Turkish Medical Text Classification Using BERT

Küçük Resim Yok

Tarih

2021

Dergi Başlığı

Dergi ISSN

Cilt Başlığı

Yayıncı

Ieee

Erişim Hakkı

info:eu-repo/semantics/closedAccess

Özet

Medical text classification is mostly carried out on English data sets. The limited number of studies in Turkish is due to the compelling morphological structure of Turkish for natural language processing and the limited number of data sets in the medical domain. In addition, the use of domain specific words and abbreviations makes natural language processing studies more challenging. In this study, a classification model is implemented to assign article abstracts to appropriate disease categories using multilingual BERT and BERTurk models on a data set consisting of Turkish medical article abstracts. As a result of the experimental study, 0.82 and 0.93 F-score are obtained for multilingual BERT and BERTurk, respectively. The results show that the BERTurk is more successful than other compared models for Turkish medical text classification.

Açıklama

29th IEEE Conference on Signal Processing and Communications Applications (SIU) -- JUN 09-11, 2021 -- ELECTR NETWORK

Anahtar Kelimeler

Text classification, medical texts, disease classification, BERT, BERTurk

Kaynak

29th Ieee Conference On Signal Processing And Communications Applications (Siu 2021)

WoS Q Değeri

N/A

Scopus Q Değeri

N/A

Cilt

Sayı

Künye