The effect of part-of-speech tagging on IR performance for Turkish

dc.contributor.authorDincer, BT
dc.contributor.authorKaraoglan, B
dc.contributor.editorAykanat, C
dc.contributor.editorDayar, T
dc.contributor.editorKorpeoglu, I
dc.date.accessioned2019-10-27T18:37:18Z
dc.date.available2019-10-27T18:37:18Z
dc.date.issued2004
dc.departmentEge Üniversitesien_US
dc.description19th International Symposium on Computer and Information Sciences (ISCIS 2004) -- OCT 27-29, 2004 -- Kemer Antalya, TURKEYen_US
dc.description.abstractIn this paper, we experimentally evaluate the effect of the Part-of-Speech (POS) tagging on Information Retrieval performance for Turkish. We used four term-weighting schemas to index SABANCI-METU Turkish Treebank corpus. The term weighting schemas are "tf", "tf x idf", "Ltu.ltu", and "Okapi". Each weighting scheme is factored over three POS tagging cases that are namely "No POS tagging", "POS tag with no history (i.e. 1-gram)", and "POS tag with one step history (i.e. 2-gram)". The Meta-scoring function is used to analyze the effect of these nine factors on IR performance. Results show that weighting schema are significantly different from each other with a p-value of 0.04 (Friedman Non-parametric Test), but there is not enough evidence in the corpus to reject the null hypothesis that the three weighting schemas, on the average, show equal performance over the three cases of POS tagging with a p-value of 0.36.en_US
dc.description.sponsorshipBilkent Univ, Dept Comp Engn, Inst Elect & Elect Engineers Turkey Sect, Working Grp, Int Federat Informat Proc, Sci & Tech Res Council Turkeyen_US
dc.identifier.endpage778en_US
dc.identifier.isbn3-540-23526-4
dc.identifier.issn0302-9743
dc.identifier.issn1611-3349
dc.identifier.issn0302-9743en_US
dc.identifier.issn1611-3349en_US
dc.identifier.startpage771en_US
dc.identifier.urihttps://hdl.handle.net/11454/36368
dc.identifier.volume3280en_US
dc.identifier.wosWOS:000225096700077en_US
dc.identifier.wosqualityN/Aen_US
dc.indekslendigikaynakWeb of Scienceen_US
dc.language.isoenen_US
dc.publisherSpringer-Verlag Berlinen_US
dc.relation.ispartofComputer and Information Sciences - Iscis 2004, Proceedingsen_US
dc.relation.ispartofseriesLecture Notes in Computer Science
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanıen_US
dc.rightsinfo:eu-repo/semantics/closedAccessen_US
dc.titleThe effect of part-of-speech tagging on IR performance for Turkishen_US
dc.typeArticleen_US

Dosyalar