Please use this identifier to cite or link to this item: http://hdl.handle.net/123456789/4868
DC FieldValueLanguage
dc.contributor.authorElnily A.en_US
dc.contributor.authorAbdelghany, A.en_US
dc.date.accessioned2023-09-06T03:03:19Z-
dc.date.available2023-09-06T03:03:19Z-
dc.date.issued2022-
dc.identifier.isbnInstitute of Electrical and Electronics Engineers Inc-
dc.identifier.urihttp://hdl.handle.net/123456789/4868-
dc.descriptionScopusen_US
dc.description.abstractThe process of automatically giving the proper POS tag to each word in a text based on context is known as automatic POS tagging. The majority of NLP applications require this process as a crucial step. This study intends to propose a machine learning-based Arabic POS tagger. YAMCHA tool is the machine learning system employed in this study. YAMCHA utilizes Support Vector Machines as a machine learning algorithm. SVM classifies data with high accuracy because it makes use of part of data in training process. As a result, in order to train the system, a substantial amount of annotated data must be evaluated at the POS level. A corpus of 100,039 words is utilized in this study. It was divided into training and testing parts, totaling 64,608 and 35,431 words, respectively. A tag set of 48 morphological tags were used in training and testing. To reach the best result in the automatic POS tagging, the system was trained multiple times with changing the range of linguistic information used in training process, and then new texts were tested and evaluated. The least error rate achieved was 11.4%. This rate was reached when the preceding word of the target one was considered in the training process without considering its POS tag (F:-10: 0).en_US
dc.publisherInstitute of Electrical and Electronics Engineers Incen_US
dc.subjectmachine learningen_US
dc.subjectPOS taggingen_US
dc.subjectsupport vector machineen_US
dc.titleAutomatic POS tagging of Arabic words using the YAMCHA machine learning toolen_US
dc.typePrinteden_US
dc.relation.conferenceProceedings of the 20th Conference on Language Engineering, ESOLEC 2022en_US
dc.identifier.doi10.1109/ESOLEC54569.2022.10009473-
dc.description.page72-77en_US
dc.relation.seminar20th International Conference on Language Engineering, ESOLEC 2022en_US
dc.date.seminarstartdate2022-10-12-
dc.date.seminarenddate2022-10-13-
dc.description.placeofseminarCairoen_US
dc.description.typeIndexed Proceedingsen_US
dc.contributor.correspondingauthorabdelghany.ma@umk.edu.myen_US
item.fulltextNo Fulltext-
item.openairetypePrinted-
item.grantfulltextnone-
Appears in Collections:Faculty of Language Studies and Human Development - Proceedings
Show simple item record

Google ScholarTM

Check

Altmetric

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.