Please use this identifier to cite or link to this item: http://hdl.handle.net/123456789/5944
DC FieldValueLanguage
dc.contributor.authorBakar, J. A.en_US
dc.contributor.authorYusoff, Nen_US
dc.contributor.authorHarun, N. H.en_US
dc.contributor.authorNadzir, M. M.en_US
dc.contributor.authorOmar, S.en_US
dc.date.accessioned2024-01-29T04:23:36Z-
dc.date.available2024-01-29T04:23:36Z-
dc.date.issued2023-
dc.identifier.issn2158107X-
dc.identifier.urihttp://hdl.handle.net/123456789/5944-
dc.descriptionWeb of Science / Scopusen_US
dc.description.abstractText Simplification (TS) is an emerging field in Natural Language Processing (NLP) that aims to make complex text more accessible. However, there is limited research on TS in the Malay language, known as Bahasa Malaysia, which is widely spoken in Southeast Asia. The challenges in this domain revolve around data availability, feature engineering, and the suitability of methods for text simplification. Previous studies predominantly employed single methods such as semantic compression, or machine learning with the Support Vector Machine (SVM) classifier consistently achieving an accuracy of approximately 70% in identifying troll sentences—statements containing threats from online trolls notorious for their disruptive online behavior. This study combines semantic compression and machine learning methods across lexical, syntactic, and semantic levels, utilizing frequency dictionaries as semantic features. Support Vector Machine and Decision Tree classifiers are applied and tested on 6,836 datasets, divided into training and testing sets. When comparing SVM and Decision Tree with and without semantic features, SVM with semantics achieves an average accuracy of 92.37%, while Decision Tree with semantics reaches 91.21%. The proposed TS method is evaluated on troll sentences, which are often associated with cyberbullying. Furthermore, it is worth noting that cyberbullying has been reported to be a significant issue, with Malaysia ranking as the second worst out of the 28 countries surveyed in Asia. Therefore, the outcomes of the study could potentially offer means, such as machine translation and relation extraction, to help prevent cyberbullying in Malaysia.en_US
dc.language.isoenen_US
dc.publisherScience and Information Organizationen_US
dc.relationFRGS-RACERen_US
dc.relation.ispartofInternational Journal of Advanced Computer Science and Applicationsen_US
dc.subjectText simplificationen_US
dc.subjectsemantic compressionen_US
dc.subjectmachine learningen_US
dc.subjectnatural language processingen_US
dc.subjectcyber bullyingen_US
dc.titleText Simplification using Hybrid Semantic Compression and Support Vector Machine for Troll Threat Sentencesen_US
dc.typeInternationalen_US
dc.identifier.doi10.14569/IJACSA.2023.0141035-
dc.description.page322-320en_US
dc.volume14(10)en_US
dc.description.articleno10en_US
dc.description.typeArticleen_US
item.fulltextWith Fulltext-
item.openairetypeInternational-
item.languageiso639-1en-
item.grantfulltextopen-
crisitem.author.deptUniversiti Malaysia Kelantan-
crisitem.author.deptUniversiti Malaysia Kelantan-
crisitem.author.orcid0000-0003-2703-2531-
crisitem.author.orcid0000-0003-2703-2531-
Appears in Collections:Faculty of Data Science and Computing - Journal (Scopus/WOS)
Files in This Item:
File Description SizeFormat
SCOPUS_IJACSA_Text Simplification.pdf1.45 MBAdobe PDFView/Open
Show simple item record

Google ScholarTM

Check

Altmetric

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.