Please use this identifier to cite or link to this item: http://hdl.handle.net/123456789/4239
Title: Malay lexical simplification model for non-native speaker
Authors: Salehah Omar 
Juhaida Abu Bakar 
Maslinda Mohd Nadzir 
Nor Hazlyna Harun 
Yusoff, N 
Keywords: text simplification;complex word identification model;machine learning;NLP component;text analysis
Issue Date: 2022
Publisher: Institute of Electrical and Electronics Engineers Inc.
Project: FRGS-RACER 
Conference: 2022 International Conference on Intelligent Systems and Computer Vision, ISCV 2022 
Abstract: 
Vocabulary is an important language skill that can affect a person’s understanding of a sentence. Thus, lexical simplification is the task of converting difficult words into simpler words. It is to make it easier for the reader to understand the sentences. The biggest challenge in lexical simplification is to simplify the words needed without changing the meaning of the sentence. Past studies have shown that there are weaknesses in this task, where simple words are also identified as complex words. This issue has led to the simplification of unnecessary words. The purpose of the study is to produce a complex word identification model for the Malay language into words that are more easily understood by nonnative speakers. Experiments was performed on the appropriate features to obtain the required results. Machine learning was used to ensure the results were more accurate. This study is a novelty in text simplification of the Malay language in the field of Natural Language Processing (NLP) and may be used as a preprocessing tool to improve other tasks in NLP.
Description: 
Scopus
URI: http://hdl.handle.net/123456789/4239
ISBN: 978-166549558-5
DOI: 10.1109/ISCV54655.2022.9806133
Appears in Collections:Faculty of Data Science and Computing - Proceedings

Show full item record

Google ScholarTM

Check

Altmetric

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.