Please use this identifier to cite or link to this item:
http://hdl.handle.net/123456789/4239
Title: | Malay lexical simplification model for non-native speaker | Authors: | Salehah Omar Juhaida Abu Bakar Maslinda Mohd Nadzir Nor Hazlyna Harun Yusoff, N |
Keywords: | text simplification;complex word identification model;machine learning;NLP component;text analysis | Issue Date: | 2022 | Publisher: | Institute of Electrical and Electronics Engineers Inc. | Project: | FRGS-RACER | Conference: | 2022 International Conference on Intelligent Systems and Computer Vision, ISCV 2022 | Abstract: | Vocabulary is an important language skill that can affect a person’s understanding of a sentence. Thus, lexical simplification is the task of converting difficult words into simpler words. It is to make it easier for the reader to understand the sentences. The biggest challenge in lexical simplification is to simplify the words needed without changing the meaning of the sentence. Past studies have shown that there are weaknesses in this task, where simple words are also identified as complex words. This issue has led to the simplification of unnecessary words. The purpose of the study is to produce a complex word identification model for the Malay language into words that are more easily understood by nonnative speakers. Experiments was performed on the appropriate features to obtain the required results. Machine learning was used to ensure the results were more accurate. This study is a novelty in text simplification of the Malay language in the field of Natural Language Processing (NLP) and may be used as a preprocessing tool to improve other tasks in NLP. |
Description: | Scopus |
URI: | http://hdl.handle.net/123456789/4239 | ISBN: | 978-166549558-5 | DOI: | 10.1109/ISCV54655.2022.9806133 |
Appears in Collections: | Faculty of Data Science and Computing - Proceedings |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Malay lexical simplification model for non-native speaker _ IEEE Conference Publication _ IEEE Xplore.pdf | 384.71 kB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.