Positive region: An enhancement of partitioning attribute based rough set for categorical data

Baroad, Muftah Mohamed; Mohd Hashim, Siti Zaiton; Ahsan, Jamal Uddin; Zainal, Anazida

Please use this identifier to cite or link to this item: http://hdl.handle.net/123456789/1524

Title:	Positive region: An enhancement of partitioning attribute based rough set for categorical data
Authors:	Baroad, Muftah Mohamed Mohd Hashim, Siti Zaiton Ahsan, Jamal Uddin Zainal, Anazida
Keywords:	Clustering, Rough Set Theory;Performance, Partitioning Categorical Data;Attribute Dependency
Issue Date:	2020
Publisher:	Periodicals of Engineering and Natural Sciences
Journal:	Periodicals of Engineering and Natural Sciences
Abstract:	Datasets containing multi-value attributes are often involved in several domains, like pattern recognition, machine learning and data mining. Data partition is required in such cases. Partitioning attributes is the clustering process for the whole data set which is specified for further processing. Recently, there are already existing prominent rough set-based approaches available for group objects and for handling uncertainty data that use indiscernibility attribute and mean roughness measure to perform attribute partitioning. Nevertheless, most of the partitioning attribute methods for selecting partitioning attribute algorithm for categorical data in clustering datasets are incapable of optimal partitioning. This indiscernibility and mean roughness measures, however, require the calculation of the lower approximation, which has less accuracy and it is an expensive task to compute. This reduces the growth of the set of attributes and neglects the data found within the boundary region. This paper presents a new concept called the Positive Region Based Dependency (PRD), that calculates the attribute dependency. In order to determine the mean dependency of the attributes, that is acceptable for categorical datasets, using a positive region-based mean dependency measure (PRD) defines the method. By avoiding the lower approximation, PRD is an optimal substitute for the conventional dependency measure in partitioning attribute selection. Contrary to traditional RST partitioning methods, the proposed method can be employed as a measure of data output uncertainty and as a tailback for larger and multiple data clustering. The performance of the method presented is evaluated and compared with the algorithms of Information-Theoretical Dependence Roughness (ITDR) and Maximum Indiscernible Attribute (MIA).
Description:	Others
URI:	http://hdl.handle.net/123456789/1524
ISSN:	2303-4521
Appears in Collections:	Faculty of Bioengineering and Technology - Other Publication

Files in This Item:

File	Description	Size	Format
Positive regionAn enhancement of partitioning attribute based rough set for categorical data UMK.pdf		423.24 kB	Adobe PDF	View/Open

Show full item record

Google Scholar^TM

Check

Files in This Item:

Google ScholarTM

Google Scholar^TM