Please use this identifier to cite or link to this item: http://hdl.handle.net/123456789/6164
Title: Leveraging Correlation and Clustering: An Exploration of Data Scientist Salaries
Authors: Chandra Agoeng 
Nurul Dini Faqriah Miza Azmi 
Hakimah Mat Harun 
Nurzulaikha Abdullah 
Wan Azani Mustafa 
Fakhitah Ridzuan 
Keywords: Salary;exploratory data analysis;data scientist
Issue Date: 2024
Publisher: Penerbit Akademia Baru
Journal: Journal of Advanced Research in Computing and Applications 
Abstract: 
Data science is a dynamic field with ever-evolving job descriptions and salary structures. While data science offers high earning potential, the factors influencing data scientist salaries remain unclear. This lack of clarity makes it challenging for both employers to determine competitive compensation packages and for employees to understand how career choices like experience level and job title can impact their earning potential. Thus, this study aims to explore the interrelationship between related variables with salary. To achieve the objectives of this research, correlation analysis was employed to identify the strength and direction of linear relationships between these attributes in the dataset. Additionally, k-means clustering was utilized to group data scientists with similar characteristics, allowing for the exploration of potential salary segments within the data science field. It was found that there was a very strong correlation between employee residence and company location (r=0.90). There was a significant moderate positive correlation between salary with company location (r=0.46), residence (r=0.48) and experience level (r=0.41) respectively. Based on the clustering analysis, the group was divided into four different popular roles in data science salary group. Therefore, employers can leverage this knowledge to design the salary packages considering location and experience.
Description: 
Mycite
URI: http://hdl.handle.net/123456789/6164
ISSN: 2462 - 1927
DOI: 10.37934/arca.35.1.1020
Appears in Collections:Journal Indexed MyCite - FSDK

Files in This Item:
File Description SizeFormat
5282-Article Text-26649-1-10-20240517.pdf651.03 kBAdobe PDFView/Open
Show full item record

Google ScholarTM

Check

Altmetric

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.