Please use this identifier to cite or link to this item:
http://hdl.handle.net/123456789/6164
Title: | Leveraging Correlation and Clustering: An Exploration of Data Scientist Salaries | Authors: | Chandra Agoeng Nurul Dini Faqriah Miza Azmi Hakimah Mat Harun Nurzulaikha Abdullah Wan Azani Mustafa Fakhitah Ridzuan |
Keywords: | Salary;exploratory data analysis;data scientist | Issue Date: | 2024 | Publisher: | Penerbit Akademia Baru | Journal: | Journal of Advanced Research in Computing and Applications | Abstract: | Data science is a dynamic field with ever-evolving job descriptions and salary structures. While data science offers high earning potential, the factors influencing data scientist salaries remain unclear. This lack of clarity makes it challenging for both employers to determine competitive compensation packages and for employees to understand how career choices like experience level and job title can impact their earning potential. Thus, this study aims to explore the interrelationship between related variables with salary. To achieve the objectives of this research, correlation analysis was employed to identify the strength and direction of linear relationships between these attributes in the dataset. Additionally, k-means clustering was utilized to group data scientists with similar characteristics, allowing for the exploration of potential salary segments within the data science field. It was found that there was a very strong correlation between employee residence and company location (r=0.90). There was a significant moderate positive correlation between salary with company location (r=0.46), residence (r=0.48) and experience level (r=0.41) respectively. Based on the clustering analysis, the group was divided into four different popular roles in data science salary group. Therefore, employers can leverage this knowledge to design the salary packages considering location and experience. |
Description: | Mycite |
URI: | http://hdl.handle.net/123456789/6164 | ISSN: | 2462 - 1927 | DOI: | 10.37934/arca.35.1.1020 |
Appears in Collections: | Journal Indexed MyCite - FSDK |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
5282-Article Text-26649-1-10-20240517.pdf | 651.03 kB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.