Developing a Robust Median and MAD-Based Estimator for Casewise and Cellwise Outlier Detection in South Sulawesi Socio-Economic Data

Authors

  • Agung Tri Utomo Department of Statistics, Universitas Negeri Makassar, Makassar, 90223, Indonesia
  • Abdul Rahman Department of Mathematics, Universitas Negeri Makassar, Makassar, 90223, Indonesia

DOI:

https://doi.org/10.35877/454RI.jinav4864

Keywords:

Robust Estimator, Median Absolute Deviation, Cellwise Outlier, Casewise Outlier, Socio-Economic Data

Abstract

Socio-economic data analysis at the regional level frequently faces problems of spatial heterogeneity and extreme variability that give rise to outliers. Traditional approaches using casewise deletion often discard valuable information entirely just because of anomalies in a small fraction of attributes. This study proposes the development and application of a robust estimator based on Median and Median Absolute Deviation (MAD) to detect outliers using a cellwise paradigm (data cell-based) and compares it with a casewise approach (observation row-based) on socio-economic datasets of Regencies/Cities in South Sulawesi Province. The analyzed indicators include the Human Development Index (HDI), Regional Gross Domestic Product (RGDP), Percentage of Poor Population, Per Capita Expenditure, and the Open Unemployment Rate (OUR). Experimental results show that the robust estimator with a threshold of 2.24 is able to map cellwise outliers accurately without reducing the dimension of the observation data. Heatmap and Robust Z-Score Scatter Plot visualizations reveal that Makassar City is an extreme outlier in the RGDP indicator, while other anomalies are found in the OUR indicator in the Enrekang, Bulukumba, and Bone regions. Ultimately, this approach proves to be superior in maintaining the integrity of macroeconomic datasets compared to classical methods.

References

Alqallaf, F., Van Aelst, S., Yohai, V. J., & Zamar, R. H. (2009). Propagation of outliers in multivariate data. The Annals of Statistics, 37(1), 311–331.

Badan Pusat Statistik Provinsi Sulawesi Selatan. (2025). Provinsi Sulawesi Selatan Dalam Angka 2025. Badan Pusat Statistik Provinsi Sulawesi Selatan. https://sulsel.bps.go.id/id/publication/2025/02/28/a7beacd81481497cecb44084/provinsi-sulawesi-selatan-dalam-angka-2025.html

Barnett, V., & Lewis, T. (1994). Outliers in Statistical Data (3rd ed.). John Wiley & Sons.

Hampel, F. R. (1974). The influence curve and its role in robust estimation. Journal of the American Statistical Association, 69(346), 383–393.

Hawkins, D. M. (1980). Identification of Outliers. Chapman and Hall.

Huber, P. J. (1981). Robust Statistics. John Wiley & Sons.

Leys, C., Ley, C., Klein, O., Bernard, P., & Licata, L. (2013). Detecting outliers: Do not use standard deviation around the mean, use absolute deviation around the median. Journal of Experimental Social Psychology, 49(4), 764–766.

Raymaekers, J., & Rousseeuw, P. J. (2021). Fast robust correlation for high-dimensional data. Technometrics, 63(2), 184–198.

Rousseeuw, P. J., & Croux, C. (1993). Alternatives to the median absolute deviation. Journal of the American Statistical Association, 88(424), 1273–1283.

Rousseeuw, P. J., & den Bossche, W. (2018). Detecting Deviating Data Cells. Technometrics, 60(2), 135–145.

Rousseeuw, P. J., & Leroy, A. M. (2005). Robust Regression and Outlier Detection. John Wiley & Sons.

Rousseeuw, P. J., & Van Driessen, K. (1999). A fast algorithm for the minimum covariance determinant estimator. Technometrics, 41(3), 212–223.

Sembiring, F., & others. (2020). Penerapan Data Mining untuk Analisis Tingkat Kemiskinan di Indonesia. Jurnal SIFO Mikroskil, 21(1), 45–56.

Voloh, B., Watson, M. R., Konig, S., & Womelsdorf, T. (2020). MAD saccade: statistically robust saccade threshold estimation via the median absolute deviation. Journal of Eye Movement Research, 12. https://doi.org/10.16910/jemr.12.8.3

Yulianto, A., Soelistyo, A., & Handayani, D. (2019). Analisis Robust Principal Component Analysis (RPCA) pada Data Ekonomi Regional. Jurnal Statistika Nusantara, 5(2), 112–120.

Published

2026-04-30

How to Cite

Utomo, A. T., & Rahman, A. (2026). Developing a Robust Median and MAD-Based Estimator for Casewise and Cellwise Outlier Detection in South Sulawesi Socio-Economic Data. JINAV: Journal of Information and Visualization, 7(1). https://doi.org/10.35877/454RI.jinav4864

Issue

Section

Articles