PT - JOURNAL ARTICLE AU - Ashkar, Ziad M. AU - Gottumukkala, Raju TI - Using Machine Learning to Identify Geographic and Socioeconomic Disparities in Dialysis Facility Outcomes Across the United States AID - 10.31486/toj.25.0040 DP - 2025 Sep 21 TA - Ochsner Journal PG - 170--180 VI - 25 IP - 3 4099 - http://www.ochsnerjournal.org/content/25/3/170.short 4100 - http://www.ochsnerjournal.org/content/25/3/170.full SO - Ochsner J2025 Sep 21; 25 AB - Background Despite progress in dialysis care, the patient outcomes of mortality, hospitalization, and readmission rates remain unsatisfactory because of complex clinical, demographic, and socioeconomic interactions. For this study, we used unsupervised machine learning to identify clusters of dialysis facilities based on quality metrics and sociodemographic factors, with attention to racial and geographic disparities.Methods We sourced facility-level data from data.cms.gov and sourced ZIP Code Tabulation Area–level sociodemographic data from the 2021 American Community Survey via the US Census Bureau application programming interface. Datasets were linked by ZIP code, standardized, and analyzed using principal component analysis and k-means clustering. We examined geographic patterns by US Census Bureau regions. Analyses were conducted in Python version 3.11.6 (Python Software Foundation) with the following libraries: pandas for data manipulation, scikit-learn for machine learning and principal component analysis, Matplotlib and Seaborn for data visualization, and GeoPandas for geographic mapping and spatial analysis.Results Two facility clusters emerged: Cluster 0 (n=4,609) and Cluster 1 (n=2,857). Cluster 1 was characterized by poorer outcomes (higher mortality, hospitalization, readmission, anemia, catheter use, and hyperphosphatemia); lower rates of fistula use; and lower dialysis adequacy compared to Cluster 0. Cluster 1 facilities were more prevalent in regions with lower income, higher unemployment, and lower college education, and they served populations with greater proportions of Black and Hispanic residents. Geographically, Cluster 1 facilities were concentrated in the southern and western United States. Compared to Cluster 0, a larger share of Cluster 1 facilities were for-profit facilities (91.4% vs 88.5%).Conclusion This study highlights a distinct cluster of underperforming dialysis clinics serving socioeconomically disadvantaged and racially diverse populations. Addressing these disparities requires multifaceted strategies including patient-level, institutional, and policy-level interventions.