KAMILA

KAy-means for MIxed LArge data(English)

algorithm that enables simultaneous handling of continuous and categorical data. This method has the capacity to manage mixed data types while maintaining the data’s native scale, integrating k-means for continuous variables with k-modes for categorical variables. The optimal number of clusters can be identified using multiple validation methods: silhouette analysis (cluster package), within-cluster sum of squares, and gap statistics
ICM, SHAP
Statistics, Classification, Cardiology
https://doi.org…554-025-03541-4