KAMILA

KAy-means for MIxed LArge data(English)

  • algorithm that enables simultaneous handling of continuous and categorical data. This method has the capacity to manage mixed data types while maintaining the data’s native scale, integrating k-means for continuous variables with k-modes for categorical variables. The optimal number of clusters can be identified using multiple validation methods: silhouette analysis (cluster package), within-cluster sum of squares, and gap statistics
  • ICM, SHAP
  • Statistics, Classification, Cardiology
  • https://doi.org…554-025-03541-4