CAH

Classification Ascendante Hiérarchique(CAH)

Classification Ascendante Hiérarchique(CAH) is a type of clustering algorithm that builds a hierarchy of clusters by successively merging smaller clusters into larger ones.

Hyper-parameters

  • maxNbClusters: the number of cluster maximum.
  • numberOfClusters: the number of cluster minimum.
  • seed: for the same value of seed we get the same results in each execution of process.

Payload JSON template example :

{
  "processingKeyword": "hierarchicalClustering",
  "customer": "hephia",
  "name": "hephia_hierarchicalClustering",
  "creationTS": 1675333316,
  "latestUpdateTS": 1675333316,
  "status": "1",
  "dataLocations": [
    {
      "role": "parquet",
      "dataLocationId": "63db8494eac14d53c1dce3ed"
    }
  ],
  "processingContext": {
    "processingName": "hierarchicalClustering",
    "editionContext": "notebook",
    "callingContext": "ds-lab",
    "view": {
      "name": "view_02-02-2023_09:29:56",
      "id": "63db82947926d269918c5113"
    },
    "dataset": {
      "name": "cii_0006"
    },
    "project": {
      "id": 190242736,
      "name": "datasets"
    }
  },
  "stepId": 1,
  "hyperParameters": {
    "maxNbClusters": 8,
    "numberOfClusters": 2,
    "seed": 1
  }
}