Denormalization
Denormalization is the process of reversing normalization in a database. In the denormalization main we integrate these functions:
- denormalizationStand
- denormalizationLog
- denormalizationCOS
- denormalizationDivide
- denormalizationMinMaxAndPercentile
Representation
Keys _id, name, stepId, domainInformation, dataset, project, processingInfo, creationTS, latestUpdateTS
are unchanged and follow the classic of representation only dataSpecification keys
are changed and are described as above:
- dataSpecification:- keyword: Its value is “denormalization”. **. Cf to mandatory keys.
- valueType: It is the type of processing data values and then depends on them. Cf to mandatory keys.
- meaning: Its value is “Vector of denormalized numerical features”. Cf to mandatory keys.
- view: Cf to mandatory keys.
- dataLocationId: Cf to mandatory keys. Currently, only numerical vector can be saved during raw data loading which gives us un single template at this moment.
 
{
  "dataSpecification": {
    "keyword": "denormalization",
    "valueType": {
      "dataType": "numerical",
      "structureType": "vector"
    },
    "meaning": "Vector of denormalized numerical features",
    "view": {
      "name": "view_02-02-2023_11:07:45",
      "id": "63db99816b332aad216c21e6"
    },
    "dataLocationId": "63db9d34f130060d1ce25ad5",
    "parameters": [
      {
        "columnName": "cp_a_1_bar",
        "preprocessingMethod": "fstand",
        "parameters": [
          {
            "name": "A",
            "value": 0
          }
        ],
        "outPutParameters": [
          {
            "name": "Means",
            "value": 26.98525809029851
          },
          {
            "name": "Standardization",
            "value": 41.44575354738084
          }
        ]
      },
      {
        "columnName": "cp_a_2_bar",
        "preprocessingMethod": "fstand",
        "parameters": [
          {
            "name": "A",
            "value": 0
          }
        ],
        "outPutParameters": [
          {
            "name": "Means",
            "value": 24.02843988227177
          },
          {
            "name": "Standardization",
            "value": 30.083848994236583
          }
        ]
      },
      {
        "columnName": "cp_r_bar",
        "preprocessingMethod": "fstand",
        "parameters": [
          {
            "name": "A",
            "value": 0
          }
        ],
        "outPutParameters": [
          {
            "name": "Means",
            "value": 22.13542299052926
          },
          {
            "name": "Standardization",
            "value": 33.69735064026772
          }
        ]
      }
    ]
  }
}
Observations
We save the observations in Parquet.