Denormalization

Denormalization is the process of reversing normalization in a database. In the denormalization main we integrate these functions:

  • denormalizationStand
  • denormalizationLog
  • denormalizationCOS
  • denormalizationDivide
  • denormalizationMinMaxAndPercentile

Representation

Keys _id, name, stepId, domainInformation, dataset, project, processingInfo, creationTS, latestUpdateTS are unchanged and follow the classic of representation only dataSpecification keys are changed and are described as above:

{
  "dataSpecification": {
    "keyword": "denormalization",
    "valueType": {
      "dataType": "numerical",
      "structureType": "vector"
    },
    "meaning": "Vector of denormalized numerical features",
    "view": {
      "name": "view_02-02-2023_11:07:45",
      "id": "63db99816b332aad216c21e6"
    },
    "dataLocationId": "63db9d34f130060d1ce25ad5",
    "parameters": [
      {
        "columnName": "cp_a_1_bar",
        "preprocessingMethod": "fstand",
        "parameters": [
          {
            "name": "A",
            "value": 0
          }
        ],
        "outPutParameters": [
          {
            "name": "Means",
            "value": 26.98525809029851
          },
          {
            "name": "Standardization",
            "value": 41.44575354738084
          }
        ]
      },
      {
        "columnName": "cp_a_2_bar",
        "preprocessingMethod": "fstand",
        "parameters": [
          {
            "name": "A",
            "value": 0
          }
        ],
        "outPutParameters": [
          {
            "name": "Means",
            "value": 24.02843988227177
          },
          {
            "name": "Standardization",
            "value": 30.083848994236583
          }
        ]
      },
      {
        "columnName": "cp_r_bar",
        "preprocessingMethod": "fstand",
        "parameters": [
          {
            "name": "A",
            "value": 0
          }
        ],
        "outPutParameters": [
          {
            "name": "Means",
            "value": 22.13542299052926
          },
          {
            "name": "Standardization",
            "value": 33.69735064026772
          }
        ]
      }
    ]
  }
}

Observations

We save the observations in Parquet.
Payload exchange schema