Core concepts
This chapter gathers HephIA’s concepts glossary.
Along your data crunching journeys using HephIA product you will need to manipulate many concepts, some of them have already been established by the community when others are specific to us.
Some of these concepts exist on both side, they often share the same semantic. We simply add the little something that allow you to dig down into your problematics. To make it easier for you will use following syntactic rules to distinguish :
- Well established concepts on both theoretical and / or technical sides used Roman style.
- HephIA own concepts are in italic.
- Concepts merging well established and HephIA ones are in bold.
Page organization :
Concepts
- Id
- Model
- Instance
- Predictor // is there exists a general concept of predictor, i do not believe it, but check and/or ask others
- Evaluator // is there exists a general concept of predictor, i do not believe it, but check and/or ask others
- Evaluation // is there exists a general concept of predictor, i do not believe it, but check and/or ask others
- Pipe // is there exists a general concept of predictor, i do not believe it, but check and/or ask others
- Algorithm
- Clustering
- Hard clustering
- Soft clustering
- Dimensions reduction
- Outliers detection
- Quality indices
- Internal
- External
- Vectorization
- Categorical binarization // It is properly define theoretically but done technically by us, so which syntax
- Image
- Text
- Audio
- Video
- Monovariate time series
- Multivariate time series
- Clustering
- Scalability
- Algorithmic complexity
- Time
- Space
- Algorithmic complexity
- DAG : Directed Acyclic Graph
Minimal required concepts
The minimal required concepts’set include for few various examples :
Clustering
- Cluster
- Dissimilarity measure
- ClusterId
Dimensions reduction
Outliers detection
Predictor
At least algorithms you want to apply
Quick overlook of some important concepts
- Model
- TraversableModel
- TraversableMemory
- GlobalModel
- GlobalMemory
- TreeModel
- TreeMemory
- GTreeMemory
- GTreeModel
- Instance
- InstanceMemory
- List, Vector, Array, ParVector, ParArray, RDD InstanceMemory/Instance
- GInstanceMemory
- GInstance
- List, Vector, Array, ParVector, ParArray, RDD InstanceMemory/Instance
- Extension
- GExtension
- Clustering
- ClusteringEntity
- Discrete Distribution
- Hard Clustering
- HardClusteringEntity
- Constant
- Aggregator
- HardClusteringEntity
- Soft Clustering
- Categorical Clustering
- CategoricalDistribution
- Categorical Clustering
- ClusteringEntity
- Cluster
- GInstanceMemory
- ProtoCluster
- Clusters
- ProtoClusters
- Pipe
- Algorithm1
- TraversableAlgorithm
- GTraversableAlgorithm
- InstanceAlgorithm
- …
- ClusteringAlgorithm
- ClustersAlgorithm
- KMeans
- IterativeAlgo
- TraversableAlgorithm
- Predictor
- Evaluator
- Evaluation
- Internal / External
- FuncData
- FuncDataExtension
- Outlier detection
- Dimension reduction
- Matrix
- NN
- KNN
- ANN
- Hash
- Preprocessing
- Gradient ascent
- Vectorizer