-
Z-Score or Extreme Value Analysis (parametric)
-
Probabilistic and Statistical Modeling (parametric)
-
Linear Regression Models (PCA, LMS)
-
Proximity Based Models (non-parametric)
-
Information Theory Models
-
High Dimensional Outlier Detection Methods (high dimensional sparse data)
-
How to check model is stable(validate performance)
Cross-validation
- Randomly split your entire dataset into k ”folds”
- For each k-fold in your dataset, build your model on folds of the dataset. Then, test the model to check the effectiveness for kth fold
- Record the error you see on each of the predictions
- Repeat this until each of the k-folds has served as the test set
- The average of your k recorded errors is called the cross-validation error and will serve as your performance metric for the model