Data cleaning: Difference between revisions

Jump to navigation Jump to search
254 bytes added ,  8 January 2018
Line 405: Line 405:


== Outlier / Anomaly detection ==
== Outlier / Anomaly detection ==
Anomaly detection
Anomaly detection of numeric data
* Median
* Median
* Range Checks
* Range Checks
* All values is event
* The values are the same even the column is totally different
Anomaly detection of categorical data (qualitative variable)
* Normal distribution e.g. The interest of audiences should be very different NOT coherent


Anomaly detection for time series data
Anomaly detection for time series data

Navigation menu