Data cleaning: Difference between revisions

Jump to navigation Jump to search
Line 293: Line 293:
[[Validate the datetime value]]
[[Validate the datetime value]]


=== Time data: Data was generated in 10 years ===
=== Time data: Data was generated in N years ===
Define the abnormal values of the time data ([http://en.wikipedia.org/wiki/Time_series time series]) if they
Define the abnormal values of the time data ([http://en.wikipedia.org/wiki/Time_series time series])
* were generated 10 years before or
* Verfiy the data were generated in 10 years
* newer than today
* Verfiy the data were not newer than today
* Verfiy the year of data were not {{kbd | key=1900}} if the data were imported from Microsoft Excel file. Datevalue<ref>[https://support.microsoft.com/zh-tw/office/datevalue-%E5%87%BD%E6%95%B8-df8b07d4-7761-4a93-bc33-b7471bbff252 DATEVALUE 函數 - Office 支援]</ref> was started from the year {{kbd | key=1900}}.
* Verfiy the value of data were not {{kbd | key=0000-00-00 00:00:00}}


List of the possible abnormal values:
List of the possible abnormal values: