Data cleaning: Difference between revisions

Jump to navigation Jump to search
287 bytes added ,  27 April 2015
m
Line 160: Line 160:
* {{code | code = 0001-01 00:00:00}} occurred in MySQL {{code | code = datetime}} type
* {{code | code = 0001-01 00:00:00}} occurred in MySQL {{code | code = datetime}} type
* {{code | code = 1900/1/0}} (converted time formatted value from 0), {{code | code = 1900/1/1}} (converted time formatted value from 1), {{code | code = 1900/1/2}} ... occurred in MS Excel
* {{code | code = 1900/1/0}} (converted time formatted value from 0), {{code | code = 1900/1/1}} (converted time formatted value from 1), {{code | code = 1900/1/2}} ... occurred in MS Excel
find normal values: Assume the data was generated in recent 10 years
* MySQL:
** {{code | code = SELECT * FROM `my_table` WHERE YEAR( CURDATE() ) - YEAR( `my_time_column`) <= 10;}}
** {{code | code = SELECT * FROM `my_table` WHERE  `my_time_column` >=  CURDATE() - INTERVAL 10 YEAR;}}


== duplicate data ==
== duplicate data ==

Navigation menu