Data cleaning: Difference between revisions

Jump to navigation Jump to search
354 bytes added ,  27 April 2015
Line 155: Line 155:
* MySQL: {{code | code = SELECT * FROM `my_table` WHERE `my_column` REGEXP '^[0-9]+$'}}<ref>[http://stackoverflow.com/questions/14343767/mysql-regexp-with-and-numbers-only regex - Mysql REGEXP with . and numbers only - Stack Overflow]</ref>
* MySQL: {{code | code = SELECT * FROM `my_table` WHERE `my_column` REGEXP '^[0-9]+$'}}<ref>[http://stackoverflow.com/questions/14343767/mysql-regexp-with-and-numbers-only regex - Mysql REGEXP with . and numbers only - Stack Overflow]</ref>
* Excel: [http://www.techonthenet.com/excel/formulas/isnumber.php ISNUMBER Function]
* Excel: [http://www.techonthenet.com/excel/formulas/isnumber.php ISNUMBER Function]
=== time data ===
abnormal values if the data was generated in recent years
* {{code | code = 0001-01 00:00:00}} occurred in MySQL {{code | code = datetime}} type
* {{code | code = 1900/1/0}} (converted time formatted value from 0), {{code | code = 1900/1/1}} (converted time formatted value from 1), {{code | code = 1900/1/2}} ... occurred in MS Excel


== duplicate data ==
== duplicate data ==

Navigation menu