Count number of characters: Difference between revisions
Jump to navigation
Jump to search
mNo edit summary |
mNo edit summary |
||
Line 1: | Line 1: | ||
Counting number of characters in different approaches | Counting number of characters (or bytes) in different approaches | ||
<table border="1" class="wikitable sortable"> | <table border="1" class="wikitable sortable"> |
Revision as of 14:41, 11 March 2020
Counting number of characters (or bytes) in different approaches
String example | Number of characters | Number of bytes |
---|---|---|
fox | 3 | 3 |
The quick brown fox jumps over the lazy dog | 43 | 43 |
狐 | 1 | 3 |
象 | 1 | 3 |
🐘 | 1 | 4 |
敏捷的棕毛狐狸從懶狗身上躍過 | 14 | 28 |
PHP
- PHP: strlen & PHP mb_strlen function
// number of characters echo mb_strlen("狐", 'UTF-8') . PHP_EOL; // return 1 echo mb_strlen("《王大文 Dawen》", 'UTF-8') . PHP_EOL; // return 11 // string length (number of bytes) echo strlen("狐") . PHP_EOL; // return 3 echo strlen("《王大文 Dawen》") . PHP_EOL; // return 21
MySQL
- MySQL: MySQL CHAR_LENGTH() function
// number of characters SELECT CHAR_LENGTH("狐"); /* return 1 */ SELECT CHAR_LENGTH("《王大文 Dawen》"); /* return 11 */ // number of bytes SELECT LENGTH("狐"); /* return 3 */ SELECT LENGTH("《王大文 Dawen》"); /* return 21 */
- MySQL :: MySQL 8.0 Reference Manual :: 11.4.1 The CHAR and VARCHAR Types e.g. VARCHAR(5) or CHAR(5) means can hold up to 5 characters.
SQLite
Length function
SELECT LENGTH("狐"); /* return 1 */ SELECT LENGTH("《王大文 Dawen》"); /* return 11 */
Excel
- Excel: LEN, LENB functions / LEN、LENB 函數 Result of the function LENB is not the same with the result in other programming language.
// number of characters =LEN("狐") // return 1 =LEN("《王大文 Dawen》") // return 11 // number of bytes =LENB("狐") // return 2 =LENB("《王大文 Dawen》") // return 16
- Calculate String Length Online
- string - How many bytes does one Unicode character take? - Stack Overflow
BASH
Step1: Using Linux wc command
# print the character counts of txt files (contains the count of return symbol) wc -m *.txt # print the newline counts of txt files wc -l *.txt # print the whitespaces counts of txt files grep -c ' ' *.txt
Step2: Check the Return symbol
- e.g. \r\n costs 2 characters
Step3: final formula
Number of characters (not contains the return symbol) = result of wc -m *.txt - result of wc -m *.txt * 2 - 1 (the last blank line costs 1 character) - number of the whitespaces