Regular expression: Difference between revisions

Latest revision as of 11:18, 29 February 2024

透過正規表示法 (Regular Expression) 處理文字檔時，可以快速地搜尋或取代符合特定規則的字串。以每行為單位，進行字串處理^[1]。正規表示法又稱正規表示式、正規表達式、正則表達式、正規表示法、正規運算式、規則運算式、常規表示法^[2]。

有問題嗎？可以利用提供解說的線上工具，嘗試自己除錯。也可以到看板 RegExp 文章列表 - 批踢踢實業坊或其他問答服務詢問。

快速查表[edit]

說明: (1) sample 藍色網底處代表符合規則的文字、(2) 同一文字規則可以有多種表示法

文字規則	sample	對立的文字規則	sample
任意一個文字(包含空白，但不包含換行符號) .	What Does the Fox Say? 12 狐狸怎叫 34
任意文字(包含空白)，出現1次或0次 .? = .{0,1}	What Does the Fox Say? 12 狐狸怎叫 34
任意次的多個文字(包含空白) .* = .{0,}	What Does the Fox Say? 12 狐狸怎叫 34
任意次的文字(包含空白)，至少出現1次 .+ = .{1,}	What Does the Fox Say? 12 狐狸怎叫 34
任意次的空白或換行符號 (至少出現1次的空白或換行符號) \s+	What Does the Fox Say? 12 狐狸怎叫 34	任意多個文字(不包含空白或換行符號) [^\s]+ = [^\s]{1,} = [\S]+ = [^ ]+	What Does the Fox Say? 12 狐狸怎叫 34
任意次的 ASCII character (包含英文、數字和空白) demo^[3] [\x00-\x80]+ 或 [[:ascii:]]+^[4]	What Does the Fox Say? 12 狐狸怎叫 34	非 ASCII，即中文出現任意次 [^\x00-\x80]+	What Does the Fox Say? 12 狐狸怎叫 34
任意次的大小寫英文、數字和底線符號( _ ) (不包含空白) (demo) [\w]+ = [a-zA-Z0-9_]+ PHP 加上 u 修飾語，則可支援中文字	What Does the Fox Say? 12 狐狸怎叫 _34	任意次的不是英文、數字和底線符號( _ )的文字 \W+ = [^a-zA-Z0-9_]+	demo
任意次的數字(不包含空白) [\d]+ = [0-9]+	What Does the Fox Say? 12 狐狸怎叫 34	不包含數字的任意次文字(包含空白 [^\d]+ = [^0-9]+ = \D+	What Does the Fox Say? 12 狐狸怎叫 34
任意次的中文字 [\p{Han}]+ (demo、詳細說明)	What Does the Fox Say? 12 狐狸怎叫 34	不包含中文字的任意次文字 [^\p{Han}]+ (demo)
以「狐狸」開頭的行 ^狐狸.*$^[5]	狐狸怎叫 34 What Does the Fox Say? 柴犬怎叫 What Does the shiba inu say?	不以「狐狸」開頭的行 ^(?!狐狸).*$^[6]	狐狸怎叫 34 What Does the Fox Say? 柴犬怎叫 What Does the shiba inu say?
以「怎叫」結尾的行 ^.*怎叫$	What Does the Fox Say? 12 狐狸怎叫 34 What Does the shiba inu say? 柴犬怎叫	不以「怎叫」結尾的行 .*(?<!怎叫)$^[7]	What Does the Fox Say? 12 狐狸怎叫 34 What Does the shiba inu say? 柴犬怎叫
包含「狐狸」的行 ^.狐狸.$ 或 (狐狸) (demo)	What Does the Fox Say? 12 狐狸怎叫 34 What Does the shiba inu say? 柴犬怎叫	不包含「狐狸」的行 (demo) ^((?!狐狸).)*$	What Does the Fox Say? 12 狐狸怎叫 34 What Does the shiba inu say? 柴犬怎叫
布林邏輯 AND: 包含「狐狸」和「叫」的行 (demo)^[8] (?=.狐狸)(?=.叫).* 或狐狸.叫\|叫.狐狸	What Does the Fox Say? 12 狐狸怎叫 34 What Does the Fox Say? 12 不叫狐狸 34 What Does the shiba inu say? 柴犬怎叫
布林邏輯 OR: 包含「狐狸」或「叫」的行 (demo) .(狐狸\|叫).	What Does the Fox Say? 12 狐狸怎叫 34 What Does the shiba inu say? 柴犬怎叫 What Does the shiba inu say? 柴犬怎了	布林邏輯: 不包含「狐狸」也不包含「柴犬」的行 ^((?!狐狸\|柴犬).)*$	What Does the Fox Say? 12 狐狸怎叫 34 What Does the shiba inu say? 柴犬怎叫 What Does the Husky say? 哈士奇怎叫
布林邏輯 NOT: 不包含「狐狸」，但包含「柴犬」的行 (demo)^[9] ^((?!狐狸).)(柴犬).$ = ^(柴犬).((?!狐狸).)$ = (柴犬).((?!狐狸).) (如果句子同時存在狐狸和柴犬會出錯)	What Does the Fox Say? 12 狐狸怎叫 34 What Does the shiba inu say? 柴犬怎叫

Regular expression online tools[edit]

測試 Regular expression 語法的網站

RegEx101 "Online regex tester and debugger: PHP, PCRE, Python, Golang and JavaScript" (example) 有提供語法解說。教學: RegEx101正規表示法線上產生器，有沒有選到立馬告訴你|梅問題．教學網
RegExr: Learn, Build, & Test RegEx (example). 有提供語法解說. 教學: RegExr: 功能強大的正規式撰寫協助工具
Regexper: 圖解方式提供語法解說 e.g. \d{3}(.*)
Regulex：JavaScript Regular Expression Visualizer : 圖解方式提供語法解說 e.g. ^(a|b)*?$
Rubular: a Ruby regular expression editor and tester (example)
PHP Live Regex [Last visited: 2014-11-25]
Regex Tester and Debugger Online - Javascript, PCRE, PHP [Last visited: 2016-01-07]
Regular Expression (RegExp) in JavaScript - 石頭閒語 [Last visited: 2017-11-14]

Examples

Regular Expression Library 網友提供的 pattern 範例

cases[edit]

取代換行符號為逗號[edit]

將Email清單，轉成Email軟體可以使用的寄信名單

原 
[email protected]
[email protected]
[email protected]

改成
[email protected],[email protected],[email protected]

方案1: Sublime Text, EmEditor[edit]

語法適用 Sublime Text, EmEditor軟體 (以下為 EmEditor 的操作說明)

Menu: Search -> Replace
click "Use Regular Expression"
1. Find: \n ( 換行符號。Win 作業系統的換行符號是 \r\n、Mac 作業系統的換行符號是 \n，取兩者共有的符號。如果使用 Linux 作業系統的換行符號是 \r。 )
2. Replace with: ,
click "Replace all"

將每行的文字，移除換行，並且都加上逗號分隔[edit]

// before
Elmo
Emie
Granny Bird

// after
Elmo, Emie, Granny Bird

方法: 使用 Sublime Text 或 EmEditor。

Find what: \n
Replace with: , 此例是將每行的文字，都加上逗號+空格分隔 (如果要用別的符號分隔，例如頓號分隔，則是 Replace with: 、)

將逗號分隔的文字，還原成逐行顯示，並且移除分隔符號 (,)[edit]

// before
Elmo, Emie, Granny Bird

// after
Elmo
Emie
Granny Bird

方法: 使用 Sublime Text 或 EmEditor。輸出結果的每行前面可能會有空白

Find what: ([^,]+),
Replace with: \1\n

方案2: Notepad++[edit]

使用Notepad++軟體

選單: 尋找 -> 取代
搜尋模式: 勾選「增強模式」 (不是勾選「用類型表式」)
1. 尋找目標: \n (換行符號)
2. 取代成: ,
勾選全部取代

相關資料: How To Replace Line Ends, thus changing the line layout last visited: 2010-01-27

方案3: Microsoft Word[edit]

使用Microsoft Word 2002軟體

選單: 編輯 -> 取代
勾選增強模式
1. 尋找目標: ^p (段落標記)
2. 取代為: ,
勾選全部取代

方案4: Sed command for linux[edit]

sed 's/要被取代的字串/新的字串/g' old.filename > new.filename^[10]

(1)要被取代的字串: :a;N;$!ba;s/\n (2)新的字串: ;

sed ':a;N;$!ba;s/\n/; /g' old.filename > new.filename ^[11]

方案5: 使用支援十六進位編輯 (HEX) 的編輯軟體[edit]

使用支援十六進位編輯 (HEX) 的編輯軟體，例如: ‎iHex - Hex Editor for Mac

選單 Edit -> Find
Find: 0A 換行符號
Replace: 2c 20 其中 2c 是逗號， 20 是空白
儲存檔案

相關資料

Hex Dictionary | Convert Hex / Hexadecimal Numbers to Binary and Decimal

Find IP address (IPv4)[edit]

適用 Notepad++ 軟體 v.5.9.5

選單: 尋找 -> 取代
搜尋模式: 勾選「用類型表式」
1. 尋找目標: \d\d?\d?\.\d\d?\d?\.\d\d?\d?\.\d\d?\d?

note: not support {n} syntax

適用 Sublime Text v. 3.2.21

Find: (?:\d{1,3}\.){3}\d{1,3}

參考資料:

How to Find or Validate an IP Address [Last visited: 2019-06-05]
SourceForge.net: Notepad++: Regular expression for IP addresses
regex - Regular expression that matches valid IPv6 addresses - Stack Overflow [Last visited: 2015-08-10]

移除記事本純文字檔的黑色方塊(UNIX系統的換行符號 LF )[edit]

使用notepad++軟體

選單: 尋找 -> 取代
搜尋模式: 勾選「增強模式」
1. 尋找目標: \n\n (註: 2個LF )
2. 取代成: \r\n (註: CR與LF )

用記事本打開純文字檔時，就不會看到黑色方塊

將每項元素，加上引號框起來[edit]

將陣列的每項元素，都加上引號框起來[edit]

Elmo, Emie, Granny Bird, Herry Monster, 喀喀獸
修改成
'Elmo', 'Emie', 'Granny Bird', 'Herry Monster', '喀喀獸'

方法1: 使用 PHP 如果元素包含換行符號，不能用下面方法處理。

$users = array('Elmo', 'Emie', 'Granny Bird', 'Herry Monster', '喀喀獸');
//「單引號」相隔每個元素
$result = implode(",", preg_replace('/^(.*?)$/', "'$1'", $users));

//「雙引號」相隔每個元素
$result = implode(",", preg_replace('/^(.*?)$/', "\"$1\"", $users));
echo $result;

Thanks, Joshua! More on PHP - Wrap Implode Array Elements in Quotes » Me Like Dev

方法2: 使用 Sublime Text 或 EmEditor

Find: ([^\s|,]+)
分隔符號
- 「單引號」相隔每個元素 Replace with: '\1'
- 「雙引號」相隔每個元素 Replace with: "\1"

方法3: 使用 Notepad++。啟用搜尋模式的「用類型表式」

Find: ([^\s|,]+)
分隔符號
- 「單引號」相隔每個元素 Replace with: '$1'
- 「雙引號」相隔每個元素 Replace with: "$1"

將每行的文字，都加上引號框起來，並且移除換行[edit]

// before
Elmo
Emie
Granny Bird

// after
'Elmo', 'Emie', 'Granny Bird'

方法1: 使用 Sublime Text 、Notepad++ 或 EmEditor。該方法有處理每行的前面或後面可能有一格或多格空白

如果使用 Mac 作業系統

Find what: (\S+)(\s?)+$\n
Replace with: '\1',
(如果要使用雙引號框起來，則是 Replace with: "\1", )

如果使用 Win 作業系統，需要修改換行符號 \n 為 \r\n

Find what: (\S+)(\s?)+$\r\n on Mac
Replace with: '\1',
(如果要使用雙引號框起來，則是 Replace with: "\1", )

方法2: 使用 Sublime Text 或 EmEditor 該方法沒有處理每行的後面可能有一格或多格空白

Find what: (.*)$\n 或 (\S+)$\n 或 (\S+)\n
Replace with: '\1',

More details on the page add quotation at the start and end of each line.

將引號框起來的文字，還原成逐行顯示，並且移除分隔符號 (,)[edit]

// before
'Elmo', 'Emie', 'Granny Bird'

// after
Elmo
Emie
Granny Bird

方法: 使用 Sublime Text 或 EmEditor。該方法有處理每行的前面或後面可能有一格或多格空白

Find what: '(([^,|^'])+)',?\s?
Replace with: \1\n

將試算表欄位值前後，加上雙引號框起來[edit]

Google 試算表的文字類型欄位值的前後加上雙引號

Find non-ASCII characters 尋找中文、非英文的文字[edit]

Find non-ASCII characters in Google sheet[edit]

適用: Google Drive 試算表的 Regular expression 相關函數，例如: REGEXMATCH、REGEXEXTRACT、RegExReplace 函數、Notepad++的搜尋

[^\x00-\x80]+

Find non-ASCII characters in LibreOffice[edit]

適用: LibreOffice REGEX function^[12]、Total commander 的 Multi-Rename tool^[13]^[14]

[^\u0000-\u0080]+

Find Chinese characters in Google sheet[edit]

範例：如果 A2 包含任一中文字，則欄位值顯示「中文」。如果未包含任何中文字，則欄位值顯示「英文」：

=IF(REGEXMATCH(A2, "[\一-\龥]"), "中文", "英文")

Google 不支援以下語法，會顯示「... 是無效的規則運算式。」錯誤

[\u4e00-\u9fa5]
[^\u4e00-\u9fa5]
[\p{Script=Hans}]
[\p{Han}]

Find Chinese characters in MySQL[edit]

尋找 `column_name` 欄位值包含中文字。適用: MySQL^[15]^[16]

SELECT `column_name`
FROM `table_name`
WHERE HEX(`column_name`) REGEXP '^(..)*(E[4-9])';

說明

正則表達式 '^(..)*(E[4-9])' 的含義是尋找從字符串開始處（表示為 ^），每兩個字符（表示為 ..）重複零次或多次（表示為 *），直到找到一個匹配 (E[4-9]) 的序列。
透過加入 ^(..)* 使得搜尋條件更加嚴格，它要求 (E[4-9]) 的出現位置必須是在一個合法的 UTF-8 字符邊界上。這意味著它更可能正確匹配開頭為中文字符的字符串，而忽略那些僅在中間或末尾偶然包含 E4 到 E9 序列的非中文字符串。

Find non-ASCII characters in MySQL[edit]

尋找 `column_name` 欄位值不完全是 ASCII 字元

SELECT `column_name`
FROM `table_name`
WHERE `column_name` <> CONVERT(`column_name` USING ASCII)

Find non-ASCII characters in PHP[edit]

尋找欄位值包含中文字，中文字包含繁體中文與簡體中文，不包含標點符號 (例如 ,)、全形標點符號 (例如，)以及特殊符號，例如 Emoji：⭐。 PHP: exact match

// approach 1
if (preg_match('/^[\x{4e00}-\x{9fa5}]+$/u', $string)) {
	echo "全部文字都是中文字" . PHP_EOL;
}else{
	echo "部分文字不是中文字" . PHP_EOL;
}

// approach 2
if (preg_match('/^[\p{Han}]+$/u', $string)) {
	echo "全部文字都是中文字" . PHP_EOL;
}else{
	echo "部分文字不是中文字" . PHP_EOL;
}

partial match (online demo hosted by PHP Sandbox)

// approach 1
$string = '繁體中文-简体中文-English-12345-。，！-.,!-⭐';
$pattern = '/[\p{Han}]+/u';
preg_match_all($pattern, $string, $matches, PREG_OFFSET_CAPTURE);

var_dump($matches);

// approach 2
$string = '繁體中文-简体中文-English-12345-。，！-.,!-⭐';
$pattern = '/[\x{4e00}-\x{9fa5}]+/u';
preg_match_all($pattern, $string, $matches, PREG_OFFSET_CAPTURE);

var_dump($matches);

技術問題除錯: 錯誤訊息

preg_match(): Compilation failed: character value in \x{} or \o{} is too large at offset 8

解決方式: preg_match() 需要加上 u 變數^[17]。

Find non-ASCII characters in JavaScript[edit]

regex - Javascript unicode string, chinese character but no punctuation - Stack Overflow

參考資料:

尋找英文字[edit]

尋找 ASCII 字元 in MySQL[edit]

-- 尋找欄位 `my_column` 欄位值是 ASCII 字元

SELECT * 
FROM `my_table` 
WHERE `my_column` LIKE CONVERT(`my_column` USING ASCII)

解決英文字的搜尋：搜尋 app 而不是 apple

參考資料

How can I find non-ASCII characters in MySQL? - Stack Overflow

尋找英文字、數字、破折號（-）或底線（_）字元 in MySQL[edit]

-- 尋找欄位 `my_column` 欄位值是包含英文字、數字、破折號（-）或底線（_）的字串

SELECT * 
FROM `my_table` 
WHERE `my_column` REGEXP '[a-zA-Z0-9\-_]'

將每行文字的行頭加上逗號符號[edit]

Adding characters to document lines

知道前面跟後面的文字，但是中間文字忘記了[edit]

使用notepad++軟體

選單: 尋找 -> 取代
搜尋模式: 勾選「用類型表示」
1. 尋找目標: a(.*)le 就可以找到(1)apple (2)apps lesson ... 等a開頭、le結尾的文字，中間可夾雜空白。中文字串搜尋，建議將文件的編碼改成 UTF-8 編碼

移除空白行[edit]

# (原) 每行可能間隔一行空白或多行空白
尼歐
崔妮蒂

莫斐斯


史密斯
祭師

# (後) 改成每行逐行緊接著
尼歐
崔妮蒂
莫斐斯
史密斯
祭師

移除一行空白或多行空白( 行內可能包含一個或多個空白字元 SPACE 、定位鍵TAB)

使用工具: 適用 Sublime Text 與 EmEditor 軟體，需勾選「使用規則運算式」。以下語法不適用於 Notepad++ 軟體^[18]
- 尋找: ^[\s\t]*$\n --> 取代為: 空 (不需要輸入任何字)
使用工具: Notepad++ v7.8.7
- Notepad++ 軟體選單: 編輯 -> 行處理 -> 移除空行(包括只有空白字元的行)^[19]
詳細說明，請見 Regular replace blank lines

尋找非空白的文字[edit]

尋找: [^\s]+ online demo
解決遇到空白段落發生程式異常錯誤而執行中斷的問題「... 看起來空白的字元，卻無法使用 TRIM 函數去除，可能是其他的空白字元。解決方式是偵測段落內有沒有包含中英文、數字，再進行後續處理。」

去除標點符號、特殊符號等[edit]

regex - PHP strip punctuation - Stack Overflow

將特定符號相隔的文字，改成逐行顯示[edit]

例子:

# (原) 頓號(、)符號相隔的文字
尼歐、莫斐斯、崔妮蒂、史密斯、祭師

# (後) 改成逐行顯示
尼歐
莫斐斯
崔妮蒂
史密斯
祭師

使用 Sublime Text 或 EmEditor

Find: ([^、]+)([、]{1})
Replace with: \1\n

語法說明

[^、] : 符合任意字，但不是頓號(、)的文字
[^、]+ : 一次以上不是頓號(、)的文字
([^、]+) : 符合「一次以上不是頓號(、)的文字」規則的文字
[、]: 出現頓號(、)任意次的文字
[、]{1} : 出現頓號(、)一次的文字
([、]{1}) : 符合「出現頓號(、)一次的文字」規則的文字

將每行文字的結尾處，加入空一格 (半形空白)[edit]

法1: 適用軟體: Sublime Text, EmEditor

Menu: Search -> Replace
click "Use Regular Expression"
1. Find: \n
2. Replace with: _\n(符號 \n 前面的 _ 自行替換成半形空白)
click "Replace all"

法2: 適用軟體: Sublime Text, EmEditor

Menu: Search -> Replace
click "Use Regular Expression"
1. Find: $
2. Replace with: _$(符號 $ 前面的 _ 自行替換成半形空白)
click "Replace all"

需要檢查最後一行是否是空白行，如果不是空白行，不會套用到該取代規則

將每行文字內夾雜的空白，取代成 Tab 符號[edit]

將原本空白間隔的欄位值，取代成 Tab鍵間隔的欄位值。輸出結果可以方便貼到 MS Excel 或 Google spreadsheet。

# \t 代表是 Tab 鍵，又稱定位鍵
# before
aaa bbb    ccc

# after
aaa\tbbb\tccc

說明: \S 代表非空白字元, \r\n 代表換行符號。[^\S\r\n] 則代表不是非空白字元、也不是換行符號。換句話說尋找空白，但不包含換行符號。

使用 Sublime Text 軟體 (參考資料^[20] ^[21])

Menu: Search -> Replace
click "Use Regular Expression"
1. Find: ([^\S\n]+) 或 ([^\S\r\n]+) 或 \s\s+ 或 _{1,} ( 自行替換 _ 成半形空白) 因為 \s 包含了空白與換行字元，所以不能直接使用 \s+ 當做搜尋條件
2. Replace with: \t
click "Replace all"

移除每行文字前後面可能多個的空白[edit]

移除每行文字最前面可能多個的空白[edit]

尋找: ^\s+ --> 取代為: 空白 (適用軟體: Sublime Text、EmEditor，需啟用 "Use Regular Expression" )

# before
aaa 
 bbb
    ccc

# after
aaa 
bbb
ccc

移除每行文字最後面可能多個的空白[edit]

尋找: \s+$ --> 取代為: 空白 (適用軟體: Sublime Text、EmEditor，需啟用 "Use Regular Expression" )

移除每行文字前面或後面可能多個的空白[edit]

尋找: (^\s+|\s+$) --> 取代為: 空白 (適用軟體: Sublime Text、EmEditor，需啟用 "Use Regular Expression" )

尋找包含不是數字，是文字的行[edit]

預期每行資料都是數字，尋找包含不是數字，是文字的行

[^\d|\n]

Search unmatched string[edit]

find un-commented console.log:

original format: some lines contains un-commented Javascript debug information

   console.log("un-commented debug information");

  //console.log("commented debug information");

Search pattern: find not started with the / symbol before the string "console.log"

   [^/](console\.log)

Text editor with support for regular expression[edit]

Text editor with support for regular expression

Regular expression batch tools[edit]

multiple regular expression operations on the same file

RegReplace 執行多個取代命令 "Simple find and replace sequencer plugin for Sublime Text" Quoted from official webpage. [Last visited: 2014-10-25]
$ EmEditor (Text Editor) - Batch Replace & EmEditor (文字編輯器) | 規則運算式

one regular expression operations on multiple files

$ EmEditor (Text Editor) | Find and Replace

syntax[edit]

換行符號: \r\n (適用: Notepad++選項: 增強模式 & 用類型表式)
tab鍵的固定空白分隔: \t (適用: Notepad++選項: 增強模式)
數字: \d (適用: Notepad++選項: 用類型表式。不適用: Notepad++選項: 增強模式)
\S 非空白的文字: 不會含括半形空白與全行空白

Troubleshooting of regular expression[edit]

Tips

Use online tool regex101: build, test, and debug regex to obtain the explain of your syntax
Small data test: (1) Prepare the small file data to verify the syntax (2) Using the online tools
Highlight or output the matched text e.g. --color^[22] for grep command or output the matches by PHP preg_match() function.
Simplify the syntax
Because the compatibility issue, you may try to use the alternative syntax e.g. \d to [0-9]+.

替代方案[edit]

將資料以 Tab來隔開，貼到Google Drive的Spreadsheet或MS Excel，會自動儲存到不同欄位。所以將需要處理的原始資料中，需要擷取的資料的前後，使用Tab來隔開，複製後貼到於Google Drive的Spreadsheet或MS Excel，就會自動儲存到不同欄位，方便做進一步處理。

Copy multiple rows & paste

Copy to dreamweaver from MS Excel 2002: ok
Copy to dreamweaver from Google Docs: not ok
Copy to MS Excel 2002 from Google Docs: ok

Troubleshooting of ...

Troubleshooting of Excel errors

PHP, cUrl, Python, selenium, HTTP status code errors

Database: SQL syntax debug, MySQL errors, MySQLTuner errors or PostgreSQL errors

Troubleshooting of regular expression

HTML/Javascript: Troubleshooting of javascript, XPath

Software: Mediawiki, Docker, FTP problems, online conference software

Test connectivity for the web service, Web Ping, Network problem, Web user behavior, Web scrape troubleshooting

Template

Bug report template

[1] 鳥哥的 Linux 私房菜 -- 正規表示法 (regular expression, RE) 與文件格式化處理

[2] 正規表示式 - 維基百科，自由的百科全書

[3] Ascii Table - ASCII character codes and html, octal, hex and decimal chart conversion

[4] - Regex for Any English ASCII Character Including Special Characters - Stack Overflow

[5] Regex Examples: Matching Whole Lines of Text That Satisfy Certain Requirements

[6] regex - Regular expression to match text that *doesn't* contain a word? - Stack Overflow

[7] Regex not ending with - Stack Overflow

[8] regex - Regular Expressions: Is there an AND operator? - Stack Overflow

[9] regex - Regular expression for a string containing one word but not another - Stack Overflow

[10] 鳥哥的 Linux 私房菜 -- 正規表示法 (regular expression, RE) 與文件格式化處理

[11] 參考 unix - sed: How can I replace a newline?

[12] List of Regular Expressions

[13] 取代非英文的文字，但是不包含 . 符號: [^\u0000-\u0080|.]+

[14] vascript - Regular expression to match non-english characters? - Stack Overflow

[15] How to detect rows with chinese characters in MySQL? - Stack Overflow

[16] How can I find non-ASCII characters in MySQL? - Stack Overflow

[17] - preg_match(): Compilation failed: character value in \x{} or \o{} is too large at offset 27 on line number 25 - Stack Overflow

[18] Regex: delete multiple blank lines

[19] regex - Removing empty lines in Notepad++ - Stack Overflow

[20] Quickly replace multiple space characters with a tab character - TechRepublic

[21] regex - Match whitespace but not newlines (Perl) - Stack Overflow

[22] Grep -color command Examples - nixCraft

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

[22]