Text file encoding: Difference between revisions
Jump to navigation
Jump to search
mNo edit summary |
m (Text replacement - "Category:Text file processing" to "Category:String manipulation") Tags: Mobile edit Mobile web edit |
||
| (3 intermediate revisions by one other user not shown) | |||
| Line 63: | Line 63: | ||
<td>UTF-16 BE with BOM</td> | <td>UTF-16 BE with BOM</td> | ||
</tr> | </tr> | ||
<tr> | |||
<td>[https://en.wikipedia.org/wiki/File_(command) file (command)] on {{Mac}} e.g. {{kbd | key=<nowiki>file <filename></nowiki>}}</td> | |||
<td>ISO-8859 text</td> | |||
<td>UTF-8 Unicode text</td> | |||
<td>UTF-8 Unicode (with BOM) text</td> | |||
<td>data</td> | |||
<td>Little-endian UTF-16 Unicode text</td> | |||
<td>data</td> | |||
<td>Big-endian UTF-16 Unicode text</td> | |||
</tr> | |||
<tr> | <tr> | ||
<td>[https://en.wikipedia.org/wiki/File_(command) file (command)] on {{Mac}} e.g. {{kbd | key=<nowiki>file -I <filename></nowiki>}}</td> | <td>[https://en.wikipedia.org/wiki/File_(command) file (command)] on {{Mac}} e.g. {{kbd | key=<nowiki>file -I <filename></nowiki>}}</td> | ||
| Line 73: | Line 85: | ||
<td>text/plain; charset=utf-16be</td> | <td>text/plain; charset=utf-16be</td> | ||
</tr> | </tr> | ||
</table> | </table> | ||
</div> | </div> | ||
| Line 79: | Line 92: | ||
* [http://osxdaily.com/2017/09/02/determine-file-encoding-mac-command-line/ How to Determine File Encoding in Mac OS by Command Line] | * [http://osxdaily.com/2017/09/02/determine-file-encoding-mac-command-line/ How to Determine File Encoding in Mac OS by Command Line] | ||
* [[Byte order mark]] | |||
* [https://en.wikipedia.org/wiki/Byte_order_mark Byte order mark - Wikipedia] | * [https://en.wikipedia.org/wiki/Byte_order_mark Byte order mark - Wikipedia] | ||
* [https://ephrain.net/sublime-text-%E8%AE%93-sublime-text-%E6%94%AF%E6%8F%B4-big5-%E7%B7%A8%E7%A2%BC%E7%9A%84%E6%96%87%E5%AD%97%E6%AA%94/ (Sublime Text) 讓 Sublime Text 支援 BIG5 編碼的文字檔 | EPH 的程式日記] | * [https://ephrain.net/sublime-text-%E8%AE%93-sublime-text-%E6%94%AF%E6%8F%B4-big5-%E7%B7%A8%E7%A2%BC%E7%9A%84%E6%96%87%E5%AD%97%E6%AA%94/ (Sublime Text) 讓 Sublime Text 支援 BIG5 編碼的文字檔 | EPH 的程式日記] | ||
* [https://docs.microsoft.com/en-us/windows-server/administration/windows-commands/chcp chcp | Microsoft Docs] | * [https://docs.microsoft.com/en-us/windows-server/administration/windows-commands/chcp chcp | Microsoft Docs] | ||
* [http://mocha-c-163-com.iteye.com/blog/583064 | * [http://mocha-c-163-com.iteye.com/blog/583064 (Charset) UTF-8, UTF-16, UTF-16LE, UTF-16BE的区别 - 伊苏 - ITeye博客] | ||
* [https://en.wikipedia.org/wiki/UTF-8 UTF-8 - Wikipedia] | * [https://en.wikipedia.org/wiki/UTF-8 UTF-8 - Wikipedia] | ||
* [https://en.wikipedia.org/wiki/UTF-16 UTF-16 - Wikipedia] | * [https://en.wikipedia.org/wiki/UTF-16 UTF-16 - Wikipedia] | ||
[[Category:Data Science]] [[Category:Software]] [[Category: | [[Category:Data Science]] [[Category:Software]] [[Category:String manipulation]] | ||
Latest revision as of 19:33, 9 March 2021
Text file encoding mapping on different softwares 「純文字編碼」在不同軟體的用字
| Software | ANSI/Big5 | UTF-8 | UTF-8 with BOM | UTF-16 LE | UTF-16 LE with BOM | UTF-16 BE | UTF-16 BE with BOM |
|---|---|---|---|---|---|---|---|
| Notepad (記事本) on Win |
ANSI | UTF-8 | Unicode | Unicode big endian | |||
| Notepad++ on Win |
ANSI 碼 | UTF-8 碼 | UTF-8 碼 (BOM 檔首) | UCS2 Little Endian 碼 (UCS2 LE BOM) | UCS2 Big Endian 碼 (UCS2 BE BOM) | ||
| TextEdit (文字編輯) on macOS |
繁體中文 (Big5) | Unicode (UTF-8) | Unicode (UTF-16) | ||||
| EmEditor on Win |
繁體中文 (Big5) (950, big5) | UTF-8 不帶簽名 | UTF-8 帶簽名 | UTF-16LE 不帶簽名 | UTF-16LE 帶簽名 | UTF-16 BE 不帶簽名 | UTF-16 BE 帶簽名 |
| Sublime text on Win |
UTF-8 | UTF-8 with BOM | UTF-16 LE | UTF-16 LE with BOM | UTF-16 BE | UTF-16 BE with BOM | |
| file (command) on macOS |
ISO-8859 text | UTF-8 Unicode text | UTF-8 Unicode (with BOM) text | data | Little-endian UTF-16 Unicode text | data | Big-endian UTF-16 Unicode text |
| file (command) on macOS |
text/plain; charset=iso-8859-1 | text/plain; charset=utf-8 | text/plain; charset=utf-8 | application/octet-stream; charset=binary | text/plain; charset=utf-16le | application/octet-stream; charset=binary | text/plain; charset=utf-16be |
References