Text file encoding: Difference between revisions

From LemonWiki共筆
Jump to navigation Jump to search
mNo edit summary
Tags: Mobile edit Mobile web edit
 
(3 intermediate revisions by one other user not shown)
Line 63: Line 63:
<td>UTF-16 BE with BOM</td>
<td>UTF-16 BE with BOM</td>
</tr>
</tr>
<tr>
<td>[https://en.wikipedia.org/wiki/File_(command) file (command)] on {{Mac}} e.g. {{kbd | key=<nowiki>file <filename></nowiki>}}</td>
<td>ISO-8859 text</td>
<td>UTF-8 Unicode text</td>
<td>UTF-8 Unicode (with BOM) text</td>
<td>data</td>
<td>Little-endian UTF-16 Unicode text</td>
<td>data</td>
<td>Big-endian UTF-16 Unicode text</td>
</tr>
<tr>
<tr>
<td>[https://en.wikipedia.org/wiki/File_(command) file (command)] on {{Mac}} e.g. {{kbd | key=<nowiki>file -I <filename></nowiki>}}</td>
<td>[https://en.wikipedia.org/wiki/File_(command) file (command)] on {{Mac}} e.g. {{kbd | key=<nowiki>file -I <filename></nowiki>}}</td>
Line 73: Line 85:
<td>text/plain; charset=utf-16be</td>
<td>text/plain; charset=utf-16be</td>
</tr>
</tr>
</table>
</table>
</div>
</div>
Line 79: Line 92:


* [http://osxdaily.com/2017/09/02/determine-file-encoding-mac-command-line/ How to Determine File Encoding in Mac OS by Command Line]
* [http://osxdaily.com/2017/09/02/determine-file-encoding-mac-command-line/ How to Determine File Encoding in Mac OS by Command Line]
* [[Byte order mark]]
* [https://en.wikipedia.org/wiki/Byte_order_mark Byte order mark - Wikipedia]
* [https://en.wikipedia.org/wiki/Byte_order_mark Byte order mark - Wikipedia]
* [https://ephrain.net/sublime-text-%E8%AE%93-sublime-text-%E6%94%AF%E6%8F%B4-big5-%E7%B7%A8%E7%A2%BC%E7%9A%84%E6%96%87%E5%AD%97%E6%AA%94/ (Sublime Text) 讓 Sublime Text 支援 BIG5 編碼的文字檔 | EPH 的程式日記]
* [https://ephrain.net/sublime-text-%E8%AE%93-sublime-text-%E6%94%AF%E6%8F%B4-big5-%E7%B7%A8%E7%A2%BC%E7%9A%84%E6%96%87%E5%AD%97%E6%AA%94/ (Sublime Text) 讓 Sublime Text 支援 BIG5 編碼的文字檔 | EPH 的程式日記]
* [https://docs.microsoft.com/en-us/windows-server/administration/windows-commands/chcp chcp | Microsoft Docs]
* [https://docs.microsoft.com/en-us/windows-server/administration/windows-commands/chcp chcp | Microsoft Docs]
* [http://mocha-c-163-com.iteye.com/blog/583064 [Charset]UTF-8, UTF-16, UTF-16LE, UTF-16BE的区别 - 伊苏 - ITeye博客]
* [http://mocha-c-163-com.iteye.com/blog/583064 (Charset) UTF-8, UTF-16, UTF-16LE, UTF-16BE的区别 - 伊苏 - ITeye博客]
* [https://en.wikipedia.org/wiki/UTF-8 UTF-8 - Wikipedia]
* [https://en.wikipedia.org/wiki/UTF-8 UTF-8 - Wikipedia]
* [https://en.wikipedia.org/wiki/UTF-16 UTF-16 - Wikipedia]
* [https://en.wikipedia.org/wiki/UTF-16 UTF-16 - Wikipedia]


[[Category:Data Science]] [[Category:Software]] [[Category:Text file processing]]
[[Category:Data Science]] [[Category:Software]] [[Category:String manipulation]]

Latest revision as of 19:33, 9 March 2021

Text file encoding mapping on different softwares 「純文字編碼」在不同軟體的用字

Software ANSI/Big5 UTF-8 UTF-8 with BOM UTF-16 LE UTF-16 LE with BOM UTF-16 BE UTF-16 BE with BOM
Notepad (記事本) on Win Os windows.png ANSI UTF-8 Unicode Unicode big endian
Notepad++ on Win Os windows.png ANSI 碼 UTF-8 碼 UTF-8 碼 (BOM 檔首) UCS2 Little Endian 碼 (UCS2 LE BOM) UCS2 Big Endian 碼 (UCS2 BE BOM)
TextEdit (文字編輯) on macOS icon_os_mac.png 繁體中文 (Big5) Unicode (UTF-8) Unicode (UTF-16)
EmEditor on Win Os windows.png 繁體中文 (Big5) (950, big5) UTF-8 不帶簽名 UTF-8 帶簽名 UTF-16LE 不帶簽名 UTF-16LE 帶簽名 UTF-16 BE 不帶簽名 UTF-16 BE 帶簽名
Sublime text on Win Os windows.png & macOS icon_os_mac.png UTF-8 UTF-8 with BOM UTF-16 LE UTF-16 LE with BOM UTF-16 BE UTF-16 BE with BOM
file (command) on macOS icon_os_mac.png e.g. file <filename> ISO-8859 text UTF-8 Unicode text UTF-8 Unicode (with BOM) text data Little-endian UTF-16 Unicode text data Big-endian UTF-16 Unicode text
file (command) on macOS icon_os_mac.png e.g. file -I <filename> text/plain; charset=iso-8859-1 text/plain; charset=utf-8 text/plain; charset=utf-8 application/octet-stream; charset=binary text/plain; charset=utf-16le application/octet-stream; charset=binary text/plain; charset=utf-16be

References