Comparison of docx to txt converter: Difference between revisions

From LemonWiki共筆
Jump to navigation Jump to search
mNo edit summary
 
(4 intermediate revisions by the same user not shown)
Line 1: Line 1:
[[File Convert]]: Comparison of docx to txt converter which could be executed under commad line mode
[[File Convert]]: Comparison of docx to txt converter which could be executed under commad line mode
{{LanguageSwitcher | content = [[Comparison of docx to txt converter | EN]], [https://errerrors.blogspot.com/2020/09/comparison-of-docx-to-txt-converter.html  漢字] }}


== Suggestion ==
== Suggestion ==
Line 8: Line 10:
* [https://pandoc.org/ Pandoc] v. 2.10.1
* [https://pandoc.org/ Pandoc] v. 2.10.1
* [https://zh-tw.libreoffice.org/ LibreOffice] v. 6.3.4.2 (x64)
* [https://zh-tw.libreoffice.org/ LibreOffice] v. 6.3.4.2 (x64)
** {{Mac}} Path of LibreOffice<ref>[https://ask.libreoffice.org/t/using-libreoffice-with-terminal/45653 Using LibreOffice with Terminal - English - Ask LibreOffice]</ref>: {{kbd | key=<nowiki>/Applications/LibreOffice.app/Contents/MacOS/soffice</nowiki>}}
** Ubuntu: Path of LibreOffice binary: {{kbd | key=/usr/bin/libreoffice}} or {{kbd | key=/usr/bin/soffice}}. You can verify with {{kbd | key=which libreoffice}} or {{kbd | key=which soffice}}.
* [https://tika.apache.org/ Apache Tika] v. 1.24.1
* [https://tika.apache.org/ Apache Tika] v. 1.24.1


Line 49: Line 53:
</table>
</table>


Ths content in Madarian (Traditional Chinese): [https://errerrors.blogspot.com/2020/09/comparison-of-docx-to-txt-converter.html 可在命令提示字元使用的 DOCX 轉 TXT 軟體比較]
Ths content written in Madarian (Traditional Chinese): [https://errerrors.blogspot.com/2020/09/comparison-of-docx-to-txt-converter.html 可在命令提示字元使用的 DOCX 轉 TXT 軟體比較]


[[Category:Software]] [[Category:Tool]]
[[Category:Software]] [[Category:Tool]]

Latest revision as of 17:01, 23 March 2026

File Convert: Comparison of docx to txt converter which could be executed under commad line mode

🌐 Switch language: EN, 漢字


Suggestion[edit]

  • It the file size is smaller e.g. 2 MB or smaller, you can use Pandoc or Apache Tika
  • It the file size is larger e.g. 4 MB or larger, you can use Apache Tika

DOCX to TXT converter which could be executed under commad line mode[edit]

  • Pandoc v. 2.10.1
  • LibreOffice v. 6.3.4.2 (x64)
    • macOS icon_os_mac.png Path of LibreOffice[1]: /Applications/LibreOffice.app/Contents/MacOS/soffice
    • Ubuntu: Path of LibreOffice binary: /usr/bin/libreoffice or /usr/bin/soffice. You can verify with which libreoffice or which soffice.
  • Apache Tika v. 1.24.1

Comparison of costs time (seconds)[edit]

Number of characters of input file File size (MB) of input file Costs time (seconds) of Pandoc Costs time (seconds) of LibreOffice Costs time (seconds) of Apache Tika
25399 1.07 4.396870136 35.96195006 9.429663897
95147 2.39 27.40566111 290.2933881 Icon_exclaim.gif 7.883637905
190295 4.65 60.29140496 Icon_exclaim.gif 837.8451748 Icon_exclaim.gif 11.16180587
380591 9.18 518.632391 Icon_exclaim.gif 3204.57426 Icon_exclaim.gif 15.61542296

Ths content written in Madarian (Traditional Chinese): 可在命令提示字元使用的 DOCX 轉 TXT 軟體比較