Search the full text in PDF files: Difference between revisions

From LemonWiki共筆
Jump to navigation Jump to search
No edit summary
No edit summary
Line 12: Line 12:




'''Comparion of Solutions'''
'''Comparison of Solutions'''


{| border="1"
{| border="1"
Line 73: Line 73:


PDF type
PDF type
* Text-PDF: 由文件檔轉成的PDF檔
* Text-PDF: The PDF file generated from text files. 由文件檔轉成的PDF檔
* Image-PDF: 由圖檔轉成的PDF檔
* Image-PDF: The PDF file generated from image files. 由圖檔轉成的PDF檔


[[Category:Search]] [[Category:Software]]
[[Category:Search]] [[Category:Software]]

Revision as of 13:53, 6 May 2007

尋找多個PDF檔案裡的資料


Suggestion

  • full text search: Adobe reader or Google Desktop search are both good choices because they highlight and locate the keywords you type.
  • metadata search: PDF Explorer or xPDFSearch (Total Commander extension) are both good choices to perform the metadata search.


567171_35fe5e9654.jpg

Snapshot of xPDFSearch (Image hosted on Zooomr Photo Sharing)


Comparison of Solutions

PDF type Software full text search metadata search comments
Text-PDF Adobe reader 7.0.7 or Adobe acrobat OK OK (but slow) (1)the search function combined the full-text and metadata search, (2) locate the keywords you type
Text-PDF Google desktop search v4 OK, but only index the first 10,000 words Title only
Text-PDF Windows Desktop Search 02.06.5000.5378 OK (with PDF IFilter[1]) OK ex: author:someone
Text-PDF PDF Explorer 1.5 OK OK (1)not highlight and locate the keywords you type; (2)extract and index the internal images
Text-PDF xPDFSearch 1.02 (Total Commander extension) OK OK not highlight and locate the keywords you type
Text-PDF Yahoo! Desktop Search 1.2 OK No (1)not highlight and locate the keywords you type; (2)not support Chinese folder name
Image-PDF Google desktop search + OmniPage Search Indexer OK, but only index the first 10,000 words Title only Quick, English Only
Fox Reader 2.0 No (single PDF file only)


PDF type

  • Text-PDF: The PDF file generated from text files. 由文件檔轉成的PDF檔
  • Image-PDF: The PDF file generated from image files. 由圖檔轉成的PDF檔