Convert webpage to markdown: Difference between revisions

From LemonWiki共筆
Jump to navigation Jump to search
No edit summary
 
Line 3: Line 3:


== Convert webpage to markdown ==
== Convert webpage to markdown ==
* [https://github.com/deathau/markdownload?tab=readme-ov-file deathau/markdownload]: "A Firefox and Google Chrome extension to clip websites and download them into a readable markdown file." {{Gd}} Automatically identify and extract the primary content area of a webpage (integration of [https://github.com/mozilla/readability mozilla/readability.js])
1. [https://github.com/deathau/markdownload?tab=readme-ov-file deathau/markdownload]: "A Firefox and Google Chrome extension to clip websites and download them into a readable markdown file." {{Gd}} Automatically identify and extract the primary content area of a webpage (integration of [https://github.com/mozilla/readability mozilla/readability.js])
* [https://jina.ai/reader/ Jina Reader API]


2. [https://jina.ai/reader/ Jina Reader API]


Bookmarklet
Bookmarklet
<pre>
<pre>
javascript:(function(){location.href='https://r.jina.ai/'+location.href})();
javascript:(function(){location.href='https://r.jina.ai/'+location.href})();
</pre>
</pre>
3. ''$'' [https://www.firecrawl.dev/ Firecrawl - The API to search, scrape, and interact with the web at scale. 🔥]
You may define the scrape result format, such as `markdown`, `rawHtml`, or `json`. For example, Firecrawl’s scrape endpoint supports body formats like those documented [https://docs.firecrawl.dev/api-reference/endpoint/scrape#body-formats here].


[[Category: Tool]]
[[Category: Tool]]

Latest revision as of 12:14, 29 May 2026


icon_scale_pencil.png This article "Convert webpage to markdown" is still being written. If there are any incomplete parts, you are welcome to directly edit them. 這篇文章「Convert webpage to markdown」內容還在撰寫中,如果有不完整的部分,歡迎你直接動手修改


Convert webpage to markdown[edit]

1. deathau/markdownload: "A Firefox and Google Chrome extension to clip websites and download them into a readable markdown file." Good.gif Automatically identify and extract the primary content area of a webpage (integration of mozilla/readability.js)

2. Jina Reader API

Bookmarklet

javascript:(function(){location.href='https://r.jina.ai/'+location.href})();

3. $ Firecrawl - The API to search, scrape, and interact with the web at scale. 🔥

You may define the scrape result format, such as `markdown`, `rawHtml`, or `json`. For example, Firecrawl’s scrape endpoint supports body formats like those documented here.