Editing
Archive of webpage
Jump to navigation
Jump to search
Warning:
You are not logged in. Your IP address will be publicly visible if you make any edits. If you
log in
or
create an account
, your edits will be attributed to your username, along with other benefits.
Anti-spam check. Do
not
fill this in!
A comparison of web page archiving backup tools. Comparison criteria include (1) whether embedded link text remains clickable, (2) whether basic information like archive date and original URL are preserved, and (3) how information is organized, such as reorganizing archived pages through tags. {{LanguageSwitcher | content = [[Archive of webpage | EN]], [[網頁備份 | 漢字]] }} {{Tips}} # '''Free''' Services: Primary recommendation is [http://archive.is/ Archive.is], which can store webpage-embedded images. Even if the original webpage is lost, it preserves complete information. The secondary option is [https://archive.org/web/ Internet Archive: Wayback Machine], which allows you to view archived versions of webpages from different time periods. # '''Paid''' Services: Primary recommendation is the desktop version of [http://www.evernote.com/ Evernote], as it can successfully capture webpages that require login credentials. For bookmarking public webpages only, consider [https://app.raindrop.io/ Raindrop] or [http://pinboard.in/ Pinboard], which '''automatically''' capture webpage content and embedded images after adding the bookmark URL. == Comparing the Article Backup Results of Different Social Media Websites == Medium: * [https://web.archive.org/ Wayback Machine]: Possible backup failure, successful backups may lack images. Examples of failures[https://web.archive.org/web/*/https://medium.com/%E5%93%88%E5%98%8D-%E4%B8%96%E7%95%8C/%E9%AB%98%E6%95%88%E5%B7%A5%E7%A8%8B%E5%B8%AB-effective-engineer-%E9%87%8D%E9%BB%9E%E7%AD%86%E8%A8%98-ca66e589653c] and successes[https://web.archive.org/web/20130915000000*/https://policy.medium.com/medium-terms-of-service-9db0094a1e0f] are given. * [https://archive.today/ Webpage archive]: Possible successful backup ([https://archive.is/3qZZc link]), with examples including one with blurred images [https://archive.is/2017.01.29-002950/https://medium.com/@taylorhu/%E5%A5%87-app-%E5%85%B1%E8%B3%9E-%E5%A8%81%E7%A7%80%E5%BD%B1%E5%9F%8E-7d7971c5d421]. * [https://perma.cc/ Perma.cc]: Shows an [https://perma.cc/BM2W-X62C example] of a successful backup. * [https://historio.us/ historio]: When loading the backup, the content is visible for a few seconds, but it seems to conflict with CSS, resulting in a blank display. Using the mhtml format is necessary to read the backup. * Diigo (private access): Notes on reading backups in mhtml format. PTT: * [https://web.archive.org/ Wayback Machine]: Mentions partial success with restrictions due to adult content warnings[https://web.archive.org/web/20240209204000/https://www.ptt.cc/ask/over18?from=%2Fbbs%2FGossiping%2FM.1707508785.A.344.html][https://web.archive.org/web/20130915000000*/https://www.ptt.cc/bbs/Boy-Girl/M.1378051232.A.3E0.html]. * [https://archive.today/ Webpage archive]: Successful backup. * Perma.cc: Backup failure due to 18+ warnings. * historio: Successful backup. * Diigo (private access): Successful backup. Facebook: * [https://web.archive.org/ Wayback Machine]: Backup results in a login screen, even when set to public. * [https://archive.today/ Webpage archive]: Error message "Not Found (yet?)" * Perma.cc: "You’re Temporarily Blocked" message. * historio: Using bookmarklet had no effect, backup was not successful. * Diigo (private access): Reading backups in mhtml format. Dcard * [https://web.archive.org/ Wayback Machine]: Backup failed due to [https://zh.wikipedia.org/zh-tw/HTTP_403 HTTP 403 error]. * [https://archive.today/ Webpage archive]: Backup failed[https://archive.is/yEJMT] * Diigo (private access): Reading backups in mhtml format. YouTube * [https://web.archive.org/ Wayback Machine]: (1) Videos cannot be played, (2) Comments are not visible [https://web.archive.org/web/*/https://www.youtube.com/watch?v=W95p-Ag4RMg] * [https://archive.today/ Webpage archive]: (1) Videos cannot be played, (2) Comments are visible [https://archive.is/EY1ZH] == Desktop tools == {| border="1" | check || approach || filetype || cached media (images, flash...) || clickable text embeded with links || kept the saved time* || kept the original URL || Comments |- | || [[MozillaFirefox|Fx]] 2.0: Save as HTML (kept images) || html || saved with another directory || yes || yes || ''no'' || |- | || Fx 2.0: Save as HTML (html only) || html || ''no'' || yes || yes || ''no'' || |- | ☆ || Fx 2.0 + [http://amb.vis.ne.jp/mozilla/scrapbook/ ScrapBook] 1.2 || html || saved with another directory || yes || yes* || yes || |- | ☆ || Fx 1.5 + [http://maf.mozdev.org/ MAF] 0.6.3: Save as MAF MHT Archive || mht || embeded into a single file || yes || yes || yes || |- | || Fx 2.0 + [http://www.google.com/tools/firefox/toolbar/FT3/intl/en/ Google Toolbar for Firefox] 3: Send with Gmail || html ||''no'', they use the original URL of media || yes || yes || yes || |- | ☆ || [http://www.microsoft.com/windows/ie/ IE] 6.0.x: Save as MHT || mht || embeded into a single file ||yes || yes || yes || |- | || [[Acrobat PDFMaker]] 7.0.5 || pdf || embeded into a single file || yes || yes || yes || |- | || Print to Adobe Acrobat Printer || pdf || embeded into a single file ||''no'' || yes || yes || |- | || Print to [http://www.pdffactory.com/ pdfFactory Pro] v2.45 || pdf || embeded into a single file ||''no'' || yes || yes || |- | || IE + [http://www.adobe.com/products/acrobat/main.html Adobe Acrobat 7]: Convert web page to PDF || pdf || embeded into a single file || yes || yes || ''no'' || |- | || [http://unipage.org/index.html Unipage Unifier] 1.0 RC3(kept images or flash...) || html || embeded into a single file || yes || yes || ''no'' || |- |} == Online services == {| border="1" | check || approach || filetype || cached media (images, flash...) || clickable text embeded with links || kept the saved time* || kept the original URL || Information organization / Comments |- | || [http://backupurl.com/ BackupUrl.com] (cache image) || html |class="yes" | '''yes''' |class="yes" | yes |class="yes" | yes |class="yes" | yes || no (visited: 2009-04-09) |- | ☆ || [http://preview.evernote.com/Home.action Evernote Web] (no cache image) || html || ''no'', they use the original URL of media || yes || yes || yes || tags; It also offer the sync software ((visited: 2008-03-29)) |- | || [http://www.furl.net Furl] (no cache image) || html || ''no'', they use the original URL of media || yes || yes || yes || Topic (tags) |- | || [http://myweb2.search.yahoo.com/myweb?ei=UTF-8 Yahoo My Web 2.0 Beta] (no cache image) || html || ''no'', they use the original URL of media || yes || yes || yes || tags |- | ☆ || [http://www.google.com/notebook/ Google Notebook] (no cache image) || html || ''no'', they use the original URL of media ||yes || yes || yes || tags |- | || [http://info.jkn.com/ "Jump" Knowledge] || html |class="no" | ''no'', they use the original URL of media |class="yes" |yes |class="yes" | yes |class="yes" | yes || You can annotate the webpages, and share the link to others. |- | || [http://toread.cc/ toread] (no cache image) || html(Email) || ''no'', they use the original URL of media(written in related path will appear normally) ||yes || yes || yes || |- | || [http://www.webcitation.org/index WebCite](access error: 2007-05-07) || html || ''no'', they use the original URL of media(written in related path will appear normally) || yes || yes || yes || You can browse or backup the same page at different time. |- |} About kept the saved time: Most files already have this property. It varied easily if we saved to different storage media or FTP to another location. But the solution of Fx 1.5 + [http://amb.vis.ne.jp/mozilla/scrapbook/ ScrapBook] 0.18.4 saved this property with another function (metadata). '''Winner is Firefox!''' [[Category:Software]] [[Category:PKM]]
Summary:
Please note that all contributions to LemonWiki共筆 are considered to be released under the Creative Commons Attribution-NonCommercial-ShareAlike (see
LemonWiki:Copyrights
for details). If you do not want your writing to be edited mercilessly and redistributed at will, then do not submit it here.
You are also promising us that you wrote this yourself, or copied it from a public domain or similar free resource.
Do not submit copyrighted work without permission!
Cancel
Editing help
(opens in new window)
Templates used on this page:
Template:LanguageSwitcher
(
edit
)
Template:Tips
(
edit
)
Navigation menu
Personal tools
Not logged in
Talk
Contributions
Log in
Namespaces
Page
Discussion
English
Views
Read
Edit
View history
More
Search
Navigation
Main page
Current events
Recent changes
Random page
Help
Categories
Tools
What links here
Related changes
Special pages
Page information