Archive of webpage: Difference between revisions
Jump to navigation
Jump to search
mNo edit summary Tags: Mobile edit Mobile web edit |
mNo edit summary Tags: Mobile edit Mobile web edit |
||
| Line 8: | Line 8: | ||
Medium: | Medium: | ||
* Wayback Machine: Possible backup failure, successful backups may lack images. Examples of failures[https://web.archive.org/web/*/https://medium.com/%E5%93%88%E5%98%8D-%E4%B8%96%E7%95%8C/%E9%AB%98%E6%95%88%E5%B7%A5%E7%A8%8B%E5%B8%AB-effective-engineer-%E9%87%8D%E9%BB%9E%E7%AD%86%E8%A8%98-ca66e589653c] and successes[https://web.archive.org/web/20130915000000*/https://policy.medium.com/medium-terms-of-service-9db0094a1e0f] are given. | * [https://web.archive.org/ Wayback Machine]: Possible backup failure, successful backups may lack images. Examples of failures[https://web.archive.org/web/*/https://medium.com/%E5%93%88%E5%98%8D-%E4%B8%96%E7%95%8C/%E9%AB%98%E6%95%88%E5%B7%A5%E7%A8%8B%E5%B8%AB-effective-engineer-%E9%87%8D%E9%BB%9E%E7%AD%86%E8%A8%98-ca66e589653c] and successes[https://web.archive.org/web/20130915000000*/https://policy.medium.com/medium-terms-of-service-9db0094a1e0f] are given. | ||
* Webpage archive: Possible successful backup ([https://archive.is/3qZZc link]), with examples including one with blurred images [https://archive.is/2017.01.29-002950/https://medium.com/@taylorhu/%E5%A5%87-app-%E5%85%B1%E8%B3%9E-%E5%A8%81%E7%A7%80%E5%BD%B1%E5%9F%8E-7d7971c5d421]. | * [https://archive.today/ Webpage archive]: Possible successful backup ([https://archive.is/3qZZc link]), with examples including one with blurred images [https://archive.is/2017.01.29-002950/https://medium.com/@taylorhu/%E5%A5%87-app-%E5%85%B1%E8%B3%9E-%E5%A8%81%E7%A7%80%E5%BD%B1%E5%9F%8E-7d7971c5d421]. | ||
* Perma.cc: Shows an example of a successful backup. | * [https://perma.cc/ Perma.cc]: Shows an [https://perma.cc/BM2W-X62C example] of a successful backup. | ||
* historio: | * [https://historio.us/ historio]: When loading the backup, the content is visible for a few seconds, but it seems to conflict with CSS, resulting in a blank display. Using the mhtml format is necessary to read the backup. | ||
* Diigo (private access): Notes on reading backups in mhtml format. | * Diigo (private access): Notes on reading backups in mhtml format. | ||
PTT: | PTT: | ||
* Wayback Machine: Mentions partial success with restrictions due to adult content warnings[https://web.archive.org/web/20240209204000/https://www.ptt.cc/ask/over18?from=%2Fbbs%2FGossiping%2FM.1707508785.A.344.html][https://web.archive.org/web/20130915000000*/https://www.ptt.cc/bbs/Boy-Girl/M.1378051232.A.3E0.html]. | * [https://web.archive.org/ Wayback Machine]: Mentions partial success with restrictions due to adult content warnings[https://web.archive.org/web/20240209204000/https://www.ptt.cc/ask/over18?from=%2Fbbs%2FGossiping%2FM.1707508785.A.344.html][https://web.archive.org/web/20130915000000*/https://www.ptt.cc/bbs/Boy-Girl/M.1378051232.A.3E0.html]. | ||
* Webpage archive: Successful backup. | * [https://archive.today/ Webpage archive]: Successful backup. | ||
* Perma.cc: Backup failure due to 18+ warnings. | * Perma.cc: Backup failure due to 18+ warnings. | ||
* historio: Successful backup. | * historio: Successful backup. | ||
| Line 24: | Line 24: | ||
Facebook: | Facebook: | ||
* Wayback Machine: Backup results in a login screen, even when set to public. | * [https://web.archive.org/ Wayback Machine]: Backup results in a login screen, even when set to public. | ||
* Webpage archive: Error message "Not Found (yet?)" | * [https://archive.today/ Webpage archive]: Error message "Not Found (yet?)" | ||
* Perma.cc: "You’re Temporarily Blocked" message. | * Perma.cc: "You’re Temporarily Blocked" message. | ||
* historio: Using bookmarklet had no effect, backup was not successful. | * historio: Using bookmarklet had no effect, backup was not successful. | ||
| Line 31: | Line 31: | ||
Dcard | Dcard | ||
* Wayback Machine: Backup failed due to [https://zh.wikipedia.org/zh-tw/HTTP_403 HTTP 403 error]. | * [https://web.archive.org/ Wayback Machine]: Backup failed due to [https://zh.wikipedia.org/zh-tw/HTTP_403 HTTP 403 error]. | ||
* Webpage archive: Backup failed[https://archive.is/yEJMT] | * [https://archive.today/ Webpage archive]: Backup failed[https://archive.is/yEJMT] | ||
* Diigo (private access): Reading backups in mhtml format. | * Diigo (private access): Reading backups in mhtml format. | ||
YouTube | YouTube | ||
* Wayback Machine: (1) Videos cannot be played, (2) Comments are not visible [https://web.archive.org/web/*/https://www.youtube.com/watch?v=W95p-Ag4RMg] | * [https://web.archive.org/ Wayback Machine]: (1) Videos cannot be played, (2) Comments are not visible [https://web.archive.org/web/*/https://www.youtube.com/watch?v=W95p-Ag4RMg] | ||
* | * [https://archive.today/ Webpage archive]: (1) Videos cannot be played, (2) Comments are visible [https://archive.is/EY1ZH] | ||
Revision as of 23:09, 16 February 2024
Archive of webpage for backup purpose
Comparing the Article Backup Results of Different Social Media Websites
Medium:
- Wayback Machine: Possible backup failure, successful backups may lack images. Examples of failures[1] and successes[2] are given.
- Webpage archive: Possible successful backup (link), with examples including one with blurred images [3].
- Perma.cc: Shows an example of a successful backup.
- historio: When loading the backup, the content is visible for a few seconds, but it seems to conflict with CSS, resulting in a blank display. Using the mhtml format is necessary to read the backup.
- Diigo (private access): Notes on reading backups in mhtml format.
PTT:
- Wayback Machine: Mentions partial success with restrictions due to adult content warnings[4][5].
- Webpage archive: Successful backup.
- Perma.cc: Backup failure due to 18+ warnings.
- historio: Successful backup.
- Diigo (private access): Successful backup.
Facebook:
- Wayback Machine: Backup results in a login screen, even when set to public.
- Webpage archive: Error message "Not Found (yet?)"
- Perma.cc: "You’re Temporarily Blocked" message.
- historio: Using bookmarklet had no effect, backup was not successful.
- Diigo (private access): Reading backups in mhtml format.
Dcard
- Wayback Machine: Backup failed due to HTTP 403 error.
- Webpage archive: Backup failed[6]
- Diigo (private access): Reading backups in mhtml format.
YouTube
- Wayback Machine: (1) Videos cannot be played, (2) Comments are not visible [7]
- Webpage archive: (1) Videos cannot be played, (2) Comments are visible [8]
Desktop tools
| check | approach | filetype | cached media (images, flash...) | clickable text embeded with links | kept the saved time* | kept the original URL | Comments |
| Fx 2.0: Save as HTML (kept images) | html | saved with another directory | yes | yes | no | ||
| Fx 2.0: Save as HTML (html only) | html | no | yes | yes | no | ||
| ☆ | Fx 2.0 + ScrapBook 1.2 | html | saved with another directory | yes | yes* | yes | |
| ☆ | Fx 1.5 + MAF 0.6.3: Save as MAF MHT Archive | mht | embeded into a single file | yes | yes | yes | |
| Fx 2.0 + Google Toolbar for Firefox 3: Send with Gmail | html | no, they use the original URL of media | yes | yes | yes | ||
| ☆ | IE 6.0.x: Save as MHT | mht | embeded into a single file | yes | yes | yes | |
| Acrobat PDFMaker 7.0.5 | embeded into a single file | yes | yes | yes | |||
| Print to Adobe Acrobat Printer | embeded into a single file | no | yes | yes | |||
| Print to pdfFactory Pro v2.45 | embeded into a single file | no | yes | yes | |||
| IE + Adobe Acrobat 7: Convert web page to PDF | embeded into a single file | yes | yes | no | |||
| Unipage Unifier 1.0 RC3(kept images or flash...) | html | embeded into a single file | yes | yes | no |
Online services
| check | approach | filetype | cached media (images, flash...) | clickable text embeded with links | kept the saved time* | kept the original URL | Information organization / Comments |
| BackupUrl.com (cache image) | html | yes | yes | yes | yes | no (visited: 2009-04-09) | |
| ☆ | Evernote Web (no cache image) | html | no, they use the original URL of media | yes | yes | yes | tags; It also offer the sync software ((visited: 2008-03-29)) |
| Furl (no cache image) | html | no, they use the original URL of media | yes | yes | yes | Topic (tags) | |
| Yahoo My Web 2.0 Beta (no cache image) | html | no, they use the original URL of media | yes | yes | yes | tags | |
| ☆ | Google Notebook (no cache image) | html | no, they use the original URL of media | yes | yes | yes | tags |
| "Jump" Knowledge | html | no, they use the original URL of media | yes | yes | yes | You can annotate the webpages, and share the link to others. | |
| toread (no cache image) | html(Email) | no, they use the original URL of media(written in related path will appear normally) | yes | yes | yes | ||
| WebCite(access error: 2007-05-07) | html | no, they use the original URL of media(written in related path will appear normally) | yes | yes | yes | You can browse or backup the same page at different time. |
About kept the saved time: Most files already have this property. It varied easily if we saved to different storage media or FTP to another location. But the solution of Fx 1.5 + ScrapBook 0.18.4 saved this property with another function (metadata).
Winner is Firefox!