I used to use archive.today to archive news stories. However, between recently experiencing issues accessing the site and now that FBI trying to shut it down, I need to find a new way to archive.
I came across ArchiveBox, but it seems like you have to self-host and I don’t have the tech skills yet to self-host an application. I also know the Internet Archive has the WayBack Machine, but never have good luck using the site.
I am hoping that I can find a site that is similar to archive.today and is not on the FBI’s watchlist.
If you have a machine and/or the storage for it, you could deploy a docker container of linkwarden and do it yourself for a lot of things.
It says it’s for “bookmarking” but in addition to storing the outbound link, it takes backups of pages as text, html, and PDF and can do so recursively with the pages links. Nice interface, makes stuff searchable and taggable etc.
That’s really cool. I didn’t know Linkwarden could do that. I’ll further take a look at this, thank you!
I used to use archive.today to archive news stories.
Why are specifically are you using archive.today? To post links that bypass paywalls, or for something else? Because if it’s for something else then there may be other solutions, like using archive.org or saving the page locally.
I mainly use it to read articles that have paywalls.
AFAIK, archive.today is the best around for that, outside of installing the Bypass Paywalls Clean browser extension for Firefox or Chrome.
I’ll take a look at the browser extensions. Thank you!
Archive.ph is good, especially because I’ve never met a paywall it couldn’t bypass. 😏
archive.ph is just another alternate hostname for archive.today.
Oh. In which case it still works fine for me, what can I say. 😅
As others have said, SingleFile extenstion works well. I’ve also found zotero with the web extension quite good. Its useful for added organisation/catagoriesation especially since I’m already using it for academic work.
There is also zimit for use with kiwix, both a comandline version(see github) and website if you want something simpler.
Although I’ve found the website has long queues quite often and it may not get a clean backup if the website uses cloudflare or the like. But its useful if I need an offline copy of a website with many pages.
I recommend having a look at the archive team wiki page on software, here, see if anything fits your needs.
They all look like they can work. Zimit especially looks interesting. I’ll take a look at all of them. Thank you!
singlefile or webrecorder in chromium based browsers maybe?
self hosting is actually pretty easy actually :) we’re here to help too.
for large scale crawling i usually use archiveteam’s grab-site.
You can save as HTML but animation and videos won’t work. Try singlefile extension
Just learned about Readeck the other day. Self-hosted for now, but it sounds like they’re planning to launch a centrally-hosted instance at some point, maybe keep an eye on that.
I’ll definetely keep an eye on Readeck. Thank you!
Tbh internet archive and wayback machine is the best option I can think of. It’s easy to use and I only had problems with it when I was looking for old archives from late 90s and early 2000s, it sometimes didn’t load. That’s the only problem I had w wayback m.
What’s your use case?
Mainly to bypass paywalls.





