76
votes
Wikipedia blacklists archive.today, starts removing 695,000 archive links
Link information
This data is scraped automatically and may be incorrect.
- Title
- Wikipedia bans Archive.today after site executed DDoS and altered web captures
- Published
- Feb 20 2026
- Word count
- 419 words
From the article:
[...]
[...]
Woof. I understand the decision but that’s potentially a LOT of valuable archived data that will end up being discarded. Hopefully most of it can be safely re-archived elsewhere but I doubt that will be possible for a significant portion.
Until now I hadn’t heard the accusations of archive.today tampering with saved page content. I want to give it the benefit of the doubt because it’s always seemed trustworthy to me. But without knowing the operators and their agendas who can say for sure (and this DDoS situation does not inspire much confidence). If a site is no longer online there’s no way to guarantee the authenticity of the version archive.today is serving. Honestly the same could be said for any of the alternatives though… we believe that the Wayback Machine is preserving content in good faith, but they could be selectively modifying specific URLs and no one would be the wiser.
I don’t think page archival is a solved problem yet. The strengths of archive.today revealed the shortcomings of Wayback. There are a number of improvements I’d like to see, personally. After reading this I think some kind of cryptographic signature scheme for pages might be helpful too. It could at least prove that archives haven’t been changed over time, but you’d still have to trust the accuracy of the original scrape. Though In the LLM era that might not be enough either. 🤔
Hopefully cleverer minds than mine are already thinking about this.
I don't think the operator should be given the benefit of the doubt here, they have admitted as much in a blog post
Well said.
I am concerned that we are going to see a massive cascade of websites blocking them now. I have never had any access issues with archive working on any website, whereas the alternatives can be spotty.
Tildes users regularly provide "archives" using this site. Should we consider switching elsewhere?
Here's the previous discussion from the first thread about this, for continuity's sake. Wayback Machine and ghostarchive seem to be the main two mentioned there.
I've been using ghostarchive since the first post about this but there was some question about whether it was also affiliated with the archive.today person. Does anyone know more? I've liked ghostarchive so far (unfortunately not as much as archive.today but ah well)
I don't know a good solution. I'm glad that I regularly extract quotes, so at least there's some context. Maybe I'll start archiving pages privately?
Is there any connection between this and Wikipedia being unreachable a few hours ago?