• Activity
  • Votes
  • Comments
  • New
  • All activity
  • Showing only topics with the tag "archiving". Back to normal view
    1. Why won't the Wayback Machine archive my page?

      I have updated the Portuguese section of my blog with many posts that I scavenged from past blogs I've had since 2005. In order for everyone to be able to go through them chronologically, I gave...

      I have updated the Portuguese section of my blog with many posts that I scavenged from past blogs I've had since 2005. In order for everyone to be able to go through them chronologically, I gave them their original dates. In the end of each of these posts there is a link to the original publication, many of which came from the Internet Archive itself.

      One of my oldest blogs was removed from blogspot decades ago either by a hacker or something obscure about blogspot. So I had to use the archived version to reconstruct my history. I was very surprised to find it there because it was seemingly archived a decade after blogspot removed it. I have no idea what happened but I was so glad to find it!

      I have been trying to archive that page for days. The posts within that page are archived but not the page itself. The current August 2025 snapshot is not shown, and if I click on the link that they give me after the archiving process is done, I am directed to a snapshot I did back in May. I have no idea why this is happening, and the "help" section of Wayback Machine doesn't seem to have anyway for me to talk to someone.

      Can someone help?

      This is the page: https://daviramos.com/br/. It is also available at https://daviramos.bearblog.dev/br/, and yes, I tried archiving that one too.

      Thanks!

      9 votes
    2. Does anyone have experience with tools for locally archiving the web, like Archivebox for example?

      I found myself on the Archivebox website earlier today. After reading some of it, that's the kind of program I could use. The ephemerous nature of the web is bothersome, so much content is lost...

      I found myself on the Archivebox website earlier today. After reading some of it, that's the kind of program I could use. The ephemerous nature of the web is bothersome, so much content is lost for one reason or another. Archivebox seems to be one of the most popular tools, and it can automatically mirror my locally downloaded website to archive.org, which is great. It seems complex though, maybe more complex than I usually tolerate these days. Which is why I am asking if anyone has personal experience with Archivebox or other similar programs. Do you find them useful and reliable? Have you ever found in your local storage a webpage that you really liked, which was gone from the web? How's your setup?

      Thank ;)

      19 votes
    3. [SOLVED] Archiving a deceased loved one's Twitter timeline, including media

      Recently a loved one of a friend has died and they would like to archive their entire timeline (no retweets), including media they posted. I've looked around a little bit and the Twitter API only...

      Recently a loved one of a friend has died and they would like to archive their entire timeline (no retweets), including media they posted.

      I've looked around a little bit and the Twitter API only allows 3200 tweets to be exported. As this includes RTs, this goes back to about 2018, while the account was made in 2011, so it's missing about 90% of their tweets. Also, getting all the media isn't really possible.

      Do any of you know a way to accomplish this? Or, can anyone direct me to scripts that crawl the page and save every non-RT tweet + potential media? I'm not very tech-oriented but I can at least run python scripts.

      I should mention that I've so far checked out Allmytweets.net (returns RTs) and the Twitter archival project (or whatever it's called), which is a group of people that help in archiving accounts, but they haven't responded yet.

      13 votes