Fact Check: How to Recover Deleted Web Pages

Have you ever clicked on a link only to land on an empty webpage with the message "Error 404" or "404 Not Found?" If so, you're not alone. There are several reasons this can happen — the simplest being a misspelled URL. But increasingly, the cause is that the page has been deleted or moved, sometimes intentionally. That's why DW Fact Check has put together a guide to help you find deleted or altered content.

As online fact-checking practices have grown in importance, tools for archiving digital content have become essential. They allow users to take 'snapshots' of websites or social media posts, capturing how they looked at a specific moment in time, and ensuring they remain accessible — even if the original content disappears.

Internet content changes constantly — pages vanish, links break, and information gets edited or removed. According to a study by the Pew Research Center, 38% of the webpages from 2013 are no longer available. Archiving is more than a technical solution — it's a tool for accountability, transparency, and preserving history.

Real-world cases illustrate why archiving is so crucial. In January 2025, the White House shut down its Spanish-language page. The Library of US Congress removed certain parts of the US Constitution from its online archive. In September 2022, Iran restricted internet access in parts of Tehran and Kurdistan, blocking Instagram and WhatsApp during protests following the death of a Kurdish woman in police custody. And in China, a once-extensive internet archive run by Peking University, which allowed searches of more than 2.5 billion historical Chinese web pages, is no longer accessible.

Why Archiving Matters

Web archives have been important in providing evidence in court cases and public discussions. Images like screenshots can easily be manipulated. "Web archives, on the other hand, record the entire contents of a web page, including its source HTML and embedded images, stylesheets, or JavaScript source," Michele Weigle, professor of computer science at Old Dominion University wrote in her article On the Importance of Web Archiving.

Four Go-To Tools for Web Archiving

DW Fact check has put together a list of four go-to tools for web archiving. One of the most widely used free archiving tools is the Wayback Machine launched in 2001 by the non-profit Internet Archive. Its mission is to "preserve those [digital] artifacts and create an Internet library for researchers, historians, and scholars."

  • Pros: Comprehensive, free, and widely used. Cons: Occasionally inaccessible due to hacking, keyword searches can be tricky.
  • The Internet Archive's Wayback Machine is the oldest and biggest public web archive, but it's not the only one. Many countries and national libraries have their own web archives, too.
  • Launched in 2012, Archive.today is a user-driven tool that saves web pages without active elements or scripts. It is great for archiving dynamic content like social media posts. It saves functional links. And it is not as large as the Wayback Machine, but more personal and responsive.
  • Pros: Fast, easy, and free. Cons: Relies on user initiative, smaller archive.

The other two tools include Perma.cc and Ghostarchive. Perma.cc combats link rot as this is "a big problem, especially for academic scholarship and judicial opinions, which depend heavily on citations to stable sources that readers can access."

  • Pros: Reliable for scholarly use. Cons: Limited free access.
  • Launched in 2021, Ghostarchive specializes in archiving videos and dynamic content, which is often used on social media — areas where other tools often struggle. It has a high success rate with video content, but is not always reliable.
  • Pros: High success rate with video content. Cons: Not 100% reliable.

Additional Tools and Considerations

A Chrome extension called Web Archives also bundles several archiving tools, reflecting the growing need to preserve online content as it continues to expand.

"We can share at least the digital archive of our reality," says Henk van Ess, expert in online research and open-source intelligence. "If politicians say something many, many years ago and they change their opinion, it's super important to find out what they actually said. So, it's basically the best way of sharing reality together again."

Not all pages are archived equally, and to archive all online content would be impossible. Popular sites are regularly scraped, while smaller ones are archived more sporadically.

Tools like Archive.today depend on users to initiate the archiving process. "Every hour, there's so much material produced on the web that it's technically impossible to copy and paste it," says Van Ess.

The Importance of Web Archiving

"It’s not about trying to archive the stuff that’s true, but archiving the conversation," Mark Graham, director of the Wayback Machine, put it in an interview with the Financial Times. Not all pages are archived equally, and to archive all online content would be impossible.

The most important takeaway is that the saying, "The internet never forgets!" is often true and we can use it to our advantage, and find older versions of websites, or even deleted websites, in the archives of the net.