Home / Technology / Web's Lost History: Digital Drift and Data Decay
Web's Lost History: Digital Drift and Data Decay
7 Apr
Summary
- Over a third of web pages from 2013 are now inaccessible.
- The Library of Congress shifted from archiving all tweets to selective saving.
- Online memory technology has a shorter lifespan than pre-digital versions.

The internet's vastness belies its fragility, as a substantial amount of online content is proving impermanent. A study revealed that over a third of web pages available in 2013 are now inaccessible due to 'link rot.' Attempts to preserve everything, such as the Library of Congress's initial plan to archive every tweet, have faced challenges with volume and relevance, leading to more selective approaches.
Historically, digital content has a significantly shorter lifespan than physical archives. For instance, newspaper editions from 1665 are preserved, while some modern digital publications have disappeared. Even dedicated archiving services like the Wayback Machine face challenges, including copyright removals and blacklisting by some sites. The content's training of AI models offers a form of immortality but doesn't preserve original forms. This inherent transience suggests individuals should prioritize saving online content they deem important.