Chapter 1: Introduction to Archaeobytology
Opening Vignette: The Day GeoCities Died
On October 26, 2009, Yahoo announced that GeoCities—one of the earliest and largest web hosting services—would shut down in less than three weeks. What followed was a frantic rescue operation. A loose network of digital preservationists calling themselves "Archive Team" mobilized immediately, recruiting volunteers to scrape as many sites as possible before the November deadline. Working around the clock across time zones, they managed to save approximately 650 gigabytes of data—a fraction of the estimated terabytes that had accumulated since GeoCities launched in 1994.
When the servers went dark on October 26, 2009, roughly 30 million websites vanished from the internet. Personal homepages that teenagers had built in the late 1990s. Fan sites dedicated to Sailor Moon and Dragon Ball Z. Memorial pages for loved ones. Experimental art projects. Tutorials on HTML and web design. An entire era of digital culture—messy, earnest, strange, and deeply human—was murdered by a single corporate decision.
This wasn't obsolescence. These sites didn't decay naturally or become technically incompatible. They were deliberately killed by a platform that no longer found them profitable. And because GeoCities users didn't own their digital ground—they were renting space at geocities.com/neighborhood/username—they had no recourse. When Yahoo turned off the servers, those 30 million voices simply... disappeared.
Or almost disappeared. Thanks to Archive Team's heroic effort, fragments survived. But even those rescued artifacts present new problems: they exist in a massive, unsearchable data dump. No context. No curation. No way to understand why a particular site mattered or what community it represented. The bits are preserved, but the meaning is lost.
This is the problem Archaeobytology was created to solve.
What Is Archaeobytology?
Archaeobytology is the study and practice of excavating, preserving, interpreting, and building with digital artifacts—particularly those that have been "murdered" by platform shutdowns or rendered obsolete by technological change.
The term combines three roots:
- Archaeo- (Greek: ancient, old) — invoking archaeology's commitment to recovering and interpreting the past
- -byte- (computing unit) — grounding the discipline in digital materiality
- -ology (study of) — claiming status as a rigorous intellectual discipline
But Archaeobytology is more than just "digital archaeology." It encompasses:
- Theoretical frameworks for understanding digital mortality, platform power, and technological sovereignty
- Practical methods for excavation, preservation, and forensic analysis
- Institutional design for building organizations that can sustain preservation work for decades
- Political advocacy for laws and policies that protect digital culture from corporate erasure
- Creative practice of forging new tools, monuments, and systems that embody principles of digital sovereignty
Unlike adjacent fields—digital history, media archaeology, library science, or computer science—Archaeobytology insists on a dual commitment: we are both archivists and builders, both scholars and smiths. We preserve what platforms murder, and we forge alternatives that resist future murders.
Why We Need a New Discipline
The Inadequacy of Existing Fields
When GeoCities died, where were the experts? Digital historians could analyze what was lost, but they lacked the technical skills to mount a rescue. Computer scientists could write scraping scripts, but they lacked frameworks for ethical triage or cultural curation. Librarians understood preservation, but most weren't equipped to reverse-engineer dying platforms or navigate copyright gray areas.
The problem wasn't lack of expertise—it was fragmentation. The skills needed to address platform death are scattered across multiple disciplines, none of which fully claim this territory:
- Computer Science treats digital objects as technical problems, not cultural artifacts
- History studies the past but rarely intervenes to preserve the present
- Library Science excels at cataloging but often lacks technical depth for complex digital formats
- Media Archaeology theorizes dead media but doesn't always prioritize active preservation
- Cultural Studies critiques platforms but rarely builds alternatives
Archaeobytology argues that digital preservation requires a unified discipline that combines:
- Technical skill (excavation, forensics, emulation)
- Humanistic interpretation (curation, contextualization, ethics)
- Institutional expertise (sustainable organizations, governance models)
- Political engagement (policy advocacy, movement building)
No existing field does all of this. That's why we need a new one.
The Scale of the Crisis
Platform death isn't a niche problem. It's accelerating:
Major Platform Shutdowns (2000-2024):
- GeoCities (2009): 30 million sites
- Google Reader (2013): Millions of curated RSS feeds
- Vine (2017): 200 million short videos
- Google+ (2019): User profiles, communities, content
- Mixer (2020): Game streaming platform
- Tumblr NSFW purge (2018): Millions of posts deleted
- Twitter's chaos (2022-present): Uncertain future, mass exodus
- Countless smaller platforms: Ello, Peach, Path, Plurk, FriendFeed...
Each shutdown represents not just technical infrastructure dying, but communities, memories, identities, and cultural artifacts being erased. And unlike physical artifacts—which decay slowly, giving civilizations time to respond—digital artifacts can vanish overnight.
Moreover, we're not just losing content. We're losing:
- Social graphs: Who was connected to whom
- Affordances: How platforms shaped communication (Twitter's 140 characters, Vine's 6 seconds)
- Aesthetics: Platform-specific design cultures (GeoCities'
- Communities: Specialized subcultures that formed around platform features
This isn't just cultural loss—it's cultural murder. And it's happening faster than any single discipline can address.
The Three Pillars: A Framework for Digital Sovereignty
At the heart of Archaeobytology lies a normative commitment: we believe digital culture should be sovereign—independent from corporate control, resistant to platform shutdown, and owned by the people who create it.
We call this framework The Three Pillars, drawing on both ancient philosophy (the Greek concept of authentikos, self-originating authority) and practical infrastructure design. A digitally sovereign presence requires three interdependent foundations:
Pillar 1: Declaration (I Am)
The Principle: You should be able to declare your identity and existence without permission from a platform or intermediary.
When you create a Facebook profile, Facebook owns your identity. If they ban you, "you" cease to exist—at least in that digital space. Your name, photos, relationships, and history are controlled by an entity that can revoke them at will.
Sovereign declaration means:
- Your identity is self-hosted (username@yourdomain.com, not username@gmail.com)
- Your presence is persistent (your website outlives any platform)
- Your voice is uncensorable (no corporate ToS can silence you on your own ground)
This doesn't mean freedom from all consequences—legal systems, community norms, and social accountability still apply. But it means platforms cannot unilaterally erase you.
Historical precedent: In the early web (1990s-2000s), personal homepages embodied this principle. You bought a domain, hosted your site, and declared yourself to the internet. GeoCities degraded this model by making addresses hierarchical (geocities.com/neighborhood/you) rather than sovereign (you.com). Social media completed the enclosure by eliminating personal domains entirely.
Pillar 2: Connection (Instant Message)
The Principle: You should be able to communicate directly with others without a platform mediating, monitoring, or monetizing your relationships.
Platforms don't just host our content—they control our connections. Facebook decides who sees your posts (algorithmic curation). Twitter can prevent you from messaging someone (shadowbanning). Instagram owns the graph of your followers (you can't export it).
Sovereign connection means:
- Communication is peer-to-peer or federated (not routed through corporate servers)
- Relationships are exportable (if you leave a platform, your network comes with you)
- Discovery is intentional (you choose who to connect with, not an algorithm)
This is why email—for all its flaws—remains more sovereign than social media. If Gmail shuts down, you can take your address to another provider. If your contacts have their own domains (name@theirdomain.com), you can reach them directly.
The challenge: Network effects make this hard. If everyone is on Twitter, leaving Twitter means losing access to your community. Sovereignty requires interoperability—the ability to communicate across platforms, or to bring your network with you when you migrate.
Pillar 3: Ground (Digital Real Estate)
The Principle: You should own the infrastructure your digital life is built on—not rent it from a landlord who can evict you.
GeoCities users thought they had websites. They didn't. They had leases on someone else's servers. When Yahoo decided those leases weren't profitable, they terminated them. No appeals, no alternatives, no recourse.
Sovereign ground means:
- You own your domain name (example.com, not facebook.com/example)
- You control your hosting (self-hosted or a provider you can migrate away from)
- Your data is exportable (you can take it with you, in usable formats)
This doesn't require technical expertise. Thousands of people own domains and use managed hosting services like WordPress.com or Ghost(Pro). The key is portability: if the service shuts down or changes terms, you can move.
The analogy: Owning ground is like owning land versus renting an apartment. A landlord can raise rent, change rules, or evict you. But if you own land, you have sovereignty—subject to laws, but not to arbitrary corporate power.
The Dual Soul: Archive and Anvil
Archaeobytology is not just a preservationist discipline. We insist on a dual practice:
The Archive: Preservation and Memory
The Archive represents our commitment to:
- Excavate murdered platforms before they vanish completely
- Preserve artifacts with technical and cultural fidelity
- Curate collections that make sense of vast data dumps
- Interpret artifacts so future generations understand their significance
Archival work requires:
- Technical skills (web scraping, forensic recovery, emulation)
- Ethical frameworks (what should be preserved? what should be forgotten?)
- Institutional knowledge (how do you build organizations that last 50 years?)
The Archive is retrospective: it looks backward to save what's endangered.
The Anvil: Creation and Resistance
The Anvil represents our commitment to:
- Forge tools that empower digital sovereignty (domain registration services, self-hosting platforms, open protocols)
- Build monuments that embody our values (websites, platforms, frameworks designed for permanence)
- Design institutions that resist platform capture (non-profit archives, cooperative hosting services, federated networks)
The work of the Anvil requires:
- Creative practice (making things that don't yet exist)
- Systems thinking (how do you design for resilience?)
- Political imagination (what does a post-platform future look like?)
The Anvil is prospective: it looks forward to build alternatives.
Why Both Are Necessary
You cannot be only an archivist. If you preserve everything but build nothing, you're a curator in a warehouse—keeping records of a world dominated by platforms, never challenging that dominance.
You cannot be only a builder. If you forge alternatives but never preserve the past, you lose the lessons of history. Each new generation reinvents the wheel, repeating old mistakes.
The Archaeobytologist embodies both: we save the murdered web, and we build systems that can't be murdered.
Triage: The Central Methodology
The most painful truth of Archaeobytology: you cannot save everything.
When a platform announces shutdown, you have limited time, limited storage, limited volunteers. You must make choices. This is triage—borrowed from emergency medicine, where doctors must decide which patients to treat first when resources are scarce.
The Custodial Filter
We use the Custodial Filter as our ethical framework for triage decisions. Before preserving an artifact, we ask five questions:
-
Cultural Significance: Does this artifact represent a community, movement, or cultural moment that would otherwise be lost?
-
Technical Fragility: How close to disappearance is this? (A site archived by Internet Archive is less urgent than one that isn't.)
-
Rescue Difficulty: How hard is this to preserve? (Simple HTML is easier than complex Flash applications.)
-
Existing Redundancy: Is someone else already preserving this? (Don't duplicate effort when time is scarce.)
-
Consent and Ethics: Should we preserve this? Does it violate someone's privacy, contain traumatic content, or cause harm by existing?
The fifth question is critical. Not everything that can be preserved should be. Revenge porn, doxxing, harassment campaigns—these are digital artifacts too, but preserving them can perpetuate harm. The Custodial Filter requires us to think beyond technical feasibility to ethical responsibility.
Triage in Practice: The Archive Team Model
When Vine announced its shutdown in 2016, Archive Team had roughly six weeks to save 200 million videos. Impossible to save them all. They triaged:
- Highest priority: Videos with significant cultural impact (viral memes, influential creators, historically important moments)
- Medium priority: Representative samples of different communities, genres, and time periods
- Lower priority: Duplicates, spam, commercial advertisements
Even then, they couldn't manually curate 200 million items. So they used algorithmic triage: view counts, shares, and community-submitted nominations. Imperfect, but pragmatic.
The result: they saved millions of videos, but not all. Some Vines are lost forever. Triage accepts this tragedy as unavoidable, while working to minimize the loss.
A Brief History of Digital Mortality
Digital culture has always been ephemeral, but the causes of mortality have evolved:
Era 1: Technological Obsolescence (1960s-1990s)
Early digital artifacts died because the hardware or software became incompatible:
- Floppy disks degraded physically
- File formats became unreadable (WordStar, AppleWorks)
- Storage media evolved (punch cards → magnetic tape → hard drives)
This was passive death—artifacts decayed like ancient papyrus. The solution was technical: emulation, format migration, hardware preservation.
Era 2: Link Rot and Neglect (1990s-2000s)
As the web grew, artifacts died because:
- Website owners stopped paying for hosting
- Domains expired and were re-registered by squatters
- Links broke as sites moved or vanished
This was death by neglect—the equivalent of abandoning a physical archive to water damage and mold. The solution was institutional: projects like the Internet Archive's Wayback Machine, which proactively crawled and preserved sites.
Era 3: Platform Murder (2000s-present)
In the social media era, artifacts die because platforms choose to kill them:
- Corporate shutdowns (GeoCities, Vine, Google Reader)
- Terms of Service purges (Tumblr NSFW ban, YouTube's algorithmic demonetization)
- Acquisition and closure (platforms bought and killed by competitors)
This is active murder—deliberate erasure. The solution isn't just technical or institutional—it's political. We need laws, rights, and alternatives.
Archaeobytology emerged in response to this third era. We're not just fighting entropy or neglect—we're fighting corporate power.
What Makes Archaeobytology Different?
Not Digital History
Digital historians study the past. Archaeobytologists intervene in the present to create a future past. When Archive Team scraped GeoCities, they weren't analyzing history—they were making it possible for future historians to have something to analyze.
Not Media Archaeology
Media archaeologists theorize dead media. Archaeobytologists rescue dying media before they become dead. We're applied, not purely theoretical. We get our hands dirty with code, servers, and scrapers.
Not Library Science
Librarians excel at cataloging, access, and preservation—but within established frameworks. Archaeobytology operates in legal and technical gray areas: scraping platforms that didn't consent, preserving copyrighted material under dubious fair use claims, reverse-engineering proprietary formats.
We respect librarians deeply. But we do things they often can't or won't do.
Not Computer Science
Computer scientists can write scrapers and build emulators. But they often lack frameworks for curation, ethics, and cultural interpretation. A computer scientist might preserve every byte. An Archaeobytologist asks: Should we? What does this mean? How do we make it legible?
Not Just Activism
Archaeobytology isn't pure advocacy. We build theoretical frameworks, develop rigorous methods, and create institutions. We're scholars and activists—but the scholarship matters.
The Crisis of Legitimacy
Archaeobytology faces a credibility problem: we don't exist yet.
There are no Archaeobytology departments at universities. No tenure-track jobs with "Archaeobytologist" in the title. No dedicated funding streams from NSF or NEH. When we tell people we're Archaeobytologists, they ask, "What's that?"
This book is part of solving that problem. By codifying our theories, methods, and practices, we make the discipline real. By teaching courses, publishing research, and building institutions, we establish legitimacy.
Disciplines don't emerge naturally—they're constructed through collective action:
- Journals and conferences create scholarly community
- Textbooks standardize knowledge
- Degree programs train new generations
- Professional organizations provide structure
- Public advocacy wins recognition
This book is a founding document. You're reading it early in the discipline's life. In 20 years, Archaeobytology might be as established as Data Science or Digital Humanities. Or it might remain a niche practice, known only to specialists.
That outcome depends on us.
Who Is This Book For?
Undergraduate Students
If you're considering a career in digital preservation, museum curation, or tech ethics, this book provides foundational knowledge. Each chapter includes exercises and case studies to build practical skills.
Graduate Students and Researchers
If you're writing a dissertation on platform death, digital memory, or technological sovereignty, this book offers theoretical frameworks and methodologies you can adapt.
Practitioners
If you work in libraries, archives, museums, or tech companies, this book gives you tools to advocate for preservation work and design sustainable institutions.
Activists and Advocates
If you're fighting for digital rights, platform accountability, or data sovereignty, this book provides evidence and arguments for policy change.
The Curious Public
If you've ever wondered what happened to your MySpace profile, your LiveJournal, or that website you made in 2003, this book explains why they disappeared—and what we can do about it.
How to Use This Book
Structure
The book is organized into five parts:
Part I: Foundations (Chapters 1-6) introduces core concepts: the Archaeobyte taxonomy, the Archive/Anvil framework, the Three Pillars, and triage methodology.
Part II: Excavation & Forensics (Chapters 7-10) teaches practical methods for recovering and analyzing digital artifacts.
Part III: Institution Building (Chapters 11-14) shows how to design organizations that sustain preservation work for decades.
Part IV: Systems & Movements (Chapters 15-16) addresses political economy: who controls digital infrastructure, and how do we build alternatives?
Part V: Public Scholarship & The Future (Chapters 17-18) explores how Archaeobytologists can translate research into public discourse, policy, and cultural change.
Pedagogy
Each chapter includes:
- Case Studies: Real-world examples of platform death, preservation projects, and institution-building
- Discussion Questions: Prompts for classroom or reading group conversations
- Exercises: Hands-on activities to build skills (accessible to readers with varying technical backgrounds)
- Further Reading: Curated bibliography for deeper exploration
Teaching with This Book
For a 15-week undergraduate survey course, cover one chapter per week. Focus on Part I (Foundations) and Part II (Methods), with selected chapters from Part III.
For a graduate seminar, assume students have read the entire book. Use class time for deep discussion of case studies, triage dilemmas, and institutional design challenges. Assign a capstone project: design a preservation organization, memory institution, or movement campaign.
For professional development, organize a reading group among librarians, archivists, or tech workers. Each week, one person presents a chapter and leads discussion. Focus on how frameworks apply to your workplace.
A Provocation: Why Bother?
Let's be honest: most people don't care that GeoCities died. They don't think about digital preservation. They assume "the internet remembers everything" (it doesn't) or that "tech companies will handle it" (they won't).
So why bother? Why build a discipline around saving things most people forgot existed?
Three answers:
1. Memory Is Power
Who controls the past controls the present. Platforms curate our memories—deciding which photos Facebook shows you in "On This Day," which tweets trend, which YouTube videos get recommended. When platforms die, they take our memories with them.
Preserving murdered platforms is an act of resistance against corporate memory control. It insists that our digital lives belong to us, not to companies that can erase them at will.
2. Culture Dies in Darkness
Every generation deserves access to the cultural artifacts of previous generations. Historians study ancient Rome through pottery fragments. Future historians will study early internet culture through GeoCities sites—if we save them.
If we don't preserve digital culture, it vanishes. No ruins, no fragments. Just absence. Future generations won't even know what they're missing.
3. Building Alternatives Requires Understanding Failures
You can't design a sovereign internet if you don't understand how platforms murdered the old one. Every shutdown teaches lessons:
- Why didn't GeoCities users own their domains?
- Why couldn't Vine videos be exported?
- Why did Google Reader's death destroy thousands of curated feeds?
Studying murdered platforms isn't nostalgia—it's learning how to build systems that can't be murdered.
The Archaeobytologist's Vow
As you read this book, you're joining a community. We're small now—scattered practitioners, archivists, activists, scholars. But we're growing.
If you embrace this work, you're making an implicit commitment:
I will not let digital culture die in silence.
I will excavate what platforms murder.
I will build systems that resist future murder.
I will teach others to do the same.
I am a scholar and a smith, a custodian and a strategist.
I own my ground. I tell my story. I forge my future.
I am an Archaeobytologist.
Looking Ahead
The rest of this book will equip you with:
- Theory: Rigorous frameworks for understanding digital death (Chapters 2-6)
- Methods: Practical skills for excavation and preservation (Chapters 7-10)
- Institutions: Models for sustainable organizations (Chapters 11-14)
- Systems: Alternatives to platform capitalism (Chapters 15-16)
- Impact: Strategies for public scholarship and policy change (Chapters 17-18)
By the end, you'll be able to:
- Classify digital artifacts using the Archaeobyte taxonomy
- Conduct triage using the Custodial Filter
- Excavate a dying platform before it shuts down
- Design a preservation organization that can last 50 years
- Advocate for laws that protect digital culture
- Build alternatives that embody digital sovereignty
You'll be, in short, an Archaeobytologist.
Welcome to the discipline. Now let's get to work.
Discussion Questions
-
On GeoCities: Why do you think Yahoo shut down GeoCities instead of maintaining it as a historical archive? What does this decision reveal about corporate priorities?
-
On Definitions: How is Archaeobytology different from "digital archiving" or "data preservation"? Does it need to be a separate discipline, or could existing fields do this work?
-
On The Three Pillars: Audit your own digital presence. Do you have Declaration (sovereign identity)? Connection (direct communication)? Ground (owned infrastructure)? If not, what would it take to achieve them?
-
On Triage: Imagine a platform announces shutdown in 48 hours. You can save 10% of its content. How do you decide what to save? What ethical dilemmas arise?
-
On The Dual Soul: Can you be only an archivist (save the past) without being a builder (create the future)? Or are both commitments necessary?
-
On Legitimacy: What would it take for Archaeobytology to be recognized as a legitimate academic discipline? Journals? Conferences? University departments? All of the above?
Exercise: Your First Triage
Scenario: You discover that Ello (a social network launched in 2014 as an "ad-free alternative" to Facebook) is shutting down in one week. You have time to preserve approximately 1,000 user profiles out of 50,000 active accounts.
Task:
- Research Ello: What communities formed there? What made it culturally significant?
- Define Criteria: Using the Custodial Filter, list 5 criteria you'd use to select profiles
- Identify Examples: Find 10 specific Ello users you'd prioritize and explain why
- Ethical Dilemmas: Identify at least 3 ethical challenges in this scenario (privacy, consent, harm, etc.)
- Reflection: After making your choices, what did you have to leave behind? How does that feel?
Further Reading
Foundational Texts
- Kirschenbaum, Matthew. Mechanisms: New Media and the Forensic Imagination. MIT Press, 2008.
- Parikka, Jussi. What Is Media Archaeology? Polity, 2012.
- Chun, Wendy Hui Kyong. "The Enduring Ephemeral, or the Future Is a Memory." Critical Inquiry 35, no. 1 (2008): 148-171.
On Platform Death
- Gillespie, Tarleton. Custodians of the Internet: Platforms, Content Moderation, and the Hidden Decisions That Shape Social Media. Yale University Press, 2018.
- Zuboff, Shoshana. The Age of Surveillance Capitalism. PublicAffairs, 2019.
- Doctorow, Cory. The Internet Con: How to Seize the Means of Computation. Verso, 2023.
On Digital Sovereignty
- Lessig, Lawrence. Code: Version 2.0. Basic Books, 2006.
- Schneier, Bruce. Data and Goliath: The Hidden Battles to Collect Your Data and Control Your World. W.W. Norton, 2015.
- Véliz, Carissa. Privacy Is Power: Why and How You Should Take Back Control of Your Data. Melville House, 2020.
On Archives and Memory
- Derrida, Jacques. Archive Fever: A Freudian Impression. University of Chicago Press, 1996.
- Ernst, Wolfgang. Digital Memory and the Archive. University of Minnesota Press, 2013.
- Manoff, Marlene. "Theories of the Archive from Across the Disciplines." Portal: Libraries and the Academy 4, no. 1 (2004): 9-25.
Primary Sources
- Archive Team. "GeoCities: We Didn't Start the Fire." https://archiveteam.org/index.php?title=GeoCities
- Internet Archive. Wayback Machine. https://web.archive.org
- Kahle, Brewster. "Preserving the Internet." Scientific American 276, no. 3 (1997): 82-83.
End of Chapter 1
Next: Chapter 2 — The Archaeobyte Taxonomy: Understanding Digital Mortality