AI Is Racing to Save the 3,000 Languages That Will Die This Century. The Root of Language Death Goes Back to Empire.
Every two weeks, a language dies. AI tools are now helping linguists and indigenous communities document, teach, and revitalize languages before the last speakers are gone. The pattern of language death is as old as conquest itself.
Key Takeaways
- •A language dies approximately every two weeks — by 2100, an estimated 3,000 of the world's 7,000+ languages will be extinct
- •Meta's No Language Left Behind AI translates 200 languages including many low-resource and endangered ones
- •US Indian boarding schools (1879-1960s) systematically punished Native children for speaking indigenous languages
- •AI tools now help communities create dictionaries, pronunciation guides, and language-learning apps for languages with no written tradition
Root Connection
From the Roman Empire imposing Latin across Europe, to Spanish colonizers suppressing Quechua, to US boarding schools beating Native children for speaking their languages, to AI helping those same communities reclaim their tongues — language has always been the first casualty and the last hope of cultural survival.
World's Languages by Number of Speakers
Nearly half the world's 7,000+ languages have fewer than 10,000 speakers
Source: Ethnologue / UNESCO
Timeline
Roman expansion imposes Latin across Western Europe, beginning the extinction of Gaulish, Celtiberian, Etruscan, and dozens of other languages
Spanish colonization of the Americas begins 500 years of indigenous language suppression. An estimated 1,500 languages existed in the Americas; today, fewer than 600 remain
The Carlisle Indian Industrial School opens in Pennsylvania with the motto 'Kill the Indian, save the man.' Native children are punished for speaking their languages.
UNESCO launches the Endangered Languages Project. Estimates: 50-90% of the world's ~7,000 languages will disappear by 2100
Meta (Facebook) releases No Language Left Behind — an AI model that can translate 200 languages, including many with little digital presence
AI tools now assist language documentation in 500+ endangered languages. Google's 1,000 Languages Initiative trains speech models on low-resource languages worldwide
Marie Smith Jones died on January 21, 2008, in Anchorage, Alaska. She was 89 years old. With her death, the Eyak language — spoken for thousands of years along the copper river delta of southern Alaska — ceased to exist. Its last fluent speaker was gone. An entire way of encoding reality, of describing the relationship between humans and salmon and glaciers, of telling jokes that only work in Eyak, vanished from the Earth.
This happens every two weeks. On average, one of the world's approximately 7,000 languages dies every 14 days. Linguists estimate that between 50% and 90% of today's languages will be extinct by the end of this century. That means somewhere between 3,000 and 6,000 languages — each one a unique cognitive architecture for understanding reality — will disappear within our lifetimes.
AI can't stop language death. But it's giving endangered language communities something they've never had before: tools powerful enough to document, teach, and revitalize their languages at the speed that's needed.
THE ROOT
When an elder dies, a library burns. When a language dies, a civilization's way of understanding the world disappears forever. No amount of archaeology can recover it.
Languages have always died. What's different about the current extinction wave is its speed and its cause. Most language death in recorded history can be traced to a single pattern: empire.
When Rome expanded across Western Europe, Latin didn't just coexist with local languages — it replaced them. Gaulish, the Celtic language spoken across what is now France, was dead by the 6th century. Etruscan, the language of the civilization that preceded Rome in Italy, left behind inscriptions that scholars still struggle to fully decipher. Celtiberian, Lusitanian, Dacian — all gone. Latin didn't erase these languages through violence alone. It erased them through administration, trade, law, and prestige. If you wanted a government job, you spoke Latin. If you wanted to trade, you spoke Latin. Within a few generations, parents stopped teaching their children the old language because it had no economic value.
The same pattern repeated worldwide. Spanish colonizers suppressed Quechua, Nahuatl, and hundreds of other indigenous languages across the Americas. British colonial education systems imposed English in India, Africa, and Southeast Asia. French colonial policy explicitly aimed to replace local languages in West Africa and Indochina.
But the most systematic language destruction happened in North America. In 1879, Captain Richard Henry Pratt opened the Carlisle Indian Industrial School in Pennsylvania with the explicit mission to "Kill the Indian, save the man." Native American children were taken from their families, often by force, and placed in boarding schools where they were beaten for speaking their native languages. Their hair was cut. Their names were changed. They were forbidden from practicing their cultures.
The boarding school system operated for nearly a century. By the time it ended, many Native American languages had lost an entire generation of speakers. Languages that had been spoken for thousands of years were reduced to a handful of elders.
The Cherokee syllabary was created by Sequoyah in 1821. He was illiterate in English. He invented a writing system for his people's language in 12 years. Now AI helps his descendants teach it.
THE AI LIFELINE
Traditional language documentation is slow, expensive, and labor-intensive. A trained linguist works with a native speaker for months or years to create a grammar, a dictionary, and a collection of recorded texts. This work is essential, but there aren't enough linguists, and many endangered languages have only a handful of elderly speakers left. The clock is ticking.
AI is accelerating every step of this process. Speech recognition models can now be trained on relatively small amounts of audio data — as few as a hundred hours — to create speech-to-text systems for languages that have never had one. Machine learning can identify grammatical patterns in transcribed texts, helping linguists build grammars faster. Neural machine translation can create rough translation tools between an endangered language and a major language, even with limited training data.
Meta's No Language Left Behind project, released in 2022, demonstrated that AI translation could work across 200 languages, including many with minimal digital presence. Google's 1,000 Languages Initiative is building speech and text models for the world's most-spoken 1,000 languages — covering 98% of the world's population but also including many languages that are on the brink.
On the ground, the most impactful work is being done by indigenous communities themselves, using AI tools to build language-learning apps, create interactive dictionaries, and produce educational content. The Māori community in New Zealand has built AI-powered tools for the te reo Māori language. First Nations communities in Canada are using AI to create pronunciation guides from elder recordings. Cherokee Nation is building on Sequoyah's 200-year-old syllabary with modern speech recognition.
Sequoyah's story is worth noting. Born around 1770 as a Cherokee silversmith who could not read or write English, he spent 12 years developing a complete writing system for the Cherokee language — 85 characters representing every syllable. By 1825, the Cherokee Nation was more literate than the surrounding white population. It was one of the most remarkable intellectual achievements in human history. Now AI tools help teach his syllabary to a new generation.
WHAT'S LOST WHEN A LANGUAGE DIES
Languages are not just communication tools. They are cognitive frameworks — different ways of parsing reality. The Hopi language has no grammatical tense, suggesting a fundamentally different relationship with time. The Guugu Yimithirr language of Australia uses cardinal directions (north, south, east, west) instead of relative directions (left, right), and its speakers develop extraordinary spatial awareness as a result. The Pirahã language of Brazil has no words for specific numbers, only "few" and "many."
When a language dies, we don't just lose words. We lose a way of thinking. We lose metaphors, stories, songs, jokes, and knowledge systems that evolved over thousands of years. We lose medicinal plant knowledge encoded in vocabulary that has no equivalent in English. We lose navigational knowledge, ecological knowledge, and astronomical knowledge that exists only in the oral traditions of small communities.
The root of language preservation isn't technology. It's recognition — the same recognition that Abbé de l'Épée brought to sign language, and that Sequoyah brought to Cherokee. The recognition that every language, no matter how few its speakers, encodes something irreplaceable about the human experience.
AI can't save a language that no one wants to speak. But for the communities that are fighting to keep their languages alive — and there are hundreds of them — AI is the most powerful ally they've ever had.
Every two weeks, a library burns. AI is trying to copy the books before the flames reach them.
How did this make you feel?
Recommended Gear
View all →Disclosure: Some links on this page may be affiliate links. If you make a purchase through these links, we may earn a small commission at no extra cost to you. We only recommend products we genuinely believe in.
Framework Laptop 16
The modular, repairable laptop that lets you upgrade every component. The right-to-repair movement in action.
Flipper Zero
Multi-tool for pentesters and hardware hackers. RFID, NFC, infrared, GPIO — all in your pocket.
The Innovators by Walter Isaacson
The untold story of the people who created the computer, internet, and digital revolution. Essential tech history.
reMarkable 2 Paper Tablet
E-ink tablet that feels like writing on real paper. No distractions, no notifications — just thinking.
Keep Reading
Want to dig deeper? Trace any technology back to its origins.
Start Research