Mastodon

USC researchers use AI to help translate Bible into very rare languages

(RNS) — Their goal is to target ‘very low-resource languages that are not even in the top 500.’
USC researchers use AI to help translate Bible into very rare languages
The Bible in different languages. Image courtesy of Creative Commons

(RNS) — Out of the 7,100 languages that exist, the Bible has been translated into more than 700, making it the most-translated book in the world. Yet, those remaining languages — many of them extremely rare — have vexed Bible translators for decades. Two scientists are looking to new advancements in artificial intelligence to help close the gap.

“We want to reach all the languages on earth; the goal is to reach everyone,” said Joel Mathew, a research engineer who alongside Ulf Hermjakob recently launched the Greek Room, an AI-powered technology to help streamline the highly technical process of biblical translation.

Combining Hermjakob’s long experience with natural language processing technologies and Mathew’s field knowledge of Bible translation, the two USC Information Sciences Institute researchers developed the technology with an aim to target “very low-resource languages that are not even in the top 500,” said Mathew.


The Greek Room includes three main tools: spell-checking, world alignment that ensures consistency in translation, and Wildebeest, used to detect improper characters in a script. 

Joel Mathew. Photo via USC

Joel Mathew. Photo via USC

The two scientists met in 2015 when Mathew joined USC to complete a master’s degree in computer science. There, he met Hermjakob in the AI division of the Information Sciences Institute. They bonded over a shared passion for languages and their Christian faith.

Mathew, the son of two Bible translators, has observed firsthand the difficulties that come with manual translation by local church members. In his hometown, New Delhi, he took notes of all the tasks that technology could accomplish.

Spell-checking usually requires many people and time, he explained. In the context of translation into rare languages, only local church members are qualified and they don’t have technology to back their work.

“These are not trivial problems; these are very hard problems. But big companies are not interested in solving them; it’s not their business model to target very rare languages,” he said.

When Mathew shared with him some of the problems Indian translators faced on the ground, Hermjakob jumped at the occasion.


“I always had this feeling to know how, at some point, I could apply my skills to my faith,” said Hermjakob, who earned a Ph.D. in computer science from the University of Texas.

With their project, Mathew and Hermjakob want to work on languages that do not even have a written system, grammar codes, dictionaries or spell-checkers.

Ulf Hermjakob. Photo via USC

Ulf Hermjakob. Photo via USC

“We are thinking of languages like Uyghur or Oromo (Oromo is spoken in Ethiopia and Northern Kenya),” said Hermjakob.

Recently, they have been approached by an Indian consultant specifically interested in the spell-checking and world-alignment tool for Bible translation in Kolami, a language spoken in western India that counts 130,000 native speakers.

The Greek Room also hopes to change the traditional model of Bible translation. Historically, translations were done by Western missionaries, who could only work on two languages at most in their lifetime, explained Hermjakob. With the Greek Room, the two researchers encourage a local church-driven model.

“Local churches and local language communities are asking for translations of the Bible in their heart language,” explained Mathew, adding that in a multilingual context, the heart language is the one in which people express their deepest feelings and is usually their native language.


This first version of the Greek Room focuses on quality control so translators can prioritize other tasks that require more judgment, like finding a way to translate a concept that doesn’t exist in a given language. In their next version, the two researchers want the tool to suggest better translations.

Now that their codes and data are available on GitHub, they hope other users will integrate their research into their tools and innovate further.

Their initiative, supported by the Wycliffe Bible Translators USA organization, is part of a broader program directed by Every Tribe, Every Nation that hopes to make the Scripture available in every language by 2033.

No paywalls here. Thanks to you.
As an independent nonprofit, RNS believes everyone should have access to coverage of religion that is fair, thoughtful and inclusive. That's why you will never hit a paywall on our site; you can read all the stories and columns you want, free of charge (and we hope you read a lot of them!)

But, of course, producing this journalism carries a high cost, to support the reporters, editors, columnists, and the behind-the-scenes staff that keep this site up and running. That's why we ask that if you can, you consider becoming one of our donors. Any amount helps, and because we're a nonprofit, all of it goes to support our mission: To produce thoughtful, factual coverage of religion that helps you better understand the world. Thank you for reading and supporting RNS.
Deborah Caldwell, CEO and Publisher
Donate today