Google Translate, a tool that has long been a bridge across language barriers, is taking a monumental leap forward. The tech giant has announced the addition of 110 new languages to its translation service, marking the largest expansion in its history. This update, powered by the advanced PaLM 2 language model, aims to enhance global communication by supporting languages spoken by over 614 million people, or around 8% of the world's population.
Isaac Caswell, a senior software engineer at Google, highlighted the significance of this update: "From Cantonese to Qʼeqchiʼ, these new languages represent more than 614 million speakers, opening up translations for around 8% of the world’s population". This expansion includes major world languages with over 100 million speakers, as well as languages spoken by small indigenous communities and those undergoing revitalization efforts.
A Focus on African Languages
One of the most notable aspects of this update is the inclusion of a significant number of African languages. About a quarter of the new additions come from Africa, making it the largest expansion of African languages on Google Translate to date. Languages such as Fon, Kikongo, Luo, Ga, Swati, Venda, and Wolof are now supported, reflecting Google's commitment to linguistic diversity and inclusion.
Revitalizing Endangered Languages
Google's efforts also extend to languages that are on the brink of extinction. For instance, Manx, the Celtic language of the Isle of Man, which nearly died out in 1974, has been included in this update. Thanks to a revival movement, there are now thousands of Manx speakers. Similarly, NKo, a standardized form of the West African Manding languages, and Punjabi (Shahmukhi), the most spoken language in Pakistan, have also been added.
The Role of AI in Language Translation
The integration of these new languages was made possible by Google's PaLM 2 AI language model. This model excels in learning languages that share similarities, such as those closely related to Hindi like Awadhi and Marwadi, and French creoles like Seychellois Creole and Mauritian Creole. The AI's ability to handle regional varieties, dialects, and different spelling standards has been crucial in this expansion.
Caswell explained the complexity of integrating languages like Cantonese, which has long been one of the most requested languages for Google Translate. "Because Cantonese often overlaps with Mandarin in writing, it’s tricky to find data and train models," he noted. Despite these challenges, the addition of Cantonese is a significant milestone for the platform.
Looking Ahead
Google's commitment to breaking down language barriers doesn't stop here. The company has announced the 1,000 Languages Initiative, a long-term goal to build AI models that will support the 1,000 most spoken languages around the world. This initiative, combined with ongoing partnerships with expert linguists and native speakers, promises to bring even more language varieties and spelling conventions to Google Translate in the future.
The addition of 110 new languages to Google Translate is a testament to the power of AI in enhancing global communication. By supporting a diverse range of languages, including those spoken by small communities and those undergoing revitalization, Google is making significant strides in its mission to make information accessible to everyone, regardless of language.