Home > Columns > CRM Columns

Building a True Multilingual Contact Center with GenAI

CrmXchange

Presented By: CrmXchange



How advances in AI and real-time translation are eliminating language barriers, once and for all, in the customer contact center 

Contributed Article by Maik Hummel, Principal AI Evangelist for Parloa   

In the contact center world, resolving a customer service issue requires effective, efficient communication. Agents must know how and when to ask the right questions. They need to serve as caring, empathetic professionals and detectives who can quickly sleuth out and solve a problem.    

There’s nothing more frustrating for a customer than a communication breakdown. Imagine calling a contact center for help, only to realize that the person on the other end of the phone doesn’t understand what you’re trying to say. It’s equally frustrating for the call center agent, who may not have a translator on staff with the corresponding language skills to help bridge the communication gap.    

The good news is that this scenario is swiftly becoming a thing of the past, thanks to significant advancements in genAI technology. As more contact center solutions embed advanced AI technologies, enterprises can finally build a true, multilingual customer contact center.

The Growing Need for Multilingual Support

Most U.S. companies already understand the importance of multilingual support centers. The U.S. doesn’t have an official designated language. While English is the most widely used language, approximately 430 languages are spoken by a diverse population. According to the most recent U.S. Census, more than 25 million people in the United States have limited English proficiency (LEP), about 8% of the population. A LEP customer is someone who does not speak English as their primary language. They may have limited ability to read, speak, write, or understand English. Every LEP customer is different. They may speak English well, not well, or not at all. They may struggle to find or understand words in English. This makes it more essential for contact centers to seamlessly support multiple languages, 24/7.    

However, many contact centers still struggle with properly executing multilingual customer support. For years, the approach to multilingual customer support involved hiring contact center agents fluent in multiple languages and ensuring direct communication with customers in their preferred language. Some contact centers tried to organize dedicated teams of agents or departments for specific language groups to handle calls, chats, or emails. Others turn to third-party interpretation services to support agents while working with a customer.    

Given the sheer number of languages spoken across the U.S. and the volume of calls coming into larger contact centers, these approaches have proven to be unsustainable. There simply aren’t enough skilled contact center agents who speak enough languages to support every language at scale. Additionally, the costs of recruiting, hiring, training, and retaining multilingual agents are relatively high, even when outsourcing these agents from a third party. Moreover, third-party interpretation services can sometimes be ineffective in a call center setting due to several factors, including the lack of visual cues, potential delays in connecting with an interpreter, potential audio quality issues, difficulty managing complex technical jargon, and the need for context specific to the company and its products, which a remote interpreter might not fully grasp, leading to potential misunderstandings and customer frustration.     

Removing Barriers With Advanced AI Technologies

You may have contact center agents on staff who can speak a few languages fluently, which already puts you ahead of the game. And yet, if your contact center supports a large market with thousands of incoming calls, that won’t be much help. For enterprises with contact centers that must scale to operate across multiple regions, it may seem like forced off-shore customer service or hiring a larger team for broad language support are your only options. However, AI and advanced real-time translation can provide an engine to help your agents converse easily in non-native languages.

Real-time translation (RTT) technology has advanced significantly over the last few years due to improvements in artificial intelligence (AI), machine learning (ML), and natural language processing (NLP). Today’s RTT is faster, seamless to the customer, more accurate, and more accessible across different industries, such as e-commerce, healthcare, and financial services. AI-powered RTT systems empower businesses with multilingual contact centers to communicate with international customers in real time with customized vocabularies for industry-specific terms. This is a massive leap beyond outsourcing translation to human agents, which can cause lags in response time.

Moreover, traditional, phrase-based language translation models are evolving thanks to advances in Neural Machine Translation (NMT). This technology uses neural networks to provide more natural, context-aware language translations. To date, Google Translate, Microsoft Translator, DeepL, and other major players have all integrated NMT, leading to more accurate real-time translations.

While NMT excels in literal translation from one language to another, large language models (LLMs) enable the possibility of further refining the translation. LLMs are a type of AI that can generate human-like written responses to user queries. In the context of language translation, we call this “polishing” LLM interpolation.    

LLMs can be instructed to pay attention to industry- or domain-specific wordings, bridge cultural differences (like the different politeness levels in Japanese), and correct potential mistakes from NMT engines.    

In addition to NMT and LLMs, we have seen a significant advancement in the quality and accuracy of Text-to-Speech capabilities. Today’s TTS can give human agents an artificial speaking voice that can talk to the customer in any language. This was not possible in the past because the AI-powered voice sounded robotic and non-authentic. But now, the capability exists to provide an artificial, multilingual voice to human agents that sounds and feels like the “real thing” to the customer.     

Today’s RTT works by using the original transcript from the text-to-speech and the NMT translation and combining these with a dedicated prompt (context) that defines all the rules that we want to have considered by the LLM when proposing an interpolated translation. While this approach increases the latency slightly, it greatly impacts the end-customer experience due to the enhanced translation quality.    

With AI acting as a real-time translator across calls and chat, human agents don’t need to speak a language fluently—or at all. AI can immediately translate an audio or written exchange for the human agent and then back into the customer's preferred language. Thanks to AI, customers may never be aware they are not chatting with someone who speaks their language.     

This capability enhances customer satisfaction, eliminates errors and confusion due to language barriers, and allows for more effective, nuanced communication between customers and live agents. Advanced genAI models can even help agents be more empathetic with their customers to build stronger customer loyalty and deliver a more human experience, regardless of the languages spoken.     

In essence, generative AI has elevated real-time translation into a sophisticated, adaptive tool that allows for more natural, contextually aware, and personalized multilingual communication in the customer contact center. This leap makes RTT more valuable for businesses, travelers, customer service teams, and global collaborators alike. The days of language as a barrier to seamless communication between agents and customers are about to be over.    

About the author

Maik Hummel

Maik Hummel is the Principal AI Evangelist for Parloa, a leader in AI-powered automation and assistance for customer service.