Lost in translation: AI chatbots still too English-language centric, Stanford study finds (2024)

Lost in translation: AI chatbots still too English-language centric, Stanford study finds (1)

AI solutions and associated chatbots coming to the fore may lack the global diversity needed to serve international user bases. Many of today's large language models tend to favor "Western-centric tastes and values," asserts a recent study by researchers at Stanford University. Attempts to achieve what is referred to as "alignment" with intended users of systems or chatbots often fall short, they claimed.

It's not for lack of trying as the researchers, led by Diyi Yang, assistant professor at Stanford University and part of Stanford Human-Centered Artificial Intelligence (HAI), recount in the study. "Before the creators of a new AI-based chatbot can release their latest apps to the general public, they often reconcile their models with the various intentions and personal values of the intended users." However, efforts to achieve this alignment "can introduce its own biases, which compromise the quality of chatbot responses."

Also: Nvidia will train 100,000 California residents on AI in a first-of-its-kind partnership

The Stanford researchers offer the following recommendations to increase awareness of global diversity:

Recognize that the alignment of language models is not a one-size-fits-all solution. "Various groups are impacted differently by alignment procedures."

Strive for transparency. This "is of the utmost importance in disclosing the design decisions that go into aligning an LLM. Each step of alignment adds additional complexities and impacts on end users." Most human-written preference datasets do not include the demographics of their regional preference annotators. "Reporting such information, along with decisions about what prompts or tasks are in the domain, is essential for the responsible dissemination of aligned LLMs to a diverse audience of users."

Seek multilingual datasets. The researchers looked at the Tülu dataset used in language models, of which 13% is non-English. "Yet this multilingual data leads to performance improvements in six out of nine tested languages for extractive QA and all nine languages for reading comprehension. Many languages can benefit from multilingual data."

Also:AI scientist: 'We need to think outside the large language model box'

Working closely with local users is also essential to overcome cultural or language deficiencies or missteps with AI chatbots. "Collaborating with local experts and native speakers is crucial for ensuring authentic and appropriate adaptation," wrote Vuk Dukic, software engineer and founder at Anablock, in a recent LinkedIn article. "Thorough cultural research is necessary to understand the nuances of each target market. Implementing continuous learning algorithms allows chatbots to adapt to user interactions and feedback over time."

Dukic also urged "extensive testing with local users before full deployment to help identify and resolve cultural missteps." In addition, "offering language selection allows users to choose their preferred language and cultural context."

Featured

Everything announced at Made by Google 2024
You can upgrade your old PC to Windows 11 - even if Microsoft says it's 'incompatible'. Here's how
The best smart rings you can buy: Expert tested
I'm a diehard Pixel user, but I'm considering a change for two reasons (and I'm not alone)