Lost in translation: AI chatbots still too English-language centric, Stanford study finds (2024)

Lost in translation: AI chatbots still too English-language centric, Stanford study finds (1)

AI solutions and associated chatbots coming to the fore may lack the global diversity needed to serve international user bases. Many of today's large language models tend to favor "Western-centric tastes and values," asserts a recent study by researchers at Stanford University. Attempts to achieve what is referred to as "alignment" with intended users of systems or chatbots often fall short, they claimed.

It's not for lack of trying as the researchers, led by Diyi Yang, assistant professor at Stanford University and part of Stanford Human-Centered Artificial Intelligence (HAI), recount in the study. "Before the creators of a new AI-based chatbot can release their latest apps to the general public, they often reconcile their models with the various intentions and personal values of the intended users." However, efforts to achieve this alignment "can introduce its own biases, which compromise the quality of chatbot responses."

Also: Nvidia will train 100,000 California residents on AI in a first-of-its-kind partnership

In theory, "alignment should be universal and make large language models more agreeable and helpful for a variety of users across the globe and, ideally, for the greatest number of users possible," they state. However, annotators seeking to adapt datasets and LLMs within different regions may misinterpret those instruments.

AI chatbots for various purposes -- from customer interactions to intelligent assistants -- keep proliferating at a significant pace, so there's a lot at stake. The global AI chatbot market size is expected to be worth close to $67 billion by 2033, growing at a rate of 26% annually from its current size of more than $6 billion, according to estimates by MarketsUS.

"The AI chatbot market is experiencing rapid growth due to increased demand for automated customer support services and advancements in AI technology," the report's authors detail. "Interestingly, over 50% of enterprises are expected to invest more annually in bots and chatbot development than in traditional mobile app development."

Also:If these chatbots could talk: The most popular ways people are using AI tools

The bottom line is that a huge variety of languages and communities across the globe are currently being underserved by AI and chatbots. English-language instructions or engagements may include phrases or idioms that are open to misinterpretation.

The Stanford study asserts that LLMs are likely to be based on the preferences of their creators, who, at this point, are likely to be based in English-speaking countries. Human preferences are not universal, and LLMs must reflect "the social context of the people it represents -- leading to variations in grammar, topics, and even moral and ethical value systems."

The Stanford researchers offer the following recommendations to increase awareness of global diversity:

Recognize that the alignment of language models is not a one-size-fits-all solution. "Various groups are impacted differently by alignment procedures."

Strive for transparency. This "is of the utmost importance in disclosing the design decisions that go into aligning an LLM. Each step of alignment adds additional complexities and impacts on end users." Most human-written preference datasets do not include the demographics of their regional preference annotators. "Reporting such information, along with decisions about what prompts or tasks are in the domain, is essential for the responsible dissemination of aligned LLMs to a diverse audience of users."

Seek multilingual datasets. The researchers looked at the Tülu dataset used in language models, of which 13% is non-English. "Yet this multilingual data leads to performance improvements in six out of nine tested languages for extractive QA and all nine languages for reading comprehension. Many languages can benefit from multilingual data."

Also:AI scientist: 'We need to think outside the large language model box'

Working closely with local users is also essential to overcome cultural or language deficiencies or missteps with AI chatbots. "Collaborating with local experts and native speakers is crucial for ensuring authentic and appropriate adaptation," wrote Vuk Dukic, software engineer and founder at Anablock, in a recent LinkedIn article. "Thorough cultural research is necessary to understand the nuances of each target market. Implementing continuous learning algorithms allows chatbots to adapt to user interactions and feedback over time."

Dukic also urged "extensive testing with local users before full deployment to help identify and resolve cultural missteps." In addition, "offering language selection allows users to choose their preferred language and cultural context."

Featured

  • Everything announced at Made by Google 2024
  • You can upgrade your old PC to Windows 11 - even if Microsoft says it's 'incompatible'. Here's how
  • The best smart rings you can buy: Expert tested
  • I'm a diehard Pixel user, but I'm considering a change for two reasons (and I'm not alone)
Lost in translation: AI chatbots still too English-language centric, Stanford study finds (2024)
Top Articles
Redesign your Google sites - Sites Help
The Ultimate Google Sites Tutorial [20+ Templates & Examples]
Terraria Enchanting
Gabrielle Abbate Obituary
Irving Hac
What Happened To Father Anthony Mary Ewtn
Geometry Escape Challenge A Answer Key
Our Facility
Keurig Refillable Pods Walmart
Craigslist Farm And Garden Cincinnati Ohio
Google Feud Unblocked 6969
Beebe Portal Athena
Roof Top Snipers Unblocked
Csi Tv Series Wiki
Sadie Proposal Ideas
What Is Vioc On Credit Card Statement
Rufus Benton "Bent" Moulds Jr. Obituary 2024 - Webb & Stephens Funeral Homes
[PDF] NAVY RESERVE PERSONNEL MANUAL - Free Download PDF
Panola County Busted Newspaper
Chamberlain College of Nursing | Tuition & Acceptance Rates 2024
Craig Woolard Net Worth
Bay Area Craigslist Cars For Sale By Owner
Albert Einstein Sdn 2023
Craigslist Fort Smith Ar Personals
Bfsfcu Truecar
Cvs Sport Physicals
Shoe Station Store Locator
Till The End Of The Moon Ep 13 Eng Sub
Calvin Coolidge: Life in Brief | Miller Center
Ff14 Sage Stat Priority
Bursar.okstate.edu
Golden Tickets
John F Slater Funeral Home Brentwood
Mistress Elizabeth Nyc
Boggle BrainBusters: Find 7 States | BOOMER Magazine
Tokyo Spa Memphis Reviews
Heelyqutii
Elisabeth Shue breaks silence about her top-secret 'Cobra Kai' appearance
Vision Source: Premier Network of Independent Optometrists
Daly City Building Division
Armageddon Time Showtimes Near Cmx Daytona 12
Umd Men's Basketball Duluth
Todd Gutner Salary
Thotsbook Com
Garland County Mugshots Today
Catchvideo Chrome Extension
Barback Salary in 2024: Comprehensive Guide | OysterLink
Parks And Rec Fantasy Football Names
Lake County Fl Trash Pickup Schedule
Invitation Quinceanera Espanol
Coors Field Seats In The Shade
Latest Posts
Article information

Author: Virgilio Hermann JD

Last Updated:

Views: 6223

Rating: 4 / 5 (41 voted)

Reviews: 80% of readers found this page helpful

Author information

Name: Virgilio Hermann JD

Birthday: 1997-12-21

Address: 6946 Schoen Cove, Sipesshire, MO 55944

Phone: +3763365785260

Job: Accounting Engineer

Hobby: Web surfing, Rafting, Dowsing, Stand-up comedy, Ghost hunting, Swimming, Amateur radio

Introduction: My name is Virgilio Hermann JD, I am a fine, gifted, beautiful, encouraging, kind, talented, zealous person who loves writing and wants to share my knowledge and understanding with you.