Thu, 30 Oct 2025 00:00:00 UTC CUHK launches world’s first dynamic evaluation platform and ecosystem for Cantonese large language models

The Chinese University of Hong Kong (CUHK) today (30 October) announced the launch of CLEVA-Cantonese, the world’s first dynamic evaluation platform and ecosystem dedicated to the Cantonese language. Cantonese is a vital language for communities in Hong Kong, Guangdong and other Cantonese-speaking regions. This pioneering platform delivers fair, dynamic, informative benchmarking that reveals how well various large language models (LLMs) support Cantonese. It provides researchers and developers with meaningful insights to accelerate the improvement and real-world application of Cantonese-capable LLMs.
This project is a collaboration between CUHK’s InnoHK Centre for Perceptual and Interactive Intelligence (CPII) and the CUHK Language and Vision (LaVi) Lab. It is co-led by Professor Helen Meng Mei-ling, Patrick Huen Wing Ming Professor of Systems Engineering and Engineering Management and Director of CPII, and Professor Wang Liwei, Assistant Professor in the Department of Computer Science and Engineering at CUHK, Leader of the LaVi Lab and CLEVA project leader.
An evolving ecosystem for Cantonese LLM evaluation
CLEVA (Chinese Language Models EVAluation Platform), developed by CUHK’s LaVi Lab, is widely recognised as one of the largest and most comprehensive evaluation benchmarks for Mandarin Chinese LLMs. Building upon this foundation, CLEVA-Cantonese establishes the world’s first evolving ecosystem for Cantonese LLM evaluation. It integrates a collaborative, automated workflow that cycles through four key phases: data import and filtering, language model understanding, evaluation, and feedback. This continuous process provides timely insights to guide LLM innovation, improves services for Cantonese-speaking populations and generates research outcomes that can assist in the evaluation of other low-resource languages.
Cantonese evaluation for LLMs is crucial, as it provides clear performance signals that pinpoint model strengths and areas for improvement, thereby accelerating their development. It also enables scalable, timely assessment that keeps pace with rapid model iteration cycles, while ensuring trustworthy comparisons through standardised tasks, prompts and multi-metric evaluations.
CLEVA-Cantonese is built to meet the special challenges of creating a high-quality Cantonese benchmark:

It is capable to evaluate written vernacular Cantonese (粵語白話文) – the written form of everyday spoken Cantonese – capturing unique linguistic traits such as colloquial expressions and slang, code-switching with English and Mandarin, and romanisation in the form of Jyutping (粵拼).
CLEVA-Cantonese standardises the end-to-end workflow for evaluation, including constructing representative tasks with up-to-date data, evaluating LLMs using consistent prompts and selecting a suite of informative metrics.
Through collaboration with data providers such as Phoenix TV, CLEVA-Cantonese continuously adopts the latest data, which naturally reflects emerging language trends in Cantonese and mitigates data contamination.

Professor Wang said: “We utilise natural language understanding technology based on LLMs to assist in constructing a series of multidimensional evaluation tasks. These tasks are designed around linguistic features, ensuring the benchmark faithfully reflects the language’s structural and knowledge-based characteristics. CLEVA-Cantonese marks the beginning of an ecosystem that brings together academic research, data contributors and state-of-the-art model developers to drive LLM advancement across languages, with immediate benefits for Cantonese-speaking communities.”
Early findings and the continuous improvement loop
The CLEVA-Cantonese team has completed an initial round of evaluation with a range of international and domestic LLMs, spanning open-source and proprietary models. The findings show that even the latest models still struggle to fully capture the nuances of Cantonese, leaving substantial room for improvement in grammar, pronunciation and vocabulary. These insights will guide the next generation of LLMs, enhancing their alignment with Cantonese and performance in related tasks. As stronger models emerge, CLEVA-Cantonese will iteratively refine its evaluation criteria – completing the continuous cycle of data import, language model understanding, evaluation and feedback.
Professor Meng concluded: “Building upon CUHK’s interdisciplinary expertise, we will continuously refresh the benchmark through expanded data partnerships, develop an open evaluation platform for researchers, developers and institutions, extend CLEVA-Cantonese to support more languages, tasks and spoken Cantonese, and provide shared tools to advance collaborative research across linguistics, education, culture and related domains. CLEVA-Cantonese elevates evaluation to a systematic process. It makes gaps for improvement visible, guides research and product roadmaps, and helps ensure Cantonese is well supported across areas such as education, healthcare, public services and cultural life.”

Read on the original site

A Call for Empowering Frontline Workers and Leaders to Increase State Capacity in India: Ethnographic Study of Education Reform in Delhi

- Princeton

Thu, 30 Oct 2025 00:00:00 UTC CUHK launches world’s first dynamic evaluation platform and ecosystem for Cantonese large language models

- Hong Kong

HKUMed finds depression doubles mortality rates and increases suicide risk 10- timely treatment can reduce risk by up to 30%

- Hongkong

Theater Performance Co-Curricular Classes with Vivia Font

Fri, 24 Oct 2025 00:00:00 UTC CUHK and Joincap Capital sign Memorandum of Understanding HK$150 million committed to support CUHK startup projects

- Hong Kong

Neuroscience PhD studies what the body remembers

Barcelona
Copenhagen
Gordon
Aberdeen
acenet
Agricultural Sciences
Alabama
Arizona
Autonomous
Bath
Bergen
Bern
Bloomington
Boston
Bozen-Bolzano
Brandeis
Buffalo
Calgary
Cambridge
Central European
Charité
Chester
Colorado Boulder
Connecticut
Copenhagen
Duisburg-Essen
Duke
Dundee
École
Eindhoven
Emory
Estadual de Campinas
Federal do Rio de Janeiro
Florida
Frankfurt am Main
Galway
Geneva
Goethe
Groningen
Harvard
Hawai’i at Mānoa
Hong Kong
Hongkong
Imperial
James Cook
Keele
Kingston
KTH
Laval
Leiden
Liège
Liverpool
Lomonosov Moscow
Luxembourg
Macquarie
Mancunion
Maryland
Massachusetts
Michigan
MMU
Montreal
Nacional de Colombia
Newcastle
Northampton
Nuremberg
Ohio
Ottawa
Oxford
Paris-Sud
Princeton
Purdue
qswownews
Quaid-i-Azam
Queensland
Queen’s
Radboud
Riverside
Ruhr
Rush
Rutgers
RWTH Aachen
Santa Barbara
Santa Cruz
Sant’Anna
São Paulo
Sciences Po
Scuola
SOAS
South Australia
South Florida
Southampton
St-andrews
St. Louis
Stanford
Stirling
Stockholm
Stony Brook
Stuttgart
Surrey
Sussex
SUU
Swansea
Sydney
Syracuse
Texas
Texas A&M
Texas at Dallas
Tokyo
topuniversities
Trento
Tufts
Ulm
USnews/Education
Utah
Utrecht
Wageningen
Waikato
Warwick
Waseda
Washington
Western Australia
Western Ontario
Wilhelms-University Munster
William & Mary
Wollongong
Würzburg
Yale
Yeshiva

⁞

Thu, 30 Oct 2025 00:00:00 UTC CUHK launches world’s first dynamic evaluation platform and ecosystem for Cantonese large language models

Glocal University

Innovation Hub investment announced as part of £500 million Oxford-Cambridge growth package

Instituto Tecnológico de Sonora (ITSON) - Latin America and the Caribbean Rankings - Central America 2026

Kazan (Volga region) Federal University - World University Rankings 2026

Early hunter-gatherers reshaped Europe’s ecosystems long before agriculture

HKUMed finds depression doubles mortality rates and increases suicide risk 10- timely treatment can reduce risk by up to 30%

Theater Performance Co-Curricular Classes with Vivia Font

UK organisations release statistics for use of animals in research in 2024

Phuket Rajabhat University

Universidad Nacional de Rosario (UNR) - World University Rankings 2026

Graduate Student Buddhist Association Practice Meeting

Guided tour: Forms and Function: The Splendors of Global Book Making

The University of Tennessee at Martin

Zhejiang A F University

Premio Nobel per l’Economia 2025: l’importanza dell’innovazione come strumento di crescita economica e sviluppo sociale

Universidad del Valle de Guatemala (UVG) - Latin America and the Caribbean Rankings - Central America 2026

Technische Universität Bergakademie Freiberg - World University Rankings 2026

Beyond a number: At Yale, a new hub for understanding aging and cognitive health