Advances in Multilingual Large Language Models

The field of multilingual large language models is rapidly advancing, with a focus on improving performance in low-resource languages and addressing cross-lingual gaps. Recent research has highlighted the importance of controlling grammaticality in multilingual datasets and the need for more nuanced evaluations of cultural understanding. Noteworthy papers in this area include 'Measuring the Effect of Disfluency in Multilingual Knowledge Probing Benchmarks', which demonstrates the impact of grammaticality on knowledge retrieval scores, and 'Language over Content: Tracing Cultural Understanding in Multilingual Large Language Models', which provides insights into the internal cultural understanding mechanisms of LLMs. Additionally, 'Rethinking Cross-lingual Gaps from a Statistical Viewpoint' offers a new perspective on the cross-lingual gap, attributing it to variance in responses rather than divergence in latent representations. Other notable works include the introduction of new benchmarks, such as ChiKhaPo, and the development of novel methods for multilingual prompt optimization and difficulty-controllable question generation.

Sources

Measuring the Effect of Disfluency in Multilingual Knowledge Probing Benchmarks

FarsiMCQGen: a Persian Multiple-choice Question Generation Framework

Rethinking Cross-lingual Gaps from a Statistical Viewpoint

Language over Content: Tracing Cultural Understanding in Multilingual Large Language Models

ChiKhaPo: A Large-Scale Multilingual Benchmark for Evaluating Lexical Comprehension and Generation in Large Language Models

Parameter-Efficient Fine-Tuning for Low-Resource Languages: A Comparative Study of LLMs for Bengali Hate Speech Detection

Disparities in Multilingual LLM-Based Healthcare Q&A

Transformer-Based Low-Resource Language Translation: A Study on Standard Bengali to Sylheti

Tibetan Language and AI: A Comprehensive Survey of Resources, Methods and Challenges

Difficulty-Controllable Multiple-Choice Question Generation Using Large Language Models and Direct Preference Optimization

CrossNews-UA: A Cross-lingual News Semantic Similarity Benchmark for Ukrainian, Polish, Russian, and English

The Art of Asking: Multilingual Prompt Optimization for Synthetic Data

From Facts to Folklore: Evaluating Large Language Models on Bengali Cultural Knowledge

Towards Reliable Evaluation of Large Language Models for Multilingual and Multimodal E-Commerce Applications

The Reasoning Lingua Franca: A Double-Edged Sword for Multilingual AI

\textsc{CantoNLU}: A benchmark for Cantonese natural language understanding

Analyticup E-commerce Product Search Competition Technical Report from Team Tredence_AICOE