Multilingual Language Understanding and Generation

The field of natural language processing is moving towards developing more sophisticated models for multilingual language understanding and generation. Researchers are exploring innovative approaches to bridge the gap between languages, including the use of visual information and cultural awareness. A key challenge in this area is addressing the performance gap between high-resource and low-resource languages, as well as mitigating biases in multilingual models. Notable papers in this area include IRLBench, which introduces a benchmark for evaluating large language models in multilingual settings, and Cross-Lingual Representation Alignment Through Contrastive Image-Caption Tuning, which proposes a method for aligning sentence representations across languages using visual information. Additionally, papers such as TransBench and ScholarBench highlight the need for more robust evaluation frameworks for machine translation and academic reasoning in multilingual contexts.

Sources

Low-Resource Language Processing: An OCR-Driven Summarization and Translation Pipeline

IRLBench: A Multi-modal, Culturally Grounded, Parallel Irish-English Benchmark for Open-Ended LLM Reasoning Evaluation

Cross-Lingual Representation Alignment Through Contrastive Image-Caption Tuning

Breaking Language Barriers or Reinforcing Bias? A Study of Gender and Racial Disparities in Multilingual Contrastive Vision Language Models

TransBench: Benchmarking Machine Translation for Industrial-Scale Applications

Uncovering Cultural Representation Disparities in Vision-Language Models

Traveling Across Languages: Benchmarking Cross-Lingual Consistency in Multimodal LLMs

A Survey on Multilingual Mental Disorders Detection from Social Media Data

KoBALT: Korean Benchmark For Advanced Linguistic Tasks

ScholarBench: A Bilingual Benchmark for Abstraction, Comprehension, and Reasoning Evaluation in Academic Contexts

Evaluating Large Language Model with Knowledge Oriented Language Specific Simple Question Answering