Updates
Jan 2025: | Invited talk on Pangea at TikTok | .
Jan 2025: | Pangea, our fully open multilingual multimodal LLM, is on CMU news! [Link] |
Dec 2024: | Invited talk on API-Based Agent at CAMEL-AI. [Video] |
Nov 2024: | Our work on image transcreation won Best Paper Award at EMNLP 2024! |
Aug 2024: | Started research at CMU Neulab as a BS/MS student! :) |
Publications
Beyond Browsing: API-Based Web Agents
, Frank Xu, Shuyan Zhou, Graham Neubig
Under Review
pdf
abstract
Pangea: A Fully Open Multilingual Multimodal LLM for 39 Languages
Xiang Yue*, , Akari Asai, Seungone Kim, Jean de Dieu Nyandwi, Simran Khanuja,
Anjali Kantharuban, Lintang Sutawika, Sathyanarayanan Ramamoorthy, Graham Neubig
ICLR 2025
pdf
abstract
OpenHands: An Open Platform for AI Software Developers as Generalist Agents
Xingyao Wang, Boxuan Li, Yufan Song, Frank F. Xu, Xiangru Tang, Mingchen Zhuge, Jiayi Pan, ,
Bowen Li, Jaskirat Singh, Hoang H. Tran, Fuqiang Li, Ren Ma, Mingzhang Zheng, Bill Qian, Yanjun Shao, Niklas Muennighoff,
Yizhe Zhang, Binyuan Hui, Junyang Lin, Robert Brennan, Hao Peng, Heng Ji, Graham Neubig
ICLR 2025
pdf
abstract
What Is Missing in Multilingual Visual Reasoning and How to Fix It
, Simran Khanuja, Graham Neubig
NAACL 2025 Findings
pdf
abstract
🏆 Best Paper
An image speaks a thousand words, but can everyone listen?
On translating images for cultural relevance
Simran Khanuja, Sathyanarayanan Ramamoorthy, , Graham Neubig
EMNLP 2024
pdf
abstract
GlobalBench: A Benchmark for Global Progress in Natural Language Processing
, Catherine Cui, Simran Khanuja, Pengfei Liu, Fahim Faisal, Alissa Ostapenko, Genta Indra Winata,
Alham Fikri Aji, Samuel Cahyawijaya, Yulia Tsvetkov, Antonios Anastasopoulos, Graham Neubig
EMNLP 2023
pdf
abstract