I am an incoming PhD student at Carnegie Mellon University, where I am doing research in Natural Language Processing, funded by DSTA. I also pursued a BS/MS as CMU, with a dual degree in Computer Science and Statistics and Machine Learning.

I am broadly interested in NLP. Previously I worked on Multimodal LLM, Web Agents, Multi-Agents, Multilingual NLP, and benchmarking. I am also open to exploring interesting new directions.

I have been fortunate to work with an amazing set of researchers in the past. I am very grateful to be advised by Professor Graham Neubig and to have the opportunity to work closely with many great people.

In addition to my time in research, I interned at Intuit as a software engineer in AI/ML team, and PreVeil as a software developer. For more details, please check my CV or hit me up on my email: yueqis@cs.cmu.edu :)

Updates:

. .
Apr 2025: Invited talk on VisualPuzzles at OpenCompass, ModelScope, and other organizations.
Mar 2025: Joining CMU LTI NeuLab as a PhD student starting from 2025 Fall!
Jan 2025: Invited talk on Pangea at TikTok
Jan 2025: Pangea, our fully open multilingual multimodal LLM, is on CMU news! [Link]
Dec 2024: Invited talk on API-Based Agent at CAMEL-AI. [Video]
Nov 2024: Our work on image transcreation won Best Paper Award at EMNLP 2024!
Aug 2024: Started research at CMU Neulab as a BS/MS student! :)

Publications

FieldWorkArena: Agentic AI Benchmark for Real Field Work Tasks
Atsunori Moteki, Shoichi Masui, Fan Yang, Yueqi Song, Yonatan Bisk, Graham Neubig, Ikuo Kusajima, Yasuto Watanabe, Hiroyuki Ishida, Jun Takahashi, Shan Jiang
Under Review
PDF| Abstract

VisualPuzzles: Decoupling Multimodal Reasoning Evaluation from Domain Knowledge
Yueqi Song*, Tianyue Ou*, Yibo Kong†, Zecheng Li†, Graham Neubig, Xiang Yue
Under Review
PDF| Abstract| Leaderboard| Model Outputs| Code

SkillWeaver: Web Agents can Self-Improve by Discovering and Honing Skills
Boyuan Zheng*, Michael Y Fatemi*, Xiaolong Jin*, Zora Zhiruo Wang, Apurva Gandhi, Yueqi Song, Yu Gu, Jayanth Srinivasa, Gaowen Liu, Graham Neubig, Yu Su
Under Review
PDF| Abstract

Beyond Browsing: API-Based Web Agents
Yueqi Song, Frank Xu, Shuyan Zhou, Graham Neubig
ACL 2025 (Findings)
PDF| Abstract| Project Website

Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia
Samuel Cahyawijaya, Holy Lovenia, Joel Ruben Antony Moniz, Tack Hwa Wong, Mohammad Rifqi Farhansyah, Thant Thiri Maung, Frederikus Hudi, David Anugraha, Muhammad Ravi Shulthan Habibi, Muhammad Reza Qorib, Amit Agarwal, Joseph Marvin Imperial, Hitesh Laxmichand Patel, Vicky Feliren, Bahrul Ilmi Nasution, Manuel Antonio Rufino, Genta Indra Winata, Rian Adam Rajagede, Carlos Rafael Catalan, Mohamed Fazli Imam, Priyaranjan Pattnayak, Salsabila Zahirah Pranida, Kevin Pratama, Yeshil Bangera, Adisai Na-Thalang, Patricia Nicole Monderin, Yueqi Song, Christian Simon, Lynnette Hui Xian Ng, et al.
ACL 2025
PDF| Abstract|

Pangea: A Fully Open Multilingual Multimodal LLM for 39 Languages
Xiang Yue*, Yueqi Song*, Akari Asai, Seungone Kim, Jean de Dieu Nyandwi, Simran Khanuja, Anjali Kantharuban, Lintang Sutawika, Sathyanarayanan Ramamoorthy, Graham Neubig
ICLR 2025
PDF| Abstract| Project Website| Model| Demo| Train Data| Benchmark

OpenHands: An Open Platform for AI Software Developers as Generalist Agents
Xingyao Wang, Boxuan Li, Yufan Song, Frank F. Xu, Xiangru Tang, Mingchen Zhuge, Jiayi Pan, Yueqi Song, Bowen Li, Jaskirat Singh, Hoang H. Tran, Fuqiang Li, Ren Ma, Mingzhang Zheng, Bill Qian, Yanjun Shao, Niklas Muennighoff, Yizhe Zhang, Binyuan Hui, Junyang Lin, Robert Brennan, Hao Peng, Heng Ji, Graham Neubig
ICLR 2025
PDF| Abstract| Code

What Is Missing in Multilingual Visual Reasoning and How to Fix It
Yueqi Song, Simran Khanuja, Graham Neubig
NAACL 2025 Findings
PDF| Abstract

🏆 Best Paper
An image speaks a thousand words, but can everyone listen?
On translating images for cultural relevance

Simran Khanuja, Sathyanarayanan Ramamoorthy, Yueqi Song, Graham Neubig
EMNLP 2024
PDF| Abstract| Project Website| Code

GlobalBench: A Benchmark for Global Progress in Natural Language Processing
Yueqi Song, Catherine Cui, Simran Khanuja, Pengfei Liu, Fahim Faisal, Alissa Ostapenko, Genta Indra Winata, Alham Fikri Aji, Samuel Cahyawijaya, Yulia Tsvetkov, Antonios Anastasopoulos, Graham Neubig
EMNLP 2023
PDF| Abstract