Profile photo of Manish Shetty
I am a PhD student at UC Berkeley in the Sky Lab studying AI4Code.
I work on turning language models into capable software agents.
My focus has been building benchmarks ("evals") and environments that either stage long-horizon and challenging coding tasks or elicit more from models by scaling compute. My work spans the software lifecycle: code completion, optimization, translation, and deployment.
From 2020 to 2022, I was a research fellow at Microsoft Research.
Email · CV · Scholar · GitHub · Notes · 𝕏

News

Sep 2025. ✔︎ Passed the PhD Qualifying Exam at UC Berkeley!
May 2025. Interning at Google DeepMind! Working on Gemini post-training for computer-use tasks.
Apr 2024. 🏆 Received the Tong Leong Lim Pre-Doctoral Prize at UC Berkeley.
Mar 2024. 🏆 Received the 2024 Outstanding Graduate Student Instructor Award at UC Berkeley.
Sept 2023. ✔︎ Passed the PhD Preliminary Exam at UC Berkeley!
May 2023. Taught my first class: CS164: Compilers and Programming Languages at UC Berkeley!
Nov 2022. 🏆 Our empirical study @ Microsoft Research on production incidents in large-scale cloud services received the Best Paper Award 🏆 at SoCC 2022.
Aug 2022. Started my Ph.D. at UC Berkeley advised by Prof. Koushik Sen. Joining the Sky Lab and the Programming Systems group!

Papers

ICLR 2026
Terminal-Bench: Benchmarking Agents on Hard, Realistic Tasks in Command Line Interfaces
Mike A. Merrill, Alexander G. Shaw, ..., Manish Shetty, ..., Ludwig Schmidt
NeurIPS 2025
GSO: Challenging Software Optimization Tasks for Evaluating SWE-Agents
Manish Shetty, Naman Jain, Jinjian Liu, Vijay Kethanaboyina, Koushik Sen, Ion Stoica
COLM 2025
R2E-Gym: Procedural Environments and Hybrid Verifiers for Scaling Open-Weights SWE Agents
Naman Jain*, Jaskirat Singh*, Manish Shetty, Liang Zheng, Koushik Sen, Ion Stoica
ICML 2025
Challenges and Paths Towards AI for Software Engineering
Alex Gu, Naman Jain*, Wen-Ding Li*, Manish Shetty*, Yijia Shao, Ziyang Li, Diyi Yang, Kevin Ellis, Koushik Sen, Armando Solar-Lezama
ICSE 2025
Syzygy: Dual Code-Test C to Rust Translation using LLMs and Dynamic Analysis
Manish Shetty*, Naman Jain*, Adwait Godbole*, Sanjit Seshia, Koushik Sen
ICML 2024
R2E: Turning any GitHub Repository into a Programming Agent Environment
Manish Shetty*, Naman Jain*, Tianjun Zhang, King Han, Koushik Sen, Ion Stoica
MLSys 2025
AIOpsLab: A Holistic Framework to Evaluate AI Agents for Enabling Autonomous Clouds
Yinfang Chen, Manish Shetty, Gagan Somashekar, Minghua Ma, Yogesh Simmhan, Jonathan Mace, Chetan Bansal, Rujia Wang, Saravan Rajmohan
SoCC 2024
Building AI Agents for Autonomous Clouds: Challenges and Design Principles
Manish Shetty, Yinfang Chen, Gagan Somashekar, Minghua Ma, Yogesh Simmhan, Xuchao Zhang, Jonathan Mace, Dax Vandevoorde, Pedro Las-Casas, Shachee Mishra Gupta, Suman Nath, Chetan Bansal, Saravan Rajmohan

See all papers →

Awards

Teaching

Service