Andrew Wagenmaker

Postdoctoral Scholar
Electrical Engineering & Computer Sciences
UC Berkeley

ajwagen AT berkeley DOT edu
Twitter / Scholar

I am a postdoc in EECS at UC Berkeley working with Sergey Levine. Previously, I completed a PhD in Computer Science at the University of Washington, where I was advised by Kevin Jamieson. While in graduate school, I also spent time at Microsoft Research, mentored by Dylan Foster, as well as the Simons Institute, and my work was supported by an NSF Graduate Research Fellowship. Before that, I completed a master's and bachelor's degree at the University of Michigan, both in Electrical Engineering.

My research focuses on learning in dynamic, sequential, and interactive settings, and spans the spectrum from fundamental theory to practical algorithms for real-world decision-making. In particular, much of my work focuses on developing effective and scalable approaches to exploration, with the goal of enabling efficient online policy improvement. That is, how should we attempt novel behaviors during online deployment in order to identify better solution strategies than those currently known, and how can we use the experience collected from this process to improve our performance over time?

I seek to develop a fundamental theoretical understanding of such questions, and to apply these theoretical insights to motivate novel algorithmic techniques that lead to real-world impact, particularly towards enabling fast and efficient learning in robotic control settings.

I am on the 2025-2026 academic job market.

Selected Publications (Show All):

RoboReward: General-Purpose Vision-Language Reward Models for Robotics
Tony Lee^c, Andrew Wagenmaker^c, Karl Pertsch^c, Percy Liang, Sergey Levine, and Chelsea Finn
In Submission, 2026

Posterior Behavioral Cloning: Pretraining BC Policies for Efficient RL Finetuning
Andrew Wagenmaker, Perry Dong, Raymond Tsao, Chelsea Finn, and Sergey Levine
In Submission, 2025

Robust Finetuning of Vision-Language-Action Robot Policies via Parameter Merging
Yajat Yadav^c, Zhiyuan Zhou^c, Andrew Wagenmaker, Karl Pertsch, and Sergey Levine
In Submission, 2025

Steering Your Diffusion Policy with Latent Space Reinforcement Learning
Andrew Wagenmaker^c, Mitsuhiko Nakamoto^c, Yunchu Zhang^c, Seohong Park, Waleed Yagoub, Anusha Nagabandi, Abhishek Gupta^c, and Sergey Levine^c
CoRL, 2025 (Oral, Best Paper Award Nomination) [Website] [Code]

Behavioral Exploration: Learning to Explore via In-Context Adaptation
Andrew Wagenmaker, Zhiyuan Zhou, and Sergey Levine
ICML, 2025

Overcoming the Sim-to-Real Gap: Leveraging Simulation to Learn to Explore for Real-World RL
Andrew Wagenmaker, Kevin Huang, Liyiming Ke, Byron Boots, Kevin Jamieson, and Abhishek Gupta
NeurIPS, 2024

Active Learning of Neural Population Dynamics Using Two-Photon Holographic Optogenetics
Andrew Wagenmaker*, Lu Mi*, Marton Rozsa, Matthew S. Bull, Karel Svoboda, Kayvon Daie^†, Matthew D. Golub^†, and Kevin Jamieson^†
NeurIPS, 2024

Sample Complexity Reduction via Policy Difference Estimation in Tabular Reinforcement Learning
Adhyyan Narang, Andrew Wagenmaker, Lillian Ratliff, and Kevin Jamieson
NeurIPS, 2024 (Spotlight)

Humor in AI: Massive Scale Crowd-Sourced Preferences and Benchmarks for Cartoon Captioning
Jifan Zhang, Lalit Jain, Yang Guo, Jiayi Chen, Kuan Lok Zhou, Siddharth Suresh, Andrew Wagenmaker, Scott Sievert, Timothy Rogers, Kevin Jamieson, Robert Mankoff, and Robert Nowak
NeurIPS, 2024 (Datasets & Benchmarks Track, Spotlight)

Corruption-Robust Linear Bandits: Minimax Optimality and Gap-Dependent Misspecification
Haolin Liu^α, Artin Tajdini^α, Andrew Wagenmaker^α, and Chen-Yu Wei^α
NeurIPS, 2024

Fair Active Learning in Low-Data Regimes
Romain Camilleri, Andrew Wagenmaker, Jamie Morgenstern, Lalit Jain, and Kevin Jamieson
UAI, 2024

ASID: Active Exploration for System Identification in Robotic Manipulation
Marius Memmel, Andrew Wagenmaker, Chuning Zhu, Patrick Yin, Dieter Fox, and Abhishek Gupta
ICLR, 2024 (Oral)

Optimal Exploration for Model-Based RL in Nonlinear Systems
Andrew Wagenmaker, Guanya Shi, and Kevin Jamieson
NeurIPS, 2023 (Spotlight) [Code]

Instance-Optimality in Interactive Decision Making: Toward a Non-Asymptotic Theory
Andrew Wagenmaker and Dylan Foster
COLT, 2023 [Talk]

Leveraging Offline Data in Online Reinforcement Learning
Andrew Wagenmaker and Aldo Pacchiano
ICML, 2023 [Talk]

Instance-Dependent Near-Optimal Policy Identification in Linear MDPs via Online Experiment Design
Andrew Wagenmaker and Kevin Jamieson
NeurIPS, 2022

Active Learning with Safety Constraints
Romain Camilleri, Andrew Wagenmaker, Jamie Morgenstern, Lalit Jain, and Kevin Jamieson
NeurIPS, 2022

Reward-Free RL is No Harder Than Reward-Aware RL in Linear Markov Decision Processes
Andrew Wagenmaker, Yifang Chen, Max Simchowitz, Simon S. Du, and Kevin Jamieson
ICML, 2022

First-Order Regret in Reinforcement Learning with Linear Function Approximation: A Robust Estimation Approach
Andrew Wagenmaker, Yifang Chen, Max Simchowitz, Simon S. Du, and Kevin Jamieson
ICML, 2022 (Long Talk) [Talk]

Beyond No Regret: Instance-Dependent PAC Reinforcement Learning
Andrew Wagenmaker, Max Simchowitz, and Kevin Jamieson
COLT, 2022 [Talk]

Best Arm Identification with Safety Constraints
Zhenlin Wang, Andrew Wagenmaker, and Kevin Jamieson
AISTATS, 2022

Task-Optimal Exploration in Linear Dynamical Systems
Andrew Wagenmaker, Max Simchowitz, and Kevin Jamieson
ICML, 2021 (Long Talk)

Experimental Design for Regret Minimization in Linear Bandits
Andrew Wagenmaker*, Julian Katz-Samuels*, and Kevin Jamieson
AISTATS, 2021

Active Learning for Identification of Linear Dynamical Systems
Andrew Wagenmaker and Kevin Jamieson
COLT, 2020 [Talk]

Robust Photometric Stereo via Dictionary Learning
Andrew Wagenmaker, Brian Moore, and Raj Rao Nadakuditi
IEEE Transactions on Computational Imaging, 2018

Robust Photometric Stereo Using Learned Image and Gradient Dictionaries
Andrew Wagenmaker, Brian Moore, and Raj Rao Nadakuditi
ICIP, 2017

Robust Surface Reconstruction from Gradients via Adaptive Dictionary Regularization
Andrew Wagenmaker, Brian Moore, and Raj Rao Nadakuditi
ICIP, 2017

A Bisimulation-Like Algorithm for Abstracting Control Systems
Andrew Wagenmaker and Necmiye Ozay
Allerton, 2016

* equal contribution, ^† equal advising, ^α alphabetical ordering, ^c core contributor