Bhavya Sukhija

PhD at ETH Zürich.

webpage_pic.jpeg

OAT Y19, ETH Zürich

Hi!

I am pursuing a PhD in computer science at ETH Zürich. I am co-supervised by Prof. Andreas Krause and Prof. Stelian Coros. My research interests are Reinforcement learning (RL): model-based RL, nonepisodic RL, safe RL, meta RL, continuous time RL, exploration (multimodal) in RL, active learning and robotics.

Since December 2024, I have started an internship at Amazon Web Services in Berlin as a research scientist. Currently, I am working on leveraging RL for fine-tuning LLM agents to master real-world applications.

From July 2024 – December 2024 I was a research visitor at University of California, Berkeley at the Berkeley Robot Learning lab where I was supervised by Prof. Pieter Abbeel. During the visit, I worked on developing a class of simple, efficient, and scalable algorithms for exploration in RL, see MaxInfoRL.

Prior to my PhD, I completed a BSc in Mechanical Engineering and a masters in Robotics at ETH. I completed my master thesis at the RWTH Aachen university under the supervision of Prof. Dominik Baumann, Prof. Sebastian Trimpe, and Prof. Andreas Krause. I received the ETH medal for my thesis.

Besides research, I enjoy playing football and support Liverpool FC.

News

Dec 1, 2024 Started as a research scientist intern at at Amazon Web Services, working on leveraging RL for fine-tuning LLM agents to master real-world applications.
Jul 1, 2024 Started as a visiting research in University of California, Berkeley at the Berkeley Robot Learning lab under the supervision of Dr. Carlo Sferrazza and Prof. Pieter Abbeel.
Jun 30, 2024 Sim-FSVGD got accepted as oral presentation at IROS 2024.
Feb 29, 2024 My great master student (now PhD at LAS), Jonas Hübotter, is awarded the ETH medal for his master thesis.

Selected publications

  1. humanoid_walk_learning.gif
    MaxInfoRL: Boosting exploration in reinforcement learning through information gain maximization
    Bhavya Sukhija, Stelian Coros, Andreas Krause, and 2 more authors
    arXiv preprint arXiv:2412.12098, 2024
  2. actsafe.gif
    ActSafe: Active Exploration with Safety Constraints for Reinforcement Learning
    Yarden As*, Bhavya Sukhija*, Lenart Treven, and 3 more authors
    arXiv preprint arXiv:2410.09486, 2024
  3. nonepisodic_rl.jpeg
    NeoRL: Efficient Exploration for Nonepisodic RL
    Bhavya Sukhija, Lenart Treven, Florian Dörfler, and 2 more authors
    Proc. Neural Information Processing Systems (NeurIPS), 2024
    Spotlight
  4. racecar_simfsvgd.gif
    Bridging the Sim-to-Real Gap with Bayesian Inference
    Jonas Rothfuss*, Bhavya Sukhija*, Lenart Treven*, and 3 more authors
    IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2024
    Oral presentation
  5. opax_teaser.gif
    Optimistic Active Exploration of Dynamical Systems
    Bhavya Sukhija, Lenart Treven, Cansu Sancaktar, and 3 more authors
    In Proc. Neural Information Processing Systems (NeurIPS), 2023
  6. franka.gif
    GoSafeOpt: Scalable safe exploration for global optimization of dynamical systems
    Bhavya Sukhija, Matteo Turchetta, David Lindner, and 3 more authors
    Artificial Intelligence Journal (AIJ), 2023
  7. HAMBO.png
    Hallucinated Adversarial Control for Conservative Offline Policy Evaluation
    Jonas Rothfuss*, Bhavya Sukhija*, Tobias Birchler*, and 2 more authors
    In Conference on Uncertainty in Artificial Intelligence (UAI), 2023
  8. go1gosafeopt.gif
    Tuning Legged Locomotion Controllers via Safe Bayesian Optimization
    Daniel Widmer*, Dongho Kang*, Bhavya Sukhija, and 3 more authors
    In Conference on Robot Learning (CoRL), 2023
  9. spot_gradientbased.gif
    Gradient-Based Trajectory Optimization With Learned Dynamics
    Bhavya Sukhija, Nathanael Köhler, Miguel Zamora, and 4 more authors
    IEEE International Conference on Robotics and Automation (ICRA), 2023