Publications

An up-to-date list is available on Google Scholar.

2025

  1. ICLR
    ORSO: Accelerating Reward Design via Online Reward Selection and Policy Optimization
    Chen Bo Calvin Zhang, Zhang-Wei Hong, Aldo Pacchiano, and Pulkit Agrawal
    In The Thirteenth International Conference on Learning Representations, 2025
  2. arXiv
    Humanity’s Last Exam
    Long Phan, Alice Gatti, Ziwen Han, Nathaniel Li, Josephina Hu, Hugh Zhang, Chen Bo Calvin Zhang, Mohamed Shaaban, John Ling, and 2 more authors
    arXiv preprint arXiv:2501.14249, 2025
  3. Preprint
    SHADE-Arena: Evaluating Sabotage and Monitoring in LLM Agents
    Jonathan Kutasov, Yuqi Sun, Paul Colognese, Teun Weij, Linda Petrini, Chen Bo Calvin Zhang, John Hughes, Xiang Deng, Henry Sleight, and 3 more authors
    2025

2023

  1. ICML
    HIP-RL: Hallucinated Inputs for Preference-based Reinforcement Learning in Continuous Domains
    Chen Bo Calvin Zhang, and Giorgia Ramponi
    In ICML 2023 Workshop: The Many Facets of Preference-Based Learning, 2023
  2. arXiv
    Zero-Shot Transfer in Imitation Learning
    Alvaro Cauderan, Gauthier Boeshertz, Florian Schwarb, and Calvin Zhang
    arXiv preprint arXiv:2310.06710, 2023