Yi Zhao

I am a PhD candidate at the Robot Learning Group at Aalto University, advised by Joni Pajarinen and Juho Kannala. My research interests include reinforcement learning and robot learning. I received my MSc degree from Aalto University in Finland and my BEng degree from Huazhong University of Science and Technology in China. From February to December 2024, I was a visiting researcher at the Max Planck Institute for Intelligent Systems, working with Dieter Büchler and Bernhard Schölkopf.


Experience
  • Aalto University, Finland
    Aalto University, Finland
    Doctoral Candidate
    Feb. 2021 - now
  • Max Planck Institute for Intelligent Systems, Germany
    Max Planck Institute for Intelligent Systems, Germany
    Research Visit
    Feb. 2024 - Dec. 2024
  • Aalto University, Finland
    Aalto University, Finland
    Master of Science
    2020
  • Huazhong University of Science and Technology, China
    Huazhong University of Science and Technology, China
    Bachelor of Engineering
    2017
Publications (view all )
Generalist World Model Pre-Training for Efficient Reinforcement Learning
Generalist World Model Pre-Training for Efficient Reinforcement Learning

Yi Zhao, Aidan Scannell, Yuxin Hou, Tianyu Cui, Le Chen, Dieter Büchler, Arno Solin, Juho Kannala, Joni Pajarinen

Preprint to appear soon 2025

Generalist World Model Pre-Training for Efficient Reinforcement Learning
Generalist World Model Pre-Training for Efficient Reinforcement Learning

Yi Zhao, Aidan Scannell, Yuxin Hou, Tianyu Cui, Le Chen, Dieter Büchler, Arno Solin, Juho Kannala, Joni Pajarinen

Preprint to appear soon 2025

Discrete Codebook World Models for Continuous Control
Discrete Codebook World Models for Continuous Control

Aidan Scannell, Mohammadreza Nakhaeinezhadfard, Kalle Kujanpää, Yi Zhao, Kevin Sebastian Luck, Arno Solin, Joni Pajarinen

International Conference on Learning Representations (ICLR) 2025

Discrete Codebook World Models for Continuous Control
Discrete Codebook World Models for Continuous Control

Aidan Scannell, Mohammadreza Nakhaeinezhadfard, Kalle Kujanpää, Yi Zhao, Kevin Sebastian Luck, Arno Solin, Joni Pajarinen

International Conference on Learning Representations (ICLR) 2025

RP1M: A Large-Scale Motion Dataset for Piano Playing with Bi-Manual Dexterous Robot Hands
RP1M: A Large-Scale Motion Dataset for Piano Playing with Bi-Manual Dexterous Robot Hands

Yi Zhao*, Le Chen*, Jan Schneider, Quankai Gao, Juho Kannala, Bernhard Schölkopf, Joni Pajarinen, Dieter Büchler (* equal contribution)

Conference on Robot Learning (CoRL) 2024

RP1M: A Large-Scale Motion Dataset for Piano Playing with Bi-Manual Dexterous Robot Hands
RP1M: A Large-Scale Motion Dataset for Piano Playing with Bi-Manual Dexterous Robot Hands

Yi Zhao*, Le Chen*, Jan Schneider, Quankai Gao, Juho Kannala, Bernhard Schölkopf, Joni Pajarinen, Dieter Büchler (* equal contribution)

Conference on Robot Learning (CoRL) 2024

Bi-Level Motion Imitation for Humanoid Robots
Bi-Level Motion Imitation for Humanoid Robots

Wenshuai Zhao, Yi Zhao, Joni Pajarinen, Michael Muehlebach

Conference on Robot Learning (CoRL) 2024

Bi-Level Motion Imitation for Humanoid Robots
Bi-Level Motion Imitation for Humanoid Robots

Wenshuai Zhao, Yi Zhao, Joni Pajarinen, Michael Muehlebach

Conference on Robot Learning (CoRL) 2024

iQRL--Implicitly Quantized Representations for Sample-efficient Reinforcement Learning
iQRL--Implicitly Quantized Representations for Sample-efficient Reinforcement Learning

Aidan Scannell, Kalle Kujanpää, Yi Zhao, Mohammadreza Nakhaei, Arno Solin, Joni Pajarinen

International Conference on Machine Learning, Workshop (ICML Workshop) 2024

iQRL--Implicitly Quantized Representations for Sample-efficient Reinforcement Learning
iQRL--Implicitly Quantized Representations for Sample-efficient Reinforcement Learning

Aidan Scannell, Kalle Kujanpää, Yi Zhao, Mohammadreza Nakhaei, Arno Solin, Joni Pajarinen

International Conference on Machine Learning, Workshop (ICML Workshop) 2024

Optimistic Multi-Agent Policy Gradient
Optimistic Multi-Agent Policy Gradient

Wenshuai Zhao, Yi Zhao, Zhiyuan Li, Juho Kannala, Joni Pajarinen

International Conference on Machine Learning (ICML) 2024

Optimistic Multi-Agent Policy Gradient
Optimistic Multi-Agent Policy Gradient

Wenshuai Zhao, Yi Zhao, Zhiyuan Li, Juho Kannala, Joni Pajarinen

International Conference on Machine Learning (ICML) 2024

Hscnet++: Hierarchical Scene Coordinate Classification and Regression for Visual Localization with Transformer
Hscnet++: Hierarchical Scene Coordinate Classification and Regression for Visual Localization with Transformer

Shuzhe Wang, Zakaria Laskar, Iaroslav Melekhov, Xiaotian Li, Yi Zhao, Giorgos Tolias, Juho Kannala

International Journal of Computer Vision (IJCV) 2024

Hscnet++: Hierarchical Scene Coordinate Classification and Regression for Visual Localization with Transformer
Hscnet++: Hierarchical Scene Coordinate Classification and Regression for Visual Localization with Transformer

Shuzhe Wang, Zakaria Laskar, Iaroslav Melekhov, Xiaotian Li, Yi Zhao, Giorgos Tolias, Juho Kannala

International Journal of Computer Vision (IJCV) 2024

Continuous Monte Carlo Graph Search
Continuous Monte Carlo Graph Search

Kalle Kujanpää*, Amin Babadi*, Yi Zhao, Juho Kannala, Alexander Ilin, Joni Pajarinen (* equal contribution)

International Conference on Autonomous Agents and Multiagent Systems (AAMAS) 2023

Continuous Monte Carlo Graph Search
Continuous Monte Carlo Graph Search

Kalle Kujanpää*, Amin Babadi*, Yi Zhao, Juho Kannala, Alexander Ilin, Joni Pajarinen (* equal contribution)

International Conference on Autonomous Agents and Multiagent Systems (AAMAS) 2023

Simplified Temporal Consistency Reinforcement Learning
Simplified Temporal Consistency Reinforcement Learning

Yi Zhao, Wenshuai Zhao, Rinu Boney, Juho Kannala, Joni Pajarinen

International Conference on Machine Learning (ICML) 2023

Simplified Temporal Consistency Reinforcement Learning
Simplified Temporal Consistency Reinforcement Learning

Yi Zhao, Wenshuai Zhao, Rinu Boney, Juho Kannala, Joni Pajarinen

International Conference on Machine Learning (ICML) 2023

Adaptive Behavior Cloning Regularization for Stable Offline-to-Online Reinforcement Learning
Adaptive Behavior Cloning Regularization for Stable Offline-to-Online Reinforcement Learning

Yi Zhao*, Rinu Boney*, Alexander Ilin, Juho Kannala, Joni Pajarinen (* equal contribution)

European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning 2022

Adaptive Behavior Cloning Regularization for Stable Offline-to-Online Reinforcement Learning
Adaptive Behavior Cloning Regularization for Stable Offline-to-Online Reinforcement Learning

Yi Zhao*, Rinu Boney*, Alexander Ilin, Juho Kannala, Joni Pajarinen (* equal contribution)

European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning 2022

All publications