Yi Zhao

I am a PhD candidate in the Robot Learning Group at Aalto University, advised by Prof. Joni Pajarinen and Juho Kannala. My research focuses on reinforcement learning, imitation learning, and planning, with an emphasis on learning versatile robot skills in a sample-efficient manner. I have developed algorithms that have been applied to various domains, including locomotion, robotic manipulation, and dexterous robotic hands. I hold an MSc in Robotics from Aalto University, Finland, and a BEng from Huazhong University of Science and Technology, China. In 2024, I was a visiting researcher at the Max Planck Institute for Intelligent Systems, where I collaborated with Prof. Dieter Büchler and Bernhard Schölkopf. There, we developed the first agent capable of autonomously learning to play Flight of the Bumblebee with bimanual dexterous robotic hands, without human demonstrations.


Experience
  • Aalto University, Finland
    Aalto University, Finland
    Doctoral Candidate
    Feb. 2021 - now
  • Max Planck Institute for Intelligent Systems, Germany
    Max Planck Institute for Intelligent Systems, Germany
    Research Visit
    Feb. 2024 - Dec. 2024
  • Aalto University, Finland
    Aalto University, Finland
    Master of Science
    Oct. 2020
  • Huazhong University of Science and Technology, China
    Huazhong University of Science and Technology, China
    Bachelor of Engineering
    June 2017
Publications (view all )
Efficient Reinforcement Learning by Guiding Generalist World Models with Non-Curated Data
Efficient Reinforcement Learning by Guiding Generalist World Models with Non-Curated Data

Yi Zhao, Aidan Scannell, Wenshuai Zhao, Yuxin Hou, Tianyu Cui, Le Chen, Dieter Büchler, Arno Solin, Juho Kannala, Joni Pajarinen

Preprint 2025

Efficient Reinforcement Learning by Guiding Generalist World Models with Non-Curated Data
Efficient Reinforcement Learning by Guiding Generalist World Models with Non-Curated Data

Yi Zhao, Aidan Scannell, Wenshuai Zhao, Yuxin Hou, Tianyu Cui, Le Chen, Dieter Büchler, Arno Solin, Juho Kannala, Joni Pajarinen

Preprint 2025

Symbolically-Guided Visual Plan Inference from Uncurated Video Data
Symbolically-Guided Visual Plan Inference from Uncurated Video Data

Wenyan Yang, Ahmet Tikna, Yi Zhao, Yuying Zhang, Luigi Palopoli, Marco Roveri, Joni Pajarinen

Preprint 2025

Symbolically-Guided Visual Plan Inference from Uncurated Video Data
Symbolically-Guided Visual Plan Inference from Uncurated Video Data

Wenyan Yang, Ahmet Tikna, Yi Zhao, Yuying Zhang, Luigi Palopoli, Marco Roveri, Joni Pajarinen

Preprint 2025

Discrete Codebook World Models for Continuous Control
Discrete Codebook World Models for Continuous Control

Aidan Scannell, Mohammadreza Nakhaeinezhadfard, Kalle Kujanpää, Yi Zhao, Kevin Sebastian Luck, Arno Solin, Joni Pajarinen

International Conference on Learning Representations (ICLR) 2025

Discrete Codebook World Models for Continuous Control
Discrete Codebook World Models for Continuous Control

Aidan Scannell, Mohammadreza Nakhaeinezhadfard, Kalle Kujanpää, Yi Zhao, Kevin Sebastian Luck, Arno Solin, Joni Pajarinen

International Conference on Learning Representations (ICLR) 2025

RP1M: A Large-Scale Motion Dataset for Piano Playing with Bi-Manual Dexterous Robot Hands
RP1M: A Large-Scale Motion Dataset for Piano Playing with Bi-Manual Dexterous Robot Hands

Yi Zhao*, Le Chen*, Jan Schneider, Quankai Gao, Juho Kannala, Bernhard Schölkopf, Joni Pajarinen, Dieter Büchler (* equal contribution)

Conference on Robot Learning (CoRL) 2024

RP1M: A Large-Scale Motion Dataset for Piano Playing with Bi-Manual Dexterous Robot Hands
RP1M: A Large-Scale Motion Dataset for Piano Playing with Bi-Manual Dexterous Robot Hands

Yi Zhao*, Le Chen*, Jan Schneider, Quankai Gao, Juho Kannala, Bernhard Schölkopf, Joni Pajarinen, Dieter Büchler (* equal contribution)

Conference on Robot Learning (CoRL) 2024

Bi-Level Motion Imitation for Humanoid Robots
Bi-Level Motion Imitation for Humanoid Robots

Wenshuai Zhao, Yi Zhao, Joni Pajarinen, Michael Muehlebach

Conference on Robot Learning (CoRL) 2024

Bi-Level Motion Imitation for Humanoid Robots
Bi-Level Motion Imitation for Humanoid Robots

Wenshuai Zhao, Yi Zhao, Joni Pajarinen, Michael Muehlebach

Conference on Robot Learning (CoRL) 2024

iQRL--Implicitly Quantized Representations for Sample-efficient Reinforcement Learning
iQRL--Implicitly Quantized Representations for Sample-efficient Reinforcement Learning

Aidan Scannell, Kalle Kujanpää, Yi Zhao, Mohammadreza Nakhaei, Arno Solin, Joni Pajarinen

International Conference on Machine Learning, Workshop (ICML Workshop) 2024

iQRL--Implicitly Quantized Representations for Sample-efficient Reinforcement Learning
iQRL--Implicitly Quantized Representations for Sample-efficient Reinforcement Learning

Aidan Scannell, Kalle Kujanpää, Yi Zhao, Mohammadreza Nakhaei, Arno Solin, Joni Pajarinen

International Conference on Machine Learning, Workshop (ICML Workshop) 2024

Optimistic Multi-Agent Policy Gradient
Optimistic Multi-Agent Policy Gradient

Wenshuai Zhao, Yi Zhao, Zhiyuan Li, Juho Kannala, Joni Pajarinen

International Conference on Machine Learning (ICML) 2024

Optimistic Multi-Agent Policy Gradient
Optimistic Multi-Agent Policy Gradient

Wenshuai Zhao, Yi Zhao, Zhiyuan Li, Juho Kannala, Joni Pajarinen

International Conference on Machine Learning (ICML) 2024

Hscnet++: Hierarchical Scene Coordinate Classification and Regression for Visual Localization with Transformer
Hscnet++: Hierarchical Scene Coordinate Classification and Regression for Visual Localization with Transformer

Shuzhe Wang, Zakaria Laskar, Iaroslav Melekhov, Xiaotian Li, Yi Zhao, Giorgos Tolias, Juho Kannala

International Journal of Computer Vision (IJCV) 2024

Hscnet++: Hierarchical Scene Coordinate Classification and Regression for Visual Localization with Transformer
Hscnet++: Hierarchical Scene Coordinate Classification and Regression for Visual Localization with Transformer

Shuzhe Wang, Zakaria Laskar, Iaroslav Melekhov, Xiaotian Li, Yi Zhao, Giorgos Tolias, Juho Kannala

International Journal of Computer Vision (IJCV) 2024

Continuous Monte Carlo Graph Search
Continuous Monte Carlo Graph Search

Kalle Kujanpää*, Amin Babadi*, Yi Zhao, Juho Kannala, Alexander Ilin, Joni Pajarinen (* equal contribution)

International Conference on Autonomous Agents and Multiagent Systems (AAMAS) 2023

Continuous Monte Carlo Graph Search
Continuous Monte Carlo Graph Search

Kalle Kujanpää*, Amin Babadi*, Yi Zhao, Juho Kannala, Alexander Ilin, Joni Pajarinen (* equal contribution)

International Conference on Autonomous Agents and Multiagent Systems (AAMAS) 2023

Simplified Temporal Consistency Reinforcement Learning
Simplified Temporal Consistency Reinforcement Learning

Yi Zhao, Wenshuai Zhao, Rinu Boney, Juho Kannala, Joni Pajarinen

International Conference on Machine Learning (ICML) 2023

Simplified Temporal Consistency Reinforcement Learning
Simplified Temporal Consistency Reinforcement Learning

Yi Zhao, Wenshuai Zhao, Rinu Boney, Juho Kannala, Joni Pajarinen

International Conference on Machine Learning (ICML) 2023

All publications