CV

Education

Ph.D Candidate in Aalto University (Feb. 2020 – Now)
- Research Topic: Reinforcement Learning with a Handful of Trials
- Co-supervised by Prof. Juho Kannala and Prof. Joni Pajarinen
MSc in Aalto University (Sep. 2018 – Oct. 2020)
- Grade: 4.7/5
- Major in Control, Robotics and Autonomous System
BEng in Huazhong University of Science and Technology

Learning to Drive (L2D) as a Low-Cost Benchmark for Real-World RL (Project Page)

We introduce Learning to Drive (L2D), a simple and easily reproducible Reinforcement Learning benchmark with a real small-scale car.
Apply a model-based RL algorithm, named Dreamer, on a small-scale racing car, which learns to drive from scratch using raw pixels in less than five minutes of interaction.

Hierarchical Scene Coordinate Classification and Regression for Visual Localization (Paper)

We propose a hierarchical network to predict pixel scene coordinates in a coarse-to-fine manner to improve performance in large and ambiguous environments, published in IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020.

Master Thesis: Model-based Reinforcement Learning from Pixels (Thesis，Code，Talk)

Reproduce the Dreamer paper, a model-based visual reinforcement learning algorithm. The results are tested on the Mujuco simulator.
Extend Dreamer by combining policy learning and decision-time planning (CEM).
Partially reproduce the Muzero algorithm, a model-based reinforcement learning algorithm that combines learning and Monte Carlo Tree Search to improve sample efficiency.
Extend the MuZero algorithm to continuous control by discretizing the action space.

Bachelor Thesis: Hardware and Software Design of Drones

Research Assistant: Aalto Computer Vision Group

Propose a hierarchical network to predict pixel scene coordinates in a coarse-to-fine manner to improve performance in large and ambiguous environment with colleagues.
Implement a simple structure-from-motion algorithm to construct 3D sparse models of indoor environments with repeated feature pattern.

Summer Intern: Aalto Computer Vision Group

Reproduce Model-free RL Algorithms

Reproduce two model-free reinforcement learning methods, including PPO and SAC. (Code)

Bayesian Deep Learning

Implement and compare a set of Bayesian neural network methods, including MC Dropout, Concrete Dropout and Variational Dropout. (Slides, Code)

ViTa-bot: A Bio-mimetically Transformed Youbot

The ViTa-Whisker array is a biomimetic tactile sensor array, inspired by whiskers of rodents. Its purpose is to explore its most immediate environment with whisking movement, and generate a pointcloud model which describes the 3D map of surroundings. (Project Page)

Reinforcement Learning Course Project

Programming: Proficient in Python and past experience with C, C++, Java, MATLAB and other languages.
Tools: Proficient in PyTorch, Numpy, OpenCV, Git, Bash, etc. Experience with TensorFlow, ROS, JAX, etc.