|
Zechu (Steven) Li
I am a PhD student at PEARL Lab advised by Prof. Georgia Chalvatzaki from Oct 2024.
My research interest lies in reinforcement learning, especially its applications (e.g., robotics, finance, and transportation) and high-performance and scalable systems.
Prior to this, I was a visiting researcher at MIT CSAIL, advised by Prof. Pulkit Agrawal,
where I conducted research on massively parallel simulation and sim-to-real in robotics.
I received my bachelor's degree from Columbia University in May 2022, majoring in computer science.
During my undergraduate studies, I was fortunate to work with Prof. Xiaodong Wang, Prof. Anwar Walid and Prof.
Sharon (Xuan) Di.
|
|
|
Selected Publications [* Equal contribution]
|
|
Scalable Multi-Task Data Generation via Reinforcement Learning for Language-Conditioned Bimanual Dexterous Manipulation
Zechu Li,
Yufeng Jin,
Puze Liu,
Jan Peters,
Georgia Chalvatzaki
IROS, 2026
paper /
website
An RL-based data-generation pipeline for language-conditioned bimanual arm-hand manipulation.
|
|
Reconciling Reality through Simulation: A Real-to-Sim-to-Real Approach for Robust Manipulation
Marcel Torne,
Anthony Simeonov,
Zechu Li,
April Chan,
Tao Chen,
Abhishek Gupta*,
Pulkit Agrawal*
Robotics: Science and Systems (RSS), 2024
paper /
website /
code
A system for robustifying real-world imitation learning policies via reinforcement learning in "digital twin" simulation environments constructed on the fly from small amounts of real-world data.
|
|
Parallel Q-Learning: Scaling Off-policy Reinforcement Learning under Massively Parallel Simulation
Zechu Li*,
Tao Chen*,
Zhang-Wei Hong,
Anurag Ajay,
Pulkit Agrawal
ICML, 2023
paper /
code
A novel parallel Q-learning framework that scales off-policy learning to 10000+ parallel environments.
|
|
Homomorphic Matrix Completion
Xiao-Yang Liu*,
Zechu Li*,
Xiaodong Wang
NeurIPS, 2022
paper
A homomorphic matrix completion algorithm that satisfies the differential privacy property and reduces the best-known error bound to EXACT recovery at a price of more samples.
|
|
Nautilus: From One Prompt to Plug-and-Play Robot Learning
Yufeng Jin*,
Jianfei Guo*,
Xiaogang Jia,
Yu Deng,
Zechu Li,
Han Liu,
Weiran Liao,
Vignesh Prasad,
Mathias Franzius,
Gerhard Neumann,
Georgia Chalvatzaki
arXiv, 2026
paper /
website
An open-source agentic harness that turns a single natural-language prompt into ready-to-use reproduction, evaluation, fine-tuning, and deployment workflows for robot learning research.
|
|
SE(3)-PoseFlow: Estimating 6D Pose Distributions for Uncertainty-Aware Robotic Manipulation
Yufeng Jin,
Niklas Funk,
Vignesh Prasad,
Zechu Li,
Mathias Franzius,
Jan Peters,
Georgia Chalvatzaki
ICRA, 2026
paper /
website
A probabilistic framework that leverages flow matching on the SE(3) manifold to estimate full 6D object pose distributions, enabling uncertainty-aware robotic manipulation under partial observability, occlusions, and symmetries.
|
|
Social Learning for Sequential Driving Dilemmas
Xu Chen,
Sharon (Xuan) Di,
Zechu Li
Games, 2023
paper
Identified whether social dilemmas exist in AVs' sequential decision making to help policymakers and AV manufacturers better understand under what circumstances SDDs arise and how to design rewards.
|
|
Stationary Deep Reinforcement Learning with Quantum K-spin Hamiltonian Equation
Xiao-Yang Liu*,
Zechu Li*,
Shixun Wu,
Xiaodong Wang
Workshop on Physics for Machine Learning, ICLR, 2023
paper
Proposed a K-spin Hamiltonian regularization term (called H-term) to help a policy network converge to a high-quality local minima from a quantum perspective.
|
|
Social Learning In Markov Games: Empowering Autonomous Driving
Xu Chen,
Zechu Li,
Sharon (Xuan) Di
IEEE Intelligent Vehicles Symposium (IV), 2022
paper /
code
Applied the social learning scheme to Markov games and leverage RL to investigate how individual AVs learn policies and form social norms in traffic scenarios.
|
|
FinRL-Podracer: High Performance and Scalable Deep Reinforcement Learning for Quantitative Finance
Zechu Li,
Xiao-Yang Liu,
Jiahao Zheng,
Zhaoran Wang,
Anwar Walid,
Jian Guo
ACM International Conference on AI in Finance (ICAIF), 2021
paper /
code
A framework to accelerate the development pipeline of RL-driven trading strategy and show the high scalability by training a trading agent in 10 minutes with 80 A100 GPUs, on NASDAQ-100 constituent stocks with minute-level data over 10 years.
|
|
ElegantRL-Podracer: Scalable and Elastic Library for Cloud-native Deep Reinforcement Learning
Xiao-Yang Liu*,
Zechu Li*,
Zhuoran Yang,
Jiahao Zheng,
Zhaoran Wang,
Anwar Walid,
Jian Guo,
Michael Jordan
Deep Reinforcement Learning Workshop, NeurIPS, 2021
paper /
code
A scalable and elastic library ElegantRL-podracer for cloud-native deep reinforcement learning, which efficiently supports millions of GPU cores to carry out massively parallel training at multiple levels.
|
|
FinRL: Financial Reinforcement Learning
project page /
code /
GitHub Star
The first open-source framework to show the great potential of financial reinforcement learning.
|
|
ElegantRL “小雅”: Massively Parallel Library for Cloud-native Deep Reinforcement Learning
project page /
code /
GitHub Star
A massively parallel library for cloud-native deep reinforcement learning (DRL) applications.
As a leader of this project, I have been contributing to
- develop a series of large-scale training frameworks,
- implemente SOTA algorithms and techniques,
- build the documentation website.
Starting from Mar. 2021, I started to write tutorial blogs for the community,
- ElegantRL: Much More Stable Deep Reinforcement Learning Algorithms than Stable-Baseline3,
MLearning.ai, Mar. 3, 2022.
- ElegantRL-Podracer: A Scalable and Elastic Library for Cloud-Native Deep Reinforcement Learning,
Towards data science, Dec. 11, 2021.
- ElegantRL: Mastering PPO Algorithms,
Towards data science, May. 3, 2021.
- ElegantRL Demo: Stock Trading Using DDPG (Part II),
MLearning.ai, Apr. 19, 2021.
- ElegantRL Demo: Stock Trading Using DDPG (Part I),
MLearning.ai, Mar. 28, 2021.
- ElegantRL-Helloworld: A Lightweight and Stable Deep Reinforcement Learning Library,
Towards data science, Mar. 4, 2021.
|
|
High-performance Tensor Decompositions for Compressing and Accelerating Deep Neural Networks
Xiao-Yang Liu,
Yiming Fang,
Liuqing Yang,
Zechu Li,
Anwar Walid
Tensors for Data Processing, Elsevier, 2021
chapter /
book
This chapter takes a practical approach to seek a better efficiency-accuracy trade-off, which utilizes high performance tensor decompositions to compress and accelerate neural networks by exploiting low-rank structures of the network weight matrix.
|
|