I start studying at the University of Toronto! I am supervised by Prof. Animesh Garg.
footoredo [at] gmail [dot] com
Google Scholar
Curriculum Vitae
Github
Twitter
GPG public key
Reinforcement Learning / Robotics
Zihan Zhou, Animesh Garg, Dieter Fox, Caelan Garrett, Ajay Mandlekar. “SPIRE: Synergistic Planning, Imitation, and Reinforcement Learning for Long-Horizon Manipulation”. (* equal contribution).
🤜 Accepted by CoRL 2024. [arXiv] [OpenReview] [Website]
Marta Skreta*, Zihan Zhou*, Jia Lin Yuan*, Kourosh Darvish, Alán Aspuru-Guzik, Animesh Garg. “RePLan: Robotic Replanning with Perception and Language Models”. (* equal contribution).
🤜 Preprint. [arXiv] [Website]
Zihan Zhou, Animesh Garg. “Learning Achievement Structure for Structured Exploration in Domains with Sparse Reward”.
🤜 Accepted by ICLR 2023. [arXiv] [OpenReview] [Code]
Zihan Zhou*, Wei Fu*, Bingliang Zhang, Yi Wu. “Continuously Discovering Novel Strategies via Reward-Switching Policy Optimization”. (* equal contribution).
🤜 Accepted by ICLR 2022. [arXiv] [OpenReview] [Website]
Weizhe Chen*, Zihan Zhou*, Yi Wu, Fei Fang. “Temporal Induced Self-Play for Stochastic Bayesian Games”. (* equal contributions).
🤜 Accepted by IJCAI 2021. [Paper]
Qian Long*, Zihan Zhou*, Abhinav Gupta, Fei Fang, Yi Wu†, Xiaolong Wang†. “Evolutionary Population Curriculum for Scaling Multi-Agent Reinforcement Learning”. (* equal contribution, † equal advising).
🤜 Accepted by ICLR 2020. [arXiv] [OpenReview] [Website]
Zihan Zhou, Zheyuan Ryan Shi, Fei Fang, Yi Wu. “Approximated Temporal-Induced Neural Self-Play for Finitely Repeated Bayesian Games”.
🤜 Accepted by AAAI 2020 Workshop on Reinforcement Learning in Games (oral presentation). [Paper]
Huichu Zhang, Siyuan Feng, Chang Liu, Yaoyao Ding, Yichen Zhu, Zihan Zhou, Weinan Zhang, Yong Yu, Haiming Jin, Zhenhui Li. “CityFlow: A Multi-Agent Reinforcement Learning Environment for Large Scale City Traffic Scenario”.
🤜 Accepted by WWW 2019 Demonstration. [Website] [arXiv]
Xuehui Sun, Zihan Zhou, Yuda Fan. “Image Based Review Text Generation with Emotional Guidance”.
🤜 Preprint. [arXiv]