Currently, I am a Ph.D. candidate in the School of Computer Science at Peking University , advised by Prof. Zongqing Lu. My research interests include Reinforcement Learning, Generative Modeling, Multimodal LLMs. Feel free to contact me if you are interested in discussing or collaborating.
[Email / Google Scholar / DBLP / Github / CV]
Education
- School of Computer Science, Peking University.
- Ph.D. Candidate. Advised by Prof. Zongqing Lu.
- 2022 — Now
- Department of Computer Science and Technology, Tsinghua University.
- M.Sc. Degree. Advised by Prof. Xi Xiao.
- 2019 — 2022
- School of Mathematical Sciences, Nankai University.
- B.Sc. Degree.
- 2015 — 2019
Publication
- Tackling Non-Stationarity in Reinforcement Learning via Causal-Origin Representation. (ICML’24)
- AdaRefiner: Refining Decisions of Language Models with Adaptive Feedback. (NAACL’24)
- Entity Divider with Language Grounding in Multi-Agent Reinforcement Learning. (ICML’23)
- Model-Based Opponent Modeling. (NeurIPS’22)
- iGrow: A Smart Agriculture Solution to Autonomous Greenhouse Control. (AAAI’22)
- Efficient and Stable Information Directed Exploration for Continuous Reinforcement Learning. (ICASSP’22)
- Robust Model-based Reinforcement Learning for Autonomous Greenhouse Control. (ACML’21)
- Sample Efficient Reinforcement Learning via Model-Ensemble Exploration and Exploitation. (ICRA’21)
- A Simulator-based Planning Framework for Optimizing Autonomous Greenhouse Control Strategy. (ICAPS’21)
- Self-Paced Probabilistic Principal Component Analysis for Data with Outliers. (ICASSP’20)
Preprint
- MBDP: A Model-based Approach to Achieve both Robustness and Sample Efficiency via Double Dropout Planning. (arXiv’21.08)
Patent
- Method, device and equipment for determining parameters and storage medium. (CN112527104A)
Experience
- Beijing Academy of Artificial Intelligence (BAAI)
- Research Intern
- 2024.05 — Now
- Tencent AI Lab
- Research Intern
- 2020.06 — 2021.07
- Availink
- Research Intern
- 2018.08 — 2018.10
Academic Service
- Conference Reviewer
- ICML 2022, 2023, 2024
- NeurIPS 2022, 2023, 2024
- ICLR 2024