Currently, I am a Ph.D. candidate in the School of Computer Science at Peking University , advised by Prof. Zongqing Lu.
I received my M.Sc. degree from the Department of Computer Science and Technology at Tsinghua University in June 2022, advised by Prof. Xi Xiao. I received my B.Sc. degree from the School of Mathematical Sciences at Nankai University in June 2019. I also worked as a research intern at Tencent AI Lab in 2021, advised by Dijun Luo.
My research interests include Reinforcement Learning, Language Modeling. Feel free to contact me if you are interested in discussing or collaborating.
[Email / Google Scholar / DBLP / Github]
Publications
- AdaRefiner: Refining Decisions of Language Models with Adaptive Feedback. (NAACL’24)
- Entity Divider with Language Grounding in Multi-Agent Reinforcement Learning. (ICML’23)
- Model-Based Opponent Modeling. (NeurIPS’22)
- iGrow: A Smart Agriculture Solution to Autonomous Greenhouse Control. (AAAI’22)
- Efficient and Stable Information Directed Exploration for Continuous Reinforcement Learning. (ICASSP’22)
- Robust Model-based Reinforcement Learning for Autonomous Greenhouse Control. (ACML’21)
- Sample Efficient Reinforcement Learning via Model-Ensemble Exploration and Exploitation. (ICRA’21)
- A Simulator-based Planning Framework for Optimizing Autonomous Greenhouse Control Strategy. (ICAPS’21)
- Self-Paced Probabilistic Principal Component Analysis for Data with Outliers. (ICASSP’20)
Preprints
- Tackling Non-Stationarity in Reinforcement Learning via Causal-Origin Representation. (arXiv’23.06)
- MBDP: A Model-based Approach to Achieve both Robustness and Sample Efficiency via Double Dropout Planning. (arXiv’21.08)
Patents
- Method, device and equipment for determining parameters and storage medium. (CN112527104A)
Academic Service
- Reviewer
- ICML 2022, 2023, 2024
- NeurIPS 2022, 2023
- ICLR 2024