
I am a Ph.D. candidate in the School of Computer Science at Peking University , advised by Prof. Zongqing Lu. My research interests include Multimodal LLMs, Reinforcement Learning, and Embodied Agent. Feel free to contact me if you are interested in discussing or collaborating. For more details, please refer to my CV or CV(Chinese).
Education
- Peking University. School of Computer Science.
- Ph.D. Candidate. (Sep. 2022 — Present)
- Supervisor: Prof. Zongqing Lu.
- Tsinghua University. Department of Computer Science and Technology.
- Master of Science Degree. (Sep. 2019 — Jun. 2022)
- Supervisor: Prof. Xi Xiao.
- Nankai University. School of Mathematical Sciences.
- Bachelor of Science Degree. (Sep. 2015 — Jun. 2019)
- Advisor: Prof. Jishou Ruan.
Experience
- BeingBeyond
- Research Intern. (Mar. 2025 — Present)
- Multimodal LLMs / Embodied Agent
- Beijing Academy of Artificial Intelligence (BAAI)
- Research Intern. (May. 2024 — Mar.2025)
- Multimodal LLMs / Embodied Agent
- Tencent AI Lab
- Research Intern. (Jun. 2020 — Jul. 2021)
- Reinforcement Learning / AI for Science.
Selected Publication
(For the full publications, please see my Google Scholar.)
1. MLLM
- (ICCV’25) Unified Multimodal Understanding via Byte-Pair Visual Encoding.
- (ICCV’25) VideoOrion: Tokenizing Object Dynamics in Videos.
- (ICLR’25) From Pixels to Tokens: Byte-Pair Encoding on Quantized Visual Modalities.
2. RL & Agent
- (NAACL’25) LLM-Based Explicit Models of Opponents for Multi-Agent Games.
- (ICML’24) Tackling Non-Stationarity in Reinforcement Learning via Causal-Origin Representation.
- (NAACL’24) AdaRefiner: Refining Decisions of Language Models with Adaptive Feedback.
- (ICML’23) Entity Divider with Language Grounding in Multi-Agent Reinforcement Learning.
- (NeurIPS’22) Model-Based Opponent Modeling.
Patent
- Multimodal data processing method, device, storage medium, and electronic equipment. (CN119226992A)
- Method, device and equipment for determining parameters and storage medium. (CN112527104A)
Award
- Award for Scientific Research of Peking University. (Dec. 2024)
- Presidential Scholarship of Peking University. (Nov. 2024)
- Rhino-bird Elite Training Program of Tencent AI Lab. (Jul. 2021)
- Mathematical Contest in Modeling (MCM/ICM), Meritorious Winner (First Prize). (Apr. 2017)
- China Undergraduate Mathematical Contest in Modeling (CUMCM), Second Prize. (Jan. 2016)
Service
- Conference Reviewer
- ICML / NeurIPS / ICLR / ICCV / AAAI / ICRA / AISTATS
- Journal Reviewer
- TNNLS / TIST
- Teaching Assistant
- Deep Reinforcement Learning, Peking University. (Spring, 2025)