I am currently a fourth-year PhD Student in the College of Computer Science and Technology at Zhejiang University, under the supervision of Prof. Fei Wu in the DCD (Digital media Computing & Design) Lab. During my PhD journey, I am also fortunate to collaborate with Prof. Li Shen from Sun Yat-sen University and Zhengyu Chen from Meituan remotely.
My research interests include:
- Agentic RL for complex interaction scenarios (e.g. Tool Use, DeepResearch, Code Agent);
- LLM Reasoning, especially for real-world open-domain unverifiable scenarios;
- Trustworthiness, including models’ generalization, safety, hallucination and so on.
★★★ Feel free to reach out to me for academic discussions and collaborations!
★★★ I am currently seeking job opportunities and will graduate in June 2027. If you have any suitable positions, please feel free to reach out.
📝 Publications
Please look through my Google Scholar for the publications.
💻 Internships
-
2025.08 - 2026.03: Meituan, LongCat Team
Responsible for Agentic Tool Use: (1) Cold Start and RL Strategy; (2) Date Synthesis and Data Selection
Core Contributor of LongCat-Flash-Thinking-2601 Technical Report
-
2024.06 - 2025.06: AntGroup, Ling Team
Responsible for LLM Post-Training: (1) Multi-Objective Policy Optimization; (2) Data Mixture Optimization
📖 Educations
- 2022.09 - 2027.06 (Expected), PhD candidate, Zhejiang University
- 2018.09 - 2022.06, Undergraduate, Nanjing University of Aeronautics and Astronautics