Jinhao Jiang (蒋锦昊) 蒋锦昊 (Jinhao Jiang)
Gaoling School of Artificial Intelligence, Renmin University of China(RUC) 中国人民大学, 高瓴人工智能学院
Address: No.59 Zhongguancun Street, Haidian District Beijing, 100872, P.R. China 地址: 中国北京市海淀区中关村大街59号,100872
Email: jiangjinhao [at] ruc.edu.cn 邮箱: jiangjinhao [at] ruc.edu.cn

About Me [GitHub] [Google Scholar]

关于我 [GitHub] [Google Scholar]

I am a fourth-year Ph.D. student (expected to graduate in June 2026) supervised by Prof. Xin Zhao from GSAI, Renmin University of China. Prior to this, I obtained a bachelor's degree from the University of Electronic Science and Technology of China in July 2021. I have a broad interest in Natural Language Processing, Large Language Model, and Agent.

我是中国人民大学高瓴人工智能学院四年级博士生(预计2026年6月毕业),导师为赵鑫教授。此前,我于2021年7月获得电子科技大学学士学位。我对自然语言处理、大语言模型和智能体有广泛的研究兴趣。

Research Interest

研究兴趣

My research interest focuses on LLM and Agent, with an emphasis on fundamental capabilities (world knowledge & complex reasoning) of LLM and agent applications, specifically:

我的研究兴趣集中在大语言模型(LLM)智能体(Agent),特别关注 大语言模型的基础能力(世界知识和复杂推理)以及 智能体应用,具体包括:


  • Enhancing internal reasoning capabilities: Through continue pre-training (CPT), supervised fine-tuning (SFT), and reinforcement learning (RL) training, expand the knowledge boundaries of LLMs and enhance the inherent general reasoning abilities of LLMs (such as encyclopedic knowledge, mathematics, and code).

  • Enhancing the ability to call external tools: Improve the ability of LLMs to call external tools (such as code, calculators, and search engines).

  • Agent applications in vertical fields: Enhance the application of LLM-based agent in vertical scenarios, such as complex structured data (such as knowledge graphs, databases, Excel spreadsheets, and tables), general retrieval scenarios (such as AI Searcher, Deep Research), etc.

  • 增强内部推理能力:通过持续预训练(CPT)、监督微调(SFT)和强化学习(RL)训练,拓展大语言模型的知识边界,提升大语言模型固有的通用推理能力(如百科知识、数学和代码)。

  • 增强调用外部工具的能力:提高大语言模型调用外部工具(如代码、计算器和搜索引擎)的能力。

  • 垂直领域的智能体应用:增强基于大语言模型的智能体在垂直场景中的应用,如复杂结构化数据(如知识图谱、数据库、Excel电子表格和表格),通用检索场景(如AI搜索器、深度研究)等。

I am currently seeking job opportunities in both academic and industry. I am expected to graduate in July 2026. If you are interested in me, please do not hesitate to contact me via Email. 我目前正在寻找学术界和工业界的工作机会。我预计将于2026年7月毕业。如果您对我感兴趣,请通过邮件联系我。

News

新闻


Experience

工作经历


Selected Publications

精选论文

(* indicates equal contribution, † indicates corresponding author)
(* 表示共同一作, † 表示通讯作者)


Here, I've listed my work as a (co-) first author. For the complete list of my publications, please visit my Google Scholar profile.
这里列出了我作为(共同)第一作者的工作。有关我发表的完整论文列表,请访问我的谷歌学术主页。


  • Enhancing LLM Reasoning with Reward-guided Tree Search

    Jinhao Jiang*, Zhipeng Chen*, Yingqian Min*, Jie Chen, Xiaoxue Cheng, Jiapeng Wang, Yiru Tang, Haoxiang Sun, Jia Deng, Wayne Xin Zhao†, Zheng Liu, Dong Yan, Jian Xie, Zhongyuan Wang, Ji-Rong Wen

    Technical Report, 2024


Grants

奖项

  • 2021 Outstanding Graduates of Sichuan Province (winning ratio 3.7%), Education Department of Sichuan.
  • 2020 China National Scholarship (top 1.5%), Ministry of Education of the People's Republic of China.
  • 2019 China National Scholarship (top 1.5%), Ministry of Education of the People's Republic of China.
  • 2021 四川省优秀毕业生(获奖比例3.7%),四川省教育厅。
  • 2020 国家奖学金(前1.5%),中华人民共和国教育部。
  • 2019 国家奖学金(前1.5%),中华人民共和国教育部。

Professional Service

学术服务

  • Journal: TALLIP, Computational Intelligence, Information Retrieval Journa
  • Conference: ICLR, NIPS, ACL, EMNLP
  • 期刊: TALLIP, Computational Intelligence, Information Retrieval Journal
  • 会议: ICLR, NIPS, ACL, EMNLP