Jinhao Jiang

Research Interest

研究兴趣

My research interest focuses on LLM and Agent, with an emphasis on fundamental capabilities (world knowledge & complex reasoning) of LLM and agent applications, specifically:

我的研究兴趣集中在大语言模型（LLM）和智能体（Agent），特别关注 大语言模型的基础能力（世界知识和复杂推理）以及 智能体应用，具体包括：

Enhancing internal reasoning capabilities: Through continue pre-training (CPT), supervised fine-tuning (SFT), and reinforcement learning (RL) training, expand the knowledge boundaries of LLMs and enhance the inherent general reasoning abilities of LLMs (such as encyclopedic knowledge, mathematics, and code).

Enhancing the ability to call external tools: Improve the ability of LLMs to call external tools (such as code, calculators, and search engines).

Agentic LLM: Enhance the application of LLM-based agent in various scenarios, such as complex structured data (such as knowledge graphs, databases, Excel spreadsheets, and tables), general retrieval scenarios (such as AI Searcher, Deep Research), etc.

增强内部推理能力：通过持续预训练（CPT）、监督微调（SFT）和强化学习（RL）训练，拓展大语言模型的知识边界，提升大语言模型固有的通用推理能力（如百科知识、数学和代码）。

增强调用外部工具的能力：提高大语言模型调用外部工具（如代码、计算器和搜索引擎）的能力。

通用智能体应用：增强基于大语言模型的智能体在各类场景中的应用，如复杂结构化数据（如知识图谱、数据库、Excel电子表格和表格），通用检索场景（如AI搜索器、深度研究）等。

I am currently seeking job opportunities in both academic and industry. I am expected to graduate in July 2026. If you are interested in me, please do not hesitate to contact me via Email. 我目前正在寻找学术界和工业界的工作机会。我预计将于2026年7月毕业。如果您对我感兴趣，请通过邮件联系我。

News

新闻

[2025-08-21] I have five papers accepted by EMNLP 2025, including CAFE (Main), StickerTTS (Main), ManuSearch (Findings), R1-Searcher++ (Findings), and SimpleDeepSearcher (Findings), congratulations to my co-authors!
[2025-05-23] We release ManuSearch, which aims to push the LLM-based AI search with a curated complex search benchmark and a strong training-free multi-agent framework.
[2025-05-22] We release SimpleDeepSearcher and R1-Searcher++, which are the data engineering method for collecting small but refined SFT dataa, and a novel RL framework for incentivizing the dynamic knowledge acquisition of LLMs from internal knowledge and external tools.
[2025-05-21] I have four papers accepted by ACL 2025 main conference, including YuLan-Mini, Llama-3-SynE, KG-Agent, and LongReD, congratulations to my co-authors!
[2025-04-29] We release RV-Syn, which is a rational and verifiable mathematical data synthesis method based on structured function library. It has achieved more efficent scaling curve compared to latest methodes, such as ScaleQuest, NuminaMath.
[2025-04-22] I'll present my work "Mix-CPT: A Domain Adaptation Framework via Decoupling Knowledge Learning and Format Alignment" poster at ICLR-2025 in Singapore. It's on April 26th, from 15:00 - 17:30 in Hall 3 + Hall 2B. Welcome to have a chat!
[2025-03-07] We release R1-Searcher, which is the first technical report to apply the RL of the R1 paradigm to the RAG scenario. It has achieved significant performance improvements across multiple evaluation datasets, marking an important step towards Deep Research!

[2025-08-21] 我有五篇论文被EMNLP 2025主会议接收，包括 CAFE (Main), StickerTTS (Main), ManuSearch (Findings), R1-Searcher++ (Findings), and SimpleDeepSearcher (Findings), 祝贺我的合作者！
[2025-05-23] 我们发布了 ManuSearch，旨在推动基于大语言模型的AI搜索，并提供了一个精心策划的复杂搜索基准和一个强大的通用免训的多智能体框架。
[2025-05-22] 我们发布了 SimpleDeepSearcher 和 R1-Searcher++，它们是收集少而精炼的SFT数据和激励LLM从内部知识和外部工具中动态获取知识的新方法。
[2025-05-21] 我有四篇论文被ACL 2025主会议接收，包括 YuLan-Mini、Llama-3-SynE、KG-Agent和LongReD，祝贺我的合作者！
[2025-04-29] 我们发布了 RV-Syn，它是一种基于结构化函数库的理性且可验证的数学数据合成方法。与最新的方法相比，它具有更高效的缩放曲线，如ScaleQuest和NuminaMath。
[2025-04-22] 我将在新加坡的ICLR-2025会议上展示我的工作 "Mix-CPT: 通过解耦知识学习和格式对齐的领域适应框架"。展示时间是4月26日下午15:00-17:30，地点在3号厅+2B厅。欢迎来交流！
[2025-03-07] 我们发布了 R1-Searcher，这是首个将R1范式的强化学习应用于RAG场景的技术报告。它在多个评估数据集上取得了显著的性能提升，标志着向深度研究迈出了重要一步！

Selected Publications

精选论文

(* indicates equal contribution, † indicates corresponding author)

(* 表示共同一作, † 表示通讯作者)

Here, I've listed my work as a (co-) first author. For the complete list of my publications, please visit my Google Scholar profile.

这里列出了我作为（共同）第一作者的工作。有关我发表的完整论文列表，请访问我的谷歌学术主页。

ManuSearch: Democratizing Deep Search in Large Language Models with a Transparent and Open Multi-Agent Framework

Lisheng Huang*, Yichen Liu*, Jinhao Jiang*, Rongxiang Zhang, Jiahao Yan, Junyi Li, Wayne Xin Zhao

EMNLP-Findings, 2025
R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning

Huatong Song*, Jinhao Jiang*, Wenqing Tian, Zhipeng Chen, Yuhuan Wu, Jiahao Zhao, Yingqian Min, Wayne Xin Zhao, Lei Fang, Ji-Rong Wen

EMNLP-Findings, 2025
R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning

Huatong Song*, Jinhao Jiang*, Yingqian Min, Jie Chen, Zhipeng Chen, Wayne Xin Zhao†, Lei Fang, Ji-Rong Wen

Technical Report, 2025

RV-Syn: Rational and Verifiable Mathematical Reasoning Data Synthesis based on Structured Function library

Jiangpeng Wang*, Jinhao Jiang*, Zhiqiang Zhang, Jun Zhou, Wayne Xin Zhao†

arXiv, 2025

CAFE: Retrieval Head-based Coarse-to-Fine Information Seeking to Enhance Multi-document Question Answering Capability

Han Peng*, Jinhao Jiang*, Zican Dong*, Wayne Xin Zhao†, Lei Fang

EMNLP, 2025

Imitate, explore, and self-improve: A reproduction report on slow-thinking reasoning systems

Yingqian Min*, Zhipeng Chen*, Jinhao Jiang*, Jie Chen, Jia Deng, Yiwen Hu, Yiru Tang, Jiapeng Wang, Xiaoxue Cheng, Huatong Song, Wayne Xin Zhao†, Zheng Liu, Zhongyuan Wang, Ji-Rong Wen

Technical Report, 2025

Enhancing LLM Reasoning with Reward-guided Tree Search

Jinhao Jiang*, Zhipeng Chen*, Yingqian Min*, Jie Chen, Xiaoxue Cheng, Jiapeng Wang, Yiru Tang, Haoxiang Sun, Jia Deng, Wayne Xin Zhao†, Zheng Liu, Dong Yan, Jian Xie, Zhongyuan Wang, Ji-Rong Wen

Technical Report, 2024

Mix-CPT: A Domain Adaptation Framework via Decoupling Knowledge Learning and Format Alignment

Jinhao Jiang*, Junyi Li*, Wayne Xin Zhao†, Yang Song, Tao Zhang, Ji-Rong Wen

International Conference on Learning Representations (ICLR), 2025

RAG-Star: Enhancing Deliberative Reasoning with Retrieval Augmented Verification and Refinement

Jinhao Jiang*, Jiayi Chen*, Junyi Li*, Ruiyang Ren, Shijie Wang, Wayne Xin Zhao†, Yang Song, Tao Zhang

The North American Chapter of the Association for Computational Linguistics (NAACL), 2025

KG-Agent: An Efficient Autonomous Agent Framework for Complex Reasoning over Knowledge Graph

Jinhao Jiang*, Kun Zhou*, Wayne Xin Zhao†, Yang Song, Chen Zhu, Hengshu Zhu, Ji-Rong Wen

The 63rd Annual Meeting of the Association for Computational Linguistics (ACL), 2025

StructGPT: A General Framework for Large Language Model to Reason over Structured Data

Jinhao Jiang*, Kun Zhou*, Zican Dong, Keming Ye, Wayne Xin Zhao†, Ji-Rong Wen

The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023

ReasoningLM: Enabling Structural Subgraph Reasoning in Pre-trained Language Models for Question Answering over Knowledge Graph

Jinhao Jiang, Kun Zhou, Wayne Xin Zhao†, Yaliang Li, Ji-Rong Wen

The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023

UniKGQA: Unified Retrieval and Reasoning for Solving Multi-hop Question Answering Over Knowledge Graph

Jinhao Jiang*, Kun Zhou*, Wayne Xin Zhao†, Ji-Rong Wen

International Conference on Learning Representations (ICLR), 2023

Great Truths are Always Simple: A Rather Simple Knowledge Encoder for Enhancing the Commonsense Reasoning Capacity of Pre-Trained Models

Jinhao Jiang*, Kun Zhou*, Wayne Xin Zhao†, Ji-Rong Wen

The North American Chapter of the Association for Computational Linguistics (NAACL-Findings), 2022

Complex Knowledge Base Question Answering: A Survey

Yunshi Lan*, Gaole He*, Jinhao Jiang, Jing Jiang, Wayne Xin Zhao†, Ji-Rong Wen

IEEE Transactions on Knowledge and Data Engineering (TKDE), 2022

Grants

奖项

2021 Outstanding Graduates of Sichuan Province (winning ratio 3.7%), Education Department of Sichuan.
2020 China National Scholarship (top 1.5%), Ministry of Education of the People's Republic of China.
2019 China National Scholarship (top 1.5%), Ministry of Education of the People's Republic of China.

2021 四川省优秀毕业生（获奖比例3.7%），四川省教育厅。
2020 国家奖学金（前1.5%），中华人民共和国教育部。
2019 国家奖学金（前1.5%），中华人民共和国教育部。

Professional Service

学术服务

Journal: TALLIP, Computational Intelligence, Information Retrieval Journa
Conference: ICLR, NIPS, ACL, EMNLP

期刊: TALLIP, Computational Intelligence, Information Retrieval Journal
会议: ICLR, NIPS, ACL, EMNLP

About Me [GitHub] [Google Scholar]

关于我 [GitHub] [Google Scholar]