基于大语言模型的财务报告指标抽取智能体方法
Agent-Based Financial Report Indicator Extraction with Large Language Models
DOI: 10.12677/airr.2025.146127, PDF,   
作者: 张亚豪, 秦 疆:北京信息科技大学计算机学院,北京;施水才, 王洪俊:北京信息科技大学计算机学院,北京;拓尔思信息技术股份有限公司,北京
关键词: 大语言模型智能体框架检索增强生成财务报告分析Large Language Models (LLMs) Agent Framework Retrieval-Augmented Generation (RAG) Financial Report Analysis
摘要: 金融年报中的信息抽取因其复杂的PDF格式和超长上下文而极具挑战。传统的检索增强生成(RAG)方法受限于单步、静态的检索范式,一旦初始查询与文档表述不匹配,便容易失败。为解决这一检索脆弱性问题,本文提出了一个名为LedgerLens的多智能体协作框架。该框架借鉴人类分析师的认知模式,其核心研究智能体(Researcher Agent)通过“检索–分析–精炼”的迭代循环,在初步检索结果不佳时能够自主重构查询并进行多轮尝试,直至精准定位目标信息。在自建的银行年报问答数据集(BAR-QA)上的实验结果表明,LedgerLens在指标抽取任务中取得了94.1%的F1分数,并在大多数任务上实现了领先表现。研究结果证明,引入基于智能体的迭代查询优化机制,是突破传统RAG在复杂真实场景中检索瓶颈的有效途径。
Abstract: Information extraction from financial annual reports is highly challenging due to their complex PDF formatting and extremely long contexts. Traditional retrieval-augmented generation (RAG) methods rely on a single-step, static retrieval paradigm, which is prone to failure when the initial query does not align with document expressions. To address this retrieval vulnerability, this study proposes LedgerLens, a multi-agent collaborative framework. Inspired by the cognitive process of human analysts, the core Researcher Agent operates through an iterative “retrieve-nalyze-refine” loop, enabling it to autonomously reformulate queries and conduct multiple retrieval attempts until the target information is accurately located. Experiments on a self-constructed Bank Annual Report Question Answering dataset (BAR-QA) demonstrate that LedgerLens achieves an F1 score of 94.1% in the indicator extraction task and outperforms baselines on most tasks. These results indicate that introducing an agent-based iterative query optimization mechanism provides an effective solution to overcoming the retrieval bottlenecks of traditional RAG in complex real-world scenarios.
文章引用:张亚豪, 施水才, 王洪俊, 秦疆. 基于大语言模型的财务报告指标抽取智能体方法[J]. 人工智能与机器人研究, 2025, 14(6): 1361-1371. https://doi.org/10.12677/airr.2025.146127

参考文献

[1] Huang, Y., Chen, Y., Zhang, H., Li, K., Fang, M., Yang, L., Li, X., Shang, L., Xu, S., Hao, J., Shao, K. and Wang, J. (2025) Deep Research Agents: A Systematic Examination and Roadmap. 10.48550/arXiv.2506.18096. [Google Scholar] [CrossRef
[2] Zheng, Y., Fu, D., Hu, X., Cai, X., Ye, L., Lu, P. and Liu, P. (2025) Deep Researcher: Scaling Deep Research via Reinforcement Learning in Real-World Environments. ArXiv, abs/2504.03160.
[3] Sarmah, B., Mehta, D., Hall, B., Rao, R., Patel, S. and Pasquali, S. (2024) Hybridrag: Integrating Knowledge Graphs and Vector Retrieval Augmented Generation for Efficient Information Extraction. Proceedings of the 5th ACM International Conference on AI in Finance, Brooklyn, 14-17 November 2024, 608-616. [Google Scholar] [CrossRef
[4] Wang, J., Ding, W. and Zhu, X. (2025) Financial Analysis: Intelligent Financial Data Analysis System Based on LLM-RAG. Applied and Computational Engineering, 145, 182-189. [Google Scholar] [CrossRef
[5] Setty, S., Jijo, K., Chung, E. and Vidra, N. (2024) Improving Retrieval for RAG based Question Answering Models on Financial Documents. ArXiv, abs/2404.07221.
[6] Kim, S., Song, H., Seo, H. and Kim, H. (2025) Optimizing Retrieval Strategies for Financial Question Answering Documents in Retrieval-Augmented Generation Systems. ArXiv, abs/2503.15191.
[7] Lee, J. and Roh, M. (2024) Multi-Reranker: Maximizing Performance of Retrieval-Augmented Generation in the Finance RAG Challenge. ArXiv, abs/2411.16732.
[8] Reddy, V., Koncel-Kedziorski, R., Lai, V., Krumdick, M., Lovering, C. and Tanner, C. (2024) DocFinQA: A Long-Context Financial Reasoning Dataset. Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), Bangkok, 11-16 August 2024, 445-458. [Google Scholar] [CrossRef
[9] Islam, P., Kannappan, A., Kiela, D., Qian, R., Scherrer, N. and Vidgen, B. (2023) Finance Bench: A New Benchmark for Financial Question Answering. ArXiv, abs/2311.11944.
[10] Jimeno-Yepes, A., You, Y., Milczek, J., Laverde, S. and Li, R. (2024) Financial Report Chunking for Effective Retrieval Augmented Generation. ArXiv, abs/2402.05131.
[11] Jiang, C., Zhang, P., Ni, Y., Wang, X., Peng, H., Liu, S., et al. (2025) Multimodal Retrieval-Augmented Generation for Financial Documents: Image-Centric Analysis of Charts and Tables with Large Language Models. The Visual Computer, 41, 7657-7670. [Google Scholar] [CrossRef
[12] Gondhalekar, C., Patel, U. and Yeh, F. (2025) MultiFinRAG: An Optimized Multimodal Retrieval-Augmented Generation (RAG) Framework for Financial Question Answering. ArXiv, abs/2506.20821.
[13] Lai, V.D., Krumdick, M., Lovering, C., Reddy, V., Schmidt, C.W. and Tanner, C. (2024) SEC-QA: A Systematic Evaluation Corpus for Financial QA. ArXiv, abs/2406.14394.
[14] Wang, X., Chi, J., Tai, Z., Kwok, T.S., Li, M., Li, Z., He, H., Hua, Y., Lu, P., Wang, S., Wu, Y., Huang, J., Tian, J. and Zhou, L. (2025) FinSage: A Multi-Aspect RAG System for Financial Filings Question Answering. ArXiv, abs/2504.14493.