FinDDR 2025

Financial Document Deep Research Grand Challenge

Nov 15-18, 2025, Singapore

Co-located with The 6th International ACM International Conference on AI in Finance (ICAIF)

ACM ICAIF 2025

Announcements

October 16, 2025:

The official leaderboard is now live! Check the Leaderboard section for the latest rankings.

October 9, 2025:

The challenge deadline has been extended by one week. The new submission deadline is October 22, 2025. The Timeline has been updated.
A total of USD 3,000 in cash prizes for the winning teams has been confirmed! See the Prizes section for the detailed breakdown.

October 6, 2025:

The test set is now available for download in the Task and Dataset section.
Note: Test scores will be made public after the challenge ends. You can use the validation set to test your solution robustness.

Winners

🥇

SilverSight

Members:

Yuhang Zhou, Xutian Chen, Zhongyi Liu, Yu He, Ziyang Liu, Yunwen Chen

Organization:

Fudan University, Shanghai Innovation Institute, DataGrand Inc

🥈

Finsselaer

Members:

Andy Zhu

Organization:

Rensselaer Polytechnic Institute

🥉

Token Refund

Members:

Jinyu Wang, Shiran Zhang

Organization:

Microsoft Research Asia

Overview

Financial analysis is crucial for informed decision-making among stakeholders of public companies. Yet extracting insight from lengthy and complex annual reports remains a significant challenge. Utilizing the proven capabilities of Deep Research Agents, we propose the Financial Document Deep Research (FinDDR) Challenge, which adopts similar deep research methodologies in question design and evaluation frameworks. The FinDDR Challenge introduces a richly structured, industry-diverse dataset and requires participants to generate comprehensive, sectioned research reports. This is accomplished through a hierarchical, stepwise reasoning framework that closely emulates the analytical methodologies employed by professional financial analysts. In conclusion, the FinDDR Challenge seeks to establish new benchmarks for complex document-based deep research in financial AI applications, fostering progress and collaboration across both academic and industry communities.

Task and Dataset

The task requires participants to generate a comprehensive and deep research report based solely on a pair of multi-year annual reports of a listed company following a desired report structure. The use of external data sources other than provided annual reports (e.g., websites, news articles, financial data providers) is strictly prohibited. This structure is designed to emulate the analytical workflow of professional financial analysts, progressing from basic fact extraction, to calculation of financial indicators, to in-depth interpretation and summarization.
In the following, we provide the detailed description about the input raw annual report and the hierarchical structure of the desired output research report:

Diverse Input Raw Annual Report（Download）
- 4 Languages: English, Simplified Chinese, Traditional Chinese and Indonesia Bahasa
- 8 Markets: USA, UK, China, Hong Kong, Australia, Singapore, Indonesia, and Malaysia.
- 15+ Industries: Property & Real Estate, Healthcare & Medical Services, Consumer Goods & Services, Technology, Aerospace & Defense, Telecommunications, Oil & Gas, Transport & Logistics, Mining & Resources, Plantation, Utilities
Hierarchical and Comprehensive Output Report Structure (Download）
- Company Overview: This section provides a concise overview of the company, including:
- Financial Performance: This section presents a detailed analysis of the company’s financial health, including:
- Business Analysis: This section provides a summary and analysis of a company’s business performance and strategies, including:
- Risk Factors: This section identifies and discusses the principal risks the company faces, including:
- Corporate Governance: This section outlines the company’s governance framework, including:
- Future Outlook: This section provides management’s projections and strategic plans for the future, including:

FinDDR Dataset

For every market in the FinDDR Dataset, we provide a roughly equal number of companies from various industries. Two annual reports (2023 and 2024) for each company are provided. The FinDDR Dataset is divided into sample, validation, and test sets. Participants can develop their methods using the sample set and evaluate performance on the validation and test sets. The raw annual reports are available for the following sets: sample set and validation set. For the sample set, we also provide the corresponding sample deep research reports (in both Markdown and Word formats). ~~The raw annual reports for the test set will be released soon.~~ The raw annual reports for the test set are now available.

Market	Sample Set	Validation Set	Test Set
US	1	6	7
UK	1	6	6
China	1	6	5
Hong Kong	1	6	6
Singapore	1	6	6
Australia	1	6	5
Indonesia	1	6	6
Malaysia	1	6	7
Total	8	48	48

Evaluation

We apply two metrics to evaluate different sections, the actual metrics used for each section can be found in the defined structure of the deep research report）.

Accuracy: we will use advanced LLMs to judge if the submitted answers are factually correct by comparing them with the ground truth, where an LLM takes question, submitted answer, and ground truth as input and outputs the evaluation result.
DeepEval.SummarizationMetric: The summarization metric uses LLM-as-a-judge to determine whether your LLM (application) is generating factually correct summaries while including the necessary details from the original text. More details can be find via Link

The final score for a research report is a weighted sum of its section scores. The overall score of the submitted model is then calculated by averaging the scores of all reports.

Winner Verification: Top-performing teams will be invited to present their solutions at the FinDDR 2025 venue. To ensure fairness and reproducibility, winning teams will be required to provide supporting materials such as code, prompts, and relevant documentation to verify that their reports are AI-generated and their results can be reproduced.

Prizes^*

🥇

1st Place

USD 1,500

🥈

2nd Place

USD 1,000

🥉

3rd Place

USD 500

*Final rankings will be determined based on performance on the test set.

Leaderboard

Leaderboard on Validation Set

Rank	Team	Model	Organization	Overall Score	US	UK	China	HK	Singapore	Australia	Indonesia	Malaysia	Last Update
🎯	Full Marks	-	-	240	240	240	240	240	240	240	240	240	2025/10/02
🥇	SilverSight	SilverSight Agent	Fudan University, Shanghai Innovation Institute, DataGrand Inc	184.94	170.55	175.66	183.82	189.38	185.48	185.6	186.94	202.09	2025/10/22
🥈	Finsselaer	FinFiler Agent	Rensselaer Polytechnic Institute	177.15	182.1	175.11	167.02	177.22	160.14	186.95	177.74	190.94	2025/10/22
🥉	Token Refund	PIKE-Report	Microsoft Research Asia	165.36	162.15	155.92	150.53	169.4	171.67	172.27	166.04	174.89	2025/10/22
4	Baseline	DeepSeek-v3.2	Official	144.75	160.88	130.47	140.56	146.76	125.25	150.66	147.06	156.38	2025/10/15
5	Baseline	GPT-5-MINI with File Search	Official	144.59	153.11	137.82	124.28	155.86	135.76	160.85	137.14	151.88	2025/10/15
6	Baseline	GPT-5-NANO	Official	130.84	149.71	124.96	117.54	105.38	111.76	150.95	143.31	143.13	2025/10/15
7	ICT-NDST	ICTDR	Chinese Academy of Sciences	116.99	123.39	110.15	103.55	111.75	110.7	136.79	120.8	118.82	2025/10/22
8	Baseline	GPT-OSS-20B	Official	108.02	115.54	82.02	89.8	103.55	107.43	126.46	122.72	116.61	2025/10/15
9	afinit	afinit_fin_report_agent_v2	afinit	98.65	99.95	77.18	145.74	70.17	83.93	110.35	85.57	116.29	2025/10/21
10	RUCFinAI	DeepFinAgent	Renmin University of China	39.08	33.77	26.28	39.8	49.04	30.27	62.77	40.27	30.41	2025/10/22
11	SI4Fin	Qwen3RAG	A*STAR	38.78	28.95	24.51	37.01	49.19	35.75	60.7	35.66	38.45	2025/10/02

Leaderboard on Test Set (Final)

Rank	Team	Model	Organization	Overall Score	US	UK	China	HK	Singapore	Australia	Indonesia	Malaysia	Last Update
🎯	Full Marks	-	-	240	240	240	240	240	240	240	240	240	2025/10/02
🥇	SilverSight	SilverSight Agent	Fudan University, Shanghai Innovation Institute, DataGrand Inc	197.66	199.21	204.94	207.11	188.31	206.28	192.29	188.95	195.03	2025/10/23
🥈	Finsselaer	FinFiler Agent	Rensselaer Polytechnic Institute	184.5	179.09	187.08	188.08	176.86	187.14	189.54	174.02	194.80	2025/10/23
🥉	Token Refund	PIKE-Report	Microsoft Research Asia	173.31	163.99	180.77	167.79	163.20	181.92	175.50	171.76	181.22	2025/10/23
4	Financial Wizard	Experian_FinAgent	Experian	171.01	175.11	178.45	171.71	162.69	174.57	163.79	159.25	179.34	2025/10/17
5	afinit	afinit_fin_report_agent_v2	afinit	158.40	161.46	174.42	154.68	142.89	169.16	154.22	142.98	164.53	2025/10/23
6	e0nia	aiar	Individual	156.28	149.12	160.05	149.96	152.86	156.58	168.54	143.95	169.24	2025/10/23
7	Baseline	DeepSeek-v3.2	Official	156.20	156.74	162.88	139.55	140.00	152.59	169.61	153.28	171.71	2025/10/15
8	Baseline	GPT-5-MINI with File Search	Official	150.72	152.43	160.27	132.22	130.61	162.53	151.24	147.30	163.72	2025/10/15
9	Baseline	GPT-5-NANO	Official	149.10	142.92	159.74	141.23	134.11	162.12	174.96	121.03	159.10	2025/10/15
10	SI4Fin	GeminiFlashRAG	A*STAR	140.81	148.61	140.29	154.85	132.29	131.95	148.87	122.83	148.01	2025/10/21
11	ICT-NDST	ICTDR	Chinese Academy of Sciences	127.23	130.28	148.60	94.86	129.89	128.47	130.00	108.53	139.68	2025/10/22
12	DeepSeek Your Report	FinCMini Agent	Shanghai University of International Business and Economics	121.92	153.07	82.47	126.35	106.98	139.93	131.30	72.70	154.26	2025/10/22
13	Baseline	GPT-OSS-20B	Official	113.41	116.98	101.28	92.39	109.06	108.94	119.57	115.50	136.61	2025/10/15
14	LedgerLens	AEGIS	The University of Technology Sydney	76.68	78.50	75.32	83.78	80.99	80.66	78.81	44.12	90.21	2025/10/21
15	FinSight	CAVM Agent	Renmin University of China	71.94	71.45	69.89	55.22	74.73	65.93	85.37	83.55	69.35	2025/10/21
16	DataLovers	FinMAHRAG3	Rajiv Gandhi Institute of Petroleum Technology,Galgotias University,Wells Fargo	58.88	59.54	63.95	47.72	49.15	64.07	70.68	43.98	70.11	2025/10/23
17	RUCFinAI	DeepFin Agent	Renmin University of China	51.29	58.15	47.22	40.09	51.66	50.26	63.45	35.35	61.47	2025/10/22

Submission

To submit your predictions for the validation or test set, please send your results in Markdown format (Samples) to finddr2025@gmail.com. The predicted report should use the Case ID as the file name like the sample reports. For example, if the Case ID is "val001", the file name should be "val001.md".
Please use the following subject in your email: FinDDR2025-{Val/Test}-{Method Name}-{Team Name}

The Method Name should be a unique name for your system/method, not just the name of the base model used (e.g., "MyAwesomeMethod", "FinAgent-v2", not "GPT-4").

Team Member Information: Please include the names and affiliations of all team members directly in the body of your submission email. Each team may have up to 6 members. We will consider the team member list from your most recent submission as the final version for your team.

Important: Each team may submit up to 3 times per set per day. Please allow up to two days for leaderboard updates.

Timeline

Challenge Starts

20 Aug 2025

→

Sample Set Release

25 Aug 2025

→

Val Set Release

15 Sep 2025

→

Test Set Release

6 Oct 2025

→

Challenge Ends

22 Oct 2025

→

Awards

16 Nov 2025

Organizers

Project Leader

• Fengbin Zhu | National University of Singapore
• Chao Wang | 6Estates Pte Ltd
• Tianhui Tan | Asian Institute of Digital Finance

Dataset Construction and Evaluation

• Xiang Yao Ng | 6Estates Pte Ltd
• Ziyang Liu | 6Estates Pte Ltd
• Huanchang Zhuo | 6Estates Pte Ltd
• Min Xu | 6Estates Pte Ltd
• Stanley Marcelino | 6Estates Pte Ltd
• Jing Wang | 6Estates Pte Ltd
• Junfeng Li | National University of Singapore
• Chang Liu | Asian Institute of Digital Finance
• Xuan Yao | Asian Institute of Digital Finance
• Hao Zhuang | Asian Institute of Digital Finance
• Ruiqi Zheng | Asian Institute of Digital Finance
• Haiyi Shao | Asian Institute of Digital Finance
• Harpal Kaur Tarjindar Singh Dhindsa | Asian Institute of Digital Finance
• Zixuan Wang | 6Estates Pte Ltd
• Xiaohan Ai | 6Estates Pte Ltd
• Lan Huang | 6Estates Pte Ltd
• Xin Lin | 6Estates Pte Ltd
• Xianwei Zeng | 6Estates Pte Ltd
• Jing Wang | 6Estates Pte Ltd

Advisor

• Ke-Wei Huang | Asian Institute of Digital Finance
• Shuo Zhang | Bloomberg
• Wenjie Wang | University of Science and Technology of China
• Fuli Feng | University of Science and Technology of China
• Huanbo Luan | 6Estates Pte Ltd
• Tat-Seng Chua | National University of Singapore

Venue

FinDDR 2025 will be co-located with The 6th International ACM International Conference on AI in Finance at Sheraton Towers Singapore.

FinDDR 2025

Financial Document Deep Research Grand Challenge

Announcements

Winners

🥇

SilverSight

🥈

Finsselaer

🥉

Token Refund

Overview

Task and Dataset

Evaluation

Prizes*

🥇

1st Place

🥈

2nd Place

🥉

3rd Place

Leaderboard

Submission

Timeline

Organizers

Project Leader

Dataset Construction and Evaluation

Advisor

Venue

Prizes^*