Affiliation(s): 1Department of Industrial Engineering and Operations Research, Columbia University, New York 10027, USA;
moreAffiliation(s): 1Department of Industrial Engineering and Operations Research, Columbia University, New York 10027, USA; 2School of Information Management and Engineering, Shanghai University of Finance and Economics, Shanghai 200433, China; 3JF SmartInvest Holdings Ltd., Shanghai 201702, China; 4Department of Computer Science, Johns Hopkins University, Baltimore 21218, USA;
less
Abstract: Current financial large language models (FinLLMs) exhibit two major limitations: the absence of standardized evaluation metrics for stock analysis quality and insufficient analytical depth. We address these limitations with two contributions. First, we introduce AnalyScore, a systematic framework for evaluating the quality of stock analysis. Second, we construct Stocksis, an expert-curated dataset designed to enhance LLMs'financial analysis capabilities. Building on Stocksis, together with a novel integration framework and quantitative tools, we develop FinSphere, an AI agent that generates professional-grade stock analysis reports. Evaluations with AnalyScore show that FinSphere consistently surpasses general-purpose LLMs, domain-specific FinLLMs, and existing agent-based systems, even when the latter are enhanced with real-time data access and few-shot guidance. The findings highlight FinSphere's significant advantages in analytical quality and real-world applicability.
Darkslateblue:Affiliate; Royal Blue:Author; Turquoise:Article
Reference
Open peer comments: Debate/Discuss/Question/Opinion
Open peer comments: Debate/Discuss/Question/Opinion
<1>