5.4 Data Collection Module
Utilizes Yahoo Finance API to collect real-time and
historical stock price data, financial reports, and key
performance indicators (KPIs). The Yahoo Finance
API automatically updates stock data at defined
intervals for continuous monitoring. Cleans raw
financial data by handling missing values, anomalies,
and noise. Transforms data into structured formats
(CSV, JSON, Pandas DataFrames) for faster
processing and analysis.
Table 1 show the Input
Specifications:
Phidata’s automation capabilities streamline the
data pipeline, ensuring real-time stock data updates
without manual intervention.
5.5 AI-Based Analysis Module
Utilizes NLP models for sentiment analysis from
financial reports and news articles:
Large Language Models (LLMs) (e.g., GPT-
based models, FinBERT) analyze financial news,
earnings reports, and social media sentiment.
Extracts positive, negative, or neutral sentiment
for stocks and industries.
Identifies market-moving news and predicts investor
sentiment shifts.
Implements machine learning models for trend
forecasting and anomaly detection:
Uses time series forecasting models (LSTMs,
ARIMA, and Transformer-based models) to predict
stock price movements.
Detects unusual price fluctuations and alerts users
about potential risks or investment opportunities.
Phidata facilitates efficient data pipelines, making it
easier to process large volumes of stock market data
quickly.
5.6 Query Interface Module
Accepts user-defined inputs (e.g., stock symbol,
timeframe, financial metric):
Users can enter stock symbols, date ranges, and
financial KPIs (e.g., P/E ratio, market cap, revenue
growth).
Allows for custom queries related to stock trends,
risk factors, and sector-specific performance.
Retrieves relevant stock trends, sentiment
analysis, and predictive insights:
Combines real-time data with AI predictions to
provide actionable investment insights.
Uses Phidata workflows to efficiently process and
return query results in a structured format.
Enables interactive exploration of stock trends
without requiring financial expertise.
5.7 Summary Generation Module
AI-powered engine for generating natural language
summaries based on analysed financial information:
LLM-based report generation from stock data,
sentiment analysis, and forecasts.
General impact of the financial news across stock
price movements of the typical stocks and the future
assessments of risk.
Instead of just stock data gives you intuitive insights:
Instead of showing complicated graphs and
numbers, the system gives plain-text summaries (for
example, “Stock X is seeing a good upward trend in
light of strong quarterly earnings”).
Allows investors lacking coding knowledge to
get a picture of financial trends without advanced
analytics capabilities.
5.8 LLM Judge Evaluation Module
Employs LLMs to review the output produced by the
summary generation module:
Uses reinforcement learning with another Judge
LLM to verify AI generated summaries are factually
accurate and in the right context.
Aims Open source LLMs Llama 3.1 70 b
versatile assume as true financial summaries through
comparing to live inventory piece of evidence.
Minimizes human intervention and evaluates the
accuracy of the output:
Conducts cross-check against the original data
and results of sentiment analysis.
Note: Potential hallucinations or misleading
information detected, providing reliable investment
insights.
Lowers manual verification overhead, enabling
automated, high-accuracy financial reporting.
6 CONCLUSIONS AND FUTURE
WORK
This paper uses AI agents to collect real-time data,
analyse sentiment, and query data based on user
preferences to address major issues in Financial
analysis. The system improves the decision-making
process for investors and analysts by automating
stock analysis and delivering AI-powered financial
insights.