Sentiment trading with large language models

Sentiment trading with large language models

Kemal Kirtac
Guido Germano
Published on 12/26/2024
Equities
Stocks
United States (US)
Sentiment
Sentiment trading
LLM
AI
Machine learning
Stock picking
Long short equity

This research paper investigates the application of large language models (LLMs) in sentiment analysis of U.S. financial news to predict stock market returns, comparing advanced LLMs with traditional methods. The study analyzes a comprehensive dataset of 965,375 news articles from 2010 to 2023, evaluating models including BERT, OPT, FINBERT, and the Loughran-McDonald dictionary. It finds that LLMs, particularly OPT (a GPT-3 based model), significantly outperform traditional approaches in sentiment prediction accuracy and their association with subsequent daily stock returns.

The methodology involves sentiment scoring of news articles using the various models, followed by regression analyses to quantify the impact on returns and portfolio performance assessments through long-short strategies. Results show OPT achieving the highest accuracy (74.4%) and the strongest predictive power in regressions, with coefficients indicating a positive effect on next-day returns. In portfolio management, OPT-based strategies yield a superior Sharpe ratio of 3.05, highlighting its practical utility. These findings challenge the dominance of dictionary-based methods like Loughran-McDonald, which show minimal effectiveness, and underscore a shift towards LLMs in financial analysis, with implications for market prediction, regulation, and policy.

Highlights

  • 1Investigates the efficacy of large language models (LLMs) in sentiment analysis of U.S. financial news for predicting stock market returns
  • 2Compares performance of various LLMs (BERT, OPT, FINBERT) against the traditional Loughran-McDonald dictionary model
  • 3Documents a significant association between LLM sentiment scores and subsequent daily stock returns, with OPT showing the highest accuracy
  • 4Demonstrates superior portfolio performance of OPT-based long-short strategies with a Sharpe ratio of 3.05
  • 5Challenges the effectiveness of traditional dictionary-based methods in modern financial contexts

Methods

  • M
    Sentiment analysis using large language models (BERT, OPT, FINBERT) and the Loughran-McDonald dictionary model
  • M
    Regression analyses to assess the impact of sentiment scores on next-day stock returns
  • M
    Portfolio performance evaluation through long-short strategies based on sentiment predictions
  • M
    Analysis of a dataset comprising 965,375 news articles from January 1, 2010, to June 30, 2023

Results

  • R
    OPT model achieves the highest sentiment prediction accuracy at 74.4%, followed by BERT (72.5%) and FINBERT (72.2%), while Loughran-McDonald dictionary model shows only 50.1% accuracy
  • R
    Regression analyses indicate a robust positive impact of OPT model scores on next-day stock returns, with coefficients of 0.274 and 0.254 in different specifications
  • R
    Long-short OPT strategy yields the highest Sharpe ratio of 3.05, compared to 2.11 for BERT, 2.07 for FINBERT, and 1.23 for Loughran-McDonald strategies
  • R
    No significant relationship observed between Loughran-McDonald dictionary model scores and stock returns, questioning its current efficacy
0/5

Analyze Paper

Generate insights from "Sentiment trading with large language models".

Suggested Actions