Sentiment trading with large language models
Sentiment trading with large language models
This research paper investigates the application of large language models (LLMs) in sentiment analysis of U.S. financial news to predict stock market returns, comparing advanced LLMs with traditional methods. The study analyzes a comprehensive dataset of 965,375 news articles from 2010 to 2023, evaluating models including BERT, OPT, FINBERT, and the Loughran-McDonald dictionary. It finds that LLMs, particularly OPT (a GPT-3 based model), significantly outperform traditional approaches in sentiment prediction accuracy and their association with subsequent daily stock returns.
The methodology involves sentiment scoring of news articles using the various models, followed by regression analyses to quantify the impact on returns and portfolio performance assessments through long-short strategies. Results show OPT achieving the highest accuracy (74.4%) and the strongest predictive power in regressions, with coefficients indicating a positive effect on next-day returns. In portfolio management, OPT-based strategies yield a superior Sharpe ratio of 3.05, highlighting its practical utility. These findings challenge the dominance of dictionary-based methods like Loughran-McDonald, which show minimal effectiveness, and underscore a shift towards LLMs in financial analysis, with implications for market prediction, regulation, and policy.
Highlights
- 1Investigates the efficacy of large language models (LLMs) in sentiment analysis of U.S. financial news for predicting stock market returns
- 2Compares performance of various LLMs (BERT, OPT, FINBERT) against the traditional Loughran-McDonald dictionary model
- 3Documents a significant association between LLM sentiment scores and subsequent daily stock returns, with OPT showing the highest accuracy
- 4Demonstrates superior portfolio performance of OPT-based long-short strategies with a Sharpe ratio of 3.05
- 5Challenges the effectiveness of traditional dictionary-based methods in modern financial contexts
Methods
- MSentiment analysis using large language models (BERT, OPT, FINBERT) and the Loughran-McDonald dictionary model
- MRegression analyses to assess the impact of sentiment scores on next-day stock returns
- MPortfolio performance evaluation through long-short strategies based on sentiment predictions
- MAnalysis of a dataset comprising 965,375 news articles from January 1, 2010, to June 30, 2023
Results
- ROPT model achieves the highest sentiment prediction accuracy at 74.4%, followed by BERT (72.5%) and FINBERT (72.2%), while Loughran-McDonald dictionary model shows only 50.1% accuracy
- RRegression analyses indicate a robust positive impact of OPT model scores on next-day stock returns, with coefficients of 0.274 and 0.254 in different specifications
- RLong-short OPT strategy yields the highest Sharpe ratio of 3.05, compared to 2.11 for BERT, 2.07 for FINBERT, and 1.23 for Loughran-McDonald strategies
- RNo significant relationship observed between Loughran-McDonald dictionary model scores and stock returns, questioning its current efficacy
Analyze Paper
Generate insights from "Sentiment trading with large language models".