Cover image

When the Market Speaks: A New Dataset That Actually Listens

In financial sentiment analysis, the devil has always been in the labeling. Most datasets — even the industry-standard Financial-Phrasebank — ask human annotators to tag headlines as positive, negative, or neutral. But here’s the problem: the market often disagrees. Take a headline reporting widening losses. Annotators marked it “negative.” Yet the stock rose the next day. Welcome to the disconnect. Enter FinMarBa, a bold new dataset that cuts out the middleman — the human — and lets the market itself do the labeling. Developed by Lefort et al. (2025), this 61,252-item dataset uses next-day price reactions to classify financial news, creating a labeling method that is empirically grounded, scalable, and (critically) aligned with investor behavior. ...

August 3, 2025 · 3 min · Zelina
Cover image

🚀 All Talk, No Stocks? What Reddit Sentiment *Doesn't* Predict

In the wake of the GameStop and AMC frenzies, financial firms and researchers have been racing to decode one question: Can social media sentiment predict stock prices? A new paper from researchers at Wrocław University of Science and Technology provides a sobering answer: not really. Despite employing advanced sentiment models—including a ChatGPT-annotated and emoji-savvy version of Financial-RoBERTa—the study found only weak and inconsistent relationships between sentiment and price movement for GME and AMC. ...

August 1, 2025 · 3 min · Zelina
Cover image

Stacking Alpha: How HARLF's Three-Tier Reinforcement Learner Beats the Market

The idea of merging language models and financial algorithms isn’t new — but HARLF takes it a step further by embedding them in a hierarchical reinforcement learning (HRL) framework that actually delivers. With a stunning 26% annualized ROI and a Sharpe ratio of 1.2, this isn’t just another LLM-meets-finance paper. It’s a blueprint for how sentiment and structure can be synergistically harnessed. From FinBERT to Fortune: Integrating Text with Tickers Most financial LLM pipelines stop at score generation: classify sentiment and call it a signal. But HARLF builds a full sentiment pipeline using FinBERT, generating monthly sentiment scores from scraped Google News articles for each of 14 assets. These scores aren’t just inputs — they form a complete observation vector that includes: ...

July 27, 2025 · 3 min · Zelina
Cover image

The Sentiment Edge: How FinDPO Trains LLMs to Think Like Traders

Financial markets don’t reward the loudest opinions. They reward the most timely, well-calibrated ones. FinDPO, a new framework by researchers from Imperial College London, takes this lesson seriously. It proposes a bold shift in how we train language models to read market sentiment. Rather than relying on traditional supervised fine-tuning (SFT), FinDPO uses Direct Preference Optimization (DPO) to align a large language model with how a human trader might weigh sentiment signals in context. And the results are not just academic — they translate into real money. ...

July 27, 2025 · 3 min · Zelina
Cover image

Signals & Sentiments: How GPT-2 and FinBERT Beat Buy-and-Hold on the S&P 500

When it comes to trading the S&P 500, tradition says: trust the chart. But a new study from UCLA researchers proposes a smarter compass—one that listens not only to price momentum but also to the tone of the news. By merging language model-powered sentiment scores with technical indicators and time-series forecasting, the authors build a hybrid strategy that outperforms a buy-and-hold baseline during a volatile 3-month window. ...

July 20, 2025 · 3 min · Zelina