Reading Between the Lines: NLP for Long-Horizon Factor Investing (Part 1 of 2)

Key Points

  • Natural language processing (NLP) can extract investment signals from unstructured text that aren’t already captured by traditional factors like value, quality, momentum, and low volatility.

  • Unlike news sentiment, year-over-year textual change in SEC filings is a slow-moving, durable signal that investors can use to potentially enhance long-horizon factor strategies.

  • The signal's persistence may be supported by three drivers that technology doesn't easily erode: limited investor attention, management incentives to bury bad news in revised filings, and compensation for the genuine uncertainty signaled by meaningful rewrites.

  • The signal is most useful as a value-trap filter: it has historically produced the strongest return spread among stocks that already look risky on conventional measures — the cheap, aggressively investing, unprofitable, high-volatility, and low-momentum segments.

Que Nguyen is the corresponding author.

Introduction

When it comes to systematic investing, numbers tell only part of the story. Traditional quantitative models rely on prices, earnings, and balance sheet data, but words matter too. News articles, SEC filings, earnings call transcripts, and social media all help shape how markets value companies. With online text expanding at a staggering pace and computing power advancing just as quickly, natural language processing (NLP)—the technology that lets computers read and interpret human language—has become an increasingly relevant and valuable tool in modern investment management.

Transforming Text into Investment Signals

The idea that words can move markets isn’t new, but quantifying it took time. The first solid evidence came almost two decades ago when Tetlock (2007) demonstrated that pessimistic coverage in The Wall Street Journal predicted short-term downward pressure on stock prices.

From there, researchers kept finding signals the numbers missed.1 Feldman et al. (2010) showed that tonal shifts in management discussion and analysis (MD&A) sections forecast returns even after controlling for earnings surprises.2 In a paper aptly titled "Lazy Prices," Cohen, Malloy, and Nguyen (2020) found that when firms meaningfully changed their 10-K text year over year, the stocks underperformed for the next six months. The implication: investors simply weren't reading carefully enough.

Today the field has entered the era of large language models (LLMs). Lopez-Lira and Tang (2023) demonstrated that modern AI models significantly outperform dictionary-based methods at interpreting financial news, and researchers are now building entirely new fundamental factors from qualitative disclosures — Eisfeldt et al. (2026), for example, parse 10-K language to construct an "intangible intensity" metric that captures investment the financial statements miss.

Our Approach: Building a Signal That Lasts

The dominant NLP application in investment management targets short-term trading, with daily news sentiment driving moves that play out over hours or days. That’s not much use to long-term factor investors, who need durable, slow-moving signals.

The dominant NLP application in investment management targets short-term trading, with daily news sentiment driving moves that play out over hours or days. That’s not much use to long-term factor investors, who need durable, slow-moving signals.

This article takes a different path, focusing on SEC 10-K and 10-Q filings. Unlike daily news, these are dense regulatory disclosures that reveal lasting information about a company’s trajectory. We constructed our signal using the full history (since mid-1990s) of filings from the SEC’s Electronic Data Gathering, Analysis, and Retrieval (EDGAR) database, converting the raw HTML to plain text.3

We chose a simple metric: the Jaccard similarity score (formal definition in the appendix). It asks a single question: Of all the unique words used across two filings, what fraction appears in both? A score of 1.0 means identical vocabulary; 0.0 means no overlap. A lower score means that management has substantially rewritten its language compared to the same quarter a year earlier — a pattern that the “Lazy Prices” research linked to weaker future stock performance.4 Using this transparent measure, we test whether filing language changes contain information not already captured by traditional factors, such as value, quality, momentum, and low volatility.

Read more: Records on the Tape. Savings at a Three-Year Low.