Question 1

What is the NLP factor in stock investing?

Accepted Answer

NLP scores the tone of management's narrative in the 10-K MD&A section using a finance-specific dictionary. Negative-leaning language predicts negative forward returns; obscure or hedge-laden language predicts uncertainty. The factor reads what management is signalling, not what the spreadsheet says.

Question 2

Which academic paper does DeepVane's NLP factor come from?

Accepted Answer

Loughran-McDonald 2011 — When Is a Liability Not a Liability? Textual Analysis, Dictionaries, and 10-Ks — Builds the canonical finance-specific sentiment dictionary, demonstrating that off-the-shelf Harvard-IV General Inquirer mis-labels words like "liability", "tax", or "vice" as negative when they are neutral in financial context. The LM dictionary correctly classifies ~85% of 10-K language. The negative word fraction in MD&A predicts forward returns — a one-standard-deviation increase in negative tone corresponds to ~2-3% annualised underperformance, robust to size, B/M, and momentum controls. Li 2008 separately established that 10-K readability (measured by Fog index) negatively correlates with future earnings persistence: complex, hard-to-read filings hide bad news.

Question 3

How does DeepVane combine NLP with the other factors?

Accepted Answer

NLP is the strongest companion factor for PEAD — together they decompose earnings news into the number (PEAD) and the narrative around the number (NLP). Bullish PEAD + bullish NLP fires EARNINGS MOMENTUM CASCADE; bullish PEAD + bearish NLP fires EARNINGS DISSONANCE (Tetlock 2007's mechanism — when management hedges around a beat, the beat is suspect). NLP also pairs with Insider: officers buying while writing optimistically is doubly bullish (INSIDER + NARRATIVE CONFLUENCE pattern). Finally, NLP and Quality interact — high quality with deteriorating tone is an early warning the moat is cracking.

Question 4

When does the NLP factor fail?

Accepted Answer

Three known failure modes. (1) Boilerplate inflation. Compliance counsel adds risk-factor language each year; the same company's 10-K has more LM-negative words in 2025 than in 2015 even if the business is unchanged. We partly mitigate via z-scoring across the universe, but absolute trend in negative tone has slowly drifted up. (2) Foreign filers. 20-F filings (used by ADRs) follow a different structure than 10-K, and our extractor's coverage is weaker there — pending fix. (3) Q-only signals. The factor only updates on annual filings; intra-year tone shifts (10-Q MD&A, conference calls, press releases) are not yet incorporated. Adding 10-Q tone is on the post-16-May roadmap.

NLP factor — explained

Academic anchor

What it actually measures

Math sketch

How DeepVane implements it

How it composes with APEX

When it fails

Related factors

See NLP score on a real ticker