Machine Learning

Predicting Returns with Text Data

Topics - Machine Learning

Read Time - 25min

${ numberSection } ${ text }
Predicting Returns with Text Data

We introduce a new text-mining methodology that extracts sentiment information from news articles to predict asset returns. Unlike more common sentiment scores used for stock return prediction (e.g., those sold by commercial vendors or built with dictionary-based methods), our supervised learning framework constructs a sentiment score that is specifically adapted to the problem of return prediction. Our method proceeds in three steps: 1) isolating a list of sentiment terms via predictive screening, 2) assigning sentiment weights to these words via topic modeling, and 3) aggregating terms into an article-level sentiment score via penalized likelihood. We derive theoretical guarantees on the accuracy of estimates from our model with minimal assumptions. In our empirical analysis, we text-mine one of the most actively monitored streams of news articles in the financial system—the Dow Jones Newswires—and show that our supervised sentiment model excels at extracting return-predictive signals in this context.

 

AQR Capital Management, LLC, (“AQR”) provide links to third-party websites only as a convenience, and the inclusion of such links does not imply any endorsement, approval, investigation, verification or monitoring by us of any content or information contained within or accessible from the linked sites. If you choose to visit the linked sites, you do so at your own risk, and you will be subject to such sites' terms of use and privacy policies, over which AQR.com has no control. In no event will AQR be responsible for any information or content within the linked sites or your use of the linked sites.

 

The information contained herein is only as current as of the date indicated, and may be superseded by subsequent market events or for other reasons. The views and opinions expressed herein are those of the author and do not necessarily reflect the views of AQR Capital Management, LLC, its affiliates or its employees. This information is not intended to, and does not relate specifically to any investment strategy or product that AQR offers. It is being provided merely to provide a framework to assist in the implementation of an investor’s own analysis and an investor’s own view on the topic discussed herein. Past performance is not a guarantee of future results.

 

Hypothetical performance results have many inherent limitations, some of which, but not all, are described herein. Hypothetical performance results are presented for illustrative purposes only.

 

Diversification does not eliminate the risk of experiencing investment loss.

 

Certain publications may have been written prior to the author being an employee of AQR.

This material is intended for informational purposes only and should not be construed as legal or tax advice, nor is it intended to replace the advice of a qualified attorney or tax advisor.