skip to content

The Cambridge-INET Institute - continuing as the Janeway Institute

 
WP Cover

Mueller, H. and Rauh, C.

Reading Between the Lines: Prediction of Political Violence Using Newspaper Text

WP Number: 1605

Abstract: This article provides a new methodology to predict conflict by using newspaper text. Through machine learning, vast quantities of newspaper text are reduced to interpretable topic shares. We use changes in topic shares to predict conflict one and two years before it occurs. In our predictions we distinguish between predicting the likelihood of conflict across countries and the timing of conflict within each country. Most factors identified by the literature, though performing well at predicting the location of conflict, add little to the prediction of timing. We show that news topics indeed can predict the timing of conflict onset. We also use the estimated topic shares to document how reporting changes before conflict breaks out.

Keywords: Conflict, Forecasting, Machine Learning, Panel Data, Topic Models, Latent Dirichlet Allocation

Author links: Christopher Rauh  

PDF: wp1605.pdf

Open Access Link: 10.17863/CAM.1086


Theme: transmission


Published Version of Paper: Reading Between the Lines: Prediction of Political Violence Using Newspaper Text, Mueller, H. and Rauh, C., American Political Science Review (2018)