Replication data for: Text as Data: The Promise and Pitfalls of Automatic Content Analysis Methods for Political Texts (doi:10.7910/DVN/FQBHP8)

View:

Part 1: Document Description
Part 2: Study Description
Part 5: Other Study-Related Materials
Entire Codebook

Document Description
Citation
Title:	Replication data for: Text as Data: The Promise and Pitfalls of Automatic Content Analysis Methods for Political Texts
Identification Number:	doi:10.7910/DVN/FQBHP8
Distributor:	Harvard Dataverse
Date of Distribution:	2012-07-23
Version:	2
Bibliographic Citation:	Grimmer, Justin; Stewart, Brandon, 2012, "Replication data for: Text as Data: The Promise and Pitfalls of Automatic Content Analysis Methods for Political Texts", https://doi.org/10.7910/DVN/FQBHP8, Harvard Dataverse, V2
Study Description
Citation
Title:	Replication data for: Text as Data: The Promise and Pitfalls of Automatic Content Analysis Methods for Political Texts
Identification Number:	doi:10.7910/DVN/FQBHP8
Authoring Entity:	Grimmer, Justin (Stanford University)
	Stewart, Brandon (Harvard University)
Producer:	Political Analysis
Date of Production:	2012-07-09
Distributor:	Harvard Dataverse
Distributor:	IQSS Dataverse Network
Access Authority:	Brandon Stewart
Date of Deposit:	2012-07-09
Series Name:	Volume 21, Issue 3
Holdings Information:	https://doi.org/10.7910/DVN/FQBHP8
Study Scope
Keywords:	statistics, text analysis, content analysis
Abstract:	Replication Materials (Data and Code) for 'Text as Data' Abstract: Politics and political conflict often occur in the written and spoken word. Scholars have long recognized this, but the massive costs of analyzing even moderately sized collections of texts have hindered their use in political science research. Here lies the promise of automated text analysis: it substantially reduces the costs of analyzing large collections of text. We provide a guide to this exciting new area of research and show how, in many instances, the methods have already obtained part of their promise. But there are pitfalls to using automated methods--they are no substitute for careful thought and close reading and require extensive and problem specific validation. We survey a wide range of new methods, provide guidance on how to validate the output of the models, and clarify misconceptions and errors in the literature. To conclude, we argue that for automated text methods to become a standard tool for political scientists, methodologists must contribute new methods and new methods of validation.
Country:	United States
Methodology and Processing
Sources Statement
Data Access
Notes:	<a href="http://creativecommons.org/publicdomain/zero/1.0">CC0 1.0</a>
Other Study Description Materials
Related Publications
Citation
Title:	Justin Grimmer and Brandon Stewart. 2013. "Text as Data: The Promise and Pitfalls of Automatic Content Analysis Methods for Political Texts." Political Analysis (Summer 2013) 21 (3): 267-297. <a href= "http://stanford.edu/~jgrimmer/tad2.pdf" target= "_new">article available here</a>
Identification Number:	10.1093/pan/mps028
Bibliographic Citation:	Justin Grimmer and Brandon Stewart. 2013. "Text as Data: The Promise and Pitfalls of Automatic Content Analysis Methods for Political Texts." Political Analysis (Summer 2013) 21 (3): 267-297. <a href= "http://stanford.edu/~jgrimmer/tad2.pdf" target= "_new">article available here</a>
Other Study-Related Materials
Label:	ReplicationFile.zip
Text:	Replication Code and Data in R Statistical Computing Language
Notes:	application/octet-stream