<?xml version='1.0' encoding='UTF-8'?><codeBook xmlns="ddi:codebook:2_5" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="ddi:codebook:2_5 https://ddialliance.org/Specification/DDI-Codebook/2.5/XMLSchema/codebook.xsd" version="2.5"><docDscr><citation><titlStmt><titl>Replication Data for: Not Just Conspiracy Theories:  Vaccine Opponents and Proponents add to the COVID-19 ‘Infodemic’ on Twitter</titl><IDNo agency="DOI">doi:10.7910/DVN/9ICICY</IDNo></titlStmt><distStmt><distrbtr source="archive">Harvard Dataverse</distrbtr><distDate>2020-08-26</distDate></distStmt><verStmt source="archive"><version date="2020-08-26" type="RELEASED">1</version></verStmt><biblCit>Broniatowski, David, 2020, "Replication Data for: Not Just Conspiracy Theories: Vaccine Opponents and Proponents add to the COVID-19 ‘Infodemic’ on Twitter", https://doi.org/10.7910/DVN/9ICICY, Harvard Dataverse, V1</biblCit></citation></docDscr><stdyDscr><citation><titlStmt><titl>Replication Data for: Not Just Conspiracy Theories:  Vaccine Opponents and Proponents add to the COVID-19 ‘Infodemic’ on Twitter</titl><IDNo agency="DOI">doi:10.7910/DVN/9ICICY</IDNo></titlStmt><rspStmt><AuthEnty affiliation="The George Washington University">Broniatowski, David</AuthEnty></rspStmt><prodStmt/><distStmt><distrbtr source="archive">Harvard Dataverse</distrbtr><contact affiliation="The George Washington University" email="broniatowski@gwu.edu">Broniatowski, David</contact><depositr>Broniatowski, David</depositr><depDate>2020-08-25</depDate></distStmt><holdings URI="https://doi.org/10.7910/DVN/9ICICY"/></citation><stdyInfo><subject><keyword xml:lang="en">Computer and Information Science</keyword><keyword xml:lang="en">Medicine, Health and Life Sciences</keyword><keyword xml:lang="en">Social Sciences</keyword></subject><abstract>These files contain the data required to replicate all findings in the referenced paper. Files include:&#xd;
&#xd;
1) 2000_Account_IDs.txt -- a tab-separated text file listing the top 2000 accounts mentioning vaccine-related keywords in CY 2019.&#xd;
2) users_ids.csv -- a comma-separated file listing all tweet IDs containing coronavirus-related keywords generated by each of the 2000 accounts. The first entry on each line is a username, followed by a list of tweet IDs.&#xd;
3) users_botscores.txt -- a tab-separated text file listing the bot scores generated from querying Botometer on March 2, 2020. The first entry is the raw (English) bot score and the second entry is the CAP score. &#xd;
4) corona_topic_keys.txt -- the top 20 words for each of 35 topics generated using the LDA algorithm fit to all tweets listed in users_ids.csv&#xd;
5) corona_doc_topics.txt -- LDA model topic results fit to each tweet in users_ids.csv. The second column corresponds to the tweet ID, and the following 35 columns are topic proportions for topics 0-34, respectively.</abstract><sumDscr/></stdyInfo><method><dataColl><sources/></dataColl><anlyInfo/></method><dataAccs><setAvail/><useStmt/><notes type="DVN:TOU" level="dv">&lt;a href="http://creativecommons.org/publicdomain/zero/1.0">CC0 1.0&lt;/a></notes></dataAccs><othrStdyMat/></stdyDscr><otherMat ID="f4033950" URI="https://dataverse.harvard.edu/api/access/datafile/4033950" level="datafile"><labl>2000_Account_IDs.txt</labl><txt>User ID, retweet count, and annotations for the 2000 most prolific accounts in the vaccine stream Twitter archive for calendar year 2019.</txt><notes level="file" type="DATAVERSE:CONTENTTYPE" subject="Content/MIME Type">text/plain</notes></otherMat><otherMat ID="f4033949" URI="https://dataverse.harvard.edu/api/access/datafile/4033949" level="datafile"><labl>corona_doc_topics.txt</labl><txt>LDA model topic results fit to each tweet in users_ids.csv. The second column corresponds to the tweet ID, and the following 35 columns are topic proportions for topics 0-34, respectively.</txt><notes level="file" type="DATAVERSE:CONTENTTYPE" subject="Content/MIME Type">text/plain</notes></otherMat><otherMat ID="f4033948" URI="https://dataverse.harvard.edu/api/access/datafile/4033948" level="datafile"><labl>corona_topic_keys.txt</labl><txt>The top 20 words for each of 35 topics generated using the LDA algorithm fit to all tweets listed in users_ids.csv</txt><notes level="file" type="DATAVERSE:CONTENTTYPE" subject="Content/MIME Type">text/plain</notes></otherMat><otherMat ID="f4033951" URI="https://dataverse.harvard.edu/api/access/datafile/4033951" level="datafile"><labl>users_botscores.txt</labl><txt>A tab-separated text file listing the bot scores generated from querying Botometer on March 2, 2020. The first entry is the raw (English) bot score and the second entry is the CAP score. </txt><notes level="file" type="DATAVERSE:CONTENTTYPE" subject="Content/MIME Type">text/plain</notes></otherMat><otherMat ID="f4033947" URI="https://dataverse.harvard.edu/api/access/datafile/4033947" level="datafile"><labl>users_ids.csv</labl><txt>A comma-separated file listing all tweet IDs containing coronavirus-related keywords generated by each of the 2000 accounts. The first entry on each line is a username, followed by a list of tweet IDs.</txt><notes level="file" type="DATAVERSE:CONTENTTYPE" subject="Content/MIME Type">text/csv</notes></otherMat></codeBook>