<codeBook xmlns="ddi:codebook:2_5" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="ddi:codebook:2_5 https://ddialliance.org/Specification/DDI-Codebook/2.5/XMLSchema/codebook.xsd" version="2.5"><docDscr><citation><titlStmt><titl>Open e-commerce 1.0:  Five years of crowdsourced U.S. Amazon purchase histories with user demographics</titl><IDNo agency="DOI">doi:10.7910/DVN/YGLYDY</IDNo></titlStmt><distStmt><distrbtr source="archive">Harvard Dataverse</distrbtr><distDate>2023-12-02</distDate></distStmt><verStmt source="archive"><version date="2023-12-02" type="RELEASED">1</version></verStmt><biblCit>Alex Berke; Dan Calacci; Robert Mahari; Takahiro Yabe; Kent Larson; Sandy Pentland, 2023, "Open e-commerce 1.0: Five years of crowdsourced U.S. Amazon purchase histories with user demographics", https://doi.org/10.7910/DVN/YGLYDY, Harvard Dataverse, V1, UNF:6:mV4isMgPXhqWeiQ3gZmmNQ== [fileUNF]</biblCit></citation></docDscr><stdyDscr><citation><titlStmt><titl>Open e-commerce 1.0:  Five years of crowdsourced U.S. Amazon purchase histories with user demographics</titl><IDNo agency="DOI">doi:10.7910/DVN/YGLYDY</IDNo></titlStmt><rspStmt><AuthEnty affiliation="MIT Media Lab">Alex Berke</AuthEnty><AuthEnty affiliation="Princeton University &amp; MIT Media Lab">Dan Calacci</AuthEnty><AuthEnty affiliation="MIT Media Lab &amp; Harvard Law School">Robert Mahari</AuthEnty><AuthEnty affiliation="MIT Institute of Data, Systems, and Society (IDSS) &amp; New York University Center for Urban Science and Progress">Takahiro Yabe</AuthEnty><AuthEnty affiliation="MIT Media Lab">Kent Larson</AuthEnty><AuthEnty affiliation="MIT Media Lab">Sandy Pentland</AuthEnty></rspStmt><prodStmt/><distStmt><distrbtr source="archive">Harvard Dataverse</distrbtr><contact affiliation="Massachusetts Institute of Technology" email="aberke@mit.edu">Alex Berke</contact><depositr>Alex Berke</depositr><depDate>2023-12-02</depDate></distStmt><holdings URI="https://doi.org/10.7910/DVN/YGLYDY"/></citation><stdyInfo><subject><keyword xml:lang="en">Social Sciences</keyword><keyword xml:lang="en">Other</keyword><keyword>e-commerce</keyword><keyword>purchase histories</keyword><keyword>crowdsourced</keyword></subject><abstract>This dataset contains longitudinal purchases data from 5027 Amazon.com users in the US, spanning 2018 through 2022: amazon-purchases.csv&lt;br>
It also includes demographic data and other consumer level variables for each user with data in the dataset. These consumer level variables were collected through an online survey and are included in survey.csv
&lt;br>
fields.csv describes the columns in the survey.csv file, where fields/survey columns correspond to survey questions. 
&lt;br>
&lt;br>
The dataset also contains the survey instrument used to collect the data.
More details about the survey questions and possible responses, and the format in which they were presented can be found by viewing the survey instrument.
&lt;br>
&lt;br>
A 'Survey ResponseID' column is present in both the amazon-purchases.csv and survey.csv files. It links a user's survey responses to their Amazon.com purchases. The 'Survey ResponseID' was randomly generated at the time of data collection. 
&lt;br>&lt;br>
&lt;b>amazon-purchases.csv&lt;/b>
&lt;br>
Each row in this file corresponds to an Amazon order. Each such row has the following columns: 
&lt;ul>
&lt;li>Survey ResponseID&lt;/li>
&lt;li>Order date&lt;/li>
&lt;li>Shipping address state&lt;/li>
&lt;li>Purchase price per unit&lt;/li>
&lt;li>Quantity&lt;/li>
&lt;li>ASIN/ISBN (Product Code)&lt;/li>
&lt;li>Title &lt;/li>
&lt;li>Category&lt;/li> 
&lt;/ul>

&lt;br>
The data were exported by the Amazon users from Amazon.com and shared by users with their informed consent. PII and other information not listed above were stripped from the data. This processing occurred on users' machines before sharing with researchers.</abstract><sumDscr/><notes>The dataset is provided for research purposes and should not be used to re-identify study participants.
&lt;br>&lt;br>
The Amazon.com purchases data were crowdsourced and shared through an online survey. Surrey participants were recruited via online research platforms Prolific and CloudResearch.
They were offered $0.35 for an estimated 1 minute prescreen and $1.50 for the main survey, with an estimated 4-7 minute completion time.
In order to be eligible for the survey, participants had to meet the following requirements: 18 years or older, U.S. resident and English speaker, have an active Amazon account that they could sign into during the survey and which they had been using since 2018.
&lt;br>
The survey prompted participants to share their Amazon data with informed consent, with the option to consent or decline to share. Participants were paid for completing the survey whether or not they chose to share their data.
&lt;br>&lt;br>
The survey tool also embedded an experiment designed to test the impact of various data transparency levels and incentives on participants' likelihood to share their Amazon data. In addition, the survey tool enabled an empirical study of the privacy paradox. More information about the survey tool, data collection process, experiment design, and experiment results can be found in our related publication.
&lt;br>&lt;br>
All software used in the data collection process is available via an open source repository: https://github.com/aberke/amazon-study 
&lt;br>&lt;br>
This data collection and publication was approved by the MIT Institutional Review Board (protocol #2205000649).</notes></stdyInfo><method><dataColl><sources/></dataColl><anlyInfo/></method><dataAccs><setAvail/><useStmt/><notes type="DVN:TOU" level="dv">&lt;a href="http://creativecommons.org/publicdomain/zero/1.0">CC0 1.0&lt;/a></notes></dataAccs><othrStdyMat><relPubl><citation><titlStmt><titl>Berke, A., Mahari, R., Pentland, S., Larson, K., Calacci, D. Insights from an experiment crowdsourcing data from thousands of US Amazon users: The importance of transparency, money, and data use. (In review).</titl></titlStmt><biblCit>Berke, A., Mahari, R., Pentland, S., Larson, K., Calacci, D. Insights from an experiment crowdsourcing data from thousands of US Amazon users: The importance of transparency, money, and data use. (In review).</biblCit></citation><ExtLink URI="https://github.com/aberke/amazon-study/blob/master/data-collection-survey-experiment.pdf"/></relPubl></othrStdyMat></stdyDscr><fileDscr ID="f7616231" URI="https://dataverse.harvard.edu/api/access/datafile/7616231"><fileTxt><fileName>survey.csv</fileName><dimensns><caseQnty>5027</caseQnty><varQnty>23</varQnty></dimensns><fileType>text/tab-separated-values</fileType></fileTxt><notes level="file" type="VDC:UNF" subject="Universal Numeric Fingerprint">UNF:6:mV4isMgPXhqWeiQ3gZmmNQ==</notes><notes level="file" type="DATAVERSE:FILEDESC" subject="DataFile Description">Survey responses, including only the responses from participants who chose to share their Amazon data (N=5027). Columns relevant to only the Qualtrics survey software and experiment setup were removed.</notes></fileDscr><dataDscr><var ID="v32057804" name="Survey ResponseID" intrvl="discrete"><location fileid="f7616231"/><labl level="variable">Survey ResponseID</labl><varFormat type="character"/><notes subject="Universal Numeric Fingerprint" level="variable" type="Dataverse:UNF">UNF:6:iz2A5v9x7adZNdEPWLCm2Q==</notes></var><var ID="v32057818" name="Q-demos-age" intrvl="discrete"><location fileid="f7616231"/><labl level="variable">Q-demos-age</labl><varFormat type="character"/><notes subject="Universal Numeric Fingerprint" level="variable" type="Dataverse:UNF">UNF:6:dLUBf2tOG8/JsQRLSkMKZQ==</notes></var><var ID="v32057806" name="Q-demos-hispanic" intrvl="discrete"><location fileid="f7616231"/><labl level="variable">Q-demos-hispanic</labl><varFormat type="character"/><notes subject="Universal Numeric Fingerprint" level="variable" type="Dataverse:UNF">UNF:6:BLxin5ByfSPzCrpTY2BalQ==</notes></var><var ID="v32057822" name="Q-demos-race" intrvl="discrete"><location fileid="f7616231"/><labl level="variable">Q-demos-race</labl><varFormat type="character"/><notes subject="Universal Numeric Fingerprint" level="variable" type="Dataverse:UNF">UNF:6:sRU0N9Bmj7ODj0GAIv8YrA==</notes></var><var ID="v32057803" name="Q-demos-education" intrvl="discrete"><location fileid="f7616231"/><labl level="variable">Q-demos-education</labl><varFormat type="character"/><notes subject="Universal Numeric Fingerprint" level="variable" type="Dataverse:UNF">UNF:6:Dy85SedHQmVhbJ802JE1/A==</notes></var><var ID="v32057809" name="Q-demos-income" intrvl="discrete"><location fileid="f7616231"/><labl level="variable">Q-demos-income</labl><varFormat type="character"/><notes subject="Universal Numeric Fingerprint" level="variable" type="Dataverse:UNF">UNF:6:4fDoYjBlQeNq22Qvd7KG5Q==</notes></var><var ID="v32057815" name="Q-demos-gender" intrvl="discrete"><location fileid="f7616231"/><labl level="variable">Q-demos-gender</labl><varFormat type="character"/><notes subject="Universal Numeric Fingerprint" level="variable" type="Dataverse:UNF">UNF:6:vnpt1jm20rWh3wrzffkGpg==</notes></var><var ID="v32057810" name="Q-sexual-orientation" intrvl="discrete"><location fileid="f7616231"/><labl level="variable">Q-sexual-orientation</labl><varFormat type="character"/><notes subject="Universal Numeric Fingerprint" level="variable" type="Dataverse:UNF">UNF:6:fkX0UbYjqdeFWX2UxelKdw==</notes></var><var ID="v32057824" name="Q-demos-state" intrvl="discrete"><location fileid="f7616231"/><labl level="variable">Q-demos-state</labl><varFormat type="character"/><notes subject="Universal Numeric Fingerprint" level="variable" type="Dataverse:UNF">UNF:6:S3CNaQVpJhjFtvKEBkP85Q==</notes></var><var ID="v32057812" name="Q-amazon-use-howmany" intrvl="discrete"><location fileid="f7616231"/><labl level="variable">Q-amazon-use-howmany</labl><varFormat type="character"/><notes subject="Universal Numeric Fingerprint" level="variable" type="Dataverse:UNF">UNF:6:aNqb7RcK8awCBKZBfwsp7A==</notes></var><var ID="v32057813" name="Q-amazon-use-hh-size" intrvl="discrete"><location fileid="f7616231"/><labl level="variable">Q-amazon-use-hh-size</labl><varFormat type="character"/><notes subject="Universal Numeric Fingerprint" level="variable" type="Dataverse:UNF">UNF:6:UqCXolcN5Wi82ozhI8u3lg==</notes></var><var ID="v32057817" name="Q-amazon-use-how-oft" intrvl="discrete"><location fileid="f7616231"/><labl level="variable">Q-amazon-use-how-oft</labl><varFormat type="character"/><notes subject="Universal Numeric Fingerprint" level="variable" type="Dataverse:UNF">UNF:6:hrnSS1ocB/NDqHGL+ibJ1g==</notes></var><var ID="v32057819" name="Q-substance-use-cigarettes" intrvl="discrete"><location fileid="f7616231"/><labl level="variable">Q-substance-use-cigarettes</labl><varFormat type="character"/><notes subject="Universal Numeric Fingerprint" level="variable" type="Dataverse:UNF">UNF:6:g96p55a8iPlmB0Fx4q+N2w==</notes></var><var ID="v32057825" name="Q-substance-use-marijuana" intrvl="discrete"><location fileid="f7616231"/><labl level="variable">Q-substance-use-marijuana</labl><varFormat type="character"/><notes subject="Universal Numeric Fingerprint" level="variable" type="Dataverse:UNF">UNF:6:1Mj8LvdkinBqhiavZAtSEw==</notes></var><var ID="v32057814" name="Q-substance-use-alcohol" intrvl="discrete"><location fileid="f7616231"/><labl level="variable">Q-substance-use-alcohol</labl><varFormat type="character"/><notes subject="Universal Numeric Fingerprint" level="variable" type="Dataverse:UNF">UNF:6:X5i/cwJ5iFdwr5IBf+n5Mw==</notes></var><var ID="v32057808" name="Q-personal-diabetes" intrvl="discrete"><location fileid="f7616231"/><labl level="variable">Q-personal-diabetes</labl><varFormat type="character"/><notes subject="Universal Numeric Fingerprint" level="variable" type="Dataverse:UNF">UNF:6:368f0rkAv8tNlzayKVrwbg==</notes></var><var ID="v32057820" name="Q-personal-wheelchair" intrvl="discrete"><location fileid="f7616231"/><labl level="variable">Q-personal-wheelchair</labl><varFormat type="character"/><notes subject="Universal Numeric Fingerprint" level="variable" type="Dataverse:UNF">UNF:6:QImnKtVfuomhxAwRuRjCkw==</notes></var><var ID="v32057805" name="Q-life-changes" intrvl="discrete"><location fileid="f7616231"/><labl level="variable">Q-life-changes</labl><varFormat type="character"/><notes subject="Universal Numeric Fingerprint" level="variable" type="Dataverse:UNF">UNF:6:zpnoYT+sa6I173rcsBipBQ==</notes></var><var ID="v32057823" name="Q-sell-YOUR-data" intrvl="discrete"><location fileid="f7616231"/><labl level="variable">Q-sell-YOUR-data</labl><varFormat type="character"/><notes subject="Universal Numeric Fingerprint" level="variable" type="Dataverse:UNF">UNF:6:mzyywvGaJDSTQZBPBco2+w==</notes></var><var ID="v32057816" name="Q-sell-consumer-data" intrvl="discrete"><location fileid="f7616231"/><labl level="variable">Q-sell-consumer-data</labl><varFormat type="character"/><notes subject="Universal Numeric Fingerprint" level="variable" type="Dataverse:UNF">UNF:6:yDbCN1ySU/lNJTKNEPcUrw==</notes></var><var ID="v32057821" name="Q-small-biz-use" intrvl="discrete"><location fileid="f7616231"/><labl level="variable">Q-small-biz-use</labl><varFormat type="character"/><notes subject="Universal Numeric Fingerprint" level="variable" type="Dataverse:UNF">UNF:6:lI2+2goCVkQ2IT0U3F1oYQ==</notes></var><var ID="v32057807" name="Q-census-use" intrvl="discrete"><location fileid="f7616231"/><labl level="variable">Q-census-use</labl><varFormat type="character"/><notes subject="Universal Numeric Fingerprint" level="variable" type="Dataverse:UNF">UNF:6:SVUC8pNogySI/We8UABzqA==</notes></var><var ID="v32057811" name="Q-research-society" intrvl="discrete"><location fileid="f7616231"/><labl level="variable">Q-research-society</labl><varFormat type="character"/><notes subject="Universal Numeric Fingerprint" level="variable" type="Dataverse:UNF">UNF:6:ATmCZMPJdmbiz/6LgdBlEA==</notes></var></dataDscr><otherMat ID="f7616235" URI="https://dataverse.harvard.edu/api/access/datafile/7616235" level="datafile"><labl>amazon-purchases.csv</labl><txt>Amazon purchases data</txt><notes level="file" type="DATAVERSE:CONTENTTYPE" subject="Content/MIME Type">text/csv</notes></otherMat><otherMat ID="f7616233" URI="https://dataverse.harvard.edu/api/access/datafile/7616233" level="datafile"><labl>fields.csv</labl><txt>Names and descriptions of columns in survey.csv</txt><notes level="file" type="DATAVERSE:CONTENTTYPE" subject="Content/MIME Type">text/csv</notes></otherMat><otherMat ID="f7616232" URI="https://dataverse.harvard.edu/api/access/datafile/7616232" level="datafile"><labl>prescreen-survey-instrument.pdf</labl><notes level="file" type="DATAVERSE:CONTENTTYPE" subject="Content/MIME Type">application/pdf</notes></otherMat><otherMat ID="f7616234" URI="https://dataverse.harvard.edu/api/access/datafile/7616234" level="datafile"><labl>survey-instrument.pdf</labl><notes level="file" type="DATAVERSE:CONTENTTYPE" subject="Content/MIME Type">application/pdf</notes></otherMat></codeBook>