{"@context":{"@language":"en","@vocab":"https://schema.org/","citeAs":"cr:citeAs","column":"cr:column","conformsTo":"dct:conformsTo","cr":"http://mlcommons.org/croissant/","rai":"http://mlcommons.org/croissant/RAI/","data":{"@id":"cr:data","@type":"@json"},"dataType":{"@id":"cr:dataType","@type":"@vocab"},"dct":"http://purl.org/dc/terms/","examples":{"@id":"cr:examples","@type":"@json"},"extract":"cr:extract","field":"cr:field","fileProperty":"cr:fileProperty","fileObject":"cr:fileObject","fileSet":"cr:fileSet","format":"cr:format","includes":"cr:includes","isLiveDataset":"cr:isLiveDataset","jsonPath":"cr:jsonPath","key":"cr:key","md5":"cr:md5","parentField":"cr:parentField","path":"cr:path","recordSet":"cr:recordSet","references":"cr:references","regex":"cr:regex","repeated":"cr:repeated","replace":"cr:replace","sc":"https://schema.org/","separator":"cr:separator","source":"cr:source","subField":"cr:subField","transform":"cr:transform","wd":"https://www.wikidata.org/wiki/"},"@type":"sc:Dataset","conformsTo":"http://mlcommons.org/croissant/1.0","name":"Open e-commerce 1.0:  Five years of crowdsourced U.S. Amazon purchase histories with user demographics","url":"https://doi.org/10.7910/DVN/YGLYDY","creator":[{"@type":"Person","givenName":"Alex","familyName":"Berke","affiliation":{"@type":"Organization","name":"MIT Media Lab"},"sameAs":"https://orcid.org/0000-0001-5996-0557","@id":"https://orcid.org/0000-0001-5996-0557","identifier":"https://orcid.org/0000-0001-5996-0557","name":"Alex Berke"},{"@type":"Person","givenName":"Dan","familyName":"Calacci","affiliation":{"@type":"Organization","name":"Princeton University & MIT Media Lab"},"sameAs":"https://orcid.org/0000-0002-9552-1137","@id":"https://orcid.org/0000-0002-9552-1137","identifier":"https://orcid.org/0000-0002-9552-1137","name":"Dan Calacci"},{"@type":"Person","givenName":"Robert","familyName":"Mahari","affiliation":{"@type":"Organization","name":"MIT Media Lab & Harvard Law School"},"sameAs":"https://orcid.org/0000-0003-2372-2746","@id":"https://orcid.org/0000-0003-2372-2746","identifier":"https://orcid.org/0000-0003-2372-2746","name":"Robert Mahari"},{"@type":"Person","givenName":"Takahiro","familyName":"Yabe","affiliation":{"@type":"Organization","name":"MIT Institute of Data, Systems, and Society (IDSS) & New York University Center for Urban Science and Progress"},"sameAs":"https://orcid.org/0000-0001-8967-1967","@id":"https://orcid.org/0000-0001-8967-1967","identifier":"https://orcid.org/0000-0001-8967-1967","name":"Takahiro Yabe"},{"@type":"Person","givenName":"Kent","familyName":"Larson","affiliation":{"@type":"Organization","name":"MIT Media Lab"},"name":"Kent Larson"},{"@type":"Person","givenName":"Sandy","familyName":"Pentland","affiliation":{"@type":"Organization","name":"MIT Media Lab"},"name":"Sandy Pentland"}],"description":"This dataset contains longitudinal purchases data from 5027 Amazon.com users in the US, spanning 2018 through 2022: amazon-purchases.csv It also includes demographic data and other consumer level variables for each user with data in the dataset. These consumer level variables were collected through an online survey and are included in survey.csv fields.csv describes the columns in the survey.csv file, where fields/survey columns correspond to survey questions. The dataset also contains the survey instrument used to collect the data. More details about the survey questions and possible responses, and the format in which they were presented can be found by viewing the survey instrument. A 'Survey ResponseID' column is present in both the amazon-purchases.csv and survey.csv files. It links a user's survey responses to their Amazon.com purchases. The 'Survey ResponseID' was randomly generated at the time of data collection. amazon-purchases.csv Each row in this file corresponds to an Amazon order. Each such row has the following columns: Survey ResponseID Order date Shipping address state Purchase price per unit Quantity ASIN/ISBN (Product Code) Title Category The data were exported by the Amazon users from Amazon.com and shared by users with their informed consent. PII and other information not listed above were stripped from the data. This processing occurred on users' machines before sharing with researchers.","keywords":["Social Sciences","Other","e-commerce","purchase histories","crowdsourced"],"license":"http://creativecommons.org/publicdomain/zero/1.0","datePublished":"2023-12-02","dateModified":"2023-12-02","includedInDataCatalog":{"@type":"DataCatalog","name":"Harvard Dataverse","url":"https://dataverse.harvard.edu"},"publisher":{"@type":"Organization","name":"Harvard Dataverse"},"version":"1.0","citeAs":"@data{DVN/YGLYDY_2023,author = {Alex Berke and Dan Calacci and Robert Mahari and Takahiro Yabe and Kent Larson and Sandy Pentland},publisher = {Harvard Dataverse},title = {Open e-commerce 1.0: Five years of crowdsourced U.S. Amazon purchase histories with user demographics},year = {2023},url = {https://doi.org/10.7910/DVN/YGLYDY}}","citation":[{"@type":"CreativeWork","name":"Berke, A., Mahari, R., Pentland, S., Larson, K., Calacci, D. Insights from an experiment crowdsourcing data from thousands of US Amazon users: The importance of transparency, money, and data use. (In review).","@id":"https://github.com/aberke/amazon-study/blob/master/data-collection-survey-experiment.pdf","identifier":"https://github.com/aberke/amazon-study/blob/master/data-collection-survey-experiment.pdf","url":"https://github.com/aberke/amazon-study/blob/master/data-collection-survey-experiment.pdf"}],"distribution":[{"@type":"cr:FileObject","@id":"amazon-purchases.csv","name":"amazon-purchases.csv","encodingFormat":"text/csv","md5":"3549d72735b1211ab009ff0177b94bd9","contentSize":"313070173","description":"Amazon purchases data","contentUrl":"https://dataverse.harvard.edu/api/access/datafile/7616235"},{"@type":"cr:FileObject","@id":"fields.csv","name":"fields.csv","encodingFormat":"text/csv","md5":"45dcdf6a3f4c51ab5721a0a09e8dfdfe","contentSize":"2532","description":"Names and descriptions of columns in survey.csv","contentUrl":"https://dataverse.harvard.edu/api/access/datafile/7616233"},{"@type":"cr:FileObject","@id":"prescreen-survey-instrument.pdf","name":"prescreen-survey-instrument.pdf","encodingFormat":"application/pdf","md5":"56ddd736c0f30a7b1e8999aedc826b52","contentSize":"393144","description":"","contentUrl":"https://dataverse.harvard.edu/api/access/datafile/7616232"},{"@type":"cr:FileObject","@id":"survey-instrument.pdf","name":"survey-instrument.pdf","encodingFormat":"application/pdf","md5":"77bf0cb81796bb409c6c4a752c5e2f1b","contentSize":"1333077","description":"","contentUrl":"https://dataverse.harvard.edu/api/access/datafile/7616234"},{"@type":"cr:FileObject","@id":"survey.csv","name":"survey.csv","encodingFormat":"text/csv","md5":"8340a44a5691c3763702683155cb5cc7","contentSize":"1342356","description":"Survey responses, including only the responses from participants who chose to share their Amazon data (N=5027). Columns relevant to only the Qualtrics survey software and experiment setup were removed.","contentUrl":"https://dataverse.harvard.edu/api/access/datafile/7616231?format=original"}],"recordSet":[{"@type":"cr:RecordSet","field":[{"@type":"cr:Field","name":"Survey ResponseID","description":"Survey ResponseID","dataType":"sc:Text","source":{"@id":"32057804","fileObject":{"@id":"survey.csv"},"extract":{"column":"Survey ResponseID"}}},{"@type":"cr:Field","name":"Q-demos-age","description":"Q-demos-age","dataType":"sc:Text","source":{"@id":"32057818","fileObject":{"@id":"survey.csv"},"extract":{"column":"Q-demos-age"}}},{"@type":"cr:Field","name":"Q-demos-hispanic","description":"Q-demos-hispanic","dataType":"sc:Text","source":{"@id":"32057806","fileObject":{"@id":"survey.csv"},"extract":{"column":"Q-demos-hispanic"}}},{"@type":"cr:Field","name":"Q-demos-race","description":"Q-demos-race","dataType":"sc:Text","source":{"@id":"32057822","fileObject":{"@id":"survey.csv"},"extract":{"column":"Q-demos-race"}}},{"@type":"cr:Field","name":"Q-demos-education","description":"Q-demos-education","dataType":"sc:Text","source":{"@id":"32057803","fileObject":{"@id":"survey.csv"},"extract":{"column":"Q-demos-education"}}},{"@type":"cr:Field","name":"Q-demos-income","description":"Q-demos-income","dataType":"sc:Text","source":{"@id":"32057809","fileObject":{"@id":"survey.csv"},"extract":{"column":"Q-demos-income"}}},{"@type":"cr:Field","name":"Q-demos-gender","description":"Q-demos-gender","dataType":"sc:Text","source":{"@id":"32057815","fileObject":{"@id":"survey.csv"},"extract":{"column":"Q-demos-gender"}}},{"@type":"cr:Field","name":"Q-sexual-orientation","description":"Q-sexual-orientation","dataType":"sc:Text","source":{"@id":"32057810","fileObject":{"@id":"survey.csv"},"extract":{"column":"Q-sexual-orientation"}}},{"@type":"cr:Field","name":"Q-demos-state","description":"Q-demos-state","dataType":"sc:Text","source":{"@id":"32057824","fileObject":{"@id":"survey.csv"},"extract":{"column":"Q-demos-state"}}},{"@type":"cr:Field","name":"Q-amazon-use-howmany","description":"Q-amazon-use-howmany","dataType":"sc:Text","source":{"@id":"32057812","fileObject":{"@id":"survey.csv"},"extract":{"column":"Q-amazon-use-howmany"}}},{"@type":"cr:Field","name":"Q-amazon-use-hh-size","description":"Q-amazon-use-hh-size","dataType":"sc:Text","source":{"@id":"32057813","fileObject":{"@id":"survey.csv"},"extract":{"column":"Q-amazon-use-hh-size"}}},{"@type":"cr:Field","name":"Q-amazon-use-how-oft","description":"Q-amazon-use-how-oft","dataType":"sc:Text","source":{"@id":"32057817","fileObject":{"@id":"survey.csv"},"extract":{"column":"Q-amazon-use-how-oft"}}},{"@type":"cr:Field","name":"Q-substance-use-cigarettes","description":"Q-substance-use-cigarettes","dataType":"sc:Text","source":{"@id":"32057819","fileObject":{"@id":"survey.csv"},"extract":{"column":"Q-substance-use-cigarettes"}}},{"@type":"cr:Field","name":"Q-substance-use-marijuana","description":"Q-substance-use-marijuana","dataType":"sc:Text","source":{"@id":"32057825","fileObject":{"@id":"survey.csv"},"extract":{"column":"Q-substance-use-marijuana"}}},{"@type":"cr:Field","name":"Q-substance-use-alcohol","description":"Q-substance-use-alcohol","dataType":"sc:Text","source":{"@id":"32057814","fileObject":{"@id":"survey.csv"},"extract":{"column":"Q-substance-use-alcohol"}}},{"@type":"cr:Field","name":"Q-personal-diabetes","description":"Q-personal-diabetes","dataType":"sc:Text","source":{"@id":"32057808","fileObject":{"@id":"survey.csv"},"extract":{"column":"Q-personal-diabetes"}}},{"@type":"cr:Field","name":"Q-personal-wheelchair","description":"Q-personal-wheelchair","dataType":"sc:Text","source":{"@id":"32057820","fileObject":{"@id":"survey.csv"},"extract":{"column":"Q-personal-wheelchair"}}},{"@type":"cr:Field","name":"Q-life-changes","description":"Q-life-changes","dataType":"sc:Text","source":{"@id":"32057805","fileObject":{"@id":"survey.csv"},"extract":{"column":"Q-life-changes"}}},{"@type":"cr:Field","name":"Q-sell-YOUR-data","description":"Q-sell-YOUR-data","dataType":"sc:Text","source":{"@id":"32057823","fileObject":{"@id":"survey.csv"},"extract":{"column":"Q-sell-YOUR-data"}}},{"@type":"cr:Field","name":"Q-sell-consumer-data","description":"Q-sell-consumer-data","dataType":"sc:Text","source":{"@id":"32057816","fileObject":{"@id":"survey.csv"},"extract":{"column":"Q-sell-consumer-data"}}},{"@type":"cr:Field","name":"Q-small-biz-use","description":"Q-small-biz-use","dataType":"sc:Text","source":{"@id":"32057821","fileObject":{"@id":"survey.csv"},"extract":{"column":"Q-small-biz-use"}}},{"@type":"cr:Field","name":"Q-census-use","description":"Q-census-use","dataType":"sc:Text","source":{"@id":"32057807","fileObject":{"@id":"survey.csv"},"extract":{"column":"Q-census-use"}}},{"@type":"cr:Field","name":"Q-research-society","description":"Q-research-society","dataType":"sc:Text","source":{"@id":"32057811","fileObject":{"@id":"survey.csv"},"extract":{"column":"Q-research-society"}}}]}]}