HarvardX Person-Course Academic Year 2013 De-Identified dataset, version 3.0 (doi:10.7910/DVN/26147)

View:

Part 1: Document Description
Part 2: Study Description
Part 3: Data Files Description
Part 4: Variable Description
Part 5: Other Study-Related Materials
Entire Codebook

Document Description

Citation

Title:

HarvardX Person-Course Academic Year 2013 De-Identified dataset, version 3.0

Identification Number:

doi:10.7910/DVN/26147

Distributor:

Harvard Dataverse

Date of Distribution:

2014-05-27

Version:

11

Bibliographic Citation:

HarvardX, 2014, "HarvardX Person-Course Academic Year 2013 De-Identified dataset, version 3.0", https://doi.org/10.7910/DVN/26147, Harvard Dataverse, V11, UNF:6:WSoYmsP5KeX2t/6g2JiEuw== [fileUNF]

Study Description

Citation

Title:

HarvardX Person-Course Academic Year 2013 De-Identified dataset, version 3.0

Identification Number:

doi:10.7910/DVN/26147

Authoring Entity:

HarvardX (HarvardX)

Date of Production:

2019-11-12

Distributor:

Harvard Dataverse

Distributor:

Harvard Dataverse Network

Access Authority:

Jon Daries

Date of Deposit:

2019-12-18

Date of Distribution:

2019-12-18

Holdings Information:

https://doi.org/10.7910/DVN/26147

Study Scope

Keywords:

Social Sciences, HarvardX

Abstract:

This release is comprised of de-identified data from the first year (Academic Year 2013: Fall 2012, Spring 2013, and Summer 2013) of HarvardX courses on the edX platform along with related documentation. These data are aggregate records, and each record represents one individual's activity in one edX course. For more information about the existing analyses of these data and the first year of HarvardX courses, please see the HarvardX and MITx working paper "HarvardX and MITx: The first year of open online courses" by Andrew Ho, Justin Reich, Sergiy Nesterko, Daniel Seaton, Tommy Mullaney, Jim Waldo, and Isaac Chuang (http://papers.ssrn.com/sol3/papers.cfm?abstract_id=2381263). The first release of this dataset is the HarvardX Person-Course Academic Year 2013 De-Identified dataset, version 3.0, created on November 12, 2019. File name: HXPC13_DI_v3_11-13-2019.csv The md5sum for this release (HXPC13_DI_v3_11-13-2019.csv) is: 53419b486c3b19c14d2f06612980f630

Methodology and Processing

Sources Statement

Data Access

Notes:

Users of the dataset agree to abide by the Dataverse Community Norms, specifically: Maintaining anonymity of human subjects Users of the Service should not abuse the available data that relate to human subjects and use the materials to: -obtain information that could directly or indirectly identify any research subjects, or obtain information to attempt to directly or indirectly identify any research subjects; -produce and/or publish connections among datasets that could identify individuals or organizations; or -obtain (additional) information about or (additional) means of contact for already-identified subjects.

Reminder that the terms of use include the Dataverse community norms, specifically: Maintaining anonymity of human subjects Users of the Service should not abuse the available data that relate to human subjects and use the materials to: -obtain information that could directly or indirectly identify any research subjects, or obtain information to attempt to directly or indirectly identify any research subjects; -produce and/or publish connections among datasets that could identify individuals or organizations; or -obtain (additional) information about or (additional) means of contact for already-identified subjects.

Other Study Description Materials

File Description--f3624095

File: HXPC13_DI_v3_11-13-2019.tab

  • Number of cases: 338223

  • No. of variables per record: 20

  • Type of File: text/tab-separated-values

Notes:

UNF:6:WSoYmsP5KeX2t/6g2JiEuw==

HarvardX Person-Course Academic Year 2013 De-identified Dataset, Version 2.0

Variable Description

List of Variables:

Variables

course_id

f3624095 Location:

Variable Format: character

Notes: UNF:6:5nIllYPWiMOKY55zRzdn8Q==

userid_DI

f3624095 Location:

Variable Format: character

Notes: UNF:6:/oHwfUCpOxX0sTgB4D3ndA==

registered

f3624095 Location:

Summary Statistics: Mean 1.0; Valid 338223.0; Min. 1.0; Max. 1.0; StDev 0.0

Variable Format: numeric

Notes: UNF:6:32yUU6Bx4Rmy7ZVE80rLZw==

viewed

f3624095 Location:

Summary Statistics: Max. 1.0; Min. 0.0; Valid 338223.0; Mean 0.5731603113923918; StDev 0.4946193406767063;

Variable Format: numeric

Notes: UNF:6:+rCj+Ggq9+nAK49F07UFbw==

explored

f3624095 Location:

Summary Statistics: Max. 1.0; StDev 0.23928943145615522; Min. 0.0; Mean 0.06097752074816919; Valid 338223.0;

Variable Format: numeric

Notes: UNF:6:eGaImUAyi1DL3+tyWW863Q==

certified

f3624095 Location:

Summary Statistics: StDev 0.13801368585510165; Valid 338223.0; Min. 0.0; Max. 1.0; Mean 0.019425053884562393

Variable Format: numeric

Notes: UNF:6:QzjVoaS2bXdrOcMlIvRrDQ==

final_cc_cname_DI

f3624095 Location:

Variable Format: character

Notes: UNF:6:6DKENVTo72iTvo++ow8rwg==

LoE_DI

f3624095 Location:

Variable Format: character

Notes: UNF:6:lp1xPwHS0D91HTZm5I87aw==

YoB

f3624095 Location:

Summary Statistics: StDev 9.60372961855612; Max. 2013.0; Mean 1984.0447252259614; Valid 299719.0; Min. 1931.0

Variable Format: numeric

Notes: UNF:6:aQKnrTLUvTQcEeELcFbypA==

gender

f3624095 Location:

Variable Format: character

Notes: UNF:6:5dWKqKBaOLqYpXMoeZGEcg==

grade

f3624095 Location:

Variable Format: character

Notes: UNF:6:EF2Z/9gfK4Y/IyEDZFjPCA==

start_time_DI

f3624095 Location:

Variable Format: character

Notes: UNF:6:KaN+o9rI8Xw99jhPZz8LOw==

last_event_DI

f3624095 Location:

Variable Format: character

Notes: UNF:6:hp3WdRyr+IeBf5EXfDbE1Q==

nevents

f3624095 Location:

Summary Statistics: Min. 1.0; Valid 178945.0; StDev 939.1013106434568; Mean 231.9927016680034; Max. 43880.0

Variable Format: numeric

Notes: UNF:6:Mlqnzo3hXq74Dk5IY3VslA==

ndays_act

f3624095 Location:

Summary Statistics: Mean 4.864899112475554; Min. 1.0; Valid 195713.0; Max. 176.0; StDev 9.187188638265702;

Variable Format: numeric

Notes: UNF:6:gEYLGb/P/1M5S2ThaXJ/dQ==

nplay_video

f3624095 Location:

Summary Statistics: Mean 146.9641794633158; StDev 518.4680024057607; Max. 34596.0; Valid 33277.0; Min. 1.0;

Variable Format: numeric

Notes: UNF:6:QojOM0KVdYSCIQesIorh3A==

nchapters

f3624095 Location:

Summary Statistics: Valid 193758.0; Max. 34.0; Mean 3.4877269583712587; Min. 1.0; StDev 4.720402574034257;

Variable Format: numeric

Notes: UNF:6:CsW8C9CYuDwEjIZ2UjAqwA==

nforum_posts

f3624095 Location:

Summary Statistics: Mean 0.011421458623430308; StDev 0.15018599425779372; Min. 0.0; Max. 7.0; Valid 338223.0

Variable Format: numeric

Notes: UNF:6:3Q32yxI9sFOZACpKXkRUUQ==

roles

f3624095 Location:

Summary Statistics: Mean NaN; Max. NaN; Min. NaN; Valid 0.0; StDev NaN

Variable Format: numeric

Notes: UNF:6:OHbv3tC9xEmIUT/knzNDpg==

incomplete_flag

f3624095 Location:

Summary Statistics: Mean 1.0; Max. 1.0; StDev 0.0; Valid 77385.0; Min. 1.0;

Variable Format: numeric

Notes: UNF:6:fIO+MmUTANhorcRux5soDA==

Other Study-Related Materials

Label:

Person Course Deidentification.pdf

Text:

Description of the process for de-identifying the HarvardX-MITx Person-Course Academic Year 2013 Dataset

Notes:

application/pdf

Other Study-Related Materials

Label:

Person Course Documentation.pdf

Text:

Description of data sources and variables for the HarvardX-MITx Person-Course Academic Year 2013 De-identified Dataset

Notes:

application/pdf