Replication Data for: 'Race to the Bottom: Competition and Quality in Science' (doi:10.7910/DVN/KD7A8B)

View:

Part 1: Document Description
Part 2: Study Description
Part 3: Data Files Description
Part 4: Variable Description
Part 5: Other Study-Related Materials
Entire Codebook

(external link)

Document Description

Citation

Title:

Replication Data for: 'Race to the Bottom: Competition and Quality in Science'

Identification Number:

doi:10.7910/DVN/KD7A8B

Distributor:

Harvard Dataverse

Date of Distribution:

2025-01-23

Version:

2

Bibliographic Citation:

Hill, Ryan; Stein, Carolyn, 2025, "Replication Data for: 'Race to the Bottom: Competition and Quality in Science'", https://doi.org/10.7910/DVN/KD7A8B, Harvard Dataverse, V2, UNF:6:NiV5uyFxWELxKJbH7j7h5w== [fileUNF]

Study Description

Citation

Title:

Replication Data for: 'Race to the Bottom: Competition and Quality in Science'

Identification Number:

doi:10.7910/DVN/KD7A8B

Authoring Entity:

Hill, Ryan (Northwestern University)

Stein, Carolyn (UC Berkeley)

Distributor:

Harvard Dataverse

Access Authority:

Stein, Carolyn

Depositor:

Baranga, Thomas

Date of Deposit:

2025-01-23

Holdings Information:

https://doi.org/10.7910/DVN/KD7A8B

Study Scope

Keywords:

Social Sciences, Asymmetric and Private Information; Mechanism Design, Analysis of Health Care Markets, Higher Education; Research Institutions, Innovation and Invention: Processes and Incentives, Open Innovation

Abstract:

The data and programs replicate tables and figures from "Race to the Bottom: Competition and Quality in Science," by Hill and Stein. Please see the README file for additional details.

Methodology and Processing

Sources Statement

Data Access

Notes:

<a href="http://creativecommons.org/publicdomain/zero/1.0">CC0 1.0</a>

Other Study Description Materials

Related Publications

Citation

Title:

Hill, Ryan, and Carolyn Stein, "Race to the Bottom: Competition and Quality in Science," Quarterly Journal of Economics, vol. 140 no.2, May 2025, pp. 1111-1185.

Bibliographic Citation:

Hill, Ryan, and Carolyn Stein, "Race to the Bottom: Competition and Quality in Science," Quarterly Journal of Economics, vol. 140 no.2, May 2025, pp. 1111-1185.

File Description--f11716964

File: db_drug_index.tab

  • Number of cases: 11355

  • No. of variables per record: 17

  • Type of File: text/tab-separated-values

Notes:

UNF:6:jGp+KjY8ltxkG1QnhF0Kzw==

File Description--f11716973

File: db_target_all.tab

  • Number of cases: 5120

  • No. of variables per record: 13

  • Type of File: text/tab-separated-values

Notes:

UNF:6:VJzzodUucm+AWCKb6LqpqA==

File Description--f11716962

File: db_target_pharmacologically_active.tab

  • Number of cases: 1256

  • No. of variables per record: 13

  • Type of File: text/tab-separated-values

Notes:

UNF:6:iFxBY6BeSg1agqMpPlJ+4A==

File Description--f11716969

File: pdb_Citation.tab

  • Number of cases: 145586

  • No. of variables per record: 11

  • Type of File: text/tab-separated-values

Notes:

UNF:6:klhqzfzZpB+jNb62Bga7qg==

File Description--f11716983

File: pdb_ClusterEntity.tab

  • Number of cases: 429807

  • No. of variables per record: 14

  • Type of File: text/tab-separated-values

Notes:

UNF:6:nMSUPTeAmJRNK0PSpWtr8w==

File Description--f11716957

File: pdb_DataCollectionDetails.tab

  • Number of cases: 137067

  • No. of variables per record: 5

  • Type of File: text/tab-separated-values

Notes:

UNF:6:k1djg+Ea6Li8n+H0Ll9RIQ==

File Description--f11716992

File: pdb_RefinementDetails.tab

  • Number of cases: 131836

  • No. of variables per record: 7

  • Type of File: text/tab-separated-values

Notes:

UNF:6:tLgOOBp8+mwRlUi3/RcBJg==

File Description--f11716963

File: pdb_RefinementParameters.tab

  • Number of cases: 132131

  • No. of variables per record: 4

  • Type of File: text/tab-separated-values

Notes:

UNF:6:qyjh4HspX1I+H8rV4YYVtQ==

File Description--f11716966

File: pdb_Sequence.tab

  • Number of cases: 490717

  • No. of variables per record: 9

  • Type of File: text/tab-separated-values

Notes:

UNF:6:2BlzdVnrtseUU8459XdsVg==

File Description--f11716982

File: pdb_StructureSummary.tab

  • Number of cases: 144173

  • No. of variables per record: 15

  • Type of File: text/tab-separated-values

Notes:

UNF:6:LbvKE2F9w3kIA1rKtEaIEQ==

File Description--f11716993

File: phat.tab

  • Number of cases: 20434

  • No. of variables per record: 3

  • Type of File: text/tab-separated-values

Notes:

UNF:6:MzvZM/pj52KAuuPMeFzNOw==

File Description--f11716981

File: scooped_groups.tab

  • Number of cases: 10410

  • No. of variables per record: 6

  • Type of File: text/tab-separated-values

Notes:

UNF:6:7ZGzMJ8oWA4bFvYJFdDeNg==

File Description--f11716979

File: survey_data_050623.tab

  • Number of cases: 10564

  • No. of variables per record: 31

  • Type of File: text/tab-separated-values

Notes:

UNF:6:Zc+9Dd7iS1rrnVfIVjpV9w==

File Description--f11716960

File: wos_pubmed2pubmed.tab

  • Number of cases: 604016

  • No. of variables per record: 9

  • Type of File: text/tab-separated-values

Notes:

UNF:6:xRsi/Hljcx8dsJ/TG3gy0Q==

Variable Description

List of Variables:

Variables

DrugBank ID

f11716964 Location:

Summary Statistics: StDev NaN; Min. NaN; Valid 0.0; Max. NaN; Mean NaN

Variable Format: numeric

Notes: UNF:6:tgzvlz9Pw9c8tGf4dIT6ZA==

Name

f11716964 Location:

Summary Statistics: Valid 0.0; StDev NaN; Min. NaN; Max. NaN; Mean NaN

Variable Format: numeric

Notes: UNF:6:tgzvlz9Pw9c8tGf4dIT6ZA==

CAS Number

f11716964 Location:

Summary Statistics: StDev NaN; Max. NaN; Min. NaN; Valid 0.0; Mean NaN;

Variable Format: numeric

Notes: UNF:6:tgzvlz9Pw9c8tGf4dIT6ZA==

Drug Groups

f11716964 Location:

Summary Statistics: Valid 0.0; Min. NaN; Mean NaN; Max. NaN; StDev NaN;

Variable Format: numeric

Notes: UNF:6:tgzvlz9Pw9c8tGf4dIT6ZA==

InChIKey

f11716964 Location:

Summary Statistics: Mean NaN; Valid 0.0; StDev NaN; Max. NaN; Min. NaN;

Variable Format: numeric

Notes: UNF:6:tgzvlz9Pw9c8tGf4dIT6ZA==

InChI

f11716964 Location:

Summary Statistics: StDev NaN; Valid 0.0; Mean NaN; Max. NaN; Min. NaN;

Variable Format: numeric

Notes: UNF:6:tgzvlz9Pw9c8tGf4dIT6ZA==

SMILES

f11716964 Location:

Summary Statistics: Max. NaN; Min. NaN; StDev NaN; Valid 0.0; Mean NaN

Variable Format: numeric

Notes: UNF:6:tgzvlz9Pw9c8tGf4dIT6ZA==

Formula

f11716964 Location:

Summary Statistics: Max. NaN; StDev NaN; Min. NaN; Mean NaN; Valid 0.0;

Variable Format: numeric

Notes: UNF:6:tgzvlz9Pw9c8tGf4dIT6ZA==

KEGG Compound ID

f11716964 Location:

Summary Statistics: StDev NaN; Max. NaN; Min. NaN; Valid 0.0; Mean NaN

Variable Format: numeric

Notes: UNF:6:tgzvlz9Pw9c8tGf4dIT6ZA==

KEGG Drug ID

f11716964 Location:

Summary Statistics: Mean NaN; Valid 0.0; Max. NaN; StDev NaN; Min. NaN;

Variable Format: numeric

Notes: UNF:6:tgzvlz9Pw9c8tGf4dIT6ZA==

PubChem Compound ID

f11716964 Location:

Summary Statistics: Max. NaN; Mean NaN; StDev NaN; Min. NaN; Valid 0.0

Variable Format: numeric

Notes: UNF:6:tgzvlz9Pw9c8tGf4dIT6ZA==

PubChem Substance ID

f11716964 Location:

Summary Statistics: Min. NaN; Mean NaN; Max. NaN; StDev NaN; Valid 0.0;

Variable Format: numeric

Notes: UNF:6:tgzvlz9Pw9c8tGf4dIT6ZA==

ChEBI ID

f11716964 Location:

Summary Statistics: StDev NaN; Valid 0.0; Max. NaN; Mean NaN; Min. NaN

Variable Format: numeric

Notes: UNF:6:tgzvlz9Pw9c8tGf4dIT6ZA==

ChEMBL ID

f11716964 Location:

Summary Statistics: Valid 0.0; StDev NaN; Mean NaN; Max. NaN; Min. NaN

Variable Format: numeric

Notes: UNF:6:tgzvlz9Pw9c8tGf4dIT6ZA==

HET ID

f11716964 Location:

Summary Statistics: StDev NaN; Valid 0.0; Mean NaN; Max. NaN; Min. NaN;

Variable Format: numeric

Notes: UNF:6:tgzvlz9Pw9c8tGf4dIT6ZA==

ChemSpider ID

f11716964 Location:

Summary Statistics: Mean NaN; Min. NaN; Valid 0.0; Max. NaN; StDev NaN

Variable Format: numeric

Notes: UNF:6:tgzvlz9Pw9c8tGf4dIT6ZA==

BindingDB ID

f11716964 Location:

Summary Statistics: Mean 0.0; Valid 11355.0; Min. 0.0; Max. 0.0; StDev 0.0

Variable Format: numeric

Notes: UNF:6:2jUh8AubANfzHkKM1ZS7lw==

ID

f11716973 Location:

Summary Statistics: Min. 0.0; Mean 0.0; Valid 5120.0; Max. 0.0; StDev 0.0;

Variable Format: numeric

Notes: UNF:6:KR+FyhCsnj6ODpFlHULLXg==

Name

f11716973 Location:

Summary Statistics: Mean NaN; Valid 0.0; Min. NaN; StDev NaN; Max. NaN

Variable Format: numeric

Notes: UNF:6:Apn3V6haGq1svhrSsO2pJQ==

Gene Name

f11716973 Location:

Summary Statistics: Mean NaN; Min. NaN; StDev NaN; Max. NaN; Valid 0.0

Variable Format: numeric

Notes: UNF:6:Apn3V6haGq1svhrSsO2pJQ==

GenBank Protein ID

f11716973 Location:

Summary Statistics: Max. 0.0; Valid 5120.0; StDev 0.0; Mean 0.0; Min. 0.0;

Variable Format: numeric

Notes: UNF:6:KR+FyhCsnj6ODpFlHULLXg==

GenBank Gene ID

f11716973 Location:

Summary Statistics: Min. NaN; Max. NaN; Valid 0.0; StDev NaN; Mean NaN;

Variable Format: numeric

Notes: UNF:6:Apn3V6haGq1svhrSsO2pJQ==

UniProt ID

f11716973 Location:

Summary Statistics: Mean NaN; Min. NaN; Valid 0.0; StDev NaN; Max. NaN

Variable Format: numeric

Notes: UNF:6:Apn3V6haGq1svhrSsO2pJQ==

Uniprot Title

f11716973 Location:

Summary Statistics: Min. NaN; Max. NaN; Mean NaN; Valid 0.0; StDev NaN;

Variable Format: numeric

Notes: UNF:6:Apn3V6haGq1svhrSsO2pJQ==

PDB ID

f11716973 Location:

Summary Statistics: Valid 0.0; Max. NaN; Mean NaN; Min. NaN; StDev NaN;

Variable Format: numeric

Notes: UNF:6:Apn3V6haGq1svhrSsO2pJQ==

GeneCard ID

f11716973 Location:

Summary Statistics: Valid 0.0; Mean NaN; StDev NaN; Min. NaN; Max. NaN;

Variable Format: numeric

Notes: UNF:6:Apn3V6haGq1svhrSsO2pJQ==

GenAtlas ID

f11716973 Location:

Summary Statistics: Mean NaN; StDev NaN; Min. NaN; Max. NaN; Valid 0.0;

Variable Format: numeric

Notes: UNF:6:Apn3V6haGq1svhrSsO2pJQ==

HGNC ID

f11716973 Location:

Summary Statistics: Max. NaN; Valid 0.0; Mean NaN; Min. NaN; StDev NaN

Variable Format: numeric

Notes: UNF:6:Apn3V6haGq1svhrSsO2pJQ==

Species

f11716973 Location:

Summary Statistics: Max. NaN; StDev NaN; Min. NaN; Valid 0.0; Mean NaN

Variable Format: numeric

Notes: UNF:6:Apn3V6haGq1svhrSsO2pJQ==

Drug IDs

f11716973 Location:

Summary Statistics: Min. NaN; Max. NaN; Mean NaN; Valid 0.0; StDev NaN

Variable Format: numeric

Notes: UNF:6:Apn3V6haGq1svhrSsO2pJQ==

ID

f11716962 Location:

Summary Statistics: Min. 0.0; Max. 0.0; StDev 0.0; Mean 0.0; Valid 1256.0

Variable Format: numeric

Notes: UNF:6:PJZHTS9Qde4FyxeGpF3qfQ==

Name

f11716962 Location:

Summary Statistics: StDev NaN; Valid 0.0; Min. NaN; Max. NaN; Mean NaN

Variable Format: numeric

Notes: UNF:6:/XCeuZ3BnTEG46NJQqC1YA==

Gene Name

f11716962 Location:

Summary Statistics: Valid 0.0; Mean NaN; Max. NaN; Min. NaN; StDev NaN

Variable Format: numeric

Notes: UNF:6:/XCeuZ3BnTEG46NJQqC1YA==

GenBank Protein ID

f11716962 Location:

Summary Statistics: Mean 0.0; Min. 0.0; Valid 1256.0; Max. 0.0; StDev 0.0

Variable Format: numeric

Notes: UNF:6:PJZHTS9Qde4FyxeGpF3qfQ==

GenBank Gene ID

f11716962 Location:

Summary Statistics: Max. NaN; StDev NaN; Mean NaN; Min. NaN; Valid 0.0

Variable Format: numeric

Notes: UNF:6:/XCeuZ3BnTEG46NJQqC1YA==

UniProt ID

f11716962 Location:

Summary Statistics: Mean NaN; StDev NaN; Valid 0.0; Min. NaN; Max. NaN

Variable Format: numeric

Notes: UNF:6:/XCeuZ3BnTEG46NJQqC1YA==

Uniprot Title

f11716962 Location:

Summary Statistics: Max. NaN; Min. NaN; Valid 0.0; Mean NaN; StDev NaN

Variable Format: numeric

Notes: UNF:6:/XCeuZ3BnTEG46NJQqC1YA==

PDB ID

f11716962 Location:

Summary Statistics: StDev NaN; Valid 0.0; Min. NaN; Mean NaN; Max. NaN

Variable Format: numeric

Notes: UNF:6:/XCeuZ3BnTEG46NJQqC1YA==

GeneCard ID

f11716962 Location:

Summary Statistics: Mean NaN; Max. NaN; Valid 0.0; StDev NaN; Min. NaN;

Variable Format: numeric

Notes: UNF:6:/XCeuZ3BnTEG46NJQqC1YA==

GenAtlas ID

f11716962 Location:

Summary Statistics: Min. NaN; Valid 0.0; Mean NaN; StDev NaN; Max. NaN

Variable Format: numeric

Notes: UNF:6:/XCeuZ3BnTEG46NJQqC1YA==

HGNC ID

f11716962 Location:

Summary Statistics: Valid 0.0; StDev NaN; Mean NaN; Max. NaN; Min. NaN;

Variable Format: numeric

Notes: UNF:6:/XCeuZ3BnTEG46NJQqC1YA==

Species

f11716962 Location:

Summary Statistics: Valid 0.0; Mean NaN; Max. NaN; StDev NaN; Min. NaN

Variable Format: numeric

Notes: UNF:6:/XCeuZ3BnTEG46NJQqC1YA==

Drug IDs

f11716962 Location:

Summary Statistics: Max. NaN; Valid 0.0; StDev NaN; Mean NaN; Min. NaN;

Variable Format: numeric

Notes: UNF:6:/XCeuZ3BnTEG46NJQqC1YA==

structureId

f11716969 Location:

Variable Format: character

Notes: UNF:6:spZ07Q7m5YdTorQcI6QvMg==

authors

f11716969 Location:

Variable Format: character

Notes: UNF:6:iHECryIWt1JGvEst9q/yiA==

publicationYear

f11716969 Location:

Summary Statistics: Min. 0.0; StDev 8.856074016123205; Valid 120686.0; Max. 2018.0; Mean 2009.0579851846962

Variable Format: numeric

Notes: UNF:6:TgvjL3ZirMJk7MFiI+OVjw==

title

f11716969 Location:

Variable Format: character

Notes: UNF:6:9E1urN589szmxgzFhZiLWA==

journalName

f11716969 Location:

Variable Format: character

Notes: UNF:6:DRtO0jApVaQxEequ+mx3yg==

volumeId

f11716969 Location:

Variable Format: character

Notes: UNF:6:+99GfBQOD0a0aXnvm0J+yQ==

firstPage

f11716969 Location:

Variable Format: character

Notes: UNF:6:EjF2qH5lu0CJq1yo95KPzw==

lastPage

f11716969 Location:

Variable Format: character

Notes: UNF:6:fqM+Lz8y310LRknuOZ2u7g==

pubmedId

f11716969 Location:

Summary Statistics: Valid 117771.0; Mean 2.016163786417924E7; Max. 2.9741882E7; Min. 5.0; StDev 6352898.198012639

Variable Format: numeric

Notes: UNF:6:XmQSFKvRvRApA4gx+eDc0A==

pmc

f11716969 Location:

Variable Format: character

Notes: UNF:6:JOPJ0CfcTn9i3DfJOU71Sw==

doi

f11716969 Location:

Variable Format: character

Notes: UNF:6:YGO3e5LLAr2HMG2U0N8rnQ==

structureId

f11716983 Location:

Variable Format: character

Notes: UNF:6:943AoBGGaGmUUywiYrZV6Q==

chainId

f11716983 Location:

Variable Format: character

Notes: UNF:6:1fbwa7c1MlriRNG+TNdu3w==

entityId

f11716983 Location:

Summary Statistics: Mean 4.752253918608449; Max. 91.0; StDev 10.743163631936978; Valid 429807.0; Min. 1.0

Variable Format: numeric

Notes: UNF:6:xVRZpPDTkLcjCPwfC0FneQ==

clusterNumber100

f11716983 Location:

Summary Statistics: Max. 67020.0; Mean 11813.966554798702; Valid 396918.0; Min. 1.0; StDev 15609.582324114106

Variable Format: numeric

Notes: UNF:6:TjeVYZB3ODVwc4BbAvBIWw==

clusterNumber95

f11716983 Location:

Summary Statistics: Min. 1.0; Mean 8558.07345597312; StDev 11938.829273842279; Valid 396918.0; Max. 54215.0;

Variable Format: numeric

Notes: UNF:6:TgZ9I1YXi8bb13DFSyK/oQ==

clusterNumber90

f11716983 Location:

Summary Statistics: Mean 7899.684914768714; Valid 396918.0; Max. 51390.0; StDev 11158.22193538808; Min. 1.0;

Variable Format: numeric

Notes: UNF:6:H3n01PjK12AEn6qnj+u5Cw==

uniprotAcc

f11716983 Location:

Variable Format: character

Notes: UNF:6:uP+ON0JOP+0fX16RJMSG5A==

uniprotRecommendedName

f11716983 Location:

Variable Format: character

Notes: UNF:6:H6UIVLEH+0cyzUStG4MKfA==

uniprotAlternativeNames

f11716983 Location:

Variable Format: character

Notes: UNF:6:pujF9TYHIpjWE7wqhOMQQw==

geneName

f11716983 Location:

Variable Format: character

Notes: UNF:6:nSo1LfTrYzOQoSpcKtIXEQ==

authorAssignedEntityName

f11716983 Location:

Variable Format: character

Notes: UNF:6:tbZ5ASqxoQ9oD+jF4hye2A==

synonym

f11716983 Location:

Variable Format: character

Notes: UNF:6:GOmRzf/p7p75Qj4RsbltMQ==

taxonomy

f11716983 Location:

Variable Format: character

Notes: UNF:6:upO+usjuhDwmIYFUvwGHuw==

taxonomyId

f11716983 Location:

Variable Format: character

Notes: UNF:6:Uv7dcA+hpbFaSyRSNc3cmQ==

structureId

f11716957 Location:

Variable Format: character

Notes: UNF:6:LUJwnrDDWhA5g0so8+EwRw==

device

f11716957 Location:

Variable Format: character

Notes: UNF:6:ckKYMl6jTA+OztrkwjN65w==

diffractionSource

f11716957 Location:

Variable Format: character

Notes: UNF:6:hWdEmvWvjUnCejax9W5g7g==

collectionDate

f11716957 Location:

Variable Format: character

Notes: UNF:6:JQpG9w+3AtYbLg7eUzqwyg==

collectionTemperature

f11716957 Location:

Summary Statistics: StDev 51.698349026037334; Valid 123536.0; Mean 115.96184561583769; Max. 1400.0; Min. 1.0

Variable Format: numeric

Notes: UNF:6:tQOmJ9KiCyopY7BOHo1CJw==

structureId

f11716992 Location:

Variable Format: character

Notes: UNF:6:Fj0ZHbzBQhgEyDQ5+5sZaQ==

rObserved

f11716992 Location:

Summary Statistics: Min. 0.035; Valid 118535.0; StDev 0.034159976206063064; Max. 0.97; Mean 0.1953621630741974;

Variable Format: numeric

Notes: UNF:6:Mo8m+J4fm9zdU685J8cVWQ==

rAll

f11716992 Location:

Summary Statistics: Mean 0.19960813938807287; Min. 0.017; Max. 1.169; Valid 23761.0; StDev 0.0393016202758425;

Variable Format: numeric

Notes: UNF:6:HT1AxishuGFBfSjxiSdPkg==

rWork

f11716992 Location:

Summary Statistics: StDev 0.033770573559456306; Valid 126927.0; Mean 0.19486690775012386; Min. 0.042; Max. 0.615

Variable Format: numeric

Notes: UNF:6:wcrgNMNxzwTX8HIm7tfJHg==

rFree

f11716992 Location:

Summary Statistics: Min. 0.049; Valid 124114.0; Max. 0.516; Mean 0.23597594953027043; StDev 0.0393557523674627

Variable Format: numeric

Notes: UNF:6:XD4Ap46XgspVfgGpXYy8ww==

averageBFactor

f11716992 Location:

Summary Statistics: Mean 37.78244337708995; StDev 27.52450401504627; Valid 95147.0; Min. -13.76; Max. 696.1;

Variable Format: numeric

Notes: UNF:6:a+F7gjMg4ys4Mp2hQXA9yA==

refinementResolution

f11716992 Location:

Summary Statistics: Mean 2.2007510547879336; Max. 70.0; Min. 0.48; StDev 1.1543708645205406; Valid 131069.0

Variable Format: numeric

Notes: UNF:6:Qb/JLk04y9Y1QRXi2gD+lQ==

structureId

f11716963 Location:

Variable Format: character

Notes: UNF:6:IP2GEmZzoH5yMAXIsUtgjQ==

highResolutionLimit

f11716963 Location:

Summary Statistics: Mean 2.2013536168676335; Valid 131281.0; Max. 70.0; StDev 1.1544724127937342; Min. 0.48

Variable Format: numeric

Notes: UNF:6:03iAAH5UkVf8eSRvlNyTYw==

reflectionsForRefinement

f11716963 Location:

Summary Statistics: Valid 127787.0; Max. 1.5015304E7; Mean 57738.24101826785; StDev 138983.7495465496; Min. 0.0

Variable Format: numeric

Notes: UNF:6:hH66WqvUOhg3S86PKnJSoQ==

structureDeterminationMethod

f11716963 Location:

Variable Format: character

Notes: UNF:6:pZxh/bwYI2NYfpBCvcywYA==

structureId

f11716966 Location:

Variable Format: character

Notes: UNF:6:knNbD6en7DxXOkpvmq0ymg==

origChainId

f11716966 Location:

Variable Format: character

Notes: UNF:6:ahZbrTB/aFA9LAX8kjYkRw==

entityId

f11716966 Location:

Summary Statistics: Mean 5.397157628518579; Min. 1.0; StDev 11.379686083032961; Max. 163.0; Valid 490717.0

Variable Format: numeric

Notes: UNF:6:lB4O570lS5fOYL9C0o2Vqg==

db_code

f11716966 Location:

Variable Format: character

Notes: UNF:6:SHAM7Mh9vCnoM0NfERtk/A==

db_name

f11716966 Location:

Variable Format: character

Notes: UNF:6:+c5hMNHFCF/i9Tk2a62HVg==

sequence

f11716966 Location:

Variable Format: character

Notes: UNF:6:wjS5l9QDrNIQyIWvAqhDwA==

chainLength

f11716966 Location:

Summary Statistics: Min. 1.0; Max. 5070.0; StDev 263.26143932527845; Valid 490717.0; Mean 248.38850090780178

Variable Format: numeric

Notes: UNF:6:fNK4X0LMTp680tannNgw0w==

molecularWeight

f11716966 Location:

Summary Statistics: Min. 158.179; StDev 59265.9632409755; Max. 1641910.0; Mean 30613.250154936333; Valid 490717.0

Variable Format: numeric

Notes: UNF:6:0Rzl9fLIr0J5HarKz3uYvQ==

entityMacromoleculeType

f11716966 Location:

Variable Format: character

Notes: UNF:6:nonyFqvoa7E1x/cHqLGb7g==

structureId

f11716982 Location:

Variable Format: character

Notes: UNF:6:luyght8MPbMzScybj9BOCA==

structureTitle

f11716982 Location:

Variable Format: character

Notes: UNF:6:R3XPlI0M+zdbt1bUo2k7mQ==

experimentalTechnique

f11716982 Location:

Variable Format: character

Notes: UNF:6:clywTQwhhPNdiprNY2vCVw==

ndbId

f11716982 Location:

Variable Format: character

Notes: UNF:6:aPNsG7T04p95StXbAHUw0w==

resolution

f11716982 Location:

Summary Statistics: Mean 2.2640487983297; Max. 70.0; Valid 131234.0; Min. 0.48; StDev 1.404899836868882

Variable Format: numeric

Notes: UNF:6:Itut/j0i/ZnWCGARd4tnMw==

classification

f11716982 Location:

Variable Format: character

Notes: UNF:6:9zdS2jLhktEJ1eoRCTaNJA==

releaseDate

f11716982 Location:

Variable Format: character

Notes: UNF:6:K3ZyMsw3eAMbLwUaQJkdYg==

depositionDate

f11716982 Location:

Variable Format: character

Notes: UNF:6:V+ojzIX0Tu4EqyarHWlLfA==

revisionDate

f11716982 Location:

Variable Format: character

Notes: UNF:6:+iJ55sI4JZtLhMzrQrd+Jw==

authors

f11716982 Location:

Variable Format: character

Notes: UNF:6:sH5tjQwJQqeDSSUuxKcxPg==

structureMolecularWeight

f11716982 Location:

Summary Statistics: Valid 144173.0; Min. 314.38; StDev 617138.5139571822; Mean 113072.19783128629; Max. 9.7730536E7

Variable Format: numeric

Notes: UNF:6:/vX4tAHWjN0ZhK5R71Uj1Q==

macromoleculeType

f11716982 Location:

Variable Format: character

Notes: UNF:6:lpX3DTr5WGz2NMcpNoBVfw==

residueCount

f11716982 Location:

Summary Statistics: Min. 0.0; Max. 313236.0; Mean 828.4346098086216; StDev 2125.357475585923; Valid 144173.0;

Variable Format: numeric

Notes: UNF:6:2ETIdEU7TKFPRO3B69EX2g==

atomSiteCount

f11716982 Location:

Summary Statistics: Max. 2440800.0; StDev 20062.01180554317; Valid 144173.0; Mean 6674.830897604348; Min. 0.0

Variable Format: numeric

Notes: UNF:6:zEoexNa4R6SGwnHJ8I3Ayw==

pdbDoi

f11716982 Location:

Variable Format: character

Notes: UNF:6:tml4aX0dvUYEdjxFMtlOuw==

structureId

f11716993 Location:

Variable Format: character

Notes: UNF:6:WvhikHzLfuYITd91kcdBeA==

100 quantiles of stk_cites_nslf3

f11716993 Location:

Summary Statistics: Min. 1.0; StDev 29.826175806842322; Valid 12306.0; Mean 48.71046643913557; Max. 100.0;

Variable Format: numeric

Notes: UNF:6:ucuCLZCfGmwnBbAjpXuY2A==

Predicted values

f11716993 Location:

Summary Statistics: Min. 16.2842960357666; Mean 44.958374480159385; Valid 20434.0; StDev 13.567828882560503; Max. 100.0;

Variable Format: numeric

Notes: UNF:6:KZvMZxXHGJDVP/GY+3anwQ==

structureId

f11716981 Location:

Variable Format: character

Notes: UNF:6:82r46ocbTcgdHzlaYdiqCQ==

clusterId

f11716981 Location:

Variable Format: character

Notes: UNF:6:orG/Sx4CP7NfW4u+YCTXLQ==

releaseDate

f11716981 Location:

Variable Format: character

Notes: UNF:6:MY/PxEyxCZN6l5tG6hgFoA==

scoopCluster

f11716981 Location:

Summary Statistics: Valid 10410.0; StDev 18.565328740879924; Max. 100.0; Min. 50.0; Mean 82.12968299711812

Variable Format: numeric

Notes: UNF:6:8+Ql+8AX0e8KpB6QilUmMA==

scooped

f11716981 Location:

Summary Statistics: Valid 10410.0; Mean 0.5231508165225747; Max. 1.0; StDev 0.4994877434744961; Min. 0.0;

Variable Format: numeric

Notes: UNF:6:wzYUK4KEfAubejfxBAq2zw==

sampleBlind

f11716981 Location:

Summary Statistics: StDev 0.49538981019160017; Valid 10410.0; Min. 0.0; Mean 0.43208453410182723; Max. 1.0;

Variable Format: numeric

Notes: UNF:6:YjtcRPSpmgde/G74S8+56A==

StartDate

f11716979 Location:

Variable Format: character

Notes: UNF:6:Fq382ybCoi8fZ1I2mM3NhQ==

EndDate

f11716979 Location:

Variable Format: character

Notes: UNF:6:e8ewniTtIV04wpuymjC7OQ==

Status

f11716979 Location:

Variable Format: character

Notes: UNF:6:i8abDc7RsTMXu+XM1NWzHw==

Progress

f11716979 Location:

Summary Statistics: StDev 31.455044093234157; Min. 13.0; Mean 83.29231351760569; Max. 100.0; Valid 10564.0

Variable Format: numeric

Notes: UNF:6:nz5Qp3byclLDs/yk5khpRQ==

Duration (in seconds)

f11716979 Location:

Summary Statistics: Max. 1296175.0; Valid 10564.0; StDev 40987.48748777706; Min. 0.0; Mean 4654.944433927156;

Variable Format: numeric

Notes: UNF:6:Gk/8nGg3GOQVEgF5eWIPJg==

Finished

f11716979 Location:

Variable Format: character

Notes: UNF:6:sSj4xgIBNzuQvu7weC1O+A==

RecordedDate

f11716979 Location:

Variable Format: character

Notes: UNF:6:t90+R+sIHwSQ+w6HSEEryQ==

ResponseId

f11716979 Location:

Variable Format: character

Notes: UNF:6:pkpvWXR5hNiQ0QAvrfdrHw==

DistributionChannel

f11716979 Location:

Variable Format: character

Notes: UNF:6:V69H/UzPRZmeluqISbNCbA==

UserLanguage

f11716979 Location:

Variable Format: character

Notes: UNF:6:DK1fZsfUMbmT0AFns/jH+A==

Q_BallotBoxStuffing

f11716979 Location:

Variable Format: character

Notes: UNF:6:WEEH5CZZoPjvOGfRKSxVNQ==

Comp 1.1_1

f11716979 Location:

Summary Statistics: StDev 26.12742368286901; Valid 4585.0; Min. 0.0; Max. 100.0; Mean 58.889640130861466;

Variable Format: numeric

Notes: UNF:6:U71CtFSf4EV5dT9Mw96jpw==

Comp 2.1_1

f11716979 Location:

Summary Statistics: Mean 52.65955153083227; Min. 0.0; Valid 4638.0; StDev 26.051066420574223; Max. 100.0;

Variable Format: numeric

Notes: UNF:6:y19RdK5SvG64SbgOq1vCDA==

Qual 1.1_1

f11716979 Location:

Summary Statistics: Min. 0.03; Mean 12.919711286089239; StDev 6.82166241664357; Max. 24.0; Valid 4191.0

Variable Format: numeric

Notes: UNF:6:UEZmCv1sp5sjzSciLLQ8ZQ==

Qual 1.2_1

f11716979 Location:

Variable Format: character

Notes: UNF:6:PdrPcLlBLYJnkDk1/JDoxw==

Qual 1.2_2

f11716979 Location:

Variable Format: character

Notes: UNF:6:YB30WG/JXuuuyw9bSiV6wQ==

Qual 1.2_3

f11716979 Location:

Variable Format: character

Notes: UNF:6:uiTACON2yY2jy6lC4MinEA==

Qual 1.2_4

f11716979 Location:

Variable Format: character

Notes: UNF:6:cjQc6iT0SLqnmqv45Oz9bg==

Qual 1.2_5

f11716979 Location:

Variable Format: character

Notes: UNF:6:QLBKGCv8JnnamTHhjmqycg==

Qual 1.2_6

f11716979 Location:

Variable Format: character

Notes: UNF:6:ofKcoeKfuqGQ4lLPjEiw/w==

Qual 2.1_1

f11716979 Location:

Summary Statistics: Min. 0.0; Max. 24.0; StDev 6.526103958882243; Mean 9.935474074074072; Valid 4050.0

Variable Format: numeric

Notes: UNF:6:/60j05skv+H4+rcKZOWCew==

Qual 2.2_1

f11716979 Location:

Variable Format: character

Notes: UNF:6:I/K7A7Z7epJqudw3ojnPDQ==

Qual 2.2_2

f11716979 Location:

Variable Format: character

Notes: UNF:6:1d/wX5w+4iGRSYKO5rBjNg==

Qual 2.2_3

f11716979 Location:

Variable Format: character

Notes: UNF:6:kX2lvgfJai68OzGSbJmJlQ==

Qual 2.2_4

f11716979 Location:

Variable Format: character

Notes: UNF:6:80roXj8Kkl5FSx9xqpMSUA==

Qual 2.2_5

f11716979 Location:

Variable Format: character

Notes: UNF:6:y0x+ZvMdp7wR1PD7DewELg==

Qual 2.2_6

f11716979 Location:

Variable Format: character

Notes: UNF:6:YNdTqD8ItFWdrv9JSIbxcQ==

D1

f11716979 Location:

Variable Format: character

Notes: UNF:6:5XfCcA+nqh+fWoTeFWg90w==

D2

f11716979 Location:

Variable Format: character

Notes: UNF:6:oSX/OtAWifh6swatvLLrIw==

D3

f11716979 Location:

Variable Format: character

Notes: UNF:6:goItuaOO62BZrBAr+3zbWQ==

l0_l1

f11716979 Location:

Variable Format: character

Notes: UNF:6:Yu0JxnSNjkroo7vV4js91w==

pubmedId

f11716960 Location:

Summary Statistics: Valid 604016.0; Min. 1279434.0; Mean 1.5058693831709042E7; Max. 2.9735985E7; StDev 6093502.081918616

Variable Format: numeric

Notes: UNF:6:yzzR3xLpbrT8PXChGVztdw==

year

f11716960 Location:

Summary Statistics: Min. 1980.0; Mean 2010.8980242245045; Max. 2018.0; Valid 604016.0; StDev 5.900052985882172

Variable Format: numeric

Notes: UNF:6:t+0X6nblJRaHqThDr7DJKQ==

(sum) nbcites

f11716960 Location:

Summary Statistics: StDev 0.0; Valid 604016.0; Max. 0.0; Mean 0.0; Min. 0.0

Variable Format: numeric

Notes: UNF:6:zjA6dzErhaR0PeT/hnMWsA==

(sum) nbcites_nslf

f11716960 Location:

Summary Statistics: Max. 0.0; Min. 0.0; Valid 604016.0; Mean 0.0; StDev 0.0

Variable Format: numeric

Notes: UNF:6:zjA6dzErhaR0PeT/hnMWsA==

pubyear

f11716960 Location:

Summary Statistics: Max. 2018.0; Valid 604016.0; Min. 1980.0; StDev 6.847754723669283; Mean 2003.796048449024;

Variable Format: numeric

Notes: UNF:6:/E0pfR+M+YsSvv5Sc16+Xg==

stk_cites

f11716960 Location:

Summary Statistics: Mean 0.0; Min. 0.0; Max. 0.0; StDev 0.0; Valid 604016.0

Variable Format: numeric

Notes: UNF:6:zjA6dzErhaR0PeT/hnMWsA==

stk_cites_nslf

f11716960 Location:

Summary Statistics: Mean 0.0; StDev 0.0; Min. 0.0; Max. 0.0; Valid 604016.0;

Variable Format: numeric

Notes: UNF:6:zjA6dzErhaR0PeT/hnMWsA==

qntl_stkcites_t

f11716960 Location:

Summary Statistics: Mean 0.0; Valid 604016.0; StDev 0.0; Max. 0.0; Min. 0.0;

Variable Format: numeric

Notes: UNF:6:zjA6dzErhaR0PeT/hnMWsA==

qntl_stkcites_nslf_t

f11716960 Location:

Summary Statistics: Mean 0.0; Valid 604016.0; StDev 0.0; Max. 0.0; Min. 0.0;

Variable Format: numeric

Notes: UNF:6:zjA6dzErhaR0PeT/hnMWsA==

Other Study-Related Materials

Label:

01_build_pdb.do

Notes:

application/x-stata-syntax

Other Study-Related Materials

Label:

01_clean_summary.do

Notes:

application/x-stata-syntax

Other Study-Related Materials

Label:

01_summary_stats.do

Notes:

application/x-stata-syntax

Other Study-Related Materials

Label:

02_build_entities.do

Notes:

application/x-stata-syntax

Other Study-Related Materials

Label:

02_clean_citation.do

Notes:

application/x-stata-syntax

Other Study-Related Materials

Label:

02_potential_regressions.do

Notes:

application/x-stata-syntax

Other Study-Related Materials

Label:

03_build_structures.do

Notes:

application/x-stata-syntax

Other Study-Related Materials

Label:

03_clean_refine.do

Notes:

application/x-stata-syntax

Other Study-Related Materials

Label:

03_structural_genomics.do

Notes:

application/x-stata-syntax

Other Study-Related Materials

Label:

04_build_papers.do

Notes:

application/x-stata-syntax

Other Study-Related Materials

Label:

04_clean_collection.do

Notes:

application/x-stata-syntax

Other Study-Related Materials

Label:

04_competition_regressions.do

Notes:

application/x-stata-syntax

Other Study-Related Materials

Label:

05_clean_entity.do

Notes:

application/x-stata-syntax

Other Study-Related Materials

Label:

05_define_sample.do

Notes:

application/x-stata-syntax

Other Study-Related Materials

Label:

05_welfare.do

Notes:

application/x-stata-syntax

Other Study-Related Materials

Label:

06_clean_pubmed.do

Notes:

application/x-stata-syntax

Other Study-Related Materials

Label:

06_generate_p.do

Notes:

application/x-stata-syntax

Other Study-Related Materials

Label:

06_welfare_calculations.do

Notes:

application/x-stata-syntax

Other Study-Related Materials

Label:

07_appendix_misc.do

Notes:

application/x-stata-syntax

Other Study-Related Materials

Label:

07_clean_validation.do

Notes:

application/x-stata-syntax

Other Study-Related Materials

Label:

08_clean_drugbank.do

Notes:

application/x-stata-syntax

Other Study-Related Materials

Label:

08_phat_bootstrap.do

Notes:

application/x-stata-syntax

Other Study-Related Materials

Label:

09_clean_survey.do

Notes:

application/x-stata-syntax

Other Study-Related Materials

Label:

09_survey_analysis.do

Notes:

application/x-stata-syntax

Other Study-Related Materials

Label:

comp_qual_master.do

Notes:

application/x-stata-syntax

Other Study-Related Materials

Label:

pdbe_Validation.csv

Notes:

text/csv

Other Study-Related Materials

Label:

placeholder.csv

Notes:

text/csv

Other Study-Related Materials

Label:

README.pdf

Notes:

application/pdf

Other Study-Related Materials

Label:

Tables.xlsx

Notes:

application/vnd.openxmlformats-officedocument.spreadsheetml.sheet

Other Study-Related Materials

Label:

uniprot_pubmed.dta

Notes:

application/x-stata-14