A dataset of publication records for Nobel laureates (doi:10.7910/DVN/6NJ5RN)

View:

Part 1: Document Description
Part 2: Study Description
Part 3: Data Files Description
Part 4: Variable Description
Entire Codebook

Document Description

Citation

Title:

A dataset of publication records for Nobel laureates

Identification Number:

doi:10.7910/DVN/6NJ5RN

Distributor:

Harvard Dataverse

Date of Distribution:

2018-12-04

Version:

1

Bibliographic Citation:

Li, Jichao; Yin, Yian; Fortunato, Santo; Wang Dashun, 2018, "A dataset of publication records for Nobel laureates", https://doi.org/10.7910/DVN/6NJ5RN, Harvard Dataverse, V1, UNF:6:/Mr84aTKPhJytkmsz1tgZQ== [fileUNF]

Study Description

Citation

Title:

A dataset of publication records for Nobel laureates

Identification Number:

doi:10.7910/DVN/6NJ5RN

Authoring Entity:

Li, Jichao (Northwestern University)

Yin, Yian (Northwestern University)

Fortunato, Santo (Indiana University)

Wang Dashun (Northwestern University)

Distributor:

Harvard Dataverse

Access Authority:

Li, Jichao

Depositor:

Li, Jichao

Date of Deposit:

2018-12-03

Holdings Information:

https://doi.org/10.7910/DVN/6NJ5RN

Study Scope

Keywords:

Social Sciences

Abstract:

We constructed the publication records for almost all Nobel laureates in physics, chemistry, and physiology or medicine from 1900 to 2016 (545 out of 590, 92.4%). We first collected information manually from Nobel Prize official websites, their university websites, and Wikipedia. We then match it algorithmically with big data, tracing publication records from the MAG database.

Methodology and Processing

Sources Statement

Data Access

Notes:

<a href="http://creativecommons.org/publicdomain/zero/1.0">CC0 1.0</a>

Other Study Description Materials

File Description--f3323577

File: Chemistry publication record.tab

  • Number of cases: 42657

  • No. of variables per record: 10

  • Type of File: text/tab-separated-values

Notes:

UNF:6:jDl2/f7JsLkiAUUqUTMNjg==

File Description--f3323578

File: Medicine publication record.tab

  • Number of cases: 29233

  • No. of variables per record: 10

  • Type of File: text/tab-separated-values

Notes:

UNF:6:PBJJd/v8s3uFlW4c8ESsQA==

File Description--f3323579

File: Physics publication record.tab

  • Number of cases: 21504

  • No. of variables per record: 10

  • Type of File: text/tab-separated-values

Notes:

UNF:6:36Zajas6XhgOnXIZhWD9Pg==

File Description--f3323580

File: Prize-winning paper record.tab

  • Number of cases: 874

  • No. of variables per record: 8

  • Type of File: text/tab-separated-values

Notes:

UNF:6:1YD9MlOiXAr4IVFk3KPH+g==

Variable Description

List of Variables:

Variables

Laureate ID

f3323577 Location:

Summary Statistics: Min. 20001.0; Mean 20061.10943104299; Max. 20163.0; StDev 37.944124600626296; Valid 42657.0

Variable Format: numeric

Notes: UNF:6:JwG6puIzn/YqJbtE3GnLXw==

Laureate name

f3323577 Location:

Variable Format: character

Notes: UNF:6:DMRr/M6/gfm0iSir+y9zlg==

Prize year

f3323577 Location:

Summary Statistics: Valid 42657.0; Mean 1988.0807604848003; Min. 1908.0; StDev 21.377326266488055; Max. 2016.0

Variable Format: numeric

Notes: UNF:6:hxnhU3KUEFq39Vb3dXARKg==

Title

f3323577 Location:

Variable Format: character

Notes: UNF:6:qAx0TcZ3JvfcOMJQeKbsuQ==

Pub year

f3323577 Location:

Summary Statistics: Max. 2018.0; Min. 1886.0; Mean 1982.8235694024381; StDev 22.215752754841727; Valid 42657.0

Variable Format: numeric

Notes: UNF:6:4wczG3MF2fNd7/AwYlfW8w==

Paper ID

f3323577 Location:

Summary Statistics: Min. 46287.0; Valid 42651.0; Mean 2.0862953941851313E9; Max. 2.787385211E9; StDev 2.0731257696086302E8

Variable Format: numeric

Notes: UNF:6:4sEN+RKpymYdxz1jXJW50w==

DOI

f3323577 Location:

Variable Format: character

Notes: UNF:6:ueTGe6O7ufJRs1YvBnOWIA==

Journal

f3323577 Location:

Variable Format: character

Notes: UNF:6:5dOR850+EzaIyFGvB3fCKQ==

Affiliation

f3323577 Location:

Variable Format: character

Notes: UNF:6:9DDzuVAJUnp4Q64/sphjoQ==

Is prize-winning paper

f3323577 Location:

Variable Format: character

Notes: UNF:6:ywky8+WHwEdXygOmAN6opA==

Laureate ID

f3323578 Location:

Summary Statistics: Mean 30071.331064208258; Min. 30001.0; Max. 30189.0; StDev 46.55651069141344; Valid 29233.0

Variable Format: numeric

Notes: UNF:6:H7s5XzQojf8wKkbBwiewnA==

Laureate name

f3323578 Location:

Variable Format: character

Notes: UNF:6:P32ruEFLPoerdaFMEqbySA==

Prize year

f3323578 Location:

Summary Statistics: Mean 1985.6569288133337; Min. 1912.0; StDev 21.093513559414983; Valid 29233.0; Max. 2016.0;

Variable Format: numeric

Notes: UNF:6:HQlEIaQjvBCMVlKcbf0zkw==

Title

f3323578 Location:

Variable Format: character

Notes: UNF:6:Yr0dBbKgWyTn51V6BJCaPA==

Pub year

f3323578 Location:

Summary Statistics: StDev 22.62730580041046; Min. 1850.0; Valid 29233.0; Max. 2018.0; Mean 1983.0767283549455

Variable Format: numeric

Notes: UNF:6:f8ONn9YhRwfrW7nYKdaf1g==

Paper ID

f3323578 Location:

Summary Statistics: Valid 29224.0; Min. 107758.0; StDev 1.998962120844465E8; Max. 2.787058625E9; Mean 2.0639778978406794E9

Variable Format: numeric

Notes: UNF:6:R9vAwiKsQJ6/CIJ8jTXv9Q==

DOI

f3323578 Location:

Variable Format: character

Notes: UNF:6:jCrmpUEciIWs/epUq05oSg==

Journal

f3323578 Location:

Variable Format: character

Notes: UNF:6:FIxpmNNj2/zcPvuHeJLWaw==

Affiliation

f3323578 Location:

Variable Format: character

Notes: UNF:6:MMvEyA/3RGPYxxve68qmRg==

Is prize-winning paper

f3323578 Location:

Variable Format: character

Notes: UNF:6:HMG27I0Au8qqOrAF5ljr1Q==

Laureate ID

f3323579 Location:

Summary Statistics: StDev 47.078454191388076; Min. 10001.0; Valid 21504.0; Mean 10067.215494791655; Max. 10193.0

Variable Format: numeric

Notes: UNF:6:3S4JRTx9gOJZ7N3M9mDnfg==

Laureate name

f3323579 Location:

Variable Format: character

Notes: UNF:6:9bS6cekPADfRw6IdqUIkKw==

Prize year

f3323579 Location:

Summary Statistics: Max. 2016.0; Mean 1988.2034505208287; Valid 21504.0; Min. 1902.0; StDev 22.401811991111785

Variable Format: numeric

Notes: UNF:6:YwyrchUTVa1vT3tK6kYU/Q==

Title

f3323579 Location:

Variable Format: character

Notes: UNF:6:m8Ry+tlPMM2CAbuZ9nn+GA==

Pub year

f3323579 Location:

Summary Statistics: Max. 2018.0; Min. 1826.0; Mean 1984.9008975491827; StDev 22.81448622941681; Valid 21503.0;

Variable Format: numeric

Notes: UNF:6:Zh+SHRzCmblv6gLl9cNqYw==

Paper ID

f3323579 Location:

Summary Statistics: StDev 2.5215771088126832E8; Min. 1376972.0; Max. 2.788704602E9; Mean 2.0566379459808226E9; Valid 21486.0;

Variable Format: numeric

Notes: UNF:6:jPvpEt5omozJo76nI0eDyA==

DOI

f3323579 Location:

Variable Format: character

Notes: UNF:6:G2xrJHTIQU5ZH6Gd6AoGfw==

Journal

f3323579 Location:

Variable Format: character

Notes: UNF:6:SnpL1WAw02WyYlDvHkXRXg==

Affiliation

f3323579 Location:

Variable Format: character

Notes: UNF:6:vv2nC8wytXGqClNpsWa0Qg==

Is prize-winning paper

f3323579 Location:

Variable Format: character

Notes: UNF:6:QsUFT09JXlDfffVNcn1kqA==

Field

f3323580 Location:

Variable Format: character

Notes: UNF:6:wICAOS3xdZDN/M3wA8x/+w==

Laureate ID

f3323580 Location:

Summary Statistics: Valid 874.0; Min. 10001.0; StDev 8374.77131995743; Mean 20654.21510297486; Max. 30189.0

Variable Format: numeric

Notes: UNF:6:y8jwEo96oZtiUdGgGxIo7w==

Laureate name

f3323580 Location:

Variable Format: character

Notes: UNF:6:5oxxIZjHyONTLhKo9gom7Q==

Prize year

f3323580 Location:

Summary Statistics: Max. 2016.0; Mean 1973.0274599542333; StDev 28.073486877775455; Valid 874.0; Min. 1902.0

Variable Format: numeric

Notes: UNF:6:Wtg5b/NngpcTRBaObeo0zQ==

Title

f3323580 Location:

Variable Format: character

Notes: UNF:6:glUydqB3H+Hmb0uQZDSYaw==

Pub year

f3323580 Location:

Summary Statistics: Valid 874.0; Min. 1887.0; StDev 24.931493183275272; Max. 2010.0; Mean 1956.229977116705

Variable Format: numeric

Notes: UNF:6:shu6Ksptqyniz6zebZf7Cg==

Paper ID

f3323580 Location:

Summary Statistics: Mean 1.989629953020214E9; Max. 2.77746594E9; Valid 841.0; StDev 3.629530851157096E8; Min. 1.1476871E7

Variable Format: numeric

Notes: UNF:6:plBEUC/OYbXflhsTT8pQ4Q==

Additional information

f3323580 Location:

Variable Format: character

Notes: UNF:6:tvFSdkkWnxlRWPwwBP5iYg==