The Allen Ancient DNA Resource (AADR): A curated compendium of ancient human genomes (doi:10.7910/DVN/FFIDCW)

View:

Part 1: Document Description
Part 2: Study Description
Part 5: Other Study-Related Materials
Entire Codebook

Document Description

Citation

Title:

The Allen Ancient DNA Resource (AADR): A curated compendium of ancient human genomes

Identification Number:

doi:10.7910/DVN/FFIDCW

Distributor:

Harvard Dataverse

Date of Distribution:

2023-04-01

Version:

9

Bibliographic Citation:

Mallick, Swapan; Reich, David, 2023, "The Allen Ancient DNA Resource (AADR): A curated compendium of ancient human genomes", https://doi.org/10.7910/DVN/FFIDCW, Harvard Dataverse, V9

Study Description

Citation

Title:

The Allen Ancient DNA Resource (AADR): A curated compendium of ancient human genomes

Identification Number:

doi:10.7910/DVN/FFIDCW

Authoring Entity:

Mallick, Swapan (Harvard University)

Reich, David (Harvard University)

Distributor:

Harvard Dataverse

Access Authority:

Mallick, Swapan

Depositor:

Mallick, Swapan

Date of Deposit:

2023-03-28

Holdings Information:

https://doi.org/10.7910/DVN/FFIDCW

Study Scope

Keywords:

Medicine, Health and Life Sciences

Abstract:

The Allen Ancient DNA Resource (AADR) seeks to provide a publicly available, uniformly curated dataset that is maximally useful for scientists carrying out analyses of population history and natural selection. The dataset consists of thousands of ancient and present-day individuals genotyped at up to 1.23 million positions in the genome (in hg19 coordinates). <br><br> The genotypes in the AADR are not a perfect match to those from the associated published papers. To make it easier to co-analyze datasets, we have started from bam or fastq files; trimmed the ends of sequences to reduce errors due to ancient DNA damage in a way that is largely uniform across datasets and may be slightly different from that used in the individual publications; and determined genotypes anew by sampling a random sequence to cover each position. <br><br> Researchers who wish to use this compilation should provide two citations. The first should be to the Dataverse page and the specific version of AADR they use as the basis of their analyses (e.g. version 9, the September 16 2024 release, as in the example below). The second should be to the manuscript describing AADR. <br><br> (1) "Swapan Mallick and David Reich: The Allen Ancient DNA Resource (AADR): A curated compendium of ancient human genomes, https://doi.org/10.7910/DVN/FFIDCW”, Harvard Dataverse, V9 data release [September 16, 2024]." <br><br> (2) "Mallick S, Micco A, Mah M, Ringbauer H, Lazaridis I, Olalde I, Patterson N, Reich D (2024) The Allen Ancient DNA Resource (AADR) a curated compendium of ancient human genomes. Sci Data 11, 182." <br><br> Citing the AADR is not a substitute for citing the original papers that produced the component data, which must be specifically referenced in each publication that uses data from them. <br><br> We aim to update and enhance this resource every couple of months to make the releases maximally useful to the community. We rely on feedback from the user community to improve the AADR, so please write jointly to Swapan Mallick (swapan_mallick@hms.harvard.edu) and David Reich (reich@genetics.med.harvard.edu) if you identify errors or other issues. <br><br> The first version of AADR was made publicly on February 22 2019 via the Reich laboratory website at Harvard Medical School, which hosted a total of six primary releases. All releases are now copied to Dataverse which has the virtue of including a permanent digital object identifier (doi) that can be cited in a straightforward way, and data access not tied to the website of a Principal Investigator. Below is a translation from the versions on the Reich laboratory website to the Dataverse versions. <br><br> V62.0 (Dataverse 9.0) September 16 2024 <br> V54.1.p1 (Dataverse 8.0) March 6 2023 <br> V54.1 (Dataverse 7.0) Nov 16 2022 <br> V52.2 (Dataverse 6.0) Aug 22 2022 <br> V50.0.p1 (Dataverse 5.0) Aug 1 2022 <br> V50.0 (Dataverse 4.0) Oct 10 2021 <br> V44.3 (Dataverse 3.0) Jan 20 2021 <br> V42.4 (Dataverse 2.0) Mar 25 2020 <br> V37.2 (Dataverse 1.0) Feb 22 2019 <br><br> We thank the John Templeton Foundation, a grant from the National Institutes of Health, the Howard Hughes Medical Institute, and the Allen Discovery Center program, a Paul G. Allen Frontiers Group advised program of the Paul G. Allen Family Foundation, for providing the resources needed to create and update this dataset.

Methodology and Processing

Sources Statement

Data Access

Other Study Description Materials

Related Publications

Citation

Title:

Mallick S, Micco A, Mah M, Ringbauer H, Lazaridis I, Olalde I, Patterson N, Reich D (2024) The Allen Ancient DNA Resource (AADR) a curated compendium of ancient human genomes. Sci Data 11, 182.

Bibliographic Citation:

Mallick S, Micco A, Mah M, Ringbauer H, Lazaridis I, Olalde I, Patterson N, Reich D (2024) The Allen Ancient DNA Resource (AADR) a curated compendium of ancient human genomes. Sci Data 11, 182.

Other Study-Related Materials

Label:

aadr_v62.0__README.docx

Notes:

application/vnd.openxmlformats-officedocument.wordprocessingml.document

Other Study-Related Materials

Label:

aadr_v62.0__README_MT.docx

Notes:

application/vnd.openxmlformats-officedocument.wordprocessingml.document

Other Study-Related Materials

Label:

mtdna_uncompress_v3.py

Text:

Notes:

text/x-python

Other Study-Related Materials

Label:

v62.0_1240k_public.anno

Text:

Notes:

text/plain

Other Study-Related Materials

Label:

v62.0_1240k_public.geno

Text:

Notes:

application/octet-stream

Other Study-Related Materials

Label:

v62.0_1240k_public.ind

Text:

Notes:

text/plain

Other Study-Related Materials

Label:

v62.0_1240k_public.snp

Text:

Notes:

text/plain

Other Study-Related Materials

Label:

v62.0_1240k_public.xlsx

Text:

Notes:

application/vnd.openxmlformats-officedocument.spreadsheetml.sheet

Other Study-Related Materials

Label:

v62.0_HO_public.anno

Text:

Notes:

text/plain

Other Study-Related Materials

Label:

v62.0_HO_public.geno

Text:

Notes:

application/octet-stream

Other Study-Related Materials

Label:

v62.0_HO_public.ind

Text:

Notes:

text/plain

Other Study-Related Materials

Label:

v62.0_HO_public.snp

Text:

Notes:

text/plain

Other Study-Related Materials

Label:

v62.0_HO_public.xlsx

Text:

Notes:

application/vnd.openxmlformats-officedocument.spreadsheetml.sheet

Other Study-Related Materials

Label:

v62.0_MT.repo.fa.gz

Text:

Notes:

application/gzip