EvidenceNet Dataset (for HCC and CRC diseases) (doi:10.7910/DVN/649TSE)

View:

Part 1: Document Description
Part 2: Study Description
Part 5: Other Study-Related Materials
Entire Codebook

Document Description

Citation

Title:

EvidenceNet Dataset (for HCC and CRC diseases)

Identification Number:

doi:10.7910/DVN/649TSE

Distributor:

Harvard Dataverse

Date of Distribution:

2026-03-31

Version:

1

Bibliographic Citation:

Zong, Chang, 2026, "EvidenceNet Dataset (for HCC and CRC diseases)", https://doi.org/10.7910/DVN/649TSE, Harvard Dataverse, V1

Study Description

Citation

Title:

EvidenceNet Dataset (for HCC and CRC diseases)

Identification Number:

doi:10.7910/DVN/649TSE

Authoring Entity:

Zong, Chang (zhejiang university of science and technology)

Distributor:

Harvard Dataverse

Access Authority:

Zong, Chang

Depositor:

Zong, Chang

Date of Deposit:

2026-03-30

Holdings Information:

https://doi.org/10.7910/DVN/649TSE

Study Scope

Keywords:

Engineering, Medicine, Health and Life Sciences, Evidence Knowledge Graph, Information Extraction from Full-Text Literature, Biomedical Reasoning

Abstract:

This is the public derived data release for EvidenceNet, a framework for constructing disease-specific, evidence-centric knowledge graphs from full-text biomedical literature published during 2010-2025 (around 500 articles for each disease). This package contains two released resources: EvidenceNet-HCC and EvidenceNet-CRC. For each disease, it provides record-level JSON files (evidence_nodes.json) and graph-level JSON files (evidence_graph.json) that preserve structured evidence records, normalized biomedical entities, typed evidence-evidence relations, provenance metadata, and evidence quality scores. The package also includes selected derived evaluation outputs for component validation, question answering, future link prediction, and target prioritization. Raw source PDFs, large third-party knowledge resources, and local cache files are not included.

Methodology and Processing

Sources Statement

Data Access

Notes:

<a href="http://creativecommons.org/publicdomain/zero/1.0">CC0 1.0</a>

Other Study Description Materials

Other Study-Related Materials

Label:

crc_future_links.json

Notes:

application/json

Other Study-Related Materials

Label:

crc_merged_dataset.json

Notes:

application/json

Other Study-Related Materials

Label:

evidence_graph_crc.json

Notes:

application/json

Other Study-Related Materials

Label:

evidence_graph_hcc.json

Notes:

application/json

Other Study-Related Materials

Label:

evidence_nodes_crc.json

Notes:

application/json

Other Study-Related Materials

Label:

evidence_nodes_hcc.json

Notes:

application/json

Other Study-Related Materials

Label:

generated_qa_crc.json

Notes:

application/json

Other Study-Related Materials

Label:

generated_qa_hcc.json

Notes:

application/json

Other Study-Related Materials

Label:

hcc_future_links.json

Notes:

application/json

Other Study-Related Materials

Label:

hcc_merged_dataset.json

Notes:

application/json

Other Study-Related Materials

Label:

README.md

Notes:

text/markdown