Scientific Data

Papers
(The H4-Index of Scientific Data is 68. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2020-04-01 to 2024-04-01.)
ArticleCitations
Version 4 of the CRU TS monthly high-resolution gridded multivariate climate dataset2007
The FLUXNET2015 dataset and the ONEFlux processing pipeline for eddy covariance data618
County-level CO2 emissions and sequestration in China during 1997–2017429
PTB-XL, a large publicly available electrocardiography dataset352
A cross-country database of COVID-19 testing332
High resolution temporal profiles in the Emissions Database for Global Atmospheric Research293
A harmonized global nighttime light dataset 1992–2018235
Dynamic World, Near real-time global 10 m land use land cover mapping229
HyperKvasir, a comprehensive multi-class image and video dataset for gastrointestinal endoscopy207
MIMIC-IV, a freely accessible electronic health record dataset205
COVID-19 outbreak response, a dataset to assess mobility changes in Italy following national lockdown189
Materials Cloud, a platform for open computational science189
Holocene global mean surface temperature, a multi-method reconstruction approach181
The World Checklist of Vascular Plants, a continuously updated resource for exploring global plant diversity179
Bias-corrected climate projections for South Asia from Coupled Model Intercomparison Project-6168
A patient-centric dataset of images and metadata for identifying melanomas using clinical context164
InvaCost, a public database of the economic costs of biological invasions worldwide162
Harmonized global maps of above and belowground biomass carbon density in the year 2010159
The TRUST Principles for digital repositories156
The COUGHVID crowdsourcing dataset, a corpus for the study of large-scale cough analysis algorithms149
Highly accurate long-read HiFi sequencing data for five complex genomes149
Version 3 of the Global Aridity Index and Potential Evapotranspiration Database149
Outlining where humans live, the World Settlement Footprint 2015143
Systematic phenotyping and characterization of the 5xFAD mouse model of Alzheimer’s disease141
The 10-m crop type maps in Northeast China during 2017–2019139
AiiDA 1.0, a scalable computational infrastructure for automated reproducible workflows and data provenance138
The human O-GlcNAcome database and meta-analysis136
Multiscale dynamic human mobility flow dataset in the U.S. during the COVID-19 epidemic134
The International Bathymetric Chart of the Arctic Ocean Version 4.0125
A structured open dataset of government interventions in response to COVID-19124
A global-scale data set of mining areas116
NASA Global Daily Downscaled Projections, CMIP6111
A global database of Holocene paleotemperature records107
Carbon Monitor, a near-real-time daily dataset of global CO2 emission from fossil fuel and cement production107
COVID-CT-MD, COVID-19 computed tomography scan dataset applicable in machine learning and deep learning105
Data sharing practices and data availability upon request differ across scientific disciplines105
The ANI-1ccx and ANI-1x data sets, coupled-cluster and density functional theory properties for molecules101
Operationalizing the CARE and FAIR Principles for Indigenous data futures101
MedMNIST v2 - A large-scale lightweight benchmark for 2D and 3D biomedical image classification93
Global 1 km × 1 km gridded revised real gross domestic product and electricity consumption during 1992–2019 based on calibrated nighttime light data92
High-throughput screening platform for solid electrolytes combining hierarchical ion-transport prediction algorithms90
A global record of annual terrestrial Human Footprint dataset from 2000 to 201890
Kvasir-Capsule, a video capsule endoscopy dataset89
Global land use for 2015–2100 at 0.05° resolution under diverse socioeconomic and climate scenarios88
Building a PubMed knowledge graph88
The Building Data Genome Project 2, energy meter data from the ASHRAE Great Energy Predictor III competition86
A platinum standard pan-genome resource that represents the population structure of Asian rice85
Gridded daily weather data for North America with comprehensive uncertainty quantification84
A global map of terrestrial habitat types82
VinDr-CXR: An open dataset of chest X-rays with radiologist’s annotations80
A high-resolution in vivo magnetic resonance imaging atlas of the human hypothalamic region80
GlobalFungi, a global database of fungal occurrences from high-throughput-sequencing metabarcoding studies79
A database of battery materials auto-generated using ChemDataExtractor77
COVID-19 Disease Map, building a computational repository of SARS-CoV-2 virus-host interaction mechanisms74
High-resolution monthly precipitation and temperature time series from 2006 to 210073
Introducing the FAIR Principles for research software73
ERA5-based global meteorological wildfire danger maps73
Global quantitative analysis of the human brain proteome and phosphoproteome in Alzheimer’s disease73
Bias-corrected CMIP6 global dataset for dynamical downscaling of the historical and future climate (1979–2100)72
Geomorpho90m, empirical evaluation and accuracy assessment of global high-resolution geomorphometric layers72
Reactants, products, and transition states of elementary chemical reactions based on quantum chemistry71
Global terrestrial carbon fluxes of 1999–2019 estimated by upscaling eddy covariance data with a random forest71
COVIDiSTRESS Global Survey dataset on psychological and behavioural consequences of the COVID-19 outbreak70
Harmonised global datasets of wind and solar farm locations and power70
K-EmoCon, a multimodal sensor dataset for continuous emotion recognition in naturalistic conversations70
AusTraits, a curated plant trait database for the Australian flora70
PERSIANN-CCS-CDR, a 3-hourly 0.04° global precipitation climate data record for heavy precipitation studies69
Thermodynamic and transport properties of hydrogen containing streams68
0.054333925247192