Scientific Data

Papers
(The H4-Index of Scientific Data is 65. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-04-01 to 2025-04-01.)
ArticleCitations
A chromosome-level genome assembly of East Asia endemic minnow Zacco platypus869
Dataset for developing deep learning models to assess crack width and self-healing progress in concrete461
A travelable area boundary dataset for visual navigation of field robots404
A catalogue of land-based adaptation and mitigation solutions to tackle climate change329
Kinematics, kinetics, and muscle activations during human locomotion over compliant terrains300
Author Correction: Microbial Metagenomes Across a Complete Phytoplankton Bloom Cycle: High-Resolution Sampling Every 4 Hours Over 22 Days265
PCMMD: A Novel Dataset of Plasma Cells to Support the Diagnosis of Multiple Myeloma235
Recovery of nearly 3,000 archaeal genomes from 152 terrestrial geothermal spring metagenomes228
A large-scale dataset for Chinese historical document recognition and analysis223
The compositional behavior of the human T cell receptor repertoire in ovarian cancer compared to healthy donors211
Mapping of 10-km daily diffuse solar radiation across China from reanalysis data and a Machine-Learning method208
A dataset on formulation parameters and characteristics of drug-loaded PLGA microparticles199
Curated global occurrence dataset of the insect order Zoraptera194
Vis-NIR soil spectral library of the Hungarian Soil Degradation Observation System191
CESNET-TimeSeries24: Time Series Dataset for Network Traffic Anomaly Detection and Forecasting181
A high-quality chromosome-level genome assembly of Pacific whiteleg shrimp (Penaeus vannamei)178
A co-registered in-situ and ex-situ dataset from wire arc additive manufacturing process172
A standardized lexicon of body odor words crafted from 17 countries167
Time-dependent RNA transcriptional profiling of abomasal mucosa in cattle infected with Ostertagia ostertagi155
A global dataset of fossil fungi records from the Cenozoic148
EEG Dataset for the Recognition of Different Emotions Induced in Voice-User Interaction135
Shear modulus reduction and damping ratios curves joined with engineering geological units in Italy133
Single-cell assay for transposase-accessible chromatin sequencing of human clear cell renal cell carcinoma127
Exploring, walking, and interacting in virtual reality with simulated low vision: a living contextual dataset119
A neuroimaging dataset during sequential color qualia similarity judgments with and without reports118
Measuring Overwork in China Using Daily High-Resolution Nighttime Satellite Data109
Chromosome-level genome assembly and annotation of the prickly nightshade Solanum rostratum Dunal108
XyloDensMap: a georeferenced dataset for the wood density of 110,000 trees from 156 European species in France108
Human alterations of the global floodplains 1992–2019106
Comprehensive energy demand and usage data for building automation97
Multi-environment field trials for wheat yield, stability and breeding progress in Germany92
Global daily 1 km land surface precipitation based on cloud cover-informed downscaling91
Effects of heat stress on 16S rDNA, metagenome and metabolome in Holstein cows at different growth stages90
A database of low-energy atomically precise nanoclusters89
A human lower-limb biomechanics and wearable sensors dataset during cyclic and non-cyclic activities87
Size-fractionated microbiome observed during an eight-month long sampling in Jiaozhou Bay and the Yellow Sea86
ODFM, an omics data resource from microorganisms associated with fermented foods86
Annual Impervious Surface Data from 2001–2020 for West African Countries: Ghana, Togo, Benin and Nigeria85
A database of chemical absorption in human skin with mechanistic modeling applications83
De novo transcriptome assembly and gene annotation for the toxic dinoflagellate Dinophysis83
The transcriptomic footprint of Mytella strigata: de novo transcriptome assembly of a major invasive species82
Investigating the Quality of DermaMNIST and Fitzpatrick17k Dermatological Image Datasets82
High-quality faba bean reference transcripts generated using PacBio and Illumina RNA-seq data82
High-fidelity annotated triploid genome of the quarantine root-knot nematode, Meloidogyne enterolobii80
The Bushland, Texas, maize evapotranspiration, growth, and yield dataset Collection78
Dynamic urban morphology mapping in Chinese cities based on local climate zone approach78
Multidimensional dataset for cognitive assessment, sMRI, and rsfMRI in common benign epileptic children78
An enhanced rainfall-induced landslide catalogue in Italy77
High-speed video recordings of metal powder pneumatic conveying in thin capillary pipes77
A time-varying index for agricultural suitability across Europe from 1500–200076
Chromosome-level genome assembly of tetraploid Chinese cherry (Prunus pseudocerasus)74
A high-quality chromosome-scale genome assembly of the Cherokee rose (Rosa laevigata)73
A human single-neuron dataset for object recognition71
Linking Research Data with Physically Preserved Research Materials in Chemistry71
Propithecus verreauxi demography spanning 40 years at Bezà Mahafaly Special Reserve, southwest Madagascar70
Australian automotive workers and community leaders interview dataset following 2017 assembly plant closures70
A National Synthetic Populations Dataset for the United States69
An Integrated Database for Exploring Alternative Promoters in Animals69
A knowledge graph for crop diseases and pests in China68
A synthetic building operation dataset68
Genome assembly of Hawaiian flower thrips Thrips hawaiiensis (Thysanoptera: Thripidae)68
Author Correction: First assessment of underwater sound levels in the Northern Adriatic Sea at the basin scale67
A chromosome-level genome assembly of the spider mite Tetranychus piercei McGregor67
Monitoring non-pharmaceutical public health interventions during the COVID-19 pandemic67
Chromosome-level genome assembly of the critically endangered Baer’s pochard (Aythya baeri)66
FastMRI Prostate: A public, biparametric MRI dataset to advance machine learning for prostate cancer imaging65
Single-cell RNA-sequencing of virus-specific cellular immune responses in chronic hepatitis B patients65
SOIL-WATERGRIDS, mapping dynamic changes in soil moisture and depth of water table from 1970 to 201465
The Avian Diet Database as a source of quantitative information on bird diets65
0.10415101051331