Statistical Analysis and Data Mining

Papers
(The TQCC of Statistical Analysis and Data Mining is 2. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2022-01-01 to 2026-01-01.)
ArticleCitations
Randomized multiarm bandits: An improved adaptive data collection method540
Semi‐Parametric Least‐Area Linear‐Circular Regression Through Möbius Transformation23
Deep Learning for Variable Selection in Censored Quantile Regression Models21
Testing for the Important Components of Predictive Variance19
15
Model Averaging for Regression Kink Models14
CLADAG 2021 special issue: Selected papers on classification and data analysis9
BayesMultiomics : An R Package for Bayesian Shrinkage Models for Integration and Analysis of Multi‐Platform High‐Dimensional Genomics Data9
Issue Information9
Data Twinning8
Some Bayesian biclustering methods: Modeling and inference7
Kernel learning with nonconvex ramp loss7
Optimal ratio for data splitting6
Bayesian shrinkage models for integration and analysis of multiplatform high‐dimensional genomics data5
Issue Information5
Weighted AutoEncoding recommender system5
5
Robust Model‐Based Semi‐Supervised Clustering of Incomplete Records4
Variational Autoencoder With Gamma Mixture for Clustering High‐Dimensional Right‐Skewed Data4
Multivariate contaminated normal mixture regression modeling of longitudinal data based on jointmean‐covariancemodel4
Integrative learning of structuredhigh‐dimensionaldata from multiple datasets4
Robust deep neural network surrogate models with uncertainty quantification via adversarial training4
A tree‐based gene–environment interaction analysis with rare features4
Issue Information4
Rank‐Based Inference for Conditional Independence Graph With Missing Values4
On difference‐based gradient estimation in nonparametric regression4
Input‐response space‐filling designs incorporating response uncertainty4
Nonparametric clustering of RNA‐sequencing data3
Issue Information3
Model‐Based Recursive Partitioning for Discrete Event Times3
Extracting Genetically‐Imputed Causal Features From ECG Data3
Sparse Bayesian variable selection in high‐dimensional logistic regression models with correlated priors3
eRPCA: Robust Principal Component Analysis for Exponential Family Distributions3
An ImprovedD2GAN‐based oversampling algorithm for imbalanced data classification3
Bayesian inference for nonprobability samples with nonignorable missingness3
Local influence analysis for the sliced average third‐moment estimation3
The analysis of association rules: Latent class analysis3
Robust and Differentially Private Principal Component Analysis3
A finely tuned deep transfer learning algorithm to compare outsole images3
Driving mode analysis—How uncertain functional inputs propagate to an output2
Development and validation of models for two‐week mortality of inpatients with COVID‐19 infection: A large prospective cohort study2
Issue Information2
Adversarially robust subspace learning in the spiked covariance model2
Online Updating Composite Quantile Regression for Streaming Data2
Semiparametric detection of changepoints in location, scale, and copula2
A Novel Approach for APT Detection Based on Ensemble Learning Model2
Issue Information2
Recovering the Number of Clusters From a Laplacian Matrix by Nuclear Norm Penalization2
2
Biclustering high‐frequency financial time series based on information theory2
Adaptive Weighted Regularized QRGRU Algorithm and Its Application in Stock Price Prediction2
Data‐driven stochastic model for quantifying the interplay between amyloid‐beta and calcium levels in Alzheimer's disease2
Interaction Tests With Covariate‐Adaptive Randomization2
A new formulation of sparse multiple kernel k$$ k $$‐means clustering and its applications2
Low‐Dimensional Adaptive Neural Network Regression With Directional Change Detection via Nuclear Norm Penalization2
2
Bayesian Hybrid Model Search and Averaging for Sparse Gaussian Process Regression2
A Conversational Assistant for Democratization of Data Visualization: A Comparative Study of Two Approaches of Interaction2
Cost‐sensitive classification with time constraint on incomplete data2
0.036510944366455