Annals of Statistics

Papers
(The median citation count of Annals of Statistics is 3. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-08-01 to 2025-08-01.)
ArticleCitations
Estimation and inference for minimizer and minimum of convex functions: Optimality, adaptivity and uncertainty principles142
A sieve stochastic gradient descent estimator for online nonparametric regression in Sobolev ellipsoids84
Inference in Ising models on dense regular graphs74
Efficiency in local differential privacy72
Robust k-means clustering for distributions with two moments67
Scalable estimation and inference for censored quantile regression process53
Universal rank inference via residual subsampling with application to large networks42
Deep learning for the partially linear Cox model36
Foundations of structural causal models with cycles and latent variables36
Half-trek criterion for identifiability of latent variable models34
Parametric copula adjusted for non- and semiparametric regression33
Measuring dependence in the Wasserstein distance for Bayesian nonparametric models33
Online inference with multi-modal likelihood functions32
Learning sparse graphons and the generalized Kesten–Stigum threshold30
On high-dimensional Poisson models with measurement error: Hypothesis testing for nonlinear nonconvex optimization30
A general characterization of optimal tie-breaker designs29
Inference for low-rank tensors—no need to debias28
Uniform convergence of local Fréchet regression with applications to locating extrema and time warping for metric space valued trajectories28
Consistent inference for diffusions from low frequency measurements27
Community detection with dependent connectivity27
Asymptotic distributions of high-dimensional distance correlation inference26
Adaptive and robust multi-task learning26
Admissible ways of merging p-values under arbitrary dependence26
Asymptotic analysis of synchrosqueezing transform—toward statistical inference with nonlinear-type time-frequency analysis25
Inference for low-rank models23
Rate-optimal estimation of mixed semimartingales23
Environment invariant linear least squares22
Sharp optimality for high-dimensional covariance testing under sparse signals22
Nonparametric classification with missing data22
Refined Cramér-type moderate deviation theorems for general self-normalized sums with applications to dependent random variables and winsorized mean21
On posterior consistency of data assimilation with Gaussian process priors: The 2D-Navier–Stokes equations21
Supervised homogeneity fusion: A combinatorial approach21
Change-point inference in high-dimensional regression models under temporal dependence20
Spectral estimation of Hawkes processes from count data20
Limiting distributions for eigenvalues of sample correlation matrices from heavy-tailed populations20
Rank and factor loadings estimation in time series tensor factor model by pre-averaging20
Order-of-addition orthogonal arrays to study the effect of treatment ordering19
Is infinity that far? A Bayesian nonparametric perspective of finite mixture models19
Consistent order selection for ARFIMA processes19
Testing goodness-of-fit and conditional independence with approximate co-sufficient sampling19
Wilks’ theorem for semiparametric regressions with weakly dependent data19
Iterative algorithm for discrete structure recovery18
New Edgeworth-type expansions with finite sample guarantees18
The numeraire e-variable and reverse information projection17
Plugin estimation of smooth optimal transport maps17
On the convergence of coordinate ascent variational inference17
Spatial dependence and space–time trend in extreme events17
Asymptotic distribution of maximum likelihood estimator in generalized linear mixed models with crossed random effects17
Consistency of Bayesian inference for multivariate max-stable distributions17
A nonparametric test for elliptical distribution based on kernel embedding of probabilities16
Distributed nonparametric function estimation: Optimal rate of convergence and cost of adaptation16
Surprises in high-dimensional ridgeless least squares interpolation16
General spatio-temporal factor models for high-dimensional random fields on a lattice16
Projected state-action balancing weights for offline reinforcement learning16
Learning mixtures of permutations: Groups of pairwise comparisons and combinatorial method of moments16
Toward theoretical understandings of robust Markov decision processes: Sample complexity and asymptotics16
The Lasso with general Gaussian designs with applications to hypothesis testing16
Sup-norm adaptive drift estimation for multivariate nonreversible diffusions15
Wald tests when restrictions are locally singular15
Computational lower bounds for graphon estimation via low-degree polynomials15
Backfitting for large scale crossed random effects regressions15
Consistency of invariance-based randomization tests15
Minimax rate for multivariate data under componentwise local differential privacy constraints15
Time-uniform central limit theory and asymptotic confidence sequences14
Transfer learning for contextual multi-armed bandits14
Minimax rate of distribution estimation on unknown submanifolds under adversarial losses14
Edgeworth expansions for network moments14
Optimal rates for independence testing via U-statistic permutation tests14
Linear biomarker combination for constrained classification13
Minimax nonparametric estimation of pure quantum states13
Detecting multiple replicating signals using adaptive filtering procedures13
Testing for independence in high dimensions based on empirical copulas13
Testing nonparametric shape restrictions12
Noisy linear inverse problems under convex constraints: Exact risk asymptotics in high dimensions12
Sharp adaptive and pathwise stable similarity testing for scalar ergodic diffusions12
On least squares estimation under heteroscedastic and heavy-tailed errors12
Change acceleration and detection12
Confidence regions near singular information and boundary points with applications to mixed models12
Dispersal density estimation across scales11
Finite-sample complexity of sequential Monte Carlo estimators11
Dimension free ridge regression11
The Stein effect for Fréchet means11
Interactive versus noninteractive locally differentially private estimation: Two elbows for the quadratic functional11
Adaptive transfer learning11
A new approach to tests and confidence bands for distribution functions11
Approximate Message Passing algorithms for rotationally invariant matrices11
Testing community structure for hypergraphs11
Marginal singularity and the benefits of labels in covariate-shift10
On the sample complexity of entropic optimal transport10
Carving model-free inference10
Joint sequential detection and isolation for dependent data streams10
ℓ2 inference for change points in high-dimensional time series via a Two-Way MOSUM10
ARK: Robust knockoffs inference with coupling10
Nonlinear global Fréchet regression for random objects via weak conditional expectation10
Bridging factor and sparse models10
Ridge regression revisited: Debiasing, thresholding and bootstrap10
On universally consistent and fully distribution-free rank tests of vector independence10
Adaptive estimation in multivariate response regression with hidden variables10
Tensor clustering with planted structures: Statistical optimality and computational limits10
Correction note: “Asymptotic spectral theory for nonlinear time series”10
Multivariate trend filtering for lattice data9
Statistical inference for principal components of spiked covariance matrices9
Existence and uniqueness of the Kronecker covariance MLE9
Asymptotic distributions of largest Pearson correlation coefficients under dependent structures9
Heavy-tailed Bayesian nonparametric adaptation9
Ensemble projection pursuit for general nonparametric regression9
The online closure principle9
Conformal inference for random objects9
Matching recovery threshold for correlated random graphs9
Semiparametric inference based on adaptively collected data9
Analysis of generalized Bregman surrogate algorithms for nonsmooth nonconvex statistical learning9
Adaptive variational Bayes: Optimality, computation and applications9
Rerandomization with diminishing covariate imbalance and diverging number of covariates9
Testing high-dimensional regression coefficients in linear models9
Large-dimensional independent component analysis: Statistical optimality and computational tractability9
Local convexity of the TAP free energy and AMP convergence for Z2-synchronization9
A nonparametric doubly robust test for a continuous treatment effect9
Affine-equivariant inference for multivariate location under Lp loss functions9
On robustness and local differential privacy8
Embedding distributional data8
A general framework to quantify deviations from structural assumptions in the analysis of nonstationary function-valued processes8
Sparse high-dimensional linear regression. Estimating squared error and a phase transition8
Bootstrapping persistent Betti numbers and other stabilizing statistics8
Post-selection inference via algorithmic stability8
Global and individualized community detection in inhomogeneous multilayer networks8
Peskun–Tierney ordering for Markovian Monte Carlo: Beyond the reversible scenario8
Efficient estimation of the maximal association between multiple predictors and a survival outcome8
Inference for a two-stage enrichment design8
Local permutation tests for conditional independence8
Spectral analysis of gram matrices with missing at random observations: Convergence, central limit theorems, and applications in statistical inference8
Stereographic Markov chain Monte Carlo7
Optimal false discovery rate control for large scale multiple testing with auxiliary information7
Rates of estimation for high-dimensional multireference alignment7
Optimal signal detection in some spiked random matrix models: Likelihood ratio tests and linear spectral statistics7
Adaptive novelty detection with false discovery rate guarantee7
Deep neural networks for nonparametric interaction models with diverging dimension7
Two-level parallel flats designs7
Some theory about efficient dimension reduction regarding the interaction between two responses7
Bootstrap-assisted inference for generalized Grenander-type estimators7
Convex regression in multidimensions: Suboptimality of least squares estimators7
Self-normalized Cramér type moderate deviation theorem for Gaussian approximation7
Estimating a density near an unknown manifold: A Bayesian nonparametric approach7
General and feasible tests with multiply-imputed datasets7
Total positivity in multivariate extremes7
Improved covariance estimation: Optimal robustness and sub-Gaussian guarantees under heavy tails6
Approximation error from discretizations and its applications6
S-estimation in linear models with structured covariance matrices6
Estimation of the spectral measure from convex combinations of regularly varying random vectors6
A study of orthogonal array-based designs under a broad class of space-filling criteria6
Augmented minimax linear estimation6
Conditional calibration for false discovery rate control under dependence6
On the existence of powerful p-values and e-values for composite hypotheses6
Optimal subgroup selection6
On minimax optimality of sparse Bayes predictive density estimates6
Model selection in the space of Gaussian models invariant by symmetry6
Distributed adaptive Gaussian mean estimation with unknown variance: Interactive protocol helps adaptation6
Central limit theorem and bootstrap approximation in high dimensions: Near 1/n rates via implicit smoothing6
A new and flexible design construction for orthogonal arrays for modern applications6
Universal regression with adversarial responses6
Conditional predictive inference for stable algorithms6
Statistical complexity and optimal algorithms for nonlinear ridge bandits6
Grouped variable selection with discrete optimization: Computational and statistical perspectives6
Gaussian process regression in the flat limit6
Learning low-dimensional nonlinear structures from high-dimensional noisy data: An integral operator approach5
Skewed Bernstein–von Mises theorem and skew-modal approximations5
The curse of overparametrization in adversarial training: Precise analysis of robust generalization for random features regression5
Editorial: Memorial issue for Charles Stein5
Optimality of spectral clustering in the Gaussian mixture model5
One-step estimation of differentiable Hilbert-valued parameters5
Precise error rates for computationally efficient testing5
MARS via LASSO5
How do noise tails impact on deep ReLU networks?5
Stein’s method of normal approximation: Some recollections and reflections5
High-dimensional inference for dynamic treatment effects5
Variable selection consistency of Gaussian process regression5
Boosted nonparametric hazards with time-dependent covariates5
Optimal policy evaluation using kernel-based temporal difference methods5
The generalization error of max-margin linear classifiers: Benign overfitting and high dimensional asymptotics in the overparametrized regime5
A statistical framework of watermarks for large language models: Pivot, detection efficiency and optimal rules5
Spectral statistics of sample block correlation matrices5
A conformal test of linear models via permutation-augmented regressions5
Early stopping for L2-boosting in high-dimensional linear models5
Infinite-dimensional gradient-based descent for alpha-divergence minimisation5
Consistent nonparametric estimation for heavy-tailed sparse graphs5
Integrative methods for post-selection inference under convex constraints5
False discovery rate control with unknown null distribution: Is it possible to mimic the oracle?5
Higher criticism to compare two large frequency tables, with sensitivity to possible rare and weak differences5
Convergence of de Finetti’s mixing measure in latent structure models for observed exchangeable sequences5
Extreme value inference for heterogeneous power law data5
Rate-optimal robust estimation of high-dimensional vector autoregressive models5
Random graph asymptotics for treatment effect estimation under network interference4
Isotonic regression with unknown permutations: Statistics, computation and adaptation4
The edge of discovery: Controlling the local false discovery rate at the margin4
Statistical-computational trade-offs in tensor PCA and related problems via communication complexity4
Uniform consistency in nonparametric mixture models4
Gromov–Wasserstein distances: Entropic regularization, duality and sample complexity4
Adaptive learning rates for support vector machines working on data with low intrinsic dimension4
Set structured global empirical risk minimizers are rate optimal in general dimensions4
Empirical tail copulas for functional data4
Optimal difference-based variance estimators in time series: A general framework4
Statistical guarantees for Bayesian uncertainty quantification in nonlinear inverse problems with Gaussian process priors4
StarTrek: Combinatorial variable selection with false discovery rate control4
Dimension-free mixing times of Gibbs samplers for Bayesian hierarchical models4
On blockwise and reference panel-based estimators for genetic data prediction in high dimensions4
Exact minimax risk for linear least squares, and the lower tail of sample covariance matrices4
Stochastic continuum-armed bandits with additive models: Minimax regrets and adaptive algorithm4
Testing for practically significant dependencies in high dimensions via bootstrapping maxima of U-statistics4
Quantile processes and their applications in finite populations4
Deep approximate policy iteration4
Non-independent component analysis4
Concentration of discrepancy-based approximate Bayesian computation via Rademacher complexity4
On the statistical complexity of sample amplification4
Projective, sparse and learnable latent position network models4
Fundamental limits of low-rank matrix estimation with diverging aspect ratios4
Propriety of the reference posterior distribution in Gaussian process modeling4
Asymptotic normality and optimality in nonsmooth stochastic approximation4
Optimal change-point estimation in time series3
Unified algorithms for RL with Decision-Estimation Coefficients: PAC, reward-free, preference-based learning and beyond3
Correction note: “Statistical inference for the mean outcome under a possibly nonunique optimal treatment rule”3
Sparse anomaly detection across referentials: A rank-based higher criticism approach3
Evidence factors from multiple, possibly invalid, instrumental variables3
Online change-point detection for matrix-valued time series with latent two-way factor structure3
Metric statistics: Exploration and inference for random objects with distance profiles3
AutoRegressive approximations to nonstationary time series with inference and applications3
On an extension of the promotion time cure model3
Multiscale Bayesian survival analysis3
Extreme conditional expectile estimation in heavy-tailed heteroscedastic regression models3
Semiparametric optimal estimation with nonignorable nonresponse data3
Optimal heteroskedasticity testing in nonparametric regression3
Inference for extremal regression with dependent heavy-tailed data3
Testing equivalence of clustering3
Robust sub-Gaussian estimation of a mean vector in nearly linear time3
A statistical framework for analyzing shape in a time series of random geometric objects3
Complexity analysis of Bayesian learning of high-dimensional DAG models and their equivalence classes3
Sharp global convergence guarantees for iterative nonconvex optimization with random data3
Empirical partially Bayes multiple testing and compound χ2 decisions3
Optimization hierarchy for fair statistical decision problems3
Approximate kernel PCA: Computational versus statistical trade-off3
Choosing between persistent and stationary volatility3
Increasing dimension asymptotics for two-way crossed mixed effect models3
Tensor factor model estimation by iterative projection3
Variable selection, monotone likelihood ratio and group sparsity3
Functional sufficient dimension reduction through average Fréchet derivatives3
Generalization error bounds of dynamic treatment regimes in penalized regression-based learning3
Adaptive estimation of multivariate piecewise polynomials and bounded variation functions by optimal decision trees3
A cross-validation framework for signal denoising with applications to trend filtering, dyadic CART and beyond3
Parameter estimation in nonlinear multivariate stochastic differential equations based on splitting schemes3
Simplex quantile regression without crossing3
Dimension reduction for functional data based on weak conditional moments3
Covariance estimation under one-bit quantization3
0.16010117530823