Data Mining and Knowledge Discovery

Papers
(The TQCC of Data Mining and Knowledge Discovery is 7. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-05-01 to 2025-05-01.)
ArticleCitations
A probabilistic model for API contract specification retrieval focusing on the openAPI standard142
Counterfactual explanations as interventions in latent space106
Joint dynamic topic model for recognition of lead-lag relationship in two text corpora106
Who can receive the pass? A computational model for quantifying availability in soccer88
Hydra: competing convolutional kernels for fast and accurate time series classification63
Traffic forecasting on new roads using spatial contrastive pre-training (SCPT)57
Exploiting sensor data in professional road cycling: personalized data-driven approach for frequent fitness monitoring55
Discord-based counterfactual explanations for time series classification48
Representing ensembles of networks for fuzzy cluster analysis: a case study48
Thompson sampling-based recursive block elimination for dynamic assignment under limited budget in pure-exploration45
Correction: Marginal effects for non-linear prediction functions41
TCMI: a non-parametric mutual-dependence estimator for multivariate continuous distributions41
The grammar of interactive explanatory model analysis38
Extending greedy feature selection algorithms to multiple solutions37
Wisdom of the contexts: active ensemble learning for contextual anomaly detection35
VEM$$^2$$L: an easy but effective framework for fusing text and structure knowledge on sparse knowledge graph completion35
MMA: metadata supported multi-variate attention for onset detection and prediction34
Reflective-net: learning from explanations34
Dynamic cyber risk estimation with competitive quantile autoregression33
Neural content-aware collaborative filtering for cold-start music recommendation32
Correction: Bake off redux: a review and experimental evaluation of recent time series classification algorithms28
TenGAN: adversarially generating multiplex tensor graphs24
SALτ: efficiently stopping TAR by improving priors estimates21
Improving neural network’s robustness on tabular data with D-layers20
Approximation trees: statistical reproducibility in model distillation20
On computing exact means of time series using the move-split-merge metric20
Explainable decomposition of nested dense subgraphs19
Robust explainer recommendation for time series classification19
Interpretable representations in explainable AI: from theory to practice19
On the evaluation of outlier detection and one-class classification: a comparative study of algorithms, model selection, and ensembles17
Generalized core maintenance of dynamic bipartite graphs17
AA-forecast: anomaly-aware forecast for extreme events17
What do anomaly scores actually mean? Dynamic characteristics beyond accuracy16
MultiRocket: multiple pooling operators and transformations for fast and effective time series classification16
On GNN explainability with activation rules16
Correlations between random projections and the bivariate normal16
Multilayer horizontal visibility graphs for multivariate time series analysis16
Robust and sparse multinomial regression in high dimensions15
Contextualization of soccer analysis with tactical periodization and machine learning15
Topic change point detection using a mixed Bayesian model15
Explanatory artificial intelligence (YAI): human-centered explanations of explainable AI and complex data15
Exploiting second-order dissimilarity representations for hierarchical clustering and visualization15
Explainable and interpretable machine learning and data mining15
EmbAssi: embedding assignment costs for similarity search in large graph databases14
Coupled block diagonal regularization for multi-view subspace clustering14
Hyperbolic node embedding for temporal networks14
Efficient algorithms for fair clustering with a new notion of fairness14
Sky-signatures: detecting and characterizing recurrent behavior in sequential data14
A comprehensive taxonomy for explainable artificial intelligence: a systematic survey of surveys on methods and concepts14
Algorithmic fairness datasets: the story so far13
Random walks with variable restarts for negative-example-informed label propagation13
Mondrian forest for data stream classification under memory constraints13
Bounding the family-wise error rate in local causal discovery using Rademacher averages12
Unsupervised feature based algorithms for time series extrinsic regression12
BROCCOLI: overlapping and outlier-robust biclustering through proximal stochastic gradient descent12
Inferring tie strength in temporal networks11
Missing value replacement in strings and applications11
PAC-Bayesian lifelong learning for multi-armed bandits11
Hypercore decomposition for non-fragile hyperedges: concepts, algorithms, observations, and applications11
Unsupervised domain adaptation with non-stochastic missing data11
Randomnet: clustering time series using untrained deep neural networks11
NICE: an algorithm for nearest instance counterfactual explanations11
K-plex cover pooling for graph neural networks11
Link prediction in dynamic networks using random dot product graphs11
An eager splitting strategy for online decision trees in ensembles10
ClaSP: parameter-free time series segmentation10
Making clusterings fairer by post-processing: algorithms, complexity results and experiments10
When graph convolution meets double attention: online privacy disclosure detection with multi-label text classification10
Structural learning of simple staged trees10
Synwalk: community detection via random walk modelling10
Model-agnostic feature importance and effects with dependent features: a conditional subgroup approach10
Detach-ROCKET: sequential feature selection for time series classification with random convolutional kernels9
Grouped feature importance and combined features effect plot9
Temporal state change Bayesian networks for modeling of evolving multivariate state sequences: model, structure discovery and parameter estimation9
Dynamic self-paced sampling ensemble for highly imbalanced and class-overlapped data classification9
PETSC: pattern-based embedding for time series classification9
Intersectional fair ranking via subgroup divergence9
Central node identification via weighted kernel density estimation8
Sentiment analysis in tweets: an assessment study from classical to modern word representation models8
Relational Learning Analysis of Social Politics using Knowledge Graph Embedding8
Efficient set-valued prediction in multi-class classification8
Knowledge graph embedding closed under composition8
Robust subgroup discovery8
A Lagrangian-based score for assessing the quality of pairwise constraints in semi-supervised clustering8
One-shot relational learning for extrapolation reasoning on temporal knowledge graphs7
MrTF: model refinery for transductive federated learning7
Modelling event sequence data by type-wise neural point process7
Continuous treatment effect estimation via generative adversarial de-confounding7
Binary quantification and dataset shift: an experimental investigation7
Bake off redux: a review and experimental evaluation of recent time series classification algorithms7
Marginal effects for non-linear prediction functions7
An external stability audit framework to test the validity of personality prediction in AI hiring7
i-Align: an interpretable knowledge graph alignment model7
A tale of two roles: exploring topic-specific susceptibility and influence in cascade prediction7
Handling imbalance in hierarchical classification problems using local classifiers approaches7
Structural iterative lexicographic autoencoded node representation7
0.029692888259888