Data Mining and Knowledge Discovery

Papers
(The TQCC of Data Mining and Knowledge Discovery is 6. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2020-10-01 to 2024-10-01.)
ArticleCitations
The great multivariate time series classification bake off: a review and experimental evaluation of recent algorithmic advances244
Counterfactual explanations and how to find them: literature review and benchmarking100
A survey of community detection methods in multilayer networks93
A comprehensive taxonomy for explainable artificial intelligence: a systematic survey of surveys on methods and concepts80
MultiRocket: multiple pooling operators and transformations for fast and effective time series classification77
Time series extrinsic regression50
Deep graph similarity learning: a survey44
Fake review detection on online E-commerce platforms: a systematic literature review41
Forecast evaluation for data scientists: common pitfalls and best practices38
Time series motifs discovery under DTW allows more robust discovery of conserved structure34
Improving embedded knowledge graph multi-hop question answering by introducing relational chain reasoning34
Benchmarking and survey of explanation methods for black box models34
Relational Learning Analysis of Social Politics using Knowledge Graph Embedding34
End-to-end deep representation learning for time series clustering: a comparative study33
Smoothed dilated convolutions for improved dense prediction32
XEM: An explainable-by-design ensemble method for multivariate time series classification32
Algorithmic fairness datasets: the story so far31
Word-class embeddings for multiclass text classification26
Data-driven detection of counterpressing in professional football25
Graph convolutional networks for traffic forecasting with missing values22
Improving position encoding of transformers for multivariate time series classification20
The area under the ROC curve as a measure of clustering quality19
Hydra: competing convolutional kernels for fast and accurate time series classification19
Dataset2Vec: learning dataset meta-features19
Model-agnostic feature importance and effects with dependent features: a conditional subgroup approach18
Grouped feature importance and combined features effect plot18
A framework for deep constrained clustering18
Multi-label learning with missing and completely unobserved labels18
Efficient set-valued prediction in multi-class classification17
User preference and embedding learning with implicit feedback for recommender systems16
Boosting house price predictions using geo-spatial network embedding16
Hierarchical message-passing graph neural networks15
VFC-SMOTE: very fast continuous synthetic minority oversampling for evolving data streams15
Cost-sensitive ensemble learning: a unifying framework15
A survey of deep network techniques all classifiers can adopt15
Detecting virtual concept drift of regressors without ground truth values15
INK: knowledge graph embeddings for node classification15
Expected passes14
Sequence graph transform (SGT): a feature embedding function for sequence data mining13
Explanatory artificial intelligence (YAI): human-centered explanations of explainable AI and complex data12
Extending greedy feature selection algorithms to multiple solutions12
Simplification of genetic programs: a literature survey12
Feature extraction from unequal length heterogeneous EHR time series via dynamic time warping and tensor decomposition12
Bake off redux: a review and experimental evaluation of recent time series classification algorithms12
Knowledge graph embedding methods for entity alignment: experimental review12
Sequential recommendation with metric models based on frequent sequences12
A deep multimodal model for bug localization11
Who can receive the pass? A computational model for quantifying availability in soccer11
An efficient procedure for mining egocentric temporal motifs11
POI recommendation with queuing time and user interest awareness11
Interpretability, personalization and reliability of a machine learning based clinical decision support system10
PETSC: pattern-based embedding for time series classification10
Chebyshev approaches for imbalanced data streams regression models10
Controlling hallucinations at word level in data-to-text generation10
Robust subgroup discovery10
BROCCOLI: overlapping and outlier-robust biclustering through proximal stochastic gradient descent10
Stable and actionable explanations of black-box models through factual and counterfactual rules9
Sufficient dimension reduction for average causal effect estimation9
Predictive modeling of infant mortality9
What’s in a name? – gender classification of names with character based machine learning models9
Novel features for time series analysis: a complex networks approach9
SPEck: mining statistically-significant sequential patterns efficiently with exact sampling9
The minimum description length principle for pattern mining: a survey9
Informative pseudo-labeling for graph neural networks with few labels9
Early abandoning and pruning for elastic distances including dynamic time warping9
Detecting singleton spams in reviews via learning deep anomalous temporal aspect-sentiment patterns9
A recurrent neural network architecture to model physical activity energy expenditure in older people9
Synwalk: community detection via random walk modelling9
ClaSP: parameter-free time series segmentation9
Mining full, inner and tail periodic patterns with perfect, imperfect and asynchronous periodicity simultaneously8
Natural language techniques supporting decision modelers8
NICE: an algorithm for nearest instance counterfactual explanations8
Mining communities and their descriptions on attributed graphs: a survey8
SMILE: a feature-based temporal abstraction framework for event-interval sequence classification8
The grammar of interactive explanatory model analysis8
Adversarial balancing-based representation learning for causal effect inference with observational data8
On GNN explainability with activation rules8
Neural content-aware collaborative filtering for cold-start music recommendation8
Fast and robust video-based exercise classification via body pose tracking and scalable multivariate time series classifiers8
Time series clustering in linear time complexity8
Interpreting deep learning models with marginal attribution by conditioning on quantiles7
Sentiment analysis in tweets: an assessment study from classical to modern word representation models7
An external stability audit framework to test the validity of personality prediction in AI hiring7
An overlap sensitive neural network for class imbalanced data7
An adaptive meta-heuristic for music plagiarism detection based on text similarity and clustering7
Recurring concept memory management in data streams: exploiting data stream concept evolution to improve performance and transparency7
Individualized passenger travel pattern multi-clustering based on graph regularized tensor latent dirichlet allocation7
Continuous treatment effect estimation via generative adversarial de-confounding6
Federated singular value decomposition for high-dimensional data6
Fair detection of poisoning attacks in federated learning on non-i.i.d. data6
Implicit consensus clustering from multiple graphs6
The network-untangling problem: from interactions to activity timelines6
Sparse randomized shortest paths routing with Tsallis divergence regularization6
Isolation kernel: the X factor in efficient and effective large scale online kernel learning6
Parameterizing the cost function of dynamic time warping with application to time series classification6
Conclusive local interpretation rules for random forests6
Better trees: an empirical study on hyperparameter tuning of classification decision tree induction algorithms6
0.03839111328125