Data Mining and Knowledge Discovery

Papers
(The TQCC of Data Mining and Knowledge Discovery is 6. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2020-02-01 to 2024-02-01.)
ArticleCitations
InceptionTime: Finding AlexNet for time series classification503
ROCKET: exceptionally fast and accurate time series classification using random convolutional kernels336
The great multivariate time series classification bake off: a review and experimental evaluation of recent algorithmic advances176
TS-CHIEF: a scalable and accurate forest algorithm for time series classification108
A survey of community detection methods in multilayer networks67
Challenges in benchmarking stream learning algorithms with real-world data55
Counterfactual explanations and how to find them: literature review and benchmarking47
MultiRocket: multiple pooling operators and transformations for fast and effective time series classification40
An efficient K-means clustering algorithm for tall data36
Deep soccer analytics: learning an action-value function for evaluating soccer players35
Matrix profile goes MAD: variable-length motif and discord discovery in data series34
Deep graph similarity learning: a survey32
TEASER: early and accurate time series classification30
Time series extrinsic regression30
A comprehensive taxonomy for explainable artificial intelligence: a systematic survey of surveys on methods and concepts30
Relational Learning Analysis of Social Politics using Knowledge Graph Embedding28
Smoothed dilated convolutions for improved dense prediction27
Fake review detection on online E-commerce platforms: a systematic literature review27
ColluEagle: collusive review spammer detection using Markov random fields26
Time series motifs discovery under DTW allows more robust discovery of conserved structure26
Scalable attack on graph data by injecting vicious nodes23
Comparison of novelty detection methods for multispectral images in rover-based planetary exploration missions23
Word-class embeddings for multiclass text classification22
Data-driven detection of counterpressing in professional football20
End-to-end deep representation learning for time series clustering: a comparative study20
The Swiss army knife of time series data mining: ten useful things you can do with the matrix profile and ten lines of code19
XEM: An explainable-by-design ensemble method for multivariate time series classification18
Forecast evaluation for data scientists: common pitfalls and best practices17
Improving embedded knowledge graph multi-hop question answering by introducing relational chain reasoning17
Treant: training evasion-aware decision trees16
Active learning for hierarchical multi-label classification16
Gaussian bandwidth selection for manifold learning and classification15
A framework for deep constrained clustering15
Multi-label learning with missing and completely unobserved labels15
Efficient mining of the most significant patterns with permutation testing15
ABBA: adaptive Brownian bridge-based symbolic aggregation of time series15
INK: knowledge graph embeddings for node classification14
Algorithmic fairness datasets: the story so far14
MIDIA: exploring denoising autoencoders for missing data imputation14
A survey of deep network techniques all classifiers can adopt13
struc2gauss: Structural role preserving network embedding via Gaussian embedding13
The area under the ROC curve as a measure of clustering quality12
Dataset2Vec: learning dataset meta-features12
Grouped feature importance and combined features effect plot12
Sequential recommendation with metric models based on frequent sequences12
User preference and embedding learning with implicit feedback for recommender systems12
Graph convolutional networks for traffic forecasting with missing values11
Efficient set-valued prediction in multi-class classification11
VFC-SMOTE: very fast continuous synthetic minority oversampling for evolving data streams10
Cost-sensitive ensemble learning: a unifying framework10
Extending greedy feature selection algorithms to multiple solutions10
Detecting virtual concept drift of regressors without ground truth values10
Fair-by-design matching9
BROCCOLI: overlapping and outlier-robust biclustering through proximal stochastic gradient descent9
For real: a thorough look at numeric attributes in subgroup discovery9
An ultra-fast time series distance measure to allow data mining in more complex real-world deployments9
A deep multimodal model for bug localization9
Boosting house price predictions using geo-spatial network embedding9
PETSC: pattern-based embedding for time series classification8
Feature extraction from unequal length heterogeneous EHR time series via dynamic time warping and tensor decomposition8
Discrete-time survival forests with Hellinger distance decision trees8
Large-scale network motif analysis using compression8
Guided sampling for large graphs8
Who can receive the pass? A computational model for quantifying availability in soccer8
Controlling hallucinations at word level in data-to-text generation8
Robust subgroup discovery8
Bayesian mean-parameterized nonnegative binary matrix factorization8
Expected passes8
Introducing time series snippets: a new primitive for summarizing long time series7
Simplification of genetic programs: a literature survey7
Natural language techniques supporting decision modelers7
Mining communities and their descriptions on attributed graphs: a survey7
Benchmarking and survey of explanation methods for black box models7
Chebyshev approaches for imbalanced data streams regression models7
Early abandoning and pruning for elastic distances including dynamic time warping7
An efficient procedure for mining egocentric temporal motifs7
Computing exact P-values for community detection7
Model-agnostic feature importance and effects with dependent features: a conditional subgroup approach6
Simple and effective neural-free soft-cluster embeddings for item cold-start recommendations6
Detecting singleton spams in reviews via learning deep anomalous temporal aspect-sentiment patterns6
The minimum description length principle for pattern mining: a survey6
SMILE: a feature-based temporal abstraction framework for event-interval sequence classification6
What’s in a name? – gender classification of names with character based machine learning models6
Recurring concept memory management in data streams: exploiting data stream concept evolution to improve performance and transparency6
Sufficient dimension reduction for average causal effect estimation6
POI recommendation with queuing time and user interest awareness6
Hydra: competing convolutional kernels for fast and accurate time series classification6
TEAGS: time-aware text embedding approach to generate subgraphs6
Interpretability, personalization and reliability of a machine learning based clinical decision support system6
An overlap sensitive neural network for class imbalanced data6
SPEck: mining statistically-significant sequential patterns efficiently with exact sampling6
Neural content-aware collaborative filtering for cold-start music recommendation6
0.038684129714966