VLDB Journal

Papers
(The TQCC of VLDB Journal is 7. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2020-11-01 to 2024-11-01.)
ArticleCitations
Data collection and quality challenges in deep learning: a data-centric AI perspective138
Fast-adapting and privacy-preserving federated recommender system52
Managing bias and unfairness in data for decision support: a survey of machine learning and data engineering approaches to identify and mitigate bias and unfairness within data management and analytic48
Fairness in rankings and recommendations: an overview45
A survey of RDF stores & SPARQL engines for querying knowledge graphs41
A model and query language for temporal graph databases37
MDDE: multitasking distributed differential evolution for privacy-preserving database fragmentation35
A survey on outlier explanations32
A benchmark and comprehensive survey on knowledge graph entity alignment via representation learning32
A survey on deep learning approaches for text-to-SQL30
Unsupervised and scalable subsequence anomaly detection in large data series29
LineageChain: a fine-grained, secure and efficient data provenance system for blockchains29
Data dependencies for query optimization: a survey27
Location- and keyword-based querying of geo-textual data: a survey26
Cross-chain deals and adversarial commerce23
Towards efficient solutions of bitruss decomposition for large-scale bipartite graphs23
Memory-aware framework for fast and scalable second-order random walk over billion-edge natural graphs22
Tidy Tuples and Flying Start: fast compilation and fast execution of relational queries in Umbra22
Distributed temporal graph analytics with GRADOOP19
Privacy and efficiency guaranteed social subgraph matching17
Dragoon: a hybrid and efficient big trajectory management system for offline and online analytics16
Visually aware recommendation with aesthetic features15
A survey on semantic schema discovery15
PrefixFPM: a parallel framework for general-purpose mining of frequent and closed patterns14
Answering reachability and K-reach queries on large graphs with label constraints14
eRiskCom: an e-commerce risky community detection platform14
On entity alignment at scale14
$$\hbox {CDBTune}^{+}$$: An efficient deep reinforcement learning-based automatic cloud database tuning system14
A dataspace-based framework for OLAP analyses in a high-variety multistore13
A survey on the evolution of stream processing systems13
GeoSparkViz: a cluster computing system for visualizing massive-scale geospatial data13
I/O efficient k-truss community search in massive graphs13
In-Memory Interval Joins13
Data distribution debugging in machine learning pipelines12
HFUL: a hybrid framework for user account linkage across location-aware social networks12
Efficient kNN query for moving objects on time-dependent road networks12
Fast subgraph query processing and subgraph matching via static and dynamic equivalences12
Efficient and effective ER with progressive blocking11
Parallel mining of large maximal quasi-cliques10
A cost model for random access queries in document stores10
Efficient Hop-constrained s-t Simple Path Enumeration10
ByShard: sharding in a Byzantine environment10
Algorithms for the discovery of embedded functional dependencies10
Fast data series indexing for in-memory data9
Pivot selection algorithms in metric spaces: a survey and experimental study9
G-thinker: a general distributed framework for finding qualified subgraphs in a big graph with load balancing9
Better database cost/performance via batched I/O on programmable SSD9
Leveraging range joins for the computation of overlap joins8
Accelerated butterfly counting with vertex priority on bipartite graphs8
Unified route representation learning for multi-modal transportation recommendation with spatiotemporal pre-training8
Effective entity matching with transformers8
Model averaging in distributed machine learning: a case study with Apache Spark8
VolcanoML: speeding up end-to-end AutoML via scalable search space decomposition8
Semantic embedding for regions of interest7
Efficient distributed discovery of bidirectional order dependencies7
(p,q)-biclique counting and enumeration for large sparse bipartite graphs7
A design space for RDF data representations7
Cache-efficient sweeping-based interval joins for extended Allen relation predicates7
ProS: data series progressive k-NN similarity search and classification with probabilistic quality guarantees7
0.10585498809814