VLDB Journal

Papers
(The median citation count of VLDB Journal is 3. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2022-06-01 to 2026-06-01.)
ArticleCitations
Threshold queries in theory and in the wild451
An efficient and scalable graph database with built-in temporal support136
Beyond influence: voting theory for opinion maximization118
Generating highly customizable python code for data processing with large language models90
Efficiently Counting Four-Node Motifs in Large-Scale Temporal Graphs35
Transactional panorama: a conceptual framework for user perception in analytical visual interfaces (extended version)34
Optimizing navigational graph queries34
Missing Value Imputation in Tabular Data Lakes Unleashed: A Hybrid Approach29
Reverse spatial top-k keyword queries28
Efficient and robust active learning methods for interactive database exploration26
FOSS: A learned doctor for query optimization25
BioGITOM: Matching Biomedical Ontologies with Graph Isomorphism Transformer21
Third and Boyce–Codd normal form for property graphs20
Model reusability in Reinforcement Learning20
Hu-Fu: efficient and secure spatial queries over data federation20
On efficient 3D object retrieval19
Hyper-distance oracles in hypergraphs19
A new window Clause for SQL++17
PTSSP: privacy-preserving top-k spatial keyword similarity query with priority matching16
In-database query optimization on SQL with ML predicates16
Learned sketch for subgraph counting: a holistic approach16
Special issue on the best papers of DaMoN 202015
Fast subgraph query processing and subgraph matching via static and dynamic equivalences15
Efficient graph embedding at scale: optimizing CPU-GPU-SSD integration15
Discovering critical vertices for reinforcement of large-scale bipartite networks14
GPU-based butterfly counting14
SQUID: subtrajectory query in trillion-scale GPS database13
An update-intensive LSM-based R-tree index13
LEON+: towards robust ML-aided query optimization12
Efficient top-k spatial-range-constrained approximate nearest neighbor search on geo-tagged high-dimensional vectors12
On Querying Historical Connectivity in Large-scale Temporal Graphs11
The Status-Quo in nested data processing for high-energy physics11
Multi-constraint shortest path using forest hop labeling11
ByShard: sharding in a Byzantine environment10
DBSP: automatic incremental view maintenance for rich query languages10
DB-BERT: making database tuning tools “read” the manual10
P$$^2$$CG: a privacy preserving collaborative graph neural network training framework10
Correction to: BugDoc Iterative debugging and explanation of pipeline executions10
Efficient detection of multivariate correlations with different correlation measures9
LIST: learning to index spatio-textual data for embedding based spatial keyword queries9
Anytime bottom-up rule learning for large-scale knowledge graph completion9
Towards flexibility and robustness of LSM trees9
DIST: Efficient k-Clique Listing via Induced Subgraph Trie9
Efficient and effective algorithms for densest subgraph discovery and maintenance9
Privacy-Utility Balanced Cooperative Online Matching in Spatial Crowdsourcing9
Incremental discovery of denial constraints8
Generating adversarial SQL queries for evaluating cardinality estimators8
DumpyOS: A data-adaptive multi-ary index for scalable data series similarity search8
Accelerating directed densest subgraph queries with software and hardware approaches8
A graph pattern mining framework for large graphs on GPU8
Eris: efficiently measuring discord in multidimensional sources8
Efficient and scalable huge embedding model training via distributed cache management8
AutoML in heavily constrained applications7
Efficient discovery of co-movement patterns from video data7
Tiered-Indexing: Optimizing Access Methods for Skew7
A survey on deep learning approaches for text-to-SQL6
Special issue: modern hardware6
A survey on the evolution of stream processing systems6
Performant almost-latch-free data structures using epoch protection in more depth6
Assisted design of data science pipelines6
Survey of window types for aggregation in stream processing systems6
Efficient Algorithms for Uncertain Restricted Skyline Query Processing5
Lamba: A pretrained model for latency prediction over distributed databases5
How good are machine learning clouds? Benchmarking two snapshots over 5 years5
HPCache: memory-efficient OLAP through proportional caching revisited5
Editorial for Special Issue: VLDB 20225
Morphtree: a polymorphic main-memory learned index for dynamic workloads5
xDBTagger: explainable natural language interface to databases using keyword mappings and schema graph5
Tee-based key-value stores: a survey5
A multi-facet analysis of BERT-based entity matching models5
ICS-GNN$$^+$$: lightweight interactive community search via graph neural network5
Scalable decoupling graph neural network with feature-oriented optimization5
HINT: a hierarchical interval index for Allen relationships5
BatchHL$$^{+}$$: batch dynamic labelling for distance queries on large-scale networks5
MSAD: A deep dive into model selection for time series anomaly detection4
A generic framework for efficient computation of top-k diverse results4
FlexpushdownDB: rethinking computation pushdown for cloud OLAP DBMSs4
BQSched$$^{+}$$: A generalizable RL-based scheduler for varying batch concurrent queries4
VUS: effective and efficient accuracy measures for time-series anomaly detection4
Join optimization revisited: a novel DP algorithm for join&sort order selection4
Highly distributed and privacy-preserving queries on personal data management systems4
Netherite: efficient execution of serverless workflows4
Efficient Task Assignment for Multi-Workerset Crowdsourcing with Time and Expense Considerations4
Identifying similar-bicliques in bipartite graphs4
An Evaluation of B-tree Compression Techniques4
Time-topology analysis on temporal graphs4
Data collection and quality challenges in deep learning: a data-centric AI perspective4
Editorial: Special Issue for Selected Papers of VLDB 20214
A near-optimal approach to edge connectivity-based hierarchical graph decomposition4
Scalable lighting-fast temporal indexing4
C5: cloned concurrency control that always keeps up3
Leveraging user itinerary to improve personalized deep matching at Fliggy3
Finding Locally Densest Subgraphs: Convex Programming with Edge and Triangle Density3
Ingress: an automated incremental graph processing system3
Density decomposition on large static and dynamic graphs: algorithms and applications3
Raster interval object approximations for spatial intersection joins3
Enhancing domain-aware multi-truth data fusion using copy-based source authority and value similarity3
Temporal graph patterns by timed automata3
SWOOP: top-k similarity joins over set streams3
Similarity-driven and task-driven models for diversity of opinion in crowdsourcing markets3
Hypergraph motifs and their extensions beyond binary3
Table integration in data lakes unleashed: pairwise integrability judgment, integrable set discovery, and multi-tuple conflict resolution3
Efficient kNN query for moving objects on time-dependent road networks3
Cardinality estimation using normalizing flow3
A powerful reducing framework for accelerating set intersections over graphs3
Detecting rumours with latency guarantees using massive streaming data3
Efficient algorithms for reachability and path queries on temporal bipartite graphs3
Measuring approximate functional dependencies: a comparative study3
Butterfly counting and bitruss decomposition on uncertain bipartite graphs3
A systematic evaluation of machine learning on serverless infrastructure3
MinJoin++: a fast algorithm for string similarity joins under edit distance3
AutoCTS++: zero-shot joint neural architecture and hyperparameter search for correlated time series forecasting3
HERMES: data placement and schema optimization for enterprise knowledge bases3
Flexible grouping of linear segments for highly accurate lossy compression of time series data3
Efficient indexing and searching of constrained core in hypergraphs3
Reliability evaluation of individual predictions: a data-centric approach3
HMI: hierarchical knowledge management for efficient multi-tenant inference in pretrained language models3
Accelerating maximum biplex search over large bipartite graphs3
1.2678730487823