Proceedings of the Vldb Endowment

Papers
(The median citation count of Proceedings of the Vldb Endowment is 2. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-02-01 to 2025-02-01.)
ArticleCitations
SHiFT263
An experimental evaluation and guideline for path finding in weighted dynamic network210
BICE: Exploring Compact Search Space by Using Bipartite Matching and Cell-Wide Verification87
XGNN: Boosting Multi-GPU GNN Training via Global GNN Memory Store72
DuckPGQ: Bringing SQL/PGQ to DuckDB71
Evolution of a compiling query engine58
LANNS58
PRUC55
Representing Paths in Graph Database Pattern Matching51
Towards a polyglot framework for factorized ML51
FILM48
Can Learned Models Replace Hash Functions?48
Semi-Oblivious Chase Termination for Linear Existential Rules: An Experimental Study47
Parallel training of knowledge graph embedding models45
High-Dimensional Data Cubes42
Learning to be a statistician42
A study of database performance sensitivity to experiment settings40
Flexible rule-based decomposition and metadata independence in modin38
Accelerating recommendation system training by leveraging popular choices38
An intermediate representation for hybrid database and machine learning workloads38
Errata for "Cerebro: a data system for optimized deep learning model selection"37
FCBench: Cross-Domain Benchmarking of Lossless Compression for Floating-Point Data37
Towards plug-and-play visual graph query interfaces37
Towards event prediction in temporal graphs37
AutoDI36
Towards Designing and Learning Piecewise Space-Filling Curves36
Two Birds With One Stone: Designing a Hybrid Cloud Storage Engine for HTAP35
DynaHB: A Communication-Avoiding Asynchronous Distributed Framework with Hybrid Batches for Dynamic GNN Training34
Agile-Ant: Self-Managing Distributed Cache Management for Cost Optimization of Big Data Applications34
Influential Community Search over Large Heterogeneous Information Networks33
LLM-PBE: Assessing Data Privacy in Large Language Models33
Texera: A System for Collaborative and Interactive Data Analytics Using Workflows33
Aleph Filter: To Infinity in Constant Time33
Blueprinting the Cloud: Unifying and Automatically Optimizing Cloud Data Infrastructures with BRAD33
Spooky32
MITra: A Framework for Multi-Instance Graph Traversal32
Discovering related data at scale32
Opportunities for Quantum Acceleration of Databases: Optimization of Queries and Transaction Schedules31
Efficient maximum k -plex computation over large sparse graphs31
A Comparative Study and Component Analysis of Query Plan Representation Techniques in ML4DB Studies31
Exathlon31
ZIP: Lazy Imputation during Query Processing31
Parallel Colorful h -Star Core Maintenance in Dynamic Graphs31
DeepJoin: Joinable Table Discovery with Pre-Trained Language Models31
Algorithmic Complexity Attacks on Dynamic Learned Indexes30
PerfGuard30
Reliable community search in dynamic networks30
Fries30
Cloud Analytics Benchmark29
Towards distribution-aware query answering in data markets29
Towards General and Efficient Online Tuning for Spark29
Symmetric continuous subgraph matching with bidirectional dynamic programming29
Catch a blowfish alive28
Frost28
IsoBugView28
Demonstration of accelerating machine learning inference queries with correlative proxy models28
Watermarks in stream processing systems28
ReMac28
Demo of marius28
Sigma workbook27
Database technology for the masses27
Anser: Adaptive Information Sharing Framework of AnalyticDB27
Pipemizer27
Data and AI Model Markets: Opportunities for Data and Model Sharing, Discovery, and Integration27
Keep CALM and CRDT On26
WebMILE26
Hu-fu25
The LDBC Social Network Benchmark25
ZeroEA: A Zero-Training Entity Alignment Framework via Pre-Trained Language Model25
Density Personalized Group Query25
Hu-Fu25
The FastLanes Compression Layout: Decoding > 100 Billion Integers per Second with Scalar Code25
On repairing timestamps for regular interval time series25
LES 325
OmniSketch: Efficient Multi-Dimensional High-Velocity Stream Analytics with Arbitrary Predicates24
No Repetition24
Efficient Regular Simple Path Queries under Transitive Restricted Expressions24
Spatial and temporal constrained ranked retrieval over videos24
ADF & TransApp: A Transformer-Based Framework for Appliance Detection Using Smart Meter Consumption Series24
The power of summarization in graph mining and learning24
A Blockchain System for Clustered Federated Learning with Peer-to-Peer Knowledge Transfer23
Timestamp as a Service, Not an Oracle23
Low-latency compilation of SQL queries to machine code23
Data-Driven Insight Synthesis for Multi-Dimensional Data23
ADOPT: Adaptively Optimizing Attribute Orders for Worst-Case Optimal Join Algorithms via Reinforcement Learning23
REmatch: A Novel Regex Engine for Finding All Matches22
DILI: A Distribution-Driven Learned Index22
Transactional Panorama: A Conceptual Framework for User Perception in Analytical Visual Interfaces22
VeriBench: Analyzing the Performance of Database Systems with Verifiability22
Hippo22
Triangular Stability Maximization by Influence Spread over Social Networks21
Pantheon21
Demonstration of panda21
Effective and Efficient Route Planning Using Historical Trajectories on Road Networks21
In-network support for transaction triaging21
Generalized supervised meta-blocking20
Algorithm and system co-design for efficient subgraph-based graph representation learning20
Motiflets20
DARLING20
Fast detection of denial constraint violations20
Effective indexing for dynamic structural graph clustering20
RPT19
EPICGen19
Approximating probabilistic group steiner trees in graphs19
GeCo19
ParaX19
Towards Efficient Index Construction and Approximate Nearest Neighbor Search in High-Dimensional Spaces19
SAND in action19
QO-Insight: Inspecting Steered Query Optimizers19
Demonstrating ADOPT: Adaptively Optimizing Attribute Orders for Worst-Case Optimal Joins via Reinforcement Learning19
DBOS18
Analysis of influence contribution in social advertising18
LightDiC: A Simple Yet Effective Approach for Large-Scale Digraph Representation Learning18
The end of Moore's law and the rise of the data processor18
Mixed Covers of Keys and Functional Dependencies for Maintaining the Integrity of Data under Updates18
ShadowAQP: Efficient Approximate Group-by and Join Query via Attribute-Oriented Sample Size Allocation and Data Generation18
POLAR: Adaptive and Non-invasive Join Order Selection via Plans of Least Resistance18
Managing ML pipelines18
SmartLite: A DBMS-Based Serving System for DNN Inference in Resource-Constrained Environments18
Flexible Resource Allocation for Relational Database-as-a-Service18
How Do Categorical Duplicates Affect ML? A New Benchmark and Empirical Analyses18
The Composable Data Management System Manifesto18
Achieving high throughput and elasticity in a larger-than-memory store18
Finding locally densest subgraphs17
Performance-Based Pricing for Federated Learning via Auction17
In-network leaderless replication for distributed data stores17
BP-Tree: Overcoming the Point-Range Operation Tradeoff for In-Memory B-Trees17
Valentine in action17
Lindorm TSDB: A Cloud-Native Time-Series Database for Large-Scale Monitoring Systems17
BigST: Linear Complexity Spatio-Temporal Graph Neural Network for Traffic Forecasting on Large-Scale Road Networks17
Efficient Framework for Operating on Data Sketches16
AnyOLAP16
FedTSC16
SpaceSaving ±16
gCore: Exploring Cross-Layer Cohesiveness in Multi-Layer Graphs16
WebArrayDB16
Detecting Metadata-Related Logic Bugs in Database Systems via Raw Database Construction16
Epistemic Parity: Reproducibility as an Evaluation Metric for Differential Privacy16
Catalyst: Optimizing Cache Management for Large In-memory Key-value Systems16
The art of balance16
SDPipe: A Semi-Decentralized Framework for Heterogeneity-Aware Pipeline-parallel Training15
BASE: Bridging the Gap between Cost and Latency for Query Optimization15
Longshot: Indexing Growing Databases Using MPC and Differential Privacy15
Efficient Influence Minimization via Node Blocking15
In-page shadowing and two-version timestamp ordering for mobile DBMSs15
LANCET15
Quasi-Stable Coloring for Graph Compression15
ZKSQL: Verifiable and Efficient Query Evaluation with Zero-Knowledge Proofs15
BYO: A Unified Framework for Benchmarking Large-Scale Graph Containers15
Solver-In-The-Loop Cluster Resource Management for Database-as-a-Service15
Efficient Distributed Transaction Processing in Heterogeneous Networks15
G-tran15
Optimal Matrix Sketching over Sliding Windows15
DET-LSH: A Locality-Sensitive Hashing Scheme with Dynamic Encoding Tree for Approximate Nearest Neighbor Search15
Wikinegata14
Accelerating large scale real-time GNN inference using channel pruning14
DyHealth14
KG-Roar: Interactive Datalog-Based Reasoning on Virtual Knowledge Graphs14
Out-of-Order Sliding-Window Aggregation with Efficient Bulk Evictions and Insertions14
How divergent is your data?14
Netherite14
DuckDB-wasm14
Hazelcast jet14
Weakly Guided Adaptation for Robust Time Series Forecasting14
Online Ridesharing with Meeting Points14
Efficient secure and verifiable location-based skyline queries over encrypted data14
Procedural extensions of SQL14
Projection-compliant database generation14
SAND13
Spatial Query Optimization With Learning13
FusionQuery: On-demand Fusion Queries over Multi-source Heterogeneous Data13
HMAB13
Billion-Scale Bipartite Graph Embedding: A Global-Local Induced Approach13
Don't be a tattle-tale13
Marigold: Efficientk-Means Clustering in High Dimensions13
GaussDB: A Cloud-Native Multi-Primary Database with Compute-Memory-Storage Disaggregation13
MLOS in Action: Bridging the Gap Between Experimentation and Auto-Tuning in the Cloud13
CGgraph: An Ultra-Fast Graph Processing System on Modern Commodity CPU-GPU Co-processor13
Kamino13
Towards communication-efficient vertical federated learning training via cache-enabled local updates13
MDTP13
Are we ready for learned cardinality estimation?13
QPJVis Demo: Quality-Boost Progressive Join Query Processing System13
CatSQL : Towards Real World Natural Language to SQL Applications13
Differentially Private Stream Processing at Scale13
Communication Efficient and Provable Federated Unlearning13
Accelerating Similarity Search for Elastic Measures: A Study and New Generalization of Lower Bounding Distances13
Galvatron13
RetClean: Retrieval-Based Data Cleaning Using LLMs and Data Lakes12
APEX12
COMET12
Efficient k -Clique Count Estimation with Accuracy Guarantee12
Scabbard12
Neighborhood-Based Hypergraph Core Decomposition12
ModsNet: Performance-Aware Top- k Model Search Using Exemplar Datasets12
LLM for Data Management12
Transparent Migration from Datastore to Firestore12
OptScaler: A Collaborative Framework for Robust Autoscaling in the Cloud12
High-Performance Spatial Data Analytics: Systematic R&D for Scale-Out and Scale-Up Solutions from the Past to Now12
Spectrum: Speedy and Strictly-Deterministic Smart Contract Transactions for Blockchain Ledgers12
Napa12
A queueing-theoretic framework for vehicle dispatching in dynamic car-hailing12
UPLIFT12
Intelligent Agents for Data Exploration12
Apache TsFile: An IoT-Native Time Series File Format12
Demonstration of the VeriEQL Equivalence Checker for Complex SQL Queries12
A Reproducible Tutorial on Reproducibility in Database Systems Research12
Finding group Steiner trees in graphs with both vertex and edge weights12
Efficient Maximal Frequent Group Enumeration in Temporal Bipartite Graphs12
TFB: Towards Comprehensive and Fair Benchmarking of Time Series Forecasting Methods12
From BERT to GPT-3 codex12
Spade: A Real-Time Fraud Detection Framework12
Mach: Firefighting Time-Critical Issues in Complex Systems Using High-Frequency Telemetry12
TDSQL: Tencent Distributed Database System12
Distributed Shortest Distance Labeling on Large-Scale Graphs11
Breathing New Life into an Old Tree: Resolving Logging Dilemma of B + -tree on Modern Computational Storage Drives11
Secure Shapley Value for Cross-Silo Federated Learning11
MagicScaler: Uncertainty-Aware, Predictive Autoscaling11
Towards scalable online machine learning collaborations with OpenML11
Interactive demonstration of SQLCheck11
Trajectory Similarity Measurement: An Efficiency Perspective11
Tair-PMem11
Dual-objective fine-tuning of BERT for entity matching11
GEDet11
ADESIT11
DatAgent11
Language-agnostic integrated queries in a managed polyglot runtime11
Viper11
PetPS: Supporting Huge Embedding Models with Persistent Memory11
On Efficient Approximate Queries over Machine Learning Models11
Autonomously Computable Information Extraction11
CDI-E11
Mixer10
CBench10
Robustness against read committed for transaction templates10
Sage10
Auto-pipeline10
Waffle10
ThunderRW10
ByteGraph10
CIVET: Exploring Compact Index for Variable-Length Subsequence Matching on Time Series10
Modern techniques for querying graph-structured relations10
DADER10
Analyzing how BERT performs entity matching10
SKT10
Automatic data acquisition for deep learning10
DORIAN in action10
A Tutorial on Visual Representations of Relational Queries10
Excalibur10
JoinBoost: Grow Trees over Normalized Data Using Only SQL10
0.059651851654053