Proceedings of the Vldb Endowment

Papers
(The TQCC of Proceedings of the Vldb Endowment is 8. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2020-11-01 to 2024-11-01.)
ArticleCitations
TranAD240
Anomaly detection in time series162
Decoupled dynamic spatial-temporal graph neural network for traffic forecasting87
A comprehensive survey and experimental comparison of graph-based approximate nearest neighbor search72
SlimChain66
TURL55
Are we ready for learned cardinality estimation?54
Viper52
TSB-UAD51
SAND51
Updatable learned index with precise positions51
openGauss48
Analyzing and mitigating data stalls in DNN training46
An inquiry into machine learning-based automatic configuration tuning services on real-world database management systems45
FACE42
Can Foundation Models Wrangle Your Data?41
Learned cardinality estimation40
ByteGNN40
ICS-GNN39
Deep learning for blocking in entity matching38
ByShard38
TRACE38
Fauce37
FLAT37
Exathlon37
Cardinality estimation in DBMS37
Dual-objective fine-tuning of BERT for entity matching36
Understanding the idiosyncrasies of real persistent memory36
FINEdex35
Hierarchical core maintenance on large dynamic graphs34
CGPTuner34
Auctus34
Volume under the surface33
GeCo33
tf.data33
Dealer33
Large graph convolutional network training with GPU-oriented data communication architecture33
HET32
RPT32
CodexDB31
OceanBase31
AutoCTS31
Missing value imputation on multidimensional time series31
FederatedScope: A Flexible Federated Learning Platform for Heterogeneity30
A learned query rewrite system using Monte Carlo tree search30
TGL30
Constructing and analyzing the LSM compaction design space30
Selective data acquisition in the wild for model charging29
Flow-loss29
QueryFormer29
Building enclave-native storage engines for practical encrypted databases29
GraphScope29
Multi-modal transportation recommendation with unified route representation learning29
Hybrid blockchain database systems28
Efficient size-bounded community search over large networks28
Data management in microservices28
Efficiently answering reachability and path queries on temporal bipartite graphs28
Randomized error removal for online spread estimation in data streaming28
Frequency estimation under local differential privacy28
Sancus28
Query driven-graph neural networks for community search28
SQLite28
PyTorch FSDP: Experiences on Scaling Fully Sharded Data Parallel27
Cost-Based or Learning-Based?27
FREDE27
Data synthesis via differentially private markov random fields27
MDTP27
APEX26
Real-world trajectory sharing with local differential privacy26
Elle26
Optimizing inference serving on serverless platforms25
Efficient maximal biclique enumeration for large sparse bipartite graphs25
The art of balance25
METRO25
GRAIN25
Facilitating database tuning with hyper-parameter optimization25
FlexPushdownDB25
UDO24
Projected federated averaging with heterogeneous differential privacy24
Lux24
Epoch-based commit and replication in distributed OLTP databases24
Towards cost-effective and elastic cloud database deployment via memory disaggregation24
Hu-Fu24
Unsupervised time series outlier detection with diversity-driven convolutional ensembles23
Moneyball23
Stingy sketch23
Efficient bi-triangle counting for large bipartite networks23
Data acquisition for improving machine learning models23
NeuChain23
Butterfly-core community search over labeled graphs22
Persistent memory hash indexes22
Decomposed bounded floats for fast compression and queries22
Efficient label-constrained shortest path queries on road networks22
Answering multi-dimensional range queries under local differential privacy22
Accelerating large scale real-time GNN inference using channel pruning22
RONIN21
Netherite21
Chimp21
NBTree21
Robust Query Driven Cardinality Estimation under Changing Workloads21
Distributed hop-constrained s-t simple path enumeration at billion scale21
Revisiting the design of LSM-tree Based OLTP storage engine with persistent memory21
Integrating Data Lake Tables20
Massively parallel algorithms for personalized pagerank20
ConnectIt20
From natural language processing to neural databases20
CGM20
Watermarks in stream processing systems20
Machine learning for databases20
Capturing and querying fine-grained provenance of preprocessing pipelines in data science20
Refiner19
Velox19
Butterfly counting on uncertain bipartite graphs19
Tailoring data source distributions for fairness-aware data integration19
Manu19
Are updatable learned indexes ready?19
Make your database system dream of electric sheep19
Distributed deep learning on data systems19
Accelerating recommendation system training by leveraging popular choices19
DSB18
KDV-explorer18
Fine-grained lineage for safer notebook interactions18
(p,q)-biclique counting and enumeration for large sparse bipartite graphs18
xFraud18
VolcanoML18
Semantics-Aware Dataset Discovery from Data Lakes with Contextualized Column-Based Representation Learning18
Horizon18
PLIN18
LlamaTune18
Efficient join algorithms for large database tables in a multi-GPU environment18
Zero-shot cost models for out-of-the-box learned cost prediction18
Nearest neighbor classifiers over incomplete information18
How Large Language Models Will Disrupt Data Management17
Towards instance-optimized data systems17
An experimental evaluation and guideline for path finding in weighted dynamic network17
Lero: A Learning-to-Rank Query Optimizer17
Teseo and the analysis of structural dynamic graphs17
Automated feature engineering for algorithmic fairness17
Federated matrix factorization with privacy guarantee17
Galvatron17
DeepTEA16
Bagua16
GriDB: Scaling Blockchain Database via Sharding and Off-Chain Cross-Shard Mechanism16
On analyzing graphs with motif-paths16
Zen16
SChain16
Stacked filters16
ThunderRW16
On querying historical k-cores16
The LDBC Social Network Benchmark16
Astrid16
New trends in high-D vector similarity search16
CALYPSO16
Learned Index: A Comprehensive Experimental Evaluation16
From BERT to GPT-3 codex16
Influence maximization in real-world closed social networks15
Learned Index Benefits15
Towards crowd-aware indoor path planning15
The case for NLP-enhanced database tuning15
Effective community search over large star-schema heterogeneous information networks15
PR-sketch15
Babelfish15
Spooky15
Efficient and effective data imputation with influence functions15
DLCR15
Local algorithms for distance-generalized core decomposition over large dynamic graphs15
Crystal15
Distributed learning of fully connected neural networks using independent subnet training15
DBMind14
Cosine14
Scalable mining of maximal quasi-cliques14
LOGER: A Learned Optimizer Towards Generating Efficient and Robust Query Execution Plans14
Diversified top- k route planning in road network14
Auto-pipeline14
Finding group Steiner trees in graphs with both vertex and edge weights14
Time series data encoding for efficient storage14
Managing ML pipelines14
Tensor relational algebra for distributed machine learning system design14
Comprehensive and efficient workload compression14
Efficient streaming subgraph isomorphism with graph neural networks14
COMET14
SNARF14
Symmetric continuous subgraph matching with bidirectional dynamic programming14
Tensors14
A demonstration of KGLac13
HVS13
Ginex13
The case for distributed shared-memory databases with RDMA-enabled memory disaggregation13
Redy13
Adaptive data augmentation for supervised learning over missing data13
Optimizing in-memory database engine for AI-powered on-line decision augmentation using persistent memory13
Hazelcast jet13
CARMI13
PromptEM13
TreeLine13
Hardware acceleration of compression and encryption in SAP HANA13
Shortest paths and centrality in uncertain networks13
OpBoost13
KLL ± approximate quantile sketches over dynamic datasets13
Parallel discrepancy detection and incremental detection13
FastFlow: Accelerating Deep Learning Model Training with Smart Offloading of Input Data Pipeline13
LargeEA13
Ananke13
NFL13
RapidFlow13
Kamino13
Designing an open framework for query optimization and compilation12
Scaling replicated state machines with compartmentalization12
Locater12
YeSQL12
A critical re-evaluation of neural methods for entity alignment12
SetSketch12
Automating incremental graph processing with flexible memoization12
Automatic data acquisition for deep learning12
Reliable community search in dynamic networks12
Mainlining databases12
Optimizing Video Analytics with Declarative Model Relationships12
Deep indexed active learning for matching heterogeneous entity representations12
The evolution of Amazon redshift12
Orchestrating data placement and query execution in heterogeneous CPU-GPU DBMS12
Operon12
LDPTrace: Locally Differentially Private Trajectory Synthesis12
Analyzing how BERT performs entity matching12
Parallel training of knowledge graph embedding models12
MT-teql12
ByteHTAP12
Language Models Enable Simple Systems for Generating Structured Views of Heterogeneous Data Lakes12
Towards cost-optimal query processing in the cloud12
Accelerating exact constrained shortest paths on GPUs12
Zebra: When Temporal Graph Neural Networks Meet Temporal Personalized PageRank12
A neural database for differentially private spatial range queries12
Extraction of Validating Shapes from Very Large Knowledge Graphs11
Discovering association rules from big graphs11
Towards communication-efficient vertical federated learning training via cache-enabled local updates11
Influential Community Search over Large Heterogeneous Information Networks11
An in-depth study of continuous subgraph matching11
Query processing on tensor computation runtimes11
DICE11
Software-defined data protection11
TimeEval11
Efficient secure and verifiable location-based skyline queries over encrypted data11
A critical analysis of recursive model indexes11
Collective influence maximization for multiple competing products with an awareness-to-influence model11
QARTA11
DINOMO11
Scabbard11
CoroBase11
DISTILL11
SWS11
Near-data processing in database systems on native computational storage under HTAP workloads11
0.080583095550537