Proceedings of the Vldb Endowment

Papers
(The TQCC of Proceedings of the Vldb Endowment is 8. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2020-02-01 to 2024-02-01.)
ArticleCitations
PyTorch distributed131
TranAD119
Deep entity matching with pre-trained language models117
Cloudburst109
TiDB108
The PGM-index98
A benchmarking study of embedding-based entity alignment for knowledge graphs86
Anomaly detection in time series78
DeepDB76
Dash74
LB+Trees71
Effective and efficient community search over large heterogeneous information networks71
Privacy preserving vertical federated learning for tree-based models70
Delta lake67
NeuroCard61
Data market platforms57
Benchmarking learned indexes53
uTree49
ResilientDB49
Maximum biclique search at billion scale48
MyRocks45
LedgerDB44
Sato44
Diagnosing root causes of intermittent slow queries in cloud databases44
Responsible data management44
Series2Graph43
Data collection and quality challenges for deep learning43
A comprehensive survey and experimental comparison of graph-based approximate nearest neighbor search43
Effectively learning spatial indices42
Query performance prediction for concurrent queries using graph embedding42
Apache IoTDB41
Analyzing and mitigating data stalls in DNN training40
Tsunami39
Viper39
Magic mirror in my hand, which is the best in the land?39
Pangolin38
Are we ready for learned cardinality estimation?38
SlimChain38
openGauss37
SAND37
Decoupled dynamic spatial-temporal graph neural network for traffic forecasting37
Towards scalable dataframe systems37
Answering billion-scale label-constrained reachability queries within microsecond37
TURL36
Atomic commitment across blockchains36
Cerebro35
AGL35
Aria34
Single machine graph analytics on massive datasets using Intel optane DC persistent memory34
Natural language to SQL34
An inquiry into machine learning-based automatic configuration tuning services on real-world database management systems34
Traversing large graphs on GPUs with unified memory33
Hypergraph motifs32
Compression of uncertain trajectories in road networks32
Incrementalization of graph partitioning algorithms32
TSB-UAD31
Baran31
ATHENA++30
Updatable learned index with precise positions30
Quantifying TPC-H choke points and their optimizations30
Efficient algorithms for budgeted influence maximization on massive social networks30
ARDA29
Demand-aware route planning for shared mobility services29
Rank aggregation algorithms for fair consensus28
ByShard28
Understanding the idiosyncrasies of real persistent memory28
Fair task assignment in spatial crowdsourcing28
FACE27
Leaper26
TRACE26
Deep learning for blocking in entity matching25
tf.data25
HET25
LiveGraph25
Learned cardinality estimation25
SPORES24
Multi-modal transportation recommendation with unified route representation learning24
Identifying insufficient data coverage in databases with multiple relations24
Exathlon24
ICS-GNN24
GeCo24
Dremel23
An analysis of concurrency control protocols for in-memory databases with CCBench23
Dual-objective fine-tuning of BERT for entity matching23
Dealer23
Auctus23
CGPTuner23
KClist++23
Randomized error removal for online spread estimation in data streaming23
VHP23
Hierarchical core maintenance on large dynamic graphs23
FREDE22
AnalyticDB-V22
A learned query rewrite system using Monte Carlo tree search22
RapidMatch22
Fauce22
Accelerating truss decomposition on heterogeneous processors22
Building enclave-native storage engines for practical encrypted databases22
Large graph convolutional network training with GPU-oriented data communication architecture22
KBPearl22
G 322
Efficient size-bounded community search over large networks21
Adopting worst-case optimal joins in relational database systems21
RPT21
Missing value imputation on multidimensional time series21
Data management in microservices21
F1 lightning21
CodexDB21
Real-time distance-based outlier detection in data streams21
DeepTRANS21
Improving reproducibility of data science pipelines through transparent provenance capture20
Anytime stochastic routing with hybrid learning20
Scrutinizer20
SAQE20
Understanding the effect of data center resource disaggregation on production DBMSs20
Efficiently approximating selectivity functions using low overhead regression models20
On-off sketch20
Oracle AutoML20
Automated generation of materialized views in Oracle20
Understanding and benchmarking the impact of GDPR on database systems20
Efficient bi-triangle counting for large bipartite networks19
Relational data synthesis using generative adversarial networks19
Flow-loss19
ByteGNN19
Frequency estimation under local differential privacy19
GraphAn19
ADnEV19
Jointly optimizing preprocessing and inference for DNN-based visual analytics19
Constructing and analyzing the LSM compaction design space18
Real-world trajectory sharing with local differential privacy18
FlexPushdownDB18
Persistent memory hash indexes18
Hu-Fu18
TGL18
Spitz18
Distributed hop-constrained s-t simple path enumeration at billion scale18
MDTP18
Hybrid blockchain database systems18
Lux18
EMOGI18
Effective and efficient relational community detection and search in large dynamic heterogeneous information networks18
Fairly evaluating and scoring items in a data set18
Efficiently answering reachability and path queries on temporal bipartite graphs17
Selective data acquisition in the wild for model charging17
POLARIS17
FLAT17
The computation of optimal subset repairs17
The simpler the better17
Pricing influential nodes in online social networks17
DeepTrack: Monitoring and exploring spatio-temporal data17
RONIN17
Machine learning for databases16
APEX16
The art of balance16
Efficient oblivious database joins16
Building high throughput permissioned blockchain fabrics16
From natural language processing to neural databases16
Capturing associations in graphs16
Scaling attributed network embedding to massive graphs16
AutoCTS16
GraphScope16
Efficient join algorithms for large database tables in a multi-GPU environment16
Epoch-based commit and replication in distributed OLTP databases16
Efficient and effective similar subtrajectory search with deep reinforcement learning16
UDO16
Fine-grained lineage for safer notebook interactions16
Ordering heuristics for k -clique listing15
Data synthesis via differentially private markov random fields15
Volume under the surface15
Towards cost-effective and elastic cloud database deployment via memory disaggregation15
On the efficiency of K-means clustering15
Watermarks in stream processing systems15
Data-driven domain discovery for structured datasets15
Elle15
Topic-based community search over spatial-social networks15
VolcanoML15
Stable learned bloom filters for data streams15
FSST15
xFraud15
Efficient maximal biclique enumeration for large sparse bipartite graphs15
Capturing and querying fine-grained provenance of preprocessing pipelines in data science15
Automated feature engineering for algorithmic fairness14
NeuChain14
Massively parallel algorithms for personalized pagerank14
Distributed subgraph counting14
On detecting cherry-picked trendlines14
Answering multi-dimensional range queries under local differential privacy14
Butterfly-core community search over labeled graphs14
Zen14
Data migration using datalog program synthesis14
KDV-explorer14
Revisiting the design of LSM-tree Based OLTP storage engine with persistent memory14
Efficient label-constrained shortest path queries on road networks14
Data-parallel query processing on non-uniform data13
Unsupervised time series outlier detection with diversity-driven convolutional ensembles13
Cardinality estimation in DBMS13
CGM13
Data acquisition for improving machine learning models13
Towards crowd-aware indoor path planning13
Accelerating large scale real-time GNN inference using channel pruning13
Obi-Wan13
Stingy sketch13
New trends in high-D vector similarity search13
Query driven-graph neural networks for community search13
Conversational BI13
Tensors13
Approximate denial constraints13
ConnectIt13
Enabling low tail latency on multicore key-value stores13
Cost-Based or Learning-Based?13
Refiner13
Sancus13
Tensor relational algebra for distributed machine learning system design13
Realtime index-free single source SimRank processing on web-scale graphs13
Distributed deep learning on data systems13
(p,q)-biclique counting and enumeration for large sparse bipartite graphs13
Accelerating recommendation system training by leveraging popular choices13
DeepTEA13
SChain13
Tailoring data source distributions for fairness-aware data integration12
CALYPSO12
OceanBase12
PIDS12
AutoToken12
Towards instance-optimized data systems12
ThunderRW12
Demand-based sensor data gathering with multi-query optimization12
Fast subtrajectory similarity search in road networks under weighted edit distance constraints12
HydraList12
Projected federated averaging with heterogeneous differential privacy12
Shortest paths and centrality in uncertain networks12
Can Foundation Models Wrangle Your Data?12
Seagull12
Moneyball12
Sage12
On analyzing graphs with motif-paths12
Finding large diverse communities on networks12
Guided exploration of user groups12
Pytheas12
Decomposed bounded floats for fast compression and queries12
MorphStore12
Evaluating memory-hard proof-of-work algorithms on three processors12
NBTree12
Crystal12
Micro-architectural analysis of OLAP12
The relational data borg is learning12
Nearest neighbor classifiers over incomplete information12
Netherite12
Astrid12
Helios11
0.043928861618042