default search action
ACM SIGMOD Conference 2016: San Francisco, CA, USA
- Fatma Özcan, Georgia Koutrika, Sam Madden:
Proceedings of the 2016 International Conference on Management of Data, SIGMOD Conference 2016, San Francisco, CA, USA, June 26 - July 01, 2016. ACM 2016, ISBN 978-1-4503-3531-7
Keynote - Jeff Dean
- Jeff Dean:
Building Machine Learning Systems that Understand. 1
Session 1 - Scalable Analytics and Machine Learning
- Maximilian Schleich, Dan Olteanu, Radu Ciucanu:
Learning Linear Regression Models over Factorized Joins. 3-18 - Arun Kumar, Jeffrey F. Naughton, Jignesh M. Patel, Xiaojin Zhu:
To Join or Not to Join?: Thinking Twice about Joins before Feature Selection. 19-34 - Yanxiang Huang, Bin Cui, Jie Jiang, Kunqian Hong, Wenyu Zhang, Yiran Xie:
Real-time Video Recommendation Exploration. 35-46 - Akash Das Sarma, Aditya G. Parameswaran, Jennifer Widom:
Towards Globally Optimal Crowdsourcing Quality Management: The Uniform Worker Setting. 47-62 - Jeff LeFevre, Rui Liu, Cornelio Inigo, Lupita Paz, Edward Ma, Malú Castellanos, Meichun Hsu:
Building the Enterprise Fabric for Big Data with Vertica and Spark Integration. 63-75 - Xin Huang, Wei Lu, Laks V. S. Lakshmanan:
Truss Decomposition of Probabilistic Graphs: Semantics and Algorithms. 77-90 - Rong-Hua Li, Lu Qin, Jeffrey Xu Yu, Rui Mao:
Efficient and Progressive Group Steiner Tree Search. 91-106
Session 2 - Privacy and Security
- Zach Jorgensen, Ting Yu, Graham Cormode:
Publishing Attributed Social Graphs with Formal Privacy Guarantees. 107-122 - Wei-Yen Day, Ninghui Li, Min Lyu:
Publishing Graph Degree Distribution with Node Differential Privacy. 123-138 - Michael Hay, Ashwin Machanavajjhala, Gerome Miklau, Yan Chen, Dan Zhang:
Principled Evaluation of Differentially Private Algorithms using DPBench. 139-154 - Jun Zhang, Xiaokui Xiao, Xing Xie:
PrivTree: A Differentially Private Algorithm for Hierarchical Decompositions. 155-170 - Panagiotis Karras, Artyom Nikitin, Muhammad Saad, Rudrika Bhatt, Denis Antyukhov, Stratos Idreos:
Adaptive Indexing over Encrypted Numeric Data. 171-183 - Ioannis Demertzis, Stavros Papadopoulos, Odysseas Papapetrou, Antonios Deligiannakis, Minos N. Garofalakis:
Practical Private Range Search Revisited. 185-198 - Zhao Chang, Lei Zou, Feifei Li:
Privacy Preserving Subgraph Matching on Large Graphs in Cloud. 199-213
Session 3 - Logical and Physical Database Design
- Benoît Dageville, Thierry Cruanes, Marcin Zukowski, Vadim Antonov, Artin Avanes, Jon Bock, Jonathan Claybaugh, Daniel Engovatov, Martin Hentschel, Jiansheng Huang, Allison W. Lee, Ashish Motivala, Abdul Q. Munir, Steven Pelley, Peter Povinec, Greg Rahn, Spyridon Triantafyllis, Philipp Unterbrunner:
The Snowflake Elastic Data Warehouse. 215-226 - Zhen Hua Liu, Beda Christoph Hammerschmidt, Douglas Mcmahon, Ying Liu, Hui Joe Chang:
Closing the functional and Performance Gap between SQL and NoSQL. 227-238 - Dipti Borkar, Ravi Mayuram, Gerald Sangudi, Michael J. Carey:
Have Your Data and Query It Too: From Key-Value Caching to Big Data Management. 239-251 - Shadi A. Noghabi, Sriram Subramanian, Priyesh Narayanan, Sivabalan Narayanan, Gopalakrishna Holla, Mammad Zadeh, Tianwei Li, Indranil Gupta, Roy H. Campbell:
Ambry: LinkedIn's Scalable Geo-Distributed Object Store. 253-265 - Henning Köhler, Sebastian Link:
SQL Schema Design: Foundations, Normal Forms, and Normalization. 267-279 - Shrainik Jain, Dominik Moritz, Daniel Halperin, Bill Howe, Ed Lazowska:
SQLShare: Results from a Multi-Year SQL-as-a-Service Experiment. 281-293 - Michael DiScala, Daniel J. Abadi:
Automatic Generation of Normalized Relational Schemas from Nested Key-Value Data. 295-310
Session 4 - New Storage and Network Architectures
- Harald Lang, Tobias Mühlbauer, Florian Funke, Peter A. Boncz, Thomas Neumann, Alfons Kemper:
Data Blocks: Hybrid OLTP and OLAP on Compressed Storage using both Vectorization and Compilation. 311-326 - Niv Dayan, Philippe Bonnet, Stratos Idreos:
GeckoFTL: Scalable Flash Translation Techniques For Very Large Flash Devices. 327-342 - Gihwan Oh, Chiyoung Seo, Ravi Mayuram, Yang-Suk Kee, Sang-Won Lee:
SHARE Interface in Flash Storage for Relational and NoSQL Databases. 343-354 - Feng Li, Sudipto Das, Manoj Syamala, Vivek R. Narasayya:
Accelerating Relational Databases by Leveraging Remote Memory and RDMA. 355-370 - Ismail Oukid, Johan Lasperas, Anisoara Nica, Thomas Willhalm, Wolfgang Lehner:
FPTree: A Hybrid SCM-DRAM Persistent and Concurrent B-Tree for Storage Class Memory. 371-386 - Utku Sirin, Pinar Tözün, Danica Porobic, Anastasia Ailamaki:
Micro-architectural Analysis of In-memory OLTP. 387-402
Session 5 - Graphs 1: Infrastructure and Processing on Modern Hardware
- Hang Liu, H. Howie Huang, Yang Hu:
iBFS: Concurrent Breadth-First Search on GPUs. 403-416 - Xiaogang Shi, Bin Cui, Yingxia Shao, Yunhai Tong:
Tornado: A System For Real-Time Iterative Analysis Over Evolving Data. 417-430 - Christopher R. Aberger, Susan Tu, Kunle Olukotun, Christopher Ré:
EmptyHeaded: A Relational Engine for Graph Processing. 431-446 - Min-Soo Kim, Kyuhyeon An, Himchan Park, Hyunseok Seo, Jinwook Kim:
GTS: A Fast and Scalable Graph Processing Method based on Streaming Topology to GPUs. 447-461 - Zechao Shang, Feifei Li, Jeffrey Xu Yu, Zhiwei Zhang, Hong Cheng:
Graph Analytics Through Fine-Grained Parallelism. 463-478 - Zhigang Wang, Yu Gu, Yubin Bao, Ge Yu, Jeffrey Xu Yu:
Hybrid Pulling/Pushing for I/O-Efficient Distributed and Iterative Graph Computing. 479-494
Session 6 - Streaming 1: Systems and Outlier Detection
- Medhabi Ray, Chuan Lei, Elke A. Rundensteiner:
Scalable Pattern Sharing on Event Streams. 495-510 - Milos Nikolic, Mohammad Dashti, Christoph Koch:
How to Win a Hot Dog Eating Contest: Distributed Incremental View Maintenance with Batch Updates. 511-526 - Lei Cao, Jiayuan Wang, Elke A. Rundensteiner:
Sharing-Aware Outlier Analytics over High-Volume Data Streams. 527-540 - Evangelia Kalyvianaki, Marco Fiscato, Theodoros Salonidis, Peter R. Pietzuch:
THEMIS: Fairness in Federated Stream Processing under Overload. 541-553 - Alexandros Koliousis, Matthias Weidlich, Raul Castro Fernandez, Alexander L. Wolf, Paolo Costa, Peter R. Pietzuch:
SABER: Window-Based Hybrid Stream Processing for Heterogeneous Architectures. 555-569 - Miao Qiao, Junhao Gan, Yufei Tao:
Range Thresholding on Streams. 571-582
Session 7 - Approximate Query Processing
- Joy Arulraj, Andrew Pavlo, Prashanth Menon:
Bridging the Archipelago between Row-Stores and Column-Stores for Hybrid Workloads. 583-598 - Yang Cao, Wenfei Fan:
An Effective Syntax for Bounded Relational Queries. 599-614 - Feifei Li, Bin Wu, Ke Yi, Zhuoyue Zhao:
Wander Join: Online Aggregation via Random Walks. 615-629 - Srikanth Kandula, Anil Shanbhag, Aleksandar Vitorovic, Matthaios Olma, Robert Grandl, Surajit Chaudhuri, Bolin Ding:
Quickr: Lazily Approximating Complex AdHoc Queries in BigData Clusters. 631-646 - Shuang Chen, Shunning Jiang, Bingsheng He, Xueyan Tang:
A Study of Sorting Algorithms on Approximate Memory. 647-662 - Ioannis Mytilinis, Dimitrios Tsoumakos, Nectarios Koziris:
Distributed Wavelet Thresholding for Maximum Error Metrics. 663-677 - Bolin Ding, Silu Huang, Surajit Chaudhuri, Kaushik Chakrabarti, Chi Wang:
Sample + Seek: Approximating Aggregates with Distribution Precision Guarantee. 679-694
Session 8 - Networks and the Web
- Hung T. Nguyen, My T. Thai, Thang N. Dinh:
Stop-and-Stare: Optimal Sampling Algorithms for Viral Marketing in Billion-scale Networks. 695-710 - Yasir Mehmood, Francesco Bonchi, David García-Soriano:
Spheres of Influence for More Effective Viral Marketing. 711-726 - Yu Yang, Xiangbo Mao, Jian Pei, Xiaofei He:
Continuous Influence Maximization: What Discounts Should We Offer to Social Network Users? 727-741 - Sainyam Galhotra, Akhil Arora, Shourya Roy:
Holistic Influence Maximization: Combining Scalability and Efficiency with Opinion-Aware Models. 743-758 - Astrid Rheinländer, Mario Lehmann, Anja Kunkel, Jörg Meier, Ulf Leser:
Potential and Pitfalls of Domain-Specific Information Extraction at Web Scale. 759-771 - Tim Furche, Jinsong Guo, Sebastian Maneth, Christian Schallhart:
Robust and Noise Resistant Wrapper Induction. 773-784
Session 9 - Data Discovery and Extraction
- Alon Y. Halevy, Flip Korn, Natalya Fridman Noy, Christopher Olston, Neoklis Polyzotis, Sudip Roy, Steven Euijong Whang:
Goods: Organizing Google's Datasets. 795-806 - Tomer Sagi, Avigdor Gal, Omer Barkol, Ruth Bergman, Alexander Avram:
Multi-Source Uncertain Entity Resolution at Yad Vashem: Transforming Holocaust Victim Reports into People. 807-819 - Thorsten Papenbrock, Felix Naumann:
A Hybrid Approach to Functional Dependency Discovery. 821-833 - Yang Chen, Sean Goldberg, Daisy Zhe Wang, Soumitra Siddharth Johri:
Ontological Pathfinding. 835-846 - Ce Zhang, Jaeho Shin, Christopher Ré, Michael J. Cafarella, Feng Niu:
Extracting Databases from Dark Data with DeepDive. 847-859 - Yeounoh Chung, Michael Lind Mortensen, Carsten Binnig, Tim Kraska:
Estimating the Impact of Unknown Unknowns on Aggregate Query Results. 861-876
Session 10 - Data Integration / Cleaning
- Shaoxu Song, Han Zhu, Jianmin Wang:
Constraint-Variance Tolerant Data Repairing. 877-892 - Jian He, Enzo Veltri, Donatello Santoro, Guoliang Li, Giansalvatore Mecca, Paolo Papotti, Nan Tang:
Interactive and Deterministic Data Cleaning. 893-907 - Aoqian Zhang, Shaoxu Song, Jianmin Wang:
Sequential Data Cleaning: A Statistical Approach. 909-924 - Asif Iqbal Baba, Manfred Jaeger, Hua Lu, Torben Bach Pedersen, Wei-Shinn Ku, Xike Xie:
Learning-Based Cleansing for Indoor RFID Data. 925-936 - Sanjay Krishnan, Jiannan Wang, Michael J. Franklin, Ken Goldberg, Tim Kraska:
PrivateClean: Data Cleaning and Differential Privacy. 937-951 - Sebastian Kruse, Anja Jentzsch, Thorsten Papenbrock, Zoi Kaoudi, Jorge-Arnulfo Quiané-Ruiz, Felix Naumann:
RDFind: Scalable Conditional Inclusion Dependency Discovery in RDF Datasets. 953-967 - Chengliang Chai, Guoliang Li, Jian Li, Dong Deng, Jianhua Feng:
Cost-Effective Crowdsourced Entity Resolution: A Partial-Order Approach. 969-984
Session 11 - Spatio / Temporal Databases
- Kaiqi Zhao, Lisi Chen, Gao Cong:
Topic Exploration in Spatio-Temporal Document Collections. 985-998 - Markus Pilman, Martin Kaufmann, Florian Köhl, Donald Kossmann, Damien Profeta:
ParTime: Parallel Temporal Aggregation. 999-1010 - Fernando Chirigati, Harish Doraiswamy, Theodoros Damoulas, Juliana Freire:
Data Polygamy: The Many-Many Relationships among Urban Spatio-Temporal Data Sets. 1011-1025 - Julien Pilourdault, Vincent Leroy, Sihem Amer-Yahia:
Distributed Evaluation of Top-k Temporal Joins. 1027-1039 - Peter Ogden, David B. Thomas, Peter R. Pietzuch:
AT-GIS: Highly Parallel Spatial Query Processing with Associative Transducers. 1041-1054 - Kaiyu Feng, Gao Cong, Sourav S. Bhowmick, Wen-Chih Peng, Chunyan Miao:
Towards Best Region Search for Data Exploration. 1055-1070 - Dong Xie, Feifei Li, Bin Yao, Gefei Li, Liang Zhou, Minyi Guo:
Simba: Efficient In-Memory Spatial Analytics. 1071-1085
Session 12 - Distributed Data Processing
- Guoqiang Jerry Chen, Janet L. Wiener, Shridhar Iyer, Anshul Jaiswal, Ran Lei, Nikhil Simha, Wei Wang, Kevin Wilfong, Tim Williamson, Serhat Yilmaz:
Realtime Data Processing at Facebook. 1087-1098 - Shivaram Venkataraman, Zongheng Yang, Davies Liu, Eric Liang, Hossein Falaki, Xiangrui Meng, Reynold Xin, Ali Ghodsi, Michael J. Franklin, Ion Stoica, Matei Zaharia:
SparkR: Scaling R Programs with Spark. 1099-1104 - Andrei Costea, Adrian Ionescu, Bogdan Raducanu, Michal Switakowski, Cristian Bârca, Juliusz Sompolski, Alicja Luszczak, Michal Szafranski, Giel de Nijs, Peter A. Boncz:
VectorH: Taking SQL-on-Hadoop to the Next Level. 1105-1117 - Chang Yao, Divyakant Agrawal, Gang Chen, Beng Chin Ooi, Sai Wu:
Adaptive Logging: Optimizing Logging and Recovery Costs in Distributed In-memory Databases. 1119-1134 - Alexander Shkapsky, Mohan Yang, Matteo Interlandi, Hsuan Chiu, Tyson Condie, Carlo Zaniolo:
Big Data Analytics with Datalog Queries on Spark. 1135-1149 - Tova Milo, Eyal Altshuler:
An Efficient MapReduce Cube Algorithm for Varied DataDistributions. 1151-1165
Session 13 - Graphs 2: Subgraph-based Optimization Techniques
- Zhengwei Yang, Ada Wai-Chee Fu, Ruifeng Liu:
Diversified Top-k Subgraph Querying in a Large Graph. 1167-1182 - Mohamed S. Hassan, Walid G. Aref, Ahmed M. Aly:
Graph Indexing for Shortest-Path Finding over Dynamic Sub-Graphs. 1183-1197 - Fei Bi, Lijun Chang, Xuemin Lin, Lu Qin, Wenjie Zhang:
Efficient Subgraph Matching by Postponing Cartesian Products. 1199-1214 - Wenfei Fan, Yinghui Wu, Jingbo Xu:
Adding Counting Quantifiers to Graph Patterns. 1215-1230 - Hyeonji Kim, Juneyoung Lee, Sourav S. Bhowmick, Wook-Shin Han, Jeong-Hoon Lee, Seongyun Ko, Moath H. A. Jarrah:
DUALSIM: Parallel Subgraph Enumeration in a Massive Graph on a Single Machine. 1231-1245 - Sairam Gurajada, Martin Theobald:
Distributed Set Reachability. 1247-1261
Session 14 - Main Memory Analytics
- Wenjian Xu, Ziqiang Feng, Eric Lo:
Fast Multi-Column Sorting in Main-Memory Column-Stores. 1263-1278 - Li Wang, Minqi Zhou, Zhenjie Zhang, Yin Yang, Aoying Zhou, Dina Bitton:
Elastic Pipelining in an In-Memory Database Cluster. 1279-1294 - Reza Sherkat, Colin Florendo, Mihnea Andrei, Anil K. Goel, Anisoara Nica, Peter Bumbulis, Ivan Schreter, Günter Radestock, Christian Bensberg, Daniel Booss, Heiko Gerwens:
Page As You Go: Piecewise Columnar Access In SAP HANA. 1295-1306 - Juchang Lee, Hyungyu Shin, Chang Gyoo Park, Seongyun Ko, Jaeyun Noh, Yongjae Chuh, Wolfgang Stephan, Wook-Shin Han:
Hybrid Garbage Collection for Multi-Version Concurrency Control in SAP HANA. 1307-1318 - Manos Athanassoulis, Zheng Yan, Stratos Idreos:
UpBit: Scalable In-Memory Updatable Bitmap Indexing. 1319-1332
Session 15 - Interactive Analytics
- Roee Ebenstein, Niranjan Kamat, Arnab Nandi:
FluxQuery: An Execution Framework for Highly Interactive Query Workloads. 1333-1345 - Kai Zeng, Sameer Agarwal, Ion Stoica:
iOLAP: Managing Uncertainty for Efficient Incremental OLAP. 1347-1361 - Leilani Battle, Remco Chang, Michael Stonebraker:
Dynamic Prefetching of Data Tiles for Interactive Visualization. 1363-1375 - Eirik Bakke, David R. Karger:
Expressive Query Construction through Direct Manipulation of Nested Relational Results. 1377-1392 - Gokul Nath Babu Manoharan, Stephan Ellner, Karl Schnaitter, Sridatta Chegu, Alejandro Estrella-Balderrama, Stephan Gudmundson, Apurv Gupta, Ben Handy, Bart Samwel, Chad Whipkey, Larysa Aharkava, Himani Apte, Nitin Gangahar, Jun Xu, Shivakumar Venkataraman, Divyakant Agrawal, Jeffrey D. Ullman:
Shasta: Interactive Reporting At Scale. 1393-1404 - Lyublena Antova, Rhonda Baldwin, Derrick Bryant, Tuan Cao, Michael Duller, John Eshleman, Zhongxian Gu, Entong Shen, Mohamed A. Soliman, F. Michael Waas:
Datometry Hyper-Q: Bridging the Gap Between Real-Time and Historical Analytics. 1405-1416
Session 16 - Streaming 2: Sketches
- Anshumali Shrivastava, Arnd Christian König, Mikhail Bilenko:
Time Adaptive Sketches (Ada-Sketches) for Summarizing Data Streams. 1417-1432 - Di Chen, Qin Zhang:
Streaming Algorithms for Robust Distinct Elements. 1433-1447 - Pratanu Roy, Arijit Khan, Gustavo Alonso:
Augmented Sketch: Faster and More Accurate Stream Processing. 1449-1463 - Zhewei Wei, Xuancheng Liu, Feifei Li, Shuo Shang, Xiaoyong Du, Ji-Rong Wen:
Matrix Sketching Over Sliding Windows. 1465-1480 - Nan Tang, Qing Chen, Prasenjit Mitra:
Graph Stream Summarization: From Big Bang to Big Crunch. 1481-1496 - Nikos Giatrakos, Antonios Deligiannakis, Minos N. Garofalakis:
Scalable Approximate Query Tracking over Highly Distributed Data Streams. 1497-1512
Session 17 - Transaction Processing
- Amirhesam Shahvarani, Hans-Arno Jacobsen:
A Hybrid B+-tree as Solution for In-Memory Indexing on CPU-GPU Heterogeneous Computing Platforms. 1523-1538 - Kun Ren, Thaddeus Diamond, Daniel J. Abadi, Alexander Thomson:
Low-Overhead Asynchronous Checkpointing in Main-Memory Database Systems. 1539-1551 - Shan-Hung Wu, Tsai-Yu Feng, Meng-Kai Liao, Shao-Kan Pi, Yu-Shan Lin:
T-Part: Partitioning of Transactions for Forward-Pushing in Deterministic Database Systems. 1553-1565 - Huanchen Zhang, David G. Andersen, Andrew Pavlo, Michael Kaminsky, Lin Ma, Rui Shen:
Reducing the Storage Overhead of Main-Memory OLTP Databases with Hybrid Indexes. 1567-1581 - Kun Ren, Jose M. Faleiro, Daniel J. Abadi:
Design Principles for Scaling Multi-core OLTP Under High Contention. 1583-1598 - Dong Young Yoon, Ning Niu, Barzan Mozafari:
DBSherlock: A Performance Diagnostic Tool for Transactional Databases. 1599-1614
Session 18 - Transactions and Consistency
- Natacha Crooks, Youer Pu, Nancy Estrada, Trinabh Gupta, Lorenzo Alvisi, Allen Clement:
TARDiS: A Branch-and-Merge Approach To Weak Consistency. 1615-1628 - Xiangyao Yu, Andrew Pavlo, Daniel Sánchez, Srinivas Devadas:
TicToc: Time Traveling Optimistic Concurrency Control. 1629-1642 - Zhaoguo Wang, Shuai Mu, Yang Cui, Han Yi, Haibo Chen, Jinyang Li:
Scaling Multicore Databases via Constrained Parallel Execution. 1643-1658 - Qian Lin, Pengfei Chang, Gang Chen, Beng Chin Ooi, Kian-Lee Tan, Zhengkui Wang:
Towards a Non-2PC Transaction Management in Distributed Database Systems. 1659-1674 - Kangnyeon Kim, Tianzheng Wang, Ryan Johnson, Ippokratis Pandis:
ERMIA: Fast Memory-Optimized Database System for Heterogeneous Workloads. 1675-1687 - Yingjun Wu, Chee Yong Chan, Kian-Lee Tan:
Transaction Healing: Scaling Optimistic Concurrency Control on Multicores. 1689-1704
Session 19 - Query Optimization
- Mengmeng Liu, Zachary G. Ives, Boon Thau Loo:
Enabling Incremental Query Re-Optimization. 1705-1720 - Wentao Wu, Jeffrey F. Naughton, Harneet Singh:
Sampling-Based Query Re-Optimization. 1721-1736 - Immanuel Trummer, Christoph Koch:
A Fast Randomized Algorithm for Multi-Objective Query Optimization. 1737-1752 - Kukjin Lee, Arnd Christian König, Vivek R. Narasayya, Bolin Ding, Surajit Chaudhuri, Brent Ellwein, Alexey Eksarevskiy, Manbeen Kohli, Jacob Wyant, Praneeta Prakash, Rimma V. Nehme, Jiexing Li, Jeffrey F. Naughton:
Operator and Query Progress Estimation in Microsoft SQL Server Live Query Statistics. 1753-1764 - Jürgen Hölsch, Michael Grossniklaus, Marc H. Scholl:
Optimization of Nested Queries using the NF2 Algebra. 1765-1780 - K. Venkatesh Emani, Karthik Ramachandra, Subhro Bhattacharya, S. Sudarshan:
Extracting Equivalent SQL from Imperative Code in Database Applications. 1781-1796
Session 20 - Graphs 3: Potpourri
- Ning Yan, Sona Hasani, Abolfazl Asudeh, Chengkai Li:
Generating Preview Tables for Entity Graphs. 1797-1811 - Hao Wei, Jeffrey Xu Yu, Can Lu, Xuemin Lin:
Speedup Graph Processing by Graph Ordering. 1813-1828 - Ali Hadian, Sadegh Nobari, Behrouz Minaei-Bidgoli, Qiang Qu:
ROLL: Fast In-Memory Generation of Gigantic Scale-free Networks. 1829-1842 - Wenfei Fan, Yinghui Wu, Jingbo Xu:
Functional Dependencies for Graphs. 1843-1857 - Boyu Tian, Xiaokui Xiao:
SLING: A Near-Optimal Index Structure for SimRank. 1859-1874 - Nikolay Yakovets, Parke Godfrey, Jarek Gryz:
Query Planning for Evaluating SPARQL Property Paths. 1875-1889
Session 21 - Hardware Acceleration and Query Compilation
- Sebastian Breß, Henning Funke, Jens Teubner:
Robust Query Processing in Co-Processor-accelerated Databases. 1891-1906 - Amir Shaikhha, Yannis Klonatos, Lionel Parreaux, Lewis Brown, Mohammad Dashti, Christoph Koch:
How to Architect a Query Compiler. 1907-1922 - Sudipto Das, Feng Li, Vivek R. Narasayya, Arnd Christian König:
Automated Demand-driven Resource Scaling in Relational Database-as-a-Service. 1923-1934 - Johns Paul, Jiong He, Bingsheng He:
GPL: A GPU-based Pipelined Query Processing Engine. 1935-1950 - Sina Meraji, Berni Schiefer, Lan Pham, Lee Chu, Peter Kokosielis, Adam J. Storm, Wayne Young, Chang Ge, Geoffrey Ng, Kajan Kanagaratnam:
Towards a Hybrid Design for Fast Query Processing in DB2 with BLU Acceleration Using Graphical Processing Units: A Technology Demonstration. 1951-1960 - Stefan Schuh, Xiao Chen, Jens Dittrich:
An Experimental Comparison of Thirteen Relational Equi-Joins in Main Memory. 1961-1976
Session 22 - Nearest Neighbors and Similarity Search
- Jieming Shi, Dingming Wu, Nikos Mamoulis:
Top-k Relevant Semantic Place Retrieval on Spatial RDF Data. 1977-1990 - Pei Wang, Chuan Xiao, Jianbin Qin, Wei Wang, Xiaoyang Zhang, Yoshiharu Ishikawa:
Local Similarity Search for Unstructured Text. 1991-2005 - Weijie Zhao, Florin Rusu, Bin Dong, Kesheng Wu:
Similarity Join over Array Data. 2007-2022 - Yuxin Zheng, Qi Guo, Anthony K. H. Tung, Sai Wu:
LazyLSH: Approximate Nearest Neighbor Search for Multiple Distance Functions with a Single Index. 2023-2037 - Jinglin Peng, Hongzhi Wang, Jianzhong Li, Hong Gao:
Set-based Similarity Search for Time Series. 2039-2052 - Huaijie Zhu, Xiaochun Yang, Bin Wang, Wang-Chien Lee:
Range-based Obstructed Nearest Neighbor Queries. 2053-2068
Session 23 - Demonstrations
- Divy Agrawal, Mouhamadou Lamine Ba, Laure Berti-Équille, Sanjay Chawla, Ahmed K. Elmagarmid, Hossam Hammady, Yasser Idris, Zoi Kaoudi, Zuhair Khayyat, Sebastian Kruse, Mourad Ouzzani, Paolo Papotti, Jorge-Arnulfo Quiané-Ruiz, Nan Tang, Mohammed J. Zaki:
Rheem: Enabling Multi-Platform Task Execution. 2069-2072 - Alexander Alexandrov, Andreas Salzmann, Georgi Krastev, Asterios Katsifodimos, Volker Markl:
Emma in Action: Declarative Dataflows for Scalable Data Analysis. 2073-2076 - Ronald Barber, Matt Huras, Guy M. Lohman, C. Mohan, René Müller, Fatma Özcan, Hamid Pirahesh, Vijayshankar Raman, Richard Sidle, Oleg Sidorkin, Adam J. Storm, Yuanyuan Tian, Pinar Tözün:
Wildfire: Concurrent Blazing Data Ingest and Analytics. 2077-2080 - Xuntao Cheng, Bingsheng He, Mian Lu, Chiew Tong Lau, Huynh Phung Huynh, Rick Siow Mong Goh:
Efficient Query Processing on Many-core Architectures: A Case Study with Intel Xeon Phi Processor. 2081-2084 - Fernando Chirigati, Rémi Rampin, Dennis E. Shasha, Juliana Freire:
ReproZip: Computational Reproducibility With Ease. 2085-2088 - Mina H. Farid, Alexandra Roatis, Ihab F. Ilyas, Hella-Franziska Hoffmann, Xu Chu:
CLAMS: Bringing Quality to Data Lakes. 2089-2092 - Ioannis Flouris, Vasiliki Manikaki, Nikos Giatrakos, Antonios Deligiannakis, Minos N. Garofalakis, Michael Mock, Sebastian Bothe, Inna Skarbovsky, Fabiana Fournier, Marko Stajcer, Tomislav Krizan, Jonathan Yom-Tov, Taji Curin:
FERARI: A Prototype for Complex Event Processing over Streaming Multi-cloud Platforms. 2093-2096 - Rihan Hai, Sandra Geisler, Christoph Quix:
Constance: An Intelligent Data Lake System. 2097-2100 - Michael Hay, Ashwin Machanavajjhala, Gerome Miklau, Yan Chen, Dan Zhang, George Bissias:
Exploring Privacy-Accuracy Tradeoffs using DPComp. 2101-2104 - Alexander Kalinin, Ugur Çetintemel, Stan Zdonik:
Interactive Search and Exploration of Waveform Data with Searchlight. 2105-2108 - Evgeny Kharlamov, Sebastian Brandt, Ernesto Jiménez-Ruiz, Yannis Kotidis, Steffen Lamparter, Theofilos Mailis, Christian Neuenstadt, Özgür L. Özçep, Christoph Pinkel, Christoforos Svingos, Dmitriy Zheleznyakov, Ian Horrocks, Yannis E. Ioannidis, Ralf Möller:
Ontology-Based Integration of Streaming and Static Relational Data with Optique. 2109-2112 - Boyan Kolev, Carlyna Bondiombouy, Patrick Valduriez, Ricardo Jiménez-Peris, Raquel Pau, José Pereira:
The CloudMdsQL Multistore System. 2113-2116 - Sanjay Krishnan, Michael J. Franklin, Ken Goldberg, Jiannan Wang, Eugene Wu:
ActiveClean: An Interactive Data Cleaning Framework For Modern Machine Learning. 2117-2120 - Feifei Li, Bin Wu, Ke Yi, Zhuoyue Zhao:
Wander Join: Online Aggregation for Joins. 2121-2124 - Yaguang Li, Han Su, Ugur Demiryurek, Bolong Zheng, Kai Zeng, Cyrus Shahabi:
PerNav: A Route Summarization Framework for Personalized Navigation. 2125-2128 - Gabriel Lyons, Vinh Tran, Carsten Binnig, Ugur Çetintemel, Tim Kraska:
Making the Case for Query-by-Voice with EchoQuery. 2129-2132 - Antonio Maccioni, Edoardo Basili, Riccardo Torlone:
QUEPA: QUerying and Exploring a Polystore by Augmentation. 2133-2136 - Tova Milo, Amit Somech:
REACT: Context-Sensitive Recommendations for Data Analysis. 2137-2140 - Jennifer Ortiz, Brendan Lee, Magdalena Balazinska:
PerfEnforce Demonstration: Data Analytics with Performance Guarantees. 2141-2144 - Varun Pandey, Andreas Kipf, Dimitri Vorona, Tobias Mühlbauer, Thomas Neumann, Alfons Kemper:
High-Performance Geospatial Analytics in HyPerSpace. 2145-2148 - Holger Pirk, Oscar R. Moll, Sam Madden:
What Makes a Good Physical plan?: Experiencing Hardware-Conscious Query Optimization with Candomblé. 2149-2152 - Jags Ramnarayan, Barzan Mozafari, Sumedh Wale, Sudhir Menon, Neeraj Kumar, Hemant Bhanawat, Soubhik Chakraborty, Yogesh Mahajan, Rishitesh Mishra, Kishor Bachhav:
SnappyData: A Hybrid Transactional Analytical Store Built On Spark. 2153-2156 - Theodoros Rekatsinas, Amol Deshpande, Xin Luna Dong, Lise Getoor, Divesh Srivastava:
SourceSight: Enabling Effective Source Selection. 2157-2160 - Donatello Santoro, Patricia C. Arocena, Boris Glavic, Giansalvatore Mecca, Renée J. Miller, Paolo Papotti:
BART in Action: Error Generation and Empirical Evaluations of Data-Cleaning Systems. 2161-2164 - Youying Shi, Abdeltawab M. Hendawi, Hossam Fattah, Mohamed H. Ali:
RxSpatial: Reactive Spatial Library for Real-Time Location Tracking and Processing. 2165-2168 - Robert Ulbricht, Claudio Hartmann, Martin Hahmann, Hilko Donker, Wolfgang Lehner:
Web-based Benchmarks for Forecasting Systems: The ECAST Platform. 2169-2172 - Annett Ungethüm, Thomas Kissinger, Willi-Wolfram Mentzel, Dirk Habich, Wolfgang Lehner:
Energy Elasticity on Heterogeneous Hardware using Adaptive Resource Reconfiguration LIVE. 2173-2176 - Xiaolan Wang, Alexandra Meliou, Eugene Wu:
QFix: Demonstrating Error Diagnosis in Query Histories. 2177-2180 - Xiang Ying, Chaokun Wang, Meng Wang, Jeffrey Xu Yu, Jun Zhang:
CoDAR: Revealing the Generalized Procedure & Recommending Algorithms of Community Detection. 2181-2184 - Victor Zakhary, Faisal Nawab, Divyakant Agrawal, Amr El Abbadi:
DB-Risk: The Game of Global Database Placement. 2185-2188 - Qizhen Zhang, Da Yan, James Cheng:
Quegel: A General-Purpose System for Querying Big Graphs. 2189-2192
Session 24 - Tutorials
- Michael Armbrust, Doug Bateman, Reynold Xin, Matei Zaharia:
Introduction to Spark 2.0 for Database Researchers. 2193-2194 - Manos Athanassoulis, Stratos Idreos:
Design Tradeoffs of Data Access Methods. 2195-2200 - Xu Chu, Ihab F. Ilyas, Sanjay Krishnan, Jiannan Wang:
Data Cleaning: Overview and Emerging Challenges. 2201-2206 - Gao Cong, Christian S. Jensen:
Querying Geo-Textual Data: Spatial Keyword Queries and Beyond. 2207-2212 - Melanie Herschel, Marcel Hlawatsch:
Provenance: On and Behind the Screens. 2213-2217 - Mohamed F. Mokbel, Amr Magdy:
Microblogs Data Management Systems: Querying, Analysis, and Visualization. 2219-2222 - Faisal Nawab, Divyakant Agrawal, Amr El Abbadi:
The Challenges of Global-scale Data Management. 2223-2227 - Yannis Papakonstantinou:
Semistructured Models, Queries and Algebras in the Big Data Era: Tutorial Summary. 2229-2233 - Xiang Ren, Ahmed El-Kishky, Heng Ji, Jiawei Han:
Automatic Entity Recognition and Typing in Massive Text Data. 2235-2239 - Da Yan, Yingyi Bu, Yuanyuan Tian, Amol Deshpande, James Cheng:
Big Graph Analytics Systems. 2241-2243
Session 25 - Undergraduate Student Abstracts
- Kaleb Alway, Anisoara Nica:
Constructing Join Histograms from Histograms with q-error Guarantees. 2245-2246 - Colin Biafore, Faisal Nawab:
Graph Summarization for Geo-correlated Trends Detection in Social Networks. 2247-2248 - Dezhi Fang, Duen Horng Chau:
M3: Scaling Up Machine Learning via Memory Mapping. 2249-2250 - Valentin D. Grigorev, George A. Chernishev:
K-means Split Revisited: Well-grounded Approach and Experimental Evaluation. 2251-2252 - Zezhou Liu, Stratos Idreos:
Main Memory Adaptive Denormalization. 2253-2254 - Wilson Qin, Stratos Idreos:
Adaptive Data Skipping in Main-Memory Systems. 2255-2256 - BiChen Rao, Erkang Zhu:
Searching Web Data using MinHash LSH. 2257-2258 - Lais M. A. Rocha, Mirella M. Moro:
Research Contribution as a Measure of Influence. 2259-2260 - Panagiotis Sioulas, Anastasia Ailamaki:
Vectorizing an In Situ Query Engine. 2261-2262 - Larry Xu:
Exploring Visualization of Data Transforms. 2263-2264 - Sepanta Zeighami, Raymond Chi-Wing Wong:
Minimizing Average Regret Ratio in Database. 2265-2266
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.