Stop Thinking, Just Do!

Sung-Soo Kim's Blog

SIGMOD 2015 Papers

tagsTags

31 May 2015


Article Source


SIGMOD ‘15- Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data

Digital Library logoFull Citation in the ACM Digital Library

SESSION: Keynote 1

From Data to Insights @ Bare Metal Speed

  • Jignesh M. Patel

SESSION: Research Session 1 - Cloud: Parallel Execution

Distributed Outlier Detection using Compressive Sensing

  • Ying Yan
  • Jiaxing Zhang
  • Bojun Huang
  • Xuzhan Sun
  • Jiaqi Mu
  • Zheng Zhang
  • Thomas Moscibroda

Locality-aware Partitioning in Parallel Database Systems

  • Erfan Zamanian
  • Carsten Binnig
  • Abdallah Salama

ByteSlice: Pushing the Envelop of Main Memory Data Processing with a New Storage Layout

  • Ziqiang Feng
  • Eric Lo
  • Ben Kao
  • Wenjian Xu

Implicit Parallelism through Deep Language Embedding

  • Alexander Alexandrov
  • Andreas Kunft
  • Asterios Katsifodimos
  • Felix Schüler
  • Lauritz Thamsen
  • Odej Kao
  • Tobias Herb
  • Volker Markl

From Theory to Practice: Efficient Join Query Evaluation in a Parallel Database System

  • Shumo Chu
  • Magdalena Balazinska
  • Dan Suciu

SESSION: Research Session 2 - Matrix and Array Computations

sPCA: Scalable Principal Component Analysis for Big Data on Distributed Platforms

  • Tarek Elgamal
  • Maysam Yabandeh
  • Ashraf Aboulnaga
  • Waleed Mustafa
  • Mohamed Hefeeda

Exploiting Matrix Dependency for Efficient Distributed Matrix Computation

  • Lele Yu
  • Yingxia Shao
  • Bin Cui

LEMP: Fast Retrieval of Large Entries in a Matrix Product

  • Christina Teflioudi
  • Rainer Gemulla
  • Olga Mykytiuk

Skew-Aware Join Optimization for Array Databases

  • Jennie Duggan
  • Olga Papaemmanouil
  • Leilani Battle
  • Michael Stonebraker

Resource Elasticity for Large-Scale Machine Learning

  • Botong Huang
  • Matthias Boehm
  • Yuanyuan Tian
  • Berthold Reinwald
  • Shirish Tatikonda
  • Frederick R. Reiss

SESSION: Research Session 3 - Security and Access Control

SEMROD: Secure and Efficient MapReduce Over HybriD Clouds

  • Kerim Yasin Oktay
  • Sharad Mehrotra
  • Vaibhav Khadilkar
  • Murat Kantarcioglu

Authenticated Online Data Integration Services

  • Qian Chen
  • Haibo Hu
  • Jianliang Xu

ENKI: Access Control for Encrypted Query Processing

  • Isabelle Hang
  • Florian Kerschbaum
  • Ernesto Damiani

Collaborative Access Control in WebdamLog

  • Vera Zaychik Moffitt
  • Julia Stoyanovich
  • Serge Abiteboul
  • Gerome Miklau

Automatic Enforcement of Data Use Policies with DataLawyer

  • Prasang Upadhyaya
  • Magdalena Balazinska
  • Dan Suciu

SESSION: Industry Session 1 - Streaming/Real-Time/Active

TencentRec: Real-time Stream Recommendation in Practice

  • Yanxiang Huang
  • Bin Cui
  • Wenyu Zhang
  • Jie Jiang
  • Ying Xu

Twitter Heron: Stream Processing at Scale

  • Sanjeev Kulkarni
  • Nikunj Bhagat
  • Masong Fu
  • Vikas Kedigehalli
  • Christopher Kellogg
  • Sailesh Mittal
  • Jignesh M. Patel
  • Karthik Ramasamy
  • Siddarth Taneja

Analytics in Motion: High Performance Event-Processing AND Real-Time Analytics in the Same Database

  • Lucas Braun
  • Thomas Etter
  • Georgios Gasparis
  • Martin Kaufmann
  • Donald Kossmann
  • Daniel Widmer
  • Aharon Avitzur
  • Anthony Iliopoulos
  • Eliezer Levy
  • Ning Liang

Why Big Data Industrial Systems Need Rules and What We Can Do About It

  • Paul Suganthan G.C.
  • Chong Sun
  • Krishna Gayatri K.
  • Haojun Zhang
  • Frank Yang
  • Narasimhan Rampalli
  • Shishir Prasad
  • Esteban Arcaute
  • Ganesh Krishnan
  • Rohit Deep
  • Vijay Raghavendra
  • AnHai Doan

TUTORIAL SESSION: Tutorial 1

Overview of Data Exploration Techniques

  • Stratos Idreos
  • Olga Papaemmanouil
  • Surajit Chaudhuri

PANEL SESSION: Panel

Machine Learning and Databases: The Sound of Things to Come or a Cacophony of Hype?

  • Christopher Ré
  • Divy Agrawal
  • Magdalena Balazinska
  • Michael Cafarella
  • Michael Jordan
  • Tim Kraska
  • Raghu Ramakrishnan

SESSION: Research Session 4 - Cloud: Fault Tolerance, Reconfiguration

Cost-based Fault-tolerance for Parallel Data Processing

  • Abdallah Salama
  • Carsten Binnig
  • Tim Kraska
  • Erfan Zamanian

Squall: Fine-Grained Live Reconfiguration for Partitioned Main Memory Databases

  • Aaron J. Elmore
  • Vaibhav Arora
  • Rebecca Taft
  • Andrew Pavlo
  • Divyakant Agrawal
  • Amr El Abbadi

Madeus: Database Live Migration Middleware under Heavy Workloads for Cloud Environment

  • Takeshi Mishima
  • Yasuhiro Fujiwara

Lineage-driven Fault Injection

  • Peter Alvaro
  • Joshua Rosen
  • Joseph M. Hellerstein

SESSION: Research Session 5 - Keyword Search and Text

Diversity-Aware Top-k Publish/Subscribe for Text Stream

  • Lisi Chen
  • Gao Cong
  • Georgios Fakas
  • Zhi Cai
  • Nikos Mamoulis

Local Filtering: Improving the Performance of Approximate Queries on String Collections

  • Xiaochun Yang
  • Yaoshu Wang
  • Bin Wang
  • Wei Wang

Exact Top-k Nearest Keyword Search in Large Networks

  • Minhao Jiang
  • Ada Wai-Chee Fu
  • Raymond Chi-Wing Wong

Efficient Algorithms for Answering the m-Closest Keywords Query

  • Tao Guo
  • Xin Cao
  • Gao Cong

SESSION: Research Session 6 - Graph Primitives

Minimum Spanning Trees in Temporal Graphs

  • Silu Huang
  • Ada Wai-Chee Fu
  • Ruifeng Liu

Efficient Enumeration of Maximal k-Plexes

  • Devora Berlowitz
  • Sara Cohen
  • Benny Kimelfeld
  • Zhiwei Zhang
  • Jeffrey Xu Yu
  • Lu Qin
  • Zechao Shang

Index-based Optimal Algorithms for Computing Steiner Components with Maximum Connectivity

  • Lijun Chang
  • Xuemin Lin
  • Lu Qin
  • Jeffrey Xu Yu
  • Wenjie Zhang

SESSION: Research Session 7 - Data Mining

COMMIT: A Scalable Approach to Mining Communication Motifs from Dynamic Networks

  • Saket Gurukar
  • Sayan Ranu
  • Balaraman Ravindran

LASH: Large-Scale Sequence Mining with Hierarchies

  • Kaustubh Beedkar
  • Rainer Gemulla

Twister Tries: Approximate Hierarchical Agglomerative Clustering for Average Distance in Linear Time

  • Michael Cochez
  • Hao Mou

DBSCAN Revisited: Mis-Claim, Un-Fixability, and Approximation

  • Junhao Gan
  • Yufei Tao

The TagAdvisor: Luring the Lurkers to Review Web Items

  • Azade Nazi
  • Mahashweta Das
  • Gautam Das

SESSION: Research Session 8 - Uncertainty and Linking

Supporting Data Uncertainty in Array Databases

  • Liping Peng
  • Yanlei Diao

Identifying the Extent of Completeness of Query Answers over Partially Complete Databases

  • Simon Razniewski
  • Flip Korn
  • Werner Nutt
  • Divesh Srivastava

k-Hit Query: Top-k Query with Probabilistic Utility Function

  • Peng Peng
  • Raymong Chi-Wing Wong

Linking Temporal Records for Profiling Entities

  • Furong Li
  • Mong Li Lee
  • Wynne Hsu
  • Wang-Chiew Tan

SESSION: Industry Session 2 - Applications

Telco Churn Prediction with Big Data

  • Yiqing Huang
  • Fangzhou Zhu
  • Mingxuan Yuan
  • Ke Deng
  • Yanhua Li
  • Bing Ni
  • Wenyuan Dai
  • Qiang Yang
  • Jia Zeng

The LDBC Social Network Benchmark: Interactive Workload

  • Orri Erling
  • Alex Averbuch
  • Josep Larriba-Pey
  • Hassan Chafi
  • Andrey Gubichev
  • Arnau Prat
  • Minh-Duc Pham
  • Peter Boncz

Rethinking Data-Intensive Science Using Scalable Analytics Systems

  • Frank Austin Nothaft
  • Matt Massie
  • Timothy Danford
  • Zhao Zhang
  • Uri Laserson
  • Carl Yeksigian
  • Jey Kottalam
  • Arun Ahuja
  • Jeff Hammerbacher
  • Michael Linderman
  • Michael J. Franklin
  • Anthony D. Joseph
  • David A. Patterson

QMapper for Smart Grid: Migrating SQL-based Application to Hive

  • Yue Wang
  • Yingzhong Xu
  • Yue Liu
  • Jian Chen
  • Songlin Hu

SESSION: ACM-W Athena Lecturer Award

Three Favorite Results

  • Jennifer Widom

SESSION: Keynote 2

The Power Behind the Throne: Information Integration in the Age of Data-Driven Discovery

  • Laura M. Haas

SESSION: Research Session 9 - Transactional Architectures

On the Design and Scalability of Distributed Shared-Data Databases

  • Simon Loesing
  • Markus Pilman
  • Thomas Etter
  • Donald Kossmann

Fast Serializable Multi-Version Concurrency Control for Main-Memory Database Systems

  • Thomas Neumann
  • Tobias Mühlbauer
  • Alfons Kemper

FOEDUS: OLTP Engine for a Thousand Cores and NVRAM

  • Hideaki Kimura

Let’s Talk About Storage & Recovery Methods for Non-Volatile Memory Database Systems

  • Joy Arulraj
  • Andrew Pavlo
  • Subramanya R. Dulloor

SESSION: Research Session 10 - Privacy

Private Release of Graph Statistics using Ladder Functions

  • Jun Zhang
  • Graham Cormode
  • Cecilia M. Procopiuc
  • Divesh Srivastava
  • Xiaokui Xiao

Bayesian Differential Privacy on Correlated Data

  • Bin Yang
  • Issei Sato
  • Hiroshi Nakagawa

Modular Order-Preserving Encryption, Revisited

  • Charalampos Mavroforakis
  • Nathan Chenette
  • Adam O’Neill
  • George Kollios
  • Ran Canetti

Chiaroscuro: Transparency and Privacy for Massive Personal Time-Series Clustering

  • Tristan Allard
  • Georges Hébrail
  • Florent Masseglia
  • Esther Pacitti

SESSION: Research Session 11 - Streams

Persistent Data Sketching

  • Zhewei Wei
  • Ge Luo
  • Ke Yi
  • Xiaoyong Du
  • Ji-Rong Wen

Scalable Distributed Stream Join Processing

  • Qian Lin
  • Beng Chin Ooi
  • Zhengkui Wang
  • Cui Yu

SCREEN: Stream Data Cleaning under Speed Constraints

  • Shaoxu Song
  • Aoqian Zhang
  • Jianmin Wang
  • Philip S. Yu

Location-Aware Pub/Sub System: When Continuous Moving Queries Meet Dynamic Event Streams

  • Long Guo
  • Dongxiang Zhang
  • Guoliang Li
  • Kian-Lee Tan
  • Zhifeng Bao

DEMONSTRATION SESSION: Demo A

CE-Storm: Confidential Elastic Processing of Data Streams

  • Nick R. Katsipoulakis
  • Cory Thoma
  • Eric A. Gratta
  • Alexandros Labrinidis
  • Adam J. Lee
  • Panos K. Chrysanthis

A SQL Debugger Built from Spare Parts: Turning a SQL: 1999 Database System into Its Own Debugger

  • Benjamin Dietrich
  • Torsten Grust

Exploratory Keyword Search with Interactive Input

  • Zhifeng Bao
  • Yong Zeng
  • H.V. Jagadish
  • Tok Wang Ling

QE3D: Interactive Visualization and Exploration of Complex, Distributed Query Plans

  • Daniel Scheibli
  • Christian Dinse
  • Alexander Boehm

DataXFormer: An Interactive Data Transformation Tool

  • John Morcos
  • Ziawasch Abedjan
  • Ihab Francis Ilyas
  • Mourad Ouzzani
  • Paolo Papotti
  • Michael Stonebraker

Quality-Driven Continuous Query Execution over Out-of-Order Data Streams

  • Yuanzhen Ji
  • Hongjin Zhou
  • Zbigniew Jerzak
  • Anisoara Nica
  • Gregor Hackenbroich
  • Christof Fetzer

MoDisSENSE: A Distributed Spatio-Temporal and Textual Processing Platform for Social Networking Services

  • Ioannis Mytilinis
  • Ioannis Giannakopoulos
  • Ioannis Konstantinou
  • Katerina Doka
  • Dimitrios Tsitsigkos
  • Manolis Terrovitis
  • Lampros Giampouras
  • Nectarios Koziris

DocRicher: An Automatic Annotation System for Text Documents Using Social Media

  • Qiang Hu
  • Qi Liu
  • Xiaoli Wang
  • Anthony K.H. Tung
  • Shubham Goyal
  • Jisong Yang

A Demonstration of Rubato DB: A Highly Scalable NewSQL Database System for OLTP and Big Data Applications

  • Li-Yan Yuan
  • Lengdong Wu
  • Jia-Huai You
  • Yan Chi

G-OLA: Generalized On-Line Aggregation for Interactive Analysis on Big Data

  • Kai Zeng
  • Sameer Agarwal
  • Ankur Dave
  • Michael Armbrust
  • Ion Stoica

TUTORIAL SESSION: Tutorial 2

Mining and Forecasting of Big Time-series Data

  • Yasushi Sakurai
  • Yasuko Matsubara
  • Christos Faloutsos

SESSION: Research Session 12 - Spatial data

Optimal Spatial Dominance: An Effective Search of Nearest Neighbor Candidates

  • Xiaoyang Wang
  • Ying Zhang
  • Wenjie Zhang
  • Xuemin Lin
  • Muhammad Aamir Cheema

THERMAL-JOIN: A Scalable Spatial Join for Dynamic Workloads

  • Farhan Tauheed
  • Thomas Heinis
  • Anastasia Ailamaki

Indexing Metric Uncertain Data for Range Queries

  • Lu Chen
  • Yunjun Gao
  • Xinhan Li
  • Christian S. Jensen
  • Gang Chen
  • Baihua Zheng

Efficient Route Planning on Public Transportation Networks: A Labelling Approach

  • Sibo Wang
  • Wenqing Lin
  • Yi Yang
  • Xiaokui Xiao
  • Shuigeng Zhou

SESSION: Research Session 13- Crowdsourcing

The Importance of Being Expert: Efficient Max-Finding in Crowdsourcing

  • Aris Anagnostopoulos
  • Luca Becchetti
  • Adriano Fazzone
  • Ida Mele
  • Matteo Riondato

Minimizing Efforts in Validating Crowd Answers

  • Nguyen Quoc Viet Hung
  • Duong Chi Thang
  • Matthias Weidlich
  • Karl Aberer

iCrowd: An Adaptive Crowdsourcing Framework

  • Ju Fan
  • Guoliang Li
  • Beng Chin Ooi
  • Kian-lee Tan
  • Jianhua Feng

QASCA: A Quality-Aware Task Assignment System for Crowdsourcing Applications

  • Yudian Zheng
  • Jiannan Wang
  • Guoliang Li
  • Reynold Cheng
  • Jianhua Feng

tDP: An Optimal-Latency Budget Allocation Strategy for Crowdsourced MAXIMUM Operations

  • Vasilis Verroios
  • Peter Lofgren
  • Hector Garcia-Molina

DEMONSTRATION SESSION: Demo B

Thrifty: Offering Parallel Database as a Service using the Shared-Process Approach

  • Petrie Wong
  • Zhian He
  • Ziqiang Feng
  • Wenjian Xu
  • Eric Lo

BenchPress: Dynamic Workload Control in the OLTP-Bench Testbed

  • Dana Van Aken
  • Djellel E. Difallah
  • Andrew Pavlo
  • Carlo Curino
  • Philippe Cudré-Mauroux
  • V.M. Megler
  • David Maier

Slider: An Efficient Incremental Reasoner

  • Jules Chevalier
  • Julien Subercaze
  • Christophe Gravier
  • Frédérique Laforest

WANalytics: Geo-Distributed Analytics for a Data Intensive World

  • Ashish Vulimiri
  • Carlo Curino
  • Philip Brighten Godfrey
  • Thomas Jungblut
  • Konstantinos Karanasos
  • Jitendra Padhye
  • George Varghese

FTT: A System for Finding and Tracking Tourists in Public Transport Services

  • Huayu Wu
  • Jo-Anne Tan
  • Wee Siong Ng
  • Mingqiang Xue
  • Wei Chen

SharkDB: An In-Memory Storage System for Massive Trajectory Data

  • Haozhou Wang
  • Kai Zheng
  • Xiaofang Zhou
  • Shazia Sadiq

Ringo: Interactive Graph Analytics on Big-Memory Machines

  • Yonathan Perez
  • Rok Sosič
  • Arijit Banerjee
  • Rohan Puttagunta
  • Martin Raison
  • Pararth Shah
  • Jure Leskovec

STORM: Spatio-Temporal Online Reasoning and Management of Large Spatio-Temporal Data

  • Robert Christensen
  • Lu Wang
  • Feifei Li
  • Ke Yi
  • Jun Tang
  • Natalee Villa

PAXQuery: Parallel Analytical XML Processing

  • Jesús Camacho-Rodríguez
  • Dario Colazzo
  • Ioana Manolescu
  • Juan A.M. Naranjo

SESSION: Research Session 14 - Indexing & Performance

Cache-Efficient Aggregation: Hashing Is Sorting

  • Ingo Müller
  • Peter Sanders
  • Arnaud Lacurie
  • Wolfgang Lehner
  • Franz Färber

Efficient Similarity Join and Search on Multi-Attribute Data

  • Guoliang Li
  • Jian He
  • Dong Deng
  • Jian Li

Holistic Indexing in Main-memory Column-stores

  • Eleni Petraki
  • Stratos Idreos
  • Stefan Manegold

CliffGuard: A Principled Framework for Finding Robust Database Designs

  • Barzan Mozafari
  • Eugene Zhen Ye Goh
  • Dong Young Yoon

Exploiting Correlations for Expensive Predicate Evaluation

  • Manas Joglekar
  • Hector Garcia-Molina
  • Aditya Parameswaran
  • Christopher Re

SESSION: Research Session 15 - Data Cleaning

Query-Oriented Data Cleaning with Oracles

  • Moria Bergman
  • Tova Milo
  • Slava Novgorodov
  • Wang-Chiew Tan

BigDansing: A System for Big Data Cleansing

  • Zuhair Khayyat
  • Ihab F. Ilyas
  • Alekh Jindal
  • Samuel Madden
  • Mourad Ouzzani
  • Paolo Papotti
  • Jorge-Arnulfo Quiané-Ruiz
  • Nan Tang
  • Si Yin

Data X-Ray: A Diagnostic Tool for Data Errors

  • Xiaolan Wang
  • Xin Luna Dong
  • Alexandra Meliou

KATARA: A Data Cleaning System Powered by Knowledge Bases and Crowdsourcing

  • Xu Chu
  • John Morcos
  • Ihab F. Ilyas
  • Mourad Ouzzani
  • Paolo Papotti
  • Nan Tang
  • Yin Ye

Crowd-Based Deduplication: An Adaptive Approach

  • Sibo Wang
  • Xiaokui Xiao
  • Chun-Hee Lee

SESSION: Research Session 16- Transactions

Minimizing Commit Latency of Transactions in Geo-Replicated Data Stores

  • Faisal Nawab
  • Vaibhav Arora
  • Divyakant Agrawal
  • Amr El Abbadi

Optimizing Optimistic Concurrency Control for Tree-Structured, Log-Structured Databases

  • Philip A. Bernstein
  • Sudipto Das
  • Bailu Ding
  • Markus Pilman

The Homeostasis Protocol: Avoiding Transaction Coordination Through Program Analysis

  • Sudip Roy
  • Lucja Kot
  • Gabriel Bender
  • Bailu Ding
  • Hossein Hojjat
  • Christoph Koch
  • Nate Foster
  • Johannes Gehrke

Feral Concurrency Control: An Empirical Investigation of Modern Application Integrity

  • Peter Bailis
  • Alan Fekete
  • Michael J. Franklin
  • Ali Ghodsi
  • Joseph M. Hellerstein
  • Ion Stoica

SESSION: Industry Session 3 - Novel Systems

REEF: Retainable Evaluator Execution Framework

  • Markus Weimer
  • Yingda Chen
  • Byung-Gon Chun
  • Tyson Condie
  • Carlo Curino
  • Chris Douglas
  • Yunseong Lee
  • Tony Majestro
  • Dahlia Malkhi
  • Sergiy Matusevych
  • Brandon Myers
  • Shravan Narayanamurthy
  • Raghu Ramakrishnan
  • Sriram Rao
  • Russel Sears
  • Beysim Sezgin
  • Julia Wang

Apache Tez: A Unifying Framework for Modeling and Building Data Processing Applications

  • Bikas Saha
  • Hitesh Shah
  • Siddharth Seth
  • Gopal Vijayaraghavan
  • Arun Murthy
  • Carlo Curino

Design and Implementation of the LogicBlox System

  • Molham Aref
  • Balder ten Cate
  • Todd J. Green
  • Benny Kimelfeld
  • Dan Olteanu
  • Emir Pasalic
  • Todd L. Veldhuizen
  • Geoffrey Washburn

Spark SQL: Relational Data Processing in Spark

  • Michael Armbrust
  • Reynold S. Xin
  • Cheng Lian
  • Yin Huai
  • Davies Liu
  • Joseph K. Bradley
  • Xiangrui Meng
  • Tomer Kaftan
  • Michael J. Franklin
  • Ali Ghodsi
  • Matei Zaharia

DEMONSTRATION SESSION: Demo C

Graft: A Debugging Tool For Apache Giraph

  • Semih Salihoglu
  • Jaeho Shin
  • Vikesh Khanna
  • Ba Quan Truong
  • Jennifer Widom

Even Metadata is Getting Big: Annotation Summarization using InsightNotes

  • Dongqing Xiao
  • Armir Bashllari
  • Tyler Menard
  • Mohamed Eltabakh

StoryPivot: Comparing and Contrasting Story Evolution

  • Anja Gruenheid
  • Donald Kossmann
  • Theodoros Rekatsinas
  • Divesh Srivastava

The Flatter, the Better: Query Compilation Based on the Flattening Transformation

  • Alexander Ulrich
  • Torsten Grust

D2WORM: A Management Infrastructure for Distributed Data-centric Workflows

  • Martin Jergler
  • Mohammad Sadoghi
  • Hans-Arno Jacobsen

NL~2~CM: A Natural Language Interface to Crowd Mining

  • Yael Amsterdamer
  • Anna Kukliansky
  • Tova Milo

Optimistic Recovery for Iterative Dataflows in Action

  • Sergey Dudoladov
  • Chen Xu
  • Sebastian Schelter
  • Asterios Katsifodimos
  • Stephan Ewen
  • Kostas Tzoumas
  • Volker Markl

A Secure Search Engine for the Personal Cloud

  • Saliha Lallali
  • Nicolas Anciaux
  • Iulian Sandu Popa
  • Philippe Pucheral

IReS: Intelligent, Multi-Engine Resource Scheduler for Big Data Analytics Workflows

  • Katerina Doka
  • Nikolaos Papailiou
  • Dimitrios Tsoumakos
  • Christos Mantas
  • Nectarios Koziris

Just can’t get enough: Synthesizing Big Data

  • Tilmann Rabl
  • Manuel Danisch
  • Michael Frank
  • Sebastian Schindler
  • Hans-Arno Jacobsen

SESSION: Research Session 17 - Hardware-Aware Query Processing

Rack-Scale In-Memory Join Processing using RDMA

  • Claude Barthels
  • Simon Loesing
  • Gustavo Alonso
  • Donald Kossmann

Self-Tuning, GPU-Accelerated Kernel Density Models for Multidimensional Selectivity Estimation

  • Max Heimel
  • Martin Kiefer
  • Volker Markl

Rethinking SIMD Vectorization for In-Memory Databases

  • Orestis Polychroniou
  • Arun Raghavan
  • Kenneth A. Ross

A Padded Encoding Scheme to Accelerate Scans by Leveraging Skew

  • Yinan Li
  • Craig Chasseur
  • Jignesh M. Patel

SESSION: Research Session 18 - Graph Propagation, Influence, Mining

GetReal: Towards Realistic Selection of Influence Maximization Strategies in Competitive Networks

  • Hui Li
  • Sourav S. Bhowmick
  • Jiangtao Cui
  • Yunjun Gao
  • Jianfeng Ma

Influence Maximization in Near-Linear Time: A Martingale Approach

  • Youze Tang
  • Yanchen Shi
  • Xiaokui Xiao

Community Level Diffusion Extraction

  • Zhiting Hu
  • Junjie Yao
  • Bin Cui
  • Eric Xing

BEAR: Block Elimination Approach for Random Walk with Restart on Large Graphs

  • Kijung Shin
  • Jinhong Jung
  • Sael Lee
  • U. Kang

The Minimum Wiener Connector Problem

  • Natali Ruchansky
  • Francesco Bonchi
  • David García-Soriano
  • Francesco Gullo
  • Nicolas Kourtellis

SESSION: Research Session 19 - Social Networks

From Group Recommendations to Group Formation

  • Senjuti Basu Roy
  • Laks V.S. Lakshmanan
  • Rui Liu

Real-Time Multi-Criteria Social Graph Partitioning: A Game Theoretic Approach

  • Nikos Armenatzoglou
  • Huy Pham
  • Vasilis Ntranos
  • Dimitris Papadias
  • Cyrus Shahabi

Utility-Aware Social Event-Participant Planning

  • Jieying She
  • Yongxin Tong
  • Lei Chen

Online Video Recommendation in Sharing Community

  • Xiangmin Zhou
  • Lei Chen
  • Yanchun Zhang
  • Longbing Cao
  • Guangyan Huang
  • Chen Wang

SESSION: Industry Session 4 - Performance

Large-scale Predictive Analytics in Vertica: Fast Data Transfer, Distributed Model Creation, and In-database Prediction

  • Shreya Prasad
  • Arash Fard
  • Vishrut Gupta
  • Jorge Martinez
  • Jeff LeFevre
  • Vincent Xu
  • Meichun Hsu
  • Indrajit Roy

Oracle Workload Intelligence

  • Quoc Trung Tran
  • Konstantinos Morfonios
  • Neoklis Polyzotis

Purity: Building Fast, Highly-Available Enterprise Flash Storage from Commodity Components

  • John Colgrove
  • John D. Davis
  • John Hayes
  • Ethan L. Miller
  • Cary Sandvig
  • Russell Sears
  • Ari Tamches
  • Neil Vachharajani
  • Feng Wang

On Improving User Response Times in Tableau

  • Pawel Terlecki
  • Fei Xu
  • Marianne Shaw
  • Valeri Kim
  • Richard Wesley

TUTORIAL SESSION: Tutorial 3

Data Management in Non-Volatile Memory

  • Stratis D. Viglas

SESSION: Research Session 20 - Information Extraction and Record Linking

TEGRA: Table Extraction by Global Record Alignment

  • Xu Chu
  • Yeye He
  • Kaushik Chakrabarti
  • Kris Ganjam

Mining Quality Phrases from Massive Text Corpora

  • Jialu Liu
  • Jingbo Shang
  • Chi Wang
  • Xiang Ren
  • Jiawei Han

Mining Subjective Properties on the Web

  • Immanuel Trummer
  • Alon Halevy
  • Hongrae Lee
  • Sunita Sarawagi
  • Rahul Gupta

Microblog Entity Linking with Social Temporal Context

  • Wen Hua
  • Kai Zheng
  • Xiaofang Zhou

SESSION: Research Session 21 - RDF and SPARQL

Graph-Aware, Workload-Adaptive SPARQL Query Caching

  • Nikolaos Papailiou
  • Dimitrios Tsoumakos
  • Panagiotis Karras
  • Nectarios Koziris

Left Bit Right: For SPARQL Join Queries with OPTIONAL Patterns (Left-outer-joins)

  • Medha Atre

How to Build Templates for RDF Question/Answering: An Uncertain Graph Similarity Join Approach

  • Weiguo Zheng
  • Lei Zou
  • Xiang Lian
  • Jeffrey Xu Yu
  • Shaoxu Song
  • Dongyan Zhao

RBench: Application-Specific RDF Benchmarking

  • Shi Qiao
  • Z. Meral Özsoyoğlu
  • Ahmed El-Roby
  • Ashraf Aboulnaga

SESSION: Research Session 22 - Time Series & Graph Processing

k-Shape: Efficient and Accurate Clustering of Time Series

  • John Paparrizos
  • Luis Gravano

SMiLer: A Semi-Lazy Time Series Prediction System for Sensors

  • Jingbo Zhou
  • Anthony K.H. Tung

SQLGraph: An Efficient Relational-Based Property Graph Store

  • Wen Sun
  • Achille Fokoue
  • Kavitha Srinivas
  • Anastasios Kementsietsidis
  • Gang Hu
  • Guotong Xie

Updating Graph Indices with a One-Pass Algorithm

  • Dayu Yuan
  • Prasenjit Mitra
  • Huiwen Yu
  • C. Lee Giles

SESSION: Industry Session 5 - Usability

Amazon Redshift and the Case for Simpler Data Warehouses

  • Anurag Gupta
  • Deepak Agarwal
  • Derek Tan
  • Jakub Kulesza
  • Rahul Pathak
  • Stefano Stefani
  • Vidhya Srinivasan

ShareInsights: An Unified Approach to Full-stack Data Processing

  • Mukund Deshpande
  • Dhruva Ray
  • Sameer Dixit
  • Avadhoot Agasti

SESSION: Research Session 23 - Advanced Query Processing

An Incremental Anytime Algorithm for Multi-Objective Query Optimization

  • Immanuel Trummer
  • Christoph Koch

Output-sensitive Evaluation of Prioritized Skyline Queries

  • Niccolo’ Meneghetti
  • Denis Mindolin
  • Paolo Ciaccia
  • Jan Chomicki

Learning Generalized Linear Models Over Normalized Data

  • Arun Kumar
  • Jeffrey Naughton
  • Jignesh M. Patel

Utilizing IDs to Accelerate Incremental View Maintenance

  • Yannis Katsis
  • Kian Win Ong
  • Yannis Papakonstantinou
  • Kevin Keliang Zhao

SESSION: Research Session 24 - New Models

S4: Top-k Spreadsheet-Style Search for Query Discovery

  • Fotis Psallidas
  • Bolin Ding
  • Kaushik Chakrabarti
  • Surajit Chaudhuri

Proactive Annotation Management in Relational Databases

  • Karim Ibrahim
  • Xiao Du
  • Mohamed Eltabakh

Weighted Coverage based Reviewer Assignment

  • Ngai Meng Kou
  • Leong Hou U.
  • Nikos Mamoulis
  • Zhiguo Gong

Distributed Online Tracking

  • Mingwang Tang
  • Feifei Li
  • Yufei Tao

TUTORIAL SESSION: Tutorial 4

Knowledge Curation and Knowledge Fusion: Challenges, Models and Applications

  • Xin Luna Dong
  • Divesh Srivastava

SESSION: Undergraduate Abstracts

Smooth Task Migration in Apache Storm

  • Mansheng Yang
  • Richard T.B. Ma

JAFAR: Near-Data Processing for Databases

  • Oreoluwatomiwa O. Babarinsa
  • Stratos Idreos

Job Scheduling with Minimizing Data Communication Costs

  • Trevor Clinkenbeard
  • Anisoara Nica

One Loop Does Not Fit All

  • Styliani Pantela
  • Stratos Idreos

DunceCap: Compiling Worst-Case Optimal Query Plans

  • Adam Perelman
  • Christopher Ré

DunceCap: Query Plans Using Generalized Hypertree Decompositions

  • Susan Tu
  • Christopher Ré

comments powered by Disqus