Article Source
- Title: SIGMOD ‘15- Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data
- Authors: Reynold Xin
SIGMOD ‘15- Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data
Full Citation in the ACM Digital Library
SESSION: Keynote 1
From Data to Insights @ Bare Metal Speed
- Jignesh M. Patel
SESSION: Research Session 1 - Cloud: Parallel Execution
Distributed Outlier Detection using Compressive Sensing
- Ying Yan
- Jiaxing Zhang
- Bojun Huang
- Xuzhan Sun
- Jiaqi Mu
- Zheng Zhang
- Thomas Moscibroda
Locality-aware Partitioning in Parallel Database Systems
- Erfan Zamanian
- Carsten Binnig
- Abdallah Salama
ByteSlice: Pushing the Envelop of Main Memory Data Processing with a New Storage Layout
- Ziqiang Feng
- Eric Lo
- Ben Kao
- Wenjian Xu
Implicit Parallelism through Deep Language Embedding
- Alexander Alexandrov
- Andreas Kunft
- Asterios Katsifodimos
- Felix Schüler
- Lauritz Thamsen
- Odej Kao
- Tobias Herb
- Volker Markl
From Theory to Practice: Efficient Join Query Evaluation in a Parallel Database System
- Shumo Chu
- Magdalena Balazinska
- Dan Suciu
SESSION: Research Session 2 - Matrix and Array Computations
sPCA: Scalable Principal Component Analysis for Big Data on Distributed Platforms
- Tarek Elgamal
- Maysam Yabandeh
- Ashraf Aboulnaga
- Waleed Mustafa
- Mohamed Hefeeda
Exploiting Matrix Dependency for Efficient Distributed Matrix Computation
- Lele Yu
- Yingxia Shao
- Bin Cui
LEMP: Fast Retrieval of Large Entries in a Matrix Product
- Christina Teflioudi
- Rainer Gemulla
- Olga Mykytiuk
Skew-Aware Join Optimization for Array Databases
- Jennie Duggan
- Olga Papaemmanouil
- Leilani Battle
- Michael Stonebraker
Resource Elasticity for Large-Scale Machine Learning
- Botong Huang
- Matthias Boehm
- Yuanyuan Tian
- Berthold Reinwald
- Shirish Tatikonda
- Frederick R. Reiss
SESSION: Research Session 3 - Security and Access Control
SEMROD: Secure and Efficient MapReduce Over HybriD Clouds
- Kerim Yasin Oktay
- Sharad Mehrotra
- Vaibhav Khadilkar
- Murat Kantarcioglu
Authenticated Online Data Integration Services
- Qian Chen
- Haibo Hu
- Jianliang Xu
ENKI: Access Control for Encrypted Query Processing
- Isabelle Hang
- Florian Kerschbaum
- Ernesto Damiani
Collaborative Access Control in WebdamLog
- Vera Zaychik Moffitt
- Julia Stoyanovich
- Serge Abiteboul
- Gerome Miklau
Automatic Enforcement of Data Use Policies with DataLawyer
- Prasang Upadhyaya
- Magdalena Balazinska
- Dan Suciu
SESSION: Industry Session 1 - Streaming/Real-Time/Active
TencentRec: Real-time Stream Recommendation in Practice
- Yanxiang Huang
- Bin Cui
- Wenyu Zhang
- Jie Jiang
- Ying Xu
Twitter Heron: Stream Processing at Scale
- Sanjeev Kulkarni
- Nikunj Bhagat
- Masong Fu
- Vikas Kedigehalli
- Christopher Kellogg
- Sailesh Mittal
- Jignesh M. Patel
- Karthik Ramasamy
- Siddarth Taneja
Analytics in Motion: High Performance Event-Processing AND Real-Time Analytics in the Same Database
- Lucas Braun
- Thomas Etter
- Georgios Gasparis
- Martin Kaufmann
- Donald Kossmann
- Daniel Widmer
- Aharon Avitzur
- Anthony Iliopoulos
- Eliezer Levy
- Ning Liang
Why Big Data Industrial Systems Need Rules and What We Can Do About It
- Paul Suganthan G.C.
- Chong Sun
- Krishna Gayatri K.
- Haojun Zhang
- Frank Yang
- Narasimhan Rampalli
- Shishir Prasad
- Esteban Arcaute
- Ganesh Krishnan
- Rohit Deep
- Vijay Raghavendra
- AnHai Doan
TUTORIAL SESSION: Tutorial 1
Overview of Data Exploration Techniques
- Stratos Idreos
- Olga Papaemmanouil
- Surajit Chaudhuri
PANEL SESSION: Panel
Machine Learning and Databases: The Sound of Things to Come or a Cacophony of Hype?
- Christopher Ré
- Divy Agrawal
- Magdalena Balazinska
- Michael Cafarella
- Michael Jordan
- Tim Kraska
- Raghu Ramakrishnan
SESSION: Research Session 4 - Cloud: Fault Tolerance, Reconfiguration
Cost-based Fault-tolerance for Parallel Data Processing
- Abdallah Salama
- Carsten Binnig
- Tim Kraska
- Erfan Zamanian
Squall: Fine-Grained Live Reconfiguration for Partitioned Main Memory Databases
- Aaron J. Elmore
- Vaibhav Arora
- Rebecca Taft
- Andrew Pavlo
- Divyakant Agrawal
- Amr El Abbadi
Madeus: Database Live Migration Middleware under Heavy Workloads for Cloud Environment
- Takeshi Mishima
- Yasuhiro Fujiwara
Lineage-driven Fault Injection
- Peter Alvaro
- Joshua Rosen
- Joseph M. Hellerstein
SESSION: Research Session 5 - Keyword Search and Text
Diversity-Aware Top-k Publish/Subscribe for Text Stream
- Lisi Chen
- Gao Cong
Diverse and Proportional Size-l Object Summaries for Keyword Search
- Georgios Fakas
- Zhi Cai
- Nikos Mamoulis
Local Filtering: Improving the Performance of Approximate Queries on String Collections
- Xiaochun Yang
- Yaoshu Wang
- Bin Wang
- Wei Wang
Exact Top-k Nearest Keyword Search in Large Networks
- Minhao Jiang
- Ada Wai-Chee Fu
- Raymond Chi-Wing Wong
Efficient Algorithms for Answering the m-Closest Keywords Query
- Tao Guo
- Xin Cao
- Gao Cong
SESSION: Research Session 6 - Graph Primitives
Minimum Spanning Trees in Temporal Graphs
- Silu Huang
- Ada Wai-Chee Fu
- Ruifeng Liu
Efficient Enumeration of Maximal k-Plexes
- Devora Berlowitz
- Sara Cohen
- Benny Kimelfeld
Divide & Conquer: I/O Efficient Depth-First Search
- Zhiwei Zhang
- Jeffrey Xu Yu
- Lu Qin
- Zechao Shang
Index-based Optimal Algorithms for Computing Steiner Components with Maximum Connectivity
- Lijun Chang
- Xuemin Lin
- Lu Qin
- Jeffrey Xu Yu
- Wenjie Zhang
SESSION: Research Session 7 - Data Mining
COMMIT: A Scalable Approach to Mining Communication Motifs from Dynamic Networks
- Saket Gurukar
- Sayan Ranu
- Balaraman Ravindran
LASH: Large-Scale Sequence Mining with Hierarchies
- Kaustubh Beedkar
- Rainer Gemulla
Twister Tries: Approximate Hierarchical Agglomerative Clustering for Average Distance in Linear Time
- Michael Cochez
- Hao Mou
DBSCAN Revisited: Mis-Claim, Un-Fixability, and Approximation
- Junhao Gan
- Yufei Tao
The TagAdvisor: Luring the Lurkers to Review Web Items
- Azade Nazi
- Mahashweta Das
- Gautam Das
SESSION: Research Session 8 - Uncertainty and Linking
Supporting Data Uncertainty in Array Databases
- Liping Peng
- Yanlei Diao
Identifying the Extent of Completeness of Query Answers over Partially Complete Databases
- Simon Razniewski
- Flip Korn
- Werner Nutt
- Divesh Srivastava
k-Hit Query: Top-k Query with Probabilistic Utility Function
- Peng Peng
- Raymong Chi-Wing Wong
Linking Temporal Records for Profiling Entities
- Furong Li
- Mong Li Lee
- Wynne Hsu
- Wang-Chiew Tan
SESSION: Industry Session 2 - Applications
Telco Churn Prediction with Big Data
- Yiqing Huang
- Fangzhou Zhu
- Mingxuan Yuan
- Ke Deng
- Yanhua Li
- Bing Ni
- Wenyuan Dai
- Qiang Yang
- Jia Zeng
The LDBC Social Network Benchmark: Interactive Workload
- Orri Erling
- Alex Averbuch
- Josep Larriba-Pey
- Hassan Chafi
- Andrey Gubichev
- Arnau Prat
- Minh-Duc Pham
- Peter Boncz
Rethinking Data-Intensive Science Using Scalable Analytics Systems
- Frank Austin Nothaft
- Matt Massie
- Timothy Danford
- Zhao Zhang
- Uri Laserson
- Carl Yeksigian
- Jey Kottalam
- Arun Ahuja
- Jeff Hammerbacher
- Michael Linderman
- Michael J. Franklin
- Anthony D. Joseph
- David A. Patterson
QMapper for Smart Grid: Migrating SQL-based Application to Hive
- Yue Wang
- Yingzhong Xu
- Yue Liu
- Jian Chen
- Songlin Hu
SESSION: ACM-W Athena Lecturer Award
Three Favorite Results
- Jennifer Widom
SESSION: Keynote 2
The Power Behind the Throne: Information Integration in the Age of Data-Driven Discovery
- Laura M. Haas
SESSION: Research Session 9 - Transactional Architectures
On the Design and Scalability of Distributed Shared-Data Databases
- Simon Loesing
- Markus Pilman
- Thomas Etter
- Donald Kossmann
Fast Serializable Multi-Version Concurrency Control for Main-Memory Database Systems
- Thomas Neumann
- Tobias Mühlbauer
- Alfons Kemper
FOEDUS: OLTP Engine for a Thousand Cores and NVRAM
- Hideaki Kimura
Let’s Talk About Storage & Recovery Methods for Non-Volatile Memory Database Systems
- Joy Arulraj
- Andrew Pavlo
- Subramanya R. Dulloor
SESSION: Research Session 10 - Privacy
Private Release of Graph Statistics using Ladder Functions
- Jun Zhang
- Graham Cormode
- Cecilia M. Procopiuc
- Divesh Srivastava
- Xiaokui Xiao
Bayesian Differential Privacy on Correlated Data
- Bin Yang
- Issei Sato
- Hiroshi Nakagawa
Modular Order-Preserving Encryption, Revisited
- Charalampos Mavroforakis
- Nathan Chenette
- Adam O’Neill
- George Kollios
- Ran Canetti
Chiaroscuro: Transparency and Privacy for Massive Personal Time-Series Clustering
- Tristan Allard
- Georges Hébrail
- Florent Masseglia
- Esther Pacitti
SESSION: Research Session 11 - Streams
Persistent Data Sketching
- Zhewei Wei
- Ge Luo
- Ke Yi
- Xiaoyong Du
- Ji-Rong Wen
Scalable Distributed Stream Join Processing
- Qian Lin
- Beng Chin Ooi
- Zhengkui Wang
- Cui Yu
SCREEN: Stream Data Cleaning under Speed Constraints
- Shaoxu Song
- Aoqian Zhang
- Jianmin Wang
- Philip S. Yu
Location-Aware Pub/Sub System: When Continuous Moving Queries Meet Dynamic Event Streams
- Long Guo
- Dongxiang Zhang
- Guoliang Li
- Kian-Lee Tan
- Zhifeng Bao
DEMONSTRATION SESSION: Demo A
CE-Storm: Confidential Elastic Processing of Data Streams
- Nick R. Katsipoulakis
- Cory Thoma
- Eric A. Gratta
- Alexandros Labrinidis
- Adam J. Lee
- Panos K. Chrysanthis
A SQL Debugger Built from Spare Parts: Turning a SQL: 1999 Database System into Its Own Debugger
- Benjamin Dietrich
- Torsten Grust
Exploratory Keyword Search with Interactive Input
- Zhifeng Bao
- Yong Zeng
- H.V. Jagadish
- Tok Wang Ling
QE3D: Interactive Visualization and Exploration of Complex, Distributed Query Plans
- Daniel Scheibli
- Christian Dinse
- Alexander Boehm
DataXFormer: An Interactive Data Transformation Tool
- John Morcos
- Ziawasch Abedjan
- Ihab Francis Ilyas
- Mourad Ouzzani
- Paolo Papotti
- Michael Stonebraker
Quality-Driven Continuous Query Execution over Out-of-Order Data Streams
- Yuanzhen Ji
- Hongjin Zhou
- Zbigniew Jerzak
- Anisoara Nica
- Gregor Hackenbroich
- Christof Fetzer
MoDisSENSE: A Distributed Spatio-Temporal and Textual Processing Platform for Social Networking Services
- Ioannis Mytilinis
- Ioannis Giannakopoulos
- Ioannis Konstantinou
- Katerina Doka
- Dimitrios Tsitsigkos
- Manolis Terrovitis
- Lampros Giampouras
- Nectarios Koziris
DocRicher: An Automatic Annotation System for Text Documents Using Social Media
- Qiang Hu
- Qi Liu
- Xiaoli Wang
- Anthony K.H. Tung
- Shubham Goyal
- Jisong Yang
A Demonstration of Rubato DB: A Highly Scalable NewSQL Database System for OLTP and Big Data Applications
- Li-Yan Yuan
- Lengdong Wu
- Jia-Huai You
- Yan Chi
G-OLA: Generalized On-Line Aggregation for Interactive Analysis on Big Data
- Kai Zeng
- Sameer Agarwal
- Ankur Dave
- Michael Armbrust
- Ion Stoica
TUTORIAL SESSION: Tutorial 2
Mining and Forecasting of Big Time-series Data
- Yasushi Sakurai
- Yasuko Matsubara
- Christos Faloutsos
SESSION: Research Session 12 - Spatial data
Optimal Spatial Dominance: An Effective Search of Nearest Neighbor Candidates
- Xiaoyang Wang
- Ying Zhang
- Wenjie Zhang
- Xuemin Lin
- Muhammad Aamir Cheema
THERMAL-JOIN: A Scalable Spatial Join for Dynamic Workloads
- Farhan Tauheed
- Thomas Heinis
- Anastasia Ailamaki
Indexing Metric Uncertain Data for Range Queries
- Lu Chen
- Yunjun Gao
- Xinhan Li
- Christian S. Jensen
- Gang Chen
- Baihua Zheng
Efficient Route Planning on Public Transportation Networks: A Labelling Approach
- Sibo Wang
- Wenqing Lin
- Yi Yang
- Xiaokui Xiao
- Shuigeng Zhou
SESSION: Research Session 13- Crowdsourcing
The Importance of Being Expert: Efficient Max-Finding in Crowdsourcing
- Aris Anagnostopoulos
- Luca Becchetti
- Adriano Fazzone
- Ida Mele
- Matteo Riondato
Minimizing Efforts in Validating Crowd Answers
- Nguyen Quoc Viet Hung
- Duong Chi Thang
- Matthias Weidlich
- Karl Aberer
iCrowd: An Adaptive Crowdsourcing Framework
- Ju Fan
- Guoliang Li
- Beng Chin Ooi
- Kian-lee Tan
- Jianhua Feng
QASCA: A Quality-Aware Task Assignment System for Crowdsourcing Applications
- Yudian Zheng
- Jiannan Wang
- Guoliang Li
- Reynold Cheng
- Jianhua Feng
tDP: An Optimal-Latency Budget Allocation Strategy for Crowdsourced MAXIMUM Operations
- Vasilis Verroios
- Peter Lofgren
- Hector Garcia-Molina
DEMONSTRATION SESSION: Demo B
Thrifty: Offering Parallel Database as a Service using the Shared-Process Approach
- Petrie Wong
- Zhian He
- Ziqiang Feng
- Wenjian Xu
- Eric Lo
BenchPress: Dynamic Workload Control in the OLTP-Bench Testbed
- Dana Van Aken
- Djellel E. Difallah
- Andrew Pavlo
- Carlo Curino
- Philippe Cudré-Mauroux
Demonstrating “Data Near Here”: Scientific Data Search
- V.M. Megler
- David Maier
Slider: An Efficient Incremental Reasoner
- Jules Chevalier
- Julien Subercaze
- Christophe Gravier
- Frédérique Laforest
WANalytics: Geo-Distributed Analytics for a Data Intensive World
- Ashish Vulimiri
- Carlo Curino
- Philip Brighten Godfrey
- Thomas Jungblut
- Konstantinos Karanasos
- Jitendra Padhye
- George Varghese
FTT: A System for Finding and Tracking Tourists in Public Transport Services
- Huayu Wu
- Jo-Anne Tan
- Wee Siong Ng
- Mingqiang Xue
- Wei Chen
SharkDB: An In-Memory Storage System for Massive Trajectory Data
- Haozhou Wang
- Kai Zheng
- Xiaofang Zhou
- Shazia Sadiq
Ringo: Interactive Graph Analytics on Big-Memory Machines
- Yonathan Perez
- Rok Sosič
- Arijit Banerjee
- Rohan Puttagunta
- Martin Raison
- Pararth Shah
- Jure Leskovec
STORM: Spatio-Temporal Online Reasoning and Management of Large Spatio-Temporal Data
- Robert Christensen
- Lu Wang
- Feifei Li
- Ke Yi
- Jun Tang
- Natalee Villa
PAXQuery: Parallel Analytical XML Processing
- Jesús Camacho-Rodríguez
- Dario Colazzo
- Ioana Manolescu
- Juan A.M. Naranjo
SESSION: Research Session 14 - Indexing & Performance
Cache-Efficient Aggregation: Hashing Is Sorting
- Ingo Müller
- Peter Sanders
- Arnaud Lacurie
- Wolfgang Lehner
- Franz Färber
Efficient Similarity Join and Search on Multi-Attribute Data
- Guoliang Li
- Jian He
- Dong Deng
- Jian Li
Holistic Indexing in Main-memory Column-stores
- Eleni Petraki
- Stratos Idreos
- Stefan Manegold
CliffGuard: A Principled Framework for Finding Robust Database Designs
- Barzan Mozafari
- Eugene Zhen Ye Goh
- Dong Young Yoon
Exploiting Correlations for Expensive Predicate Evaluation
- Manas Joglekar
- Hector Garcia-Molina
- Aditya Parameswaran
- Christopher Re
SESSION: Research Session 15 - Data Cleaning
Query-Oriented Data Cleaning with Oracles
- Moria Bergman
- Tova Milo
- Slava Novgorodov
- Wang-Chiew Tan
BigDansing: A System for Big Data Cleansing
- Zuhair Khayyat
- Ihab F. Ilyas
- Alekh Jindal
- Samuel Madden
- Mourad Ouzzani
- Paolo Papotti
- Jorge-Arnulfo Quiané-Ruiz
- Nan Tang
- Si Yin
Data X-Ray: A Diagnostic Tool for Data Errors
- Xiaolan Wang
- Xin Luna Dong
- Alexandra Meliou
KATARA: A Data Cleaning System Powered by Knowledge Bases and Crowdsourcing
- Xu Chu
- John Morcos
- Ihab F. Ilyas
- Mourad Ouzzani
- Paolo Papotti
- Nan Tang
- Yin Ye
Crowd-Based Deduplication: An Adaptive Approach
- Sibo Wang
- Xiaokui Xiao
- Chun-Hee Lee
SESSION: Research Session 16- Transactions
Minimizing Commit Latency of Transactions in Geo-Replicated Data Stores
- Faisal Nawab
- Vaibhav Arora
- Divyakant Agrawal
- Amr El Abbadi
Optimizing Optimistic Concurrency Control for Tree-Structured, Log-Structured Databases
- Philip A. Bernstein
- Sudipto Das
- Bailu Ding
- Markus Pilman
The Homeostasis Protocol: Avoiding Transaction Coordination Through Program Analysis
- Sudip Roy
- Lucja Kot
- Gabriel Bender
- Bailu Ding
- Hossein Hojjat
- Christoph Koch
- Nate Foster
- Johannes Gehrke
Feral Concurrency Control: An Empirical Investigation of Modern Application Integrity
- Peter Bailis
- Alan Fekete
- Michael J. Franklin
- Ali Ghodsi
- Joseph M. Hellerstein
- Ion Stoica
SESSION: Industry Session 3 - Novel Systems
REEF: Retainable Evaluator Execution Framework
- Markus Weimer
- Yingda Chen
- Byung-Gon Chun
- Tyson Condie
- Carlo Curino
- Chris Douglas
- Yunseong Lee
- Tony Majestro
- Dahlia Malkhi
- Sergiy Matusevych
- Brandon Myers
- Shravan Narayanamurthy
- Raghu Ramakrishnan
- Sriram Rao
- Russel Sears
- Beysim Sezgin
- Julia Wang
Apache Tez: A Unifying Framework for Modeling and Building Data Processing Applications
- Bikas Saha
- Hitesh Shah
- Siddharth Seth
- Gopal Vijayaraghavan
- Arun Murthy
- Carlo Curino
Design and Implementation of the LogicBlox System
- Molham Aref
- Balder ten Cate
- Todd J. Green
- Benny Kimelfeld
- Dan Olteanu
- Emir Pasalic
- Todd L. Veldhuizen
- Geoffrey Washburn
Spark SQL: Relational Data Processing in Spark
- Michael Armbrust
- Reynold S. Xin
- Cheng Lian
- Yin Huai
- Davies Liu
- Joseph K. Bradley
- Xiangrui Meng
- Tomer Kaftan
- Michael J. Franklin
- Ali Ghodsi
- Matei Zaharia
DEMONSTRATION SESSION: Demo C
Graft: A Debugging Tool For Apache Giraph
- Semih Salihoglu
- Jaeho Shin
- Vikesh Khanna
- Ba Quan Truong
- Jennifer Widom
Even Metadata is Getting Big: Annotation Summarization using InsightNotes
- Dongqing Xiao
- Armir Bashllari
- Tyler Menard
- Mohamed Eltabakh
StoryPivot: Comparing and Contrasting Story Evolution
- Anja Gruenheid
- Donald Kossmann
- Theodoros Rekatsinas
- Divesh Srivastava
The Flatter, the Better: Query Compilation Based on the Flattening Transformation
- Alexander Ulrich
- Torsten Grust
D2WORM: A Management Infrastructure for Distributed Data-centric Workflows
- Martin Jergler
- Mohammad Sadoghi
- Hans-Arno Jacobsen
NL~2~CM: A Natural Language Interface to Crowd Mining
- Yael Amsterdamer
- Anna Kukliansky
- Tova Milo
Optimistic Recovery for Iterative Dataflows in Action
- Sergey Dudoladov
- Chen Xu
- Sebastian Schelter
- Asterios Katsifodimos
- Stephan Ewen
- Kostas Tzoumas
- Volker Markl
A Secure Search Engine for the Personal Cloud
- Saliha Lallali
- Nicolas Anciaux
- Iulian Sandu Popa
- Philippe Pucheral
IReS: Intelligent, Multi-Engine Resource Scheduler for Big Data Analytics Workflows
- Katerina Doka
- Nikolaos Papailiou
- Dimitrios Tsoumakos
- Christos Mantas
- Nectarios Koziris
Just can’t get enough: Synthesizing Big Data
- Tilmann Rabl
- Manuel Danisch
- Michael Frank
- Sebastian Schindler
- Hans-Arno Jacobsen
SESSION: Research Session 17 - Hardware-Aware Query Processing
Rack-Scale In-Memory Join Processing using RDMA
- Claude Barthels
- Simon Loesing
- Gustavo Alonso
- Donald Kossmann
Self-Tuning, GPU-Accelerated Kernel Density Models for Multidimensional Selectivity Estimation
- Max Heimel
- Martin Kiefer
- Volker Markl
Rethinking SIMD Vectorization for In-Memory Databases
- Orestis Polychroniou
- Arun Raghavan
- Kenneth A. Ross
A Padded Encoding Scheme to Accelerate Scans by Leveraging Skew
- Yinan Li
- Craig Chasseur
- Jignesh M. Patel
SESSION: Research Session 18 - Graph Propagation, Influence, Mining
GetReal: Towards Realistic Selection of Influence Maximization Strategies in Competitive Networks
- Hui Li
- Sourav S. Bhowmick
- Jiangtao Cui
- Yunjun Gao
- Jianfeng Ma
Influence Maximization in Near-Linear Time: A Martingale Approach
- Youze Tang
- Yanchen Shi
- Xiaokui Xiao
Community Level Diffusion Extraction
- Zhiting Hu
- Junjie Yao
- Bin Cui
- Eric Xing
BEAR: Block Elimination Approach for Random Walk with Restart on Large Graphs
- Kijung Shin
- Jinhong Jung
- Sael Lee
- U. Kang
The Minimum Wiener Connector Problem
- Natali Ruchansky
- Francesco Bonchi
- David García-Soriano
- Francesco Gullo
- Nicolas Kourtellis
SESSION: Research Session 19 - Social Networks
From Group Recommendations to Group Formation
- Senjuti Basu Roy
- Laks V.S. Lakshmanan
- Rui Liu
Real-Time Multi-Criteria Social Graph Partitioning: A Game Theoretic Approach
- Nikos Armenatzoglou
- Huy Pham
- Vasilis Ntranos
- Dimitris Papadias
- Cyrus Shahabi
Utility-Aware Social Event-Participant Planning
- Jieying She
- Yongxin Tong
- Lei Chen
Online Video Recommendation in Sharing Community
- Xiangmin Zhou
- Lei Chen
- Yanchun Zhang
- Longbing Cao
- Guangyan Huang
- Chen Wang
SESSION: Industry Session 4 - Performance
Large-scale Predictive Analytics in Vertica: Fast Data Transfer, Distributed Model Creation, and In-database Prediction
- Shreya Prasad
- Arash Fard
- Vishrut Gupta
- Jorge Martinez
- Jeff LeFevre
- Vincent Xu
- Meichun Hsu
- Indrajit Roy
Oracle Workload Intelligence
- Quoc Trung Tran
- Konstantinos Morfonios
- Neoklis Polyzotis
Purity: Building Fast, Highly-Available Enterprise Flash Storage from Commodity Components
- John Colgrove
- John D. Davis
- John Hayes
- Ethan L. Miller
- Cary Sandvig
- Russell Sears
- Ari Tamches
- Neil Vachharajani
- Feng Wang
On Improving User Response Times in Tableau
- Pawel Terlecki
- Fei Xu
- Marianne Shaw
- Valeri Kim
- Richard Wesley
TUTORIAL SESSION: Tutorial 3
Data Management in Non-Volatile Memory
- Stratis D. Viglas
SESSION: Research Session 20 - Information Extraction and Record Linking
TEGRA: Table Extraction by Global Record Alignment
- Xu Chu
- Yeye He
- Kaushik Chakrabarti
- Kris Ganjam
Mining Quality Phrases from Massive Text Corpora
- Jialu Liu
- Jingbo Shang
- Chi Wang
- Xiang Ren
- Jiawei Han
Mining Subjective Properties on the Web
- Immanuel Trummer
- Alon Halevy
- Hongrae Lee
- Sunita Sarawagi
- Rahul Gupta
Microblog Entity Linking with Social Temporal Context
- Wen Hua
- Kai Zheng
- Xiaofang Zhou
SESSION: Research Session 21 - RDF and SPARQL
Graph-Aware, Workload-Adaptive SPARQL Query Caching
- Nikolaos Papailiou
- Dimitrios Tsoumakos
- Panagiotis Karras
- Nectarios Koziris
Left Bit Right: For SPARQL Join Queries with OPTIONAL Patterns (Left-outer-joins)
- Medha Atre
How to Build Templates for RDF Question/Answering: An Uncertain Graph Similarity Join Approach
- Weiguo Zheng
- Lei Zou
- Xiang Lian
- Jeffrey Xu Yu
- Shaoxu Song
- Dongyan Zhao
RBench: Application-Specific RDF Benchmarking
- Shi Qiao
- Z. Meral Özsoyoğlu
ALEX: Automatic Link Exploration in Linked Data
- Ahmed El-Roby
- Ashraf Aboulnaga
SESSION: Research Session 22 - Time Series & Graph Processing
k-Shape: Efficient and Accurate Clustering of Time Series
- John Paparrizos
- Luis Gravano
SMiLer: A Semi-Lazy Time Series Prediction System for Sensors
- Jingbo Zhou
- Anthony K.H. Tung
SQLGraph: An Efficient Relational-Based Property Graph Store
- Wen Sun
- Achille Fokoue
- Kavitha Srinivas
- Anastasios Kementsietsidis
- Gang Hu
- Guotong Xie
Updating Graph Indices with a One-Pass Algorithm
- Dayu Yuan
- Prasenjit Mitra
- Huiwen Yu
- C. Lee Giles
SESSION: Industry Session 5 - Usability
Amazon Redshift and the Case for Simpler Data Warehouses
- Anurag Gupta
- Deepak Agarwal
- Derek Tan
- Jakub Kulesza
- Rahul Pathak
- Stefano Stefani
- Vidhya Srinivasan
ShareInsights: An Unified Approach to Full-stack Data Processing
- Mukund Deshpande
- Dhruva Ray
- Sameer Dixit
- Avadhoot Agasti
SESSION: Research Session 23 - Advanced Query Processing
An Incremental Anytime Algorithm for Multi-Objective Query Optimization
- Immanuel Trummer
- Christoph Koch
Output-sensitive Evaluation of Prioritized Skyline Queries
- Niccolo’ Meneghetti
- Denis Mindolin
- Paolo Ciaccia
- Jan Chomicki
Learning Generalized Linear Models Over Normalized Data
- Arun Kumar
- Jeffrey Naughton
- Jignesh M. Patel
Utilizing IDs to Accelerate Incremental View Maintenance
- Yannis Katsis
- Kian Win Ong
- Yannis Papakonstantinou
- Kevin Keliang Zhao
SESSION: Research Session 24 - New Models
S4: Top-k Spreadsheet-Style Search for Query Discovery
- Fotis Psallidas
- Bolin Ding
- Kaushik Chakrabarti
- Surajit Chaudhuri
Proactive Annotation Management in Relational Databases
- Karim Ibrahim
- Xiao Du
- Mohamed Eltabakh
Weighted Coverage based Reviewer Assignment
- Ngai Meng Kou
- Leong Hou U.
- Nikos Mamoulis
- Zhiguo Gong
Distributed Online Tracking
- Mingwang Tang
- Feifei Li
- Yufei Tao
TUTORIAL SESSION: Tutorial 4
Knowledge Curation and Knowledge Fusion: Challenges, Models and Applications
- Xin Luna Dong
- Divesh Srivastava
SESSION: Undergraduate Abstracts
Smooth Task Migration in Apache Storm
- Mansheng Yang
- Richard T.B. Ma
JAFAR: Near-Data Processing for Databases
- Oreoluwatomiwa O. Babarinsa
- Stratos Idreos
Job Scheduling with Minimizing Data Communication Costs
- Trevor Clinkenbeard
- Anisoara Nica
One Loop Does Not Fit All
- Styliani Pantela
- Stratos Idreos
DunceCap: Compiling Worst-Case Optimal Query Plans
- Adam Perelman
- Christopher Ré
DunceCap: Query Plans Using Generalized Hypertree Decompositions
- Susan Tu
- Christopher Ré