## Article Source

- Title: Volume 48: Proceedings of The 33rd International Conference on Machine Learning
- Editors: Maria Florina Balcan, Kilian Q. Weinberger

# Proceedings of The 33rd International Conference on Machine Learning

**Editors: Maria Florina Balcan, Kilian Q. Weinberger**

## Accepted Papers

No Oops, You Won’t Do It Again: Mechanisms for Self-correction in Crowdsourcing

Nihar Shah, Dengyong Zhou

[abs] [pdf] [supplementary]

Stochastically Transitive Models for Pairwise Comparisons: Statistical and Computational Issues

Nihar Shah, Sivaraman Balakrishnan, Aditya Guntuboyina, Martin Wainwright

[abs] [pdf] [supplementary]

Uprooting and Rerooting Graphical Models

Adrian Weller

[abs] [pdf] [supplementary]

A Deep Learning Approach to Unsupervised Ensemble Learning

Uri Shaham, Xiuyuan Cheng, Omer Dror, Ariel Jaffe, Boaz Nadler, Joseph Chang, Yuval Kluger

[abs] [pdf] [supplementary]

Revisiting Semi-Supervised Learning with Graph Embeddings

Zhilin Yang, William Cohen, Ruslan Salakhudinov

Guided Cost Learning: Deep Inverse Optimal Control via Policy Optimization

Chelsea Finn, Sergey Levine, Pieter Abbeel

[abs] [pdf] [supplementary]

Diversity-Promoting Bayesian Learning of Latent Variable Models

Pengtao Xie, Jun Zhu, Eric Xing

[abs] [pdf] [supplementary]

Additive Approximations in High Dimensional Nonparametric Regression via the SALSA

Kirthevasan Kandasamy, Yaoliang Yu

[abs] [pdf] [supplementary]

Hawkes Processes with Stochastic Excitations

Young Lee, Kar Wai Lim, Cheng Soon Ong

[abs] [pdf] [supplementary]

Data-driven Rank Breaking for Efficient Rank Aggregation

Ashish Khetan, Sewoong Oh

[abs] [pdf] [supplementary]

Dropout distillation

Samuel Rota Bulò, Lorenzo Porzi, Peter Kontschieder

[abs] [pdf] [supplementary]

Metadata-conscious anonymous messaging

Giulia Fanti, Peter Kairouz, Sewoong Oh, Kannan Ramchandran, Pramod Viswanath

[abs] [pdf] [supplementary]

The Teaching Dimension of Linear Learners

Ji Liu, Xiaojin Zhu, Hrag Ohannessian

[abs] [pdf] [supplementary]

Truthful Univariate Estimators

Ioannis Caragiannis, Ariel Procaccia, Nisarg Shah

[abs] [pdf] [supplementary]

Why Regularized Auto-Encoders learn Sparse Representation?

Devansh Arpit, Yingbo Zhou, Hung Ngo, Venu Govindaraju

[abs] [pdf] [supplementary]

k-variates++: more pluses in the k-means++

Richard Nock, Raphael Canyasse, Roksana Boreli, Frank Nielsen

[abs] [pdf] [supplementary]

Multi-Player Bandits – a Musical Chairs Approach

Jonathan Rosenski, Ohad Shamir, Liran Szlak

[abs] [pdf] [supplementary]

The Information Sieve

Greg Ver Steeg, Aram Galstyan

[abs] [pdf] [supplementary]

Deep Speech 2 : End-to-End Speech Recognition in English and Mandarin

Dario Amodei, Rishita Anubhai, Eric Battenberg, Carl Case, Jared Casper, Bryan Catanzaro, JingDong Chen, Mike Chrzanowski, Adam Coates, Greg Diamos, Erich Elsen, Jesse Engel, Linxi Fan, Christopher Fougner, Awni Hannun, Billy Jun, Tony Han, Patrick LeGresley, Xiangang Li, Libby Lin, Sharan Narang, Andrew Ng, Sherjil Ozair, Ryan Prenger, Sheng Qian, Jonathan Raiman, Sanjeev Satheesh, David Seetapun, Shubho Sengupta, Chong Wang, Yi Wang, Zhiqian Wang, Bo Xiao, Yan Xie, Dani Yogatama, Jun Zhan, Zhenyao Zhu

On the Consistency of Feature Selection With Lasso for Non-linear Targets

Yue Zhang, Weihong Guo, Soumya Ray

Minimum Regret Search for Single- and Multi-Task Optimization

Jan Hendrik Metzen

[abs] [pdf] [supplementary]

CryptoNets: Applying Neural Networks to Encrypted Data with High Throughput and Accuracy

Ran Gilad-Bachrach, Nathan Dowlin, Kim Laine, Kristin Lauter, Michael Naehrig, John Wernsing

The Variational Nystrom method for large-scale spectral problems

Max Vladymyrov, Miguel Carreira-Perpinan

[abs] [pdf] [supplementary]

Multi-Bias Non-linear Activation in Deep Neural Networks

Hongyang Li, Wanli Ouyang, Xiaogang Wang

Asymmetric Multi-task Learning Based on Task Relatedness and Loss

Giwoong Lee, Eunho Yang, Sung ju Hwang

Accurate Robust and Efficient Error Estimation for Decision Trees

Lixin Fan

Fast Stochastic Algorithms for SVD and PCA: Convergence Properties and Convexity

Ohad Shamir

[abs] [pdf] [supplementary]

Convergence of Stochastic Gradient Descent for PCA

Ohad Shamir

[abs] [pdf] [supplementary]

Dealbreaker: A Nonlinear Latent Variable Model for Educational Data

Andrew Lan, Tom Goldstein, Richard Baraniuk, Christoph Studer

A Kernelized Stein Discrepancy for Goodness-of-fit Tests

Qiang Liu, Jason Lee, Michael Jordan

[abs] [pdf] [supplementary]

Variable Elimination in the Fourier Domain

Yexiang Xue, Stefano Ermon, Ronan Le Bras, Carla, Bart Selman

[abs] [pdf] [supplementary]

Low-Rank Matrix Approximation with Stability

Dongsheng Li, Chao Chen, Qin Lv, Junchi Yan, Li Shang, Stephen Chu

Linking losses for density ratio and class-probability estimation

Aditya Menon, Cheng Soon Ong

[abs] [pdf] [supplementary]

Stochastic Variance Reduction for Nonconvex Optimization

Sashank J. Reddi, Ahmed Hefny, Suvrit Sra, Barnabas Poczos, Alex Smola

[abs] [pdf] [supplementary]

Hierarchical Variational Models

Rajesh Ranganath, Dustin Tran, David Blei

[abs] [pdf] [supplementary]

Hierarchical Span-Based Conditional Random Fields for Labeling and Segmenting Events in Wearable Sensor Data Streams

Roy Adams, Nazir Saleheen, Edison Thomaz, Abhinav Parate, Santosh Kumar, Benjamin Marlin

Binary embeddings with structured hashed projections

Anna Choromanska, Krzysztof Choromanski, Mariusz Bojarski, Tony Jebara, Sanjiv Kumar, Yann LeCun

[abs] [pdf] [supplementary]

A Variational Analysis of Stochastic Gradient Algorithms

Stephan Mandt, Matthew Hoffman, David Blei

[abs] [pdf] [supplementary]

Adaptive Sampling for SGD by Exploiting Side Information

Siddharth Gopal

Learning from Multiway Data: Simple and Efficient Tensor Regression

Rose Yu, Yan Liu

A Distributed Variational Inference Framework for Unifying Parallel Sparse Gaussian Process Regression Models

Trong Nghia Hoang, Quang Minh Hoang, Bryan Kian Hsiang Low

[abs] [pdf] [supplementary]

Online Stochastic Linear Optimization under One-bit Feedback

Lijun Zhang, Tianbao Yang, Rong Jin, Yichi Xiao, Zhi-hua Zhou

[abs] [pdf] [supplementary]

Adaptive Algorithms for Online Convex Optimization with Long-term Constraints

Rodolphe Jenatton, Jim Huang, Cedric Archambeau

Actively Learning Hemimetrics with Applications to Eliciting User Preferences

Adish Singla, Sebastian Tschiatschek, Andreas Krause

Learning Simple Algorithms from Examples

Wojciech Zaremba, Tomas Mikolov, Armand Joulin, Rob Fergus

Learning Physical Intuition of Block Towers by Example

Adam Lerer, Sam Gross, Rob Fergus

Structure Learning of Partitioned Markov Networks

Song Liu, Taiji Suzuki, Masashi Sugiyama, Kenji Fukumizu

[abs] [pdf] [supplementary]

Tracking Slowly Moving Clairvoyant: Optimal Dynamic Regret of Online Learning with True and Noisy Gradient

Tianbao Yang, Lijun Zhang, Rong Jin, Jinfeng Yi

[abs] [pdf] [supplementary]

Beyond CCA: Moment Matching for Multi-View Models

Anastasia Podosinnikova, Francis Bach, Simon Lacoste-Julien

[abs] [pdf] [supplementary]

Fast methods for estimating the Numerical rank of large matrices

Shashanka Ubaru, Yousef Saad

[abs] [pdf] [supplementary]

Unsupervised Deep Embedding for Clustering Analysis

Junyuan Xie, Ross Girshick, Ali Farhadi

Efficient Private Empirical Risk Minimization for High-dimensional Learning

Shiva Prasad Kasiviswanathan, Hongxia Jin

[abs] [pdf] [supplementary]

Parameter Estimation for Generalized Thurstone Choice Models

Milan Vojnovic, Seyoung Yun

[abs] [pdf] [supplementary]

Large-Margin Softmax Loss for Convolutional Neural Networks

Weiyang Liu, Yandong Wen, Zhiding Yu, Meng Yang

A Random Matrix Approach to Echo-State Neural Networks

Romain Couillet, Gilles Wainrib, Hafiz Tiomoko Ali, Harry Sevi

Supervised and Semi-Supervised Text Categorization using LSTM for Region Embeddings

Rie Johnson, Tong Zhang

Optimality of Belief Propagation for Crowdsourced Classification

Jungseul Ok, Sewoong Oh, Jinwoo Shin, Yung Yi

[abs] [pdf] [supplementary]

Stability of Controllers for Gaussian Process Forward Models

Julia Vinogradska, Bastian Bischoff, Duy Nguyen-Tuong, Anne Romer, Henner Schmidt, Jan Peters

Learning privately from multiparty data

Jihun Hamm, Yingjun Cao, Mikhail Belkin

[abs] [pdf] [supplementary]

Network Morphism

Tao Wei, Changhu Wang, Yong Rui, Chang Wen Chen

A Kronecker-factored approximate Fisher matrix for convolution layers

Roger Grosse, James Martens

[abs] [pdf] [supplementary]

Experimental Design on a Budget for Sparse Linear Models and Applications

Sathya Narayanan Ravi, Vamsi Ithapu, Sterling Johnson, Vikas Singh

Minding the Gaps for Block Frank-Wolfe Optimization of Structured SVMs

Anton Osokin, Jean-Baptiste Alayrac, Isabella Lukasewitz, Puneet Dokania, Simon Lacoste-Julien

[abs] [pdf] [supplementary]

Exact Exponent in Optimal Rates for Crowdsourcing

Chao Gao, Yu Lu, Dengyong Zhou

[abs] [pdf] [supplementary]

Augmenting Supervised Neural Networks with Unsupervised Objectives for Large-scale Image Classification

Yuting Zhang, Kibok Lee, Honglak Lee

[abs] [pdf] [supplementary]

Online Low-Rank Subspace Clustering by Basis Dictionary Pursuit

Jie Shen, Ping Li, Huan Xu

[abs] [pdf] [supplementary]

A Self-Correcting Variable-Metric Algorithm for Stochastic Optimization

Frank Curtis

[abs] [pdf] [supplementary]

Stochastic Quasi-Newton Langevin Monte Carlo

Umut Simsekli, Roland Badeau, Taylan Cemgil, Gaël Richard

[abs] [pdf] [supplementary]

Doubly Robust Off-policy Value Evaluation for Reinforcement Learning

Nan Jiang, Lihong Li

[abs] [pdf] [supplementary]

Fast Rate Analysis of Some Stochastic Optimization Algorithms

Chao Qu, Huan Xu, Chong jin Ong

[abs] [pdf] [supplementary]

Fast k-Nearest Neighbour Search via Dynamic Continuous Indexing

Ke Li, Jitendra Malik

[abs] [pdf] [supplementary]

Smooth Imitation Learning for Online Sequence Prediction

Hoang Le, Andrew Kang, Yisong Yue, Peter Carr

[abs] [pdf] [supplementary]

Community Recovery in Graphs with Locality

Yuxin Chen, Govinda Kamath, Changho Suh, David Tse

Variance Reduction for Faster Non-Convex Optimization

Zeyuan Allen-Zhu, Elad Hazan

Loss factorization, weakly supervised learning and label noise robustness

Giorgio Patrini, Frank Nielsen, Richard Nock, Marcello Carioni

[abs] [pdf] [supplementary]

Analysis of Deep Neural Networks with Extended Data Jacobian Matrix

Shengjie Wang, Abdel-rahman Mohamed, Rich Caruana, Jeff Bilmes, Matthai Plilipose, Matthew Richardson, Krzysztof Geras, Gregor Urban, Ozlem Aslan

Doubly Decomposing Nonparametric Tensor Regression

Masaaki Imaizumi, Kohei Hayashi

[abs] [pdf] [supplementary]

Hyperparameter optimization with approximate gradient

Fabian Pedregosa

[abs] [pdf] [supplementary]

SDCA without Duality, Regularization, and Individual Convexity

Shai Shalev-Shwartz

Heteroscedastic Sequences: Beyond Gaussianity

Oren Anava, Shie Mannor

[abs] [pdf] [supplementary]

A Neural Autoregressive Approach to Collaborative Filtering

Yin Zheng, Bangsheng Tang, Wenkui Ding, Hanning Zhou

[abs] [pdf] [supplementary]

On the Quality of the Initial Basin in Overspecified Neural Networks

Itay Safran, Ohad Shamir

[abs] [pdf] [supplementary]

Primal-Dual Rates and Certificates

Celestine Dünner, Simone Forte, Martin Takac, Martin Jaggi

[abs] [pdf] [supplementary]

Minimizing the Maximal Loss: How and Why

Shai Shalev-Shwartz, Yonatan Wexler

[abs] [pdf] [supplementary]

The Information-Theoretic Requirements of Subspace Clustering with Missing Data

Daniel Pimentel-Alarcon, Robert Nowak

Online Learning with Feedback Graphs Without the Graphs

Alon Cohen, Tamir Hazan, Tomer Koren

PAC learning of Probabilistic Automaton based on the Method of Moments

Hadrien Glaude, Olivier Pietquin

[abs] [pdf] [supplementary]

Estimating Structured Vector Autoregressive Models

Igor Melnyk, Arindam Banerjee

[abs] [pdf] [supplementary]

Mixing Rates for the Alternating Gibbs Sampler over Restricted Boltzmann Machines and Friends

Christopher Tosh

[abs] [pdf] [supplementary]

Polynomial Networks and Factorization Machines: New Insights and Efficient Training Algorithms

Mathieu Blondel, Masakazu Ishihata, Akinori Fujino, Naonori Ueda

[abs] [pdf] [supplementary]

A New PAC-Bayesian Perspective on Domain Adaptation

Pascal Germain, Amaury Habrard, François Laviolette, Emilie Morvant

[abs] [pdf] [supplementary]

Correlation Clustering and Biclustering with Locally Bounded Errors

Gregory Puleo, Olgica Milenkovic

PAC Lower Bounds and Efficient Algorithms for The Max \(K\)-Armed Bandit Problem

Yahel David, Nahum Shimkin

[abs] [pdf] [supplementary]

A Comparative Analysis and Study of Multiview CNN Models for Joint Object Categorization and Pose Estimation

Mohamed Elhoseiny, Tarek El-Gaaly, Amr Bakry, Ahmed Elgammal

[abs] [pdf] [supplementary]

BASC: Applying Bayesian Optimization to the Search for Global Minima on Potential Energy Surfaces

Shane Carr, Roman Garnett, Cynthia Lo

On the Iteration Complexity of Oblivious First-Order Optimization Algorithms

Yossi Arjevani, Ohad Shamir

[abs] [pdf] [supplementary]

Stochastic Variance Reduced Optimization for Nonconvex Sparse Learning

Xingguo Li, Tuo Zhao, Raman Arora, Han Liu, Jarvis Haupt

[abs] [pdf] [supplementary]

Analysis of Variational Bayesian Factorizations for Sparse and Low-Rank Estimation

David Wipf

Fast k-means with accurate bounds

James Newling, Francois Fleuret

[abs] [pdf] [supplementary]

Boolean Matrix Factorization and Noisy Completion via Message Passing

Siamak Ravanbakhsh, Barnabas Poczos, Russell Greiner

[abs] [pdf] [supplementary]

Convolutional Rectifier Networks as Generalized Tensor Decompositions

Nadav Cohen, Amnon Shashua

[abs] [pdf] [supplementary]

Low-rank Solutions of Linear Matrix Equations via Procrustes Flow

Stephen Tu, Ross Boczar, Max Simchowitz, Mahdi Soltanolkotabi, Ben Recht

Anytime Exploration for Multi-armed Bandits using Confidence Information

Kwang-Sung Jun, Robert Nowak

[abs] [pdf] [supplementary]

Structured Prediction Energy Networks

David Belanger, Andrew McCallum

[abs] [pdf] [supplementary]

L1-regularized Neural Networks are Improperly Learnable in Polynomial Time

Yuchen Zhang, Jason D. Lee, Michael I. Jordan

Compressive Spectral Clustering

Nicolas Tremblay, Gilles Puy, Remi Gribonval, Pierre Vandergheynst

[abs] [pdf] [supplementary]

Low-rank tensor completion: a Riemannian manifold preconditioning approach

Hiroyuki Kasai, Bamdev Mishra

[abs] [pdf] [supplementary]

Provable Non-convex Phase Retrieval with Outliers: Median TruncatedWirtinger Flow

Huishuai Zhang, Yuejie Chi, Yingbin Liang

[abs] [pdf] [supplementary]

Estimating Maximum Expected Value through Gaussian Approximation

Carlo D’Eramo, Marcello Restelli, Alessandro Nuara

[abs] [pdf] [supplementary]

Representational Similarity Learning with Application to Brain Networks

Urvashi Oswal, Christopher Cox, Matthew Lambon-Ralph, Timothy Rogers, Robert Nowak

[abs] [pdf] [supplementary]

Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning

Yarin Gal, Zoubin Ghahramani

[abs] [pdf] [supplementary]

Generative Adversarial Text to Image Synthesis

Scott Reed, Zeynep Akata, Xinchen Yan, Lajanugen Logeswaran, Bernt Schiele, Honglak Lee

[abs] [pdf] [supplementary]

Dirichlet Process Mixture Model for Correcting Technical Variation in Single-Cell Gene Expression Data

Sandhya Prabhakaran, Elham Azizi, Ambrose Carr, Dana Pe’er

[abs] [pdf] [supplementary]

Improved SVRG for Non-Strongly-Convex or Sum-of-Non-Convex Objectives

Zeyuan Allen-Zhu, Yang Yuan

Sparse Parameter Recovery from Aggregated Data

Avradeep Bhowmik, Joydeep Ghosh, Oluwasanmi Koyejo

[abs] [pdf] [supplementary]

Deep Structured Energy Based Models for Anomaly Detection

Shuangfei Zhai, Yu Cheng, Weining Lu, Zhongfei Zhang

Even Faster Accelerated Coordinate Descent Using Non-Uniform Sampling

Zeyuan Allen-Zhu, Zheng Qu, Peter Richtarik, Yang Yuan

Unitary Evolution Recurrent Neural Networks

Martin Arjovsky, Amar Shah, Yoshua Bengio

Markov Latent Feature Models

Aonan Zhang, John Paisley

The Knowledge Gradient for Sequential Decision Making with Stochastic Binary Feedbacks

Yingfei Wang, Chu Wang, Warren Powell

[abs] [pdf] [supplementary]

A Simple and Provable Algorithm for Sparse Diagonal CCA

Megasthenis Asteris, Anastasios Kyrillidis, Oluwasanmi Koyejo, Russell Poldrack

[abs] [pdf] [supplementary]

Quadratic Optimization with Orthogonality Constraints: Explicit Lojasiewicz Exponent and Linear Convergence of Line-Search Methods

Huikang Liu, Weijie Wu, Anthony Man-Cho So

[abs] [pdf] [supplementary]

Normalization Propagation: A Parametric Technique for Removing Internal Covariate Shift in Deep Networks

Devansh Arpit, Yingbo Zhou, Bhargava Kota, Venu Govindaraju

[abs] [pdf] [supplementary]

Learning to Generate with Memory

Chongxuan Li, Jun Zhu, Bo Zhang

[abs] [pdf] [supplementary]

Learning End-to-end Video Classification with Rank-Pooling

Basura Fernando, Stephen Gould

Learning to Filter with Predictive State Inference Machines

Wen Sun, Arun Venkatraman, Byron Boots, J.Andrew Bagnell

[abs] [pdf] [supplementary]

A Subspace Learning Approach for High Dimensional Matrix Decomposition with Efficient Column/Row Sampling

Mostafa Rahmani, Geroge Atia

DCM Bandits: Learning to Rank with Multiple Clicks

Sumeet Katariya, Branislav Kveton, Csaba Szepesvari, Zheng Wen

[abs] [pdf] [supplementary]

Train faster, generalize better: Stability of stochastic gradient descent

Moritz Hardt, Ben Recht, Yoram Singer

Copeland Dueling Bandit Problem: Regret Lower Bound, Optimal Algorithm, and Computationally Efficient Algorithm

Junpei Komiyama, Junya Honda, Hiroshi Nakagawa

[abs] [pdf] [supplementary]

Contextual Combinatorial Cascading Bandits

Shuai Li, Baoxiang Wang, Shengyu Zhang, Wei Chen

[abs] [pdf] [supplementary]

Conservative Bandits

Yifan Wu, Roshan Shariff, Tor Lattimore, Csaba Szepesvari

[abs] [pdf] [supplementary]

Variance-Reduced and Projection-Free Stochastic Optimization

Elad Hazan, Haipeng Luo

[abs] [pdf] [supplementary]

Factored Temporal Sigmoid Belief Networks for Sequence Learning

Jiaming Song, Zhe Gan, Lawrence Carin

[abs] [pdf] [supplementary]

False Discovery Rate Control and Statistical Quality Assessment of Annotators in Crowdsourced Ranking

QianQian Xu, Jiechao Xiong, Xiaochun Cao, Yuan Yao

[abs] [pdf] [supplementary]

Strongly-Typed Recurrent Neural Networks

David Balduzzi, Muhammad Ghifary

Distributed Clustering of Linear Bandits in Peer to Peer Networks

Nathan Korda, Balazs Szorenyi, Shuai Li

[abs] [pdf] [supplementary]

Collapsed Variational Inference for Sum-Product Networks

Han Zhao, Tameem Adel, Geoff Gordon, Brandon Amos

[abs] [pdf] [supplementary]

On the Analysis of Complex Backup Strategies in Monte Carlo Tree Search

Piyush Khandelwal, Elad Liebman, Scott Niekum, Peter Stone

Benchmarking Deep Reinforcement Learning for Continuous Control

Yan Duan, Xi Chen, Rein Houthooft, John Schulman, Pieter Abbeel

[abs] [pdf] [supplementary]

\(K\)-Means Clustering with Distributed Dimensions

Hu Ding, Yu Liu, Lingxiao Huang, Jian Li

[abs] [pdf] [supplementary]

Texture Networks: Feed-forward Synthesis of Textures and Stylized Images

Dmitry Ulyanov, Vadim Lebedev, Andrea, Victor Lempitsky

[abs] [pdf] [supplementary]

Fast Constrained Submodular Maximization: Personalized Data Summarization

Baharan Mirzasoleiman, Ashwinkumar Badanidiyuru, Amin Karbasi

[abs] [pdf] [supplementary]

On the Statistical Limits of Convex Relaxations

Zhaoran Wang, Quanquan Gu, Han Liu

[abs] [pdf] [supplementary]

Ask Me Anything: Dynamic Memory Networks for Natural Language Processing

Ankit Kumar, Ozan Irsoy, Peter Ondruska, Mohit Iyyer, James Bradbury, Ishaan Gulrajani, Victor Zhong, Romain Paulus, Richard Socher

[abs] [pdf] [supplementary]

Gossip Dual Averaging for Decentralized Optimization of Pairwise Functions

Igor Colin, Aurelien Bellet, Joseph Salmon, Stéphan Clémençon

[abs] [pdf] [supplementary]

Solving Ridge Regression using Sketched Preconditioned SVRG

Alon Gonen, Francesco Orabona, Shai Shalev-Shwartz

[abs] [pdf] [supplementary]

Cumulative Prospect Theory Meets Reinforcement Learning: Prediction and Control

Prashanth L.A., Cheng Jie, Michael Fu, Steve Marcus, Csaba Szepesvari

Estimating Accuracy from Unlabeled Data: A Bayesian Approach

Emmanouil Antonios Platanios, Avinava Dubey, Tom Mitchell

[abs] [pdf] [supplementary]

Non-negative Matrix Factorization under Heavy Noise

Chiranjib Bhattacharya, Navin Goyal, Ravindran Kannan, Jagdeep Pani

[abs] [pdf] [supplementary]

Extreme F-measure Maximization using Sparse Probability Estimates

Kalina Jasinska, Krzysztof Dembczynski, Robert Busa-Fekete, Karlson Pfannschmidt, Timo Klerx, Eyke Hullermeier

[abs] [pdf] [supplementary]

Auxiliary Deep Generative Models

Lars Maaløe, Casper Kaae Sønderby, Søren Kaae Sønderby, Ole Winther

Importance Sampling Tree for Large-scale Empirical Expectation

Olivier Canevet, Cijo Jose, Francois Fleuret

[abs] [pdf] [supplementary]

Starting Small - Learning with Adaptive Sample Sizes

Hadi Daneshmand, Aurelien Lucchi, Thomas Hofmann

[abs] [pdf] [supplementary]

Deep Gaussian Processes for Regression using Approximate Expectation Propagation

Thang Bui, Daniel Hernandez-Lobato, Jose miguel Hernandez-Lobato, Yingzhen Li, Richard Turner

[abs] [pdf] [supplementary]

DR-ABC: Approximate Bayesian Computation with Kernel-Based Distribution Regression

Jovana Mitrovic, Dino Sejdinovic, Yee-Whye Teh

Predictive Entropy Search for Multi-objective Bayesian Optimization

Daniel Hernandez-Lobato, Jose miguel Hernandez-Lobato, Amar Shah, Ryan Adams

[abs] [pdf] [supplementary]

Rich Component Analysis

Rong Ge, James Zou

[abs] [pdf] [supplementary]

Black-Box Alpha Divergence Minimization

Jose miguel Hernandez-Lobato, Yingzhen Li, Mark Rowland, Thang Bui, Daniel Hernandez-Lobato, Richard Turner

[abs] [pdf] [supplementary]

One-Shot Generalization in Deep Generative Models

Danilo Rezende, Shakir, Ivo Danihelka, Karol Gregor, Daan Wierstra

Optimal Classification with Multivariate Losses

Nagarajan Natarajan, Oluwasanmi Koyejo, Pradeep Ravikumar, Inderjit Dhillon

[abs] [pdf] [supplementary]

A ranking approach to global optimization

Cedric Malherbe, Emile Contal, Nicolas Vayatis

[abs] [pdf] [supplementary]

Parallel and Distributed Block-Coordinate Frank-Wolfe Algorithms

Yu-Xiang Wang, Veeranjaneyulu Sadhanala, Wei Dai, Willie Neiswanger, Suvrit Sra, Eric Xing

[abs] [pdf] [supplementary]

Autoencoding beyond pixels using a learned similarity metric

Anders Boesen Lindbo Larsen, Søren Kaae Sønderby, Hugo Larochelle, Ole Winther

Ensuring Rapid Mixing and Low Bias for Asynchronous Gibbs Sampling

Christopher De Sa, Chris Re, Kunle Olukotun

[abs] [pdf] [supplementary]

Simultaneous Safe Screening of Features and Samples in Doubly Sparse Modeling

Atsushi Shibagaki, Masayuki Karasuyama, Kohei Hatano, Ichiro Takeuchi

[abs] [pdf] [supplementary]

Anytime optimal algorithms in stochastic multi-armed bandits

Rémy Degenne, Vianney Perchet

[abs] [pdf] [supplementary]

Bounded Off-Policy Evaluation with Missing Data for Course Recommendation and Curriculum Design

William Hoiles, Mihaela van der Schaar

[abs] [pdf] [supplementary]

On collapsed representation of hierarchical Completely Random Measures

Gaurav Pandey, Ambedkar Dukkipati

[abs] [pdf] [supplementary]

From Softmax to Sparsemax: A Sparse Model of Attention and Multi-Label Classification

Andre Martins, Ramon Astudillo

[abs] [pdf] [supplementary]

Black-box Optimization with a Politician

Sebastien Bubeck, Yin Tat Lee

Gaussian process nonparametric tensor estimator and its minimax optimality

Heishiro Kanagawa, Taiji Suzuki, Hayato Kobayashi, Nobuyuki Shimizu, Yukihiro Tagami

[abs] [pdf] [supplementary]

No-Regret Algorithms for Heavy-Tailed Linear Bandits

Andres Munoz Medina, Scott Yang

Extended and Unscented Kitchen Sinks

Edwin Bonilla, Daniel Steinberg, Alistair Reid

[abs] [pdf] [supplementary]

Matrix Eigen-decomposition via Doubly Stochastic Riemannian Optimization

Zhiqiang Xu, Peilin Zhao, Jianneng Cao, Xiaoli Li

[abs] [pdf] [supplementary]

Recommendations as Treatments: Debiasing Learning and Evaluation

Tobias Schnabel, Adith Swaminathan, Ashudeep Singh, Navin Chandak, Thorsten Joachims

[abs] [pdf] [supplementary]

ForecastICU: A Prognostic Decision Support System for Timely Prediction of Intensive Care Unit Admission

Jinsung Yoon, Ahmed Alaa, Scott Hu, Mihaela van der Schaar

[abs] [pdf] [supplementary]

An optimal algorithm for the Thresholding Bandit Problem

Andrea Locatelli, Maurilio Gutzeit, Alexandra Carpentier

[abs] [pdf] [supplementary]

Fast Parameter Inference in Nonlinear Dynamical Systems using Iterative Gradient Matching

Mu Niu, Simon Rogers, Maurizio Filippone, Dirk Husmeier

Structured and Efficient Variational Deep Learning with Matrix Gaussian Posteriors

Christos Louizos, Max Welling

[abs] [pdf] [supplementary]

Learning Granger Causality for Hawkes Processes

Hongteng Xu, Mehrdad Farajtabar, Hongyuan Zha

[abs] [pdf] [supplementary]

Neural Variational Inference for Text Processing

Yishu Miao, Lei Yu, Phil Blunsom

[abs] [pdf] [supplementary]

Dictionary Learning for Massive Matrix Factorization

Arthur Mensch, Julien Mairal, Bertrand Thirion, Gael Varoquaux

Pixel Recurrent Neural Networks

Aaron Van den Oord, Nal Kalchbrenner, Koray Kavukcuoglu

Why Most Decisions Are Easy in Tetris—And Perhaps in Other Sequential Decision Problems, As Well

Ozgur Simsek, Simon Algorta, Amit Kothiyal

Gaussian quadrature for matrix inverse forms with applications

Chengtao Li, Suvrit Sra, Stefanie Jegelka

[abs] [pdf] [supplementary]

Train and Test Tightness of LP Relaxations in Structured Prediction

Ofer Meshi, Mehrdad Mahdavi, Adrian Weller, David Sontag

[abs] [pdf] [supplementary]

Stochastic Optimization for Multiview Representation Learning using Partial Least Squares

Raman Arora, Poorya Mianjy, Teodor Marinov

[abs] [pdf] [supplementary]

Hierarchical Compound Poisson Factorization

Mehmet Basbug, Barbara Engelhardt

[abs] [pdf] [supplementary]

Opponent Modeling in Deep Reinforcement Learning

He He, Jordan Boyd-Graber, Kevin Kwok, Hal Daumé III

No penalty no tears: Least squares in high-dimensional linear models

Xiangyu Wang, David Dunson, Chenlei Leng

[abs] [pdf] [supplementary]

SDNA: Stochastic Dual Newton Ascent for Empirical Risk Minimization

Zheng Qu, Peter Richtarik, Martin Takac, Olivier Fercoq

[abs] [pdf] [supplementary]

On Graduated Optimization for Stochastic Non-Convex Problems

Elad Hazan, Kfir Yehuda Levy, Shai Shalev-Shwartz

[abs] [pdf] [supplementary]

Meta-Learning with Memory-Augmented Neural Networks

Adam Santoro, Sergey Bartunov, Matthew Botvinick, Daan Wierstra, Timothy Lillicrap

[abs] [pdf] [supplementary]

The knockoff filter for FDR control in group-sparse and multitask regression

Ran Dai, Rina Barber

Softened Approximate Policy Iteration for Markov Games

Julien Pérolat, Bilal Piot, Matthieu Geist, Bruno Scherrer, Olivier Pietquin

[abs] [pdf] [supplementary]

Stochastic Block BFGS: Squeezing More Curvature out of Data

Robert Gower, Donald Goldfarb, Peter Richtarik

Differential Geometric Regularization for Supervised Learning of Classifiers

Qinxun Bai, Steven Rosenberg, Zheng Wu, Stan Sclaroff

[abs] [pdf] [supplementary]

Exploiting Cyclic Symmetry in Convolutional Neural Networks

Sander Dieleman, Jeffrey De Fauw, Koray Kavukcuoglu

Graying the black box: Understanding DQNs

Tom Zahavy, Nir Ben-Zrihem, Shie Mannor

[abs] [pdf] [supplementary]

The Sum-Product Theorem: A Foundation for Learning Tractable Models

Abram Friesen, Pedro Domingos

[abs] [pdf] [supplementary]

Pareto Frontier Learning with Expensive Correlated Objectives

Amar Shah, Zoubin Ghahramani

[abs] [pdf] [supplementary]

Asynchronous Methods for Deep Reinforcement Learning

Volodymyr Mnih, Adria Puigdomenech Badia, Mehdi Mirza, Alex Graves, Timothy Lillicrap, Tim Harley, David Silver, Koray Kavukcuoglu

[abs] [pdf] [supplementary]

A Simple and Strongly-Local Flow-Based Method for Cut Improvement

Nate Veldt, David Gleich, Michael Mahoney

[abs] [pdf] [supplementary]

Nonlinear Statistical Learning with Truncated Gaussian Graphical Models

Qinliang Su, Xuejun Liao, Changyou Chen, Lawrence Carin

[abs] [pdf] [supplementary]

Barron and Cover’s Theory in Supervised Learning and its Application to Lasso

Masanori Kawakita, Jun’ichi Takeuchi

[abs] [pdf] [supplementary]

Nonparametric Canonical Correlation Analysis

Tomer Michaeli, Weiran Wang, Karen Livescu

[abs] [pdf] [supplementary]

BISTRO: An Efficient Relaxation-Based Method for Contextual Bandits

Alexander Rakhlin, Karthik Sridharan

[abs] [pdf] [supplementary]

Associative Long Short-Term Memory

Ivo Danihelka, Greg Wayne, Benigno Uria, Nal Kalchbrenner, Alex Graves

[abs] [pdf] [supplementary]

Dueling Network Architectures for Deep Reinforcement Learning

Ziyu Wang, Tom Schaul, Matteo Hessel, Hado van Hasselt, Marc Lanctot, Nando de Freitas

Persistence weighted Gaussian kernel for topological data analysis

Genki Kusano, Yasuaki Hiraoka, Kenji Fukumizu

[abs] [pdf] [supplementary]

Learning Convolutional Neural Networks for Graphs

Mathias Niepert, Mohamed Ahmed, Konstantin Kutzkov

Persistent RNNs: Stashing Recurrent Weights On-Chip

Greg Diamos, Shubho Sengupta, Bryan Catanzaro, Mike Chrzanowski, Adam Coates, Erich Elsen, Jesse Engel, Awni Hannun, Sanjeev Satheesh

Recurrent Orthogonal Networks and Long-Memory Tasks

Mikael Henaff, Arthur Szlam, Yann LeCun

The Arrow of Time in Multivariate Time Series

Stefan Bauer, Bernhard Schölkopf, Jonas Peters

[abs] [pdf] [supplementary]

Mixture Proportion Estimation via Kernel Embeddings of Distributions

Harish Ramaswamy, Clayton Scott, Ambuj Tewari

[abs] [pdf] [supplementary]

Fast DPP Sampling for Nystrom with Application to Kernel Methods

Chengtao Li, Stefanie Jegelka, Suvrit Sra

[abs] [pdf] [supplementary]

Complex Embeddings for Simple Link Prediction

Théo Trouillon, Johannes Welbl, Sebastian Riedel, Eric Gaussier, Guillaume Bouchard

[abs] [pdf] [supplementary]

Interactive Bayesian Hierarchical Clustering

Sharad Vikram, Sanjoy Dasgupta

[abs] [pdf] [supplementary]

A Convolutional Attention Network for Extreme Summarization of Source Code

Miltiadis Allamanis, Hao Peng, Charles Sutton

How to Fake Multiply by a Gaussian Matrix

Michael Kapralov, Vamsi Potluru, David Woodruff

[abs] [pdf] [supplementary]

Differentially Private Chi-Squared Hypothesis Testing: Goodness of Fit and Independence Testing

Marco Gaboardi, Hyun Lim, Ryan Rogers, Salil Vadhan

[abs] [pdf] [supplementary]

Pliable Rejection Sampling

Akram Erraqabi, Michal Valko, Alexandra Carpentier, Odalric Maillard

[abs] [pdf] [supplementary]

Differentially Private Policy Evaluation

Borja Balle, Maziar Gomrokchi, Doina Precup

[abs] [pdf] [supplementary]

Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning

Philip Thomas, Emma Brunskill

[abs] [pdf] [supplementary]

Discrete Deep Feature Extraction: A Theory and New Architectures

Thomas Wiatowski, Michael Tschannen, Aleksandar Stanic, Philipp Grohs, Helmut Boelcskei

[abs] [pdf] [supplementary]

Efficient Algorithms for Adversarial Contextual Learning

Vasilis Syrgkanis, Akshay Krishnamurthy, Robert Schapire

[abs] [pdf] [supplementary]

Training Deep Neural Networks via Direct Loss Minimization

Yang Song, Alexander Schwing, Richard, Raquel Urtasun

[abs] [pdf] [supplementary]

Sequence to Sequence Training of CTC-RNNs with Partial Windowing

Kyuyeon Hwang, Wonyong Sung

[abs] [pdf] [supplementary]

Variational Inference for Monte Carlo Objectives

Andriy Mnih, Danilo Rezende

[abs] [pdf] [supplementary]

Hierarchical Decision Making In Electricity Grid Management

Gal Dalal, Elad Gilboa, Shie Mannor

Learning Sparse Combinatorial Representations via Two-stage Submodular Maximization

Eric Balkanski, Baharan Mirzasoleiman, Andreas Krause, Yaron Singer

[abs] [pdf] [supplementary]

Understanding and Improving Convolutional Neural Networks via Concatenated Rectified Linear Units

Wenling Shang, Kihyuk Sohn, Diogo Almeida, Honglak Lee

Isotonic Hawkes Processes

Yichen Wang, Bo Xie, Nan Du, Le Song

[abs] [pdf] [supplementary]

Cross-Graph Learning of Multi-Relational Associations

Hanxiao Liu, Yiming Yang

Markov-modulated Marked Poisson Processes for Check-in Data

Jiangwei Pan, Vinayak Rao, Pankaj Agarwal, Alan Gelfand

Beyond Parity Constraints: Fourier Analysis of Hash Functions for Inference

Tudor Achim, Ashish Sabharwal, Stefano Ermon

[abs] [pdf] [supplementary]

On the Power and Limits of Distance-Based Learning

Periklis Papakonstantinou, Jia Xu, Guang Yang

[abs] [pdf] [supplementary]

A Convex Atomic-Norm Approach to Multiple Sequence Alignment and Motif Discovery

Ian En-Hsu Yen, Xin Lin, Jiong Zhang, Pradeep Ravikumar, Inderjit Dhillon

[abs] [pdf] [supplementary]

Generalized Direct Change Estimation in Ising Model Structure

Farideh Fazayeli, Arindam Banerjee

Robust Principal Component Analysis with Side Information

Kai-Yang Chiang, Cho-Jui Hsieh, Inderjit Dhillon

[abs] [pdf] [supplementary]

Towards Faster Rates and Oracle Property for Low-Rank Matrix Estimation

Huan Gui, Jiawei Han, Quanquan Gu

[abs] [pdf] [supplementary]

Early and Reliable Event Detection Using Proximity Space Representation

Maxime Sangnier, Jerome Gauthier, Alain Rakotomamonjy

Stratified Sampling Meets Machine Learning

Edo Liberty, Kevin Lang, Konstantin Shmakov

Efficient Multi-Instance Learning for Activity Recognition from Time Series Data Using an Auto-Regressive Hidden Markov Model

Xinze Guan, Raviv Raich, Weng-Keen Wong

[abs] [pdf] [supplementary]

Generalization Properties and Implicit Regularization for Multiple Passes SGM

Junhong Lin, Raffaello Camoriano, Lorenzo Rosasco

[abs] [pdf] [supplementary]

Principal Component Projection Without Principal Component Analysis

Roy Frostig, Cameron Musco, Christopher Musco, Aaron Sidford

[abs] [pdf] [supplementary]

Recovery guarantee of weighted low-rank approximation via alternating minimization

Yuanzhi Li, Yingyu Liang, Andrej Risteski

[abs] [pdf] [supplementary]

Deconstructing the Ladder Network Architecture

Mohammad Pezeshki, Linxi Fan, Philemon Brakel, Aaron Courville, Yoshua Bengio

[abs] [pdf] [supplementary]

Generalization and Exploration via Randomized Value Functions

Ian Osband, Benjamin Van Roy, Zheng Wen

[abs] [pdf] [supplementary]

Evasion and Hardening of Tree Ensemble Classifiers

Alex Kantchelian, J. D. Tygar, Anthony Joseph

[abs] [pdf] [supplementary]

Dynamic Memory Networks for Visual and Textual Question Answering

Caiming Xiong, Stephen Merity, Richard Socher

Estimating Cosmological Parameters from the Dark Matter Distribution

Siamak Ravanbakhsh, Junier Oliva, Sebastian Fromenteau, Layne Price, Shirley Ho, Jeff Schneider, Barnabas Poczos

[abs] [pdf] [supplementary]

Learning Population-Level Diffusions with Generative RNNs

Tatsunori Hashimoto, David Gifford, Tommi Jaakkola

[abs] [pdf] [supplementary]

Expressiveness of Rectifier Networks

Xingyuan Pan, Vivek Srikumar

[abs] [pdf] [supplementary]

Discrete Distribution Estimation under Local Privacy

Peter Kairouz, Keith Bonawitz, Daniel Ramage

[abs] [pdf] [supplementary]

Square Root Graphical Models: Multivariate Generalizations of Univariate Exponential Families that Permit Positive Dependencies

David Inouye, Pradeep Ravikumar, Inderjit Dhillon

[abs] [pdf] [supplementary]

A Box-Constrained Approach for Hard Permutation Problems

Cong Han Lim, Steve Wright

[abs] [pdf] [supplementary]

Geometric Mean Metric Learning

Pourya Zadeh, Reshad Hosseini, Suvrit Sra

Sparse Nonlinear Regression: Parameter Estimation under Nonconvexity

Zhuoran Yang, Zhaoran Wang, Han Liu, Yonina Eldar, Tong Zhang

[abs] [pdf] [supplementary]

Conditional Bernoulli Mixtures for Multi-label Classification

Cheng Li, Bingyu Wang, Virgil Pavlu, Javed Aslam

[abs] [pdf] [supplementary]

Scalable Discrete Sampling as a Multi-Armed Bandit Problem

Yutian Chen, Zoubin Ghahramani

[abs] [pdf] [supplementary]

Recycling Randomness with Structure for Sublinear time Kernel Expansions

Krzysztof Choromanski, Vikas Sindhwani

[abs] [pdf] [supplementary]

Bidirectional Helmholtz Machines

Jorg Bornschein, Samira Shabanian, Asja Fischer, Yoshua Bengio

Faster Convex Optimization: Simulated Annealing with an Efficient Universal Barrier

Jacob Abernethy, Elad Hazan

[abs] [pdf] [supplementary]

Preconditioning Kernel Matrices

Kurt Cutajar, Michael Osborne, John Cunningham, Maurizio Filippone

[abs] [pdf] [supplementary]

Greedy Column Subset Selection: New Bounds and Distributed Algorithms

Jason Altschuler, Aditya Bhaskara, Gang Fu, Vahab Mirrokni, Afshin Rostamizadeh, Morteza Zadimoghaddam

[abs] [pdf] [supplementary]

Dynamic Capacity Networks

Amjad Almahairi, Nicolas Ballas, Tim Cooijmans, Yin Zheng, Hugo Larochelle, Aaron Courville

Pricing a Low-regret Seller

Hoda Heidari, Mohammad Mahdian, Umar Syed, Sergei Vassilvitskii, Sadra Yazdanbod

Estimation from Indirect Supervision with Linear Moments

Aditi Raghunathan, Roy Frostig, John Duchi, Percy Liang

[abs] [pdf] [supplementary]

Speeding up k-means by approximating Euclidean distances via block vectors

Thomas Bottesch, Thomas Bühler, Markus Kächele

Learning and Inference via Maximum Inner Product Search

Stephen Mussmann, Stefano Ermon

[abs] [pdf] [supplementary]

A Superlinearly-Convergent Proximal Newton-type Method for the Optimization of Finite Sums

Anton Rodomanov, Dmitry Kropotov

[abs] [pdf] [supplementary]

A Kernel Test of Goodness of Fit

Kacper Chwialkowski, Heiko Strathmann, Arthur Gretton

[abs] [pdf] [supplementary]

Interacting Particle Markov Chain Monte Carlo

Tom Rainforth, Christian Naesseth, Fredrik Lindsten, Brooks Paige, Jan-Willem Vandemeent, Arnaud Doucet, Frank Wood

[abs] [pdf] [supplementary]

Faster Eigenvector Computation via Shift-and-Invert Preconditioning

Dan Garber, Elad Hazan, Chi Jin, Sham, Cameron Musco, Praneeth Netrapalli, Aaron Sidford

[abs] [pdf] [supplementary]

A Theory of Generative ConvNet

Jianwen Xie, Yang Lu, Song-Chun Zhu, Yingnian Wu

Efficient Learning with a Family of Nonconvex Regularizers by Redistributing Nonconvexity

Quanming Yao, James Kwok

Computationally Efficient Nyström Approximation using Fast Transforms

Si Si, Cho-Jui Hsieh, Inderjit Dhillon

[abs] [pdf] [supplementary]

Gromov-Wasserstein Averaging of Kernel and Distance Matrices

Gabriel Peyré, Marco Cuturi, Justin Solomon

Robust Monte Carlo Sampling using Riemannian Nosé-Poincaré Hamiltonian Dynamics

Anirban Roychowdhury, Brian Kulis, Srinivasan Parthasarathy

[abs] [pdf] [supplementary]

The Segmented iHMM: A Simple, Efficient Hierarchical Infinite HMM

Ardavan Saeedi, Matthew Hoffman, Matthew Johnson, Ryan Adams

[abs] [pdf] [supplementary]

Meta–Gradient Boosted Decision Tree Model for Weight and Target Learning

Yury Ustinovskiy, Valentina Fedorova, Gleb Gusev, Pavel Serdyukov

Discriminative Embeddings of Latent Variable Models for Structured Data

Hanjun Dai, Bo Dai, Le Song

[abs] [pdf] [supplementary]

Robust Random Cut Forest Based Anomaly Detection on Streams

Sudipto Guha, Nina Mishra, Gourav Roy, Okke Schrijvers

[abs] [pdf] [supplementary]

Training Neural Networks Without Gradients: A Scalable ADMM Approach

Gavin Taylor, Ryan Burmeister, Zheng Xu, Bharat Singh, Ankit Patel, Tom Goldstein

Clustering High Dimensional Categorical Data via Topographical Features

Chao Chen, Novi Quadrianto

Efficient Algorithms for Large-scale Generalized Eigenvector Computation and Canonical Correlation Analysis

Rong Ge, Chi Jin, Sham, Praneeth Netrapalli, Aaron Sidford

[abs] [pdf] [supplementary]

Algorithms for Optimizing the Ratio of Submodular Functions

Wenruo Bai, Rishabh Iyer, Kai Wei, Jeff Bilmes

Model-Free Imitation Learning with Policy Optimization

Jonathan Ho, Jayesh Gupta, Stefano Ermon

[abs] [pdf] [supplementary]

ADIOS: Architectures Deep In Output Space

Moustapha Cisse, Maruan Al-Shedivat, Samy Bengio

Conditional Dependence via Shannon Capacity: Axioms, Estimators and Applications

Weihao Gao, Sreeram Kannan, Sewoong Oh, Pramod Viswanath

Control of Memory, Active Perception, and Action in Minecraft

Junhyuk Oh, Valliappa Chockalingam, Satinder, Honglak Lee

[abs] [pdf] [supplementary]

The Label Complexity of Mixed-Initiative Classifier Training

Jina Suh, Xiaojin Zhu, Saleema Amershi

[abs] [pdf] [supplementary]

Bayesian Poisson Tucker Decomposition for Learning the Structure of International Relations

Aaron Schein, Mingyuan Zhou, David Blei, Hanna Wallach

[abs] [pdf] [supplementary]

Tensor Decomposition via Joint Matrix Schur Decomposition

Nicolo Colombo, Nikos Vlassis

Continuous Deep Q-Learning with Model-based Acceleration

Shixiang Gu, Timothy Lillicrap, Ilya Sutskever, Sergey Levine

[abs] [pdf] [supplementary]

Domain Adaptation with Conditional Transferable Components

Mingming Gong, Kun Zhang, Tongliang Liu, Dacheng Tao, Clark Glymour, Bernhard Schölkopf

[abs] [pdf] [supplementary]

Fixed Point Quantization of Deep Convolutional Networks

Darryl Lin, Sachin Talathi, Sreekanth Annapureddy

Provable Algorithms for Inference in Topic Models

Sanjeev Arora, Rong Ge, Frederic Koehler, Tengyu Ma, Ankur Moitra

Epigraph projections for fast general convex programming

Po-Wei Wang, Matt Wytock, Zico Kolter

[abs] [pdf] [supplementary]

Fast Algorithms for Segmented Regression

Jayadev Acharya, Ilias Diakonikolas, Jerry Li, Ludwig Schmidt

[abs] [pdf] [supplementary]

Energetic Natural Gradient Descent

Philip Thomas, Bruno Castro da Silva, Christoph Dann, Emma Brunskill

[abs] [pdf] [supplementary]

Partition Functions from Rao-Blackwellized Tempered Sampling

David Carlson, Patrick Stinson, Ari Pakman, Liam Paninski

[abs] [pdf] [supplementary]

Learning Mixtures of Plackett-Luce Models

Zhibing Zhao, Peter Piech, Lirong Xia

[abs] [pdf] [supplementary]

Near Optimal Behavior via Approximate State Abstraction

David Abel, David Hershkowitz, Michael Littman

Power of Ordered Hypothesis Testing

Lihua Lei, William Fithian

[abs] [pdf] [supplementary]

PHOG: Probabilistic Model for Code

Pavol Bielik, Veselin Raychev, Martin Vechev

[abs] [pdf] [supplementary]

Shifting Regret, Mirror Descent, and Matrices

Andras Gyorgy, Csaba Szepesvari

Scalable Gradient-Based Tuning of Continuous Regularization Hyperparameters

Jelena Luketina, Tapani Raiko, Mathias Berglund, Klaus Greff

Model-Free Trajectory Optimization for Reinforcement Learning

Riad Akrour, Gerhard Neumann, Hany Abdulsamad, Abbas Abdolmaleki

[abs] [pdf] [supplementary]

Controlling the distance to a Kemeny consensus without computing it

Yunlong Jiao, Anna Korba, Eric Sibony

[abs] [pdf] [supplementary]

Horizontally Scalable Submodular Maximization

Mario Lucic, Olivier Bachem, Morteza Zadimoghaddam, Andreas Krause

[abs] [pdf] [supplementary]

Group Equivariant Convolutional Networks

Taco Cohen, Max Welling

[abs] [pdf] [supplementary]

Stochastic Discrete Clenshaw-Curtis Quadrature

Nico Piatkowski, Katharina Morik

Correcting Forecasts with Multifactor Neural Attention

Matthew Riemer, Aditya Vempaty, Flavio Calmon, Fenno Heath, Richard Hull, Elham Khabiri

Learning Representations for Counterfactual Inference

Fredrik Johansson, Uri Shalit, David Sontag

[abs] [pdf] [supplementary]

Automatic Construction of Nonparametric Relational Regression Models for Multiple Time Series

Yunseong Hwang, Anh Tong, Jaesik Choi

[abs] [pdf] [supplementary]

Inference Networks for Sequential Monte Carlo in Graphical Models

Brooks Paige, Frank Wood

[abs] [pdf] [supplementary]

Slice Sampling on Hamiltonian Trajectories

Benjamin Bloem-Reddy, John Cunningham

[abs] [pdf] [supplementary]

Noisy Activation Functions

Caglar Gulcehre, Marcin Moczulski, Misha Denil, Yoshua Bengio

PD-Sparse : A Primal and Dual Sparse Approach to Extreme Multiclass and Multilabel Classification

Ian En-Hsu Yen, Xiangru Huang, Pradeep Ravikumar, Kai Zhong, Inderjit Dhillon

[abs] [pdf] [supplementary]