Stop Thinking, Just Do!
Sungsoo Kim's Blog
Categories
computer science
- What's the difference between programming and coding
- Cross-Platform Data Lineage with OpenLineage
- Robust and Trustworthy Deep Learning
- PFNs; Use neural networks for 100x faster Bayesian predictions
- Multimodal Deep Learning for Protein Engineering
- Topological Deep Learning; Going Beyond Graph Data
- Is Distance Matrix Enough for Geometric Deep Learning?
- Sign and Basis Invariant Networks for Spectral Graph Representation Learning
- Distributed and Multiagent Reinforcement Learning
- A Distributional Multi-agent Reinforcement Learning Approach
- Deploy and Scale Models with BentoML
- Tool Agnostic MLOps with ZenML
- The Four Pillars of Machine Learning
- When to use Kubernetes natively over Kubeflow for ML
- Deploy ZenML + Kubeflow + MLflow + Minio
- MLOps Pipeline Tutorial with ZenML and Kubeflow
- ML models with ZenML and BentoML
- Recent Advances in Vision Foundation Models
- CVPR 2023 Tutorials
- Model Registry and Deployment with MLflow
- How to Deploy ML Models in Production with BentoML
- Data Science–A Systematic Treatment
- LEARN ENGLISH with STEVE JOBS
- DuckDB Internals
- How to Master the Art of Leadership
- Getting Started with ZenML
- Generative Models for Language and Vision
- ChatGPT, LLMs & Generative AI; What Your Business Needs to Know
- Differential Privacy & Variants
- Towards Learned Database Systems
- Deep Lake; a Lakehouse for Deep Learning
- Towards Natural Language Query Answering
- Do We Still Need People To Write Database Systems?
- Generative AI and Databases
- Contrastive Learning in PyTorch
- Self-Supervised Learning; Self-Prediction and Contrastive Learning
- Discovering Dynamic Salient Regions for Spatio-Temporal Graph Neural Networks
- Bayesian Neural Network
- Study Less Study Smart
- Kubeflow v1.7 Release Overview
- Amazon Redshift Internals
- ChatGPT Course – Use The OpenAI API to Code 5 Projects
- Installing and Using NVIDIA Docker
- Hypertree Decompositions
- What is Platform Engineering?
- Databricks Spark SQL / Photon
- Google BigQuery / Dremel
- Personalized Image Generation
- Kubeflow Pipelines 2.0
- Learning to Transfer Knowledge Through Embedding Spaces
- Traffic Prediction with Transfer Learning
- First-Passage Percolation and Related Models
- Sparse Fourier Transform Algorithm for Real-Time Applications
- The Frontier of Deep Learning for Robotics
- Microsoft's Products Will Soon Access Open AI Tools Like ChatGPT
- LLaMA & Alpaca; "ChatGPT" On Your Local Computer
- Multi-Objective Recommender Systems
- Hands-on Explainable Recommender Systems with Knowledge Graphs
- Improving Recommender Systems with Human in the Loop
- Open-Source Systems for Federated Learning
- An Introduction to Federated Computation
- Data Governance and Sharing on Lakehouse
- Data Mesh and Lakehouse
- Master Data Management
- Setting up Big Data Fabric
- On Future of the Modern Data Stack
- Data Virtualization in Data Fabric
- Recommender Systems using Graph Neural Networks
- Activate Your Modern MetaData Stack
- Databricks Lakehouse makes payments ingestion and analytics simple
- The Essential Guide to Data Lineage
- Data Governance Explained in 5 Minutes
- Automated Data Lineage with Unity Catalog
- Active Metadata - Understanding the magic behind the Data Fabric
- Introduction to Data Mesh
- PostgreSQL vs MySQL
- Activate Your Metadata to Empower Innovation
- Accelerating Hybrid Data Mesh Implementation
- Deep-Dive into Delta Lake
- Delta Lake 2.0 Overview
- Advancing Spark - The Photon Whitepaper
- Column-Level Lineage and Active Metadata
- Five Things to Consider About Data Mesh and Data Governance
- Data Mesh Implementation Patterns
- Intro to Databricks Lakehouse Platform Architecture and Security
- Meshing About with Databricks
- Enterprise Data Fabric
- Converging Advances to Accelerate Molecular Simulation
- Learning Delaunay Surface Elements for Mesh Reconstruction
- Data Fabric for Self-Driving Cars
- Knowledge Graphs Seminar
- Data Cataloging with Knowledge Graphs
- The Foundation of a Data Fabric
- Data Mesh, Data Fabric, Data Lakehouse
- Tractable Probabilistic Circuits
- Distributed Analytical Database Systems
- GraphZeppelin - Streaming Graph Connectivity at Scale
- Understanding Graph Data Representations in Triplestores
- Graph-Powered Data Exploration
- Google BigQuery
- Conformal Prediction Tutorial
- Traffic Prediction Paper Collection
- End-to-end Optimization of Machine Learning Prediction Queries
- Compute Complex Temporal Join Queries Efficiently
- Making Learned Query Optimization Practical
- Universal Database Optimizer
- Machine Learning for Query Optimization
- ML Explainability
- Towards Generalizable Autonomy
- Building an AI and ML ready Modern Data Platform
- Diffusion Probabilistic Modelling of Protein Backbones in 3D
- ICCV Authors Guidelines
- Diffusion Probabilistic Models
- Stable Diffusion - What, Why, How?
- Deep Energy-Based Learning
- Incident Prediction in Spatio-Temporal Road Graph Networks
- Diffusion probabilistic modelling of protein backbones in 3D
- Contrastive Learning in PyTorch
- Neural Rendering
- Anatomy of a Research Paper
- Geometric Deep Learning for Drug Discovery
- Deploy Custom Python Model Server
- Contrastive Learning in PyTorch
- InferenceService using a Custom Torchserve Image
- From Machine Learning to Autonomous Intelligence
- Towards a Learned Index Structure for ANN Search
- Graph Processing; from Academic Research to Industrial Application
- Katana Graph; A Graph Intelligence Platform
- Machine Learning for Scientific Discovery
- Graph Neural Networks as Gradient Flows
- Ab-Initio Potential Energy Surfaces
- Reinforcement Learning via Sequence Modeling
- Recipe for a General, Powerful, Scalable Graph Transformer
- Spatial Hash Grids & Tales from Game Development
- Cardinality Estimation Benchmark
- Spatial Indexing in PostGIS
- Data-driven Learned Metric Index; an Unsupervised Approach
- Synopses - Samples, Histogram, Wavelets and Sketches
- Query Execution in MonetDB
- Low overhead self-optimizing storage for compression
- A Database Tuning Tool that "Reads the Manual"
- Rethinking Graph Transformers with Spectral Attention
- Introduction to TorchServe
- MicroK8s Installation & Simple Commands
- Diffusion Models from Scratch in PyTorch
- Deep Learning Theory Lectures
- Kubeflow Fairing
- Kubeflow Fairing, Pipelines, and Training
- On Recoverability of Graph Neural Network Representation
- Deploy Model to KServe
- TorchScript and PyTorch JIT
- Spectra of Graphs and Hypergraphs
- How to Serve PyTorch Models with TorchServe
- A Novel Data Set for Information Retrieval on the Basis of Subgraph Matching
- Graph Embedding in Vector Spaces Using Matching-Graphs
- Google Cloud BigQuery ML Using SQL
- Open World Lifelong Learning
- On Machines that can Learn Continually
- Introduction to Continual Learning
- Introducing OpenAPI Generator
- Kubeflow Development Environment
- KFServing Deep Dive
- Protein structure prediction with AlphaFold
- Deep Learning for Scientific Computation
- Started with AI in Drug Discovery
- Istio Simplified
- Knative and Microk8s with Multipass
- KServe in Kubenetes
- Graph Ordering Attention Networks
- Introduction to KServe
- Position-Aware Graph Neural Network using Reachability Estimations
- Isomorphism Using Adjacency Matrix
- Efficient Probabilistic Truss Indexing on Uncertain Graphs
- Toward Verified Artificial Intelligence
- Modern Data Stack
- PyTorchJobClient
- KServe (Kubeflow KFServing) Live Coding Session
- Workflow for ML Projects — MLOps
- Kubeflow Setup
- Introduction to Data Mesh
- Theory of Graph Neural Networks
- Representational Power of Graph Nerual Networks
- Task structure and generalization in graph neural networks
- Geometry Processing with Neural Fields
- Trends in AI
- RelationalAI Knowledge Graph Management System
- Topological Graph Neural Networks
- Causality and (Graph) Neural Networks
- Using Graph Neural Networks for Multi-Node Representation Learning
- Epistemic Uncertainty Estimation for Efficient Search of Drug Candidates
- Toward Neuro Causality
- Connect to MySQL Database from Visual Studio Code
- Data-Efficient Graph Grammar Learning for Molecular Generation
- Rethinking Graph Transformers with Spectral Attention
- How GNNs and Symmetries can help to solve PDEs
- Neural diffusion PDEs, differential geometry, and graph neural networks
- BigQuery ML; Machine Learning with Standard SQL
- GNN with Learnable Structural and Positional Presentations
- Istio Service Mesh 101
- Graph Neural Networks and Diffusion PDEs
- Recommender Systems using Graph Neural Networks
- Manage Multi-tenant ML Workloads Using Istio
- Graph Neural Networks as Neural Diffusion PDEs
- Exploring ML Model Serving with KServe
- Kubeflow Tutorial | Model Serving
- Using AI to accelerate scientific discovery
- Java to MySQL using Visual Studio Code
- Developing the Kubernetes Python Client
- Kubernetes Patterns
- Using Visual Studio Code for Java Maven Project
- Mapping External Services
- MiniKF; Kubeflow on your laptop
- Gateway Routing Pattern
- Kubeflow v1.5 Release Overview
- TensorFlow Distributed Training on Kubeflow
- Sidecar Pattern
- YAML
- Kubeflow on macOS
- Why is Kubernetes called K8s?
- Kubernetes Crash Course for Absolute Beginners
- Marrying Top k with Skyline Queries
- Spatial Skyline Queries on Triangulated Irregular Networks
- Challenges and Solutions of Using Kubernetes for Blockchain Applications
- PyTorch Training (PyTorchJob)
- Kubeflow Architecture
- Istio and Service Mesh
- Kubernetes Tutorial for Beginners
- Serving Machine Learning Models at Scale Using KServe
- Advanced Model Inferencing Leveraging Kubeflow Serving
- PyTorch via SQL Commands
- Scaling Shortest Path Graph Queries on Very Large Networks
- Hierarchical Core Maintenance on Large Dynamic Graphs
- Shortest Paths and Centrality in Uncertain Networks
- Bag of Tricks for Node Classification with Graph Neural Networks
- The Case for Learned Index Structures
- Deep Learning for Graph Similarity Search
- HyperSPNs; Compact and Expressive Probabilistic Circuits
- Learned Index Structures for Dynamic and Multi-Dimensional Data
- Blame the Data, Not the System
- Hyperbolic Embeddings in Machine Learning and Deep Learning
- What are Temporal Databases?
- Git Commit Message Style Guide
- What is PrestoDB?
- Spatial, High Dimensional, Temporal Data Indexing and Querying
- A Learned Spatial Index for Range and kNN Queries
- An Introduction to Spatial Data and its Applications
- Robustness/Interpretability in Vision & Language Models
- Visual Data Analysis; How? When? Why?
- SageDB; A Self-Assembling Database System
- Towards Zero-Shot Learning for Databases
- From Workload-Driven to Zero-Shot Learning
- From Notebook to Kubeflow Pipelines to KFServing
- Improving Transfer and Robustness of Supervised Contrastive Learning
- Approximate Query Processing
- What are Probabilistic Data Structures?
- Approximation Algorithms for Large Scale Data Analysis
- Modeling Relational Data with Graph Convolution Network
- Open-source Change Data Capture With Debezium
- Finding Approximately Repeated Patterns in Time Series
- Looper; An End-to-End ML Platform for Product Decisions
- Learned Indexing and Sampling for Improving Query Performance
- Principles of Good Machine Learning Systems Design
- Deep Learning Design Patterns
- Tips for Thinking Like a Machine Learning Architect
- Machine Learning Design Patterns
- Introduction to Kubeflow
- Martingale Theory
- Making Architecture Matter
- Bayesian Inference for Big Data
- Architecture and Cardinality Estimations for Graph Queries
- What causes congestion?
- Optimization problems in graphs with locational uncertainty
- Deep Implicit Layers
- Convex Functions
- Quantum Deep Learning
- Uncertainty and Out-of-Distribution Robustness in Deep Learning
- Probabilistic Methods for Increased Robustness in Machine Learning
- Probabilistic Circuits - Representations, Inference, Learning and Theory
- FLAT-Fast, Lightweight and Accurate Method for Cardinality Estimation
- Learned Query Scheduling
- A 10 Minute Introduction to Kubeflow
- Ethics of Artificial Intelligence
- What is AI Ethics?
- What is the Cardinality Estimator?
- Qd-tree - Learning Data Layouts for Big Data Analytics
- Sum-Product Networks
- Towards Zero-Shot Learning for Databases
- Machine Learning Design Patterns
- Learned DBMS Components 2.0
- Deep Multi-task and Meta Learning
- Machine Learning Design Patterns - Rebalancing
- Calcite Tutorial
- NeuroCard - One Cardinality Estimator for All Tables
- GNN based Recommender Systems
- On the Nature of Data Science
- Breaking the Limit of Graph Neural Networks using Local Mixing Patterns
- Performance Adaptive Sampling Strategy Towards Fast and Accurate Graph Neural Networks
- Data Fabrics for Enterprise Data Management?
- Query Optimization and Acceleration at Dremio
- AI4DB & DB4AI-Papers
- Permissioned Blockchains
- Apache Spark, Data Science + Machine Learning
- Introduction to Temporal Graph Neural Networks
- Traffic Forecasting with Pytorch Geometric Temporal
- Geometric Deep Learning
- What Is NFT? - Non Fungible Token
- Deep Recommender Systems
- Weisfeiler and Lehman Go Cellular; CW Networks
- Matrix Multiplication, and the Asymptotic Spectrum Of Tensors
- Traffic Prediction for Intelligent Transport System
- Revisiting Knowledge Graph Completion From a Practical Perspective
- Applying Machine Learning-based Database Tuning in Production
- Cloud-Native Ledger Graph Database
- A Probabilistic Model for Joint Inference from Differential Equations
- Optimization Frameworks for Graph Clustering
- Neuralising a Computer Scientist
- Preparing For Your Dissertation Defense
- Oral Dissertation Defense Preparation
- How To Defend Your Thesis? Top 10 Tips For Success
- How to Get a PhD in Computer Science
- 100 Thesis Defense Questions in 3 Categories
- 10 Mistakes to Avoid When Defending Your Thesis
- How to prepare for your PhD thesis defence
- Geometric Deep Learning-Past, Present, And Future Optimization
- Optimal Gradient-based Algorithms for Non-concave Bandit Optimization
- How to write a literature review - my simple 5 step process!
- Geometric Deep Learning - from Euclid to drug design
- Graph Isomorphism in Quasipolynomial Time
- Geometric Deep Learning Blueprint
- Geometric Deep Learning-The Erlangen Programme of ML
- KDD Cup OGB Large-Scale Challenge
- Theoretical Foundations of Data-Driven Algorithm Design
- Relational Inductive Biases, Deep Learning, and Graph Networks
- The Future Is Big Graphs
- The Universal Approximation Theorem
- Equivariant Networks and Natural Graph Networks
- An Introduction to Hamiltonian Monte Carlo Method for Sampling
- Recurrent Neural Networks for Cognitive Neuroscience
- Permutation Equivariance of Graph Filters
- Graph Convolutional and Isomorphism Networks
- Combinatorial Properties of the Weisfeiler-Leman Algorithm
- Advances in Self-Supervised Learning
- Contrastive Loss, Deep Metric Learning
- Self-Supervised Learning
- Contrastive Learning; A General Self-supervised Learning Approach
- Supervised Contrastive Learning
- What is Noise-Contrastive Estimation?
- GNNs with Learnable Structural and Positional Representation
- Knowledge Graph Representation
- Temporal Graph Networks (TGN)
- Building a Recommender System using Graph Neural Networks
- Junction Tree Variational Autoencoder for Molecular Graph Generation
- Understanding Graph Attention Networks
- GVAE Training and Adjacency Reconstruction
- Graph Representation Learning for Drug Discovery
- Graph Neural Networks for Traffic Prediction
- Visual Exploration of Trajectory Data
- GPS Data Analysis with Python
- 22 Python libraries for Geospatial Data Analysis
- Clustering Trajectories
- Towards Causal Representation Learning
- Random Sum Product Networks
- Understanding & Generating Source Code with Graph Neural Networks
- Sum-Product Networks; The Next Generation of Deep Models
- Deep Learning to Discover Coordinates for Dynamics
- The Power of Graph Signal Processing
- Some Mathematical Problems in Graph Signal Processing
- On Laplacian Eigenmaps for Dimensionality Reduction
- Overview of Graph Embeddings
- Sum-Product Networks; Powerful Models with Tractable Inference
- Sum-Product Networks
- Geometric Deep Learning and Reinforcement Learning
- Discovering Genetic Medicines with our AI Drug Discovery Platform
- Proof of Euler's Formula
- GNNs and RL in Traffic Optimization in Cities
- Graph Neural Networks and Applications to Deep Reinforcement Learning
- Writing Scientific Papers
- Drug Discovery with GANs
- PyTorch nn.Embedding()
- Reproducing Kernel Hilbert Spaces
- Dynamical Systems and Machine Learning
- An Introduction to Hilbert Spaces
- Graph Neural Networks in Computational Biology
- What is Euler's Number 'e'?
- Euler's Formula
- The Characteristic Equation and Eigenvalues
- Spanning Trees and Kirchhoff's Theorem
- Acceptance Rate of AI Conferences
- Deep Graph Learning Foundations, Advances and Applications
- Learning Graphs from Data; A Signal Processing Perspective
- Making Graphs Compact by Lossless Contraction
- A Short Course in Spectral Graph Theory
- An Introduction to Spectral Graph Theory
- Spectral Graph Theory; Enter Linear Algebra
- Spectral Graph Theory; The Standard Random Walk
- Spectral Graph Theory; The Quadratic Form
- The Laplacian Matrices of Graphs; Algorithms and Applications
- Pragmatic Ridge Spectral Sparsification for Large-Scale Graph Learning
- Knowledge Graph Embeddings Tutorial; From Theory to Practice
- How to use edge features in Graph Neural Networks
- Graph Representation Learning
- Papers on Graph Neural Networks
- A Unified Deep Model of Learning from both Data and Queries for Cardinality Estimation
- Super-Constant-Pass Streaming Lower Bounds for Reachability
- Awesone Neural ODE
- Neural Differential Equations
- Generative Adversarial Networks and TF-GAN
- Fairness and Bias in Algorithmic Decision-Making
- A Learned Sketch for Subgraph Counting
- Learning over sets, subgraphs, and streams
- Categorical Reparameterization with Gumbel-Softmax
- A Deterministic Parallel APSP Algorithm and its Applications
- How Do You Generate Synthetic Data?
- Synthetic Tabular Data Generation
- Decremental All-Pairs Shortest Paths in Deterministic Near-Linear Time
- Streaming Lower Bounds for Reachability
- Learning Deep Matrix Factorizations Via Gradient Descent
- NVIDIA Omniverse and a Future of Shared Worlds
- Introduction to Graph Neural Networks
- CVPR 2021 Workshop on Autonomous Vehicles
- Pytorch-BigGraph; A Large Scale Graph Embedding System
- Unsupervised Intelligent Agents
- Uncertainty, causality and generalization
- Illustrated Guide to RNN, LSTM, and Transformers
- Banach Space Representer Theorems for Neural Networks
- Next-Generation Recurrent Network Models
- Graph Representation Learning for Drug Discovery
- PostgreSQL Optimizer Methodology
- PPRGo-Scaling Graph Neural Networks with Approximate PageRank
- The Energy-Based Learning Model
- But how does bitcoin actually work?
- Explainable AI Cheat Sheet - Five Key Categories
- Digital Twins and Geospatial Data
- Trends in Traffic Prediction
- Understanding Competencies
- Scaling Up Graph Neural Networks to Large Graphs
- Inductive Logic Programming
- Useful Resources for Traffic Prediction
- A Video Game That Looks Like Reality!
- Spatio-Temporal Graph Transformer Networks for Pedestrian Trajectory Prediction
- New lower and upper bounds for quantile summary algorithms
- Data Summarization for Machine Learning
- Position aware Graph Neural Networks
- Instacart Data Description
- Fast Counts on Big Data Systems
- MonetDB; Scale Up Before You Scale Out
- Kronecker Graphs, Data Generation, and Performance
- AI+VR; The Kayfabe Life
- Efficient Query Processing Using Machine Learning
- How to use mathcha math online editor
- Uncertainty estimation in supervised learning
- Approximately Bayesian Ensembling
- Efficient Data Processing on Modern Hardware
- AI Bias and Fairness
- Generative Models for Graphs
- Evidential Deep Learning and Uncertainty
- Learned Approximate Query Processing
- Meta Learning Tutorial
- Deep Learning Outline
- Transfer Learning
- Approximate nearest neighbor search in high dimensions
- Learning-Based Sketching Algorithms
- Deep Learning Under Distribution Shift
- Principles for Tackling Distribution Shift
- Scalable Graph Neural Networks with Deep Graph Library
- Deep InfoMax
- Introduction to Bandits in Recommender Systems
- Our paper has been published in IEEE Access!
- Constant Girth Approximation for Directed Graphs in Subquadratic Time
- Conditionally optimal approximation algorithms
- Sparsification of Graphs and Matrices
- How to Write a Great Research Paper
- Vision Transformer - Keras Code Examples
- Vision Transformer
- MLP-Mixer; An all-MLP Architecture for Vision
- From AlphaGo to MuZero
- Moral Machines; How culture changes values?
- Generative Adversarial Networks
- Hardness of Approximation Between P and NP
- GPT-3 = Generation Pre-trained Transformer 3
- Graph Attention Networks Implementation
- Introduction to Pytorch Geometric
- Deep Learning for Graphs
- Geometric Deep Learning
- Graph Neural Networks - Models and Applications
- Matching of Matching Graphs
- Graph Classification using Structural Attention
- Approximate Personalized PageRank on Dynamic Graphs
- GCNs for Web-Scale Recommender Systems
- Inductive Representation Learning on Large Graphs
- Write Research Paper Outline
- How to Respond to Reviewer Comments
- Limitations of Graph Neural Networks
- Choice of Graph Representation
- Inductive Representation Learning on Temporal Graphs
- Applications of Graph ML
- Machine Learning with Graphs
- How to Review a Research Paper
- Approximate standing queries on Stream Processing
- Learning Mesh-Based Simulation with Graph Networks
- Temporal Graph Networks for Deep Learning on Dynamic Graphs
- Graph Neural Networks and Knowledge Graph Completion
- Parallel and Distributed Computing in Python with Dask
- Reducing Communication in Graph Neural Network Training
- Mathematical Foundations of the GraphBLAS and Big Data
- Graph Signal Processing - Accounting for Geometry in Data
- Graph Signal Processing for Machine Learning Applications
- Papers on Recommendation Systems
- Representational Power of Graph Neural Networks
- Tunable Stream Graph Embeddings at Scale
- Graph Neural Networks
- Graph Convolutional Networks using only NumPy
- Temporal Graph Networks for Dynamic Graphs
- Graph Algorithms as Matrix Vector Products
- Distributed Machine Learning with Python
- Deep Networks Are Kernel Machines
- How the Quest for the Ultimate Learning Machine Will Remake Our World
- Fast Distributed Algorithms for Girth, Cycles and Small Subgraphs
- Graph Sketching, Streaming, and Sampling
- Theoretical Foundations of Graph Neural Networks
- Efficiently Answering Span Reachability Queries
- Probabilistic Reachability for MDPs
- How computer science can help fight COVID-19
- Enable Digital Transformation with Big Spatial Data & Analytics
- Online Approximation Techniques for Spatial Data
- Sparse Matrices Beyond Solvers
- Research Resources for Approximate Query Processing
- Research Directions for Sketching Algorithms
- The Forward-Backward Embedding of Directed Graphs
- The Dynamics of Active Sensing in Social Networks
- Large-scale Graph Representation Learning
- Graph Embedding
- Individual and Collective Graph Mining
- New Machine Learning Tool Tracks Urban Traffic Congestion
- Network Analysis
- Getting Started With Network Datasets
- Bourgain's Embedding
- High Accuracy Protein Structure Prediction Using Deep Learning
- Testing Correlation of Unlabeled Random Graphs
- Designing Effective Scientific Presentations
- Quick & Simple Call Graphs in Python
- Latent Network Summarization
- Asymptotic Spectral Distributions for Strongly Regular Graphs
- Graph Convolutional Neural Networks for Molecule Generation
- Deep Learning on Graphs
- Principles and Applications of Relational Inductive Biases
- Traffic Prediction with Advanced Graph Neural Networks
- Papers on Graph Neural Networks
- Self-Driving Databases
- Network Representation Learning
- Non-Volatile Memory for Database Management Systems
- Related Papers on Graph Neural Networks
- Finding a New Knowledge
- Beautiful Elastic Simulations, Now Much Faster!
- Approximate Nearest Neighbor Search in High Dimensions
- Billion-scale Approximate Nearest Neighbor Search
- Spatio-Temporal Traffic Prediction Using Deep Learning
- Machine Learning Dissertations
- Research Day
- CS Theory Toolkit
- Deep Learning with PyTorch
- Graph Signal Processing
- Deep Learning Lectures
- Convolutional LSTMs for Video Prediction
- Research Proposal
- Nonlinear Dimensionality Reduction
- What Data Platforms Can Do for Machine Learning Workloads
- AI for Neuroscience and Neuroscience for AI
- Optimal Join Algorithms meet Top-k
- Machine Learning Projects Against COVID-19
- Antistrong Digraphs
- Reflections from the Turing Award Winners
- Recent Progress in High-Dimensional Learning
- Dynamics of Real Networks - Patterns and Algorithms
- Chasing the COVID-19 Pandemic through Modeling
- This AI Removes Shadows From Your Photos!
- Deep Generative Models for Brain Reading
- Setting Up Your Kubeflow ML Environment
- Statistical Learning Theory for Modern Machine Learning
- Deep Learning for Computer Vision
- Big Ideas in Causality and Machine Learning
- AI for physics & physics for AI
- A Blueprint of Standardized and Composable Machine Learning
- Random Walks on Directed Graphs
- Neural Ordinary Differential Equations
- A Spatio-Temporal U-Network for Graph-structured Time Series Modeling
- Variational Graph Recurrent Neural Networks
- Influence Analytics in Graphs
- Learning from Limited Labeled Data
- State of the Industry of Machine Learning Platforms
- Fundamentals of NumPy
- Fundamentals of PyTorch
- Towards Lifelong Learning Machines (L2L)
- Graph Attention Networks
- Learning Equivariant and Hybrid Message Passing on Graphs
- Adversarial Examples and Human-ML Alignment
- Graph Nets; The Next Generation
- Meta-learning
- Meta-learning of Optimizers and Update Rules
- Efficient Deep Learning with Humans in the Loop
- Generalizable Autonomy for Robot Manipulation
- Conditional Channel Gated Networks for Task-Aware Continual Learning
- Transfer/Low-Shot/Semi/Unsupervised Learning
- Mathematical Modeling of Epidemics
- How Behavior Spreads
- Network Diffusion & Contagion
- Explainable AI - Methods, Applications & Recent Developments
- Graph Representation Learning
- The Feature Store
- An Introduction to Meta Learning
- Explainable AI
- Design of BigQuery ML
- Improving Generalization by Self-Training & Self Distillation
- Neurosymbolic AI
- Task dependent adaptive metric for improved few-shot learning
- Variational Continual Learning
- Lifelong and Continual Learning
- The Power Of BI (Business Intelligence) in Digital Marketing
- Urban Computing
- Urban Computing with Advanced Visualization
- IEEE ITEC Best Presentation Award
- Transfer Learning - Part II
- Transfer Learning - Part I
- DeepWalk-Turning Graphs Into Features via Network Embeddings
- Graph Embeddings
- Deep Learning on Graphs
- Hybrid Grains
- Graph Signal Processing
- Learning to Learn
- Exploring And Attacking Neural Networks With Activation Atlases
- Miracles of Algebraic Graph Theory
- A History of Spectral Graph Theory and its Applications
- The JPEG Compression Algorithm
- Transfer Learning
- Geometric Deep Learning Papers
- Controlling Traffic with Reinforcement Learning
- Ian Goodfellow-Generative Adversarial Networks (GANs)
- Notes on Graph Theory
- Metric Learning and Manifolds
- Awesome Knowledge Distillation
- The Power of Self-Learning Systems
- Variational Inference-Foundations and Modern Methods
- 3D Deep Learning
- Machine Learning on Geometrical Data
- Hough Transform for Lines
- Deep Compression
- Model Compression Papers
- Deep RL Bootcamp
- Natural Policy Gradients, TRPO, PPO
- Deep Learning for Recommender Systems
- Awesome Knowledge Distillation
- Knowledge Distillation
- Distill and transfer learning for robust multitask RL
- Geometric Deep Learning
- Visualizing and Understanding Generative Adversarial Networks
- Trust Region Policy Optimization
- Generative Adversarial Imitation Learning
- Google AI's Take on How To Fix Peer Review
- Towards Generalization and Efficiency in Reinforcement Learning
- Acceptance Rate of AI Conferences
- PyTorch Geometric
- SQL Query Optimization Meets Deep Reinforcement Learning
- Time Series Forecasting using Statistical and Machine Learning Models
- BMM Summer Course 2018 Slides
- A Style-Based Generator Architecture for GAN
- Multi-Agent Reinforcement Learning
- What Makes a Good Image Generator AI?
- Automatic Machine Learning
- Generalizing Convolutions for Deep Learning
- Unsupervised Deep Learning
- Manifold Learning Yields Insight into Complex Biological State Space
- Project Malmo – a platform for fundamental AI research
- Keras vs. PyTorch
- Variational Inference - Foundations and Innovations
- The GAN Zoo
- Beautiful Layered Materials, Instantly
- Speed at Scale - Using GPUs to Accelerate Analytics for Extreme Use Cases
- SpatialHadoop - A MapReduce Framework for Big Spatial Data
- From Deep Learning of Disentangled Representations to Higher-level Cognition
- NVIDIA - AI for Generating High-Resolution Images
- Neural Image Stitching And Morphing
- Anomaly Detection for Real-World Systems
- Anomaly Detection
- What Explains Perception in the Brain?
- Applied Time Series Econometrics in Python and R
- Time Series Analysis with Python Intermediate
- Two Effective Algorithms for Time Series Forecasting
- AI Learns Painterly Harmonization
- Deep Learning with Ensembles of Neocortical Microcircuits
- A Neural Network Model That Can Reason
- 누구나 할 수 있는 엄청난 몰입력의 5가지 단계
- This AI Reproduces Human Perception
- 미루는 버릇을 완전히 없애주는 '2분짜리 간단한 습관'
- From Generative Models to Generative Agents
- World Models
- What Can Machine Learning Do? Workforce Implications
- Introduction to AI for Video Games
- 3D Generative-Adversarial Modeling
- Generative Models
- The Design of Traffic Flow Prediction System Based on Multimodal Data
- AI Photo Translation
- Cluster Analysis
- Deepmind AlphaZero - Mastering Games Without Human Knowledge
- NIPS Paper Evaluation Criteria
- Tutorial on GANs
- Equilibrium points in n-person games
- Machine Learning Reports
- Gaussian Material Synthesis
- Sequence Generative Adversarial Nets with Policy Gradient
- The Robot Revolution - The New Age of Manufacturing
- Evolutionary Generative Adversarial Networks
- Time Series Generation with Recurrent Conditional GANs
- Predicting Car Prices using Neural Networks
- Predicting Car Prices - Linear Regression
- Deep Reinforcement Learning
- Brains, Minds and Machines Summer Course
- OpenAI Meta Learning and Self Play
- Time Series Analysis
- Using Support Vector Machines as Flower Finders
- SARSA vs Q-learning
- Building a Neural Network Using the Iris Data Set
- Classification using Decision Trees in R
- The Long-term of AI & Temporal-Difference Learning
- Leeds annual traffic growth
- One pixel attack for fooling deep neural networks
- Visual Studio Code, CMake and LLDB
- Extreme Team
- Visual Studio Code - C++ Development
- Learning From Scratch
- Cellular Automata
- Radial basis function (RBFs)
- Karush-Kuhn-Tucker Condition
- Boundary Equilibrium Generative Adversarial Networks
- GAN Tutorials
- How To Read a Scientific Research Paper
- UML Tools for Python
- Introduction to gRPC
- Design Patterns in Python
- How to Write a Research Plan
- A Modern Toolkit for Microservice Communication
- All about the GANs
- Predicting Congestion with Tensorflow
- Sequential Implementation of Mesoscopic Simulation
- Processing 100 Billion IoT Events
- Predicting congestion on London's roads with TensorFlow
- Improving Traffic Prediction Using Weather Data
- Limitations of Macroscopic and Mesoscopic Simulation
- Simulation Loop
- Real-Time Data Processing with Apache Flink
- Mesoscopic Simulation
- Mesos Installation
- Building Mesos
- Traffic Prediction
- An Introduction to Protobufs
- Taking the I out of PaaS with Apache Mesos
- Software Projects Built on Mesos
- Distributed Deep Learning on Mesos
- Apache Mesos and GPUs
- Apache Mesos
- Distributed Systems
- High-Performance Mesoscopic Traffic Simulation
- Microscopic Simulation
- Macroscopic Simulation
- Traffic Simulation Background
- Pruning Makes Faster and Smaller Neural Networks
- NIPS 2017 Proceedings
- Distributed TensorFlow
- Using Apache Arrow, Calcite and Parquet to build a Relational Cache
- Data Vocalization
- T tree
- This Autonomous Robot Models Your House Interior
- Reproducibility in Deep Reinforcement Learning
- Google's Text Reader AI
- Conferences 2018
- What Will We Do With Quantum Computing?
- Road Transport Systems
- SLAC Dataset From MIT and Facebook
- OSPRay
- Streaming Through Caches
- GPU Architectures and New Programming Model Features
- Computer Architecture Essentials
- DeepMind Control Suite
- Cellular Automata
- Data Mining Lecture Slides
- What is Quantum Computing?
- AI Research Trends
- Data-Driven Mesoscopic Simulation Models
- Introduction to Geometric Deep Learning
- Geometric Deep Learning Paper and Code
- Geometric Deep Learning
- Neuromorphic Chip
- Pair Programming Tips
- Building Machines That Learn and Think Like People
- TensorTraffic
- Graph Partitioning
- Universal Neural Style Transfer
- Testing Software in the Age of Machine Learning
- Effective TensorFlow for Non-Experts
- Gartner Top 10 Strategic Technology Trends 2018
- Transferring AI To The Real World (OpenAI)
- Natural Language Processing with Deep Learning
- Practical Machine Learning Tutorial with Python
- DeepMind's AI Learns Superhuman Relational Reasoning
- Self-Illuminating Explosions
- Multi-Agent Reinforcement Learning
- Knowledge Based Artificial Intelligence Part 3 ~ 5
- Introduction to Graph Thinking
- Knowledge Based Artificial Intelligence Part 2
- Knowledge Based Artificial Intelligence Part 1
- Advanced Analytics with Spark
- AI Learns to Synthesize Pictures of Animals
- Latent Dirichlet Allocation Concept
- Mining Massive Data Sets
- An Efficient Scattering Material Representation
- Leveraging Activation Sparsity for Training Deep Neural Networks
- Computability, Complexity, and Theory - Georgia Tech
- Deep Photo Style Transfer
- Reinforcement Learning Course - Georgia Tech
- Digital Transformation in a Connected World
- What is Graph Analytics?
- Gunrock - GPU Graph Analytics
- Graphs in Machine Learning
- AI Creates 3D Models From Faces
- Social and Information Network Analysis
- Structural Analysis and Visualization of Networks
- Learning Local Geometric Descriptors from RGB-D Reconstructions
- Machine Learning APIs by Example
- Distributed Optimization via Alternating Direction Method of Multipliers
- Learning and Adaptation
- A modular GPU raytracer using OpenCL for non-interactive graphics
- Semantic Scene Completion from a Single Depth Image
- Latent Space Model for Road Networks to Predict Time-Varying Traffic
- Reinforcement Learning
- Deep Reinforcement Learning
- Analysis and Prediction of Spatiotemporal Traffic Congestion
- Reinforcement Learning Papers
- Technological Singularity
- 학교와 직장의 차이점
- Three Applications of Deep Learning in Big Data Analytics
- Optimization for Machine Learning
- Unsupervised Representation Learning
- Generalization and Equilibrium in Generative Adversarial Networks
- How to train a GAN
- Generative Adversarial Networks (GANs)
- Real-Time Oil Painting on Mobile Hardware
- Deep Advances in Generative Modeling
- A Guide to Edge IoT Analytics
- Edge Accelerated Web Platform
- Edge Computing - IoT and Thinking Machine
- Top AI and Machine Learning Conferences in 2017
- Fog Computing vs. Edge Computing
- Tell Me About Yourself - A Good Answer to This Interview Question
- From Big to Now - The Changing Face of Data
- Fast Data at Internet Scale with Amazon ElastiCache for Redis
- Fast Data, Fast Monitoring
- Geometric Detail Transfer
- Generative Adversarial Networks - Part III
- Generative Adversarial Networks - Part II
- Image Editing with Generative Adversarial Networks
- Generative Adversarial Networks - Part I
- Can we build AI without losing control over it?
- The Review Process for NIPS 2016
- PhD Interview Questions
- Guidelines for Writing a Good NIPS Paper
- Graph Streams and Sketches
- CNNs in Practice
- Proceedings of The ICML 2016
- Acceptance Ratio Statistics for Computational Intelligence Conferences
- Machine Learning Papers in AAAI Conference 2017
- Three things you need to know about machine learning
- Artificial Intelligence 3.0 – Third Wave of AI
- Wide & Deep Learning; Memorization + Generalization with TensorFlow
- Keynote - TensorFlow Dev Summit 2017
- Recent Results in Cloud Robotics
- Cloud of Things Overview
- Theory, Practice, and Standardization of Eye-tracking Technology
- Google Developer Day Keynote
- Understanding 3D Shapes With AI
- Occam's Razor
- Blockchain Explained
- Massive Exploration of Neural Machine Translation Architectures
- 5 Technology Trends in 2017 that Change Your Way of Living
- The Evolution of Artificial Intelligence
- Space-Time Video Completion
- How Deep Neural Networks Work
- Stable Neural Style Transfer
- GPU accelerated population annealing algorithm
- Learning Sound Representations from Unlabeled Video
- How to Improve your Clarity of Thought
- Modeling Human Communication Dynamics
- Improving the Neural GPU Architecture for Algorithm Learning
- How to Build an Intelligent Machine
- Introduction to Deep Learning
- Ten Data Acquisition Strategies for Startups
- Automatic Creation of Sketch Tutorials
- Deep Voice; Real-time Neural Text-to-Speech
- Billion-scale similarity search with GPUs
- AI Makes Stunning Photos From Your Drawings
- Data, not algorithms, is key to machine learning success
- The current state of machine intelligence 3.0
- Optimizing Deep CNN-Based Queries over Video Streams at Scale
- Gibbs Sampling-Probabilistic Graphical Models
- Real-time Fiber-level Cloth Rendering
- AI; The New Electricity
- Large-Scale Stochastic Learning using GPUs
- Shape and Material from Video
- Artificial Intelligence
- How to Prepare and Deliver a Great TED talk
- Learning to Fill Holes in Images
- Calendar of Computer Graphics Events
- Proceedings of the VLDB Endowment
- Journals and Conferences related to Artificial Intelligence
- AI Builds 3D Models From Images With a Twist
- Major Conferences and Journals about Machine Learning
- Most Cited Deep Learning Papers
- Data Preparation for Machine Learning
- How to Prepare Data For Machine Learning
- IoT, Azure, Machine Learning and more
- How Do Hollywood Movies Render Smoke?
- IBM Watson IoT Platform
- Revisiting Reuse in Main Memory Database Systems
- What's New with NewSQL?
- Spark-GPU; An Accelerated In-Memory Data Processing Engine on Clusters
- Fast Photorealistic Fur and Hair With Cone Tracing
- Big Data and Smart Data
- Deep Natural Language Processing
- What is Artificial Intelligence?
- Universe
- Game AI Development with OpenAI Universe
- Deep Reinforcement Learning - Pong from Pixels
- GPU SQL Query Accelerator
- Multi-core Parallelism in a Column-store
- How Intelligent Machines Learn to Make Sense of the World
- Enhance Super Resolution From Google
- Large-Scale Fluid Simulations On Your Graphics Card
- AI Makes 3D Models From Photos
- Attention Transfer
- Text Style Transfer
- IBM Watson-The Power of Artificial Intelligence
- Adaptive Machine Learning for Real-Time Streaming
- AI and Deep Learning at Amazon
- Datasets for Deep Learning
- DataSets and Machine Learning
- Open Data for Deep Learning
- Deep Learning Program Hallucinates Videos
- Reinforcement Learning
- Deep Reinforcement Learning
- Introduction to Deep Learning and Self-Driving Cars
- Deep Learning Resources
- Transfer Learning and Fine-tuning Deep Convolutional Neural Networks
- Amazing Slow Motion Videos With Optical Flow
- Machine Learning Algorithms
- Gunrock
- Neural Network Learns The Physics of Fluids and Smoke
- Recurrent Neural Networks, Image Captioning, LSTM
- The Unreasonable Effectiveness of Recurrent Neural Networks
- Understanding LSTM Networks
- Self-Driving Database Management Systems
- Stunning Video Game Graphics With Voxel Cone Tracing
- Papers that have used pbrt
- Mathematical Methods for Robotics, Vision, and Graphics
- Image Synthesis From Text With Deep Learning
- PBRT Installation on macOS
- Water Wave Simulation with Dispersion Kernels
- Crumpling Sound Synthesis
- Introduction to "Physically Based Shading in Theory and Practice"
- Multiphase Fluid Simulations
- Graph Mining for Log Data
- Sound Propagation With Bidirectional Path Tracing
- Understanding Multitenancy and the Architecture of the Salesforce Platform
- 3D Printing Acoustic Filters
- Probabilistic Ranking Problem
- Data Blocks; Hybrid OLTP and OLAP on Compressed Storage using both Vectorization and Compilation
- Human Pose Estimation with Deep Learning
- WaveNet by Google DeepMind
- SIGGRAPH 2016 Papers on the Web
- PVLDB Papers
- Progressive skyline computation in database systems
- Proceedings of the VLDB Endowment, Volume 10
- IBM Watson Health
- Large Scale Machine Learning
- In-core computation with HyperBall
- 소프트웨어 개발의 이해를 돕기 위한 비유
- 5 Ways to Make Your Hive Queries Run Faster
- TensorFlow Introduction
- Vectorized Executor
- Skyline Operator
- Fundamentals of Deep Learning
- TensorFlow Tutorial
- TensorFlow Examples
- Deep Learning Reading List
- Modeling Colliding and Merging Fluids
- Convolutional Neural Networks for Visual Recognition
- Surface-Only Liquids
- Top 10 Strategic Technology Trends 2016
- IELTS & TOEFL Writing - 5 Common Mistakes
- How to read the genome and build a human being
- The Art of Neural Networks
- Multithreaded Python with OpenMP
- Artistic Style Transfer For Videos
- SIGGRAPH 2016 – Technical Papers Trailer
- Deep Reinforcement Terrain Learning
- Faster In Memory Computing with No Deadlocks
- The Intel® Deep Learning Framework
- Self-Taught Learning and Unsupervised Feature Learning
- Compiling CUDA C/C++ with LLVM
- Separable Subsurface Scattering
- Deep Learning - Intelligence from Big Data
- The Emergence of Converged Data Platforms and the Role of In Memory Computing
- Real-Time Shading With Area Light Sources
- Parallel Data Processing
- Apache Ignite In-Memory Data Fabric
- Smart Data Discovery
- Creating an SPMD Vectorizer for OpenCL with LLVM
- How DeepMind's AlphaGo Defeated Lee Sedol
- The Next Step in GPU-Accelerated Deep Learning
- Convolutional Neural Networks
- Face2Face; Real-Time Facial Reenactment
- Embree; High Performance Ray Tracing Kernels
- Awesome Deep Learning
- Training Deep Neural Networks With Dropout
- Advanced Operating Systems
- Machine Learning Courses
- Deep Learning Terms
- Narrow Band Liquid Simulations
- Introduction to HDFS Erasure Coding in Apache Hadoop
- Deep Learning for Database Performance Optimization
- No Such Thing As Artificial Intelligence
- Introduction to LLVM
- Min-cost Flow Network Algorithms
- Elasticity with YARN
- Breakthrough Processor and System Design with SPARC M7
- Thread Local Storage
- Life Beyond Column-Stores - Exploiting intra-cycle parallelism
- gpucc; an open-source GPGPU compiler
- Knowledge Curation and Knowledge Fusion
- OLTPBench
- CrowdSky; Skyline Computation with Crowdsourcing
- Photos @ EDBT 2016
- 3D Depth From a Single Photograph
- Automatic Source Transformation from C++ to CUDA using Clang/LLVM
- Efficient Computation of Containment and Complementarity in RDF Data Cubess
- Data Processing in Modern Hardware
- Self-Curating Databases
- Verification of Evolving Graph-structured Data under Expressive Path Constraints
- Top-k Indexes Made Small and Sweet
- Dynamic Graph Queries
- Worst-Case Optimal Algorithms for Parallel Query Processing
- Declarative Probabilistic Programming with Datalog
- Lightning Fast Social Media Analytics
- Research Papers in Computer Graphics
- Statistics-Driven OLAP Acceleration using Query Column Sets
- Readings in Physically Based Rendering
- 9 Cool Deep Learning Applications
- Concurrency Control and Recovery for Search Trees
- Metropolis Light Transport
- How DeepMind Conquered Go With Deep Learning
- 자유가 없는 나라의 공부
- Skip List
- Transactional Memory in Practice
- DB2 for z/OS Best Practice; Locks and Latches
- Database Recovery - Basic Concepts
- Latches
- Apache Calcite Overview
- A Better B-Tree in Hekaton
- The Bw-Tree; A B-tree for New Hardware
- Hekaton Research
- Distributed Transactions
- ACID Isolation Level
- Notes About Memory Allocation Redesign
- Postgres Modifications
- The Databaseology Lectures
- C++ Arena Allocation Guide
- Region-based Memory Management
- Redmine Plugin Installation
- PostgreSQL
- Zeppelin Tajo Interpreter
- Data Visualization
- What is VoltDB?
- Release Notes - Tajo - Version 0.11.1
- Peloton Architecture
- Vagrant Tutorial
- Introduction to Vagrant
- Insert-Only Implementation Strategies
- A Hands-free Adaptive Store
- The Bw-Tree Key-Value Store
- Bitmap Indices
- IBM Cognos Analytics
- Understanding the SQL Server Columnstore Index
- Main Memory Databases
- Dictionary-based Order-preserving String Compression
- Hekaton Breaks Through
- The Merits of Data Compression
- DBMS Components
- Compensating Transaction
- In-Memory Column Store Architecture Overview
- Advantages & Disadvantages of an In-Memory Database
- Latches, Spinlocks, and Lock Free Data Structures
- Research Topics in Database Management
- Main Memory DBMS
- HYBRID WORKLOADS AND HTAP
- Peloton
- Database Systems Course
- Readings in Parallel Computer Architecture
- Lecture 0 - Course Description and Logistics
- Parallel Computer Architecture Course
- Gartner Top 10 Tech Trends for 2016
- CES 2016 Keynote
- MVCC Overview
- SAP HANA
- Work Breakdown Structure (WBS)
- HYRISE
- Main Compensation and Delta Compensation
- Aggregate Cache
- MonetDB Source compilation on OS X
- SDN and Big Data
- IEEE NetSoft 2016 Call for Demonstrations
- Connected Components in MapReduce and Beyond
- Analytics in Motion (AIM)
- Big Data is (at least) Four Different Problems
- Papers in Parallel and Distributed Data Management
- Real-time Big Data Analytics
- Wearable Technology Trends in 2016
- Telling a Story
- The Scope of a Paper
- Queries on Compressed RDDs
- HiBench Suite
- Map-D
- Clang MapReduce
- Rethinking the Role of CPUs in Modern Computers
- MultiQx-GPU
- Concurrent Analytical Query Processing with GPUs
- Where are the GPU based SQL databases?
- Comparing Relational Databases, Memory Cache, and NoSQL
- Oracle In-Memory Database Cache Overview
- MapGraph
- Central Office Re-architected as a Datacenter (CORD)
- Multiprocessor Synchronization
- Software Engineering Principle 3
- Software Engineering Principle 2
- Software Engineering Principle 1
- In-Memory Accelerator for Hadoop
- Main Memory and Streaming Databases
- Building Postgresql on OS X
- Think Fast, Talk Smart
- Unum Computing
- Introducing TPCx-HS Benchmark for Big Data
- Building LuxRender on OS X
- Calendar of Computer Graphics Events
- TPC-H
- LaTeX Fonts on Mac OS X
- Setting Up An Organized Folder Structure
- TPC-DS
- Proceedings of the VLDB Endowment, Volume 9
- In-Memory Data Management Course
- Research Posters 101
- Machine Driven, Human Guided Data Curation for Decision Advantage and Action
- A GPU Parallelized Database Accelerator
- Twitter Analysis with Apache Spark and IBM Watson
- Rethinking SQL for Big Data with Apache Drill
- Organizational Charts
- Databaseology Lectures
- KIWI Installation
- IntelliJ IDEA 15
- Maven Support in IntelliJ IDEA
- Zeppelin Deployment
- Zeppelin Structure
- The Pushdown of Everything
- Machine Learning as the Key to Personalized Curation
- 22nd ACM SIGKDD Conference
- What is Zeppelin Interpreter
- Writing-up Qualitative Research
- Zeppelin Installation
- MySQL Installation on Ubuntu
- PVLDB 2015 Vol. 9
- Data Engineering Course
- Apache Spark vs. MapReduce
- Database Reading List
- Big Data, Fast Data The Need for In Memory Database Technology
- Arrary DBMS
- Arrary Databases
- Michael Stonebraker's Talk
- Running Hadoop MapReduce on Tachyon
- CUDA LLVM Compiler
- Stanford Compilers Course
- Extreme Scale Computing
- Unit Test using specs
- Scala to Java
- Introduction to LLVM
- MySQL Installation on Mac OS X
- Computer Architecture Recommended Readings
- Introduction to Computer Architecture
- What is HTAP?
- A Taste of Random Decision Forests on Apache Spark
- ADAM Project
- Privacy as a Practice in Mobile App Development
- Lambda Expression in C++
- Spark - The Ultimate Scala Collections
- BDAS Software Stack
- Basic ONOS Tutorial
- Spark Cluster Configuration
- Legion Bootcamp
- Sample Data
- Building Tachyon Master Branch
- Creating Applications using Spark Streaming
- Creating a Spark Applications
- Working with RDD Operations
- How To Read A Paper
- RAMCloud Presentations
- Teaching creative computer science
- Paper Writing Strategy
- Paper Writing
- BlinkDB Review
- RAMCloud Training
- Kudu - New Apache Hadoop Storage for Fast Analytics on Fast Data
- GridGain In-Memory Data Fabric
- Discardable Memory and Materialized Queries
- 2014 ACM A.M. Turing Lecture
- What is the Future of Hadoop?
- Sparrow Paper Review
- Spark Architecture Shuffle
- Explore In-Memory Data Store Tachyon
- Tachyon Overview
- ACM TODS Manuscript Preparation
- Perform SQL Selects on R Data Frames
- R Graphics
- Google's R Style Guide
- Approximate Aggregation Queries in Presto
- Off-heap Memory in Apache Flink and the curious JIT compiler
- AMP Camp 6
- Spark 1.5 DataFrame API Highlights
- Fundamentals of Parallel Computing with the GPU
- Text Mining and Knowledge Graphs in the Cloud
- Knowledge Graph and Cognitive Computing
- Using Presto to combine data from Hive and MySQL
- IBM Watson at Work
- Using HyPer with PostgreSQL Drivers
- Knowledge Graph
- Big Data and Machine Learning - Keys to AI Progress
- Distributed Computing Conferences
- IEEE Transactions on Knowledge and Data Engineering
- Project Tungsten
- Advantages of Functional Programming
- Building Good Docker Images
- Optimizing Big Data Through Curation
- SparkR Setting for RStudio
- SparkR
- Cognitive Computing
- Spark Lab 3
- Spark Lab 2
- Diving into Spark Streaming's Execution Model
- SAP S4 HANA Technical White Board
- Using Electronic Health Records for Better Care
- 6 Big Data Analytics Use Cases for Healthcare IT
- Four Use Cases for Healthcare Predictive Analytics, Big Data
- Five open source Big Data projects to watch
- Reservoir Sampling in MapReduce
- Clinical Data Repository Versus a Data Warehouse
- Top 50 Data Science Resources
- High-throughput, low-latency, and exactly-once stream processing with Apache Flink
- Hyper
- From Big Data to Fast Data
- How Google Works
- SAP Hana In-Memory Explained In Nine Minutes
- Business unIntelligence
- The Challenges of Data Science
- How Data Science Driven Software is Eating the Connected World
- Data Curation as Publishing for the Digital Humanities
- How WellCare Accelerated Big Data Delivery to Improve Analytics
- Sharing Experiences in Cloud Adoption
- IBM Hybrid Transaction Analytic Processing HTAP
- What is Hybrid Transaction/Analytical Processing (HTAP)?
- CIOs and the Ongoing Data Persistence Struggle
- Eight Big Trends in Big Data Analytics
- Hybrid Trasansaction/Analytical Processing (HTAP)
- Hybrid Trasansactional/Analytical Processing (HTAP)
- HTAP and The Future of Data
- Does ethical content curation exist? A data-driven answer
- Stonebraker points to data curation as next integration step
- Three Approaches to Scalable Data Curation
- Summarizing
- The Landscape of Big Data, Curation
- Data Curation
- Tackling Data Curation
- OpenMOLE
- Advanced Vector Extension
- Potential Novel Applications of Open SDN
- Hybrid Unified Communications and Collaboration
- Mega-KV, A Case for GPUs to Maximize the Throughput of In-Memory Key-Value Stores
- SDN Note
- My Top 10 Assertions about Data Warehouses
- RAMCloud Overview
- RAMCloud
- Open Network Operating System
- The Virginian Database
- Function Composition
- Scala Collection
- The Analytical Bootstrap
- DAQ Introduction
- A New Paradigm for Approximate Query Processing
- Understanding the Differences between SDN and Server Virtualization
- Future of big data analytics
- Five most popular similarity measures implementation in python
- Massive Throughput Database Queries with LLVM on GPUs
- Open Daylight Technical Overview
- Hadoop + GPU Architecture
- Overview of the Apache Flink
- Apache Flink Seminar
- Lambda Architecture
- Apache MRQL Installation Instructions
- Common Misconceptions about SQL on Hadoop
- Apache MRQL
- Common Lisp using LLVM and C++ for Molecular Metaprogramming
- Big Data Challenges and Opportunities
- MapD (Massively Parallel Database) Overview
- Data Science using GPUs
- Apache Drill Introduction
- IBM Juices Hadoop With Java On Tesla GPUs
- Apache Tajo Internal
- Broadcast Join in Tajo
- Tajo Cluster Architecture
- Apache Hadoop vs. IBM Platform Symphony and Infosphere BigInsights
- How HPC is Hacking Hadoop
- High-Performance Computing Cluster
- Zeppelin Tutorial
- Teradata Bets Big on Presto for Hadoop SQL
- Introduction to Data Science with Apache Spark
- How to Read a Technical Paper
- The Internet of Things Transforms Transactions with Apache Spark
- Improving Spark for Data Pipelines with Native YARN Integration
- BlinkDB User Guide
- Rethinking SIMD Vectorization for In-Memory Databases
- Introduction to Docker
- Scala idiom - Prefer immutable data structures
- Scala - Tools and Frameworks
- Scala - Pattern Matching
- Scala - Functional/Collection
- Why Scala?
- Introduction to Scala
- Scala - Object Oriented Features
- Scala Installation
- Scala Basics
- Totally Lazy
- A SQL Query Compiler
- SIGMOD 2015 Papers
- Recent performance improvements in Apache Spark
- Bringing Spark Closer to Bare Metal
- Best SQL-on-hadoop Tool
- A well-planned kickoff meeting sets the tone for a successful project
- SBT (Simple Build Tool)
- Seven Key Elements for a Successful Kickoff Meeting Agenda
- Data Exploration Using BlinkDB
- Programming With SampleClean
- SampleClean
- Running BlinkDB Locally
- Cornell Notes
- SIGGRAPH 2015 papers on the web
- Large Graph-Mining - Power Tools and a Practitioner's Guide
- Hadoop-RDMA
- SIGGRAPH 2015 - Technical Papers Trailer
- Apache Spark - Caching and Checkpointing Under the Hood
- Tensor Decomposition
- Higher-order Singular Value Decomposition
- Smoothed Analysis of Tensor Decompositions
- Spark for Unconventional Cores
- Aparapi for Unconventional Cores
- Apache Phoenix
- Immutability
- Parallelism
- Column Orientation
- Composing and Scaling Data Platforms
- Dependency Injection
- Elements of Scale
- What is Mesos?
- Data Governance Initiative
- Requirements for Cluster Analysis
- What is Cluster Analysis?
- Cluster Analysis
- FAQ for the REEF
- Introduction to BlinkDB
- Introduction to REEF
- Readings in Data Management
- Spark with Python Notebook on Mac
- MapReduce for C - Run Native Code in Hadoop
- Spark Notebook
- Spark-GPU Cluster Development in a Notebook
- 5 Ways to Make Your Hive Queries Run Faster
- 39 Data Visualization Tools for Big Data
- How to Write a Paper Efficiently
- HIVE 0.14 Cost Based Optimizer (CBO) Technical Overview
- Setup a JavaCL Maven Project
- Deep Dive into Spark SQL’s Catalyst Optimizer
- GNUPlot Basics
- Introducing DataFrames in Spark for Large Scale Data Science
- Twitter Scalding
- Apache Zeppelin
- Transaction Processing on Confidential Data using Cipherbase
- Introduction to the Spark
- Big Spatial Data Processing using Spark
- SpatialSpark
- New Directions for Spark in 2015
- The Spark SQL Optimizer and External Data Sources API
- DataFrame and SQL Operations
- Design Patterns for using foreachRDD
- Output Operations on DStreams
- Transformations on DStreams
- DStream Sources
- Discretized Streams
- Spark Streaming Basic Concepts
- Spark Streaming Quick Example
- Spark Streaming Overview
- Useful Research Related to Apache Drill
- Google Protocol Buffer Basics - Java
- Apache Spark
- An Introduction To Software Defined Networking
- 10 ways to query Hadoop with SQL
- The Bulk Synchronous Parallel Model
- Application Interaction with the ResourceManager
- Resource Manager
- Build YARN Apps on Hadoop with Apache Slider
- Analytic Processing Environment (APE)
- Analytics vs. OLTP
- Layers of Query Processing
- Database Workloads
- Workload Types
- Detailed Characteristics of an Analytic Workload
- Characteristics of an Analytic Workload
- Workload Management for Big Data Analytics
- 11 Core Big Data Workload Design Patterns
- SDN과 NFV와 네트워크 가상화의 차이점 이해하기
- Apache Hadoop YARN Architecture
- DryadLINQ
- Database Management Systems
- Planning the Approach
- Dryad
- Join Optimization
- Map-Side Join Vs. Join
- What is map side join and reduce side join?
- Hive Architecture
- Translate from MapReduce to Apache Spark
- Some Common Parallel Patterns
- An Insight into Hadoop YARN NodeManager
- Data-based parallelism
- How to fix “System program problem detected” error on Ubuntu
- Task-based parallelism
- Concurrency
- Serial/Parallel Problems
- Multinode Computing
- Hadoop-GIS Spatial Queries
- Running a cluster of virtual machines with Hadoop
- Research skills
- Database Management Systems
- Readings in Database Systems
- SIX sparkling features of Apache Spark!
- Query Optimization Bibliography
- Use Parquet with Impala, Hive, Pig, and MapReduce
- Actian Unleashes SQL Users on Hadoop with Easy Access, Faster Performance
- Presto as a Service
- HIVE as a Service
- Understanding the world of SQL on Hadoop.
- How to Develop Hadoop Tests
- MRUnit Tutorial
- Serialization
- Keep Your Crises Small
- Creativity, Inc. Interview
- HIVE TPC-DS Benchmark
- SIGGRAPH 2014 Technical Papers
- Distributed Computing with Spark
- PostgreSQL
- Advantages and Drawbacks of Using Stored Procedures for Processing Data
- Why Oracle Big Data SQL Potentially Solves a Big Issue with Hadoop Security
- Samsung Developer Day 2014 at MWC - Samsung MultiScreen SDK
- Apache Tez Presentation
- Announcing Apache Pig 0.13.0
- Benchmarking Apache Hive 13 for Enterprise Hadoop
- In-Stream Big Data Processing
- Apache Hadoop YARN ResourceManager
- Introduction To Pivotal's Next Generation Analytic Data Lake
- War of the Hadoop SQL engines. And the winner is …?
- Exploring Pivotal HD Single Node (VM)
- How Pivotal Works
- Parquet - Columnar Storage for the People
- Pivotal Big Data Suite
- Pivotal HD Introduction & Demo
- Special "Use Cases" Big Data & Brews - Hortonworks, Pivotal, MapR
- What is Apache Drill?
- Drilling into Big Data with Apache Drill
- Fun with TPC-H (Part 1)
- Implementing Drill Math Functions
- Presto By Facebook – A SQL Engine
- Big Data - The Hadoop Data Warehouse
- Building a Data Warehouse with Hadoop
- Why does MapReduce + GPU Computing?
- Why NoSQL?
- Apache Spark Documentation
- SIGGRAPH 2014 Papers
- Google Cloud Dataflow, a Cloud-native data processing service
- Datawarehouse
- Selecting the right SQL-on-Hadoop engine to access big data
- Big Data Benchmark
- Apache Spark
- Optiq
- Adding Acid Transactions in Hive
- Yarn The Key to Overcoming the Challenges of Broad based Hadoop Adoption
- Challenges of Implementing an Advanced SQL Engine on Hadoop
- SparkR Enabling Interactive Data Science at Scale on Hadoop
- Hbase Read High Availability Using Eventually Consistent Region Replicas
- How Hadoop and Hive Change the Data Warehousing Game Forever
- Big Data Management System Smart SQL Processing
- Query Optimization Techniques and JIT based Vectorized Engine
- Pig on Tez - Low Latency ETL with Big Data
- Cost Based Query Optimization in Hive
- Low Latency SQL on Hadoop
- Building a Unified Data Pipeline in Apache Spark
- Big Data Architecture
- Comparing Information Architecture Operational Paradigms
- Big Data Architecture Capabilities
- Grid Architecture and Service Modeling
- How to prepare for SDN
- Features of Cloud and Grid Platforms
- What is SQL-on-Hadoop?
- Apache Storm Basics
- What does “100 times faster than Hive” actually mean?
- GraphX Programming Guide
- TPC Benchmark
- M7 - Native Storage for MapR Tables
- Hadoop 2.4.0 Installation on Mac OS X Mavericks
- Software-Defined Networks
- Future of the Network Documentary
- A Leap Forward With SQL on Hadoop
- Zero-Copy
- Remote Direct Memory Access
- Special "How it Works" Big Data & Brews
- Choosing distribution models
- GPU Programming for the Data Sciences
- 10 undeniable reasons people hate Big Data
- Physically Based Shading in Theory and Practice
- The Forgotten History Of CGI
- Hadoop Sector will Have Annual Growth of 58% for 2013-2020
- Why Impala Continues to Lead?
- Apache Spark Resource Management and YARN App Models
- Measures of Query Cost
- Query Processing
- Query Transformation
- Database Conferences
- Query Optimizer
- Unit Test
- Research Topics in Database Systems
- Statistics, Data Mining, Machine Learning and Artificial Intelligence
- ACID Properties of Transactions
- GTC On-Demand
- Google Spanner
- Open Source Analysis Tips
- Functional Specifications
- Column-Family Stores
- Key-Value and Document Data Models
- Introduction to Distributed Database Management Systems
- Development Tools for Big Data
- Introduction to Data Warehousing
- Apache Tajo JDBC Driver
- Date's Twelve Rules for a DDBMS
- Apache Tajo HCatalog Integration
- Spark for Data Science; A Case Study
- Intel® Xeon Phi™ Coprocessor - the Architecture
- Apache Tajo Backup and Restore
- Apache Tajo Table Partitioning
- An Insight into Hadoop Yarn Resource Manager
- Regression Analysis
- Apache Tajo SequenceFile
- 5 technologies that will help big data cross the chasm
- Apache Tajo Parquet
- How Apache Hadoop YARN HA Works
- Apache Tajo RCFile
- Apache Tajo Table Management
- Apache Tajo CSV (TextFile)
- Papers to Read
- Burndown Chart
- Apache Tajo Functions
- Apache Tajo SQL Language
- Apache Tajo Shell
- Apache Tajo™ 0.8.0 Configuration
- Apache Tajo™ 0.8.0 Released!
- Apache Tajo™ 0.8.0 Documentation
- The Stinger Initiative From HortonWorks With Hadoop
- GPU Cluster
- Bigtable
- GPU Accelerated Computing with C and C++
- Chubby
- Big Data & Brews - Ari Zilka, CTO of Hortonworks' Talk
- The Google File System (GFS)
- Stinger Intiative - Making Apache Hive 100 Times Faster
- Stinger - Interactive Query for Hive
- PageRank Algorithm
- Overview of Data Storage and Coordination Services
- Simple YARN Application
- Underlying Communication Paradigms
- PageRank History
- Overall Architecture and Design Philosophy
- Comparing Apache Tez and Microsoft Dryad
- Apache Spark and Apache Tez
- Introducing the case study - Google
- Integration Tajo with Hive
- Choosing the Right SQL-on-Hadoop
- How to Run a Simple Apache Spark App in CDH 5
- Resource Management Fundamentals
- NewSQL Vs. SQL on Hadoop
- Deep Learning
- Presto as a Service
- Apache Hadoop YARN – Related Work
- Advanced and Distributed Operating Systems
- Adding ACID to Apache Hive
- Eight SQL on Hadoop Challenges
- Computer Science @ University of Cambridge
- 11 SQL-on-Hadoop Tools
- Healthcare Does Hadoop
- Bloom Filter
- Processing streaming data in Hadoop with Apache Storm
- Interactive Query for Hadoop with Apache Hive on Apache Tez
- Introducing Apache Tez 0.4
- Big Data Genomics Sequencing
- Failures in YARN
- Failures in Classic MapReduce
- YARN Framework and Fault Tolerance
- Smart Things vs. Many Experienced Things
- Apache Hadoop YARN – Node Manager (NM)
- Hardening Hadoop for Healthcare with Project Rhino
- Apache Tajo
- Build Instructions for Apache Tajo
- Apache Hadoop YARN – Application Master (AM)
- Apache Hadoop in 2013 - The State of the Platform
- Grid Computing
- Design Objectives of Computer Clusters
- Clustering for Massive Parallelism
- Apache Hadoop YARN – Architecture Overview
- Apache Hadoop YARN – Resource Manager (RM)
- Apache Hadoop YARN – History and Rationale, Shared Clusters
- Functional Programming
- Why Functional Programming?
- Parallel Computing
- Implementing Graph-Based Applications
- Designing Graph-Based Applications
- Depth of Knowledge
- Data-intensive Computing
- Mapping and Folding and the Map-Reduce Paradigm
- Codd's 12 Rules
- OLAP vs. OLTP
- Network Topology and Hadoop
- Comparison - Ad-hoc vs Stored Procedure vs Dynamic SQL
- Ensuring the Validity, Veracity, and Volatility of Big Data
- Steve Jobs at SIGGRAPH 1995
- Operationalizing Big Data
- Modern Oil & Gas Architectures Built with Hadoop
- Using Avro in MapReduce Jobs with Pig
- Using Avro in MapReduce Jobs with Java
- Using Avro in MapReduce Jobs with Hive
- Using Avro in MapReduce Jobs with Hadoop Streaming
- MapReduce and Hadoop Papers in the VLDB
- NoSQL Databases
- MapReduce and Hadoop Algorithmic in Academic Papers
- Visualization for Data-Intensive Science
- Stanford Network Analysis Platform(SNAP)
- Gray’s Laws Database-centric Computing in Science
- Connecting to the Scientists
- Big Data Dilemma
- Conduct Research Survey
- Polyglot Persistence
- Functional versus Procedural Programming Models
- The Data Science Process
- Statistical Thinking in the Age of Big Data
- Populations and Samples
- Thought Experiment - Meta-Definition
- A Data Science Profile
- The Role of the Social Scientist in Data Science
- What is Data Science? - The Current Landscape
- Community Detection - Clustering Coefficient
- What is Data Science?
- Stream Data Processing for Performance Evaluation of the Equipment in an Offshore Plant
- Hive on Tez
- Functional Requirements of Hive on Tez Phase I
- Query Planning of Hive on Tez
- Design of Hive on Tez
- Apache Tez 0.3 Released!
- Predictive Methodologies - Analysis of Time and Space
- Predictive Methodologies - Analysis of Space
- Predictive Methodologies - Analysis of Social Networks
- Theoretical Foundations of Data Mining and Predictive Analytics
- Data Used in Predictive Policing
- Operational Challenges of Predictive Policing
- Forecast Vs. Prediction
- Recommendations
- Predictive Policing Myths and Pitfalls
- Prediction-Led Policing Process and Prevention Methods
- Facebook Presto
- Apache Drill
- Predictive Policing
- A Taxonomy of Predictive Methods
- Apache Hadoop YARN – History and Rationale
- “Big Data” Promises a Revolution
- As MapReduce fades, Apache Spark is now a top-level project
- Apache Hadoop YARN – ResourceManager
- Apache Hadoop YARN – Yet Another Resource Negotiator
- Apache Hadoop YARN – NodeManager
- Apache Hadoop YARN – Concepts and Applications
- Apache Hadoop YARN – Background and Overview
- Papers about MapReduce
- Introducing Apache Hadoop YARN
- T-Shaped Data Scientists
- Adapting to Change
- Agile Big Data
- Scales of Big Data
- The Variety of Data Scientists
- The Real-Time Big Data Architecture (RTDBA) Stack
- The Five Phases of Real Time
- Case Studies of Analyzing Data
- FINDING INFORMATION ON THE WEB
- Data Science, Engineering, and Data-Driven Decision Making
- Clustering Data Scientists
- Advanced Metasearch Engine Technology
- PredictionIO
- TDD Best practices
- A Material Point Method For Snow Simulation
- Test-Driven Development in a nutshell
- Reflections on Image-Based Modeling and Rendering
- Key Points from NoSQL Distilled
- Scrum Events
- Multi-tenant Database Systems
- Big Data in Motion
- Tez Design - Introduction
- Introduction to the Maven
- Building a Hybrid Modern Data Architecture for Apache Hadoop
- Continuous Quries Languages
- Semantics of Relations in Continuous Queries
- Continuous Quries as Views
- Native C/C++ Like Performance For Java Object Serialization
- Building the Health Information Infrastructure for the Modern ePatient
- Using ByteBuffer in Java
- Solving the Wanamaker Problem for Healthcare
- Continuous Query Semantics and Operators (Part II)
- Effective Team Membership
- Stream Windows
- Continuous Query Semantics and Operators (Part I)
- Parallel R
- Writing Input And Output Adapters
- Stream Models
- Understanding the Types of Big Data Problems
- Esper Relational Database Adapter
- Event Rendering to XML and JSON
- Esper Socket Adapter
- Esper Adapter Concept
- Confluent and Functional Persistence
- Full Persistence
- Partial Persistence
- Persistent Data Structures
- Writing a Tez Input, Processor and Output
- Synchronization among Heterogeneous Distributed Objects
- Runtime API in Apache Tez
- Re-Using Containers in Apache Tez
- Introducing Tez Sessions
- Data Processing API in Apache Tez
- Apache Tez Dynamic Graph Reconfiguration
- Apache Tez A New Chapter in Hadoop Data Processing
- Introduction to the Tez
- Data Stream Management
- In-Memory Caches
- Pattern Matching Techniques
- Sequence Diagram
- Computational Thinking
- Introduction to Transactional Memory
- PRODML Data Schema and API Specifications
- The Invention of Stream Computing
- Real-time Operational Intelligence
- JAXB Class Generation From XML Schema File using JAXB Tool
- Explaining Agile
- Introduction to Esper for Java
- The Emergence of NoSQL
- Concurrent Bug Patterns and How to Test Them
- Introduction to Bug Patterns in Java
- Real-Time Data Stream Manager (RTDSM) Interface Specification
- Modelling in Big Data
- Analyzing Data in Motion A Real-World View
- Understanding trade-offs with Brewer’s CAP theorem
- Foundational data architecture patterns
- Applications using graph representation
- N-Screen Application Development Manual
- YARN (MapReduce 2)
- Android WebView Application using Naver Open API
- Stream Data Schema Design
- Introduction to Stream Computing
- Streams Application Design
- Filter pattern
- Analytics vs. Predictive Analytics
- Streams design patterns
- The evolution of analytics
- Extreme Programming
- Characteristics of the Smart Devices
- Scrum Agile Development Process
- Agile Project Management
- Logical Communication-based Middleware for Collaboration among Smart Apps in P2P Network Environments
- VLDB 2013
- Impedance Mismatch
- The Value of Relational Databases
- Agile Processes and Scrum
- What is NoSQL?
- Korean Computer Graphics Conference 2013
- Neo4j
- Graph Databases
- NoSQL History
- The Presentation Secrets of Steve Jobs
- Paper Organization
arts and music
mathematical science
my profile
literature
evolutionary biology
liberal arts
artificial intelligence
- From Multimodal LLM to Human-level AI
- 6G Networks - A New Era of AI and Machine Learning
- Making Temporal Betweenness Computation Faster and Restless
- Multimodal Machine Learning
- A Brief Introduction to Causal Inference
- Intro to Quantum Gibbs Samplers - Algorithmic Ingredients
- Long Range Propagation on Continuous-Time Dynamic Graphs
- Graph Neural Networks and Spectral Graph Theory
- Maximum Entropy Method
- What is Agentic AI?
- Moravec's Paradox
- Invariant Learning
- How Could Machines Reach Human-Level Intelligence?
- Bayesian Inference - Parametric Inference, Predictive Inference
- Decoding on Graphs; Empower LLMs with KGs
- Data Heterogeneity Analysis for Out-of-Distribution Generalization
- Generative Retrieval
- Change of Variable Rule
- Chernoff and Hoeffding bounds
- Spectral Graph Theory
- Multimodal Large Language Models
- RL Loses Against Multi-Agent Intelligence
- DuckDB in the wild
- How to transform tabular data into a graph
- Isoperimetric Number
- Converting a Tabular Dataset to a Graph Dataset for GNNs
- Towards Neuro-Symbolic AI with Knowledge Graphs and LLMs
- Toward Understanding In-context Learning
- On large language models and transformers
- Understanding and Improving Efficient Language Models
- The Hierarchy of Needs for Training Dataset Development
- Uplift Modeling; From Causal Inference to Personalizations
- Unifying Graph Neural Networks
- LLMOps and FMOps for e-commerce
- What Are Drifts and How to Detect Them?
- Deep Learning of Dynamics and Coordinates with SINDy Autoencoders
- Introduction to Llama-agents Framework
- Building Trusted AI with LLMs
- How Domain-Specific AI Agents Will Shape the Industrial World
- Harvard Presents NEW Knowledge-Graph AGENT (MedAI)
- ParadeDB – Postgres for Search and Analytics
- Re-thinking Transformers
- XAI for LLMs; looking under the hood of Large Language Models
- Physics-informed Machine Learning for Inverse Problems
- Efficient Gaussian Splatting
- Evaluating AI models for structure-based drug design
- Discrete Flow Matching
- Understanding Large Action Model (LAM)
- How I Understand Flow Matching
- Toward a better Deep Learning
- Geometric Probabilistic Models
- Fine-tuning and Merging LLMs
- Normalizing Flows
- Improving and Generalizing Flow-Based Generative Models
- Flow Matching; Simplifying and Generalizing Diffusion Models
- A Fast, Embeddable, Modular Analytic Query Engine
- Improving equality and productivity with AI-driven tax policies
- Scaling Limits of Neural Networks
- Economic Policy Design using DRL and AI Simulations
- AI Economist - How can AI FIX taxes?
- Economics and Probabilistic Machine Learning
- Rethinking Physics Informed Neural Networks
- Differential Equations and Dynamical Systems
- High Level Overview of AI and ML in Science and Engineering
- A Trip Down the Generative Graphics Pipeline
- Decoding Google Gemini with Jeff Dean
- A Brief Introduction to Neural Radiance Fields
- SGD and Weight Decay Secretly Compress Your Neural Network
- Morris Approximate Counter
- Towards Understanding the Implicit Regularization Effect of SGD
- Language Models as World Models
- Building LLMs from the Ground Up
- Marginal-based Methods for Differentially Private Synthetic Data
- Differentially Private Synthetic Data via Foundation Model APIs
- GraphRAG; The Marriage of Knowledge Graphs and RAG
- Decoding the Human Brain – A Neurosurgeon’s Experience
- ML for High-Performance Climate and Earth Virtualization Engines
- How To Read AI Research Papers Effectively
- Building Large Language Models (LLMs)
- DuckDB – Overview and latest developments
- Unreasonably Effective AI with Demis Hassabis
- The COMPLETE TRUTH About AI Agents
- The Future of Knowledge Assistants
- Responsible AI development
- The Reparameterization Trick
- How to Read Machine Learning Research Papers
- A tutorial on on neurosymbolic techniques
- Analysis by Synthesis
- The tip and the iceberg; deep learning and embodiment
- Generating Synthetic Tabular Data That’s Differentially Private
- TableDiffusion; Generative AI for private tabular data
- DeepJoin Joinable Table Discovery with Pretrained Language Models
- Synthetic data generation with CTGAN
- Retrieval and Re-Reranking
- What Does Machine Learning Have to Offer Mathematics?
- What do tech pioneers think about the AI revolution?
- The Age of AI, Eric Schmidt
- Decoding the Decoder LLM without de code
- How to improve your writing
- The Potential for AI in Science and Mathematics
- Introduction to Self-Driving
- Towards Foundation Models for Graphs
- Energy Based Models
- Awesome-Graph-LLM
- Spark Tutorial For Beginners
- Meta AI, Limits of LLMs, AGI & the Future of AI
- Analyze Data with LLMs and Python
- DuckDB Spatial; Supercharged Geospatial SQL
- Robust Distortion-free Watermarks for Language Models
- Space is a latent sequence; A theory of hippocampus and PFC
- Privacy in Web Advertising - Analytics and Modeling
- Mining Temporal Networks
- Multimodal Pretraining and Generation for Recommendation
- Text Attributed Graph Representation Learning
- Curriculum Learning Theories, Approaches, Applications
- Neural Contextual Bandits for Personalized Recommendation
- Large Language Models for Recommendation
- Data Drift, Quality, Bias and Explainability
- Mining Exploratory Queries for Conversational Search
- Recent Advances in Image Generative Foundation Models
- Sublinear time algorithms in learning
- A Bi-metric Framework for Fast Similarity Search
- Vector Search and Embeddings in Organizations
- Modality Alignment for Multimodal Perception
- Multimodal Large Language Model Tutorial
- Probability for Computer Scientists
- AGI Levels by OpenAI, Grok 2 and Neuralink Update
- Large Multimodal Models
- Geometric Deep Learning; From Euclid to Drug Design
- Algorithmic Discrepancy Theory and Randomized Controlled Trials
- Continual Learning Beyond Catastrophic Forgetting
- Project Kuiper; Fast, Affordable Broadband
- AI for 5G Advanced Toward 6G
- AI & Machine Learning for Satellite Communications with SES
- AI for 3D Generation
- AI for Content Creation
- Make self-adjusting data structures concurrent
- Privacy Preserving ML with Fully Homomorphic Encryption
- Workshop on Urban Scene Modeling
- KDB.AI | Introduction to Vector Search
- Multimodal Learning on Graphs
- From Multimodal LLM to Human level AI
- Foundation Models for Autonomous Systems
- Using LLM Agents with Llama 3, LangGraph and Milvus
- Building Intelligent RAG Applications
- Learn PostgreSQL fast! | PostgreSQL for beginners
- The Influence of Computation on Inference
- Machine learning models of differential gene expression
- Recent Advances in Vision Foundation Models
- Illuminating protein space with generative models
- On the Generalization Capability of Temporal Graph Learning Algorithms
- Spectral Graph Neural Networks
- Audio-visual self-supervised baby learning
- From Higher-Order Graph Structures to Link Prediction and Beyond
- History and Future of AI at Data + AI Summit 2024
- How Can Privacy Exist in a Data-Driven World?
- Can Learned Indexes be Built Efficiently?
- The Platonic Representation Hypothesis
- programming ≠ coding
- SIGMOD 2024 - Alibaba Cloud PolarDB-MP
- Python RAG Tutorial with Local LLMs
- Learn RAG From Scratch
- PostgreSQL as a Vector Database
- Kernels for Graphs
- Milvus 101- Most Advanced Vector Database
- Vector Similarity Search & Indexing Methods
- Combine Your Data with LLMs with Advanced Search
- Learn How to Build Multimodal Search and RAG
- Many shades of Machine Learning Acceleration
- Charm++ and Adaptive MPI
- GenAI in Life Sciences Summit
- Grounding for Gemini with Vertex AI Search and DIY RAG
- HPC and AI/ML; A Synergistic Relationship
- Multimodal Retrieval-Augmented Generation with Gemini
- What is LangChain?
- PyTorch on Apple Silicon
- MLP vs KAN, Math, B-Splines, Universal Approximation Theorem
- Kolmogorov-Arnold Networks VS Regular Deep Learning
- Kolmogorov-Arnold Networks
- A Causal Inference Framework for Combinatorial Interventions
- Introducing GPT-4o with Large Language Models
- Zero-Shot Relational Learning on Temporal Knowledge Graphs with LLMs
- Continuous-time Graph Representation with Sequential Survival Process
- Topological Representations for Temporal Graph Property Prediction
- LangGraph 101; it's better than LangChain
- Deep Learning Under Distribution Shift
- Automating Enterprises With Foundation Models
- Score Based Models
- Quantum Computing in Practice
- DRO and Its Applications in Communication and Networking
- Data-driven and Distributionally Robust Optimization and Applications
- Mathematical Foundations of Robust and DRO
- Introduction to Distributionally Robust Optimization
- Fundamentals of Intelligent Reflecting Surfaces
- Multimodal language models for mapping the genotype-phenotype relationship
- Vector Embeddings Tutorial
- Vector Database 101 - A Crash Course
- LLM and Vector Databases - Concepts, Architectures, and Examples
- Vector Databases Explained
- Vector Database Fundamentals
- Indexing in Vector Databases
- Complete Tutorial on Vector Database
- Uncovering and Inducing Interpretable Causal Structure
- GNNs in Computational Biology
- EVO - DNA Foundation Models
- Bayesian Causal Inference & Propensity Scores
- Understanding Bayesian Statistics Without Frequentist Language
- Prof. Chris Bishop's NEW Deep Learning Textbook!
- Towards Conversational Diagnostic AI
- Anomaly Detection in Continuous-Time Temporal Provenance Graphs
- Gemini Full Breakdown + AlphaCode 2 Bombshell
- Googles GEMINI Just SHOCKED The ENTIRE INDUSTRY!
- Multimodal Prompting with a 44-minute Movie
- Approximate Maximum Matching
- Prompt Engineering, RAG, and Fine-tuning
- Building Production-Ready RAG Applications
- Entropy & Mutual Information in Machine Learning
- Towards Federated Analytics with Local Differential Privacy
- The Science of LLM Benchmarks - Methods, Metrics, and Meanings
- Formal Methods and Machine Learning
- Formal Methods and Deep Learning
- HyperPCM-Robust Task-Conditioned Modeling of Drug-Target Interactions
- Objective-Driven AI
- Explainability of Graph Neural Networks
- Why should you care about DuckDB?
- Dynamic Graph Information Bottleneck
- Diffusion Models on Sampling Rare Events
- Deep Probabilistic Models with Tractable Inference
- Nuggets of Shannon Information Theory
- Large Language Models as Copilots for Theorem Proving
- Application Development using Large Language Models
- A New 3D Digital Human Creator For All!
- 나의 문화유산답사기 30주년
- The Fifth Paradigm of Scientific Discovery
- Bringing analytical SQL directly to your Python shell
- Decoding the Information Bottleneck in Self Supervised Learning
- Information Bottleneck Theory of Deep Learning
- MultiDiffusion; Fusing Diffusion Paths for Controlled Image Generation
- In-Process Analytical Data Management with DuckDB
- Generalization in diffusion models
- Gemini Multimodal RAG Applications with LangChain
- Exciting Trends in Machine Learning
- How to Build a Metadata Driven Data Pipelines with Delta Live Tables
- Learn How to Build Real-Time Warehouses on Lakehouse
- Databricks Lakehouse Overview Training
- Simplify ETL pipelines on the Databricks Lakehouse
- Delta Live Tables A to Z; Best Practices for Modern Data
- Building Reliable ETL Pipelines with Azure Databricks
- Graph-based Virtual Sensing
- Online A/B Testing of Real-Time Event Detection Systems
- Machine Unlearning of Features and Labels
- Machine Unlearning Papers
- Ecommerce Marketing Analytics; Top 8 Metrics EXPLAINED
- Differential Privacy and Machine Unlearning
- Foundation Models on Consumer Devices
- Lakehouse with Delta Lake Deep Dive Training
- Large Language Models for Program Optimization
- Multimodal Reasoning; PaLM-E & Gemini
- Text2SQL; The Dream versus Reality
- Data Lakehouse and Data Mesh—Two Sides of the Same Coin
- Pathways Language Model and Model Scaling
- The Information Bottleneck Approaches in Deep Neural Networks
- On Mutual Information Estimation
- A Deeper Understanding of Deep Learning
- The Information Bottleneck Theory of Deep Learning
- Information Theory of Deep Learning
- DeepMind Gemini 1.5 - An AI That Remembers!
- Graph Machine Learning for 2024
- Unlock Data Insights Faster with AI-Generated Sample Data
- Graph Dashboards With LLM Powered Natural Language Queries
- Approximate Nearest Neighbor and Product Quantizer
- Choosing the Right Embedding Model, Chunking Strategy, and More
- Document Curation for Retrieval Augmented Generation
- Data Federation with Unity Catalog
- The Data Intelligence Platform
- Cross-Platform Data Lineage with OpenLineage
- Mitigating LLM Hallucinations
- Towards Foundation Models for Knowledge Graph Reasoning
- Unveiling the Power of Multimodal RAG for Images and Text
- Image Similarity Search with Vector Databases
- Building a Music Recommender with Vector Databases
- Document Semantic Search with Vector Databases
- Sentiment Analysis with Vector Databases
- Overview of RAG Approaches with Vector Databases
- Graph-Based Approximate Nearest Neighbors (ANN) and HNSW
- Enhancing Vector Database Queries with Metadata Filtering
- Guiding Graph Embeddings using Path-Ranking Methods
- Evaluating RAG Performance with Vector Databases
- Chunking Best Practices for RAG Applications
- Product quantization in Faiss and from scratch
- Ambient Computing, Multimodal AI & Edge Computing
- Composite AI, the Future of Computing and Human Capital
- From Delta Sharing to Clean Rooms
- What is a vector database?
- Advanced RAG; Chunking, Embeddings, and Vector Databases
- Unity Catalog Overview
- How to Build LLMs on Your Company’s Data While on a Budget
- A Technical Deep Dive into Unity Catalog's Practitioner Playbook
- Which is the most reliable estimate?
- Building an LLMOps Stack for Large Language Models
- Introduction to large language models
- Cost Optimization use cases with a Data Catalog
- 5 Explosive Digital Marketing Strategies
- Introduction to Vector Search
- Atlan AI for Documentation -- LIVE Demo
- LLMOps; Everything You Need to Know to Manage LLMs
- How to Train Your Own Large Language Models
- The future of generative AI
- Reset KOREA!, The 1st Seminar
- Intro to Large Language Models
- Metadata Use Cases Demo in Atlan
- The Anatomy of an Active Metadata Platform
- 3 Keys for Collective Intelligence
- Towards Self-Service Analytics
- Self-designing Data Systems for the AI Era
- Active Metadata Management
- Active Metadata with Data Quality
- The Next Generation of Active Metadata Platforms
- Knowledge Graph Reasoning with Graph Neural Networks
- The Modern Metadata Platform
- Metadata Management & Data Catalog
- Temporal Graph Benchmark for Machine Learning on Temporal Graphs
- Safely Using Gen AI With Active Metadata and Data Fabric
- A General Graph Neural Network Framework for Link Prediction
- Neural Bellman-Ford Networks
- Towards Interpretable Data Science
- Differential Privacy And The Complexity Of Simple Queries
- Privacy Preserving AI
- The Promise of Differential Privacy
- A Survey on Graph Neural Networks for Time Series
- Towards Foundation Models for Knowledge Graph Reasoning
- Stylish Hierarchical Navigable Small World Indexes
- Retrieval-Augmented Language Model Pre-Training
- Domain Knowledge Augmented AI for Databases
- AND/OR Search Spaces for Anytime Probabilistic Reasoning
- Generative AI And Programming
- Gartner's Top 10 Tech Trends for 2024
- Making Retrieval Augmented Generation Better
- Emerging architectures for LLM applications
- Tree of Thought Prompting
- Building a managed database service with Kubernetes Operators
- A Purpose-Built Vector Data Management System
- Best Practices for Query Performance Testing
- OpenAI Embeddings and Vector Databases Crash Course
- Long-Range Genomic Sequence Modeling
- How to Choose a Vector Database
- Awesome Vector Database
- Vector Databases simply explained! (Embeddings & Indexes)
- OpenAI DevDay, Opening Keynote
- Knowledge Distillation
- Vector Database for Large Language Models in Production
- Embeddings, Vector Databases, and Search
- Fast, Approximate Vector Queries on Very Large Unstructured Datasets
- How I make beautiful GRAPHS and PLOTS using LaTeX
- Transformers for Multimodal Self Supervised Learning
- Lance; A Modern Columnar Data Format
- Feature Store Architectures & Technical Challenges
- Machine Learning for Multi-Scale Molecular Simulation and Design
- Reasoning Algorithmically; from Toy Experiments to AGI Modules
- Table Representation Learning
- Language Models and Applications in Data Management
- Categories for AI 1; Why Category Theory?
- Estimating Separable Matching Models
- Action Matching; Learning Stochastic Dynamics from Samples
- Categories, Graphs, Reasoning
- Heterogeneous Continual Learning
- Tractable Control for Autoregressive Language Generation
- Approximating the TV Distance Between Two Product Distributions
- The Rise of Tractable Circuits
- Tractable Bounding of Counterfactual Queries
- A Tutorial on Causal Representation Learning
- Explainable Data Drift for NLP
- Open-Source Spotlight - Deepchecks
- Develop test suites for machine learning models and data
- Kolmogorov-Smirnov Test Explained
- L-Infinity Distance Explained
- Wasserstein Distance Explained
- Jensen-Shannon Distance Explained
- Comparison of Data Drift Detection Methods
- Sketching and Projecting
- Randomized Embeddings and Neural Networks
- Memory-Regret Tradeoff for Online Learning
- Sketching Techniques for Kernel Density Estimation
- Fast Attention Requires Bounded Entries
- From Machine Learning to Autonomous Intelligence
- Less is More with PostgresML
- The Power of Graph Learning
- AI-Powered Database Optimization as a Service
- Open Source Vector Search Engine and Vector Database
- How to evaluate and explore data drift in ML systems
- Hugging Face Tutorial
- Python and PyTorch Tutorial
- Data Drift & Model Drift Detection Using Python
- Easiest way to Detect Data Drift Using Evidently
- Data Drift Monitoring in Azure ML
- Drifting Away; Testing ML Models in Production
- Multimodal Deep Learning
- Automating Model Monitoring and Drift Detection
- Monitoring Model Quality in Streaming Machine Learning Applications
- ML Drift; Identifying Issues Before You Have a Problem
- Dynamic Graph Algorithms; What We Know and What We Don’t
- How to Understand Clearly
- Gentle Introduction to Causal Inference
- Awesome Incremental Learning / Lifelong learning
- Temporal Distributional Shifts
- A Brief Introduction to Causal Inference
- Explainable Deep Learning for Structured Data
- Leveraging Privacy-Enhancing Technologies with LLMs
- Probabilistic and Logical Circuits for Tractable Causal Reasoning
- Probabilistic Reasoning and Learning for Trustworthy AI
- Learned Query Optimiser At the Forefront of AI Driven Databases
- Internals of Permissioned Blockchains
- On the Role of Neural Collapse in Few-Shot Transfer Learning
- Introduction to Data Structures and Optimization for Fast Algorithms
- Attention Approximates Sparse Distributed Memory
- Fine-Grained Complexity for Dynamic Graph Problems
- Sketching
- Graph Data Structures
- Graph Linear Algebra
- Kubeflow vs MLFlow
- Language Models as Statisticians, and as Adapted Organisms
- Differentially Private Sampling from Distributions
- HNSW for Vector Search Explained and Implemented with Faiss
- Incremental View Maintenance
- Preserving Data Privacy in Federated Learning
- On Localization in Language Models
- Meaning in the age of large language models
- Large Language Models Meet Copyright Law
- Logic and Probabilistic Circuits
- Prompt Engineering
- Statistical Learning Theory and Neural Networks
- LangChain "Advanced Retrieval" Webinar
- Challenges and Opportunities in Differentially Private Federated Learning
- LlamaIndex Overview & Use cases
- In-Context Learning
- 9 tips to help you PROVE MATH THEOREMS
- Are LLMs the Beginning or End of NLP?
- Bayesian Principles for Machine Learning
- How to Speak
- Differentially Private GNNs with Aggregation Perturbation
- Recent Advances in Causality-based Recommendation
- Interpretability via Symbolic Distillation
- Synthesizing Relational Data with Differential Privacy
- PrivSyn - Differentially Private Data Synthesis
- Basics of Neurosymbolic Architectures
- The Theory and Practice of Differential Privacy
- Towards Reliable Use of Large Language Models
- Cartel Screening and Machine Learning
- Local Differential Privacy in Practice
- Survey on Privacy/Security in FL
- NVIDIA Keynote at SIGGRAPH 2023
- Trustworthy AI - A Computational Perspective
- High-Dimensional Bayesian Optimization
- Self-Supervised Learning in Recommendation
- Keeping AI under control through mechanistic interpretability
- Towards Machines that can Learn, Reason, and Plan
- Diffusion and Score-Based Generative Models
- Differentially Private Diffusion Models
- Foundation Models and Fair Use
- Spatial Data Science Methods for Improving Models
- Predictive Whittle Networks for Time Series
- There's no such thing as MIRACLE
- Locally Private Graph Neural Network
- What is a Vector Database?
- How to write Tree of Thoughts Prompts
- Prompt Engineering Overview
- A New Upper Bound for the Heilbronn Triangle Problem
- AI and the Future of Humanity
- Two Paths to Intelligence
- Large Multimodal Models
- ChatGPT for Data Analysts
- Visual Prompting Livestream With Andrew Ng
- Quantifying and Understanding Memorization in Deep Neural Networks
- Continuous Training and Deployment with ZenML and Seldon
- CVPR Highlights 2023
comments powered by