<img height="1" width="1" style="display:none" src="https://www.facebook.com/tr?id=420156728350215&amp;ev=PageView&amp;noscript=1">

SF '18 Schedule

Watch this page for the latest updates on our 3-track conference schedule. Please note that schedule is subject to change by the organizer.

 

8:00 - 9:00AM Registration and Breakfast
9:00 - 9:15AM Welcome, announcements & track host intros
9:15 - 9:50AM Keynote #1: Data Science: Past, Present, Future (Jocelyn Goldfein, Shubha Nabar, Omoju Miller, Jennifer Prendki)
10:00 - 10:40AM

Functional Data Engineering - A Set of Best Practices - Max Beauchemin (Lyft)

10:45 - 11:25AM

Scaling a Relational Database for the Cloud-age - Sumedh Pathak (Citus Data)

Office Hours - Max Beauchemin, Lyft

11:30 - 12:10PM

Cloud Data Warehouse Benchmark: Redshift vs Snowflake vs BigQuery - George Fraser (Fivetran)

Office Hours - Sumedh Pathak, Citus Data

12:15 - 1:15PM Lunch
1:15 - 1:55PM

Democratizing Data with the Clover Transform Framework - Chris Hartfield (Clover Health)

Office Hours - George Fraser, Fivetran

2:00 - 2:40PM

Effective Management of High Volume Numeric Data with Histograms - Fred Moyer (Circonus)

Office Hours - Chris Hartfield, Clover Health

2:45 - 3:15PM Coffee Break
3:15 - 3:55PM

Lazy Beats Smart and Fast - Julian Hyde (Looker)

Office Hours - Fred Moyer, Circonus

4:00 - 4:45PM

Machine Learning from Development to Production at Instacart - Montana Low (Instacart)

Office Hours - Julian Hyde, Looker

5:00 - 7:00PM DATA COMMUNITY PARTY @ Holiday Inn Golden Gateway
8:00 - 9:00AM Registration and Breakfast
9:00 - 9:15AM Welcome, announcements & track host intros
9:15 - 9:50AM Keynote #1: Data Science: Past, Present, Future (Jocelyn Goldfein, Shubha Nabar, Omoju Miller, Jennifer Prendki)
10:00 - 10:40AM

Taming the Deep Learning Workflow - Evan Sparks (Determined AI)

10:45 - 11:25AM

How to do Segmentation Right - A Practical Guide for Data Scientists - Ruben Kogel (VSCO)

Office Hours: Evan Sparks, Determined AI

11:30 - 12:10PM

Fast & Effective: Natural Language Understanding - Mike Conover (Workday)

Office Hours: Ruben Kogel, VSCO

12:15 - 1:15PM Lunch
1:15 - 1:55PM

Weld: Accelerating Data Science by 100x - Shoumik Palkar (Stanford)

Office Hours: Mike Conover, Workday

2:00 - 2:40PM

Safely Streamlining Healthcare Policy Management using Ideas from Structured Natural Language Processing (SNLP) - Asif Khalak & Sergio Martinez-Ortuno (Collective Health)

Office Hours: Shoumik Palkar, Stanford

2:45 - 3:15PM Coffee Break
3:15 - 3:55PM

Data Access for Data Science - Jacques Nadeau (Dremio)

Office Hours: Asif Khalak & Sergio Martinez-Ortuno, Collective Health

4:00 - 4:45PM

Enabling Full Stack Data Scientists at Stitch Fix - Juliet Hougland (Stitch Fix)

Office Hours: Jacques Nadeau, Dremio

5:00 - 7:00PM DATA COMMUNITY PARTY @ Holiday Inn Golden Gateway
8:00 - 9:00AM Registration and Breakfast
9:00 - 9:15AM Welcome, announcements & track host intros
9:15 - 9:50AM Keynote #1: Data Science: Past, Present, Future (Jocelyn Goldfein, Shubha Nabar, Omoju Miller, Jennifer Prendki)
10:10 - 10:30AM The Streaming Data Framework that Startups are Adopting (Vid Jain, Wallaroo Labs)
10:35 - 10:55AM

Packaging, Deploying and Running Spark Applications in Production at Mapbox (Saba El-Hilo, Mapbox)

11:00 - 11:20AM

How Wootric Uses NLP and ML to Make Sense of Hundreds of Thousands of Surveys (Prabhat Jha, Wootric)

11:25 - 11:45AM Building Bots and Conversational AI using Deep Learning (Mitul Tiwari, Passage AI)
11:50 - 12:10PM

How to Leverage Multiple Analytics Engines and Not Lose Track of Your Data (Raghu Murthy, Datacoral)

12:15 - 1:15PM

Lunch

1:15 - 1:35PM

A Serverless Approach to Adding Notifications Features to Any Analytics Application (Paul Lappas, Intermix)

1:40 - 2:00PM

Hyper-Parameter Tuning Across Your ENTIRE Pipeline: From Model Training to Model Inference (Chris Fregly, Pipeline AI)

2:05 - 2:25PM Starting from Scratch in a World Where Data is Everything (Simon Kozlov, Instrumental)
2:30 - 2:50PM ETL vs ELT for Big Data (Artyom Keydunov, Statsbot)
2:50 - 3:15PM Coffee Break
3:15 - 3:35PM Adventures (and Misadventures) in Automated Insight Discovery (Mike Kim, Outlier)
3:40 - 4:00PM How to Monitor and Get Insights From Your Blockchain (Shawn Douglass, Amber Data)
4:10 - 5:00PM VC Panel (Lisha Li, Leo Polovets & other guests)
5:00 - 7:00PM DATA COMMUNITY PARTY @ Holiday Inn Golden Gateway
8:00 - 9:00AM Registration and Breakfast
9:00 - 9:15AM Day 2 - Welcome, announcements & track host intros
9:15 - 9:50AM Keynote #2: The Design of Systems for Real-time Prediction Serving - Joseph Gonzalez (RISE Lab, UC Berkeley)
10:00 - 10:40AM

Real-Time Data Pipelines Made Easy with Structured Streaming in Apache Spark - Tathagata Das (Databricks)

Office Hours: Joseph Gonzalez (RISE Lab, UC Berkeley)

10:45 - 11:25AM

Uber’s Data Journey: 100+PB with Minute Latency - Reza Shiftehfar (Uber)

Office Hours: Tathagata Das, Databricks

11:30 - 12:10PM

From Flat Files to Deconstructed Database: The Evolution and Future of the Big Data Ecosystem - Julien Le Dem (WeWork)

Office Hours: Reza Shiftehfar, Uber

12:15 - 1:15PM Lunch
1:15 - 1:55PM

A Trillion Rows Per Second as a Foundation for Interactive Analytics - Eric Hanson (MemSQL)

Office Hours: Julien Le Dem, WeWork

2:00 - 2:40PM

Efficiently Storing and Calculating Engagement Metrics At Massive Scale - Corey Bort (Facebook)

Office Hours: Eric Hanson, MemSQL

2:45 - 3:15PM Coffee Break
3:15 - 3:55PM

What the Heck is an In-Memory Data Grid? - Addison Huddy (Pivotal)

Office Hours: Corey Bort, Facebook

4:00 - 4:45PM

Keynote #3: Leveling Up Your Career in Data (Guy Bayes, Jasmine Tsai, Noelle Sio Saldana, Aline Lerner)

5:00PM Conference End :(
8:00 - 9:00AM Registration and Breakfast
9:00 - 9:15AM Day 2 - Welcome, announcements & track host intros
9:15 - 9:50AM Keynote #2: The Design of Systems for Real-time Prediction Serving - Joseph Gonzalez (RISE Lab, UC Berkeley)
10:00 - 10:40AM

A Multi-Armed Bandit Framework for Recommendations at Netflix - Jaya Kawale & Elliot Chow (Netflix)

Office Hours: Joseph Gonzalez (RISE Lab, UC Berkeley)

10:45 - 11:25AM

AutoML: The Assembly Line of Machine Learning - Mayukh Bhaowal (Salesforce)

Office Hours: Jaya Kawale & Elliot Chow, Netflix

11:30 - 12:10PM

Democratizing Metric Definition and Discovery at Airbnb - Lauren Chircus (Airbnb)

Office Hours: Mayukh Bhaowal, Salesforce

12:15 - 1:15PM Lunch
1:15 - 1:55PM

Define Once, Evaluate Anywhere: Building Repeatable and Correct Features at Stripe - Kelley Rivoire (Stripe)

Office Hours: Lauren Chircus, Airbnb

2:00 - 2:40PM

Marketplace Optimization at Uber - Christopher Wilkins (Uber)

Office Hours: Kelley Rivoire, Stripe

2:45 - 3:15PM Coffee Break
3:15 - 3:55PM

Hazardous Models and Risk Mitigation in Real Estate (Xinlu Huang & David Lundgren, Opendoor)

Office Hours: Christopher Wilkins, Uber

4:00 - 4:45PM

Keynote #3: Leveling Up Your Career in Data (Guy Bayes, Jasmine Tsai, Noelle Sio Saldana, Aline Lerner)

5:00 Conference End :(
8:00 - 9:00AM Registration and Breakfast
9:00 - 9:15AM Welcome, announcements & track host intros
9:15 - 9:50AM Keynote #2: The Design of Systems for Real-time Prediction Serving - Joseph Gonzalez (RISE Lab, UC Berkeley)
10:10 - 10:30AM Building Big Communities with Big Queries (Paul Burt, CoreOS)
10:35 - 10:55AM

Datasets not Dashboards (Sameer Al-Sakran, Metabase)

11:00 - 11:20AM

NuCypher's Proxy Re-Encryption for Distributed Systems - Managing Private Data on Public Blockchains (MacLane Wilkison, NuCypher)

11:25 - 11:45AM Dead Simple Search A/B Testing with Scala and Spark (Sean Quigley, GIPHY)
11:50 - 12:10AM

Compliant Data Management and Machine Learning at Scale (Daniel Whitenack, Pachyderm)

12:15 - 1:15PM

Lunch

1:15 - 1:35PM

Actionable and Interpretable Predictions from a Stacked Model (Austen Head, Halo Technologies)

1:40 - 2:00PM

KISS - Keep it SQL, Stupid (Connor McArthur, DBT)

2:05 - 2:25PM Lessons Learned Deploying Machine Learning and Deep Learning Models in Production (Jerry Xu, Datatron)
2:30 - 2:50PM Real Time Text Matching at Scale (Shayan Mohanty, Watchful)
2:50 - 3:15PM Coffee Break
3:15 - 3:35PM Three Weird Tips for High Performance Analytics Applications (Gian Merlino, Imply)
3:40 - 4:00PM Version and Deploy Datasets at Scale (Kevin Moore, Quilt)
4:00 - 5:00PM Keynote #3: Leveling Up Your Career in Data (Guy Bayes, Jasmine Tsai, Noelle Sio Saldana, Aline Lerner)