<img height="1" width="1" style="display:none" src="https://www.facebook.com/tr?id=420156728350215&amp;ev=PageView&amp;noscript=1">

SF 17 Schedule

To view the schedule for each day, click on the tabs below. Speakers and talks being added daily, so be sure to check back! 

 

 

WORKSHOP DAY

* Schedule is subject to change by organizer.

 

TIME & TRACKS

 TRACK 1:

SPARK

2.0.1

 TRACK 2:

SPARK

INTERNALS

TRACK 3:

PANDAS

0.20.0 

TRACK 4:

SCIKIT-

LEARN 

TRACK 5:

DEEP

LEARNING 

TRACK 6:

APACHE

AIRFLOW

 8:30am - 9:00am

CHECK-IN & COFFEE/TEA BREAK

 9:00am - 12:30pm

 

Workshop:

Intro to 

Spark

Level:

Beginner -

Intermediate

Trainer:

Austin Ouyang

Insight Data Science

 

Workshop:

Spark 

Internals

Level:

Intermediate - 

Advanced

Trainer:

Ronak Nathani

Insight Data Science

Workshop:

Coming 

Soon

Level:

Beginner -

Intermediate

Trainer:

TBA

Stay Tuned!

Workshop: 

Intro to 

Scikit-learn

Level:

Beginner -

Intermediate

Trainer:

Francesco Mosconi

DataWeekends

Workshop:

Tensor

Flow

Level:

Beginner -

Intermediate

Trainer:

TBA

Metis

Workshop:

Up & Running

with Airflow

Level:

Beginner -

Intermediate

Trainer:

Arthur Wiedmer

Airbnb

12:30pm - 1:30pm  LUNCH BREAK
1:30pm - 5:00pm  

 

Workshop:

 Using

Spark APIs

Level:

Intermediate -

Advanced

Trainer:

Austin Ouyang

Insight Data Science

 

Workshop:

 Spark Streaming &

Kafka

Level:

Intermediate -

Advanced

Trainer:

Ronak Nathani

Insight Data Science

 

Workshop:

 Coming

Soon

Level:

Intermediate -

Advanced

Trainer:

TBA

Stay Tuned!

 

Workshop:

Intermediate

Scikit-learn

Level:

Intermediate -

Advanced

Trainer:

Francesco Mosconi

DataWeekends

 

Workshop:

 Coming

Soon

Level:

Intermediate -

Advanced

Trainer:

TBA

Stay Tuned!

 

Workshop:

 Airflow

Use Cases

Level:

Intermediate -

Advanced

Trainer:

Arthur Wiedmer

Airbnb

 

CONFERENCE DAY 1 & AFTER- PARTY

* Schedule is subject to change by organizer.

 

  Data Engineering Track Data Science Track
8:00 - 9:00am Registration and Breakfast
9:00 - 9:15am Welcome, Announcements & Track Host Introductions
9:15 - 9:50am

Opening Keynote

10:00 - 10:40am

Sid Anand

Agari

Cloud Native Data Pipelines

Soups Ranjan

Coinbase

Payment Fraud in Digital Currency

10:45 - 11:25am

 Maxime Beauchemin

Airbnb

How Superset and Druid Power Real-Time Analytics at Airbnb

Laura Pruitt

Netflix

Anomaly Detection for Data Quality and Metric Shifts at Netflix 

11:30 - 12:10pm

Chris Hartfield

Clover Health

How Healthcare Data Pushed Us to the Limit

Daniel Galron

eBay

Why, When, How: Lessons Learned in Applying Deep Learning to Real-World Problems 

12:15 - 1:15pm  Lunch Break 
1:15 - 1:55pm To Be Announced

Sharath Rao

Instacart

Practical Lessons for Building Machine Learning Models in Production 

2:00 - 2:40pm

Jeff Chao

Heroku

Beyond 100,000 Partitions: How Heroku Pushes the Limits of Kafka at Scale

To Be Announced
2:45 - 3:15pm  Coffee Break
3:15 - 355pm

Paul Dix

InfluxData

InfluxDB Storage Engine Internals 

To Be Announced 
4:00 - 4:55pm To Be Announced

Investor Panel

How Engineer-Angels Evaluate Data-Backed Startups

5:00 - 5:45pm

 [KEYNOTE PANEL]

The Right Stuff: Lessons Learned from a Decade of Data Engineering

Mike Driscoll | VJ Gill | Sam Shah

Metamarkets | Salesforce | NewCo

6:00 - 8:00pm Conference & Community After-Party  

 

CONFERENCE DAY 2

* Schedule is subject to change by organizer.

 

  Data Engineering Track Data Science Track
8:00 - 9:00am Registration and Breakfast  
 9:00 - 9:15am Welcome, Announcements & Track Host Introductions  
 9:15 - 9:50am Day 2 Keynote  
10:00 - 10:40am

Fangjin Yang

Imply

Druid: A High Performance, Distributed, Column Store

 

Nelson Ray

Opendoor

Simulation-Based Inference: Advantages Over A/B Testing in the Real Estate Domain

10:45 - 11:25am

Silvia Oliveros-Torres & Stephen O'Sullivan

Silicon Valley Data Science

Format Wars: From VHS and Beta to Avro and Parquet

Sean Anderson

Cloudera

Data Science in the Enterprise

11:30 - 12:10pm To Be Announced

To Be Announced

12:15 - 1:15pm  Lunch Break 
1:15 - 1:55pm To Be Announced

Kenneth Sanford

Dataiku

A Nation of Immigrants: The Data Sciences

2:00 - 2:40pm

Shuojie Wang

Facebook

Scaling Up Spark at Facebook: A 60TB Production Use Case

To Be Announced
2:45 - 3:15pm  Coffee Break
3:15 - 355pm

To Be Announced

To Be Announced 
4:00 - 4:55pm

Closing Keynote

To Be Announced

5:00pm

Conference Ends