Data Council Blog

Data Council Blog

Open Source Highlight: Apache Superset

Community, Metadata Management, and More: Top 10 Links From Across the Web

Open Source Highlight: PostHog

dbt at Shopify, Active Learning, and More: Top 10 Links From Across the Web

Open Source Highlight: OpenLineage

Storing Cold Metadata, Snowflake Data Cloud, and More: Top 10 Links From Across the Web

Open Source Highlight: Orchest

The Modern Data Stack, Metadata Architectures, and More: Top 10 Links From Across the Web

Open Source Highlight: Klio

NLP Heroes, Pinot, Data Testing, and More: Top 10 Links From Across the Web

Open Source Highlight: DataHub

State of AI, Data Quality, and More: Top 10 Links From Across the Web

Open Source Highlight: n8n

Hot Data Tools pt. 2, End-to-End Data Scientists, and More: Top 10 Links From Across the Web

Large Datasets, Are Dashboards Dead, and More: Top 10 Links From Across the Web

Open Source Highlight: Apache Hudi

Apache Airflow, Beyond Spreadsheets, and More: Top 10 Links From Across the Web

Open Source Highlight: Apache Iceberg

AGI, Dask, Feature Stores, and More: Top 10 Links From Across the Web

Emerging Data Roles: The Analytics Engineer

Open Source Highlight: Cube.js

What Data Tools DON’T Do, CD4ML and NoSQL: Top 10 Links from Across the Web

25 Hot New Data Tools and What They DON’T Do

Open Source Highlight: Streamlit

Data Science, Data Analytics, Data Engineering and Artificial Intelligence: 11 Online Courses You Should Check Out

PyTorch Lightning, ksqlDB and More: Top 10 Links from Across the Web

Data Engineer Salaries Around The World (2019)

Should Datacoral Power Your New Data Infrastructure?

How Histograms Can Help Improve Your Ops Monitoring

Amberdata - Featured Startup SF '18

Intermix - Featured Startup SF '18

How to "Democratize" the Responsibility for Data Quality Across your Organization

Shattering the Trillion-Rows-Per-Second Barrier With MemSQL

NuCypher - Featured Startup SF '18

Wootric - Featured Startup SF '18

The Future of Distributed Databases is Relational

Halo Tech - Featured Startup SF '18

PipelineAI - Featured Startup SF '18

Instrumental - Featured Startup SF '18

Pachyderm - Featured Startup SF '18

Redshift versus Snowflake versus BigQuery / Part 1: Performance

Functional Data Engineering — a modern paradigm for batch data processing

ETL and the Question of Happiness

Data Science in the Media

How Data Has Evolved at The New York Times

How Dremio Uses Apache Arrow to Increase the Performance

Introducing our Data Startups Track

To Shard or Not to Shard (PostgreSQL)

Rolling Your Own Distributed Column Store

How Big Data Can Help Improve the Meteorological Risk Models That Are Out of Date

A Day in the Life: What's it like Being an Engineer at Stripe?

Rebuilding Open Source Analytics @ Airbnb

Pushing Kafka to the Limit at Heroku

Fighting Fraud in Cryptocurrency using Machine Learning

Building a Column-Oriented, Distributed Data Store for Analytics - The Story of Druid

How to Build a Data Pipeline That Handles Hundreds of Different Inputs

10 Unique Gift Ideas for Data Scientists and Engineers

Open Source Software Wins $2K in Lieu of Conference Swag