I Can't Believe It's Not Real Data! An Intro to Synthetic Data


Introduction

Easy access to relevant, safe data is a major bottleneck for developers and data scientists. The data could be insufficient, biased, contain private information, etc. making it unusable. With synthetic data we can generate data that’s statistically accurate, privacy-protected, and safe to share.

This talk was given at:

  • PyCon US 2022 - Lightning Talk
  • PyRVA 6/8/2022 - Full Talk
  • PyOhio - Thunder (15 min) Talk
  • PyCon Latam 8/26/2022 - Full Talk
  • DigitalOcean Deploy - Full Talk

Lightning Talk
Full Talk
DigitalOcean Deploy Slides

Relational DB Notebook