Product Team changed their schema without notifying you which lead to your pipelines/serving breaking!?
Tired of messy json data that have no form or structure and hitting your kafka/pubsub Stream?
Want to make sure data types and fields are consistent when you are recieving or sending API responses?
Documentation is always a afterthought?
Data Scientist Code is so hard to read and collaborate!
Having difficulty to validate data from other parties?
Hear good things about docker but not sure where to get started?
Docker tutorials out there not useful as a data scientist / analyst?
Developing within docker environment! (Remote IDE)
What is the deal with testing? How do you even test data?
Testing is only for software engineers! I am a data scientist!
How should a data scientist view testing?
Data Build Tool (DBT) by fish town analytics!
Having difficulty in managing SQL scripts? Too many tested Query? Or figuring the execution graph? Finding yourself running scripts manually?
How do you even do testing for ETL jobs or Queries!?!?
Write better pandas code as a data scientist so that your engineer will love you! (Or easier to productionize your code)
Frustrated by constant reassignment and/or hard referencing when creating new columns with pandas.
Like spark Dataframe API or R's Dplyr API or chaining / piping your code in general? (Some sort of functional programming)
Getting started with metrics?
Having difficulty remembering metrics?
Sensitivity, power, recall, precision, ....
Quick Introduction to hypothesis testing
Data Science is not only about Machine Learning!
Interested to know more about a/b testing?
Understanding p-value, power, sample size.
My current setup and tools
For those who are curious about what tools I use.
How i set up my machine.
Deciding what to learn!
My personal pitfalls and avoid my mistakes.
How do I decide what to learn?
How do I stay motivated?
Podcast with symbolic connection
An hour long podcast on how I got started in my career.
How did i make the switch from a "dashboard data scientist" to putting models in production.
And other questions such as Python or R?
Work in Progress
asyncio (Or Ways to run multiple i/o bound processes at once)
a/b tests, anova