#66DaysOfData Day 10

Student’s t-test as a hypothesis test

#66DaysOfData Day 9


  • discrete — defined on discrete state spaces where the data can take only certain values;
  • continuous — defined on continuous state spaces where the data can take any value in a predefined range.

Binomial and Bernoulli distributions

#66DaysOfData Day 7


  • point estimation — the population parameter is estimated based on a single value regardless of new samples that can be added in future;
  • confidence interval estimation — the unknown population…

Predictive modeling cycle:

  1. Data cleaning
  2. Feature engineering
  3. Model building
  4. Model deployment
  5. Model updating
  6. Repeat steps 3–4

#66DaysOfData Day 5

Cumulative distribution function (CDF)

Survival function

#66DaysOfData Day 5

#66DaysOfData Day 4

Descriptive and inferential statistics

Descriptive statistics

  • describes the data by summarizing data set characteristics
  • does not posit any hypothesis
  • first step of statistical analysis
  • aims to detect outliers
  • precursor to data preparation and feature engineering

#66DaysOfData Day 3


Viktoria Karamysheva

Software Engineer

