Math Behind Data Science
Statistics is the study of collecting, analyzing, and interpreting data.
Descriptive → Summarize data Inferential → Make predictions
mean = sum(values) / n
Middle value after sorting
Most frequent value
Measures spread of data
sqrt(variance)
Probability = favorable / total
import numpy as np data = [1,2,3,4,5] print(np.mean(data)) print(np.median(data)) print(np.std(data))