Probability & Statistics
Probability axioms, random variables, standard distributions, limit theorems, and statistical inference for GATE preparation with full proofs and worked examples.
The axiomatic foundation of probability theory and Bayes' theorem for updating beliefs based on evidence.
- \(P(A) \ge 0\) for all \(A \in \mathcal{F}\)
- \(P(\Omega) = 1\)
- For pairwise disjoint \(A_1, A_2, \ldots\): \(P\!\left(\bigcup_{i=1}^{\infty} A_i\right) = \sum_{i=1}^{\infty} P(A_i)\)
\(P(+) = P(+|D)P(D) + P(+|D^c)P(D^c) = 0.99(0.01) + 0.05(0.99) = 0.0099 + 0.0495 = 0.0594\)
\[P(D|+) = \frac{0.0099}{0.0594} \approx 0.1667 \approx 16.7\%\] Despite the accurate test, only about 1 in 6 positive results is a true positive, due to the low base rate.
Discrete and continuous random variables, their distributions, expectations, and variances.
\(E[X^2] = \int_0^1 x^2 \cdot 2x\,dx = 2\int_0^1 x^3\,dx = \frac{1}{2}\).
\(\text{Var}(X) = \frac{1}{2} - \left(\frac{2}{3}\right)^2 = \frac{1}{2} - \frac{4}{9} = \frac{1}{18}\).
The most important probability distributions for GATE: Binomial, Poisson, Normal, Exponential, and Uniform.
The Law of Large Numbers and Central Limit Theorem are the cornerstones of statistical theory, justifying the use of sample means as estimators.
By CLT: \(P(190 \le S \le 210) = P\!\left(\frac{190-200}{10} \le Z \le \frac{210-200}{10}\right) = P(-1 \le Z \le 1)\)
\(= \Phi(1) - \Phi(-1) = 2\Phi(1) - 1 \approx 2(0.8413) - 1 = 0.6827 \approx 68.3\%\).
Estimation and hypothesis testing: the core tools of statistical inference including MLE, method of moments, confidence intervals, and regression.
Setting \(\ell'(\lambda) = -n + \frac{\sum x_i}{\lambda} = 0\) gives \(\hat{\lambda}_{\text{MLE}} = \bar{x}\).
- Null hypothesis \(H_0\) vs. alternative \(H_1\)
- Type I error (reject true \(H_0\)): probability \(\alpha\) (significance level)
- Type II error (fail to reject false \(H_0\)): probability \(\beta\); Power = \(1 - \beta\)
- p-value: smallest \(\alpha\) at which \(H_0\) would be rejected
- Bayes' theorem inverts conditional probabilities: base rate matters significantly.
- For Poisson, mean equals variance; for Exponential, the memoryless property is unique among continuous distributions.
- CLT justifies normal approximation for sample means regardless of the underlying distribution.
- MLE is asymptotically efficient and consistent under regularity conditions.
- Confidence intervals and hypothesis tests are dual: reject \(H_0: \mu = \mu_0\) iff \(\mu_0\) lies outside the CI.