Hacker theme

Hacker is a theme for GitHub Pages.

Download as .zip Download as .tar.gz View on GitHub
8 August 2025

Mind on Statistics (6th. Ed) Chapter 10 - Estimating Proportions with Confidence

by Arpon Sarker

Introduction

Overview of Confidence Intervals

Confidence interval: is an interval of values computed from sample data that is likely to include the unknown value of a population parameter

\[\textrm{sample statistic } \pm \textrm{ multiplier } \cdot \textrm{ standard error}\]

The sammple size, confidence level, and natural variability among units affect the width of the interval. The first two are the only ones controllable.

Confidence Interval for Population Proportion

Conditions (arise when data are collected similar to binomial experiment):

The first condition changes if parameter $p$ is a long-run probability.

alternative condition: The sample proportion $\hat{p}$ is the proportion of times a specified outcome occurs in n repeated independent trials with fixed probability $p$.

Margin of Error: describes value of multiplier x s.e. in a 95% confidence interval estimate of a population proportion

95% Margin of Error (2 standard errors): \(= 2\sqrt{\frac{\hat{p}(1-\hat{p})}{n}}\) Margin of error is the same as conservative margin of error when $\hat{p}$ is 0.5.

Confidence Interval: \(\hat{p} \pm z*\sqrt{\frac{\hat{p}(1-\hat{p})}{n}}\)

Confidence Interval for Difference in Two Population Proportions

Confidence Interval:

\[\hat{p}_1 - \hat{p}_2 \pm z*\sqrt{\frac{\hat{p}_1(1-\hat{p}_1)}{n_1} + \frac{\hat{p}_2(1-\hat{p}_2)}{n_2}}\]

Using Simulation to Calculate Confidence Intervals: Bootstrapping

A sampling distribution is constructed by assuming we can draw an infinite number of samples of size $n$ from a population, and compute the sample statistic from each sample. The distribution of the sample statistics for large number of random samples is the bootstrap sampling distribution or we use resampling.

Using Confidence Intervals to Guide Decisions

tags: mathematics - statistics