Number of observations
Count every recorded value once.
Statistics + Probability Formula Sheet
A searchable K-12 statistics and probability formula bank for data, averages, spread, probability, combinatorics, distributions, inference, regression, sampling, and exam-style helpers.
Use the table of contents to jump to the unit you are studying.
Choose a grade band to narrow the bank to your course level.
Type a formula name, symbol, distribution, or keyword.
Use the note to confirm the formula context before applying it.
Unit 1
Count every recorded value once.
Used in tally charts and frequency tables.
Add all class or category frequencies.
Count all observations in a category.
Used in simple bar-chart comparison.
Used in pictographs and simple tables.
Assume \(a>b\).
Assume \(b>a\).
Example: 1 icon represents 5 students.
Fraction of the total.
Used in tables and bar charts.
Running total up to class \(k\).
Running proportion.
Running percent.
Used for circle graphs.
Percent for one category.
Reads scaled bar charts.
Each mark represents one value.
Used before calculating a mean from a frequency table.
Used for histograms with unequal widths.
Area of a histogram bar represents frequency.
Upper class boundary minus lower class boundary.
Used for grouped-data estimates.
Total observations in grouped data.
Approximate total using midpoints.
Plot class midpoint against frequency.
Plot midpoint against relative frequency.
Plot upper boundary against cumulative frequency.
Used to estimate percentiles.
Each leaf usually represents one observation.
Preferred for continuous grouped data.
Used with scaled graphs.
General measurement-statistics link.
Alternative symbol for histogram density.
Distribution share for class \(i\).
Used in grouped tables.
Used in table completion.
Weighted sum from values and frequencies.
Approximation used for grouped statistics.
Locates median, quartile, or percentile class.
Cumulative count below upper boundary.
Cumulative count at or above lower boundary.
Recover count from a pie-chart angle.
Unit 2
Average of raw data.
Elementary form.
Useful for missing-value problems.
Add value \(a\).
Remove value \(a\).
If \(n\) is odd, this is one data position.
After sorting.
After sorting.
Can be none, one, or multiple.
Sometimes used in early data work.
Mean from frequency table.
Use class midpoints \(m_i\).
Weights may be credits, marks, or frequencies.
For two groups.
For several groups.
One missing value.
One missing frequency at value \(x_m\).
Grouped-data median class.
Grouped-data mode class.
\(L\)=lower boundary, \(CF\)=cumulative frequency before median class.
\(f_1\)=modal class frequency.
Approximate relation for moderately skewed data.
\(d_i=x_i-A\).
\(u_i=\frac{x_i-A}{h}\).
Remove \(k\) smallest and \(k\) largest values.
Extreme values are capped.
For positive values, growth factors, rates.
Positive values only.
Useful for average rates.
Positive values only.
Positive values only.
Population parameter.
Sample statistic.
Discrete random variable.
Continuous random variable.
Deviations from the mean sum to zero.
Used in grouped or ANOVA settings.
Equivalent to combined mean.
Unit 3
Lowest observation.
Highest observation.
One common school convention.
One common school convention.
One common convention.
One common convention.
Middle quartile.
Middle 50 percent spread.
Also called quartile deviation.
Used for box plots.
Box-plot outlier rule.
Box-plot outlier rule.
Extreme outlier rule.
Extreme outlier rule.
For \(k=1,2,3\).
For \(k=1,\dots,9\).
For \(k=1,\dots,99\).
Common rank formula.
Another school convention.
Population standard score.
Uses sample mean and sample standard deviation.
Common standardized score scale.
Example standard-score scale.
Usually rounded and bounded from 1 to 9.
\(\Phi\) is standard normal CDF.
Reverse standardization.
Reverse standardization.
Transforms to mean 0 and standard deviation 1.
For \(k>1\).
For \(k>0\).
Approximately normal data.
Approximately normal data.
Approximately normal data.
For a normal distribution.
For a normal distribution.
Unit 4
Basic spread.
Individual difference from mean.
Distance from mean.
Average absolute distance from mean.
Frequency-table version.
Uses class midpoints.
Population parameter.
Population spread.
Unbiased sample variance.
Sample spread.
Frequency table.
Frequency table sample version.
Shortcut formula.
Shortcut formula.
Population-style denominator.
Grouped sample estimate.
\(d_i=x_i-A\).
Population-style.
\(u_i=(x_i-A)/h\).
Sometimes used in applied statistics.
Relative quartile spread.
Average may be mean or median.
Sample version.
Population version.
Equivalent to sample CV.
Equal-variance two-sample procedures.
Random variable.
Population random-variable spread.
Adding constant does not change variance.
Scaling by \(a\).
Scaling rule.
General linear transformation.
Transformation of center.
For independent \(X,Y\).
For independent \(X,Y\).
General rule.
General rule.
Population covariance.
Shortcut.
Sample covariance.
Population covariance.
Unit 5
Every value becomes \(x+c\).
Every value becomes \(x+c\).
Spread unchanged.
Every value becomes \(ax\).
For \(a>0\).
Spread scales by \(\lvert a\rvert\).
For \(y=ax+b\).
For \(y=ax+b\).
For \(y=ax+b\).
Standardized values have mean 0 and SD 1.
For standardizing by sample mean and SD.
For standardizing by sample mean and SD.
Maps \(x\) scale to new scale.
Maps values to 0–1 range.
Used for marks and scores.
Base usually equals 100.
Decimal change.
Percent change.
Unit 6
One bivariate observation.
Observed minus predicted.
Simple linear prediction.
Used for trend lines.
For line through a known point.
Predict \(y\) from \(x\).
Least-squares slope.
Line passes through \((\bar{x},\bar{y})\).
Pearson correlation.
Sample correlation.
Population parameter.
Sample version.
Regression of \(y\) on \(x\).
Simple linear regression.
Total variation in \(y\).
Unexplained variation.
Explained variation.
For regression with intercept.
Regression fit measure.
Simple linear regression.
Typical residual size.
Prediction error metric.
Prediction error metric.
Requires nonzero \(y_i\).
Simple school-level form.
Inference for regression slope.
Inference for intercept.
Usually test \(H_0:\beta=0\).
Simple linear regression.
Simple linear regression.
CI for mean response.
PI for individual response.
At \(x=x_0\).
At \(x=x_0\).
No tied ranks formula.
\(C\)=concordant pairs, \(D\)=discordant pairs.
For \(2\times2\) table.
\(k=\min(r,c)\).
\(p\)=number of predictors.
Unit 7
Elementary probability.
All probabilities lie between 0 and 1.
Never occurs.
Sample space occurs.
Probability of not \(A\).
Convert probability to percent.
Experimental probability.
Often written as a ratio.
Often written as a ratio.
Odds in favour \(a:b\).
General addition rule.
When \(A\cap B=\varnothing\).
Independent events.
Requires \(P(B)>0\).
General rule.
General rule.
Definition of independence.
Not \(A\) or \(B\).
Very common shortcut.
Two-event case.
Counting version.
Counting version.
Inclusion-exclusion.
Counting version.
Partition \(B,B^c\).
\(\{B_i\}\) is a partition.
Common diagnostic form.
General partition form.
Set identity.
Set identity.
Not both.
Given \(B\).
If \(A\) and \(B\) are independent.
Two-way table.
Row or column total over grand total.
Restrict denominator to \(B\).
Expected count from probability.
Used in applied statistics.
For \(2\times2\) table with cells \(a,b,c,d\).
Mutual independence.
General multiplication rule.
Lower bound.
Boole's inequality.
Boole's inequality.
Given event \(C\).
Unit 8
For disjoint choices.
For sequential choices.
With \(0!=1\).
All objects used.
Order matters.
Order does not matter.
Choose \(r\) or leave \(n-r\).
Pascal identity.
\(r\) choices from \(n\) options each time.
Stars and bars.
Repeated types.
Rotations considered identical.
For reversible necklaces, \(n>2\).
Same as \(P(N,n)\).
Simple random samples.
Repeated choices allowed.
Multisets.
All subsets of an \(n\)-element set.
Excludes the full set.
Number of ways to choose success positions.
Counts category allocations.
Used in binomial probabilities.
Sum over \(n_1+\cdots+n_k=n\).
General finite version.
Unit 9
Weighted average of outcomes.
Population variance of \(X\).
Equivalent form.
Spread of random variable.
Discrete case.
Always true.
Always true.
Always true.
Requires independence.
Single success/failure trial.
Success probability.
Also \(pq\).
\(X\sim B(n,p)\).
\(X\sim B(n,p)\).
\(X\sim B(n,p)\).
\(X\sim B(n,p)\).
At least one success.
CDF form.
Upper-tail probability.
Trials until first success.
Trials until first success.
Trials until first success.
First success by trial \(k\).
No success in first \(k\) trials.
Trial of \(r\)th success is \(k\).
Trials until \(r\) successes.
Trials until \(r\) successes.
Sampling without replacement.
Population successes \(K\), population size \(N\).
Includes finite population correction.
Counts in fixed interval.
\(X\sim Pois(\lambda)\).
\(X\sim Pois(\lambda)\).
\(X\sim Pois(\lambda)\).
No events.
At least one event.
Rate \(r\) over time/space \(t\).
Counts across \(k\) categories.
For category \(i\).
For category \(i\).
For \(i e j\).
Unit 10
Area under density.
Valid PDF.
Mean.
Variance.
With \(E(X^2)=\int x^2f(x)\,dx\).
Cumulative distribution function.
Where differentiable.
Continuous variables.
\(X\sim U(a,b)\).
Continuous uniform.
Continuous uniform.
Continuous uniform.
\(X\sim N(\mu,\sigma^2)\).
\(Z\sim N(0,1)\).
Use standard normal tables/calculator.
Normal interval probability.
Upper tail.
Lower tail.
Where \(\Phi(z_p)=p\).
Waiting-time model.
\(x\ge0\).
Right-tail probability.
Average waiting time.
Waiting-time variance.
For exponential \(X\).
Advanced K–12/AP extension.
Rate parameterization.
Rate parameterization.
Shape-scale view.
\(Z\sim N(0,1)\), \(V\sim\chi^2_ u\).
\(U,V\) independent chi-square variables.
Unit 11
\(x\)=success count.
Percent form.
Population success fraction.
Sample size over population size.
For sampling without replacement.
Unbiased estimator.
Known population SD.
Unknown population SD.
For normal population or large \(n\).
Known \(\sigma\).
Unbiased estimator.
Uses true \(p\).
Uses sample \(\hat{p}\).
Large-sample approximation.
For tests.
For \(X\sim B(n,p)\).
Linearity.
Independent variables.
Known \(\sigma\).
Two independent samples.
Known population SDs.
Unknown population SDs.
Two independent samples.
For confidence intervals with true values.
For confidence intervals.
For two-proportion test under \(H_0:p_1=p_2\).
For two-proportion z test.
Sampling without replacement.
Round up.
Use \(p=0.5\) if unknown.
Survey sampling.
Survey sampling.
Survey quality metric.
Use decimal response rate.
Unit 12
Core inference structure.
Half-width of interval.
Lower endpoint.
Upper endpoint.
Known \(\sigma\).
Unknown \(\sigma\).
For one mean.
Large-sample interval.
Approximate interval adjustment.
Approximate adjusted interval.
Known SDs.
Unknown SDs.
Approximate \(df\).
Assumes equal variances.
Equal-variance methods.
Equal-variance method.
Differences within pairs.
Number of pairs minus 1.
Independent samples.
For normal population.
Square-root variance interval.
Transform \(r\).
Correlation interval.
Convert back to \(r\).
From \(z\) scale to correlation.
Approximate simulation-based interval.
\(C\)=confidence level.
Standard normal.
Upper one-sided interval context.
Unit 13
Core idea.
Right-tail test.
Left-tail test.
For symmetric test statistics.
Significance level.
Missed detection.
Probability of correctly rejecting false \(H_0\).
Known \(\sigma\).
Unknown \(\sigma\).
One mean.
Use null value in SE.
Known SDs.
Welch version.
Equal variances.
Matched pairs.
Uses pooled estimate for \(H_0:p_1=p_2\).
Categorical distribution test.
\(m\)=estimated parameters.
Expected count under \(H_0\).
Two-way table.
Independence test.
\(r\) rows, \(c\) columns.
Same as independence.
Between-group over within-group variation.
ANOVA.
ANOVA.
ANOVA total variation.
One-way ANOVA.
\(k\)=number of groups.
\(N\)=total observations.
Between groups.
Within groups.
Often put larger variance on top.
F test.
Test \(H_0:\rho=0\).
Pearson correlation.
Simple linear regression.
Simple linear regression.
Under median null, no ties.
Paired categorical data.
Approximate correction.
Unit 14
General rate formula.
Basic categorical percentage.
Applied school statistics.
Applied school statistics.
True positive rate.
True negative rate.
Equals \(1-\text{specificity}\).
Equals \(1-\text{sensitivity}\).
Precision.
Negative prediction accuracy.
Overall correct rate.
Overall incorrect rate.
Same as PPV.
Same as sensitivity.
Classification metric.
Difference in proportions.
Ratio of risks.
Probability to odds.
Odds to probability.
Comparing two groups.
Used in logistic models.
Basic logistic model.
Applied inference.
Categorical applications.
Unit 15
Change in value.
Decimal change.
If new value is greater.
If new value is smaller.
\(k\)-period moving average.
Used for even seasonal periods.
Ratio form.
Average compound growth.
Base period \(0\).
Individual item index.
Unweighted.
Weights \(w_i\).
Base quantities.
Current quantities.
Geometric mean of Laspeyres and Paasche.
Base prices.
Current prices.
Seasonality as percentage.
Multiplicative model.
Trend, seasonal, cyclical, irregular.
Product model.
\(0<\alpha<1\).
Actual minus forecast.
Bias metric.
Error metric.
Unit 16
Sampling fraction.
Share assigned to group \(i\).
Random allocation expectation.
Proportional allocation.
Choose every \(k\)th member.
Total sampled individuals.
Monte Carlo estimate.
\(R\)=simulation repetitions.
Compare groups.
Advanced stratified allocation.
Survey weights.
\(W_h=N_h/N\).
Estimator quality.
Estimator quality.
Simulation inference.
Randomization distribution.
Bootstrap resampling.
SD of bootstrap statistics.
Unit 17
Measurement error.
Dimensionless error.
Percent error.
Rounded to nearest unit \(u\).
Rounded to nearest unit \(u\).
Rounded to nearest unit \(u\).
Forecast/approximation error.
Difference between two series.
Measurement repeated trials.
Independent uncertainties.
Independent uncertainties.
For \(z=xy\), independent.
For \(z=x/y\), independent.
Prediction or measurement errors.
Average squared error.
Average measurement error.
Lower \(s\) means higher precision.
Basic form.
Repeated-measurement agreement.
Agreement analysis extension.
Unit 18
Raw moment.
Moment about mean.
Shape measure.
Common computational form.
Shape measure.
Normal distribution has 0 excess kurtosis.
Advanced extension.
If MGF exists.
If MGF exists.
Discrete nonnegative integer variables.
If PGF exists.
If PGF exists.
Advanced probability.
Advanced probability.
For nonnegative \(X\).
When conditions are suitable.
For normal approximation.
For normal approximation.
When \(n\) large and \(p\) small.
When \(\lambda\) is large.
Waiting time to first Poisson event.
Advanced extension.
Beta distribution.
Beta distribution.
Advanced modeling extension.
Sample proportion.
Sample mean.
Model comparison extension.
Model comparison extension.
Unit 19
Weighted score calculation.
If \(s_i\) are proportions.
School data use.
Education statistics.
Percent form.
Education data.
Education data.
School data.
Convert to desired scale.
Weighted components.
Simple version.
Lower may be better depending context.
Compound growth.
Rule of 70.
For exponential decay percent rate.
Compound growth.
Compound decay.
Advanced data applications.
Data applications.
Data applications.
Standard deviation of returns.
\(m\)=periods per year.
It includes 597 K-12 statistics and probability formulas covering data displays, averages, spread, probability rules, combinatorics, distributions, inference, regression, sampling, errors, time series, and senior secondary extensions.
Yes. The bank is organized for broad global K-12 revision and includes formulas commonly seen across AP Statistics, IB Mathematics, GCSE and IGCSE, CBSE or NCERT, Common Core, and senior secondary pathways.
Some advanced formulas appear in AP, IB, A-Level-style courses, optional senior secondary statistics topics, or extension work. The grade-band labels help students choose the right level.
No. Students should follow their teacher, syllabus, and exam-board formula sheet. This page is designed as a searchable revision and reference bank.