What is the G-test (likelihood-ratio test)?

The G-test is a likelihood-ratio test for categorical data. It compares observed counts with expected counts under a null hypothesis (such as a theoretical distribution or independence in a contingency table). The test statistic G = 2 * sum(O * ln(O/E)) is approximately chi-square distributed with appropriate degrees of freedom when sample sizes are not too small.

When should I use the G-test instead of the chi-square test?

Both the G-test and the Pearson chi-square test are asymptotically equivalent for large samples. The G-test is derived from likelihood theory and is often preferred in log-linear modeling or when likelihood methods are used elsewhere in the analysis. For small samples or cells with very low expected counts, exact or simulation-based methods may be more appropriate than either test.

What are the assumptions of the G-test?

Assumptions include: (1) observations are independent, (2) counts represent frequencies in mutually exclusive categories, and (3) expected cell counts are sufficiently large for the chi-square approximation to be reliable (a common rule of thumb is that most expected counts should be at least 5).

How do I interpret the p-value of a G-test?

The p-value is the probability, under the null hypothesis, of obtaining a G statistic at least as extreme as the one observed. A small p-value (e.g. below 0.05) suggests that the observed pattern of counts is unlikely under the null hypothesis and provides evidence against it. The exact threshold and interpretation should follow your study design and field standards.

Is this G-test related to prenatal screening tests called G-test?

No. This calculator implements the statistical G-test (likelihood-ratio test) for categorical data. It is not a medical or prenatal screening test and does not provide any clinical diagnosis or risk estimate.

G-test Calculator (Likelihood-Ratio Test)

G-test (likelihood-ratio) calculator for categorical data. Compute G statistic, degrees of freedom and p-value for goodness-of-fit and contingency tables, with step-by-step explanation and assumptions.

Full original guide (expanded)

G-test Calculator (Likelihood-Ratio Test)

This calculator implements the statistical G-test (likelihood-ratio test) for categorical data. It is not related to medical or prenatal screening products sometimes called “G-test”.

Use it to compute the G statistic, degrees of freedom and p-value for goodness-of-fit problems and contingency tables (tests of independence).

G-test calculator

Test type

All inputs are counts (frequencies). Use comma or dot as decimal separator; counts are rounded to non-negative numbers.

Number of categories

Between 2 and 10 categories.

Expected pattern under H₀

Equal probabilities (uniform) Custom expected counts

For “custom expected”, the calculator rescales your expected counts to match the total sample size.

Observed counts

Parameters estimated from data

df = k − 1 − parameters. For many textbook problems this is 0.

Context / variable (optional)

Output uses the chi-square approximation to obtain the p-value.

G-test formula (likelihood-ratio test)

The G-test compares observed counts with expected counts under a null hypothesis. The test statistic is

\( G = 2 \sum_i O_i \ln\left(\dfrac{O_i}{E_i}\right) \)

\( O_i \) = observed count in cell i
\( E_i \) = expected count in cell i under the null hypothesis

For contingency tables, the sum runs over all cells \((i, j)\) and the same formula applies:

\( G = 2 \sum_{i,j} O_{ij} \ln\left(\dfrac{O_{ij}}{E_{ij}}\right) \)

Under standard conditions (independent observations, not-too-small expected counts), G is approximately chi-square distributed with the same degrees of freedom as the corresponding chi-square test.

G-test for goodness-of-fit

In a goodness-of-fit setting you test whether observed frequencies follow a specific theoretical distribution (uniform, binomial, Poisson, etc.) or specified probabilities. You provide:

observed counts \( O_1, \dots, O_k \)
expected pattern under the null \( E_1, \dots, E_k \) (or probabilities)

The degrees of freedom are typically \( k - 1 - m \), where \( k \) is the number of categories and \( m \) is the number of parameters estimated from the data (for example, estimating a mean or rate).

G-test of independence (contingency table)

For an \( r \times c \) contingency table of counts, the null hypothesis is that the row and column variables are independent. The expected count in cell \((i,j)\) is

\( E_{ij} = \dfrac{(\text{row total}_i) \cdot (\text{column total}_j)}{\text{grand total}} \)

The degrees of freedom are \( \text{df} = (r - 1)(c - 1) \), exactly as for the Pearson chi-square test of independence.

G-test vs chi-square test

The Pearson chi-square test uses the statistic \( \chi^2 = \sum (O - E)^2 / E \), whereas the G-test uses the likelihood-ratio statistic \( G = 2 \sum O \ln(O/E) \). For large samples they are asymptotically equivalent and usually lead to very similar p-values.

The G-test arises naturally from likelihood-ratio principles and is convenient in log-linear models and generalized linear modeling.
The chi-square test is more familiar and historically widespread in introductory courses.
For small samples, both tests rely on approximations; exact or Monte Carlo methods are often recommended.

Assumptions and common pitfalls

Independence: Each observation contributes to exactly one cell and does not depend on other observations.
Expected counts: Expected cell counts should not be too small (rules of thumb often require most \( E_i \) ≥ 5).
Sparse tables: Many structural zeros or very sparse tables may violate the chi-square approximation; consider exact tests or specialized modeling.

Important: This page concerns the statistical G-test for categorical data. It is not intended for medical diagnostics or prenatal screening and does not replace professional advice.

G-test – frequently asked questions

Audit: Complete

Formula (LaTeX) + variables + units

This section shows the formulas used by the calculator engine, plus variable definitions and units.

Formula (extracted LaTeX)

\[','\\]

','\

Formula (extracted text)

\( G = 2 \sum_i O_i \ln\left(\dfrac{O_i}{E_i}\right) \) \( O_i \) = observed count in cell i \( E_i \) = expected count in cell i under the null hypothesis

Formula (extracted text)

\( G = 2 \sum_{i,j} O_{ij} \ln\left(\dfrac{O_{ij}}{E_{ij}}\right) \)

Formula (extracted text)

\( E_{ij} = \dfrac{(\text{row total}_i) \cdot (\text{column total}_j)}{\text{grand total}} \)

Variables and units

No variables provided in audit spec.

Sources (authoritative):

NIST — Weights and measures — nist.gov · Accessed 2026-01-19
https://www.nist.gov/pml/weights-and-measures
FTC — Consumer advice — consumer.ftc.gov · Accessed 2026-01-19
https://consumer.ftc.gov/

Changelog

Version: 0.1.0-draft
Last code update: 2026-01-19

0.1.0-draft · 2026-01-19

Initial audit spec draft generated from HTML extraction (review required).
Verify formulas match the calculator engine and convert any text-only formulas to LaTeX.
Confirm sources are authoritative and relevant to the calculator methodology.

Verified by Ugo Candido on 2026-01-19
Profile · LinkedIn

Formulas

(Formulas preserved from original page content, if present.)

Version 0.1.0-draft

Citations

Add authoritative sources relevant to this calculator (standards bodies, manuals, official docs).

Changelog

0.1.0-draft — 2026-01-19: Initial draft (review required).

G-test Calculator (Likelihood-Ratio Test)

Full original guide (expanded)

G-test Calculator (Likelihood-Ratio Test)

G-test calculator

Test statistic

Details

Step-by-step calculation

G-test formula (likelihood-ratio test)

G-test for goodness-of-fit

G-test of independence (contingency table)

G-test vs chi-square test

Assumptions and common pitfalls

G-test – frequently asked questions