How do I interpret positive and negative lags?

If the maximum correlation occurs at a positive lag k, it means series Y tends to follow series X after k time steps. A negative lag means Y tends to lead X. A maximum at lag 0 indicates that the strongest relationship is contemporaneous.

What is the difference between cross-correlation and simple correlation?

Simple Pearson correlation only measures the linear association at lag 0: each observation in X is paired with the observation at the same time index in Y. Cross-correlation generalizes this by computing correlation at many lags, so you can detect delayed relationships between the series.

Can I use this calculator for financial trading signals?

This calculator is intended for educational and exploratory analysis. It does not incorporate statistical significance testing, non-stationarity diagnostics, or transaction costs. Always use appropriate statistical methods, risk management, and professional judgement before making trading or investment decisions.

Cross-Correlation Calculator – Time Series Correlation by Lag

Cross-correlation calculator for two time series. Paste your data, compute correlation by lag, find the maximum correlation and its lag, and get the zero-lag Pearson correlation coefficient.

Full original guide (expanded)

Cross-Correlation Calculator – Time Series Correlation by Lag

Paste two time series, choose a maximum lag, and compute the cross-correlation function to see how strongly they move together as one series is shifted in time.

Time series · Cross-correlation · Pearson r Signal processing · Econometrics · Forecasting

Cross-correlation calculator

Series X (reference)

Paste numbers separated by commas, spaces, tabs or line breaks. X is treated as the leading series.

Series Y (shifted)

Y is shifted forward/backward relative to X to compute correlation at each lag.

Maximum lag (|k| ≤)

Lags will run from −maxLag to +maxLag, limited by series length.

Normalization

Correlation coefficient (Pearson r, −1 to 1) Cross-covariance (sum over demeaned products)

Data options

Subtract mean (center each overlapping segment) Use sample standard deviation (n − 1)

Minimum 3 overlapping points per lag for a stable estimate.

Lag k	Overlapping pairs	r_XY(k)

What is cross-correlation?

Given two time series \( X_t \) and \( Y_t \), the cross-correlation function (CCF) tells you how strongly they move together when one series is shifted forward or backward in time.

For a lag \( k \), we correlate \( X_t \) with \( Y_{t+k} \). A positive lag means that Y follows X; a negative lag means that Y tends to lead X.

For a given lag \( k \), consider the overlapping pairs \((X_i, Y_{i+k})\) for which both indices are in range. Let there be \( n_k \) such pairs. The sample Pearson correlation at lag \( k \) is: \[ r_{XY}(k) = \frac{\sum_{i=1}^{n_k} (X_i - \bar X_k)(Y_{i+k} - \bar Y_k)} {\sqrt{\sum_{i=1}^{n_k} (X_i - \bar X_k)^2}\,\sqrt{\sum_{i=1}^{n_k} (Y_{i+k} - \bar Y_k)^2}} \] where \( \bar X_k \) and \( \bar Y_k \) are the means of the overlapping segments.

The calculator implements this sample correlation per lag, and also offers a cross-covariance mode where the numerator \(\sum (X_i - \bar X_k)(Y_{i+k} - \bar Y_k)\) is reported directly.

Zero-lag correlation vs. cross-correlation

The usual Pearson correlation coefficient between \( X \) and \( Y \) pairs each value at time \( t \) in the first series with the value at the same time in the second series. This is the lag 0 entry of the cross-correlation function.

Cross-correlation generalizes this idea by exploring a range of lags: \(-k_{\max}, \dots, -1, 0, 1, \dots, k_{\max}\). This is crucial when:

One series is a delayed response to the other (e.g., downstream sensor data).
You suspect lead–lag effects between markets or economic indicators.
You are aligning signals in engineering (vibration, audio, radar, seismology, etc.).

How to use this cross-correlation calculator

Paste your data. Put the first time series in the X box and the second in the Y box. They may have different lengths; the tool only uses overlapping indices for each lag.
Choose maximum lag. A conservative starting point is 10–20% of the shorter series length. Very large lags can produce noisy estimates with very few pairs.
Choose normalization. For most statistical applications, use correlation coefficient (Pearson r). Use cross-covariance if you are interested in absolute energy of the joint fluctuations.
Run the calculator and interpret. Look at the lag where the absolute value of the correlation is strongest, and compare it to the zero-lag correlation.

Interpreting the sign and lag

A positive correlation at lag \( k \) suggests that when X is above its mean, Y (shifted by \( k \)) also tends to be above its mean.
A negative correlation at lag \( k \) means that high values of X are associated with low values of Y at that lag (and vice versa).
A maximum at positive lag means X tends to lead Y (Y follows after some delay).
A maximum at negative lag means Y tends to lead X.

Remember that even a high correlation may still be due to chance or shared external drivers. Always combine cross-correlation analysis with domain knowledge, plots of the original series, and where appropriate, formal hypothesis testing.

FAQ – cross-correlation calculator

My correlation values are greater than 1 in magnitude. Is that possible?

That can happen only if you are viewing cross-covariance instead of correlation coefficients, since covariance is not bounded between −1 and 1. If you see values outside [−1, 1] while in correlation mode, check that you have enough overlapping points and that your data do not contain non-numeric values.

How many data points do I need for reliable cross-correlation?

There is no universal rule, but with very few overlapping points (e.g., n < 10 per lag) correlations become unstable and highly variable. This tool reports how many overlapping pairs are used at each lag so you can judge the robustness of each estimate.

Does this tool account for non-stationarity or trends?

No. It computes classical sample cross-correlation on the raw series segments. In serious time-series work you would normally inspect for non-stationarity, detrend or difference the data, and consider more advanced models (ARIMA, transfer functions, state-space models, etc.).

Are missing values supported?

The current implementation expects numeric values only. Any non-numeric entries will trigger an error. If your series contains missing data, remove or impute them before using the calculator.

Audit: Complete

Formula (LaTeX) + variables + units

This section shows the formulas used by the calculator engine, plus variable definitions and units.

Formula (extracted LaTeX)

\[r_{XY}(k) = \frac{\sum_{i=1}^{n_k} (X_i - \bar X_k)(Y_{i+k} - \bar Y_k)} {\sqrt{\sum_{i=1}^{n_k} (X_i - \bar X_k)^2}\,\sqrt{\sum_{i=1}^{n_k} (Y_{i+k} - \bar Y_k)^2}}\]

r_{XY}(k) = \frac{\sum_{i=1}^{n_k} (X_i - \bar X_k)(Y_{i+k} - \bar Y_k)} {\sqrt{\sum_{i=1}^{n_k} (X_i - \bar X_k)^2}\,\sqrt{\sum_{i=1}^{n_k} (Y_{i+k} - \bar Y_k)^2}}

Formula (extracted LaTeX)

\[','\\]

','\

Formula (extracted text)

For a given lag \( k \), consider the overlapping pairs \((X_i, Y_{i+k})\) for which both indices are in range. Let there be \( n_k \) such pairs. The sample Pearson correlation at lag \( k \) is: \[ r_{XY}(k) = \frac{\sum_{i=1}^{n_k} (X_i - \bar X_k)(Y_{i+k} - \bar Y_k)} {\sqrt{\sum_{i=1}^{n_k} (X_i - \bar X_k)^2}\,\sqrt{\sum_{i=1}^{n_k} (Y_{i+k} - \bar Y_k)^2}} \] where \( \bar X_k \) and \( \bar Y_k \) are the means of the overlapping segments.

Variables and units

No variables provided in audit spec.

Sources (authoritative):

NIST — Weights and measures — nist.gov · Accessed 2026-01-19
https://www.nist.gov/pml/weights-and-measures
FTC — Consumer advice — consumer.ftc.gov · Accessed 2026-01-19
https://consumer.ftc.gov/

Changelog

Version: 0.1.0-draft
Last code update: 2026-01-19

0.1.0-draft · 2026-01-19

Initial audit spec draft generated from HTML extraction (review required).
Verify formulas match the calculator engine and convert any text-only formulas to LaTeX.
Confirm sources are authoritative and relevant to the calculator methodology.

Verified by Ugo Candido on 2026-01-19
Profile · LinkedIn

Best practices for cross-correlation analysis

Visualize both raw series first to check for obvious trends, seasonality, and outliers.
Consider detrending or differencing non-stationary series before interpreting the CCF.
Limit the maximum lag so that each lag still has enough overlapping points.
Use cross-correlation as a screening tool, then confirm findings with formal models and tests.
In engineering applications, compare delays from cross-correlation with known physical system timing.

Formulas

(Formulas preserved from original page content, if present.)

Version 0.1.0-draft

Citations

Add authoritative sources relevant to this calculator (standards bodies, manuals, official docs).

Changelog

0.1.0-draft — 2026-01-19: Initial draft (review required).