What is a good Deflated Sharpe Ratio?

Above 0.95, meaning there is less than 5% probability that the observed Sharpe is explained by selection bias alone. This is analogous to p < 0.05 in classical hypothesis testing.

Why does the number of strategies tested matter for the Sharpe Ratio?

If you test 100 strategies with zero true alpha, the best one will still have a positive Sharpe by pure chance. The more strategies tested, the higher the expected maximum SR under the null. The Deflated Sharpe Ratio adjusts for this inflation.

How do skewness and kurtosis affect the Deflated Sharpe Ratio?

Negative skewness (left-tail risk) and excess kurtosis (fat tails) increase the standard error of the Sharpe Ratio estimator. This makes the DSR more conservative, because non-normal returns inflate traditional Sharpe calculations.

What is the difference between PSR and DSR?

The Probabilistic Sharpe Ratio (PSR) tests whether the observed SR exceeds any fixed threshold. The Deflated Sharpe Ratio (DSR) is a special case that sets the threshold to the expected maximum SR under the null hypothesis of zero skill across all trials tested.

How many trading days are needed for a reliable Deflated Sharpe Ratio?

Bailey and López de Prado recommend at least 2 years of daily data (approximately 500 trading days) for robust inference. With fewer than 100 days, even a Sharpe of 2.0 may not reach significance if many strategies were tested.

QUANTITATIVE TOOL

Deflated Sharpe Ratio

Name: Deflated Sharpe Ratio Calculator
Author: AuditZK

Bailey & López de Prado (2014). Corrects the observed Sharpe Ratio for multiple testing, non-normal returns, and short samples. The metric that separates real alpha from backtest overfitting.

All tools

— WHY IT MATTERS

Most Sharpe Ratios are inflated

When you test 20 strategies and pick the best one, the winner's Sharpe Ratio is biased upward. The more strategies you test, the higher the expected maximum Sharpe — even if all strategies have zero true skill. The Deflated Sharpe Ratio quantifies this bias and tells you the probability that your observed Sharpe is genuinely above what you'd expect by chance.

— CALCULATOR

Compute your Deflated Sharpe Ratio

Your best Sharpe Ratio (annualized)

The annualized Sharpe of the strategy you picked. This is usually your best-performing backtest or live result.

Return frequency

Sample length (trading days)

Total trading days in the backtest. E.g. 1 year ≈ 252, 2 years ≈ 504.

How many strategies did you try?

Count every strategy, parameter set, or variation you tested before picking this one.

Return asymmetry (skewness)

0 = symmetric. Negative = occasional large losses. Most strategies: between −1 and 0.

Tail risk (kurtosis)

3 = normal distribution. Above 3 = more extreme days than expected. Most strategies: between 3 and 6.

How similar are your strategies?

0 = completely different strategies. 1 = all basically the same. If unsure, 0.2–0.5 is a reasonable guess.

— RESULTS

Deflated Sharpe Ratio

Probability that the true Sharpe exceeds SR₀ (the expected maximum under null). Above 0.95 = statistically significant.

73.31%

0%95% threshold100%

Verdict

Not significant. The observed Sharpe is likely explained by multiple testing alone.

Threshold SR₀

1.0461

Expected maximum Sharpe across N trials if all strategies have zero true skill.

Effective independent trials

8.2

Number of strategies adjusted for correlation.

Z-score

0.6221

Standard deviations above the threshold.

SR standard error

0.7296

Estimation uncertainty of the Sharpe Ratio.

— METHODOLOGY

How it works

Adjust for correlation

Correlated strategies are not independent tests. We compute the effective number of trials: N_eff = N(1 − ρ̄) + ρ̄. Perfectly correlated strategies collapse to a single trial.

Expected maximum under null

Using the Euler-Mascheroni correction, we compute SR₀ — the Sharpe you'd expect from the best of N_eff strategies, assuming all have zero true skill: SR₀ ≈ √(1/(T−1)) × [(1−γ)Φ⁻¹(1−1/N) + γΦ⁻¹(1−1/(Ne))].

Probabilistic Sharpe Ratio

The PSR incorporates skewness and kurtosis into the standard error of the SR estimator, then computes the probability of exceeding the threshold: DSR = Φ[(ŜR − SR₀)√(T−1) / √(1 − γ₃ŜR + ((γ₄−1)/4)ŜR²)].

— FORMULAS

Mathematical definitions

Deflated Sharpe Ratio

DSR = Φ[(ŜR − SR₀)√(T−1) / √(1 − γ₃ŜR + ((γ₄−1)/4)ŜR²)]

Probability that the true SR exceeds the selection-bias threshold SR₀. Values above 0.95 indicate genuine skill at 95% confidence.

Threshold SR₀

SR₀ ≈ √V[ŜR] × [(1−γ)Φ⁻¹(1−1/N) + γΦ⁻¹(1−1/(Ne))]

Expected maximum Sharpe from N independent trials under null. γ = 0.5772 (Euler-Mascheroni). Grows as √(2·ln(N)) for large N.

SR Standard Error

σ(ŜR) = √[(1 − γ₃ŜR + ((γ₄−1)/4)ŜR²) / (T−1)]

Accounts for non-normality. Negative skewness and fat tails increase estimation uncertainty.

Effective Trials

N_eff = N(1 − ρ̄) + ρ̄

Adjusts for correlation between strategies. ρ̄ = 0 gives N independent trials; ρ̄ = 1 collapses to 1 trial.

— REFERENCE

Source paper

Bailey, D.H. & López de Prado, M. (2014). "The Deflated Sharpe Ratio: Correcting for Selection Bias, Backtest Overfitting, and Non-Normality." The Journal of Portfolio Management, 40(5), 94–107.

— FAQ

Frequently asked questions

Get started

Automate your performance analytics

AuditZK computes Sharpe, Sortino, drawdown, VaR, Monte Carlo, and rolling risk metrics from verified exchange data. No manual input. No self-reported numbers.

Get started free See methodology