What Is Signal Discovery and Why It Should Come Before Position Sizing and Risk Optimisation

Why the sequence matters — and why every mainstream platform gets it backwards

Student One Research · · 7 min read

signal discoveryposition sizingrisk optimisationquant methodology

Signal discovery is the process of determining whether a specific indicator configuration — a particular RSI period, a particular MACD parameter set, a particular Bollinger Band width — detects a real market inefficiency or is merely fitting to noise. It is the first step in quantitative research, and it must be completed before you touch position sizing or risk optimisation. Every mainstream backtesting platform inverts this order, and that inversion is why most algo strategies fail.

Definition: What Signal Discovery Actually Is

Signal discovery answers one question: "Does this configuration have predictive power, or would random data produce similar results?"

This is a hypothesis test. The null hypothesis is that the observed performance is consistent with chance. The alternative hypothesis is that the configuration detects a genuine, exploitable pattern in price data.

To answer this question rigorously, you need:

  • Exhaustive enumeration — testing every configuration in the parameter space, not a gradient-descent search for the "best" one
  • A null distribution — typically built by permutation testing (shuffling returns to destroy temporal structure while preserving marginal distribution)
  • Multiple-testing correction — because when you test 100,000 configurations, some will "work" by chance alone. Benjamini-Hochberg FDR or similar corrections are mandatory.
  • Walk-forward validation — the configuration must survive on data it was never trained on
  • Concentration analysis — returns must not come from a single lucky trade or a single regime

Why Signal Discovery Must Come First

Signal discovery must happen before position sizing and before risk optimisation because:

1. Position sizing contaminates the hypothesis test

If you test a signal with Kelly sizing or fixed-fractional allocation, the performance metric is a function of both the signal quality and the sizing model. A weak signal with aggressive sizing can look identical to a strong signal with conservative sizing over finite samples. You cannot distinguish between them.

2. Stop losses create survivorship bias in parameter selection

A tight stop loss will kill many configurations that have genuine predictive power but express it through volatile entry timing. A loose stop allows configurations to "survive" that have no real edge but happen to avoid being stopped out on the specific historical path. Neither tells you about the signal.

3. Exit rules add degrees of freedom that dilute statistical power

Every exit parameter you add (take-profit distance, trailing stop factor, time-based exit) multiplies the number of hypotheses being simultaneously tested. Your FDR correction must account for all of them. When signal discovery and exit optimization happen together, the effective number of comparisons explodes and nothing passes the corrected threshold.

4. The research question changes

When you combine all three, you're no longer asking "does RSI(14) detect mean reversion in EURUSD on 1h bars?" You're asking "does RSI(14) with 2% stop, 1:2 RR, and Kelly sizing produce positive PnL on EURUSD 1h from Jan 2020 to Dec 2025?" The second question is useless — it's testing one specific historical path, not a market property.

The Correct Sequence

StageQuestionOutput
1. Signal DiscoveryDoes this configuration detect a real edge?Surviving configurations with statistical evidence
2. Position SizingHow much capital per trade?Allocation model calibrated to confirmed edge
3. Risk ManagementHow do we bound drawdowns?Stop/exit framework built around known signal properties

Each stage takes the output of the previous stage as input. Signal discovery produces candidates. Position sizing produces allocation. Risk management produces the execution plan. Doing them in parallel or in reverse order is not "efficient" — it's methodologically invalid.

What Exhaustive Enumeration Looks Like

Student One's approach: take a single indicator family (e.g., RSI) across its full parameter range (periods 2 through 14,000), on a canonical 1-minute axis, across the full date range. Test every single configuration. Subject each to walk-forward survival, permutation null with FDR correction, concentration gates, and automatic OOS split. Only configurations that survive every gate simultaneously are declared "discovered signals."

This is computationally expensive — millions of permutation tests per sweep. That's why Student One offers 10 million free permutation tests per month: because the correct methodology shouldn't be gated behind a $500/month platform subscription.

Key Takeaway

If you're "backtesting a strategy" and adjusting entries, exits, stops, and sizing in the same loop, you're not doing signal discovery. You're curve-fitting. Signal discovery is a distinct, isolated statistical process that must complete before anything else begins. The platforms that combine everything into one pass aren't giving you a shortcut — they're preventing you from ever finding a real edge.