How do you know if forecasts are any good? We built performance tracking into the core of Nordict, not as an afterthought, but as the foundation of trust.
Performance Dashboard
Last 30 days • Model v2.4.1
Directional Accuracy
68%
+2.1%Calibration Score
0.92
+0.04Avg Confidence
71%
-1.3%Forecasts Made
12,847
+847Rolling Accuracy
7-day MA
Evaluation metrics
We track multiple metrics because no single number tells the whole story. Each measures a different aspect of forecast quality.
Walk-forward validation
Walk-forward validation is the gold standard for evaluating time-series forecasts. It simulates how the model would have performed if deployed in the past.
The model never sees validation data during training. This mimics real conditions where you can't peek at tomorrow's prices.
Instead of one backtest, we run many across different market conditions bull, bear, sideways, volatile.
We include transaction costs, slippage estimates, and data delays. Results aren't idealized.
Performance is broken down by market regime so you know where the model excels and where it struggles.
Historical performance views
Performance isn't static. See how forecasts have performed over different time periods and market conditions.
Rolling 30-day accuracy vs 50% baseline
Performance varies by conditions
Trending Up
3,247 forecasts
74%
Trending Down
2,891 forecasts
71%
Ranging
4,102 forecasts
62%
High Volatility
1,847 forecasts
58%
Performance is strongest in trending markets and weaker during high volatility. This is expected—and honestly reported.
Live vs backtest tracking
Backtest results and live performance are tracked separately, and clearly labeled. No mixing, no confusion about what's real.
Directional Accuracy
Backtest
69%
Live
68%
Delta
-1%Calibration Score
Backtest
0.91
Live
0.92
Delta
+0.01Brier Score
Backtest
0.17
Live
0.18
Delta
+0.0175% Band Coverage
Backtest
74%
Live
76%
Delta
+2%Live performance aligned with backtest
All metrics within expected variance
Backtest
Complete historical data, no gaps
Live
Real-time feeds with occasional delays
Backtest
Known regimes, can stratify analysis
Live
Unknown future, regime shifts possible
Backtest
Assumed perfect execution
Live
Actual latency and slippage