A strategy that works on SPY, but not quite QQQ, overfitting?

ShimingHe · June 26, 2023, 9:25am

Say that I have developed an intraday strategy. And I found that it achieves a Sharpe ratio of 1.7 on SPY but only 0.5 on QQQ. What do you call this? Overfit?

I can further tune some parameters, so that the Sharpe of SPY goes to 1.1 and QQQ goes to 1.0. Is it now better or worse than before?

I think the reason the algo performed better on SPY is that sector rotation often causes mean reversion so that you can short at a high point and long at a low point. QQQ is one sector. QQQ goes one direction intraday and will not look back. The performance is related to the rotation and timing behavior I’m trying to model after.

Monte Carlo is not going to help because the traits I’m trying to identify only exist in real data.

What options do I have for validating this algo and calling it good?

GardCap · June 26, 2023, 10:04am

Does it work on DIA or IWM?

ShimingHe · June 26, 2023, 10:13am

Haven’t got the minute bars to test yet. But according to my past experience, DIA is generally similar to SPY. IWM on the other hand may have a worse Sharpe than QQQ.

QuantTiger · June 26, 2023, 2:32pm

What about trying walk forward testing on just SPY? What is a Walk-Forward Optimization and How to Run It? - AlgoTrading101 Blog

ShimingHe · June 26, 2023, 2:45pm

I don’t have a technical implementation of walk-forward. But I noticed that the Sharpe stay relatively stable (except for Mar 2020, during which the return is high) in random one-year windows.

It’s not machine learning, I couldn’t really take out any rules. Parameters also haven’t been optimized to the extremes. The histograms of daily low/highs have been stable over the years. Daily lows are more likely to happen in the first half of the day. Which is the clue for me to guess the entry. Only problem is that QQQ is more volatile and less trendy than SPY.

TradeNow · June 26, 2023, 6:00pm

If your system is profitable then the Sharpe ratio is a rather meaningless metric, because it penalizes both the downside and, unfortunately, the upside volatility.

The Adjusted Sortino Ratio.is much better and will give you the real value of your system, try it.

ShimingHe · June 27, 2023, 3:44am

Thanks for the idea. Sortino is good, although in my particular case, it’s not that different from Sharpe considering the strategy is in fact performing poorer in QQQ compared to SPY.

AlgoSystems · June 29, 2023, 7:12pm

Never trust backtested results. I’ve seen hundreds (if not thousands) of systems blown up after they started to test it live claiming fabulous results.

If its never forward tested you will never know if it really works.

ShimingHe · June 30, 2023, 12:51am

I figured “blow ups” are mostly due to leverage. If some strategy seemingly provides stable returns at a very high leverage, there is usually some tail risk event causing it to blow up.

But I don’t agree that “walk forward” is the remedy here. In my experience, if I try hard enough, I can definitely overfit out-of-sample data. And life is short. You don’t have endless time for testing. You certainly won’t run an algo thousands of years like a Monte Carlo test. Any strategy could work better in a certain market condition than others. I think it’s a judgment call and everyone is entitled to their own opinion.

QuantTiger · June 30, 2023, 2:56am

One thing you can do is just do the past year as walk-forward, and see if the results are still good. Maybe use 6 years ago to 1 year ago as your data, and fiddle around with the model until it produces good results. The run the model from data from 1 year ago to now. Maybe also fit model to QQQ, and see how it performs on SPY.

ShimingHe · June 30, 2023, 3:06am

But how much confidence do I have for 1 year walk forward results? 2023 feels drastically different from 2022. I have a hunch that the best strats in 2022 would not work in 2023, and vice versa. I feel like a mediocre return for both symbols is less overfitting.

GardCap · June 30, 2023, 3:11am

I like to look at 1 year rolling returns, 1 year rolling profits pet trade and maximum number of weeks before system makes new highs to test for consistency and how likely I am to stick with it.

Fabi · June 30, 2023, 6:28am

Many years help to form many different market conditions. A backtest from 1999-2020 had to go through many different market phases. Then a forward test from 2021-today, so that a backtest can be validated well.

ShimingHe · June 30, 2023, 6:34am

Thank you for all your input. Although it seems that the only options available: backtest and walk forward are still pretty limited to give a full picture.

QuantTiger · June 30, 2023, 2:28pm

1 year walk forward test will reflect how someone who make the model on 6/30/2022 would have done the following year. “2023 feels drastically different from 2022” this is why people with good backtests fail when implementing system.

AlgoSystems · June 30, 2023, 3:35pm

Basically you have to ‘know’ when your system works well and when it doesnt work well. Why does it work on the spy and not the qqq for example.

Go thru the results and find out when the system is making good trades and when the system is making bad trades.

For example a moving average system works great in trending markets but works terribly in ranging markets or whipsaw markets.

Hope this helps.

Topic		Replies	Views
AI TQQQ SQQQ Swing - Rank #2 on C2 C2	12	1155	December 9, 2020
"TQQQ Daytrader": 15.1% YTD (hypothetical), ranked #4 in C2 Trading and Markets	5	597	May 1, 2022
11 Sharpe Mean Reversion Algo C2 Software Development	9	1162	September 2, 2017
Compare Tech Savvy vs Ai TQQQSQQQ C2	15	1084	February 12, 2022
Volatility reduced SPY Trading and Markets	5	514	April 24, 2019

A strategy that works on SPY, but not quite QQQ, overfitting?

Related topics