Comment by wenc
2 days ago
I wonder how this would perform on the M4 Makridakis competitions (time series competitions)
https://github.com/Mcompetitions/M4-methods
https://en.wikipedia.org/wiki/Makridakis_Competitions
Makridakis' conclusion remained true for many years: "statistically sophisticated and complex methods do not necessarily provide more accurate forecasts than simpler ones."
Maybe things have changed?
(side: Nixtla showed a simple ensemble outperforming Chronos, and the Chronos team responded, but there's some back and forth in the comments: https://www.linkedin.com/pulse/extended-comparison-chronos-a...)
This looks like a great benchmark! We've been thinking of doing a better and more detailed follow-up and this seems like the perfect dataset to do that with. Thanks!
When I worked in Demand prediction (multivariate), it was lgbm that was outperformong across the board.