Sagemaker Debugger and forecasting

0

How to detect overfitting with Amazon Forecast. Does SageMaker Debugger work with Amazon Forecast?

专家
已提问 3 年前322 查看次数
1 回答
0
已接受的回答

Answer #1: Based on the same technology used for time-series forecasting at Amazon.com, Forecast provides state-of-the-art algorithms to predict future time-series data based on historical data, and requires no machine learning experience. Amazon SageMaker Debugger profiles and debugs your training jobs to improve the performance of machine learning models on compute resource utilization and model predictions. So Debugger is available for your SageMaker training jobs but not for Amazon Forecast which is a fully managed service of its own.

Answer #2 - The way to detect overfitting with forecasting is using "backtests", which are the "train/valid" equivalent of traditional machine learning. The reason Backtesting is used in forecasting is to keep the time order of the train/valid data. With careful analysis of backtest windows, you can have more confidence the model will generalize well with unseen data.

Amazon Forecast now supports export of backtest forecasts as .csv files. Customer can take the backtest forecasts and calculate whatever metric they want and/or visualize.

One common fear is backtesting will increase overfitting - it does not since each validation dataset is different from the previous backtest validation dataset. To have more certainty about robustness of the model, use more than 1 backtest window. If the Forecast Horizon is long, you may get by with just 1-2 backtest windows; otherwise use the maximum which is 5 backtest windows.

Another concern is the increased training time since each backtest window is in fact another train/forecast iteration. It's a trade-off, if you want more model validation or not.

Ideally, the wQL's of each backtest window are similar. If you see all backtest forecasts with similar metrics (except maybe a few windows where underlying data had anomalies), that is good sign your model is not overfit.

Both under-fitting and over-fitting for now are left to human judgement to examine backtest forecasts and decide whether model is acceptable or not.

已回答 3 年前

您未登录。 登录 发布回答。

一个好的回答可以清楚地解答问题和提供建设性反馈,并能促进提问者的职业发展。

回答问题的准则