Why are my Amazon Forecast results based on location inaccurate?

0

We get excellent results when we run separate Amazon Forecast analysis for two different data sets (for different cities). However, when we merge the two data sets and add a location as a new dimension in the target data set, the results aren't accurate anymore. What's the reason for this inaccuracy? We added the location as categorical (string) data. Should we add the location as one-hot encoding data?

AWS
asked 4 years ago234 views
1 Answer
0
Accepted Answer

The inaccurate Amazon Forecast results might be because of the following: When you process the data for each city individually, Amazon Forecast builds a model for the first city, another model for the second city, and so on. When you process the merged data, Amazon Forecast builds a single model rather than minimizing the forecasting errors across both cities. If the data that's merged is very different, the merging might produce an "average" model. This explanation is oversimplified. However, understanding the statistical distribution of the used data might be critical to determine whether a global model can be applied to all cities or individual models are mandatory to get proper predictions.

This is an example of a usual debate between "one global model for every situation" vs. "N specific models for each situation". This issue doesn't have a one-size-fits-all solution.

profile pictureAWS
answered 4 years ago

You are not logged in. Log in to post an answer.

A good answer clearly answers the question and provides constructive feedback and encourages professional growth in the question asker.

Guidelines for Answering Questions