Deploy ML Timeseries models effectively

0

Hi team ! I need to deploy a ton of Machine Learning Models (Timeseries models) and I'm seeking a way that is effective.

In details, the problem is to build a platform capable of serving many time series with different frequencies from 5s to 10m (maybe beyond this, but that's it for the time being). The ML models of the system are different in terms of the ML framework. There are about 1000 ML models. ML models sizes are from 2MB to 2GB. In which, the most popular size range is 2GB. Then how should I design the serving model system to most effective with optimal cost?

질문됨 일 년 전588회 조회
1개 답변
1
수락된 답변

Hello Quan Dang !

The following link refers to SageMaker Model Deployment and Deployment Recommendation: https://docs.aws.amazon.com/sagemaker/latest/dg/deploy-model.html#deploy-model-options

For your problem, for each model, the processing time is not long, request payload is not large, and it’s kind of real-time latency requirement, and there are about 1000 deep learning models and each’s size is ~2GB. Therefore, we eliminate following options: async inference, serverless, batch transform and leaving only 1 option left : real-time inference. In Real-time inference, there are 4 options :

So, we narrow it down to only 3 options for deployment, you can create a survey about your ML Models deployment details (a statistics for the following information of each model : framework, inference latency, GPU usage type).

For those models are frequently accessed (inference latency <~60s) , you can choose “Host Single Model” ; Otherwise, for those aren’t frequently accessed, if they use the same ML framework, choose “Host multi models in 1 container behind 1 endpoint”, if they use different ML framework, choose “Host multi models which use different containers behind 1 endpoint”.

AWS
답변함 일 년 전
  • Great point ! Will do some experiments and let you know the result !

로그인하지 않았습니다. 로그인해야 답변을 게시할 수 있습니다.

좋은 답변은 질문에 명확하게 답하고 건설적인 피드백을 제공하며 질문자의 전문적인 성장을 장려합니다.

질문 답변하기에 대한 가이드라인

관련 콘텐츠