Skip to content

Articles tagged with Amazon SageMaker Deployment

Amazon SageMaker provides a broad selection of machine learning (ML) infrastructure and model deployment options to help meet your needs, whether real time or batch. Once you deploy a model, SageMaker creates persistent endpoints to integrate into your applications to make ML predictions (also known as inference). It supports the entire spectrum of inference, from low latency (a few milliseconds) and high throughput (hundreds of thousands of inference requests per second) to long-running inference for use cases such as natural language processing (NLP) and computer vision (CV). Whether you bring your own models and containers or use those provided by AWS, you can implement MLOps best practices using SageMaker to reduce the operational burden of managing ML models at scale.

Content language: English

Filter articles
Select tags to filter
Sort by
Sort by most recent

Browse through articles or filter your results using the tools displayed.

2 results
I want to use a SageMaker Inference Toolkit to create a inference docker image. I want to use this inference docker image to deploy a SageMaker endpoint using Bring Your Own Container (BYOC).
This article helps you understand and resolve the SageMaker AI endpoint deployment error that occurs when the requested instance type isn’t available in enough Availability Zones overlapping with your...
  • 1
  • Page size
    12 / page