AsyncInferenceConfig takes different parameters

0

I would like to deploy an async endpoint in SageMaker. However when trying to deploy it I get the following error: ParamValidationError: Parameter validation failed: Unknown parameter in input: "AsyncInferenceConfig"

This is the code I tried for deploying the endpoint

from sagemaker.async_inference.async_inference_config import AsyncInferenceConfig

async_config = AsyncInferenceConfig(
    output_path="s3://poembucketus/async_inference/output",
    max_concurrent_invocations_per_instance=4,
)

async_predictor = huggingface_estimator.deploy(
    initial_instance_count=1,
    instance_type="ml.m5.xlarge",
    async_inference_config=async_config,
)

Thanks!

1개 답변
0

From the format of parameter, this error should be thrown from Boto3. If AsyncInferenceConfig is not recognized as a valid parameter, this may mean the version of boto3 is too old and released before the async feature.

Could you please confirm the version of boto3 and try updating it to the latest if possible ?

AWS
답변함 일 년 전
profile pictureAWS
전문가
Tasio
검토됨 일 년 전

로그인하지 않았습니다. 로그인해야 답변을 게시할 수 있습니다.

좋은 답변은 질문에 명확하게 답하고 건설적인 피드백을 제공하며 질문자의 전문적인 성장을 장려합니다.

질문 답변하기에 대한 가이드라인

관련 콘텐츠