in data quality check Violation report - getting error in data type check

0

While doing the data Quality check between baseline and current run data, even though the actual data is string datatype , few numbers are also present in the column but as a string dtype - but the data quality check consider them as a integer and giving the violation report as "Expected data type: String, Expected match: 100.0%. Observed: Only 73.44% of data is String." How to resolve this issue

질문됨 8달 전219회 조회
1개 답변
0
수락된 답변

Hi,

Thank you for using AWS Sagemaker.

I understand that while doing the data quality check between baseline and current run data, it’s showing data type check in Constraints when Amazon SageMaker Model Monitor compared this run against the baseline and detected these constraint violations.

"Expected data type: String, Expected match: 100.0%. Observed: Only 73.44% of data is String."

What you are experiencing is known as data drift which might be occurring due to the fact that the serving data has shifted from the original expected data distribution that was used for training.

I would like to mention here that if there are any kind of violations compared to the baseline, they get generated in the violations report and you can use the following code to list the violations.

violations = my_default_monitor.latest_monitoring_constraint_violations() pd.set_option('display.max_colwidth', -1) constraints_df = pd.io.json.json_normalize(violations.body_dict["violations"])

constraints_df.head(10)

Here is an open-source notebook which demonstrates this as well:

See 'Violations report' section here - https://github.com/aws-samples/amazon-sagemaker-immersion-day/blob/master/SageMaker-ModelMonitoring.ipynb

This information will help in identifying which columns you can check for troubleshooting the violations at your end.

Hope this information helps.

To further understand the issue more in depth as I have limited visibility on your setup, I'd recommend you to reach to AWS Support by creating a support case[2] so that the engineer can investigate further and help you overcome the issue.

References: [1] Violations Report - https://docs.aws.amazon.com/sagemaker/latest/dg/model-monitor-interpreting-results.html#w570aac27c18c20c31c11

[2] Open a support case with AWS using the link: https://console.aws.amazon.com/support/home?#/case/create

AWS
답변함 8달 전

로그인하지 않았습니다. 로그인해야 답변을 게시할 수 있습니다.

좋은 답변은 질문에 명확하게 답하고 건설적인 피드백을 제공하며 질문자의 전문적인 성장을 장려합니다.

질문 답변하기에 대한 가이드라인