- 最新
- 最多得票
- 最多評論
Hi,
Thank you for using AWS Sagemaker.
I understand that you have a data monitoring job configured to monitor data captured by an endpoint. However, the monitoring job throws the following error:
"There are missing columns in current dataset. Number of columns in current dataset: 1, Number of columns in baseline constraints: x"
This can happen if in your 'data-capture.jsonl' file, in the outputs, there is a '\n' character for every outputs.
Basically what you need to do here is to remove all the '\n' character from your outputs in the 'data-capture.jsonl' file for the model monitor to work.
There are 2 ways to fix this:
-
As the '\n' character is most likely added during the inference where the inference results are generated, you can modify your inference script so that all the outputs won't have a '\n' after it.
-
Add a preprocessing[1] script for your Model Monitor, this script aims to remove all the '\n' character from your captured data after inference.
You can also refer the sample Model Monitor notebook[2] , modify as per your use case and test at your end.
You can also access this notebook sample from a notebook instance under the SageMaker Examples -> Sagemaker Model Monitor -> 'SageMaker-ModelMonitoring.ipynb'.
To further understand the issue more in depth as I have limited visibility on your setup, I'd recommend you to reachout to AWS Support by creating a support case[+] so that an engineer can investigate further and help you overcome the issue. You can also share the notebook file, scripts,dataset etc which you used , so that engineer can replicate at their end and provide you with better assistance as per your use case.
Reference: ——————
[+]Open a support case with AWS using the link: https://console.aws.amazon.com/support/home?#/case/create [+]https://aws.amazon.com/premiumsupport/faqs/
相關內容
- 已提問 1 年前
- AWS 官方已更新 1 年前
- AWS 官方已更新 2 年前