Provide Autopilot own data

0

Is there any way to provide your own (already labeled) data to sagemaker ground truth?
My plan is to use sagemaker autopilot with my own data that I have already labeled such that the folder structure represents the labels (rekognition provides such functionality).
The problem is that AutoPilot can only accept manifest files or csv,txt, ect. Well I have images with labels, so my only option is to use a manifest file. HOWEVER GroundTruth create manifest file option only accepts images from only 1 folder without any option to give it labels by default, therefore forcing me to repeat the labeling on an already labeled dataset.

Any idea how to use already labeled data as input???

lant
已提問 4 年前檢視次數 232 次
1 個回答
0

Hello... How are your images labeled? What format are the labels in?

If you have multiple folders each containing images, and the folder is the label then you could create a quick python (etc) script that will create a manifest file, using the folders as labels. You can then upload all the folders and the manifest file to S3 and your set.

Groundtruth is a labeling pipeline tool. Its designed to organize people with domain knowledge to label data. Also its pretty pricey so you definitely don't want to use it to re-label anything thats already labeled.

Hope that helps.

已回答 4 年前

您尚未登入。 登入 去張貼答案。

一個好的回答可以清楚地回答問題並提供建設性的意見回饋,同時有助於提問者的專業成長。

回答問題指南