Delete Images in Custom Labels Datasets

0

Hello!

Is it possible to delete images that have been uploaded to a custom labels dataset?

If I delete all label annotations from an image will it be removed from training or will it still be used in the training process?

Thanks for the help!

Matt

asked 4 years ago564 views
7 Answers
1

Is there a reason removing images is not supported? Removing annotations and leaving unused images in datasets creates noise when managing large datasets. Hoping this feature will be added soon.

answered 3 years ago
0

Hey Matt

If you remove all label annotations from an image, it will be ignored during the training.

Note that your training might fail in some cases if removing label annotations caused validation rules to fail.
E.g. if you have only 2 labels and you removed all annotations corresponding to 1 label, your training will fail since at least 2 labels are required.

Validation list can be found here (see section "Semantics") - https://docs.aws.amazon.com/rekognition/latest/customlabels-dg/cd-manifest-files-validation-rules.html

Hope this helps!

AWS
answered 4 years ago
0

Thank you, that helps a lot!

If I wanted to remove images that I mistakenly uploaded from a dataset so my model is not trained on them, would you suggest just removing all annotations as they are then essentially ignored from training?

Appreciate the help!

Matt

answered 4 years ago
0

Matt

Yes you can remove the annotations to avoid them being used for training.

If you have access to the manifest file, you can manually delete the offending rows.

AWS
answered 4 years ago
0

Hey grajamey,
Thank you for your feedback. Deleting images from datasets is on our roadmap. I have also passed on your feedback to the product team to help in the prioritization of this feature.

answered 3 years ago
0

Hi,

I am unable to delete images from Amazon Rekognition dataset.
Is there any option to do it myself or would you be able to delete it?

Thanks,
Prathyusha

answered 3 years ago
0

Hi Prathyusha,

Custom labels dataset is managed by a dataset manifest file. If you have access to the manifest file, you can manually delete the offending rows. After deletion you can create a new dataset by importing the updated manifest file.
The manifest file can be accessed through a bucket which name starts with custom-labels-console-*. Within the bucket, you can locate the file with key datasets/<data set name>/manifests/output/output.manifest.
We have an item on our roadmap to support deleting images from dataset and I have also passed on your feedback to the team.

Thanks

AWS
answered 3 years ago

You are not logged in. Log in to post an answer.

A good answer clearly answers the question and provides constructive feedback and encourages professional growth in the question asker.

Guidelines for Answering Questions