Sagemaker Ground truth pdf annotation tool not rendering anything

0

Hello, I have followed these docs https://docs.aws.amazon.com/comprehend/latest/dg/cer-annotation-pdf.html and have gotten to the point in which I have created the annotation task and I have uploaded several pdf's to a s3 bucket to be used for an annotation task so I can create a comprehend model. I put myself and a co-worker as annotators just so I can verify that I can set up the task properly and I only uploaded 37 pdf's. However when both of us log in and start the task, the webpage loads as the instructions tell us to however there is no pdf rendered (though I think I see it briefly flash on the screen before it goes blank) and there are also no entities to be selected as a part of the ui unlike how the documentation pictures the tool. I am trying to do named entity recognition and created this task with the full 25 entities I want to be able to label and Also another time with only 5 entities to label. However there seems to be something wrong with this native pdf annotation feature.

2 Answers
0

Dear Customer,




Thank you so much for reaching to us. I understand that you followed our AWS Comprehend documentation for annotating PDF’s, in-order to annotate your training PDFs in SageMaker Ground Truth. Further yourself and your team is working as private annotators, in-order to facilitate the task appropriately. However when you and your co-workers login to the page for annotation in Sagemaker Ground truth, the webpage does not list any pdfs and the page is blank.
Hence you were looking for guidance in resolving this issue.



Thank you so much for providing the details.



To further better assist you on this issue, can you please create a Support Ticket to AWS. Below link will assist you to create the Support Ticket. [+]https://docs.aws.amazon.com/awssupport/latest/user/case-management.html

 [+]https://console.aws.amazon.com/support/home#/case/create

—While creating the support ticket, we kindly request you to provide the below information

  1. Use-case description.
  2. Ground Truth Job ARN details
  3. Screen-shots of the issue you are facing.
  4. Log Files for the Ground truth Job(This logs from your labeling jobs appear in Amazon CloudWatch under the /aws/sagemaker/LabelingJobs group.).

The reason behind this ask is this would help us to understand your use-case in a better way, further if we might need to deep dive and access the job created from our internal tools, we will have more visibility through the support ticket.

Rest assured we will do everything best in our abilities to assist you on this issue. 

 Thanks.

AWS
answered 2 years ago
0

I have found out that when this tutorial is carried out on windows, you get an error after creating the job, which is the labeling portal UI is blank. This happens because for some reason in the s3 bucket created here, when you go here comprehend-semi-structured-docs-ui-template/ and then inside the folder of your job, you will see that the name of S3 folders is having some formatting issue. Instead of '/' that represents folder layer, the bucket name is using '' and making the request failed. Thus, it is recommended to rearrange or rename your s3 bucket folders accordingly and this solves the issue.

answered 10 months ago

You are not logged in. Log in to post an answer.

A good answer clearly answers the question and provides constructive feedback and encourages professional growth in the question asker.

Guidelines for Answering Questions