Optimize Amazon Comprehend Analysis Job use

0

I have an audio recording file, and I want to extract my custom entities recognise data from the audio file. I searched and noticed Amazon Comprehend service could help me.

To do this, I have to do the following steps:

  1. I should submit to server and transcribe the file using AWS Transcribe service - it takes about 15~20 seconds.
  2. I should get the transcribed text and save it to s3 bucket for the Comprehend Analysis Job, because when we create a job, they ask it as input file.
  3. I should create an Analysis Job with the input file.
  4. I should wait till the job is finished - it is very slow (about 2 ~ 7 minutes).
  5. After the job is finished, the output file is output.tar.gz. They give us compressed file, not plain text.
  6. I should pull the into the local server and the unzip the file and then get the content.
  7. I should parse the file content as json data.

It takes about 5~15 minutes to do the whole steps. Especially step 4 takes pretty much times. I want to optimize it as much as possible. Can you please help me?

  • Some questions to answer:

    1. Which Comprehend API are you planning on using?
    2. How large do you expect the input file to be? Depending on your answer to (1), you may be able to use the synchronous API version which would reduce the total time of using Comprehend's API.
Thomas
질문됨 7달 전101회 조회
답변 없음

로그인하지 않았습니다. 로그인해야 답변을 게시할 수 있습니다.

좋은 답변은 질문에 명확하게 답하고 건설적인 피드백을 제공하며 질문자의 전문적인 성장을 장려합니다.

질문 답변하기에 대한 가이드라인

관련 콘텐츠