Which service of aws can be used for recognition of animal voice to text.

0

Need to know which service can be used to recognize specific animal voice and then convert the same into text and validations which can be applied to use that certain word.

  • Can you elaborate on your question? Are you asking if a dog barks to transcribe it to "woof" or "bark"?

  • it can be any animal e.g cat's "Meow", said by human either saying it normally or mimicking and saying like a cat which service can capture that sound and give the result such as "Meow"

已提問 1 年前檢視次數 389 次
1 個回答
0

Hi,

You can look into using rekognition or sagemaker to try to detect animal audio events and create a transcription of animal sounds.

Rekognition

You can use recognition to identify sounds by converting the audio to a visual format.

The blog uses an example of detecting a fire alarm, you may be able to use as a start to detect animals, this solution will have the limitation of not transcribing the audio.

As rekognition normally uses images, video and text you may need to convert your audio date into a visual representation such as a spectrogram.

The blog goes over: Assembling training and test data, looking for discrete sounds in a spectrogram, End-to-end architecture for sound detection

Detect audio events with Amazon Rekognition - https://aws.amazon.com/blogs/machine-learning/detect-audio-events-with-amazon-rekognition/ What is Amazon Rekognition? - https://docs.aws.amazon.com/rekognition/latest/dg/what-is.html

Sagemaker

You can look into using Sagemaker to create a machine learning model that can be used to detect animals, in addition you could create a model that will process that audio to create a text interpretation of their sound. This second part may run into limits regarding converting animal sounds into a phonetic transcription of the sound.

What Is Amazon SageMaker? - https://docs.aws.amazon.com/sagemaker/latest/dg/whatis.html

Below is an example of audio classification using sagemaker. This example steps through how to set up a sagemaker training job, inference endpoint and customer container images.

https://github.com/aws-samples/amazon-sagemaker-audio-classification-pytorch
AWS
Chris_T
已回答 1 年前

您尚未登入。 登入 去張貼答案。

一個好的回答可以清楚地回答問題並提供建設性的意見回饋,同時有助於提問者的專業成長。

回答問題指南