- Newest
- Most votes
- Most comments
Based on the information provided, it appears you're encountering an issue with overlapping transcriptions when processing a long video in segments using AWS services. However, the sources don't contain specific information about a "BDA API" with a 120-minute limitation or this particular overlapping transcription issue.
For processing long videos in AWS, you might consider these approaches:
-
When segmenting videos, ensure that your segments have clear, non-overlapping boundaries. For example, if your first segment ends at 120:00, make sure your second segment starts exactly at 120:00 or 120:01 without overlap.
-
Consider using Amazon Rekognition's segmentation capabilities to properly partition your video into distinct segments. This would help ensure that scene changes are properly identified and processed separately.
-
After processing individual segments, you may need to implement custom post-processing to merge the results and remove any duplicated content. This could involve checking for overlapping timestamps and merging or trimming content as needed.
-
For transcript generation specifically, you might explore using Amazon Transcribe, which has capabilities for handling longer content and includes features like speaker partitioning that might help with organizing your transcription output.
Without more specific information about the "BDA API" you're referring to, I can't provide targeted guidance for that particular service's limitations.
Sources
Exploring accessible audio descriptions with Amazon Nova | Artificial Intelligence
Exploring accessible audio descriptions with Amazon Nova | Artificial Intelligence
Media2Cloud on AWS Guidance: Scene and ad-break detection and contextual understanding for advertising using generative AI | AWS for M&E Blog
Enabling speaker partitioning - Amazon Transcribe
