- Newest
- Most votes
- Most comments
Currently, Amazon Connect does not provide a native WebSocket-style interface for real-time audio streaming that would allow easy connection to multimodal LLMs or other external systems in the way you describe.
Amazon Connect primarily handles audio in the following ways:
-
Live Media Streaming: This allows you to capture customer audio during interactions and send it to a Kinesis video stream. However, this is more for capturing and storing the audio rather than real-time processing.
-
Audio Recording: Connect can record conversations and store them in Amazon S3, but again, this is not for real-time processing.
-
Integration with Amazon Lex: As you mentioned, Connect can send audio to Lex for intent-based logic, but this has limitations in terms of latency and complexity of processing.
-
Contact Lens: This feature provides real-time and post-call analytics, but it's not designed for connecting to external LLMs or audio processing systems.
To achieve what you're looking for - a low-latency, real-time connection of audio streams to external systems like multimodal LLMs - you would likely need to use an external service or create a custom solution. This might involve using a service like AWS Chime or another programmable voice service as an intermediary between the caller and Amazon Connect.
Such an external service could act as a voice gateway, allowing you to manipulate and process the audio in real-time before forwarding it to Amazon Connect. This approach would let you integrate with more advanced audio processing systems or LLMs while still leveraging Amazon Connect's contact center capabilities.
However, this type of inline audio processing is not currently a native feature of Amazon Connect, and there hasn't been any public announcement about such a capability coming soon. If this functionality is crucial for your use case, you might need to explore hybrid solutions or alternative contact center platforms that offer more flexible audio stream handling.
Sources
Ability to listen an audio in Amazon Connect | AWS re:Post
Best approach to adding an INLINE audio processing block between a user call and AWS Connect Agent? | AWS re:Post
Data handled by Amazon Connect - Amazon Connect
Relevant content
- asked 2 years ago
- asked 7 months ago
- AWS OFFICIALUpdated a year ago