Redshift as as Data Source for a REST API for Analytics Query

0

i have a customer who is considering using Redshift as Data Source for a REST API that will give external entities access to analytical queries over a "huge" (let's assume 100s ot TBs) data set. The API will be used by 3rd parties, and we should expect some degree of unpredictability on the workload with peaks and dips. The REST API will support only read operations.

The main questions I have are

  • What are the best practices for this type of Redshift use case?
  • Do we have any customer reference who has used Redshift this way?
  • Is Redshift the best service for this use case? At which point you would recommend Athena, Aurora, or ES?

Performance is their key priority and they want to minimise query latency to less than 10 seconds per API call.

AWS
Manos_S
已提问 5 年前595 查看次数
1 回答
0
已接受的回答

It depends on the query pattern and the SLA they want to offer to this API:

  • DynamoDB storing pre-calculated metrics is a good way to provide fast response time and high availability but the update logic is tricky to implement
  • Elasticsearch can support updates and live aggregations with fast response time and high availability but simple metrics and it's easy to overload an Elasticsearch cluster
  • Redshift/athena can be an option if query customization needs to be offered but isn't highly available and unpredictable queries are dangerous for concurrency. Compared to Athena, it can provides faster response time, SLA on query execution (no risk of contention on Athena clusters).
AWS
已回答 5 年前

您未登录。 登录 发布回答。

一个好的回答可以清楚地解答问题和提供建设性反馈,并能促进提问者的职业发展。

回答问题的准则