Redshift questions from AWS Customer

0

A customer is using s3, Lambda, and Redshift. They have Redshift questions regarding what to do with the data once it is loaded to Redshift (in a star/snowflake schema).

Customer is trying to determine some approaches to reporting from the warehouse.

Here are some questions:

  1. Is there an equivalent to cubes in redshift?
  2. Is there a better approach in moving data into redshift (compared to our approach)? This is all they gave me thus far on this question.
  3. Performance of Redshift queries, seems slow when returning data, but fast for loading data and returning aggregated data. Is there a way to address the performance we are seeing?

Thank you.

AWS
已提問 5 年前檢視次數 256 次
1 個回答
0
已接受的答案
  1. Amazon Redshift does not natively provide cube/slice/dice capabilities, this is normally provided by software products that sit "on top" of a database. For example Microsrategy allows you to build ROLAP cubes on top on data marts sourced from Redshift tables.

  2. Can you specify what is "our Approach"? The best way to move data into an OLAP RDBMS is in "bulk" to maximize the parallel ingestion. Redshift providers the COPY command to perform bulk load ingestion. Data has to be moved into an S3 bucket previous to the execution of a COPY command. ETL tools may hide the process but in the back this process still happens.

  3. Query performance in an MPP solution like Redshift depends mainly on how the tables containing the data are "distributed" across the cluster and how the data in them is sorted. Distributing and Sorting is a simple mechanism, but we have created some "automated features" that may allow the developers not to be so concerned about it.

AWS
已回答 5 年前

您尚未登入。 登入 去張貼答案。

一個好的回答可以清楚地回答問題並提供建設性的意見回饋,同時有助於提問者的專業成長。

回答問題指南