It would be possible to query and let's say join data from the redshift cluster and S3. Redshift Spectrum tables allow you to query the data in S3. Querying using Redshift Spectrum.
- Querying the data in place can be cost saving. The more infrequently accessed and larger is your data set is in S3 the more cost efficient is the choice to use Spectrum.
- Flexibility. Querying the data in place, also means that data in S3 can easily accessible for other applications such as ML or Big Data processing with EMR without the need to integrate with a DW, leaving DW to do what is supposed to do aka reports.
- Less predictable costs because Spectrum queries cost over and above the Redshift cluster - currently at $5 per TB scanned
- Spectrum might be slower than other solutions e.g. Athena or Redshift
Hope that helps!
Redshift External Table - unable to query struct data types using Avro as a formatAccepted Answerasked 6 months ago
Distinguish Redshift Spectrum from Redshift CostsAccepted Answerasked 4 years ago
AWS Glue load data from S3 to Redshiftasked 2 months ago
Redshift Insert into s3 file data into existing tableasked 4 months ago
Can you use a Redshift Spectrum Table in a CTE?asked 6 months ago
How is data returned from Spectrum to Redshift cluster?Accepted Answerasked 3 years ago
use Redshift Spectrum to query both Redshift table and s3 fileAccepted Answerasked 2 years ago
Athena and Redshift Spectrum performance best practicesAccepted Answerasked 3 years ago
Redshift COPY commend takes the wrong file from S3asked 2 months ago
Redshift Managed Storage vs SpectrumAccepted Answerasked 2 years ago