Using Amazon EBS for Data Lake

0

Hi, I'd like to know if it's possible to use Amazon EBS for storing data(csv,json,...) for Data Lake, and acces them for processing and store to s3 ?

posix
已提问 2 年前318 查看次数
1 回答
0
已接受的回答

Hi,

While you could use EBS to implement portions of a data lake architecture (volumes for data analytics EC2 instances, for example), it is not a good choice for storing the actual data in a data lake. This is because EBS volumes are typically only accessible by one instance (excluding multi-attach, which requires a cluster-aware filesystem) and should not be used for sharing data. They will not be accessible by Glue, Athena or OpenSearch. Your better bet would be to store the shared data in S3, which is a key aspect of data lake design.

AWS
kdavyd
已回答 2 年前
AWS
专家
已审核 2 年前
  • Oley i understand. Thank you.

您未登录。 登录 发布回答。

一个好的回答可以清楚地解答问题和提供建设性反馈,并能促进提问者的职业发展。

回答问题的准则