Using Amazon EBS for Data Lake

0

Hi, I'd like to know if it's possible to use Amazon EBS for storing data(csv,json,...) for Data Lake, and acces them for processing and store to s3 ?

posix
질문됨 3년 전425회 조회
1개 답변
0
수락된 답변

Hi,

While you could use EBS to implement portions of a data lake architecture (volumes for data analytics EC2 instances, for example), it is not a good choice for storing the actual data in a data lake. This is because EBS volumes are typically only accessible by one instance (excluding multi-attach, which requires a cluster-aware filesystem) and should not be used for sharing data. They will not be accessible by Glue, Athena or OpenSearch. Your better bet would be to store the shared data in S3, which is a key aspect of data lake design.

AWS
kdavyd
답변함 3년 전
AWS
전문가
검토됨 3년 전
  • Oley i understand. Thank you.

로그인하지 않았습니다. 로그인해야 답변을 게시할 수 있습니다.

좋은 답변은 질문에 명확하게 답하고 건설적인 피드백을 제공하며 질문자의 전문적인 성장을 장려합니다.

질문 답변하기에 대한 가이드라인

관련 콘텐츠