Is it possible to use the native S3 SDK for storing Hive data with S3?

0

I use Glue to store Hive metadata and S3 to store Hive data, so can I just use S3's SDK to do write data without hadoop-fs?

질문됨 2년 전223회 조회
1개 답변
0

To add or change data in your S3 buckets, you should be able to use the SDK for S3.

If you want to use the Hive like capabilities to query your data from S3, you would define the metadata for the S3 data using Athena or Glue APIs to define the table definitions or execute the Glue crawler pointing to the file location so that the tables and partitions are defined for you in the Glue catalog. The data that was manipulated using S3 SDK/APIs would be available for querying in Athena as soon as the schemas are defined in the Glue catalog.

If you want to use EMR to manipulate the data, please check out the EMR File system https://docs.aws.amazon.com/emr/latest/ReleaseGuide/emr-fs.html

profile pictureAWS
답변함 일 년 전

로그인하지 않았습니다. 로그인해야 답변을 게시할 수 있습니다.

좋은 답변은 질문에 명확하게 답하고 건설적인 피드백을 제공하며 질문자의 전문적인 성장을 장려합니다.

질문 답변하기에 대한 가이드라인

관련 콘텐츠