data quality on Dynamodb and Timestream

0

Hi everyone. what do you do for checking data quality on Dynamodb and Timestream? since in Timestream only dimensions need to be present,and in Dynamodb only Primary Key need to be present we might face with some data quality issue I would be grateful if give me your opinion

profile picture
gh02
질문됨 한 달 전53회 조회
1개 답변
0

DynamoDB does not provide an out of the box solution for schema enforcement or validation. You must enforce your schema and data on the client side. You can do that through higher level SDKs such as DynamoDB Mapper or Enhanced Client, or through third party SDK Clients.

If you need something like DynamoDB but with schema enforcement then you can consider Amazon Keyspaces.

profile pictureAWS
전문가
답변함 한 달 전
  • thanks. if i wanna checking data quality for Timestream and DDB with deequ or great expectations, can I export them and then use those tools? or can I use them directly?

  • I can't speak for Timestream, but DynamoDB allows you to export the table to S3 where you can run anything over the data and have no production impact on your table.

  • have you heard about using those data quality tools for ddb without exporting data? i mean connect to table directly

  • deequ is built into Glue, and Glue has connectors for DynamoDB so I assume its entirely possible: https://aws.amazon.com/blogs/big-data/test-data-quality-at-scale-with-deequ/

로그인하지 않았습니다. 로그인해야 답변을 게시할 수 있습니다.

좋은 답변은 질문에 명확하게 답하고 건설적인 피드백을 제공하며 질문자의 전문적인 성장을 장려합니다.

질문 답변하기에 대한 가이드라인