Data Quality using PyDeequ

0

Hi, Does anyone use PyDeequ for large enterprises. I am exploring this library and have the below questions:

  1. Looking at the github repo it doesnt seem like it is actively udated. ALso, it supoorts Spark 3.0.0 but not later versions.
  2. Some of the apis didnt work(for complex examples). I dont know if there is any Amazon support.
  3. Also the scala version(deequ) is more up to date than the python version(PuDeequ). s is there a plan to sunset the PyDeequ version
  4. Should I use this for large enterprise data validation framework or there are any other alternate tools. Kindly advise.

Thank you!

質問済み 2年前669ビュー
1回答
0

Hi

To answer question '4' - I would recommend you take a look at AWS Glue DataBrew. Not only is it a fully managed service, but you'll also find that it has a better velocity of new features & updates as its supported by the AWS Glue team.

Thanks

Nick

AWS
Nick
回答済み 2年前

ログインしていません。 ログイン 回答を投稿する。

優れた回答とは、質問に明確に答え、建設的なフィードバックを提供し、質問者の専門分野におけるスキルの向上を促すものです。

質問に答えるためのガイドライン

関連するコンテンツ