Data Quality using PyDeequ

0

Hi, Does anyone use PyDeequ for large enterprises. I am exploring this library and have the below questions:

  1. Looking at the github repo it doesnt seem like it is actively udated. ALso, it supoorts Spark 3.0.0 but not later versions.
  2. Some of the apis didnt work(for complex examples). I dont know if there is any Amazon support.
  3. Also the scala version(deequ) is more up to date than the python version(PuDeequ). s is there a plan to sunset the PyDeequ version
  4. Should I use this for large enterprise data validation framework or there are any other alternate tools. Kindly advise.

Thank you!

preguntada hace 2 años670 visualizaciones
1 Respuesta
0

Hi

To answer question '4' - I would recommend you take a look at AWS Glue DataBrew. Not only is it a fully managed service, but you'll also find that it has a better velocity of new features & updates as its supported by the AWS Glue team.

Thanks

Nick

AWS
Nick
respondido hace 2 años

No has iniciado sesión. Iniciar sesión para publicar una respuesta.

Una buena respuesta responde claramente a la pregunta, proporciona comentarios constructivos y fomenta el crecimiento profesional en la persona que hace la pregunta.

Pautas para responder preguntas