Automated Data Cleaning and Visualization

0

I have an API from which I can retrieve data and upload it to my s3 buckets. I plan to clean this data for missing values, duplicates, invalid values, and also remove outliers. After the data is cleaned, I want to visualize this data and use it as an API so the users can access my visualized charts.

I have tried different tools for cleaning processes such as AWS Glue DataBrew, Sagemaker Data Wrangler, and using python (pandas). However, I am unaware of what's the best method to go about especially if I want to automate the entire process so that the cleaning and visualization works as soon as I add a CSV file into the S3 bucket.

Keine Antworten

Du bist nicht angemeldet. Anmelden um eine Antwort zu veröffentlichen.

Eine gute Antwort beantwortet die Frage klar, gibt konstruktives Feedback und fördert die berufliche Weiterentwicklung des Fragenstellers.

Richtlinien für die Beantwortung von Fragen