How to further improve Sagemaker Canvas Model Score?

0

Hi,

I've started exploring and using Sage Maker canvas upon the pilot launch of the service a few months back. I wanted to ask if there's anyway I can further improve my model performance apart from what's available in the canvas console?

Janice

Janice
已提问 2 年前296 查看次数
1 回答
1
已接受的回答

Hi Janice,

Given that Canvas is a no-code tool that abstracts and automate most of the model training process. Model tuning and training performance are mostly limited to the amount of data + your features selection when you train the model.

Nonetheless, one key option you can adopt in improving your score is at the pre-processing stage (i.e feature engineering). Assuming you aren't a technical developer, you can explore the use of AWS Data Brew.

AWS Glue DataBrew is a no-code visual data preparation tool that makes it easy for data analysts and data scientists to clean and normalize data to prepare it for analytics and machine learning. You can choose from over 250 pre-built transformations to automate data preparation tasks, all without the need to write any code. You can automate filtering anomalies, converting data to standard formats, and correcting invalid values, and other tasks. After your data is ready, you can immediately use it for analytics and machine learning projects.

In your case, you can store your initial data in S3 and have databrew performs feature engineering on it before writing it back to S3 again. From there, you can import the processed data back into Canvas and build your model from there. This should gives you a better score compared to building your model directly using the initial raw data.

Data Brew Getting Started guide : Link

Cheers.

Vincent

AWS
已回答 2 年前
profile picture
专家
已审核 1 个月前

您未登录。 登录 发布回答。

一个好的回答可以清楚地解答问题和提供建设性反馈,并能促进提问者的职业发展。

回答问题的准则