Splitting data into test and train

0

Please guide me with a tutorial for splitting the data into train and test data. AWS is using Sagemaker for machine learning for the new customers.I am a beginner in machine learning.

Please throw some light

질문됨 일 년 전308회 조회
2개 답변
1
AWS
답변함 일 년 전
0

there are quite a few resources you can start with, one of the resource I recommend is the Sagemaker notebook's build-in example, you can access it via https://[yournotebook].notebook.[yourregion].sagemaker.aws/tree#examples or the forth tab of the notebook. you can also view some good examples here - https://github.com/aws/amazon-sagemaker-examples or https://sagemaker-examples.readthedocs.io/en/latest/ generally speaking, you need to use the standard ML library to split the training and test dataset such as: sklearn.model_selection's train_test_split such as https://sagemaker-examples.readthedocs.io/en/latest/sagemaker-python-sdk/scikit_learn_randomforest/Sklearn_on_SageMaker_end2end.html#Prepare-data. most of the examples needs skills on python pandas to clean and prepare your training data

another option I can think of is using AWS Data Wrangler such as from https://aws.amazon.com/blogs/machine-learning/create-train-test-and-validation-splits-on-your-data-for-machine-learning-with-amazon-sagemaker-data-wrangler/

hope that works.

rafaxu
답변함 일 년 전

로그인하지 않았습니다. 로그인해야 답변을 게시할 수 있습니다.

좋은 답변은 질문에 명확하게 답하고 건설적인 피드백을 제공하며 질문자의 전문적인 성장을 장려합니다.

질문 답변하기에 대한 가이드라인

관련 콘텐츠