Verbose Information About Why Training Data Ineligible in AWS Sagemaker Canvas

0

Hello,

I tried to play around with Sagemaker Canvas to build a text classification prediction model. I built first 2 model using same approach and it worked fine, but when It tried to build another one with different dataset, I stuck in selecting the target field as the target field is disabled (it says that it is ineligible) However, since the information is too vague, I am unable to continue on this. Note :

  • CSV file consist of 2 column, reason and reason_label
  • target will be reason_label column and source will be reason
  • basically I want the model to predict if I am giving such text input, it should produce relevant label as per trained data
  • I make sure that there are no empty string on the csv file, trying to chunk the file (instead of 70k row in one go, I tried with 1 chunk of file of 7k row) didn't work also
  • for additional information, reason_label training data will have 250 unique value at maximum
  • I make sure that no empty string and no missing value in the dataset

My question will be :

  1. What is causing this ineligible issue? I can send sample of data if necessary
  2. Is there any way to make this information more verbose? (e.g. ineligible due to too many variations/other thing) so we are not looking for needle in haystack when debugging this

Thanks and best regards

esanto
질문됨 4달 전94회 조회
답변 없음

로그인하지 않았습니다. 로그인해야 답변을 게시할 수 있습니다.

좋은 답변은 질문에 명확하게 답하고 건설적인 피드백을 제공하며 질문자의 전문적인 성장을 장려합니다.

질문 답변하기에 대한 가이드라인

관련 콘텐츠