Verbose Information About Why Training Data Ineligible in AWS Sagemaker Canvas

0

Hello,

I tried to play around with Sagemaker Canvas to build a text classification prediction model. I built first 2 model using same approach and it worked fine, but when It tried to build another one with different dataset, I stuck in selecting the target field as the target field is disabled (it says that it is ineligible) However, since the information is too vague, I am unable to continue on this. Note :

  • CSV file consist of 2 column, reason and reason_label
  • target will be reason_label column and source will be reason
  • basically I want the model to predict if I am giving such text input, it should produce relevant label as per trained data
  • I make sure that there are no empty string on the csv file, trying to chunk the file (instead of 70k row in one go, I tried with 1 chunk of file of 7k row) didn't work also
  • for additional information, reason_label training data will have 250 unique value at maximum
  • I make sure that no empty string and no missing value in the dataset

My question will be :

  1. What is causing this ineligible issue? I can send sample of data if necessary
  2. Is there any way to make this information more verbose? (e.g. ineligible due to too many variations/other thing) so we are not looking for needle in haystack when debugging this

Thanks and best regards

esanto
已提問 4 個月前檢視次數 94 次
沒有答案

您尚未登入。 登入 去張貼答案。

一個好的回答可以清楚地回答問題並提供建設性的意見回饋,同時有助於提問者的專業成長。

回答問題指南