1回答
- 新しい順
- 投票が多い順
- コメントが多い順
0
Unfortunately parsing data by Bytes right now is not supported in glue. I observed your data is unstructured and the only way to parse that data is Grok SerDe or Regex SerDe but they parse data by identifying patterns so they are not feasible. I will recommend you to preprocess your data and then load them in glue.
unstructured data -> preprocessing using some Custom built parser function(csv) -> S3 -> crawl and createDatabase in glue.
Thank you for your reply.
関連するコンテンツ
- 質問済み 6ヶ月前
- 質問済み 9ヶ月前
- AWS公式更新しました 3年前
- AWS公式更新しました 3年前
- AWS公式更新しました 1年前
- AWS公式更新しました 2年前
If the numbers of characters are the same by record, I can use Grok pattern like "(?<col0>.{6})(?<col1>.{2})..." when making a data catalog using crawler. But in this case, numbers differ, and bytes are the same. Can anyone tell AWS Glue does support/not support data set like this?