1回答
- 新しい順
- 投票が多い順
- コメントが多い順
0
After checking the article mentioned by @Tasio I found out that the data that's coming to cloud watch is not formated properly (un-even spaces). Further digging I found the root cause to be cloudformation template, the format string for API Gw access logs had un-even spaces. While sorting out the un-even space issue I stumble upon another option to have API GW send access logs as csv. SO, I made that change and on Athena I set up the table with following properties:
InputFormat: "org.apache.hadoop.mapred.TextInputFormat"
OutputFormat: "org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat"
SerdeInfo:
Parameters: { "separatorChar" : "," }
SerializationLibrary: "org.apache.hadoop.hive.serde2.OpenCSVSerde"
Now everything is working and I can query the logs using Athena
回答済み 2年前
Glad you could fix it.
関連するコンテンツ
- AWS公式更新しました 3年前
Could you kindly share the pattern you're using? Also, did you check this article? https://aws.amazon.com/premiumsupport/knowledge-center/regexserde-error-athena-matching-groups/