flatten deeply nested JSON with Crawler

0

My customer wants to flatten deeply nested JSON object. They used Glue Crawler Classifier with $[*] (lift the array elements up one level, so that each JSON record gets loaded into its own row). When they ran Crawler and view results he saw some array type instead of struct.

I saw a previous response to similar but need to understand in more details how to fix that

AWS
已提问 4 年前2468 查看次数
1 回答
0
已接受的回答

They are seeing Arrays and Structs based on the schema of the JSON document.

{
   "event_params":[ {"key":"Value"}, {"Key","value"}],
   "geolocation": { "lat": 56.333333, "lng": 57.44333 }
}

If this is my JSON document, when I run the crawler it will result in this schema

event_params: Array
geolocation: Struct

The customer can use Custom Classifiers in Glue to extract the data further. Or, they can keep it as is, and run a glue job to format the data into a new table in columnar format.

已回答 4 年前

您未登录。 登录 发布回答。

一个好的回答可以清楚地解答问题和提供建设性反馈,并能促进提问者的职业发展。

回答问题的准则