2回答
- 新しい順
- 投票が多い順
- コメントが多い順
0
Thanks for the answer. So, in terms of maintainability it would be best to have one for each, but for cost saving parallel tasks would be better, right?
回答済み 7ヶ月前
0
Yes, in the code if you call the forEachLoop/await in a thread, you can start multiple streaming queries in the same cluster (Glue streaming job), for instance if using PySpark using a ThreadPool and tasks
This is complicate monitoring, tuning and operations in general but will save you cost significantly.
correct, parallel tasks can be challenging if you don't have prior experience maintaining streams