2개 답변
- 최신
- 최다 투표
- 가장 많은 댓글
0
Thanks for the answer. So, in terms of maintainability it would be best to have one for each, but for cost saving parallel tasks would be better, right?
답변함 7달 전
0
Yes, in the code if you call the forEachLoop/await in a thread, you can start multiple streaming queries in the same cluster (Glue streaming job), for instance if using PySpark using a ThreadPool and tasks
This is complicate monitoring, tuning and operations in general but will save you cost significantly.
관련 콘텐츠
- AWS 공식업데이트됨 2년 전
correct, parallel tasks can be challenging if you don't have prior experience maintaining streams