RDF format for aws Neptune

0

I have multiple csv I want to convert these csv in n-triple. I have used Rdflib library now when I am running that code it is taking very high time took 4-5 hours still running.... Suggest any other way to convert into n-triple my csv file has 20000 records.

已提問 9 個月前檢視次數 229 次
1 個回答
1
已接受的答案

I am not an RDFLIB expert, but as N-Triples format is highly parallelizable and assuming your CSV format is the same (e.g., every row is independent), I would suggest looking to parallelize your process. If you split the file into say 20 files of 1,000 and run them in parallel, whether as a Glue job, AWS Batch job, or maybe containers...or even kicking off 20 shell processes on an EC2 instance, beyond doing the transformation in parallel, it will also likely be faster because it will have less memory pressure.

AWS
AWS-BMO
已回答 9 個月前

您尚未登入。 登入 去張貼答案。

一個好的回答可以清楚地回答問題並提供建設性的意見回饋,同時有助於提問者的專業成長。

回答問題指南