- Newest
- Most votes
- Most comments
Thank you for reaching out to us. Please find below ways to improve reindexing performance:
-
Reindexing without replicas is a valid strategy. Reducing your replica shards to 0 momentarily will improve performance and reduce the time to reindex. See the section titled "Change the replica count to zero" in the article referenced[1]. This article also contains different options for optimization worth looking into. 2. Using slicing in OpenSearch reindex operation is also valid. However, I do not recommend disabling replicas while using this strategy, as it will affect the availability of the shards being copied, and increase time. The OpenSearch documentation did not provide much information, but I found documentation from a third party source for ElasticSearch that confirms the slices correlated to the number of shards to copy[2]. The OpenSearch documentation you had linked also contained this article that confirmed slices were how many sub-tasks are involved with the reindex operation, and that it can be set to 'auto' to let OpenSearch decide[3].
- It seems that by default, the throttling is set to -1 in OpenSearch when reindexing[3]. Which means no throttling.
I hope I have answered your questions to your satisfaction. If you have any further queries or requests, please feel free to reach out and ask.
[1]How can I improve the indexing performance on my Amazon OpenSearch Service cluster? https://aws.amazon.com/premiumsupport/knowledge-center/opensearch-indexing-performance/
[2]Reindex API https://www.elastic.co/guide/en/elasticsearch/reference/current/docs-reindex.html
[3]Index document https://opensearch.org/docs/latest/opensearch/rest-api/document-apis/reindex/
However, for complete guidance, I request you to please open a support case AWS and we would help you out further.
Relevant content
- Accepted Answerasked 3 years ago
- AWS OFFICIALUpdated a year ago
- AWS OFFICIALUpdated 3 years ago
- AWS OFFICIALUpdated 3 years ago