What is the best way to rank external API data and internal data from OpenSearch?

0

Use Case:

  • I have an OpenSearch service instance with 'internal' documents
  • I have 'external' data that I pull in from an API, ie Slack
  • I need to combine the internal and external data and rank them for relevancy - preferably with OpenSearch

What is the best way to do this? I am considering indexing the data and then deleting the index immediately, but I worry that latency would be a concern.

asked 5 months ago191 views
1 Answer
0

There could be multiple ways using which you will get desired output. However, for the most relevant ranking you have to test out patterns. However, for Relevance-based score calculation You can use sort clauses with relevant scoring . Also, as OpenSearch assign relevance scores using keyword, document age, algorithm or any custom rule. You need to assign relevance score appropriately for both your internal and external data, you can then sort the combined results based on their relevance scores.

You can refer to the below documentation for detailed information on "Learning to Rank" best-practices for Amazon OpenSearch Service.

https://docs.aws.amazon.com/opensearch-service/latest/developerguide/learning-to-rank.html

AWS
SUPPORT ENGINEER
answered 4 months ago

You are not logged in. Log in to post an answer.

A good answer clearly answers the question and provides constructive feedback and encourages professional growth in the question asker.

Guidelines for Answering Questions