- Newest
- Most votes
- Most comments
It sounds like you may be experiencing some limitations with the basic lexical search capabilities of Amazon Kendra, and you haven't yet enabled or configured some of the more advanced semantic search features.
By default, Kendra uses lexical search, which primarily matches exact words or phrases in the query to the indexed documents. This explains why you're only getting results for exact word matches and not seeing matches for related terms or concepts.
To improve your search results and leverage Kendra's more advanced capabilities, you might want to consider the following:
-
Semantic search: Kendra offers semantic search capabilities that can help understand the intent behind queries and return more relevant results, even when exact word matches aren't present. This feature uses machine learning models to encode text into vectors, allowing for similarity matching based on meaning rather than just words.
-
Custom synonyms: For cases like "Dino" and "Dinosaurier", you can set up custom synonyms in your Kendra index. This would allow Kendra to understand that these terms are related and return relevant results for both.
-
Language-specific analyzers: Ensure that you're using the appropriate German language analyzer for your index. This can help with things like stemming and understanding German-specific linguistic features.
-
Fine-tuning relevance: You may need to adjust the boosting of certain metadata fields or attributes to improve the ranking of your search results.
-
Query expansion: Kendra can expand queries to include related terms, which might help in cases where you're describing dinosaur features without using the exact word.
If you've confirmed these features are properly set up and you're still not seeing improvements, you may want to review your indexing configuration to ensure all relevant documents are being properly processed. Additionally, consider using Kendra's Retrieve API for more advanced retrieval augmented generation (RAG) use cases, which can provide more semantically relevant passages for your queries.
Remember that building an effective search solution often requires iterative refinement and tuning to get the best results for your specific use case and content.
Sources
Retrieving responses from indexes in Amazon Kendra - Amazon Kendra
Amazon OpenSearch Service search enhancements: 2023 roundup | AWS Big Data Blog
Troubleshooting document search results - Amazon Kendra
Relevant content
- asked a year ago
