- Newest
- Most votes
- Most comments
Based on your experience with Excel files in Amazon Bedrock Knowledge Base, I understand you're facing challenges getting accurate responses to queries about tabular data.
While Excel files (.xls/.xlsx) are indeed supported formats for Bedrock Knowledge Base, there are some best practices that might help improve your results:
-
Consider converting your Excel data to CSV format. CSV files are specifically mentioned as a best practice for including metadata in a data source, especially for large datasets. Using columns to designate content fields and metadata fields can improve data management and retrieval accuracy.
-
Add metadata filtering to your documents. This can significantly improve the relevancy of responses. Your Excel or CSV files should include document metadata attributes or fields that can be used for filtering.
-
For structured/tabular data like student marks, you might want to explore Bedrock Knowledge Base's structured data retrieval capabilities, as this type of query ("Show me all students whose marks are above 90") is more like a database query than a semantic search.
-
Make sure your files don't exceed the 50 MB size limit per file.
-
If you're using multiple sheets in Excel, consider whether the knowledge base is properly processing all sheets or if it might be more effective to split them into separate files.
For your specific use case with student marks, adding metadata for filtering (such as score ranges) might help the system better understand and retrieve the relevant information when queried about score thresholds.
Sources
Prerequisites for your Amazon Bedrock knowledge base data - Amazon Bedrock
Improve search results and accurate in Amazon Bedrock | AWS re:Post
Amazon Bedrock Knowledge Base Chunks Retrieval Issue | AWS re:Post
Thank you all for the support. After further attempts and additional research, I resolved the issue by switching to the Structured data store instead of the Vector store.
Initially, the following Amazon Q response led me to proceed with the vector store: "Amazon Bedrock Knowledge Base supports a variety of file types as input data when using a vector store. Here's a comprehensive list of the supported file formats: Plain text (.txt), Markdown (.md), HTML (.html), Microsoft Word (.doc/.docx), CSV (.csv), Microsoft Excel (.xls/.xlsx), PDF (.pdf), JPEG (.jpeg), PNG (.png)"
Relevant content
- asked 8 days ago
