1 Answer
0
I am not familiar with Tika, but I guess that it uses the S3 APIs to get the data that you need. You can do the same using the S3 SDK yourself. You can find the full list of APIs here. More specifically, GetObjectAttributes, and HeadObject. There might be other relevant APIs as well. Depending on your programming language, you can find the appropriate SDK to use those APIs.
Relevant content
- asked 4 months ago
- asked 4 months ago
- asked 6 months ago
- Accepted Answerasked 3 months ago
- AWS OFFICIALUpdated a year ago
- AWS OFFICIALUpdated a year ago
- AWS OFFICIALUpdated 2 years ago
Well I am referring Apache Tika. Regarding the list of S3 API I had checked but didn't find anything related to File metadata. Let's say if you right click on .docx file and take properties then it will give you number of pages, author, modified date, created date etc etc. So I need this information which is not unavialble by S3 API
As Uri mentioned, metadata is present in the HEAD object and not in the event. Here is the sample python code for getting the metadata info.