Skip to content

Unable to query MIMIC III on Athena

0

I am trying to run basic queries on MIMIC III but I get the error. Query: SELECT * FROM "AwsDataCatalog"."mimiciii"."admissions" limit 10;

There is no issue of permission from my end for AWS user.

Error is com.amazonaws.services.s3.model.AmazonS3Exception: Access Denied (Service: Amazon S3; Status Code: 403; Error Code: AccessDenied; Request ID: D6GC37V7NH3VC6XZ; S3 Extended Request ID: #######; Proxy: null), S3 Extended Request ID: ########## (Bucket: mimic-iii-physionet, Key: parquet/ADMISSIONS/) This query ran against the "mimiciii" database, unless qualified by the query.

Error in athena for querying mimiciii

I am also unable to access the specified the s3 bucket in which the mimiciii data is stored. Table Properties

Error while accessing the bucket

1 Answer
1

The error you're encountering is an access denied issue when trying to query the MIMIC III dataset in Amazon Athena. This is happening because while you may have the necessary permissions for your AWS user to use Athena itself, you don't have access to the underlying S3 bucket where the MIMIC III data is stored.

Based on the error message, the data is stored in the "mimic-iii-physionet" S3 bucket, which appears to be a bucket that you don't own or have permissions to access. The MIMIC III dataset is a medical database that typically requires specific authorization to access due to its sensitive nature.

To resolve this issue:

  1. You need to obtain proper access to the MIMIC III dataset. This typically requires completing a training course and signing a data use agreement with PhysioNet, the organization that manages this dataset.

  2. If you've already completed the necessary authorization steps for MIMIC III, you may need to ensure that your AWS credentials or role has been granted access to the specific S3 bucket containing the data.

  3. If you're working within an organization that has set up this dataset, contact your administrator to ensure your IAM role or user has the appropriate S3 permissions to read from the mimic-iii-physionet bucket.

  4. Alternatively, if you're supposed to be working with a copy of the MIMIC III dataset in your own S3 bucket, make sure the data is properly loaded and that the Athena table definitions point to the correct location.

Remember that even with proper AWS permissions, accessing medical datasets like MIMIC III requires appropriate authorization due to the sensitive nature of the data.
Sources
S3 Tables - Unable to query using Athena | AWS re:Post
S3 LOCATION INvalid | AWS re:Post
Amazon Athena Features – Serverless Interactive Query Service – Amazon Web Services

answered 15 days ago
AWS
EXPERT
reviewed 12 days ago

You are not logged in. Log in to post an answer.

A good answer clearly answers the question and provides constructive feedback and encourages professional growth in the question asker.