spark.sql not working on EMR (Serverless)

The following script does not create the table in the S3 location indicated by the query. I tested it locally and the Delta Json file is created and contains the information about the created table.

from pyspark.sql import SparkSession

spark = (SparkSession
    .builder
    .enableHiveSupport()
    .appName('omop_ddl')
    .getOrCreate()
    )


spark.sql(f"""
CREATE
OR REPLACE TABLE CONCEPT (
  CONCEPT_ID LONG,
  CONCEPT_NAME STRING,
  DOMAIN_ID STRING,
  VOCABULARY_ID STRING,
  CONCEPT_CLASS_ID STRING,
  STANDARD_CONCEPT STRING,
  CONCEPT_CODE STRING,
  VALID_START_DATE DATE,
  VALID_END_DATE DATE,
  INVALID_REASON STRING
) USING DELTA
LOCATION 's3a://ls-dl-mvp-s3deltalake/health_lakehouse/silver/concept';
""")

The configuration parameters are the following ones:

--conf spark.jars=s3a://ls-dl-mvp-s3development/spark_jars/delta-core_2.12-2.1.0.jar,s3a://ls-dl-mvp-s3development/spark_jars/delta-storage-2.1.0.jar 
--conf spark.executor.cores=1 
--conf spark.executor.memory=4g 
--conf spark.driver.cores=1 
--conf spark.driver.memory=4g 
--conf spark.executor.instances=1

I tried to modify the location in the query by inserting a non-existent bucket and the script did not go into error. Am I forgetting something? Thank you very much for your help

Topics

Analytics

Relevant content

Creating a Delta table from Spark using the Glue Catalog
Andrea Campolonghi
asked 9 months ago
What is the table IDs range for the temporary intermediate tables created by redshift during query execution?
Accepted Answer
Sumeet
asked 2 months ago
how to enable delta lake in EMR serverless.
Accepted Answer
Muthukumar
asked 4 months ago
Locate the json file which contains changes that we made on a newly created rekognition dataset
NomanM
asked 2 years ago
How do I troubleshoot the "Invalid S3 location" error when I try to save the Athena query results on an S3 bucket?
AWS OFFICIALUpdated 3 years ago
How do I resolve the "The provided key element does not match the schema" error when importing DynamoDB tables using Hive on Amazon EMR?
AWS OFFICIALUpdated a year ago
Why is my EventBridge rule not working in Regions other than the one I created it in?
AWS OFFICIALUpdated 7 months ago
Why does my Athena query fail with the error "HIVE_INVALID_METADATA: Hive metadata for table is invalid: Table descriptor contains duplicate columns"?
AWS OFFICIALUpdated 3 months ago
EMR Cluster failure with "On the master instance, application provisioning failed"
SUPPORT ENGINEER
Yokesh NK
published 9 days ago
EMR Serverless service principal is not authorized to perform: ECR:DescribeImages on resource
SUPPORT ENGINEER
Yokesh NK
published 3 days ago