- Mais recentes
- Mais votos
- Mais comentários
The issue you're experiencing with AWS Glue crawlers not syncing column descriptions from Snowflake via JDBC connection is a known limitation of the tool. While AWS Glue crawlers are effective at capturing table metadata such as column names and data types, they do not automatically retrieve column descriptions or comments from the source database.
To enable additional metadata retrieval, you've correctly tried enabling expanded metadata when creating the crawler. However, for JDBC targets, the expanded metadata option is limited to retrieving only specific types of additional information. For JDBC connections, you can enable additional metadata by specifying values like 'RAWTYPES' or 'COMMENTS' in the crawler configuration. These options provide native-level datatype information and comments associated with columns or tables in the database, respectively.
Unfortunately, even with these options enabled, column descriptions from Snowflake may not be captured as you expect. This is because the way Snowflake stores and exposes column descriptions may not align with how AWS Glue crawlers retrieve this information through JDBC connections.
Given this limitation, you may need to consider alternative approaches to ensure that column descriptions are available in AWS Glue and subsequently in AWS DataZone for your business users. Some potential workarounds include:
- Manually updating the column descriptions in the AWS Glue Data Catalog after the crawler has run.
- Developing a custom solution to extract column descriptions from Snowflake and update the AWS Glue Data Catalog programmatically.
- Exploring if there are any Snowflake-specific JDBC drivers or connection options that might expose column descriptions in a way that AWS Glue can interpret.
While these workarounds require additional effort, they may be necessary to achieve your goal of having up-to-date column descriptions available for your business users in AWS DataZone.
Sources
Crawler API - AWS Glue
Crawler - AWS Glue
Overview of using AWS Glue - AWS Glue
In order for the comments added to the Table/View to come through to the Glue Catalog table, we would have to choose COMMENTS and RAWTYPE for additional metadata as mentioned in the document[1(Step 10)]. This allows the crawler to harvest metadata related to comments and raw types like geospatial columns. Could you please confirm if this option was enabled and if the comments does not come through Glue Catalog even after enabling this.
[1] https://aws.amazon.com/blogs/big-data/announcing-aws-glue-crawler-support-for-snowflake/
Conteúdo relevante
- feita há 7 meses
- feita há um ano
- AWS OFICIALAtualizada há 8 meses
