Questions tagged with Analytics
Content language: English
Select up to 5 tags to filter
Sort by most recent
Browse through the questions and answers listed below or filter and sort to narrow down your results.
Glue 4 Hudi supportlg...
I am trying to store a data stream from kafka using the hudi format. I am following this doc https://docs.aws.amazon.com/glue/latest/dg/aws-glue-programming-etl-format-hudi.html and I even tried to...
3
answers
0
votes
256
views
asked 9 months agolg...
We are getting a new error for some of our crawlers.
```
The number of s3 paths passed as input has exceeded the limit. Your account is limited to 100 accounts per crawler.
```
This shows up on a...
2
answers
1
votes
208
views
asked 9 months agolg...
Hello, I have been experimenting with Aws glue, and created some crawlers to crawl the data but the behavior wasn't what I expected,
Question 1) I had an S3 bucket
with 3...
1
answers
0
votes
285
views
asked 9 months agolg...
We are trying to use Glue to query and aggregate some Parquet files in S3.
We get this error related to schema mismatch:
```
An error occurred while calling o106.pyWriteDynamicFrame....
2
answers
0
votes
225
views
asked 9 months agolg...
How to run a Hadoop Jar file (a mapreduce job) on EMR cluster in the CLI mode?
I have already set the cluster and have a jar file. However, I don't know how to use Hadoop to run the Jar file.
any...
1
answers
0
votes
541
views
asked 9 months agolg...
I successfully executed the Python query to convert the JSON format to Parquet format. The conversion was completed without any issues, and I can confirm the presence of the Parquet file and its...
1
answers
0
votes
367
views
asked 9 months agolg...
I am looking to have metadata produced by crawlers during Glue Jobs record both source AND target information on the same data catalog table. From the research I have done, recording source metadata...
1
answers
0
votes
230
views
asked 9 months agolg...
We have multiple devices writing into AWS Timestream databases.
We have grafana querying timestream.
Problems are:
1. grafana doesnt do any caching
2. timestream is expensive to query, when doing it...
2
answers
0
votes
289
views
asked 9 months agolg...
In a AWS Glue job, I am getting issue when I am trying to execute sql query in below code:
```
source_dataset = glueContext.create_dynamic_frame.from_options(connection_type="oracle",...
2
answers
0
votes
1998
views
asked 9 months agolg...
I am using glue to move data from s3 and redshift into a data lake. I would like to use a combination of AWS glue and Athena to create a source to target mapping report. The goal is to make the report...
1
answers
0
votes
258
views
asked 9 months agolg...
Does Clickstream Analytics for Android fully support Jetpack Compose? Asking because our current Analytics provider Pendo does not and I could not find any mention in the AWS documentation
1
answers
0
votes
254
views
asked 9 months agolg...
Hello team,
We are planning to build a data lake in AWS that will contain regularly extracted data from an on-prem data warehouse. The purpose of this data lake is to serve the following purposes in...
2
answers
0
votes
326
views
asked 10 months agolg...