Return specific percentiles of an array in a table in Quicksight - Like Pandas Describe Function

0

Hello,

I've searched the AWS Quicksight documentation and I've not been able to find what I am looking for. I am working within Quicksight and I've created a table that provides the following information in 4 columns:

Model Serial Number, Minimum Hour Meter Reading, Maximum Hour Meter Reading, and Total Hours

This table has two filters. First I select a model that I am interested in. Then I selected the timespan that I am interested in. Therefore, the first column of the table (Model Serial Number) lists all of the serial numbers of the model which show up within the timespan that I've selected. The Min and Max hour meter readings are Calculated Fields within my data. I simply use min(hour_meter) and max(hour_meter). The min and max values which are returned are those that exist in the timespan filter. The last column - Total Hours, is another calculated field which simply subtracts the max reading from the min reading to give me the total usage or run time within the time period that I've selected.

The table is fine - but what I really want is a summary table that gives me: number of units, median value of Total Hours, 90th percentile of Total Hours, and 99th percentile of Total Hours.

This is very similar to the Describe function within Pandas. I tried creating calculated fields in my dataset using the percentileDisc function but I am either not using correctly or it doesn't do what I am looking for it to do.

Does anyone know how to create a simple table of the data that I want?

Thank you!

Best Regards, David

asked 2 years ago1140 views
1 Answer
0

Thank you for posting the question, you can use the the below functions to create calculated fields and add them in the table to get your use case:

  1. percentileDisc( {total hours} ,90)
  2. percentileDisc( {total hours} ,99)
  3. median(total hours)

This will display the percentile and median of each Model Serial Number in the selected time range in the existing table you have.

If you want just the percentile and median of total hours but not by each Model serial number, you can create a table without the Model serial number so they are not grouped based on the Model serial number.

you can go through the documentation of the functions to understand more about the usage:

PercentileDisc [Median] (https://docs.aws.amazon.com/quicksight/latest/user/median-function.html)

SUPPORT ENGINEER
answered 2 years ago
profile picture
EXPERT
reviewed 20 days ago
  • Thank you for the reply - I appreciate it. However; as I've stated in the initial post, the percentileDisc() function doesn't work on aggregated data. I get the following error:

    Nesting of aggregate functions like {{aggregateFunction1}} and {{aggregateFunction2}} is not allowed.

    There may not be a solution but I find it hard to believe that something like this isn't possible.

You are not logged in. Log in to post an answer.

A good answer clearly answers the question and provides constructive feedback and encourages professional growth in the question asker.

Guidelines for Answering Questions