chatGPT visualisation

openAI published a working paper on how people use chatGPT. They include a visualisation that triggered my curiosity.

The width of the columns represents the cumulative proportion of the main category, and the height of each sub-category represents the proportion of the sub-category within the main category.

The first problem is that the percentages of the main categories do not add up to 100.

Main CategoryPercentage
Multimedia6.00
Other / Unknown4.60
Practical Guidance28.30
Seeking Information21.30
Self-Expression4.30
Technical Help7.50
Writing28.10
Grand Total100.10

The percentages of the sub-categories, however, do add up to 100. Next, I measured the width of the columns to test if they actually got it right.

I then (ironically) used chatGPT to extract the numerical values from the graph.

Next, I divided the width of the column by the percentage it is supposed to represent.

Main CategoryCalculated PercentageColumn WidthColumn Width Ratio
Multimedia5.943.07.3
Other / Unknown4.532.57.2
Practical Guidance28.3208.57.4
Seeking Information21.3157.07.4
Self-Expression4.330.07.0
Technical Help7.654.07.1
Writing28.1207.57.4

While the columns approximate the main category percentages, they vary considerably. So, how can we do this better? The answer is a good old area graph. It scales each box according the the percentage and groups them into the main categories by color.

A new type of visualization is not always a better visualization.

Leave a Reply

Your email address will not be published. Required fields are marked *