October 27, 2023

Data Exploration with GPT-4

 

Environment:  ChatGPT Plus, GPT-4 with Advanced Data Analysis

 

Data:  Superstore dataset.

 

Objective:  Analyze & visualize data with GPT-4.

 

In my previous post, I used GPT-4 to design a dashboard with multiple graphs.  This approach is applicable when you know the data so you can instruct GPT-4 to create visualizations the way you would like to interpret the data.

 

But when you are not familiar with the data set, you can prompt GPT-4 to explore and visualize the data.  This method can go in many different directions, however, it can yield insights that you might not have expected.

 

 

1)   Upload and Analyze Data:  When prompted to analyze data, GPT-4 proceeds to analyze data like sales analysis, customer analysis, product analysis.

 


 

The analysis will give the results in text and numbers, which are OK.  

 


 

 

2)   Visualize Data:  If you want to analyze data visually, the better prompt is to ask GPT-4 to visualize data, which created graphs with key insights.  

 

 








 

Verdict:  When you’re not familiar with a particular data set, use GPT-4 to explore data and provide initial analysis.  Then based on these initial analyses, you can do a deep data analysis in the direction you want.  With proper guidance, GPT-4 can be your helpful assistant.

 

Warning:  Artificial intelligence can sometimes give convincing but fictitious answers.  Make sure you always verify the results when using any LLM’s (ChatGPT, Bard, Claude…). 


October 20, 2023

Dashboard Design in GPT-4


Environment:  ChatGPT Plus, GPT-4 with Advanced Data Analysis.


Data:  Superstore dataset.


Objective:  Design a dashboard with multiple graphs.



1)  Create Graphs:  GPT-4 was asked to create a bar chart, line chart, big number, and donut chart.












2)  Design Dashboard:  GPT-4 was prompted to arrange the graphs into a dashboard.  






3)  Format Dashboard:  The final step is to format the dashboard to give it a more refined look.  It took more than 10 prompts to give the dashboard a more professional look.  Formatting a dashboard with attention to details is a time-consuming task whether it’s done in GPT-4 or Tableau.  I guess something never changes.  






Verdict:  One plus thing is GPT-4 can arrange multiple graphs into a dashboard to provide multiple metrics to users in a single place.  One minus thing is GPT-4 can’t create filters yet for users to drill down.



October 13, 2023

Advanced Data Analysis with GPT-4


Environment:  ChatGPT Plus, GPT-4 with Advanced Data Analysis.


Data:  Alternative fueling station stations in America.


Objective:  Use GPT-4 with advanced data analysis to work with file upload, analyze and visualize data. 

 


1)    Upload data file:  First, the file Alternative Fuel Stations 2021.csv was downloaded from data.gov web site, cleaned up, and saved. 

 

To upload the data file, I selected the GPT-4 box, chose ‘Advanced Data Analysis, then clicked on the plus icon which allowed me to upload.




 

After reading the data from the file, GPT-4 provided an overview of the columns’ description.




 

2)    Describe Fuel Type Code:  I instructed GPT-4 to describe the codes in the ‘Fuel Type Code’ column and provide count of the stations.  The description clarified what each code stands for.




 

3)    Describe Owner Type Code:  Again, GPT-4 can mine its own data  warehouse to provide description for the owner type codes.  




 

4)    Map Chart:  GPT-4 was asked to visualize the station locations in the US.  It turned out GPT-4 is not good at visualizing data on a map.  The visualization was not proportional and not that useful.




 

5)    Bar Chart:  If GPT-4 is prompted with specific instruction, it can produce a refined bar chart of the top 10 states with most stations.




 

6)    Calculate Ratio:  GPT-4 can mine its data warehouse and incorporate its data with the uploaded data.  Below, GPT-4 combined state population with number of stations to calculate the ratio of population vs number of stations for each state.  




 

Here, GPT-4 provides such a huge benefit and can save lots of time for the users.  If this were done in Tableau, users would have to search for a population data set, downloaded and added it to Tableau workbook.  The 2 data sets would have to be blended before the ratio can be calculated.  


 

7)    Export to PDF:  After generating a chart, GPT-4 can export chart to PDF file for downloading.




 

Verdict:  I’m impressed with how readily GPT-4 can interpret and understand the data.  The way it incorporates data from its data warehouse with the uploaded data set and seamlessly calculates the ratio is remarkable.  Note that users always need to verify the calculations as GPT-4 can hallucinate and provide incorrect results.  However, GPT-4 didn't seem to hallucinate in this run.