October 13, 2023

Advanced Data Analysis with GPT-4


Environment:  ChatGPT Plus, GPT-4 with Advanced Data Analysis.


Data:  Alternative fueling station stations in America.


Objective:  Use GPT-4 with advanced data analysis to work with file upload, analyze and visualize data. 

 


1)    Upload data file:  First, the file Alternative Fuel Stations 2021.csv was downloaded from data.gov web site, cleaned up, and saved. 

 

To upload the data file, I selected the GPT-4 box, chose ‘Advanced Data Analysis, then clicked on the plus icon which allowed me to upload.




 

After reading the data from the file, GPT-4 provided an overview of the columns’ description.




 

2)    Describe Fuel Type Code:  I instructed GPT-4 to describe the codes in the ‘Fuel Type Code’ column and provide count of the stations.  The description clarified what each code stands for.




 

3)    Describe Owner Type Code:  Again, GPT-4 can mine its own data  warehouse to provide description for the owner type codes.  




 

4)    Map Chart:  GPT-4 was asked to visualize the station locations in the US.  It turned out GPT-4 is not good at visualizing data on a map.  The visualization was not proportional and not that useful.




 

5)    Bar Chart:  If GPT-4 is prompted with specific instruction, it can produce a refined bar chart of the top 10 states with most stations.




 

6)    Calculate Ratio:  GPT-4 can mine its data warehouse and incorporate its data with the uploaded data.  Below, GPT-4 combined state population with number of stations to calculate the ratio of population vs number of stations for each state.  




 

Here, GPT-4 provides such a huge benefit and can save lots of time for the users.  If this were done in Tableau, users would have to search for a population data set, downloaded and added it to Tableau workbook.  The 2 data sets would have to be blended before the ratio can be calculated.  


 

7)    Export to PDF:  After generating a chart, GPT-4 can export chart to PDF file for downloading.




 

Verdict:  I’m impressed with how readily GPT-4 can interpret and understand the data.  The way it incorporates data from its data warehouse with the uploaded data set and seamlessly calculates the ratio is remarkable.  Note that users always need to verify the calculations as GPT-4 can hallucinate and provide incorrect results.  However, GPT-4 didn't seem to hallucinate in this run.



No comments:

Post a Comment