For over a decade, I’ve been dreaming of an analytics platform that would take my Excel file and give me actionable insights without any effort from me. The XLS file could have raw data on newsletter subscribers, sales pipeline, and so on. If you think it sounds like magic, you wouldn’t be wrong. But that was my fantasy. And, until ChatGPT came along, it remained a fantasy.
In 2017, I was very thrilled to hear that Gartner created a new subcategory of analytics called Smart Data Discovery, which “automatically finds, visualizes and narrates important findings such as correlations, exceptions, clusters, links and predictions in data that are relevant to users without requiring them to build models or write algorithms”. Smart Data Discovery seemed to come very close to fulfilling my dream.
But it didn’t. In the following years, I tried a dozen products including Excel Pivot Table, DataRobot, and a few SDD tools mentioned by Gartner.
None of them worked as advertised. After I uploaded my raw data, all of them asked me to define the fields in it. I thought that was dumb since this info was available in the header row of the XLS file I uploaded.
Over time, I got frustrated with my experience with these products, and added my dream platform to my laundry list of products that I should try and build myself when I had the bandwidth. I gave it the working name of MINER360 and spec’ced it as follows:
BACKGROUND: Virtually every business has some data, mostly in the form of Excel spreadsheets. There could be a considerable amount of insight locked in the data. To extract it, the company may use Excel Pivot Table or a more sophisticated BI software like COGNOS, DOMO, Tableau, etc. Some of them are too complex and the others, too expensive for the mainstream market. Besides, all of them require manual steps to describe the data and define the nature of insight expected. As Gartner notes in its latest BI and Analytics Platforms Magic Quadrant report, “current visual-based data discovery approach has accelerated data harmonization and the visual identification of patterns in data, as opposed to the previous IT-centric semantic-layer-based approach, (but) the tasks for creating insights are still largely manual and prone to bias.”PAIN: Difficulties in extracting insight from data lead to lost business opportunities.SOLUTION: A platform where the user simply uploads their XLS file and presses a button. The platform automatically performs analytics and outputs actionable insights. Whether it uses data mining or predictive analytics or some other sophisticated analytics technique is decided by the platform based on the nature of data uploaded and kept transparent to the user.
I didn’t find any such product.
That changed when ChatGPT launched 4o version in May 2024. You can find my first impressions of ChatGPT-4o here. Since then, 4o has come a long way since then and fulfilled my wish for MINER360. I’ve used it for analyzing data on several occasions since then.
Given below are a few examples of data analysis by ChatGPT.
1. Marketing Campaign Strategy
I uploaded an XLS file containing sales pipeline data, and entered the following prompt:
You’re a marketing analytics expert. We’re asking you to complete an analysis of the customers represented in this file. Please summarize the main insights we should take action on.
I got a pretty good response from ChatGPT, which is shown in the following exhibit.
In the above prompt, I followed the best practice of telling ChatGPT to assume the persona that’s fit-for-purpose for the given context, which is “marketing analytics expert” in this case.
2. ETF Composition
I received a spreadsheet from HDFC AMC with the list of holdings in its NIFTY 50 ETF. I uploaded this XLS file and asked ChatGPT if it contained any actionable insights.
ChatGPT analyzed the scrips, sectors and holdings, and gave me a bunch of insights. Some of them were obvious since the underlyings of an ETF, by definition, must reflect the stocks making up the index, NIFTY 50 in this case. But there were others that were new to me. You can find Chat’s full response here.
3. Is G7 Obsolete?
After seeing the moribund economy of Germany, UK, and a few other G7 countries on the one hand versus the growing economy of USA, another G7 country, on the other, I wondered if the notion of “West” had become obsolete. I decided to do a deep dive into this. Towards that end, I prompted ChatGPT as follows:
Plot the contribution of the GDP of USA and “G7 Minus USA” to Global GDP for the last 10 years.
While you find Chat’s full reponse here, the chart I got from it is given in the following exhibit.
This chart shows that the dominance of USA is rising whereas the status of the other western countries is falling. To that extent, it confirmed my suspicion that G7’s billing as a monolithic economic powerhouse is a misnomer.
4. ChatGPT And The Art Of Motorcar Maintenance
Here’s an interesting case where tweeple @devaiahPB uploaded a car repair estimate to ChatGPT and sought its help to negotiate a better price with his auto mechanic:
Gotta love chat gpt. Car service time. Estimate came in at 40k. I’m clueless about cars, parts, engines etc (although I’ve a degree in engineering, or maybe it’s because). Usually I just do basic talking and bring it down by 5-10% on a good day.
But this time, I decided to let… pic.twitter.com/5meSjrYgPf
— Devaiah Bopanna (@devaiahPB) November 9, 2024
tl;dr: ChatGPT helped @devaiahPB to reduce his repair cost by 14K.
As you can see in all the above examples, ChatGPT carries out data analysis without asking any questions. This contrasts sharply with the various analytics tools that I’d tried before. While they touted to be business intelligence software, they betrayed their dumbness by asking fairly basic questions that a human with room temperature IQ would not have. Another remarkable thing is that ChatGPT cleanses the data where required whereas the other tools expected me to carry out Extraction Transformation Loading (ETL) and feed them pristine data.
Once you start using ChatGPT for data analysis, you’ll find yourself reaching its rate limit quite quickly. Either you can upgrade to the paid version of ChatGPT or use it as an assistant to help you to do data analysis in ways that I described in my blog post titled Using ChatGPT as a Data Analysis Assistant.
Having gone through seven examples of data analysis by / with ChatGPT, you might have a few questions about hallucination, data sources, data quality, and accuracy of analysis.
That will be the topic of a follow-on post. Watch this space!