Data Trends 2024
Data Trends 2024
Data Trends 2024
3 The chatbot is on the rise. Single-text input LLM apps may be easier to
make, but they don’t allow refinement through natural conversation. For
that you need chatbots, and increasingly that’s what the devs are making.
From May 2023 through January 2024 in the Streamlit community,
chatbots went from 18% of LLM apps to 46%. And climbing.
Tremendous opportunities and challenges lie ahead, and as we analyzed use of the Snowflake Data Cloud to understand
the latest trends around data and technology, our chief interest was around how enterprises are preparing for an unfolding
era in which advanced AI accelerates and transforms how they do business.
The Snowflake Data Cloud encompasses data, models and applications from thousands of organizations across many
industries. Looking at how they work within the platform, including which features they use, paints a vivid picture of the
decisions being made to deal with current challenges and prepare for future success.
While the specific technologies around advanced AI—the algorithms and apps—are —JENNIFER BELISSENT
powerful, they don’t work alone. To be successful, a business must build the shiny, Principal Data Strategist, in Snowflake Data + AI Predictions 2024
new AI technology on top of a solid stack of organizational practices and technologies
to ensure a company’s data is available, secure and properly governed. In other words,
the LLM is the dessert, while a solid data infrastructure is the main course.
In our predictions report for 2024, our in-house experts advised that the proper
response to the new AI age is not to desperately create a new data strategy, but to
accelerate the same solid, thoughtful practices you were following before you ever
heard of ChatGPT.
When we look at how Snowflake users are working with their data, we see exactly
that: a focus on silo-busting, refining governance practices, and finally coming to grips
with the flood of unstructured data. For starters.
+131%
PYTHON SCALA JAVA
AI-friendly Python significantly outpaced Scala and
Java growth in the Data Cloud.
+123 %
and app developers. The suite of languages for
Extracting value from that data has been a tech
unstructured data processing became publicly
challenge for years, exacerbated by the near-
available in public preview or general availability
simultaneous arrivals of smartphones and social
on June 27, 2023.
media, and complicated by evolving regulatory regimes
and privacy practices that govern all of an enterprise’s Given that Python in particular is the language of
data, structured or not. That last point is important; choice for many developers, data engineers and data
even as automation and artificial intelligence help us scientists, its fast-growing adoption suggests that FROM JULY 2023 TO JAN. 2024
extract meaning from unstructured data, the actual these unstructured data workflows are not just for
management of it becomes more difficult. building data pipelines, but also involve AI applications
and ML models.
+675%
1. IDC White Paper, sponsored by Box, “Untapped Value: What Every Executive Needs to Know About Unstructured Data,” IDC #US51128223, Aug 2023
In last year’s trends report, we noted that with both data regulations and consumer
privacy concerns on the rise, we had seen increased adoption of data governance
features. In short, we saw that our users were applying more tags governing access
and use of their data, meaning that they were ensuring that necessary audiences
could make use of their data while restricting unauthorized user access. This year,
that trend continues and in fact deepens.
+142 %
2. ML-based functions evaluated for this report include anomaly detection, forecasting and contribution explorer, which all went into public preview on June 27, 2023.
Anomaly detection and forecasting were subsequently announced into general availability on Dec. 18, 2023.
20,076
fully materialized yet, but we’re definitely seeing a lot of effort to get us there ASAP.
• Within the Streamlit developer community, between April 27, 2023, and Jan. 31,
2024, we saw 20,076 unique developers work on 33,143 LLM-powered apps
(this includes apps that are still in development).
33,143
LLM projects were for work.
And it seems that these developers are steadily improving their creations. Vector
databases and vector search help improve the creativity and utility of an LLM app
by making connections between related concepts rather than requiring exact word
matches. The result is smarter, more accurate outputs, faster.
LLM-POWERED APPS IN
9 MONTHS
80% 28%
WEEKLY % OF THE TOTAL USAGE
19%
60%
40%
SKILLS: I’m still learning
20%
17%
0%
MAY JUN JUL AUG SEPT OCT NOV DEC JAN
2024
SINGLE TEXT INPUT CHATBOT
Some of the foundational trends we’re seeing apply directly to AI: robust, refined
governance; increased use of Python; coming to grips with the vast quantities of
unstructured data. Others speak to a general excellence and willingness to adopt new
practices to accelerate time to value, such as the growth of serverless computing.
As organizations progressively improve their foundation, they pave the way for
successful AI initiatives that will deliver reliable, ethical, secure and impactful results.
And the trends we’re seeing in the AI and applications spaces suggest progress is
being made.
Organizations are picking their models, creating more complex LLM applications,
making AI more available to a wider range of users, and reaping the benefits of a
unified data platform. There has been a lot of hype around the transformational
potential of AI, but judging from what we’re seeing in the Data Cloud, the frenzied
fanfare is beginning to materialize into concrete results.
SNOWPARK
Runtimes and libraries that securely deploy and process Python STREAMLIT IN SNOWFLAKE
and other programming languages in Snowflake. Turn data and ML models into interactive apps with Python—
now all in Snowflake.
LEARN MORE
LEARN MORE
18
APPENDIX:
METHODOLOGY
The Snowflake Data Trends Report 2024 is generated from fully aggregated,
anonymized data detailing usage of the Snowflake Data Cloud and its integrated
features and tools. In this report, we examine patterns and trends in data and AI
adoption across more than 9,000 global Snowflake accounts. The Snowflake Data
Cloud provides insight into the state of data and AI, including which technologies
are the fastest growing. Note that usage attributable to internal consumption, if
any, has been removed and is not reflected in any of the metrics contained herein.
The accounts and usage reflected in this report represent every major industry
and include both longtime Snowflake users and others who only recently joined
the Data Cloud.
Except where noted in the text, the data in this report compares monthly
averages from January 2024 (represented as “this year”) to averages in January
2023 (“last year”). When compared, this is depicted as “year over year” growth
to align with Snowflake’s fiscal year end, though the figures themselves are only
representative of January figures to calculate growth.
© 2024 Snowflake Inc. All rights reserved. Snowflake, the Snowflake logo, and all other Snowflake product, feature and service names mentioned herein
are registered trademarks or trademarks of Snowflake Inc. in the United States and other countries. All other brand names or logos mentioned or used
herein are for identification purposes only and may be the trademarks of their respective holder(s). Snowflake may not be associated with, or be
sponsored or endorsed by, any such holder(s).