More Database, Analytics Workloads Ran on Kubernetes in 2022

The percentage of organizations running databases on Kubernetes leaped 26 percentage points in 2022 compared to last year, according to a new survey by the Data on Kubernetes (DoK) Community.
More than three-quarters (76%) of survey participants now acknowledge the use of databases on Kubernetes, up from 50% just last year. Analytics workloads have also jumped significantly, the report states, going from 39% to 67%.
Actually running stateful applications (those including data saved to persistent disk storage) is not relatively common in the abstract. A year ago, 55% of the Cloud Native Computing Foundation’s 2021 user survey were doing this. Yet, based on the DoK report, the mix of application types that use data on Kubernetes appears to be growing.
The new report surveyed more than 500 Kubernetes users that run data workloads on Kubernetes. Consistency and ease of management are the leading factors behind running data workloads on Kubernetes, which are both critical to ensuring that widespread, production use of containers can be handled.
Notably, among those using data on Kubernetes, there was no increase in utilizing persistent storage, and an actual decline in streaming or messaging workloads.
Data on Kubernetes Is a Day 2 Operations Issue
The DoK report’s other key findings included:
- Most (72%) respondents started running data workloads on Kubernetes more than a year ago. With some experience handling Day 2 operations (production), survey participants indicated general satisfaction with the different types of stateful workloads being run on Kubernetes.
- Automating application provisioning and configuration management is the challenge people cite most often in managing data workloads on Kubernetes.
- Two out of three survey respondents (66%) are using operators to run data on Kubernetes, which can address some data management challenges, but only if they can also deal with other Day 2 issues like observability and managing the storage lifecycle.
Transformative Impact on Organizations
The survey revealed a consensus that running data workloads on Kubernetes has a transformative impact on organizations. Perception of value is high, yet may overestimate real benefits.
- One in three respondents (33%) believe running data on Kubernetes is having a transformative impact on productivity, with another 51% noting at least a significant positive impact. The numbers are only slightly lower when asked about the impact on revenue.
- Sixty-seven percent said their organization and/or developers are at least 50% more productive after adopting Kubernetes to manage data workloads. That’s up from 57% in last year’s DoK survey. However, this is not comparable to actual productivity benchmarks, at least not yet.
- Fifty-four percent claim that more than 10% of their organization’s revenue can be attributed to the ability to run data on Kubernetes. Taking a step back, we believe that it is tenuous at best to link Kubernetes to revenue this way. Fifty-two percent of organizations reported that more than half of their data workloads run on Kubernetes. But running production workloads with Kubernetes infrastructure is not the same thing as these workloads actually generating revenue.