The core idea behind a data mesh architecture
In the era of big data and AI, many organisations struggle to manage and govern the vast amounts of data they generate. Historically, the go-to solution was straightforward: create a centralised data team responsible for the entire company's data landscape — setting standards, enforcing common rules, and assuming full accountability across all data domains.
On paper, it made sense. In practice, it created something far more problematic: a bottleneck.
As businesses grew and data volumes exploded, this central team became the single point of contact for every data request, every pipeline fix, every governance question. Business users waited weeks for answers. And ironically, the team designed to make data accessible ended up making it harder to reach.
What is a Data Mesh? The Core Idea
Developed by Zhamak Dehghani in 2019, Data Mesh is not a technology or a tool you can simply install. It is a new way of thinking about how organisations own, manage, and share data at scale.
The central idea is simple: instead of funnelling all data responsibilities through a single central team, ownership is distributed to the domains that know the data best. The marketing team owns marketing data. The logistics team owns logistics data. Each domain becomes accountable for the quality, availability, and reliability of its own data.
Data Mesh lies on four foundational principles that go beyond decentralisation:
- Domain Ownership: Data is owned and managed by the business domain that generates it. The people closest to the data are best placed to understand it, maintain it, and make it useful
- Data as a Product: Each domain doesn't juste store data - it publishes as a product. That means treating datasets with the same rigour as software products. Instead of delivering in one-shots, it a data product comes with clear documentation, reliable quality, and a defined set of consumers in mind.
- Self-Serve Data Platform: To enable domains to work autonomously, the organisation provides a shared infrastructure - a platform that makes it easy for any domain to build, publish, and consume data products without needing deep engineering expertise
- Federated Computional Governance: Across all domains, a common set of standards and rules applies on security, privacy...
At its core, Data Mesh is a change in mindset before it is a change in architecture. It asks organisations to stop treating data as a byproduct of business operations — one that has owners, consumers, and a standard of quality to uphold.