Data Mesh Vs Data Lake
Data Analytics / September, 19 2021

Data Mesh vs Data Lake – Driving Business Insights at Scale

Data is now the soul of every digital business, and the pandemic has accelerated the adoption of Analytics and AI as a business function. Over the past few years, organizations had to rapidly move to new data technologies, modern data architectures, and infrastructure to drive innovations such as personalized product recommendations and predictive analytics. Despite such changes, collection, integration, and governance of data is still the main inhibitor to Analytics and AI success, says Deloitte Research.

The evolution of business insights platforms can be fragmented into three generations, as per Zhamak:

  • Organizations deployed traditional data warehouses in the first generation to generate reports as per the need. This was very expensive and lacked a centralized approach. 
  • As Big data and analytics gained popularity in the second generation, data warehouses were replaced by a central data lake. Though this became very popular, a few bottlenecks like data volumes, scalability, domain-specific data highlighted the need for a decentralized approach. 
  • The current third-generation platforms address a few gaps and garnered attention towards product thinking with data, self-service platforms design, and distributed domain-driven architecture. All these gave way to a Data Mesh architecture.

Data Mesh vs Data Lake – A Paradigm Shift

Companies no more operate on gut-driven decisions but tend to be data-driven. Data Lake was always at the core for any such organization, providing democratized access to all business functions.

Data mesh vs data lake – Rethinking data architecture

As the data lakes grew, the complexity of data management also changed. In a typical data lake architecture, data producers generate it and send it to the data consumers. In short, data producers are very tech-savvy while consumers are business savvy. Often, data consumers had to go back to data producers to understand the domain and intrinsic value of the data. The centralized data ownership created two main challenges for businesses: 

  1. Most of the data engineering team’s efforts are led towards fixing the issues and revalidating the data. An inherent difficulty in searching and interpreting the data is evident. 
  2. Data users are not aware of the source domain from where the data is extracted often leading to low data quality. 

The new architectural concept, data mesh resolves these big data issues by decentralization of data ownership and federated data governance. Data mesh claims that distributed domain-driven architecture can fuel big data innovations and resolve scalability challenges. The approach addresses the challenges with a shift in thinking at four levels:

  • At platform level – Data as a product and distributed domain-based data architecture. Data responsibilities are vested to domain owners and data as a product is shared between different domain owners. 
  • At technical level – Decentralized data ownership and shared data infrastructure with a conceptual separation between data ownership and infrastructure. Cross-function data is offered through APIs for other business needs. Data quality and data products are vested with domain owners. 
  • At team level – More collaboration from domain owners to bring in new data sources and further develop solutions as per the business needs. 
  • Expertise – Skills are broader for each team and enables easier shift across different data products 

Data Mesh vs Data Lake – Scalability to Drive Insights Faster

Data lakes have democratized access for all the business users but created siloed data and organization structure that does not scale up to deliver the promised value of a data-driven organization. In reality, we find ubiquitous data, disconnected data teams, and little access to consuming domain experts. Data mesh approach powers the next-gen enterprise data platform architecture in convergence of the following:

  • Distributed Domain-Driven Architecture – Microservices architecture has transitioned the business to domain-oriented capabilities, but we have disregarded the same with data. In data mesh, domains host and serve the data in a consumable way shifting the approach from centralized data lake architecture. Traditional Push and Ingest thinking are transformed to a Pull and Serve model across all the domains with the data mesh concept.

How does it alter insights?   

Cleansing, Preparing and Aggregating of data lies with the domain, and data pipelines are handled within the domain thereby resolving data quality issues. But yes, each domain data set must have a service level objective to ensure quality. 

  • Data Product Thinking – Over the past few years, operational teams evolved product thinking by creating the best developer experience for their APIs and making them understandable and discoverable. The same applies to data mesh architecture too by making the data product discoverable, addressable, trustworthy, and governance with global standards. Well-described semantics, syntax, centralized meta information, and data provenance and data lineage to aid in avoiding siloes and hyper-specialized ownership.

Do we need data as a product?  

The Marketing Manager of an online retailer would struggle to identify unnamed analytics solutions as per their need. But they would be interested to use a data-driven customer engagement platform. A well-defined identifiable data product leads to exceptional results as per the business context and drives decision-making at scale.

  • Self-Service Data Infrastructure as a Platform – Hiding all the complexities, keeping the data infrastructure domain agnostic, excluding domain-specific information remains the centerpiece in building data infrastructure as a platform.

Why such a centralized platform?  

Though the tools and techniques are not matured in the system, this infrastructure platform resolves duplication of efforts in setting up data pipeline engines, storage, and streaming infrastructure. This entire setup reduces the lead time to create a new data product and drives the automation efforts.

Treating data as a product alters the ownership responsibility, brings in more visibility, and makes it easier to consume the data. Therefore, the Data mesh concept avoids human knowledge siloes to create value and innovation from data for Machine Learning and AI experts.

Do you think data mesh is still a concept or interested in implementing it? A few use cases were mentioned in our previous data mesh blog and now let us look at a few implementations by the industry leaders. 

A Few Data Mesh Implementations

Data mesh is not about a specific code or tech stack that solves your problems with the click of a button. Many experts also argue that this approach is suited for large organizations. But the reality is that data is diversified and ubiquitous for businesses of any size and growth for the organization of any size lies in new-age data management solutions. A few industry leaders have already mitigated the risk of siloed data.

Europe’s leading fashion platform Zalando transitioned to a data mesh self-service architecture, all built within the centralized infrastructure layer of AWS data lake. The Insights team could access the needed data and the analytics solutions was scalable as per the business requirements. 

Netflix, the online streaming platform processes trillions of events every day. As data integration is core for their business operations, Netflix implemented data mesh architecture to optimize costs, improve performance, and mitigate operational risks. 

Data mesh is not specific to any industry, Financial giant JP Morgan implemented data mesh to facilitate data reuse and derive insights faster.  

Are you interested? Please connect with us for more information about our data mesh offering.

Get in Touch

Newsletter

Stay up-to-date with our latest news, updates, and promotions by subscribing to our newsletter.

Microsoft Solutions Partner - Infrastructure (Azure)
Microsoft Solutions Partner - Modern Work
Microsoft Solutions Partner - Data & AI (Azure)
Microsoft Solutions Partner - Business Applications
Microsoft Partner Azure Expert MSP

Copyright © 2008-2023 Saxon. All rights reserved | Privacy Policy

Address: 1320 Greenway Drive Suite # 660, Irving, TX 75038

Archana Aila

Archana Aila

Position Here

With 2 years of hands-on experience in Power Platform, I’ve excelled in developing and implementing solutions for businesses, harnessing the power of Power Apps, Power Automate, Power BI, and Power Virtual Agents to streamline processes and enhance productivity. My proficiency extends to crafting custom applications, automating workflows, generating data insights, and creating chatbots to aid operational efficiency and data-driven decision-making.

With an intermediate knowledge in Azure cognitive services, incorporating them into Power Platform use cases to innovate and solve complex challenges. My expertise in client engagement and requirements gathering, coupled with effective team coordination, ensures on-time, high-quality project deliveries. These efforts have yielded significant accomplishments, solidifying my role as a valuable asset in this field.

Palak Intodia

Palak Intodia

Position Here

I am a tech graduate with a strong passion for technology and innovation. With three years of experience in the IT industry, I’ve been on a continuous journey of professional growth and skill development. My expertise lies in Power Apps and Automate, where I’ve had the privilege of contributing to multiple successful projects.

I’m dedicated to delivering results that not only meet expectations but also drive the success of the projects I’m involved in. I’m committed to my ongoing professional development and the pursuit of excellence.

Roshan

Roshan Jaiswal

Position Here

With nearly 2 years of dedicated experience in Power Platform technology, my expertise lies in crafting customized business solutions using Power Apps and Power Automate. I excel in identifying intricate business requirements and translating them into innovative, user-friendly applications. My daily tasks involve meticulously deploying applications across diverse environments and harnessing the full potential of the Microsoft ecosystem within business applications.

I have proven my adaptability by consistently meeting the demands of creating responsive and scalable applications. Also seamlessly integrating complex workflows and data sources, ultimately enhancing operational efficiency and driving sustainable business growth.

Sugandha

Sugandha Chawla

Position Here

Sugandha is a seasoned technocrat and a full stack developer, manager, and lead. Having 8 years of industry experience, she has been able to build excellent working relationships with all her customers, successfully establishing repeat business, from almost all of them. She has worked with renowned giants like Infosys, Ernst & Young, Mindtree and Tech Mahindra.

She has very diverse and enriching work experience, having worked extensively on Microsoft Power Platform, .NET, Angular, Azure, Office 365, SQL. Her distinctiveness lies in the profound domain knowledge, managerial skills, and process mastery, that she additionally holds, as a result of possessing a customer facing role, working with different sectors, and managing and driving numerous critical executions, single-handedly, end to end.

Vibhuti Dandhich

Vibhuti Dadhich

Position Here

Vibhuti, a Power Platform technology evangelist, has passionately embraced the transformative potential of low-code development. With a background that includes experience at EY and Wipro, she’s been a trusted advisor for clients seeking innovative solutions. Her expertise in unraveling complex business challenges and crafting tailored solutions has propelled organizations to new heights.

Vibhuti’s commitment to staying at the forefront of technological advancements and her forward-thinking approach have solidified her as an industry thought leader. Her mission is to empower businesses to thrive in the digital age, revolutionizing operations through the Power Platform.

Ruturaj Kulkarni

Ruturaj Kulkarni

Position Here

With 8 years of dedicated expertise in the IT realm, I am a seasoned professional specializing in .NET technologies and Microsoft Azure Cloud. My journey encompasses a profound understanding of software development using the .NET framework and a robust command over Azure’s cloud ecosystem. Throughout my career, I’ve demonstrated a knack for crafting scalable and efficient solutions, leveraging the power of cloud computing.

My passion lies in staying at the forefront of technological advancements, ensuring that my skills align seamlessly with the dynamic landscape of IT. Ready to tackle challenges and drive innovation, I bring a wealth of experience to any project or team.