Our Big Blogs

Cloudera Unifies Trino, SDX & Lineage: A New Era of AI-Ready Data Access

Written by Matthias Vallaey | Nov 27, 2025 2:17:02 PM

As organizations accelerate their AI journeys, one challenge continues to stand in the way of meaningful progress: data accessibility. Despite massive investments in modern data stacks, most enterprises still struggle to make their data discoverable, governed, consistent, and usable across environments.

A recent survey illustrates this clearly:

  • Only 9% of IT leaders say all their organizational data is accessible.

  • Just 38% claim that most of their data is usable for AI.

The rest remains locked in silos, hidden behind fragmented governance rules or trapped in legacy systems.

Cloudera’s latest platform update directly targets this challenge by integrating Trino, Cloudera Shared Data Experience (SDX), and Cloudera Octopai Data Lineage into a single, unified architecture — powered end-to-end by AI-driven automation.

The Power of a Unified, AI-Driven Data Fabric

Cloudera’s updated platform brings together three powerful capabilities to create a cohesive and intelligent data fabric:

1. Trino for Federated Querying Across Any Environment

Organizations can now query distributed datasets — on-premises, in any cloud, or across multiple systems — without moving data. Natural language interfaces allow business teams to explore data intuitively, while compute engines close to the data ensure performance and security.

2. SDX for Unified Metadata, Governance & Access Control

Cloudera SDX consolidates metadata, policies, and permissions across the entire ecosystem.
This unified control plane ensures:

  • Consistent governance everywhere

  • Secure self-service access for all teams

  • Zero duplication of security or access policies

3. Octopai Data Lineage for End-to-End Transparency

Octopai tracks the full lifecycle and transformation journey of every dataset — even data originating outside Cloudera.
This delivers:

  • Full auditability

  • Trust in AI outcomes

  • Clear impact analysis for all data flows

AI at the Core: Automating the Data Fabric

The platform uses AI to streamline and automate essential data operations such as:

  • Data quality checks

  • Classification

  • Profiling

  • Metadata enrichment

This reduces manual work, accelerates data readiness, and empowers technical and non-technical teams alike.

What Organizations Can Achieve Now

With this release, Cloudera enables enterprises to:

Increase Efficiency

Automate data preparation, quality processes, and governance workflows.

Democratize Data Access

Use natural language interfaces to make data available to everyone — not just engineers.

Boost Trust & Transparency

Leverage intelligent metadata and complete lineage for compliant, explainable data products and AI systems.

A Unified Platform for the AI Era

“Our mission at Cloudera has always been to help organizations make trusted data available for every AI initiative,” says Leo Brunnick, Chief Product Officer at Cloudera.
“With this release, we’re taking a major step forward by bringing AI-driven automation, governance and access together on one platform.”

By combining Trino federation, SDX governance, and Octopai lineage, Cloudera provides something enterprises have long needed:
a single, secure, AI-powered data fabric that unifies access to all data — anywhere.