Simplifying Complex Data Workloads

Blog categories: Pentaho Data Integration

Simplifying Complex Data Workloads for Core Operations and GenAI Aspirations

Subhead: Automation, built-in governance, data quality and storage optimization all enhanced in Pentaho’s latest platform update

Over the past few years, we’ve been working hard at Pentaho to reinforce and extend the power of Pentaho’s core ETL engine for today’s enterprise data needs. That work has focused on two key areas:

  • New capabilities such as data catalog, data quality and storage optimization – all areas we know data teams need help with when organizing, securing and making their data available for decision-making.
  • Integrating these capabilities into a core platform for a seamless end-to-end experience, while giving our customers the flexibility and agility to deploy what they need from Pentaho wherever they see fit.

Our latest release, Pentaho+ 10.2, is a significant step forward in our vision to simplify the management of a dynamic and ever-changing data landscape for our customers. This is especially timely as AI and GenAI continue to add complexity and increasing data demands to an already incredible workload.

We’ve taken great pains to make the user experience as easy as possible while building out the key features we know customers absolutely need for day-in and day-out success.  Pentaho+ is purposely designed to be enterprise-grade while also being lightweight wherever possible. This makes it faster and easier to deploy Pentaho+ vs. other competitors while delivering value where it counts.

Although our user interface is elegant in its simplicity, there’s a lot under the hood for today’s enterprise data needs. We’ve enhanced the performance of the entire platform, which includes Pentaho Data Integration and Analytics, Pentaho Data Catalog, Pentaho Data Quality and Pentaho Data Optimizer.

What’s new in Pentaho+ 10.2

Data pipeline templates: Our no code experience enables any business user to easily build a data pipe. For example, a data scientist can quickly create a pipe to move data from the warehouse to the sandbox, or a data owner can build a pipe to re-tier or archive data they own. The seamless integration of our platform enables the ETL tool to carry the workload while the experience is simple enough to require only a few clicks in the UI and no code needed.

Expanded policy application: When making decisions on data lifecycles, you need to evaluate that data against a set of requirements. Pentaho+ allows organizations to leverage policy hierarchy to express requirements on data. This includes a wide range of policies, from who should have access to it, to where it should be stored, to what is the desired data quality for a given business purpose. Viewing data through this lens helps data stewards accelerate time to value and reduce total cost of ownership. The ability to determine if data requirements are actually being addressed is a key benefit of the data governance framework Pentaho’s platform provides.

Decisions based on data relationships: Users can leverage not just the based on the content of data asset but its relationship, whether it is to other data assets, a business term, a governance standard, a reference dataset or an application. The new Galaxy View allows users to easily navigate and drill down on these relationships to evaluate the impact on downstream components and dependency on upstream components, when using data.

Expanding trust in data: Knowing how a report got its data or how data came into an analytics tool plays a significant role in building user trust at the heart of data democratization and AI adoption. With Pentaho, users can truly know the quality, sensitivity, business context and usage of the data they are using. Whether you use Pentaho for ETL or are using another technology such as DBT, you can rely on the Pentaho platform to help you visualize lineage and build confidence in the veracity of data.

Enhancing governance and performance: Reference data plays a significant role in creating efficiency and scaling data governance efforts. Whether it is organic data or purchased from an external vendor, the ability to keep track of reference data values, versions, ownership, and validity are vital. The Pentaho platform provides for reference data management and its use in data identification, data quality evaluation, data enrichment and data remediation.

Explore the power of Pentaho+ today

With the 10.2 release, Pentaho has evolved into a platform ideal for any organization looking to become more data fit., the most critical requirement to being AI ready. Connect with us here for a demo to try out any and all of the platform components.