HPE Ezmeral: Uncut
1754713 Members
4790 Online
108825 Solutions
New Article
JoannStarke

Accelerate AI, ML, and analytics workloads with a unified platform

Learn how you can replace multiple stack complexity and accelerate AI/ML models into production with HPE Ezmeral Unified Analytics.

Accelerate-Unified Analytics-AI-ML.pngData-driven organizations have been using analytics to unlock the value in their data for a few years. But now they want to go deeper by layering AI/ML workloads on top.

Today’s analytic platforms come with a good deal of promise, but enterprises continue to face challenges and struggle to achieve the full potential of existing platforms.[1] It’s not bad technology or bad people, but rather hurdles that are complicating any organization’s efforts.[2]

The top hurdles include:

Data. Today’s modern enterprise operates in hybrid environments which means their data is distributed across multiple locations, formats, and types (i.e., data silos). Data engineers are the persona responsible for unifying this data, cleansing it, then creating the data pipelines used to train models. It’s normal for them to run a gauntlet of multiple approval layers just to access hybrid data which means pipelines are stalled placing a bulls eye on their back by data scientists.  

Figure 1. Different personas that work on data along the pipelineFigure 1. Different personas that work on data along the pipeline

Technical challenges. Data moves along a "pipeline" from one persona to the next before delivering outcomes to data end users. Each one of these personas has a preferred tool already in use and insist that this continues. Data scientists create models in one environment which rarely matches production. Machine learning engineers struggle to reproduce these models and frequently need to refactor the code to ensure productization and repeatability. 

As you can imagine, there’s a lot of friction between the personas: 

Open source software. Each persona has a preferred tool/framework and increasingly that solution is open source tooling such as Apache Airflow, Superset, Kubeflow, Spark, etc. These tools are powerful but were not designed to integrate even with their brethren which hinders cross team collaboration and model operationalization. As a result, outcomes to business owners are slow.

Solving these challenges requires an automated platform that reduces friction and increases collaboration and productivity. But with open source, the question becomes "Why would I buy a platform from a vendor when I can download the software and create my own platform?"

Building your own platform  

The breadth and depth of creating your own AI/ML platform is wildly underestimated. It isn’t as simple as cobbling together a couple of open source tools and by no means it is a simple three engineer, three-month project.

Figure 2. Sampling of open source components needed for AI/ML platformFigure 2. Sampling of open source components needed for AI/ML platform

To begin with, there is a very long list of components that need to be integrated — and there are easily five to 10 different choices for each component. A small sample of those components is shown in Figure 2.

In the end you will discover:

  • Tools don’t integrate well with each other
  • Functionality overlaps in some areas while in others, gaps exist
  • Some tools are open source, some proprietary
  • Different tools target different personas

Bottom line:  A do-it-yourself AI/ML open source platform is difficult! 

But you have a dedicated team of crack devops, development, and site reliability engineers so you push forward only to discover that the next step is plumbing. Networking and communications across external data lake/warehouses, CI/CD processes, identity providers, infrastructure provisioning tools, tools to manage credentials and access controls, multi-tenancy, storage, and secrets. If you make it through all that, you’re ready to celebrate, right? Not so fast.

Every quarter each component releases a patch/upgrade to their own schedule which means your team of crack experts need to apply each update/patch individually then test to make sure that update hasn’t broken the stability of the platform, disrupt productivity, or expose the organization to risk.

Even though hybrid cloud environments have become the dominant deployment patterns for AI/ML and analytic workloads, there is no guarantee that open source software will function correctly. Enterprises building their own platform will need to “lift and shift” as well as rearchitect open source code to run in the cloud.[3]

Several sophisticated organizations with large platform and engineering teams have tried to accomplish this journey only to tell us that their platform wasn’t easy to use, couldn’t be replicated, and lacked end-to-end support from any vendor. 

A unified platform is the answer

The alternative is a unified platform that spreads a single stack across on-premises and cloud environments. An approach that empowers developers and data science professionals to work on multiple use cases from a consistent environment using existing open source tooling. 

That is exactly what HPE Ezmeral Unified Analytics Software does. It’s a single solution stack for hybrid analytics and AI/ML workloads that empowers developers and data science professionals to work on multiple use cases using their preferred tools. A solution that replaces multiple stack complexity and allows models to be created and tested in production-like environments. It reduces complex hand-offs across the data pipeline with automation and converts work into containers allowing pipelines to work across hybrid infrastructure without code refactoring. 

Figure 3. From the top, managed ecosystem for data engineering, analytics, and data scientists included in HPE Ezmeral Unified AnalyticsFigure 3. From the top, managed ecosystem for data engineering, analytics, and data scientists included in HPE Ezmeral Unified Analytics

An import feature (green buttons in Figure 3) allows teams to ingest third-party and custom applications as needed. Connectors are available for Snowflake, Microsoft MySQL, Delta Lake, Teradata, and Oracle as well as popular structured and unstructured data sources.

Figure 4.  Sampling of the data connectors available through HPE Ezmeral Unified AnalyticsFigure 4. Sampling of the data connectors available through HPE Ezmeral Unified Analytics

Organizations that adopt a unified analytics platform benefit from open source tools that integrate seamlessly, comes with simplified management while running smoothly across a hybrid cloud environment.

See how you can successfully deploy data science and analytic workloads with HPE Ezmeral Unified Analytics Software. Watch the video to see the solution in action, then learn more here.


Joann Starke
Hewlett Packard Enterprise

twitter.com/HPE_Ezmeral
linkedin.com/showcase/hpe-ezmeral
hpe.com/software

 

[1] Unifying Analytics Across a Hybrid Cloud Environment,” S&P Global (formerly 451), May 2023.

[2] Top 10 reasons why AI projects fail, Cognilytica

[3] Unifying Analytics Across a Hybrid Cloud Environment, S&P Global (formerly 451), May 2023.

0 Kudos
About the Author

JoannStarke

Joann’s domain knowledge and technical expertise have contributed to the development and marketing of cloud, analytics, and automation solutions. She holds a B.S. in marketing and computer science. Currently she is the subject matter expert for HPE Ezmeral Data Fabric and HPE Ezmeral Unified Analytics.