Wednesday, March 26, 2025
HomeCyber SecurityAccelerating Analytics Workloads with Cloudera, NVIDIA, and Cisco

Accelerating Analytics Workloads with Cloudera, NVIDIA, and Cisco

[ad_1]

 

Co-Creator: Silesh Bijjahalli

As as we speak’s main corporations make the most of synthetic intelligence/machine studying (AI/ML) to find insights hidden in huge quantities of knowledge, many are realizing the advantages of deploying in a hybrid or non-public cloud surroundings, moderately than a public cloud. That is very true to be used instances with information units bigger than 2 TB or with particular compliance necessities.

In response, Cisco, Cloudera, and NVIDIA have partnered to ship an on-premises huge information answer that integrates Cloudera Knowledge Platform (CDP) with NVIDIA GPUs working on the Cisco Knowledge Intelligence Platform (CDIP).

Cisco Knowledge Intelligence Platform: a journey to hybrid cloud

The CDIP is a thoughtfully designed non-public cloud that helps information lake necessities. CDIP as a non-public cloud is predicated on the brand new Cisco UCS M6 household of servers that help NVIDIA GPUs and third-generation Intel Xeon Scalable household processors with PCIe fourth-generation capabilities.

CDIP helps data-intensive workloads on the CDP Personal Cloud Base. The CDP Personal Cloud Base offers storage and helps conventional information lake environments, together with Apache Ozone (a next-generation file system for information lake).

  • CDIP constructed with the Cisco UCS C240 M6 Server for storage (Apache Ozone and HDFS), which helps CDP Personal Cloud Base, extends the capabilities of the Cisco UCS rack server portfolio with third-generation Intel Xeon Scalable processors. It helps greater than 43 p.c extra cores per socket and 33 p.c extra reminiscence than the earlier era.

CDIP additionally helps compute-rich (AI/ML) and compute-intensive workloads with CDP Personal Cloud Experiences—all whereas offering storage consolidation with Apache Ozone on the Cisco UCS infrastructure. The CDP Personal Cloud Experiences present completely different experience- or persona-based processing of workloads—information analyst, information scientist, and information engineer, for instance—for information saved within the CDP Personal Cloud Base.

  • CDIP constructed with the Cisco UCS X-Collection for CDP Personal Cloud Experiences is a modular system that’s adaptable and future-ready, assembly the wants of recent purposes. The answer improves operational effectivity and agility at scale.

This CDIP answer is totally managed by means of Cisco Intersight. Cisco Intersight simplifies hybrid cloud administration, and, amongst different issues, strikes server administration from the community into the cloud.

Cisco additionally offers a number of Cisco Validated Designs (CVDs), which can be found to help in deploying this non-public cloud huge information answer.

Integrating a giant information answer to deal with AI/ML workloads

More and more, market-leading corporations are recognizing the true transformational potential of AI/ML educated by their information. Knowledge scientists are using information units on a magnitude and scale by no means seen earlier than, implementing use instances equivalent to remodeling provide chain fashions, responding to elevated ranges of fraud, predicting buyer churn, and growing new product strains. To achieve success, information scientists want the instruments and underlying processing energy to coach, consider, iterate, and retrain their fashions to acquire extremely correct outcomes.

On the software program facet of such an answer, many information scientists and engineers depend on the CDP to create and handle safe information lakes and supply the machine learning-derived companies wanted to deal with the commonest and necessary analytics workloads.

However to deploy the answer constructed with the CDP, IT additionally must resolve the place the underlying processing energy and storage ought to reside. If processing energy is just too gradual, the utility of the insights derived can diminish significantly. Alternatively, if prices are too excessive, the work is vulnerable to being cost-prohibitive and never funded on the outset.

Knowledge set measurement a significant consideration for giant information AI/ML deployments

The sheer measurement of the information to be processed and analyzed has a direct influence on the price and velocity at which corporations can prepare and function their AI/ML fashions. Knowledge set measurement may closely affect the place to deploy infrastructure—whether or not in a public, non-public, or hybrid cloud.

Take into account an autonomous driving use case for instance. Working with a significant car producer, the Cisco Knowledge Intelligence Platform ran a proof of idea (POC) that collects information from roughly 150 vehicles. Every automobile generates about 2 TB of knowledge per hour, which collectively provides as much as some 2 PB of knowledge ingested day-after-day and saved within the firm’s information lake. The associated fee to maneuver this information right into a public cloud could be staggering, and, due to this fact, an on-premises, non-public cloud possibility makes extra monetary sense.

Moreover, this information lake incorporates about 50 PB of sizzling information that’s saved for a month and tons of of petabytes of chilly information that should even be saved.

Contemplating infrastructure efficiency

As well as, the efficiency of the underlying infrastructure in lots of AI/ML deployments issues. In our autonomous driving use case instance, the POC requirement is to run greater than one million and a half simulations every day. To supply sufficient compute efficiency to satisfy this requirement takes a mixture of general-purpose CPU and GPU acceleration.

To satisfy this requirement, CDIP begins with top-of-the-line efficiency, as illustrated by means of TPC-xHS benchmarks. As well as, CDIP is obtainable with built-in NVIDIA GPUs, delivering a GPU-accelerated information heart to energy essentially the most demanding CDP workloads. To satisfy the efficiency necessities of this POC, 50,000 cores and accelerated compute nodes had been utilized, supplied by the CDIP answer deploying Cisco UCS rack servers.

Be taught extra in regards to the Cisco, Cloudera, and NVIDIA built-in answer

The Cisco, NVIDIA, and Cloudera partnership provides our joint clients a a lot richer information analytics expertise by means of answer know-how developments and validated designs—and all of it comes with full product help.

In case you have an AI/ML workload that may make sense to run in a non-public or hybrid cloud, study extra in regards to the CDP built-in with NVIDIA GPUs working on the CDIP.

And that will help you get began modernizing your infrastructure help, information lake, and AI/ML processes, check out CVDs.

 

 


We’d love to listen to what you suppose. Ask a Query, Remark Beneath, and Keep Linked with #CiscoPartners on social!

Cisco Companions Social Channels
Fb
Twitter
LinkedIn

Share:



[ad_2]

RELATED ARTICLES

LEAVE A REPLY

Please enter your comment!
Please enter your name here

Most Popular

Recent Comments