[ad_1]
On this second weblog put up in our collection, we speak about Cloudera Information Platform for IBM Cloud Pak for Information. Very like IBM Cloud Pak for Information, the Cloudera Information Platform is a knowledge and AI platform that may be put in on-premises. The truth is, many IBM prospects are additionally Cloudera prospects. IBM Cloud Pak for Information is constructed on Crimson Hat OpenShift and breaks down silos to allow all your knowledge customers to collaborate from a single, unified interface.
Like most fashionable platforms, set up is far more than simply unzipping a file or clicking a “subsequent” button on a wizard. Fortunately, the Cloudera staff not too long ago introduced it might open supply Ansible playbooks that we’ll leverage to make this complete course of simpler for our personal functions.
This weblog put up is meant to share our expertise in utilizing Ansible to put in Cloudera Information Platform on IBM Cloud. It’s price mentioning that the automation used is open supply and follows the most effective practices advisable by the Cloudera Skilled Providers staff.
The environment
We used Digital Servers on IBM Cloud because the goal for our Cloudera Information Platform set up. A complete of 8 VMs, every 32 vCPU by 128 GB of RAM working CentOS, had been chosen. We additionally had one other Home windows-based VM to run Lively Listing, to finest mimic what prospects most frequently use of their environments. And a single bastion node was provisioned to simplify the communication between the person and the hosts. IBM Cloud Pak for Information was additionally provisioned, however the particulars of which can be out of scope for this put up.

Determine 1. Checklist of digital servers on IBM Cloud
When put collectively, the environment resembled the structure diagram beneath.

Determine 2. An structure diagram of the setting used for integrating Cloudera Information Platform and IBM Cloud Pak for Information
The Ansible playbooks
As talked about earlier, to put in Cloudera Information Platform on IBM Cloud, we leveraged current Ansible playbooks that had been open sourced.
The set up takes roughly 30-60 minutes to finish, relying on machine specs. The longest half is when the installer pulls down the mandatory artifacts and pushes them to every host.

Determine 3. A screenshot of the Cloudera Supervisor putting in Cloudera Information Platform
Subsequent steps
For those who’re an IBMer seeking to get your arms on Cloudera, or considering studying extra about utilizing Ansible playbooks to put in Cloudera, take a look at the GitHub repo. For those who loved this, take a look at A technical deep-dive on integrating Cloudera Information Platform and IBM Cloud Pak for Information. You may also study extra concerning the Cloudera Information Platform for IBM Cloud Pak for Information joint providing.
[ad_2]
