Installing Cloudera Data Services on premises using Cloudera Embedded Container Service
Install Cloudera Data Services on premises using the Cloudera Embedded Container Service.
- In the Cloudera Manager WebUI, the remote_repo_override_user and remote_repo_override_password parameters must contain valid credentials for archive.cloudera.com before any upgrades or installation of Cloudera Embedded Container Service.
- When deploying an Cloudera Embedded Container Service cluster, the batch size limitation for adding Cloudera Embedded Container Service agent nodes to Cloudera Embedded Container Service cluster is under 50. If the requirement is to deploy an Cloudera Embedded Container Service cluster with more than 50 nodes, Cloudera recommends starting the initial deployment with less than 50 nodes and incrementally add nodes to the cluster after the first installation succeeds.
- Before configuring Cluster IP Range (cluster-cidr) and Service IP Range (service-cidr), you must review best practices at the Suse website. After your cluster is deployed, these values cannot change. Any misconfiguration will require decommissioning the cluster and redeploying it to correct the settings.
If the installation fails, and you see the following error message in the
stderr output during the Install Longhorn
UI step, retry the installation by clicking the
Resume button:
++ openssl passwd -stdin -apr1 + echo 'cm-longhorn:$apr1$gp2nrbtq$1KYPGI0QNlFJ2lo5sV62l0' + kubectl -n longhorn-system create secret generic basic-auth --from-file=auth + rm -f auth + kubectl -n longhorn-system apply -f /opt/cloudera/cm-agent/service/ecs/longhorn-ingress.yaml Error from server (InternalError): error when creating "/opt/cloudera/cm-agent/service/ecs/longhorn-ingress.yaml":
Internal error occurred: failed calling webhook "validate.nginx.ingress.kubernetes.io": Post "https://rke2-ingress-nginx-controller-admission.kube-system.svc:443/networking/v1/ingresses?timeout=10s": x509: certificate signed by unknown authority
In Cloudera Data Services on premises deployments using RKE2,
container life cycle events, such as container mounts, start, or stop, are
logged to the /var/log/messages file using
systemd.
Because the Cloudera Data Services on premises installation is
not OS-integrated, no systemd, rsyslogd,
or logrotate configurations are delivered. As a result, these
verbose messages are flooding the /var/log/messages file,
creating disk space pressure on the /var directory, which might
lead to stability or availability issues.
/var/log/messages file, consider the following logging
strategies:- Configure the following OS-level logging options for Cloudera Data Services on premises, especially for Cloudera Embedded Container Service on RKE2:
-
A sample
rsyslog.dconfiguration or journald filters to redirect container lifecycle logs to a separate file. For example, to/var/log/rancher-container-events.log. -
Safety valve-based injection method, if feasible, or post-install script guidance for these OS-level logging configurations.
-
- Alternatively, filter or rate-limit verbose lifecycle logs at the container runtime layer, if possible.
- If you specified a custom certificate, select the Cloudera Embedded Container Service cluster in Cloudera Manager, then select . This action copies the
cert.pemandkey.pemfiles from the Cloudera Manager server host to the Cloudera Embedded Container Service Management Console host. - Click Open Cloudera on premises to launch your Cloudera Data Services on premises instance.
- Log in using the previously provided username and password.
- Set up external authentication using the URL of the LDAP server and a CA certificate of your secure LDAP. Set up external authentication using the URL of the SAML and upload SAML Identity Provider Metadata file. Follow the instructions on the Welcome to Cloudera on premises page to complete this step.
- Click Test Connection to ensure that you are able to connect to the configured LDAP server.
- Create your first Virtual Warehouse in the Cloudera Data Warehouse Data Service
- Provision an Cloudera AI Workbench in the Cloudera AI Data Service
- Add a Cloudera Data Engineering service in the Cloudera Data Engineering Data Service





