Storage requirements

Storage requirements for Data Services.

Storage Requirements

Data Services Storage type Storage required Purpose
Cloudera Data Engineering Block 500GB per Virtual Cluster in Embedded NFS Stores all information related to virtual clusters
Cloudera Data Warehouse Local 100 GB per executor in LITE mode and 600 GB per executor in FULL mode Used for caching
Cloudera Control Plane Block 118 GB total if using an External Database, 318 GB total if using the Embedded Database (SSD support only) Storage for Cloudera infrastructure including Fluentd logging, Prometheus monitoring, and Vault. Backing storage for an embedded DB for control plane configuration purpose, if applicable
Cloudera AI Block 600 GB per node (minimum), 4.5 TB (recommended) Stores all Cloudera AI Workbench information
External NFS or Block 1 TB per Node Stores all user project files. VFS storage can either use Longhorn NFS-provisioner on Longhorn OR directly connect to your NFS.
MonitoringApp Block 30 GB + (Env cnt x 100 GB) Stores metrics collected by Prometheus.
Cloudera Data Catalog Requires Control Plane database and not a dedicated storage space 100 GB extra in Cloudera Control Plane database Stores profiling metadata.