Storage requirements
Storage requirements for Data Services.
Storage Requirements
Data Services | Storage type | Storage required | Purpose |
---|---|---|---|
Cloudera Data Engineering | Block | 500GB per Virtual Cluster in Embedded NFS | Stores all information related to virtual clusters |
Cloudera Data Warehouse | Local | 100 GB per executor in LITE mode and 600 GB per executor in FULL mode | Used for caching |
Cloudera Control Plane | Block | 118 GB total if using an External Database, 318 GB total if using the Embedded Database (SSD support only) | Storage for Cloudera infrastructure including Fluentd logging, Prometheus monitoring, and Vault. Backing storage for an embedded DB for control plane configuration purpose, if applicable |
Cloudera AI | Block | 600 GB per node (minimum), 4.5 TB (recommended) | Stores all Cloudera AI Workbench information |
External NFS or Block | 1 TB per Node | Stores all user project files. VFS storage can either use Longhorn NFS-provisioner on Longhorn OR directly connect to your NFS. | |
MonitoringApp | Block | 30 GB + (Env cnt x 100 GB) | Stores metrics collected by Prometheus. |
Cloudera Data Catalog | Requires Control Plane database and not a dedicated storage space | 100 GB extra in Cloudera Control Plane database | Stores profiling metadata. |