January 29, 2025

Release notes and fixed issues for version 2.0.47-b302.

New Features / Improvements

Migrated Cloudera AI Workbench, Cloudera AI Registry, and Cloudera AI Inference service images to chainguard to address CVEs.
Added APIv2 support for Enhanced Group Sync.
Added support to create AMPs (Cloudera Accelerators for Machine Learning Projects) using APIv2. Previously, this option was available only using UI.
Added support for H100 GPU instances for Cloudera AI Inference service on Azure.
Added support for AKS workload identity.
Added support for AWS M7a, M7i, C7a, C7i, R7a, R7i instance families.
Added support for Cloudera AI Inference service on EU Control Plane.
Added support for EKS 1.30.
Added support for AKS 1.30.
Hugging Face support (Technical Preview): You can now import text-generating language models from Hugging Face and deploy them on Cloudera AI Inference service.
Added profiles for HuggingFace Models and multi-modal models in the Model Hub catalog.
Updated existing model manifests in the catalog after upgrading the NIM version in Cloudera AI Inference service.
Enhanced error messages related to model import failure in the Model Hub UI.
Carried enhancements in AI Registry to ensure that multi-modals can be supported.
Added runtime support for Llama 3.2 11B and 90B Vision Language Model NIMs to ensure that they can be deployed using AI Inference. Only model profiles optimized for the H100 GPU are supported for these two models in this release.
Llama 3 NIM is no longer supported since we now have both Llama 3.1 and Llama 3.2.
Added support for Diagnostic Bundles in Cloudera AI Inference service.
Upgraded text-generating and embedding NIMs.
Added Code Sample functionality for endpoints deployed using Cloudera AI Inference service.
Model endpoint replica events can now be viewed on the Model Endpoint details UI.You can now add numerous docker credentials using UI or API which can be used to enable Cloudera AI to fetch custom ML Runtimes from a secure repository. For more information, see Add Docker registry credentials and certificates.

Fixed Issues

Previously, some Cloudera AI Inference service clusters did not have the 'creationDate' field. This field is now added.(DSE-38817)
Previously, the deletion of backup for older workspaces was failing. This issue is now resolved. (DSE-41031)
Previously, deleting a workbench backup created by a deleted user displayed an error. This issue is now resolved. (DSE-41052)
Multiple UI improvements are made both in the Create, Read, Update, and Delete operations of Cloudera AI Inference service and while deploying or editing a model endpoint.
The model_name field is now displayed instead of model_id in the Endpoint Details UI. (DSE-38937)
Previously, the NIM model profile environment variable was only assigned for LLMs. Now support for Model Profile override is added for Embedding and Reranker NIMs. (DSE-40508)
Previously, there was an issue with rendering of existing instance type in the "Edit Endpoint" UI. This issue is now resolved. (DSE-40636)
Validated all node group (instance type) selection from UI. (DSE-40754)
Previously, NGC manifest components were missing from the download. This issue is now resolved. (DSE-41055)
The Create ML Serving application now enables the public load balancer. (DSE-41305)
The Instance Type field in the Edit Model Endpoint UI is no longer mandatory. (DSE-41278)
Added force delete option to delete the Cloudera AI Inference service using UI. (DSE-41035)
The Cloudera AI Inference service UI now displays optimization profile details. (DSE-40927)
You can now create, download, and delete log archives for Cloudera AI Inference service. (DSE-40921)
The Test Model UI now fails gracefully when the replica is scaled down to zero for a model deployed using Cloudera AI Inference service. (DSE-40957)
Previously, the Storage initializer had the wrong task values. This issue is now resolved. (DSE-41058)
Enabled storage initializer to now handle more than two directories for NIM artifacts. (DSE-40986)
Removed Llama 3 runtimes. (DSE-40956)
Addressed SQL injection issue in AI Registry that allowed non-authorized but authenticated users to perform Create, Read, Update, and Delete operations on AI Registry’s metadata tables. (DSE-41542)