May 20, 2025
Release notes and fixed issues for version 2.0.50-b68.
New Features / Improvements
Cloudera AI Workbench
- AI Studios (Technical Preview): Cloudera AI Studios is a comprehensive suite of low-code tools designed to simplify the development, customization, and deployment of generative AI solutions within enterprises. This suite empowers organizations to operationalize AI workflows quickly and efficiently by leveraging real-time enterprise data. For more information, see Managing AI Studios.
- Added APIs within the workbench to list Cloudera AI Inference service applications and their associated model endpoints.
Cloudera AI Platform
- Added Azure UDR support for Cloudera AI Inference service.
- Added Azure NTP support.
- Added API support to retry the creation of Cloudera AI Inference service application upon failure.
Cloudera AI Registry
- Added a new set of models in the Model Hub, including Llama3.3, DeepSeek-R1-Distill-Llama, Starcorder2, Llama-Nemotron-Nano, NeMo-Retriever-Parse, Llama 3.2 Embedding, and Llama 3.2 Encoder models. To access these models, you must upgrade your Cloudera AI Registries.
- Added support for nim-cli in the AI Registry to import the latest offerings from NVIDIA.
- Enhanced troubleshooting by surfacing underlying issues encountered during AI Registry installation in the Event logs.
- Provided the ability to upgrade the AI Registry directly through the UI, eliminating the reliance on the CLI.
- Implemented automatic redirection to the model import status page whenever a new model import is triggered.
Cloudera AI Inference service
- Users must upgrade their Cloudera AI Inference service applications to serve the latest optimized models from NVIDIA, including Llama3.3, DeepSeek-R1-Distill-Llama, Starcorder2, Llama-Nemotron-Nano, NeMo-Retriever-Parse, Llama 3.2 Embedding, and Llama 3.2 Encoder models.
- Optimization profile details for deployed model endpoints are now surfaced in the UI for improved visibility.
- A user-friendly warning message will now be displayed when replicas of a deployed model scale down.
- Added an option in the UI to retry the creation of Cloudera AI Inference service applications.
- Users will be automatically redirected to the model endpoint page upon triggering the deployment of a new endpoint.
- Enhanced the UI with a variety of user-friendly tooltips for better usability.
- The metrics page for model endpoints will now refresh automatically every 15 seconds for real-time updates.
- GPU count is now auto-selected for NIM profiles when deploying a model endpoint.
- Ensured that dangling pods of deleted endpoints are immediately terminated, preventing them from being left for garbage collection cleanup.
Fixed Issues
Cloudera AI Workbench
- Resolved an issue where duplicate machine user CRNs were preventing the catalog page backup from loading. (DSE-43729)
- Fixed an invalid error issue in the Cloudera AI Registry search filter within the workbench. (DSE-44401)
Cloudera AI Platform
- Resolved the issues causing failures during the retry of upgrade operations. (DSE-44761)
- Resolved an issue where team synchronization was removing all collaborators for a project created with LDAP team when a member was removed from the UMS group mapped to that LDAP team. (DSE-43524)
Cloudera AI Inference service
- Resolved an issue causing the deletion of the incorrect node group from the Cloudera AI Inference service UI. (DSE-44981)
- Resolved an issue preventing the import of older models like Mixtral due to compatibility constraints. (DSE-44972)