Today, NVIDIA and Cloudera, the renowned enterprise AI data company, announced an expanded partnership. In order to provide quick, safe, and easy-to-use end-to-end generative AI workflows in production, Cloudera Powered by NVIDIA will incorporate enterprise-grade NVIDIA NIM microservices, which are a component of the NVIDIA AI Enterprise software platform, into Cloudera Machine Learning, a Cloudera Data Platform service for AI/ML workflows.
A complete full-stack platform that is geared for large language models (LLMs) along with enterprise data are essential for taking an organization’s generative AI applications from pilot to production. Developers may integrate AI models to their business data, including text, photos, and visualizations like pie charts, bar graphs, and line plots, using the NVIDIA NIM and NeMo Retriever APIs to produce highly accurate, contextually appropriate responses. By utilizing NVIDIA AI Enterprise, which offers optimized runtimes for creating, modifying, and deploying enterprise-grade LLMs, developers utilizing these microservices may launch apps. Cloudera Machine Learning will help customers unlock the value of their company data under Cloudera management by utilizing NVIDIA microservices. This will be accomplished by introducing high-performance AI processes, AI platform software, and accelerated computation to the data, no matter where it lives.
Cloudera is set to launch numerous NVIDIA microservices integrations. NVIDIA microservices-powered model and application serving will be integrated by Cloudera Machine Learning to improve model inference performance for all workloads. Customers can achieve fault-tolerance, low-latency serving, and auto-scaling for models deployed anywhere—from both public and private clouds—with this new AI model-serving capabilities. Additionally, to make connecting bespoke LLMs to enterprise data easier, Cloudera Machine Learning will provide integrated NVIDIA NeMo Retriever microservices. With this capability, customers will be able to develop production-ready retrieval-augmented generation (RAG) systems.
In order to take advantage of GPU-optimized data processing, Cloudera and NVIDIA previously collaborated to integrate the NVIDIA RAPIDS Accelerator for Apache Spark into the Cloudera Data Platform. Now, Cloudera Data Platform will be able to deliver simplified end-to-end hybrid AI pipelines in a unique way thanks to the anticipated addition of NVIDIA microservices and interaction with NVIDIA AI Enterprise.
Organizations in a variety of sectors will eventually be able to develop, adapt, and implement LLMs that support disruptive generative AI more swiftly and easily. This covers tools like text summarization apps for speedy document processing, chatbots for automating customer interactions and services, coding co-pilots for accelerating development time, streamlined and contextual search, and much more. By streamlining and expediting data and advanced AI processes across the company, these technologies maximize time-to-business value while boosting revenue creation and cost optimization.
Vice President of AI/ML Products at Cloudera Priyank Patel stated, “Cloudera is integrating NVIDIA NIM and CUDA-X microservices to power Cloudera Machine Learning, helping customers turn AI hype into business reality.” “The outcomes of this integration will empower enterprises to make more accurate and timely decisions while also mitigating inaccuracies, hallucinations, and errors in predictions—all critical factors for navigating today’s data landscape,” the statement reads. “In addition to delivering powerful generative AI capabilities and performance to customers.”
According to NVIDIA Vice President of Enterprise Products Justin Boitano, “Enterprises are eager to leverage their massive volumes of data for generative AI to build custom copilots and productivity tools.” The Cloudera Data Platform’s integration of NVIDIA NIM microservices gives developers a more flexible and straightforward method to implement LLMs to promote business transformation.
At the developer conference for the AI era, NVIDIA GTC, Cloudera will be demonstrating its latest AI capabilities. The San Jose McEnery Convention Center in San Jose, California will host the event from March 18–21. It brings together organizations and entrepreneurs that are defining the future of artificial intelligence and rapid computing.