Better Predictions at Scale with NVIDIA Merlin

Summary: NVIDIA Merlin is an open-source library designed to accelerate recommender systems on NVIDIA GPUs. It provides a scalable and GPU-accelerated solution for building high-performing recommenders at scale. This article explores how NVIDIA Merlin can help data scientists, machine learning engineers, and researchers build effective recommender systems with better predictions at scale. Building Better Recommenders with NVIDIA Merlin Recommender systems play a crucial role in online activities, influencing a significant portion of shopping and entertainment choices....

September 4, 2024 · Pablo Escobar

Boost Multi-Omics Analysis with GPU-Acceleration and Generative AI

Unlocking the Power of Multi-Omics Analysis with GPU Acceleration and Generative AI Summary: The rapid growth of genomic data has created a significant challenge for bioinformatics analysis. To address this, NVIDIA has introduced Parabricks v4.3, a tool that leverages GPU acceleration and generative AI to accelerate multi-omics analysis. This article explores how Parabricks v4.3 can help researchers analyze DNA, RNA, methylation, single-cell, and spatial omics data at high speed and accuracy....

September 4, 2024 · Carl Corey

Boosting Performance on NVIDIA Ampere Architecture

Boosting Performance on NVIDIA Ampere Architecture: A Guide to Controlling Data Movement Summary The NVIDIA Ampere GPU architecture offers significant performance improvements over its predecessors, particularly in AI training and inference workloads. One key aspect of achieving these improvements is controlling data movement within the GPU. This article explores how developers can leverage the Ampere architecture’s features to optimize data movement and boost performance. Introduction The NVIDIA Ampere GPU architecture is designed to deliver faster performance for HPC, AI, and data analytics workloads....

September 4, 2024 · Tony Redgrave

Bridging the CUDA C++ Ecosystem and Python Developers with Numbast

Bridging the Gap Between CUDA C++ and Python Developers Summary: NVIDIA’s CUDA ecosystem has long been a cornerstone for high-performance computing, particularly in fields like deep learning and data analytics. However, the barrier between CUDA C++ and Python developers has often hindered the full potential of GPU acceleration. This article explores how Numbast, a tool that automatically generates Numba bindings for CUDA C++ libraries, is bridging this gap, making it easier for Python developers to leverage CUDA’s power....

September 4, 2024 · Carl Corey

Bringing Generative AI to the Edge with NVIDIA Metropolis Microservices for Jetson

Summary NVIDIA Metropolis Microservices for Jetson is revolutionizing the way we develop and deploy edge AI applications. By combining the power of generative AI with edge computing, developers can create sophisticated AI-powered applications that run in real-time, without relying on cloud infrastructure. This article explores how NVIDIA Metropolis Microservices for Jetson is bringing generative AI to the edge, and what this means for industries such as healthcare, manufacturing, and automotive....

September 4, 2024 · Tony Redgrave

Build a Digital Human Interface for AI Apps with an NVIDIA NIM Agent Blueprint

Summary Building a digital human interface for AI apps can significantly enhance customer service experiences. The NVIDIA NIM Agent Blueprint provides a comprehensive package to create and deploy AI-powered digital humans. This blueprint combines NVIDIA NIM microservices with reference code and documentation, enabling enterprises to build custom AI applications. The digital human interface offers a human-like interaction, leveraging retrieval-augmented generation (RAG) for smooth and accurate information retrieval. Creating Human-Like Customer Service with NVIDIA NIM Agent Blueprint The Need for Digital Human Interfaces Traditional text-based chatbots have limitations, particularly in multilingual support and accurate information retrieval....

September 4, 2024 · Tony Redgrave

Build and Deploy Powerful Robots with Isaac SDK 2019.2

Summary: NVIDIA’s Isaac SDK 2019.2 is a comprehensive toolbox designed to accelerate the development and deployment of AI-powered robots. This version brings numerous features and enhancements to the robotics community, focusing on AI perception and navigation. Here, we explore how the Isaac SDK can help developers build and deploy powerful robots. Building Powerful Robots with NVIDIA’s Isaac SDK 2019.2 NVIDIA has announced the release of Isaac SDK 2019.2, a significant update to its comprehensive toolbox for accelerating the development and deployment of AI-powered robots....

September 4, 2024 · Emmy Wolf

Build Custom Enterprise-Grade Generative AI with NVIDIA AI Foundation Models

Building Custom Enterprise-Grade Generative AI: A Comprehensive Guide Summary: Generative AI is revolutionizing how enterprises approach various tasks, from software development to product lifecycle management. However, building custom, enterprise-grade generative AI solutions requires expertise in data collection, infrastructure setup, and model optimization. This article explores how NVIDIA AI Foundation Models can help developers create custom generative AI models tailored to their enterprise needs. Understanding Generative AI in Enterprises Generative AI has become a critical tool for enterprises looking to enhance productivity, improve code quality, and streamline development processes....

September 4, 2024 · Pablo Escobar

Build Efficient Recommender Systems with Co-Visitation Matrices and RAPIDS cuDF

Building Personalized Recommender Systems with Co-Visitation Matrices and RAPIDS cuDF Summary: Recommender systems are crucial for personalizing user experiences across various platforms. They predict and suggest items based on past behaviors and preferences. This article explores how to build efficient recommender systems using co-visitation matrices and RAPIDS cuDF, a GPU DataFrame library that accelerates data processing. Understanding Co-Visitation Matrices Co-visitations are a key concept in recommender systems. The idea is simple: items that are frequently visited together in the past are likely to be visited together in the future....

September 4, 2024 · Pablo Escobar

Build Enterprise Retrieval-Augmented Generation Apps with NVIDIA Retrieval QA Embedding Model

Building Next-Generation Enterprise Apps with Retrieval-Augmented Generation Summary Retrieval-Augmented Generation (RAG) is a powerful technique that enhances the capabilities of large language models (LLMs) by integrating them with information retrieval systems. This approach allows enterprises to build applications that provide accurate, up-to-date responses to user queries by leveraging external knowledge sources. The NVIDIA Retrieval QA Embedding Model is a key component in building such RAG applications, offering state-of-the-art performance and customization options for various industries....

September 4, 2024 · Carl Corey