Overview - NVIDIA Docs

NVIDIA Docs Hub NVIDIA Cloud Functions Developer Documentation Overview

Cloud Functions is a serverless API to deploy & manage AI workloads on GPUs, which provides security, scale and reliability to your workloads. The API to access the workloads is un-opinionated and supports HTTP polling, HTTP streaming & gRPC.

Cloud Functions is available via the NGC Portal. To gain access to Cloud Functions, talk to your NVIDIA Account Manager.

Cloud Functions is primarily suited for shorter running, preemptable workloads such as inferencing and fine-tuning.

See Function Lifecycle for key concepts and terminology.

Function Types

Cloud Functions supports three different workload and deployment types for defining functions - “Model only”, “Container” and “Helm Chart”.

The workloads are ephemeral and preemptable, so ensure you are not running long tasks, such as those spanning several hours, without expecting to save your work to the local disk.

Cloud Function models, containers, helm charts, and any additional resources are hosted by and pulled from NGC Private Registry.

See Function Creation for an in-depth overview of each function type.

Getting Started

The best way to get started with Cloud Functions is following along with the Quickstart.

Previous Cloud Functions

Next Quickstart