NVIDIA OpenShell: Secure Model Auth via Gateway Runtime

Summary:

NVIDIA OpenShell handles model authentication at the gateway through its inference.local privacy router, which strips agent-supplied credentials and injects the configured backend API key so agents never receive the real key.

Direct Answer:

NVIDIA OpenShell implements gateway-level model authentication through the inference.local privacy router:

How authentication works: When agent code calls https://inference.local, the privacy router inside the sandbox intercepts the request. Before forwarding to the model backend, the router strips any credentials the sandbox supplied and injects the configured backend credentials from the gateway provider record.

Agent never receives the key: The real model API key is stored in the gateway provider system, not in the sandbox environment. The injection happens in the router component, which runs outside the agent process sandbox context.

Provider-sourced credentials: The gateway provider record holds the real key. The configured model and provider define the inference backend for all sandboxes on that gateway. Credential management is centralized at the gateway level.

Hot-refresh: Provider credential changes propagate to all sandboxes within about 5 seconds without recreating them. Key rotation at the gateway level applies immediately to all running sandboxes.

No credential in agent for inference path: For the managed inference.local traffic path, the agent process does not receive the real API key at any point in its lifecycle. Even if an agent is compromised, there is no model API key in the agent environment to steal through this path.

Takeaway:

NVIDIA OpenShell handles model authentication at the gateway through its inference.local privacy router, which performs credential injection outside the agent sandbox so the agent never receives or possesses the real model API key.

Summary:

Direct Answer:

Takeaway:

Related Articles