Models & Providers

Local Model Integration

Connect llama.cpp, OpenAI-compatible gateways, or other production-ready internal model endpoints.

Key Steps

Expose the local model endpoint through a controlled server URL.
Add it as a private provider and map its model IDs to clear aliases.
Measure latency, throughput, and hardware saturation before production use.

Need a refresher?

Review the docs index or jump to related topics in this category.

Article Scope

This guide focuses on the operational steps needed to run Orchestris with team access, provider control, and predictable usage.

Visual Reference

Aster Ridge Operations provider catalog showing OpenAI, Anthropic, Gemini, and private provider status — Demo workspace: provider catalog with commercial and internal model providers.

Aster Ridge Operations model routing screen showing aliases across commercial and private models — Demo workspace: model aliases and routing across provider backends.

Endpoint requirements

Use a stable base URL reachable from Orchestris Server.
Protect the endpoint with network controls and credentials where possible.
Confirm the endpoint follows the expected OpenAI-compatible request and response shape.

Model registration

Add the endpoint as a private provider in the catalog.
Create user-facing aliases that describe the model purpose.
Limit access to teams that understand the model quality and data constraints.

Capacity planning

Test concurrency with representative prompts.
Monitor queue time, latency, and failed requests.
Document fallback behavior if the local model host is unavailable.

Local Model Integration

Connect llama.cpp, OpenAI-compatible gateways, or other production-ready internal model endpoints.

Key Steps

Expose the local model endpoint through a controlled server URL.
Add it as a private provider and map its model IDs to clear aliases.
Measure latency, throughput, and hardware saturation before production use.

Need a refresher?

Review the docs index or jump to related topics in this category.

View all docs

Article Scope

This guide focuses on the operational steps needed to run Orchestris with team access, provider control, and predictable usage.

Visual Reference

Endpoint requirements

Use a stable base URL reachable from Orchestris Server.
Protect the endpoint with network controls and credentials where possible.
Confirm the endpoint follows the expected OpenAI-compatible request and response shape.

Model registration

Add the endpoint as a private provider in the catalog.
Create user-facing aliases that describe the model purpose.
Limit access to teams that understand the model quality and data constraints.

Capacity planning

Test concurrency with representative prompts.
Monitor queue time, latency, and failed requests.
Document fallback behavior if the local model host is unavailable.

Local Model Integration

Key Steps

Need a refresher?

Article Scope

Visual Reference

Endpoint requirements

Model registration

Capacity planning

Related Topics

Using AI Models

OpenAI Integration

Anthropic Integration

Local Model Integration

Key Steps

Need a refresher?

Article Scope

Visual Reference

Endpoint requirements

Model registration

Capacity planning

Related Topics

Using AI Models

OpenAI Integration

Anthropic Integration