Overview
GoModel exposes failover through thefallback config block, environment
variables, and the Dashboard.
When a request fails, GoModel can retry it against alternate models. For
runtime failover uses manual rules only.
Manual Mappings
config/fallback.json is a JSON object where each primary model entry contains
an ordered fallback model list (array); top-level keys are not ordered:
gpt-4o or provider-qualified selectors like
azure/gpt-4o.
For env-only deployments, use inline JSON:
fallback.default_mode and FEATURE_FALLBACK_MODE are deprecated
compatibility inputs. They are accepted but ignored by runtime failover.
The Dashboard can generate suggested mappings from model metadata, but
suggestions must be saved as manual mappings before they affect traffic.
When It Runs
Failover is attempted only after the primary request returns:5xx429- model unavailable, unsupported, or not found style errors
/v1/chat/completions, /v1/responses,
and /v1/messages requests, not /v1/embeddings.