Skip to main content
POST
/
v1
/
select-model
Select Model
curl --request POST \
  --url https://api.llmadaptive.uk/v1/select-model \
  --header 'Content-Type: application/json' \
  --data '{
  "models": [
    {}
  ],
  "prompt": "<string>",
  "cost_bias": 123,
  "model_router_cache": {
    "enabled": true,
    "semantic_threshold": 123
  }
}'
{
  "selected_model": {
    "id": 123,
    "author": "<string>",
    "model_name": "<string>",
    "display_name": "<string>",
    "description": "<string>",
    "context_length": 123,
    "pricing": {},
    "providers": [
      {}
    ]
  },
  "alternatives": [
    {}
  ],
  "cache_tier": "<string>"
}
Get Adaptive’s intelligent model selection without using our inference. Provider-agnostic design - works with any models, any providers, any infrastructure.

Why Use This?

Use Adaptive’s intelligence, run inference wherever you want:
  • “I have my own OpenAI/Anthropic accounts” - Get optimal model selection, pay your providers directly
  • “I run models on-premise” - Get routing decisions for your local infrastructure
  • “I have enterprise contracts” - Use your existing provider relationships with intelligent routing
  • “I need data privacy” - Keep inference local while getting smart model selection

Request

Provider-agnostic format - send your available models and prompt, get intelligent selection back.
models
array
required
Array of available model specifications in provider:model_name format. Adaptive automatically queries the Model Registry to fill in pricing, capabilities, and other details for known models.
prompt
string
required
The prompt text to analyze for optimal model selection
cost_bias
number
Cost optimization preference (0.0 = cheapest, 1.0 = best performance) Default: Uses server configuration. Override to prioritize cost savings or performance for this specific selection.
model_router_cache
object
Semantic cache configuration for this request

Response

selected_model
object
Selected model details Complete model information for the chosen model
alternatives
array
Alternative models (optional) Fallback model options if the primary selection is unavailable. Each alternative is a complete RegistryModel object.
cache_tier
string
Cache hit information Indicates if the selection came from cache (“semantic_exact”, “semantic_similar”, or empty if not cached)

Authentication

Same as chat completions:
-H "Authorization: Bearer apk_123456"

No Inference = Fast & Cheap

This endpoint:
  • Fast - No LLM inference, just routing logic
  • Cheap - Doesn’t count against token usage
  • Accurate - Uses exact same selection logic as real completions
Perfect for testing, debugging, and cost planning without burning through your budget.