Model Transparency
Every model ARIA routes to is declared here with context window, training cutoff, intended use cases, known limitations, and data-handling boundaries. Tenants can pin or forbid models per policy.
Claude Haiku 4.5
fastanthropic · ctx 200,000 · cutoff 2025-10
Use cases: fast-classification, short-replies, free-tier-default
Limitations: less-capable-on-multi-step-reasoning
Data boundaries: No customer data is used for model training. All inference is stateless.
Llama 3.1 70B (Groq)
fastgroq · ctx 131,072 · cutoff 2024-03
Use cases: low-latency-inference, open-weights-transparency
Limitations: reduced-instruction-following-vs-claude
Data boundaries: Inference-only; Groq does not retain prompts. Open-weights base model.
Claude Opus 4.7
premiumanthropic · ctx 200,000 · cutoff 2026-01
Use cases: complex-reasoning, peer-review, ethics-gate
Limitations: higher-latency; higher-cost
Data boundaries: No customer data is used for model training. All inference is stateless.
Claude Sonnet 4.6
standardanthropic · ctx 200,000 · cutoff 2026-01
Use cases: agent-dispatch, content-generation, summarization
Limitations: occasional-factual-errors-on-long-tail-topics
Data boundaries: No customer data is used for model training. All inference is stateless.
GPT-4o
standardopenai · ctx 128,000 · cutoff 2024-04
Use cases: bring-your-own-key-alt-provider
Limitations: requires-byok-subject-to-openai-terms
Data boundaries: Usage subject to OpenAI data-processing terms; configure opt-out in BYOK settings.