Pricing#
Billing Model#
AIone uses pay-as-you-go billing based on actual token consumption.Input Tokens: Content sent to the model (system prompt + user message + conversation history)
Output Tokens: Content generated by the model
Billing Unit: Per million tokens by default
Currency: All prices are displayed in USD in the console
Commonly Available Models#
Models evolve rapidly, so this document does not label any specific model as "the latest." The table below lists commonly available model families as a reference. For current availability, pricing, and any applicable discounts, always refer to the console.| Family | Commonly Available Models |
|---|
| Claude | claude-opus-4-1-20250805, claude-opus-4-20250514, claude-sonnet-4-20250514, claude-3-7-sonnet-20250219, claude-3-5-haiku-20241022 |
| GPT | gpt-5.4, gpt-5.2, gpt-5, gpt-5-mini, gpt-5-nano |
| Gemini | gemini-3.1-pro, gemini-3.1-flash-lite-preview, gemini-2.5-pro, gemini-2.5-flash, gemini-2.5-flash-lite |
The model list in this document is not a fixed price sheet. Models and pricing may change based on upstream capabilities, platform strategy, and enterprise discounts.
Enterprise Discounts#
Enterprise customers receive a unified discount rate. Discounted prices are visible on the My Pricing page in the console.For custom quotes, payment terms, or higher quotas, please contact our sales team through a support ticket.Cost Optimization Tips#
For high-frequency, standardized tasks, prefer gpt-5-nano, gpt-5-mini, gemini-2.5-flash-lite, or gemini-2.5-flash
For code generation and complex analysis where quality matters, consider claude-sonnet-4-20250514, claude-3-7-sonnet-20250219, or gpt-5.2
For deep reasoning or mission-critical workloads, consider claude-opus-4-1-20250805, claude-opus-4-20250514, gpt-5.4, or gemini-3.1-pro
Combine Prompt Caching with appropriate max_tokens settings to further reduce per-task costs
Viewing Usage#
Modified at 2026-04-04 16:02:45