AIone API (English)
    • 01 - Quick Start
    • 02 - Authentication
    • 03 - Error Codes
    • 04 - Pricing
    • 05 - Contact Us
    • 06 - Quality of Service
    • 07 - Complete Examples
    • 08 - Caching & Cost Optimization
    • 11 - Model Quality Monitoring
    • 12 - Network & Connectivity
    • 13 - Model Naming & Compatibility
    • 14 - Gemini Image Generation
    • 09 - Model Verification
    • 10 - IDE Integration

    04 - Pricing

    Pricing#

    Billing Model#

    AIone uses pay-as-you-go billing based on actual token consumption.
    Input Tokens: Content sent to the model (system prompt + user message + conversation history)
    Output Tokens: Content generated by the model
    Billing Unit: Per million tokens by default
    Currency: All prices are displayed in USD in the console

    Commonly Available Models#

    Models evolve rapidly, so this document does not label any specific model as "the latest." The table below lists commonly available model families as a reference. For current availability, pricing, and any applicable discounts, always refer to the console.
    FamilyCommonly Available Models
    Claudeclaude-opus-4-1-20250805, claude-opus-4-20250514, claude-sonnet-4-20250514, claude-3-7-sonnet-20250219, claude-3-5-haiku-20241022
    GPTgpt-5.4, gpt-5.2, gpt-5, gpt-5-mini, gpt-5-nano
    Geminigemini-3.1-pro, gemini-3.1-flash-lite-preview, gemini-2.5-pro, gemini-2.5-flash, gemini-2.5-flash-lite
    For real-time pricing, visit the My Pricing page.
    The model list in this document is not a fixed price sheet. Models and pricing may change based on upstream capabilities, platform strategy, and enterprise discounts.

    Enterprise Discounts#

    Enterprise customers receive a unified discount rate. Discounted prices are visible on the My Pricing page in the console.
    For custom quotes, payment terms, or higher quotas, please contact our sales team through a support ticket.

    Cost Optimization Tips#

    For high-frequency, standardized tasks, prefer gpt-5-nano, gpt-5-mini, gemini-2.5-flash-lite, or gemini-2.5-flash
    For code generation and complex analysis where quality matters, consider claude-sonnet-4-20250514, claude-3-7-sonnet-20250219, or gpt-5.2
    For deep reasoning or mission-critical workloads, consider claude-opus-4-1-20250805, claude-opus-4-20250514, gpt-5.4, or gemini-3.1-pro
    Combine Prompt Caching with appropriate max_tokens settings to further reduce per-task costs

    Viewing Usage#

    Console > Usage Statistics: Detailed breakdowns by model, key, and date
    Console > Dashboard: Real-time KPI overview
    Modified at 2026-04-04 16:02:45
    Previous
    03 - Error Codes
    Next
    05 - Contact Us
    Built with