ShadAI Framework
  1. Pricing
ShadAI Framework
  • Index
  • README
  • Pricing
    • Pricing Plan
  • advanced
    • Best Practices
    • Tool Orchestration
    • Performance Optimization
    • Custom Tools
  • examples
    • Advanced Patterns
    • Custom Agent Examples
    • Market Research Examples
    • Multi-Document Analysis
    • Basic Query Examples
  • api-reference
    • Exceptions Reference
    • Engine Tool API
    • Shadai Client API Reference
    • Query Tool API
    • Summarize Tool API
    • Agent Tool API
    • Web Search Tool API
  • use-cases
    • Knowledge Synthesis
    • Research Assistant
    • Custom Workflows
    • Document Q&A
  • core-concepts
    • Architecture
    • Intelligent Agent
    • RAG System
    • Tools Overview
  • guides
    • Memory & Context
    • Streaming Responses
    • File Ingestion
    • Error Handling
    • Session Management
  • getting-started
    • Authentication
    • Your First Query
    • Quick Start
    • Installation
  1. Pricing

Pricing Plan

šŸ’³ Monthly Suscription Pricing#

Pay $500 USD / 500K credits — build and scale fast

šŸ“Š Pricing Components at a Glance#

ComponentWhat You Pay ForRate / UnitBilling Frequency
InfrastructureCompute time while your job is running on our clustersper secondInclude in your suscription
LLM TokensCombined input + output tokens processed by the modelprovider list price Ɨ tokensOn completion of each job
StoragePersisted docs & vector embeddings leveraged for retrievalprovider list price x sizeIt is usually monthly
ShadAI APICalls that trigger agent actions500K credits / monthInclude in your suscription
Illustration showing metered compute, token usage, storage, and API requests

šŸ” Detailed Breakdown#

1. Infrastructure#

Every time an agent runs, it spins up secure, auto‑scaling compute.

2. LLM Tokens#

Agents call best‑in‑class LLMs. You’re charged for both directions:
Input tokens (your prompt + system context)
Output tokens (model’s response)
We simply pass through each provider’s published token price so you can choose the model that fits your budget/performance profile.

3. Storage#

Documents and vectors ingested during sessions fuel semantic search. You can configure your own Postgres database.

4. ShadAI API Usage#

Using the ShadAI Client directly or via agents. Each call (sync or async) counts toward your quota.
$500 USD → 500K credits  monthly
We do not charge anything extra for infrastructure. We charge exactly the value of the suscription. In the case of storage and tokens you should manage the cost with the provider.

Recharge in USD credits#

In billing page you can recharge credits to use ShadAI. Only you need to press ADD CREDITS button and follow next steps. The minimum amount allowed is 500 USD.
You can see that your available credits increase with new recharge of credits and decrease with each purchase in the subscription marketplace.
image.png

šŸ’” Tips to Optimize Cost#

1.
Choose the right model — use smaller/faster models for high‑volume tasks.
2.
Keep sessions alive when possible.
3.
Don't ingest multiple times same data to reduce storage.
Need more clarity? Contact support and we’ll walk you through a sample bill.

šŸ’°Ā LLM Token Pricing Cheat‑Sheet (MayĀ 2025)#

All prices are USD per token. Multiply byĀ 1 000 to estimate cost per K. Always verify with vendor docs before production use.

šŸ”µ GoogleĀ Ā·Ā Gemini
ModelInĀ ($)OutĀ ($)
GEMINI_2_0_FLASH0.000000700.00000040
GEMINI_2_0_FLASH_LITE0.000000080.00000008
GEMINI_1_5_FLASH0.000000150.00000060
GEMINI_1_5_PRO0.000002500.00000100
GEMINI_1_5_FLASH_8B0.000000080.00000030
🧠 Anthropic · Claude 3
ModelInĀ ($)OutĀ ($)
CLAUDE_3_7_SONNET0.000003000.00001500
CLAUDE_3_5_SONNET_2024_10_220.000003000.00001500
CLAUDE_3_5_SONNET0.000003000.00001500
CLAUDE_3_5_HAIKU0.000001000.00000400
CLAUDE_3_OPUS0.000015000.00007500
CLAUDE_3_SONNET0.000003000.00001500
CLAUDE_3_HAIKU0.000000300.00000400
🧮 DeepSeek · R series
ModelInĀ ($)OutĀ ($)
DEEPSEEK_R1_V10.000001350.00000540
šŸ¦™ MetaĀ Ā·Ā Llama
ModelInĀ ($)OutĀ ($)
LLAMA_4_MAVERICK_17B_INSTRUCT0.000000240.00000097
LLAMA_4_SCOUT_17B_INSTRUCT0.000000170.00000066
LLAMA_3_3_70B_INSTRUCT0.000000700.00000072
LLAMA_3_2_1B_INSTRUCT0.000000100.00000010
LLAMA_3_1_70B_INSTRUCT0.000000700.00000072
LLAMA_3_1_8B_INSTRUCT0.000000200.00000022
šŸŒ€ CohereĀ Ā·Ā Command
ModelInĀ ($)OutĀ ($)
COHERE_COMMAND_PLUS0.000003000.00001500
COHERE_COMMAND_R0.000000500.00000150
COHERE_COMMAND_LIGHT0.000000300.00000060
ā˜ļø AmazonĀ Ā·Ā Nova
ModelInĀ ($)OutĀ ($)
AMAZON_NOVA_PRO0.000000800.00000320
AMAZON_NOVA_LITE0.000000060.00000024
AMAZON_NOVA_MICRO0.000000040.00000014

šŸ’”Ā Tips#

Use Flash Lite / Haiku for prototypes, then scale up as accuracy demands.
Double-check whether prices include input and output tokens separately (some dashboards only report combined cost).
Remember storage & inference infra are billed separately.

How to configure models#

Modified atĀ 2025-09-30 05:04:50
Previous
README
Next
Best Practices
Built with