Flat plans for predictable, production-ready AI
Choose the option that best fits your long-term needs. Both plans allow you to use any available model, with token-based billing under the hood and a simple, fixed monthly price on your invoice.
Core Plan
Ideal for getting started with Regolo and running consistent AI workloads without upfront commitments.
With 70% discount on the first 3 months.
Start Free 30‑Day TrialNo credit card needed, get the full service for 30 days, risk‑free.
Access to core AI models
20 million tokens per day
Email support included
Boost Plan
Unleash your AI's full potential with a more powerful plan designed for higher-volume, production workloads.
Flat monthly price for greater capacity.
Start Free 30‑Day TrialNo credit card needed, get the full service for 30 days, risk‑free.
Access to core AI models
50 million tokens per day
Priority throughput and support
Two models, one simple token-based pricing
Every Regolo plan uses the same foundation: tokens. You can use any supported model and track your consumption in real time from the Regolo dashboard. Choose the pricing model that best matches how you work.
For developer teams who need flexibility
Access your favourite AI models every day with fully elastic, token-based pricing. Only pay for the tokens you consume — no upfront commitments or fixed capacity.
- Perfect for teams who are prototyping, testing, or running variable workloads.
- Take control of your usage via the Regolo dashboard, with real-time tracking and cost visibility.
- Scale up and down freely, paying strictly for the tokens actually consumed.
For companies that need predictable pricing
Secure a fixed monthly price with guaranteed daily capacity. Ideal for teams running production workloads that must stay within a defined budget.
- Core Plan — ideal for consistent usage with up to 20 million tokens per day.
- Boost Plan — designed for higher-volume workloads with up to 50 million tokens per day.
- Enjoy transparent, predictable invoices with no overage surprises when operating within plan limits.
When you subscribe to Regolo — whether you choose pay-as-you-go or a flat plan — your first 30 days of usage are completely free of charge.
How token-based billing works
Under both pricing models, Regolo bills usage in tokens. Each model in our library has its own token price, so you always know how much you spend for prompting and generating responses.
1. Choose your model
Select any LLM from the Regolo model library. You can freely mix and match models based on your use case and performance requirements.
2. Consume tokens
Each request consumes tokens depending on prompt size and response length. The cost per token depends on the specific model you are using.
3. Track in real time
Use the Regolo dashboard to monitor token usage, costs and limits in real time, keeping your team fully in control of consumption.
Models library pricing
Here you can explore the full list of supported models, together with their token pricing and limits. Use this table to plan your workloads and choose the most cost‑efficient models for each task.
Large Language Models
Text and multimodal chat models priced per token (input and output).
| Model name | Input cost per token | Output cost per token |
|---|---|---|
| deepseek-r1-70b | €0.0000006 | €0.0000027 |
| gemma-3-27b-it | €0.00000095 | €0.0000055 |
| gpt-oss-120b | €0.000001 | €0.0000042 |
| iQuest-coder-v1-40b | €0.0000008 | €0.0000028 |
| Llama-3.1-8B-Instruct | €0.00000005 | €0.00000025 |
| Llama-3.3-70B-Instruct | €0.0000006 | €0.0000027 |
| maestrale-chat-v0.4-beta | €0.00000005 | €0.00000025 |
| mistral-small3.2 | €0.0000005 | €0.0000022 |
| qwen3-30b | €0.0000005 | €0.0000018 |
| Qwen3-8B | €0.00000007 | €0.00000035 |
| qwen3-coder-30b | €0.0000005 | €0.000002 |
| qwen3-vl-32b | €0.0000005 | €0.0000025 |
Embedding
Semantic embedding models priced per request.
| Model name | Cost per request |
|---|---|
| gte-Qwen2 | €0.001 |
| Qwen3-Embedding-8B | €0.001 |
Speech-To-Text
Audio transcription models priced per second of audio.
| Model name | Cost per second |
|---|---|
| faster-whisper-large-v3 | €0.00015 |
Image Generation
Image generation models priced per pixel.
| Model name | Cost per pixel |
|---|---|
| Qwen-Image | €0.0000000005 |
OCR
Optical character recognition models priced per request.
| Model name | Cost per request |
|---|---|
| deepseek-ocr | €0.02 |
Rerank
Reranking models priced per query.
| Model name | Cost per query |
|---|---|
| Qwen3-Reranker-4B | €0.01 |