Skip to content
Regolo Logo

Flat plans for predictable, production-ready AI

Choose the option that best fits your long-term needs. Both plans allow you to use any available model, with token-based billing under the hood and a simple, fixed monthly price on your invoice.

30 days free Test Regolo with full access for the first 30 days, no charge.
Best for steady workloads
Core plan icon

Core Plan

Ideal for getting started with Regolo and running consistent AI workloads without upfront commitments.

€39
per month

With 70% discount on the first 3 months.

Start Free 30‑Day Trial

No credit card needed, get the full service for 30 days, risk‑free.


Access to core AI models

20 million tokens per day

Email support included

Most popular
Boost plan icon

Boost Plan

Unleash your AI's full potential with a more powerful plan designed for higher-volume, production workloads.

€89
per month

Flat monthly price for greater capacity.

Start Free 30‑Day Trial

No credit card needed, get the full service for 30 days, risk‑free.


Access to core AI models

50 million tokens per day

Priority throughput and support

30‑day free trial — no upfront charge

Two models, one simple token-based pricing

Every Regolo plan uses the same foundation: tokens. You can use any supported model and track your consumption in real time from the Regolo dashboard. Choose the pricing model that best matches how you work.

Pay-as-you-go

For developer teams who need flexibility

Access your favourite AI models every day with fully elastic, token-based pricing. Only pay for the tokens you consume — no upfront commitments or fixed capacity.

  • Perfect for teams who are prototyping, testing, or running variable workloads.
  • Take control of your usage via the Regolo dashboard, with real-time tracking and cost visibility.
  • Scale up and down freely, paying strictly for the tokens actually consumed.
Under this model, pricing varies per model and usage. See the table below for detailed token costs.
Flat plans

For companies that need predictable pricing

Secure a fixed monthly price with guaranteed daily capacity. Ideal for teams running production workloads that must stay within a defined budget.

  • Core Plan — ideal for consistent usage with up to 20 million tokens per day.
  • Boost Plan — designed for higher-volume workloads with up to 50 million tokens per day.
  • Enjoy transparent, predictable invoices with no overage surprises when operating within plan limits.
You still benefit from token-based pricing internally, but your external cost remains a simple, flat monthly fee.

When you subscribe to Regolo — whether you choose pay-as-you-go or a flat plan — your first 30 days of usage are completely free of charge.

How token-based billing works

Under both pricing models, Regolo bills usage in tokens. Each model in our library has its own token price, so you always know how much you spend for prompting and generating responses.

1. Choose your model

Select any LLM from the Regolo model library. You can freely mix and match models based on your use case and performance requirements.

2. Consume tokens

Each request consumes tokens depending on prompt size and response length. The cost per token depends on the specific model you are using.

3. Track in real time

Use the Regolo dashboard to monitor token usage, costs and limits in real time, keeping your team fully in control of consumption.

Below you’ll find a detailed table with all available models and their respective pricing per token. Use it to estimate your workloads and compare options across vendors and capabilities.

Models library pricing

Here you can explore the full list of supported models, together with their token pricing and limits. Use this table to plan your workloads and choose the most cost‑efficient models for each task.

Large Language Models

Text and multimodal chat models priced per token (input and output).

Model nameInput cost per tokenOutput cost per token
deepseek-r1-70b€0.0000006€0.0000027
gemma-3-27b-it€0.00000095€0.0000055
gpt-oss-120b€0.000001€0.0000042
iQuest-coder-v1-40b€0.0000008€0.0000028
Llama-3.1-8B-Instruct€0.00000005€0.00000025
Llama-3.3-70B-Instruct€0.0000006€0.0000027
maestrale-chat-v0.4-beta€0.00000005€0.00000025
mistral-small3.2€0.0000005€0.0000022
qwen3-30b€0.0000005€0.0000018
Qwen3-8B€0.00000007€0.00000035
qwen3-coder-30b€0.0000005€0.000002
qwen3-vl-32b€0.0000005€0.0000025

Embedding

Semantic embedding models priced per request.

Model nameCost per request
gte-Qwen2€0.001
Qwen3-Embedding-8B€0.001

Speech-To-Text

Audio transcription models priced per second of audio.

Model nameCost per second
faster-whisper-large-v3€0.00015

Image Generation

Image generation models priced per pixel.

Model nameCost per pixel
Qwen-Image€0.0000000005

OCR

Optical character recognition models priced per request.

Model nameCost per request
deepseek-ocr€0.02

Rerank

Reranking models priced per query.

Model nameCost per query
Qwen3-Reranker-4B€0.01