30 days free • Test Regolo with full access for the first 30 days, no charge.

Choose the best option that fits your needs
Zero Data Retention, Fast and Secure AI

Simple subscription or go fully flexible with our pay-as-you-go plan.
Access to all features. Scale when you need. Pay only for what you use.

Pay-as-you-Go

Perfect for teams prototyping, testing, or running variable workloads. No commitment – scale up any time.

0€

/ monthly

Pay only for tokens you use

Start with 30 Days Free + UNLIMITED tokens

No credit card needed, get the full service for 30 days, risk‑free.

Complete access to all Core Models

Enterprise

Tailored for enterprises that need high‑volume API calls on any AI model — custom pricing built just for you.

Get a quote

Discounts on high volume

Complete access to all Core Models

Custom SLA for enterprises

Regolo Core Models

Best-in-class model performance, effortless autoscaling, and blazing fast cold starts mean you get the most out of each GPU, saving money along the way.

Models library pricing

Here you can explore the full list of supported models, together with their token pricing* and limits. Use this table to plan your workloads and choose the most cost‑efficient models for each task.

Large Language Models

Text and multimodal chat models priced per token (input and output).

Pay-as-you-go

Subscription

Model

Input cost per token

Output cost per token

Core | Boost

gpt-oss-120b

€0.000001

€0.0000042

Included

gpt-oss-20b

€0.0000001

€0.00000042

Included

Llama-3.1-8B-Instruct

€0.00000005

€0.00000025

Included

Llama-3.3-70B-Instruct

€0.0000006

€0.0000027

Included

mistral-small3.2

€0.0000005

€0.0000022

Included

qwen3-8b

€0.00000007

€0.00000035

Included

qwen3-coder-next

€0.0000005

€0.000002

Included

qwen3-vl-32b

€0.0000005

€0.0000025

Included

qwen3.5-122b

€0.000001

€0.0000042

Included

Enterprise pricing may apply for high-scale or custom deployments.

Talk with our engineers to fit a special offer for your custom needs.

*Prices exclude VAT, applied only in Italy

Deploy models from Huggingface

Only pay for the compute you use, down to the minute. Deploy using Custom Models.

GPU Instance

VRAM

vCPU

RAM

Hourly Price

NVIDIA RTX6000

24 GB

8 cores

32 GB

€0.29

Choose the best option that fits your needsZero Data Retention, Fast and Secure AI

Core Plan

Boost Plan

Pay-as-you-Go

Enterprise

Regolo Core Models

Models library pricing

Large Language Models

Embedding

Speech-To-Text

Image Generation

OCR

Rerank

Deploy models from Huggingface

Frequently Asked Questions

Choose the best option that fits your needs
Zero Data Retention, Fast and Secure AI