30 days free • Test Regolo with full access for the first 30 days, no charge.

Choose the best option that fits your needs
Zero Data Retention, Fast and Secure AI

Simple subscription or go fully flexible with our pay-as-you-go plan.
Access to all features. Scale when you need. Pay only for what you use.

Pay-as-you-Go

Perfect for teams prototyping, testing, or running variable workloads. No commitment – scale up any time.

0€

/ monthly

Pay only for tokens you use

Start with 30 Days Free + UNLIMITED tokens

No credit card needed, get the full service for 30 days, risk‑free.

Complete access to all Core Models

Enterprise

Tailored for enterprises that need high‑volume API calls on any AI model — custom pricing built just for you.

Get a quote

Discounts on high volume

Complete access to all Core Models

Custom SLA for enterprises

Regolo Core Models

Best-in-class model performance, effortless autoscaling, and blazing fast cold starts mean you get the most out of each GPU, saving money along the way.

Models library pricing

Here you can explore the full list of supported models, together with their token pricing* and limits. Use this table to plan your workloads and choose the most cost‑efficient models for each task.

Large Language Models

Text and multimodal chat models priced per million tokens (input and output).

Pay-as-you-go

Subscription

Model

Input cost per 1M tokens

Output cost per 1M tokens

Core | Boost

apertus-70b

€0.40

€2.10

Included

gemma4-31b

€0.40

€2.10

Included

gpt-oss-120b

€1.00

€4.20

Included

gpt-oss-20b

€0.10

€0.42

Included

Llama-3.1-8B-Instruct

€0.05

€0.25

Included

Llama-3.3-70B-Instruct

€0.60

€2.70

Included

minimax-m2.5

€0.60

€3.80

Included

mistral-small-4-119b

€0.50

€2.10

Included

mistral-small3.2

€0.50

€2.20

Included

qwen3-coder-next

€0.50

€2.00

Included

qwen3.5-122b

€1.00

€4.20

Included

qwen3.5-9b

€0.07

€0.35

Included

Enterprise pricing may apply for high-scale or custom deployments.

Talk with our engineers to fit a special offer for your custom needs.

*Prices exclude VAT, applied only in Italy

Deploy models from Huggingface

Only pay for the compute you use, down to the minute. Deploy using Custom Models.

GPU Instance

VRAM

vCPU

RAM

Hourly Price

NVIDIA RTX6000

24 GB

8 cores

32 GB

€0.29

Choose the best option that fits your needsZero Data Retention, Fast and Secure AI

Core Plan

Boost Plan

Pay-as-you-Go

Enterprise

Regolo Core Models

Models library pricing

Large Language Models

Embedding

Speech-To-Text

Image Generation

OCR

Rerank

Deploy models from Huggingface

Frequently Asked Questions

Choose the best option that fits your needs
Zero Data Retention, Fast and Secure AI