Skip to content
Regolo Logo

Flat plans for predictable, production-ready AI

Choose the option that best fits your long-term needs. Both plans allow you to use any Large Language Model, with token-based billing under the hood and a simple, fixed monthly price on your invoice.

30 days free Test Regolo with full access for the first 30 days, no charge.
Best for steady workloads
Core plan icon

Core Plan

Ideal for getting started with Regolo and running consistent AI workloads without upfront commitments.

€39*
per month

With 70% discount on the first 3 months.

Start Free 30‑Day Trial

No credit card needed, get the full service for 30 days, risk‑free.


Access to core Large Language Models

20 million tokens per day

Email support included

Most popular
Boost plan icon

Boost Plan

Unleash your AI's full potential with a more powerful plan designed for higher-volume, production workloads.

€89*
per month

Flat monthly price for greater capacity.

Start Free 30‑Day Trial

No credit card needed, get the full service for 30 days, risk‑free.


Access to core Large Language Models

50 million tokens per day

Priority throughput and support

30‑day free trial — no upfront charge

Two models, one simple token-based pricing

Every Regolo plan uses the same foundation: tokens. You can use any Large Language Model and track your consumption in real time from the Regolo dashboard. Choose the pricing model that best matches how you work.

Pay-as-you-go

For developer teams who need flexibility

Access all Regolo AI models every day with fully elastic, token-based pricing. Only pay for the tokens you consume — no upfront commitments or fixed capacity.

  • Perfect for teams who are prototyping, testing, or running variable workloads.
  • Take control of your usage via the Regolo dashboard, with real-time tracking and cost visibility.
  • Scale up and down freely, paying strictly for the tokens actually consumed.
  • Tap into the full Regolo.ai stack: not just LLMs, but also speech‑to‑text, image generation, rerank, embeddings and more, all under the same elastic model.
Under this model, pricing varies per model and usage. See the table below for detailed token costs.
Flat plans

For companies that need predictable pricing

Secure a fixed monthly price with guaranteed daily capacity. Ideal for teams running production workloads that must stay within a defined budget.

  • Core Plan — ideal for consistent usage with up to 20 million tokens per day.
  • Boost Plan — designed for higher-volume workloads with up to 50 million tokens per day.
  • Enjoy transparent, predictable invoices with no overage surprises when operating within plan limits.
  • Unlock maximum value per token: when you fully use your plan, the effective cost per token is lower than pay‑as‑you‑go, turning predictable spend into a built‑in discount.
You still benefit from token-based pricing internally, but your external cost remains a simple, flat monthly fee.

When you subscribe to Regolo — whether you choose pay-as-you-go or a flat plan — your first 30 days of usage are completely free of charge.

How token-based billing works

Under both pricing models, Regolo bills usage in tokens. Each model in our library has its own token price, so you always know how much you spend for prompting and generating responses.

1. Choose your model

Select any LLM from the Regolo model library. You can freely mix and match models based on your use case and performance requirements.

2. Consume tokens

Each request consumes tokens depending on prompt size and response length. The cost per token depends on the specific model you are using.

3. Track in real time

Use the Regolo dashboard to monitor token usage, costs and limits in real time, keeping your team fully in control of consumption.

Below you’ll find a detailed table with all available models and their respective pricing per token. Use it to estimate your workloads and compare options across vendors and capabilities.

Models library pricing

Here you can explore the full list of supported models, together with their token pricing* and limits. Use this table to plan your workloads and choose the most cost‑efficient models for each task.

Large Language Models

Text and multimodal chat models priced per token (input and output).

Model name
Input cost per token
Output cost per token
deepseek-r1-70b
€0.0000006
€0.0000027
gemma-3-27b-it
€0.00000095
€0.0000055
gpt-oss-120b
€0.000001
€0.0000042
gpt-oss-20b
€0.0000001
€0.00000042
Llama-3.1-8B-Instruct
€0.00000005
€0.00000025
Llama-3.3-70B-Instruct
€0.0000006
€0.0000027
mistral-small3.2
€0.0000005
€0.0000022
Qwen3-8B
€0.00000007
€0.00000035
qwen3-coder-next
€0.0000005
€0.000002
qwen3-vl-32b
€0.0000005
€0.0000025

Embedding

Semantic embedding models priced per request.

Model name
Cost per request
gte-Qwen2
€0.001
Qwen3-Embedding-8B
€0.001

Speech-To-Text

Audio transcription models priced per second of audio.

Model name
Cost per second
faster-whisper-large-v3
€0.00015

Image Generation

Image generation models priced per pixel.

Model name
Cost per pixel
Qwen-Image
€0.0000000005

OCR

Optical character recognition models priced per request.

Model name
Cost per request
deepseek-ocr
€0.02

Rerank

Reranking models priced per query.

Model name
Cost per query
Qwen3-Reranker-4B
€0.01

*Prices exclude VAT, applied only in Italy