Skip to content
Regolo Logo
Self‑Hosting & DevOps
3 min read

Drop-In OpenAI Replacement: Swap base_url to EU-Hosted Regolo

Replacing OpenAI with a European, GDPR-compliant inference provider does not require rewriting your application because we provide an OpenAI-compatible endpoint, you only need to…

Alex Genovese
Read article
Self‑Hosting & DevOps
10 min read

DFlash: x3 LLM inference speed – guide and codes

DFlash is a new block-diffusion based speculative decoding technique that speeds up large language model (LLM) inference by predicting multiple tokens in parallel. Unlike…

Alex Genovese
Read article
Ready to scale? Get Free Regolo Credits!