Skip to content

Bring your own models

Typillar generates code with a large language model, but it never runs that model on its own servers. Inference happens on your side — either through Cloudflare Workers AI or through a model provider key you supply. This page explains why, and what you can choose.

If you connect Cloudflare, you can run inference on Workers AI — models hosted on Cloudflare’s network, billed through your Cloudflare account. There’s nothing extra to configure beyond the Cloudflare connection itself.

Prefer a specific model or vendor? Supply your own API key and Typillar will use that provider for inference. Your key is stored encrypted and used only to run your builds. See Models & API keys for how keys are handled.

  • Cost transparency. You see and pay for inference directly, on your own account — no markup, no proxy, no surprise line item.
  • Control. You choose the model and provider, and you can change or revoke the key at any time.
  • A clean trust boundary. Because Typillar never runs the model, your prompts and generated code don’t pass through our inference servers. See Zero inference on our servers.
If you want…Use
The simplest setupCloudflare Workers AI
A specific model or vendorYour own provider key
Everything billed in one placeCloudflare Workers AI
To reuse an existing model contractYour own provider key

You can start with Workers AI and switch to your own key later — it doesn’t change how the loop works, only where the tokens are generated.