Skip to main content
Progressical

For indie builders

You shipped it on a weekend.The harness needs more than a weekend.

We engineer the layer between your LLM and your application, without making you hire someone you don't want to hire.

The shape of this problem

You know what we're talking about.

You built it in a weekend with Lovable, or Bolt, or Cursor, or v0, or just Claude in a browser tab. It worked. It was magic. You shipped it. People started paying for it.

Then the support DMs started. The screenshots of weird outputs. The user who got something that didn't look right and posted about it on X. The metric that should be going up linearly is instead going up sawtooth. Every fifth interaction makes a new user churn.

The collective name for the thing that's broken is the harness: the layer between your LLM and your application. You don't have one. Or, more accurately, you have one, but it was assembled by accident and you didn't know it counted as engineering.

You're a designer who can ship. You shipped a real product. You don't want to hire a senior AI engineer for a year. You want to keep shipping. You want the rough edges off without slowing down.

That's what we do.

What you get

Three things. Nothing extra.

01

A hardened harness, same vibe

Same feature, same look, same product. Just with the failure modes engineered out: structured outputs where they need to be structured, retries when the model fumbles, fallbacks that degrade gracefully instead of embarrassingly, prompts that don't break when users put weird things in them.

02

A test suite that runs in your CI

We write a small evaluation set drawn from your real users' interactions, and a script that runs it. Next time you want to change the prompt, push to a new model, or add a feature, you'll know in five minutes whether you broke anything.

03

A short writeup of what we changed and why

You'll be able to read it. We'll explain it in language that doesn't require an ML PhD. The point is for you to understand the harness you're now running, so future changes are yours, not ours.

Indie pricing

Scaled for your stage.

One week

Indie Audit

$5,000

Diagnostic on your existing setup, prioritized fix list. Same diagnostic methodology as the standard audit, scoped down to one feature.

Start here

Two weeks

Most common

Indie Rebuild

$15,000

Audit plus rebuilt harness as a PR or in-place edits to your codebase. Eval set included. Money back if it doesn't fix the failure modes you flagged.

Start here

Ongoing

Indie Operations

$1,000/month

Monitoring on the rebuilt harness. Alerts when something regresses. Quarterly tune-up included.

Start here

This pricing is for products with under $1M ARR or fewer than five people on the team. Above that, see the standard tiers.

Concrete failure modes we fix

The things you've been Googling at 11pm.

Outputs that fail to parse.

Your code expects JSON, the model occasionally returns markdown-wrapped JSON, your app crashes. We fix the prompt, add structured output enforcement, add a parser fallback. Failures drop to near zero.

Magic that works for some users and not others.

The output is great when inputs are short and clean, breaks when they're long or messy. Usually a retrieval or context-window issue addressed by smarter chunking and better prompt structure.

Quality drift after a model swap.

You moved from one model to another to save money or get speed, and quality silently dropped. We build the eval set you should have had, identify what regressed, and tune the harness to recover the lost quality.

Hallucinated facts in confident voice.

The model is making up information and presenting it confidently. Usually a retrieval problem, or the lack of retrieval entirely. We add or fix the retrieval layer and tune the prompt to ground in retrieved content.

Conversation memory that loses the thread.

Multi-turn interactions forget what was said three turns ago, or worse, hallucinate things that were never said. Usually a memory-management problem in the harness, fixable with better summarization strategy plus selective retention of raw turns.

If you're nodding at any of these, you're our customer.

Who this is for

You shipped an AI product. You have paying users. You're seeing predictable failure modes you don't have time or skill to fix. You'd rather spend $15K and two weeks of mostly-no-effort than spend three months learning to be your own AI engineer.

Who this isn't for

You haven't shipped. You're trying to learn AI engineering yourself and want a tutor. Your product doesn't actually use generative AI in a meaningful way and you came here from a Twitter thread.

Send us your Lovable URL, your Bolt project,or just a screenshot.

Twenty minutes on a call and we'll tell you whether we can help. No obligation, no sales pitch.