Moonborn — Developers

Eval checklist

A 30-minute decision matrix for product teams evaluating Moonborn. What to test, what's a no-go, what's a "later.

A practical 30-minute matrix. Go in this order; stop at the first no-go.

Must-haves (15 minutes)

Voice fingerprint + drift detection produces a usable score on your sample replies. Test on 10 of your last support transcripts.
OpenAI-compat endpoint slot-fits where you currently call OpenAI (model field accepts persona ID).
Free tier covers a 50-persona, 200-chat-session pilot.
Data residency option matches your compliance posture (US or EU).
Webhook delivery + signature verification is well-documented.

Should-haves (10 minutes)

Provocation test suite passes ≥ 0.85 on a persona that represents your domain.
Audit verdict aligns with your editorial judgment on 5 sample personas.
SDK exists for your stack (TS, Python, Go, Ruby, Rust, Elixir).
Pricing tier matches your projected volume.

Nice-to-haves (5 minutes)

MCP server is available for your IDE.
Marketplace listing UX is one you'd want your team to use.
Lineage + fork tree maps onto your brand variant strategy.

No-go signals

If any of these are true, stop here:

You need on-prem deployment (Moonborn is hosted only; ADR 0012).
Your compliance regime requires HIPAA BAA but you're on a non- Enterprise tier.
Your voice work needs Soul-level edits that contradict the four- layer model.
You need a region that isn't US or EU (Asia Pacific not in v1).

"Later" signals

These aren't blockers but slow rollout — note them now:

Marketplace commerce (paid listings) is Enterprise only — not blocking for a pilot, but blocks creator monetization.
Custom moderation classifiers are Enterprise — start with defaults.
More SDK languages — start with TS or Python; the others may lag for new APIs.

Output

Run this matrix and you'll have a Go / No-Go / Conditional in 30 minutes. Combine with the ROI calculation for the business case.

Related