Synthetic Customer Research: When LLM Personas Beat Real Interviews

The classic ICP-discovery loop is broken: pre-launch, you don't have customers to interview. Post-launch, you don't have time. Synthetic customer research — LLM-generated personas that simulate ICP responses — has become a real tool in the early product and positioning loop. It's also the most over-claimed technique in marketing right now.

Here's where it actually works, where it actively misleads, and how I structure it for the teams I work with.

Where it works

Synthetic research is directionally useful for three jobs:

1. Positioning stress tests. Generate 8-12 personas across your target segments, hand them three positioning statements, and ask which resonates and why. You'll get back 40 pages of plausible reactions in 20 minutes. Most of it isn't real signal — but the clustering (which positioning got the most "I don't get it" responses, which got "but how is this different from X?") is genuinely useful.

2. Objection mapping. Before you write the FAQ, before you train SDRs, before you ship the pricing page — run the messaging past a panel of "skeptical buyer" personas. The objections they raise are 70-80% predictive of the real ones. The 20-30% gap is where the surprises (and the actual insights from real interviews) live.

3. Naming and message screening. Generate 50 personas and ask each one what they think a product called "X" does. If 40 of them are wrong, your name is wrong. This screen is cheap and works.

Where it actively misleads

Synthetic panels are dangerous for:

Willingness-to-pay. Personas will happily tell you they'd pay $99/month for anything. There is no budget in the simulation. Trust nothing they say about pricing.

Behavioral predictions. "Would you use this daily?" Yes, says the persona. No, says reality. LLMs predict stated preferences, which barely correlate with revealed behavior even in real research.

Novel use cases. A synthetic ICP doesn't know about that weird workflow your real customers invented. Real interviews are still the only way to surface the use cases you didn't think to ask about.

Confidence calibration. The model writes with the same confident tone whether it's giving you a real insight or a hallucinated one. There's no "I don't know" in the default voice. You have to build that in.

How I structure a panel

A useful synthetic research session has four moves:

Step 1: Persona generation with constraint. Don't ask for "10 personas of marketing leaders." Specify: company stage, team size, tooling, prior failure modes, current OKRs. The more constrained, the less generic the output.

Step 2: Independent runs, not group chat. Run each persona in a fresh context. If you batch them in one prompt, the model converges them toward each other and you lose variance — which was the whole point.

Step 3: Adversarial roles. Always include 2-3 "skeptical buyer" or "evaluator-who-prefers-the-incumbent" personas. Without these, every panel turns into a yes-fest.

Step 4: Score the agreement, not the answers. What you're looking for is consensus across personas, not any individual response. If 9 of 12 raise the same objection, that's a signal. If one says something brilliant, it's probably just a vivid hallucination.

The integration that matters

Synthetic research isn't a replacement for real research. It's a prefilter that makes the real research dramatically more efficient.

The new loop:

Run a synthetic panel to surface the top 5 hypotheses
Book 6 real interviews to test those specific hypotheses
Compare. The deltas between synthetic and real are themselves the most interesting finding.

Teams that use this loop ship positioning faster, with sharper messaging, and with fewer "we should have caught that" moments after launch. Teams that replace real research with synthetic eventually ship a product nobody wants.

The honest part

Synthetic customer research is a hammer. The temptation when you have it is to treat everything as a nail — to skip the customer call, ship the campaign on simulated approval, and let the panel be the focus group. Don't.

It's a fast, cheap, surprisingly useful prefilter. It is not the research. Treat it as the first 30% of the job and the last 30% will go much better.

Where it works

Where it actively misleads

How I structure a panel

The integration that matters

The honest part

Enjoyed this? Get more like it weekly.