Some approaches in the market present digital twins and synthetic users as mutually exclusive methodologies, arguing that one fundamentally supersedes the other. In reality, both are digital twins of humans—they simply operate at different levels of fidelity and serve complementary purposes.
Think of it like fabrics: synthetic users are like synthetic tissue—cheaper, more stretchy, easier to scale and manipulate. Real digital twins are like natural fabric—authentic, preserving the original texture and behavior, less flexible but truer to the source. You wouldn't say synthetic fabric makes natural fabric obsolete; you'd recognize they serve different purposes. The same principle applies here.
The false dichotomy
The Misconception
Some methodologies champion the ability to "model the space of likely behaviors rather than individuals" and emphasize generalization across populations. This is presented as inherently superior. However, in innovation contexts, this representativeness is not a feature—it's a fundamental limitation.
Innovation research requires an idiographic approach that focuses on understanding individuals in their uniqueness rather than nomothetic methods based on statistical analysis of aggregated samples.
The innovation paradox: when generalization becomes a bug, not a feature
1. Breakthrough insights live at the edges, not in the average
When you optimize for "representative behavior" and "real-world distributions," you're explicitly designing to find the mean. But innovation—especially disruptive innovation—emerges from understanding outliers, edge cases, and the unique combinations of needs that don't yet have market presence.
The Idiographic Advantage
The idiographic approach enables discovery of fundamental insights that escape pre-established categories and the ability to anticipate emerging trends before they become statistically significant.
Consider: Netflix didn't emerge from understanding the "average video rental customer." It came from deeply understanding specific frustrations (late fees, limited selection, the trip to the store) that a traditional market research panel would have averaged away as acceptable friction.
2. The combinatorial "explosion" is actually rich territory
Some argue that the combinatorial complexity of real human variation creates an "explosion" that makes digital twins impractical—that attempting to model all possible combinations of attributes means you end up "collecting twins, not understanding people."
"People" vs. Individuals
This fundamentally misunderstands what "understanding people" means in an innovation context. "People" is a dangerous abstraction. What we call "people" is actually a collection of unique individuals who happen to share common sensitivities to specific problems and value propositions. When you're designing products, services, or experiences, you're not designing for "people"—you're designing for someone.
The goal of a real digital twin is not to be generalized. It's precisely the opposite. Each real twin represents an individual with specific history, motivations, constraints, and decision-making patterns—and those specifics are where the innovation insights live. A synthetic user, by design, smooths over these specifics to achieve scalability—sacrificing individual authenticity for operational flexibility.
The "combinatorial explosion" some fear is actually the rich, high-dimensional space where real human needs exist. Yes, it's complex. That's because humans are complex. Averaging them into tractable distributions doesn't eliminate that complexity—it just hides it from view, along with the opportunities it contains.
Different levels of representativeness: different jobs to be done
Here's where the complementarity becomes clear. Both approaches create digital representations of humans, but with different fidelity-to-individual versus scalability trade-offs:
Synthetic Users Excel At
- Validating concepts across demographic segments
- Stress-testing messaging for broad appeal
- Identifying patterns across large populations
- Scaling rapid iteration on known design spaces
- Market sizing and segmentation
Trade-off: Lower individual representativeness, higher flexibility and scale
Real Digital Twins Excel At
- Discovering unarticulated needs
- Understanding decision-making processes in depth
- Identifying non-obvious value propositions
- Exploring how specific contexts shape choices
- Innovation that requires individual-level fidelity
Trade-off: Higher individual representativeness, more authentic but less flexible
85%
Real Digital Twins
Accuracy
70%
Synthetic Users
Accuracy
Real twins based on real conversations with specific individuals can achieve up to 85% accuracy—equivalent to the accuracy with which real participants replicate their own responses after two weeks. This level of fidelity matters when you're trying to understand why someone makes a choice, not just what choice they're likely to make.
Synthetic users, constructed from aggregated patterns and demographic profiles, typically achieve up to 70% accuracy. That's still valuable—it's just a different tool optimized for different outcomes.
The drift argument reveals the real trade-off
Some argue that digital twins "need constant re-training per individual or segment" and that this creates cost and latency problems. This is true—but it's not a bug, it's revealing the actual trade-off:
The Trade-off
- Synthetic users optimize for operational efficiency at the cost of individual fidelity.
- If your goal is to run 10,000 simulated user tests this quarter, synthetic users are the right choice.
- If your goal is to deeply understand why your early adopters are churning despite loving your product, you need the depth and specificity that only real twins can provide.
The "drift" challenge is real, but modern digital twin platforms implement advanced quality control systems including real-time quality indicators, cross-agent validation for critical results, and scheduled quality assurance protocols that maintain accuracy over time.
The fabric analogy: natural vs. synthetic
Both synthetic users and real digital twins are digital representations—digital twins—of human behavior. The distinction is in their construction and properties:
Synthetic Users = Synthetic Fabric
- Manufactured from aggregated patterns and demographic data
- Highly stretchy and adaptable to different scenarios
- Cheaper and faster to produce at scale
- Consistent and predictable behavior
- Great for mass production and broad applications
- Some loss of natural texture and authenticity
Real Digital Twins = Natural Fabric
- Created from actual conversations with real individuals
- Preserves authentic behavioral patterns and decision-making
- More expensive and time-intensive to create
- Less flexible but truer to the original source
- Essential when authenticity and individual fidelity matter
- Retains the complexity and uniqueness of the individual
You wouldn't argue that polyester makes cotton obsolete, or vice versa. The fashion industry uses both strategically. The same applies to user research.
Where they work together
The most sophisticated research programs don't choose between these approaches—they sequence them:
Discovery Phase
Real Digital Twins: Conduct deep, idiographic research with real twins to uncover novel insights, unmet needs, and surprising decision patterns
Validation Phase
Synthetic Users: Use synthetic approaches to test whether discovered patterns hold across broader populations
Optimization Phase
Both: Return to specific digital twins for refinement while using synthetic users for rapid iteration
The privacy consideration
Some suggest that digital twins require "sensitive, granular data" with "heavy governance" and "questionable ROI." Modern digital twin platforms implement GDPR-compliant processing with sophisticated real-time output filters that prevent revelation of sensitive information regardless of conversational context.
Privacy and Governance
Meanwhile, synthetic users—precisely because they're generalized—may actually pose greater privacy risks in aggregate, as they can inadvertently encode biases and patterns from training data that reveal sensitive population-level information.
Conclusion: complementary, not competing
Some perspectives position this as a binary choice: scale with synthetic users or fail with digital twins. The reality is more nuanced. Both approaches are digital twins operating at different levels of individual representativeness, and the best research programs use both strategically.
Breadth & Flexibility
Synthetic Users
Depth & Authenticity
Real Digital Twins
When you need to understand the market, test at scale, and optimize for known patterns: synthetic digital twins are powerful and efficient.
When you need to discover what you don't yet know, understand deep causality, and find innovation opportunities: real digital twins provide irreplaceable fidelity.
The Winning Strategy
The question isn't which approach scales better—it's which level of individual representativeness your research objective requires. Innovation requires both breadth and depth. Synthetic users give you breadth and flexibility. Real digital twins give you depth and authenticity.
The companies that win won't be the ones who chose the "right" methodology. They'll be the ones who knew when to use each.