The Onboarding Data Your Avatar Actually Needs
Most people over-collect reference material and under-specify the things that actually determine output quality — here's the input framework that matters.

I run the creative side of our agency, and one of the things that surprised me most when we started building Kyndrify was how often poor avatar outputs trace back to poor inputs — not to the model itself. People assume the model is the variable. In reality, the model is largely a constant for a given quality tier; what varies enormously is the quality and structure of the data you bring to it at the start.
There's a common belief that more reference material means better results. Collect more photos, write a longer description, give the model everything you have. In practice, this often backfires. Models don't get better at the edges when you throw volume at them — they get confused. The art is knowing exactly what data matters and providing it in a structure the model can actually use.
The Four Data Categories That Actually Drive Output Quality
After building and iterating on our own avatar tooling, I've identified four categories of input data that consistently predict output quality. Everything else is either redundant or noise.
Appearance anchors: 3–5 high-quality reference images in consistent lighting, showing the subject clearly at close range
Style context: explicit descriptors for visual tone — editorial, documentary, illustrative, cinematic — not aesthetic adjectives like "professional" or "modern"
Deployment context: where the avatar will appear (video thumbnail, web hero, social profile, presentation slide) — this shapes framing and composition decisions
Constraint list: what the output must NOT include — specific background elements, color palettes to avoid, stylistic treatments that conflict with brand guidelines
What Most People Over-Provide
The most common over-provision I see is reference images — specifically, low-quality or inconsistent photos that the model can't reliably use as anchors. Ten mediocre photos produce worse results than three excellent ones. The model isn't averaging your references; it's finding patterns in them, and inconsistent lighting, framing, or quality across references produces inconsistent output patterns.
Too many low-quality reference images dilute the signal
Overly detailed personality descriptions add no value to image generation models
Style references that conflict with each other (e.g., "natural lighting" + "high-fashion editorial") produce unpredictable blended outputs
What Most People Under-Provide
Under-provision tends to happen in the constraint list. Most people tell models what they want; very few explicitly tell them what to avoid. But constraints are often more decisive than positive direction, because they rule out entire categories of failure mode. An avatar that reliably avoids the wrong things is more useful than one that sometimes hits the right ones.
How Kyndrify Structures the Onboarding Inputs
This input framework is baked directly into Kyndrify's onboarding flow. Rather than asking for open-ended descriptions, the platform guides you through each of the four categories with structured options — style context is selected from defined choices rather than typed freeform, deployment context shapes the generation parameters behind the scenes, and constraint handling is built into the option structure. The result is that you naturally provide the right data in the right format without needing to know what the model wants.
The Takeaway on Input Quality
Better onboarding data produces better avatars more reliably than any other single variable. Three strong reference images beat ten mediocre ones. Explicit constraints beat lengthy positive descriptions. Deployment context — which most people skip entirely — shapes output in ways that matter more the further along the production process you go. Get the inputs right before you touch the generation settings.
Sources
TTGC / Kyndrify — patterns from building AI avatar tooling. kyndrify.com
MIT Media Lab — research on computational representation and reference-based generation systems. media.mit.edu


