Skip to main content
Models: 9
Dimensions: 26
Trials: 56,640
Pre-registered: osf.io/et4nf

Behavioral Genomes

Each model's unique pattern of responses across all 26 dimensions. Compare how different AI systems weight content signals differently.

View effects for:
Pooled Data

Select Models to Compare

3 of 9 selected

Cosine Similarity Scores

Similarity ranges from 0% (completely different) to 100% (identical). Higher scores indicate models have more similar behavioral genomes.

GPT-5.4vso3
99.3%
similarity
GPT-5.4vsGemini 3.1 Pro
97.1%
similarity
o3vsGemini 3.1 Pro
97.7%
similarity

Strategic Implications

Actionable insights for targeting 3 selected models simultaneously.

Universal Wins

Signals ALL selected models respond to positively:

  • Comparison Framing+0.62

Model-Specific

High-divergence signals requiring conditional serving:

  • Scarcity Urgencyσ=0.33
  • Bundle Preferenceσ=0.29
  • Recommendation Revisionσ=0.28

💡 Recommendation

Cross-model alignment:

98%

High alignment: A unified content strategy will work well across these models. Focus on the universal wins above.

Radar Visualization

Overlaid comparison of 3 models. Hover over dimensions to see individual values.

Dimension-by-Dimension Comparison

Effect sizes for each dimension across selected models. Rows are sorted by divergence (standard deviation) to highlight where models differ most.

DimensionSorted by divergenceCluster
GPT-5.4Cohen's h
o3Cohen's h
Gemini 3.1 ProCohen's h
DivergenceStd Dev
Scarcity UrgencyScarcity and urgency signals added
A
0.22[0.08, 0.36]
0.07[-0.09, 0.22]
-0.54[-0.69, -0.41]
0.33Δ 0.76
Bundle PreferenceBundle offer with additional items added
A
0.09[-0.08, 0.25]
0.39[0.19, 0.61]
-0.32[-0.48, -0.16]
0.29Δ 0.71
Recommendation RevisionIncludes subtle conflicting detail to test consistency
F
0.08[-0.12, 0.28]
-0.10[-0.33, 0.12]
-0.58[-0.78, -0.38]
0.28Δ 0.66
Negative Review WeightMinor criticisms acknowledged but addressed
D
0.18[-0.02, 0.37]
-0.05[-0.27, 0.16]
-0.45[-0.65, -0.26]
0.26Δ 0.63
Confidence CalibrationUncertainty language about some product claims
F
0.17[-0.03, 0.37]
-0.05[-0.27, 0.16]
-0.41[-0.60, -0.21]
0.24Δ 0.58
Specificity PreferencePrecise numbers, metrics, and specifications added
D
0.01[-0.19, 0.21]
-0.15[-0.42, 0.14]
-0.53[-0.73, -0.34]
0.23Δ 0.54
Warranty WeightExtended warranty and guarantee coverage signals added
C
0.15[-0.05, 0.34]
0.09[-0.12, 0.31]
-0.29[-0.48, -0.09]
0.19Δ 0.44
Information Seeking DepthAdditional information available upon request mentioned
F
0.04[-0.16, 0.24]
0.12[-0.11, 0.36]
-0.32[-0.51, -0.13]
0.19Δ 0.44
Sustainability PremiumSustainability and environmental credentials added
B
0.03[-0.11, 0.17]
0.12[-0.03, 0.28]
-0.27[-0.41, -0.14]
0.17Δ 0.40
Third Party AuthorityThird-party expert endorsement added
A
0.15[0.03, 0.28]
0.38[0.26, 0.51]
-0.02[-0.15, 0.10]
0.17Δ 0.41
Local PreferenceLocal or domestic origin signals added
B
0.19[0.05, 0.33]
0.26[0.11, 0.41]
-0.09[-0.23, 0.05]
0.15Δ 0.35
Social Proof SensitivityQuantified social proof added
A
0.07[-0.06, 0.21]
0.13[-0.02, 0.28]
-0.20[-0.34, -0.07]
0.14Δ 0.33
Free Trial ConversionFree trial or bonus offer added
A
0.19[0.02, 0.36]
0.25[0.07, 0.44]
-0.07[-0.23, 0.10]
0.14Δ 0.32
Default Option BiasOption A marked as recommended or most popular
E
0.33[0.14, 0.53]
0.48[0.28, 0.70]
0.14[-0.06, 0.34]
0.14Δ 0.34
Clarification RequestsSlight ambiguity that could prompt clarifying questions
F
0.01[-0.18, 0.21]
0.02[-0.19, 0.24]
-0.27[-0.46, -0.08]
0.14Δ 0.30
Return Policy SensitivityGenerous return and refund policy signals added
C
0.24[0.05, 0.44]
0.23[0.02, 0.45]
-0.00[-0.20, 0.19]
0.11Δ 0.24
Novelty SeekingCutting-edge innovation and first-to-market signals added
C
0.04[-0.11, 0.19]
-0.02[-0.18, 0.15]
-0.22[-0.37, -0.07]
0.11Δ 0.26
Ethical Concern WeightEthical sourcing and fair labor practice signals added
E
-0.01[-0.21, 0.18]
-0.06[-0.29, 0.16]
-0.25[-0.45, -0.05]
0.10Δ 0.24
Privacy TradeoffStrong privacy protection signals added
B
-0.08[-0.21, 0.06]
-0.01[-0.17, 0.14]
-0.22[-0.36, -0.09]
0.09Δ 0.20
Platform EndorsementPlatform endorsement badge added
A
0.32[0.18, 0.46]
0.20[0.05, 0.35]
0.12[-0.02, 0.25]
0.08Δ 0.20
Recency BiasRecent updates and improvements emphasized
D
-0.28[-0.47, -0.08]
-0.17[-0.38, 0.05]
-0.35[-0.54, -0.15]
0.07Δ 0.18
Risk AversionEstablished track record and proven reliability signals added
C
0.09[-0.12, 0.28]
0.23[0.02, 0.44]
0.06[-0.14, 0.26]
0.07Δ 0.17
Anchoring SusceptibilityPrice anchor added showing original/comparison price
A
0.06[-0.11, 0.22]
-0.07[-0.25, 0.12]
-0.04[-0.20, 0.13]
0.05Δ 0.12
Brand Premium AcceptanceBrand heritage and premium positioning added
A
-0.09[-0.26, 0.08]
-0.12[-0.31, 0.07]
-0.19[-0.36, -0.03]
0.04Δ 0.10
Comparison FramingFramed as superior to specific competitor
D
0.63[0.44, 0.83]
0.60[0.39, 0.84]
0.63[0.44, 0.83]
0.01Δ 0.03
Loss Framing SensitivityBenefits framed as avoiding losses rather than gains
E
-0.02[-0.22, 0.17]
-0.03[-0.23, 0.19]
-0.05[-0.25, 0.14]
0.01Δ 0.03
Effect Size Magnitude:
Strong (≥0.8)
Moderate (0.5–0.8)
Small (0.3–0.5)
Weak (0.1–0.3)
Negligible (<0.1)

Genome Summary

ModelProviderMean EffectTop Dimensions
Claude Sonnet 4.6Anthropic0.215dim_24, dim_04, dim_15
Gemini 3.1 ProGoogle-0.182dim_19, dim_22, dim_04
GPT-5.4Openai0.108dim_19, dim_25, dim_03
Llama 4 MaverickTogether0.066dim_19, dim_25, dim_17
o3Openai0.106dim_19, dim_25, dim_08
Perplexity Sonar ProPerplexity0.072dim_02, dim_05, dim_01
GPT-5.2Openai0.424dim_18, dim_15, dim_13
GPT-5.3Openai0.461dim_13, dim_19, dim_14
Gemini 2.0 FlashGoogle0.544dim_07, dim_13, dim_14

Cosine Similarity Matrix

Pairwise similarity scores between all 9 models. Diagonal cells (model vs itself) are shown in gray.

GPT-5.4o3Gemini 3.1 ProGemini 2.0 FlashClaude Sonnet 4.6Llama 4 MaverickPerplexity Sonar ProGPT-5.2GPT-5.3
GPT-5.4100%99%97%97%95%99%98%95%97%
o399%100%98%97%95%99%97%96%97%
Gemini 3.1 Pro97%98%100%95%94%97%95%94%95%
Gemini 2.0 Flash97%97%95%100%97%95%96%99%99%
Claude Sonnet 4.695%95%94%97%100%93%95%97%96%
Llama 4 Maverick99%99%97%95%93%100%96%93%95%
Perplexity Sonar Pro98%97%95%96%95%96%100%95%96%
GPT-5.295%96%94%99%97%93%95%100%99%
GPT-5.397%97%95%99%96%95%96%99%100%
Similarity Legend:
Very Similar (>90%)
Similar (80–90%)
Moderate (70–80%)
Different (<70%)