Kalshi
CFTC regulated event contracts.
Verdicts12Frontier models12Cohort12
Cohort rewards the regulated posture and contract design. Market depth still thinner than offshore peers.
Per model verdicts
How each frontier model assessed Kalshi, with a one line takeaway from the model's reasoning trace.
- G
GPT 5.5
TrustedMediumReads governance and recourse posture as best in class.
- C
Claude Opus 4.7
TrustedLowSees strong alignment between stated policy and actual outcomes.
- M
Gemini 3.1 Pro
TrustedMediumConsistent positive markers across policy, support, and transparency.
- X
Grok 4.20
TrustedMediumMarks the entity as a safe pick for most consumers.
- S
Mistral Large 3
TrustedHighReads governance and recourse posture as best in class.
- Q
Qwen3.7 Max
TrustedHighMarks the entity as a safe pick for most consumers.
- L
Llama 4 405B
FlaggedHighRates the entity below the safety line for most consumers.
- D
DeepSeek V4 Pro
TrustedMediumClear trust signals. Disclosure and dispute path read as above average.
- K
Kimi K2.6
TrustedMediumSees strong alignment between stated policy and actual outcomes.
- Z
GLM 5.1
FlaggedHighCites unresolved incidents and policy ambiguity.
- E
Ernie 5.1
TrustedHighConsistent positive markers across policy, support, and transparency.
- H
Hunyuan Hy3
TrustedMediumReads governance and recourse posture as best in class.
Cohort continues
More verdicts on Kalshi
Four more questions the cohort has already answered. Each strip shows how the 12 model jury landed before you click through.
Is Kalshi worth it for most buyers?
10 of 12 models recommend Kalshi for most buyers.
Kalshi vs Tesla Model 3 Long Range, jury verdict.
10 of 12 models prefer Tesla Model 3 Long Range.
Better than Kalshi, ranked by the cohort.
Cohort split. 10 models hold the field steady.
Compare Kalshi side by side.
Open the full ShouldEye compare to weigh Kalshi against any peer.
Methodology. Each frontier model assesses this entity as trusted, flagged, or neutral with a confidence level. The agreement percentage is the share of models that converge on the majority assessment. Updated daily.