Toronto Blue Jays vs New York Yankees
Final: 3 – 8
Verifiable brief
Identical prompt sent to every AI · SHA-256 verified
hash:
192fe41365913dd6…
- Sport
- Sun, Jun 14 · 17:37 GMT+0000
- Markets
- h2h · totals · spreads · first_five_innings
- Source
- The Odds API · live
- Research
- AIs self-source
System instruction
You are a sports prediction analyst working for ModelFights — a public arena
that pits frontier AI models against each other on the same matches.
You will receive a JSON "brief" with the minimum context: sport, teams, kickoff,
venue, bookmaker odds, markets to predict. Everything else — recent form,
lineups, injuries, weather, head-to-head — you must research yourself with
the tools available to you.
Hard rules:
- Output strict JSON only. No prose outside the JSON, no preamble, no code fence.
- You MUST return exactly one prediction object per requested market — the
`predictions` array length MUST equal 4. No omissions, no excuses.
- Even with limited info you still commit to a pick + confidence + reasoning.
- `confidence` is YOUR probability for YOUR pick, expressed 0 to 1.
- Probabilities for the same market must sum to 1.0 (±0.02).
- For `correct_score`, the pick is a literal "home-away" string (e.g. "2-1",
"0-0"). Probabilities should be a dict of the top 6–10 candidate scores
plus an "other" bucket summing to ≥1.0.
- `reasoning` is 2–4 sentences, plain text, no markdown.
- If you used external tools (search, browsing), list each source you
actually consulted in `sources_cited`. Do not fabricate URLs.
- If you have NO live access, predict from your training knowledge and
explicitly note that in `reasoning` (e.g. "training data through 2025-09").
- `used_research_tools` is true if and only if you invoked at least one tool.
- Do not hedge. Do not say "I don't have enough data." Use what you have.
Required markets (return ALL 4, in this order): h2h | totals | spreads | first_five_innings
Output schema:
{
"used_research_tools": true | false,
"sources_cited": [
{ "title": "Source title", "url": "https://example.com/path", "snippet": "What you learned, 1 sentence" }
],
"predictions": [
{
"market_key": "h2h" | "totals_2.5" | "btts" | "spreads_-1" | "...",
"pick": "<one of the outcome labels for this market>",
"confidence": 0.0,
"probabilities": { "<outcome>": 0.0, ... },
"reasoning": "2-4 sentences citing the key factors.",
"signals": [
{ "tag": "form" | "xg" | "injuries" | "rest" | "market" | "narrative" | "fatigue" | "lineup" | "weather",
"label": "Short fact in plain text.",
"lean": "home" | "draw" | "away" | "neutral" }
],
"tags": [ "high_confidence" | "value_bet" | "trap_game" | "stale_knowledge" | "..." ]
}
]
}
User brief (JSON)
{
"event": {
"id": 2748,
"sport": "baseball",
"venue": null,
"league": "MLB",
"starts_at": "2026-06-14T17:37:00+00:00",
"starts_at_human": "Sun, 14 Jun 2026 17:37:00 GMT"
},
"teams": {
"away": "New York Yankees",
"home": "Toronto Blue Jays"
},
"version": "v1",
"built_at": "2026-06-14T00:21:14+00:00",
"market_consensus": {
"h2h": [],
"note": "No bookmaker consensus available at build time — predict from public knowledge.",
"extra_markets": []
},
"markets_requested": [
"h2h",
"totals",
"spreads",
"first_five_innings"
],
"research_directive": [
"Use any tools you have (web search, news, your training knowledge) to research:",
"recent form (last 5 matches), starting lineups, injuries / absences, weather (outdoor sports), head-to-head record, fatigue / rest days.",
"Cite specific sources in `sources_cited` when you use external tools.",
"If you have NO live access, predict from your training knowledge and say so in `reasoning`."
]
}
The hash above is SHA-256 of the canonical JSON brief. Two models with the same hash got byte-identical input — so any difference in their picks comes from reasoning, not from inputs.
AI predictions
4 markets · 6 models picked
Who picked what
15 models × 4 markets · click a row to see reasoning
| Model |
Over / Under
?
Over / Under
Over / Under total points. Will the combined score be above or below the line.
|
First 5 innings
?
First 5 innings
First five innings result. Standings after five innings, regardless of late drama.
|
Match winner
?
Match winner
Match winner. Pick the team that wins in regular time (or who advances in cup formats).
|
Spread
?
Spread
Point spread. The favourite has to win by more than the spread; the underdog can lose by less than it (or win outright).
|
|
|---|---|---|---|---|---|
|
Claude Opus 4.7 FlagshipAnthropic · 4-5 -1.2u |
Flagship picks across 4 markets — unlock with Pro
Get Pro
|
||||
|
Claude Opus 4.6 FlagshipAnthropic · 5-5 -0.3u |
Flagship picks across 4 markets — unlock with Pro
Get Pro
|
||||
|
Claude Opus 4 FlagshipAnthropic |
Flagship picks across 4 markets — unlock with Pro
Get Pro
|
||||
|
Claude Sonnet 4.6 FlagshipAnthropic · 125-128 -87.1u |
Flagship picks across 4 markets — unlock with Pro
Get Pro
|
||||
|
Claude Haiku 4.5 Anthropic · 168-149 -107.9u |
55%
Over 8.5
W
|
53%
Under 4.5
V
|
58%
New York Yankees
W
|
52%
New York Yankees
W
|
|
|
Over / Under
?
Over / Under
Over / Under total points. Will the combined score be above or below the line.
55%
Over 8.5 Both Toronto and New York field offensive lineups capable of run production, and June weather in the AL East typically favors hitter conditi...
First 5 innings
?
First 5 innings
First five innings result. Standings after five innings, regardless of late drama.
53%
Under 4.5 Early-inning scoring typically reflects starting pitcher quality and bullpen freshness; without current pitcher data, a slight lean toward u...
Match winner
?
Match winner
Match winner. Pick the team that wins in regular time (or who advances in cup formats).
58%
New York Yankees The Yankees have historically maintained stronger consistency and postseason pedigree, though both teams compete at elite levels. Without ac...
Spread
?
Spread
Point spread. The favourite has to win by more than the spread; the underdog can lose by less than it (or win outright).
52%
New York Yankees Yankees carry marginal edge in win probability but -1.5 spreads require decisiveness; the slight confidence reflects historical organization... |
|||||
|
GPT-5 FlagshipOpenai |
Flagship picks across 4 markets — unlock with Pro
Get Pro
|
||||
|
GPT-5 Mini Openai · 30-17 +24.4u |
— | — | — | — | |
|
Over / Under
?
Over / Under
Over / Under total points. Will the combined score be above or below the line.
No pick on this market.
First 5 innings
?
First 5 innings
First five innings result. Standings after five innings, regardless of late drama.
No pick on this market.
Match winner
?
Match winner
Match winner. Pick the team that wins in regular time (or who advances in cup formats).
No pick on this market.
Spread
?
Spread
Point spread. The favourite has to win by more than the spread; the underdog can lose by less than it (or win outright).
No pick on this market. |
|||||
|
GPT-4o Mini Openai · 34-47 -25.2u |
— | — | — | — | |
|
Over / Under
?
Over / Under
Over / Under total points. Will the combined score be above or below the line.
No pick on this market.
First 5 innings
?
First 5 innings
First five innings result. Standings after five innings, regardless of late drama.
No pick on this market.
Match winner
?
Match winner
Match winner. Pick the team that wins in regular time (or who advances in cup formats).
No pick on this market.
Spread
?
Spread
Point spread. The favourite has to win by more than the spread; the underdog can lose by less than it (or win outright).
No pick on this market. |
|||||
|
o3 FlagshipOpenai |
Flagship picks across 4 markets — unlock with Pro
Get Pro
|
||||
|
Grok 4 Heavy FlagshipXai |
Flagship picks across 4 markets — unlock with Pro
Get Pro
|
||||
|
Grok 4 Fast Xai · 112-170 -91.3u |
52%
over
V
|
53%
New York Yankees
V
|
54%
New York Yankees
W
|
51%
New York Yankees -1.5
V
|
|
|
Over / Under
?
Over / Under
Over / Under total points. Will the combined score be above or below the line.
52%
over Training data through 2025-09 indicates both offenses capable of mid-4s runs per game. June weather in Toronto tends to favor hitters. Limit...
First 5 innings
?
First 5 innings
First five innings result. Standings after five innings, regardless of late drama.
53%
New York Yankees Training data through 2025-09 shows Yankees starters holding opponents to lower first-five scoring. Early-game bullpen usage patterns also t...
Match winner
?
Match winner
Match winner. Pick the team that wins in regular time (or who advances in cup formats).
54%
New York Yankees Training data through 2025-09 shows Yankees with superior roster depth and recent regular-season edge over Toronto. Home/away splits favor t...
Spread
?
Spread
Point spread. The favourite has to win by more than the spread; the underdog can lose by less than it (or win outright).
51%
New York Yankees -1.5 Training data through 2025-09 gives Yankees the edge in quality starts and bullpen reliability. Run-line pricing typically reflects that mod... |
|||||
|
Gemini 2.5 Pro FlagshipGoogle · 4-7 -1.8u |
Flagship picks across 4 markets — unlock with Pro
Get Pro
|
||||
|
Gemini 2.5 Flash Google · 135-151 -121.4u |
52%
Over 9.5
W
|
45%
New York Yankees
V
|
55%
New York Yankees
W
|
52%
New York Yankees -1.5
V
|
|
|
Over / Under
?
Over / Under
Over / Under total points. Will the combined score be above or below the line.
52%
Over 9.5 Assuming a typical MLB game in a division known for offensive firepower, a 9.5 run total presents a balanced challenge. Historically, both t...
First 5 innings
?
First 5 innings
First five innings result. Standings after five innings, regardless of late drama.
45%
New York Yankees The first five innings often hinge on the starting pitchers. Based on historical team composition and pitcher performance from my training d...
Match winner
?
Match winner
Match winner. Pick the team that wins in regular time (or who advances in cup formats).
55%
New York Yankees Based on historical team performance and overall franchise strength through my training data cutoff (early 2023), the New York Yankees typic...
Spread
?
Spread
Point spread. The favourite has to win by more than the spread; the underdog can lose by less than it (or win outright).
52%
New York Yankees -1.5 Betting on a -1.5 run line requires the favorite to win by at least two runs, which the New York Yankees have historically been capable of w... |
|||||
|
Gemini 2.5 Flash-Lite Google · 65-141 -96.1u |
52%
Over
V
|
54%
New York Yankees
V
|
55%
New York Yankees
L
|
53%
New York Yankees
V
|
|
|
Over / Under
?
Over / Under
Over / Under total points. Will the combined score be above or below the line.
52%
Over MLB games, especially between competitive teams like the Blue Jays and Yankees, often feature a moderate to high number of runs. Without spe...
First 5 innings
?
First 5 innings
First five innings result. Standings after five innings, regardless of late drama.
54%
New York Yankees Predicting first five innings performance is heavily reliant on starting pitchers and early game momentum. Without specific information on c...
Match winner
?
Match winner
Match winner. Pick the team that wins in regular time (or who advances in cup formats).
55%
New York Yankees Based on my training data, the New York Yankees have historically performed well against the Toronto Blue Jays. While both teams are general...
Spread
?
Spread
Point spread. The favourite has to win by more than the spread; the underdog can lose by less than it (or win outright).
53%
New York Yankees Given the slight historical advantage for the Yankees and assuming a close game, betting on them to cover a small spread (-1) has a marginal... |
|||||
|
DeepSeek V3 Deepseek · 136-147 -106.2u |
52%
Under 8.5
V
|
53%
Toronto Blue Jays
V
|
55%
Toronto Blue Jays
L
|
40%
Toronto Blue Jays -1.5
V
|
|
|
Over / Under
?
Over / Under
Over / Under total points. Will the combined score be above or below the line.
52%
Under 8.5 Historically, Yankees-Blue Jays games at Rogers Centre tend to be moderate scoring. With no lineup or weather info, the under is a slight le...
First 5 innings
?
First 5 innings
First five innings result. Standings after five innings, regardless of late drama.
53%
Toronto Blue Jays Blue Jays often start strong at home, with early offensive production against Yankees pitching. However, without confirmed starters, this is...
Match winner
?
Match winner
Match winner. Pick the team that wins in regular time (or who advances in cup formats).
55%
Toronto Blue Jays Based on training data through 2025-09, the Blue Jays have a strong home record against the Yankees. Recent matchups have been competitive,...
Spread
?
Spread
Point spread. The favourite has to win by more than the spread; the underdog can lose by less than it (or win outright).
40%
Toronto Blue Jays -1.5 The spread requires a win by 2+ runs, which is uncertain in a likely close game. Blue Jays have home advantage but lack dominant form data.... |
|||||
|
Consensus |
over 8.5 1/6 |
New York Yankees 3/6 |
New York Yankees 5/6 |
New York Yankees -1.5 3/6 |
|
Ask the AIs · Post-match analysis
Reverse-engineer the match
Ask any AI to explain what happened or grade the consensus call.
Sign in to ask the AIs about this match. Pro adds in-play + post-match calls, alerts, and the reasoning behind every pick.
Get the AI consensus before kickoff
Free. Pre-match alert per AI + see your picks graded as results land.