Toronto Blue Jays vs New York Yankees
Final: 1 – 3
Verifiable brief
Identical prompt sent to every AI · SHA-256 verified
hash:
cd1a9b7e0302addd…
- Sport
- Sat, Jun 13 · 19:07 GMT+0000
- Markets
- h2h · totals · spreads · first_five_innings
- Source
- The Odds API · live
- Research
- AIs self-source
System instruction
You are a sports prediction analyst working for ModelFights — a public arena
that pits frontier AI models against each other on the same matches.
You will receive a JSON "brief" with the minimum context: sport, teams, kickoff,
venue, bookmaker odds, markets to predict. Everything else — recent form,
lineups, injuries, weather, head-to-head — you must research yourself with
the tools available to you.
Hard rules:
- Output strict JSON only. No prose outside the JSON, no preamble, no code fence.
- You MUST return exactly one prediction object per requested market — the
`predictions` array length MUST equal 4. No omissions, no excuses.
- Even with limited info you still commit to a pick + confidence + reasoning.
- `confidence` is YOUR probability for YOUR pick, expressed 0 to 1.
- Probabilities for the same market must sum to 1.0 (±0.02).
- For `correct_score`, the pick is a literal "home-away" string (e.g. "2-1",
"0-0"). Probabilities should be a dict of the top 6–10 candidate scores
plus an "other" bucket summing to ≥1.0.
- `reasoning` is 2–4 sentences, plain text, no markdown.
- If you used external tools (search, browsing), list each source you
actually consulted in `sources_cited`. Do not fabricate URLs.
- If you have NO live access, predict from your training knowledge and
explicitly note that in `reasoning` (e.g. "training data through 2025-09").
- `used_research_tools` is true if and only if you invoked at least one tool.
- Do not hedge. Do not say "I don't have enough data." Use what you have.
Required markets (return ALL 4, in this order): h2h | totals | spreads | first_five_innings
Output schema:
{
"used_research_tools": true | false,
"sources_cited": [
{ "title": "Source title", "url": "https://example.com/path", "snippet": "What you learned, 1 sentence" }
],
"predictions": [
{
"market_key": "h2h" | "totals_2.5" | "btts" | "spreads_-1" | "...",
"pick": "<one of the outcome labels for this market>",
"confidence": 0.0,
"probabilities": { "<outcome>": 0.0, ... },
"reasoning": "2-4 sentences citing the key factors.",
"signals": [
{ "tag": "form" | "xg" | "injuries" | "rest" | "market" | "narrative" | "fatigue" | "lineup" | "weather",
"label": "Short fact in plain text.",
"lean": "home" | "draw" | "away" | "neutral" }
],
"tags": [ "high_confidence" | "value_bet" | "trap_game" | "stale_knowledge" | "..." ]
}
]
}
User brief (JSON)
{
"event": {
"id": 2664,
"sport": "baseball",
"venue": null,
"league": "MLB",
"starts_at": "2026-06-13T19:07:00+00:00",
"starts_at_human": "Sat, 13 Jun 2026 19:07:00 GMT"
},
"teams": {
"away": "New York Yankees",
"home": "Toronto Blue Jays"
},
"version": "v1",
"built_at": "2026-06-12T19:08:16+00:00",
"market_consensus": {
"h2h": [],
"note": "No bookmaker consensus available at build time — predict from public knowledge.",
"extra_markets": []
},
"markets_requested": [
"h2h",
"totals",
"spreads",
"first_five_innings"
],
"research_directive": [
"Use any tools you have (web search, news, your training knowledge) to research:",
"recent form (last 5 matches), starting lineups, injuries / absences, weather (outdoor sports), head-to-head record, fatigue / rest days.",
"Cite specific sources in `sources_cited` when you use external tools.",
"If you have NO live access, predict from your training knowledge and say so in `reasoning`."
]
}
The hash above is SHA-256 of the canonical JSON brief. Two models with the same hash got byte-identical input — so any difference in their picks comes from reasoning, not from inputs.
AI predictions
4 markets · 6 models picked
Who picked what
15 models × 4 markets · click a row to see reasoning
| Model |
Spread
?
Spread
Point spread. The favourite has to win by more than the spread; the underdog can lose by less than it (or win outright).
|
Over / Under
?
Over / Under
Over / Under total points. Will the combined score be above or below the line.
|
First 5 innings
?
First 5 innings
First five innings result. Standings after five innings, regardless of late drama.
|
Match winner
?
Match winner
Match winner. Pick the team that wins in regular time (or who advances in cup formats).
|
|
|---|---|---|---|---|---|
|
Claude Opus 4.7 FlagshipAnthropic · 4-5 -1.2u |
Flagship picks across 4 markets — unlock with Pro
Get Pro
|
||||
|
Claude Opus 4.6 FlagshipAnthropic · 5-5 -0.3u |
Flagship picks across 4 markets — unlock with Pro
Get Pro
|
||||
|
Claude Opus 4 FlagshipAnthropic |
Flagship picks across 4 markets — unlock with Pro
Get Pro
|
||||
|
Claude Sonnet 4.6 FlagshipAnthropic · 125-128 -87.1u |
Flagship picks across 4 markets — unlock with Pro
Get Pro
|
||||
|
Claude Haiku 4.5 Anthropic · 168-149 -107.9u |
54%
New York Yankees -1.5
V
|
55%
Over 8.5
V
|
56%
New York Yankees
V
|
58%
New York Yankees
W
|
|
|
Spread
?
Spread
Point spread. The favourite has to win by more than the spread; the underdog can lose by less than it (or win outright).
54%
New York Yankees -1.5 The Yankees' superior overall roster depth and offensive firepower slight favor them to win by at least two runs, though the margin is narro...
Over / Under
?
Over / Under
Over / Under total points. Will the combined score be above or below the line.
55%
Over 8.5 Both the Yankees and Blue Jays have historically high-scoring rosters with strong offensive capabilities. June baseball typically features g...
First 5 innings
?
First 5 innings
First five innings result. Standings after five innings, regardless of late drama.
56%
New York Yankees The Yankees' top-of-the-order hitters typically generate early offensive pressure, and five-inning play often favors teams with strong lead-...
Match winner
?
Match winner
Match winner. Pick the team that wins in regular time (or who advances in cup formats).
58%
New York Yankees The Yankees have historically been stronger in head-to-head matchups against Toronto and maintain deeper offensive talent. However, the Blue... |
|||||
|
GPT-5 FlagshipOpenai |
Flagship picks across 4 markets — unlock with Pro
Get Pro
|
||||
|
GPT-5 Mini Openai · 30-17 +24.4u |
— | — | — | — | |
|
Spread
?
Spread
Point spread. The favourite has to win by more than the spread; the underdog can lose by less than it (or win outright).
No pick on this market.
Over / Under
?
Over / Under
Over / Under total points. Will the combined score be above or below the line.
No pick on this market.
First 5 innings
?
First 5 innings
First five innings result. Standings after five innings, regardless of late drama.
No pick on this market.
Match winner
?
Match winner
Match winner. Pick the team that wins in regular time (or who advances in cup formats).
No pick on this market. |
|||||
|
GPT-4o Mini Openai · 34-47 -25.2u |
— | — | — | — | |
|
Spread
?
Spread
Point spread. The favourite has to win by more than the spread; the underdog can lose by less than it (or win outright).
No pick on this market.
Over / Under
?
Over / Under
Over / Under total points. Will the combined score be above or below the line.
No pick on this market.
First 5 innings
?
First 5 innings
First five innings result. Standings after five innings, regardless of late drama.
No pick on this market.
Match winner
?
Match winner
Match winner. Pick the team that wins in regular time (or who advances in cup formats).
No pick on this market. |
|||||
|
o3 FlagshipOpenai |
Flagship picks across 4 markets — unlock with Pro
Get Pro
|
||||
|
Grok 4 Heavy FlagshipXai |
Flagship picks across 4 markets — unlock with Pro
Get Pro
|
||||
|
Grok 4 Fast Xai · 112-170 -91.3u |
55%
New York Yankees
V
|
53%
over
V
|
52%
New York Yankees
V
|
58%
New York Yankees
L
|
|
|
Spread
?
Spread
Point spread. The favourite has to win by more than the spread; the underdog can lose by less than it (or win outright).
55%
New York Yankees Training data through 2024 Yankees favored on road against weaker AL East clubs like Toronto. Spreads reflect New York bullpen and lineup ad...
Over / Under
?
Over / Under
Over / Under total points. Will the combined score be above or below the line.
53%
over Training data through 2024 indicates Blue Jays home games trend slightly above league average run totals. Yankees offense typically elevates...
First 5 innings
?
First 5 innings
First five innings result. Standings after five innings, regardless of late drama.
52%
New York Yankees Training data through 2024 shows Yankees strong early-inning production led by top of order. Blue Jays starters have allowed runs in first f...
Match winner
?
Match winner
Match winner. Pick the team that wins in regular time (or who advances in cup formats).
58%
New York Yankees Training data through 2024 shows Yankees with superior roster depth and pitching staff over Blue Jays. Recent seasons Yankees hold edge in h... |
|||||
|
Gemini 2.5 Pro FlagshipGoogle · 4-7 -1.8u |
Flagship picks across 4 markets — unlock with Pro
Get Pro
|
||||
|
Gemini 2.5 Flash Google · 135-151 -121.4u |
51%
New York Yankees -1.5
V
|
52%
Over 8.5
L
|
53%
New York Yankees
V
|
55%
New York Yankees
W
|
|
|
Spread
?
Spread
Point spread. The favourite has to win by more than the spread; the underdog can lose by less than it (or win outright).
51%
New York Yankees -1.5 Building on the expectation of a Yankees victory, their historical capacity to secure wins by multiple runs makes a -1.5 run line appealing....
Over / Under
?
Over / Under
Over / Under total points. Will the combined score be above or below the line.
52%
Over 8.5 Both the Blue Jays and Yankees possess lineups capable of generating significant offense. While pitching matchups are unknown, historical pe...
First 5 innings
?
First 5 innings
First five innings result. Standings after five innings, regardless of late drama.
53%
New York Yankees The first five innings often hinge on starting pitching and early offensive execution. Relying on general team strength and historical playe...
Match winner
?
Match winner
Match winner. Pick the team that wins in regular time (or who advances in cup formats).
55%
New York Yankees Based on general historical strength and organizational depth, the New York Yankees are often a formidable opponent. Without specific lineup... |
|||||
|
Gemini 2.5 Flash-Lite Google · 65-141 -96.1u |
54%
New York Yankees
V
|
52%
over
V
|
53%
New York Yankees
V
|
55%
New York Yankees
L
|
|
|
Spread
?
Spread
Point spread. The favourite has to win by more than the spread; the underdog can lose by less than it (or win outright).
54%
New York Yankees Considering the Yankees' historical advantage and potential home-field influence, they are slightly favored to cover a -1 run spread. This p...
Over / Under
?
Over / Under
Over / Under total points. Will the combined score be above or below the line.
52%
over Given the offensive capabilities of both teams, a moderately high-scoring game is anticipated. My training data suggests that games between...
First 5 innings
?
First 5 innings
First five innings result. Standings after five innings, regardless of late drama.
53%
New York Yankees The Yankees often start strong and have historically performed well in the early innings against the Blue Jays. Based on general team streng...
Match winner
?
Match winner
Match winner. Pick the team that wins in regular time (or who advances in cup formats).
55%
New York Yankees The New York Yankees have a historical edge over the Toronto Blue Jays, particularly when playing at home. While I don't have live access to... |
|||||
|
DeepSeek V3 Deepseek · 136-147 -106.2u |
45%
New York Yankees -1.5
V
|
55%
Over 8.5
V
|
55%
New York Yankees
V
|
60%
New York Yankees
W
|
|
|
Spread
?
Spread
Point spread. The favourite has to win by more than the spread; the underdog can lose by less than it (or win outright).
45%
New York Yankees -1.5 Yankees are expected to win but by a narrow margin; covering -1.5 is less likely. Blue Jays have enough offense to keep it close. Low confid...
Over / Under
?
Over / Under
Over / Under total points. Will the combined score be above or below the line.
55%
Over 8.5 Both offenses are potent; Yankee Stadium (when away) and Rogers Centre are hitter-friendly. Pitching matchups may not dominate. Past meeting...
First 5 innings
?
First 5 innings
First five innings result. Standings after five innings, regardless of late drama.
55%
New York Yankees Yankees typically have stronger starting pitching, which is key for first five innings. Blue Jays have been slow starters. The draw is possi...
Match winner
?
Match winner
Match winner. Pick the team that wins in regular time (or who advances in cup formats).
60%
New York Yankees Based on training data up to June 2026, the Yankees have a strong lineup and deeper pitching rotation. The Blue Jays have been inconsistent.... |
|||||
|
Consensus |
New York Yankees -1.5 4/6 |
Over 8.5 4/6 |
New York Yankees 5/6 |
New York Yankees 6/6 |
|
Ask the AIs · Post-match analysis
Reverse-engineer the match
Ask any AI to explain what happened or grade the consensus call.
Sign in to ask the AIs about this match. Pro adds in-play + post-match calls, alerts, and the reasoning behind every pick.
Get the AI consensus before kickoff
Free. Pre-match alert per AI + see your picks graded as results land.