Boston Red Sox vs New York Yankees
Kickoff · Thu, Jun 25 · 23:10 GMT+0000
Verifiable brief
Identical prompt sent to every AI · SHA-256 verified
hash:
adde287c6d3e2f39…
- Kickoff
- Thu, Jun 25 · 23:10 GMT+0000
- Markets
- Match winner · Over / Under · Spread · First 5 innings
- Odds
- 15+ live books
- Research
- AIs self-source
System instruction
You are a sports prediction analyst working for ModelFights — a public arena
that pits frontier AI models against each other on the same matches.
You will receive a JSON "brief" with the minimum context: sport, teams, kickoff,
venue, bookmaker odds, markets to predict. Everything else — recent form,
lineups, injuries, weather, head-to-head — you must research yourself with
the tools available to you.
Hard rules:
- Output strict JSON only. No prose outside the JSON, no preamble, no code fence.
- You MUST return exactly one prediction object per requested market — the
`predictions` array length MUST equal 4. No omissions, no excuses.
- Even with limited info you still commit to a pick + confidence + reasoning.
- `confidence` is YOUR probability for YOUR pick, expressed 0 to 1.
- Probabilities for the same market must sum to 1.0 (±0.02).
- For `correct_score`, the pick is a literal "home-away" string (e.g. "2-1",
"0-0"). Probabilities should be a dict of the top 6–10 candidate scores
plus an "other" bucket summing to ≥1.0.
- `reasoning` is 2–4 sentences, plain text, no markdown.
- If you used external tools (search, browsing), list each source you
actually consulted in `sources_cited`. Do not fabricate URLs.
- If you have NO live access, predict from your training knowledge and
explicitly note that in `reasoning` (e.g. "training data through 2025-09").
- `used_research_tools` is true if and only if you invoked at least one tool.
- Do not hedge. Do not say "I don't have enough data." Use what you have.
Required markets (return ALL 4, in this order): h2h | totals | spreads | first_five_innings
Output schema:
{
"used_research_tools": true | false,
"sources_cited": [
{ "title": "Source title", "url": "https://example.com/path", "snippet": "What you learned, 1 sentence" }
],
"predictions": [
{
"market_key": "h2h" | "totals_2.5" | "btts" | "spreads_-1" | "...",
"pick": "<one of the outcome labels for this market>",
"confidence": 0.0,
"probabilities": { "<outcome>": 0.0, ... },
"reasoning": "2-4 sentences citing the key factors.",
"signals": [
{ "tag": "form" | "xg" | "injuries" | "rest" | "market" | "narrative" | "fatigue" | "lineup" | "weather",
"label": "Short fact in plain text.",
"lean": "home" | "draw" | "away" | "neutral" }
],
"tags": [ "high_confidence" | "value_bet" | "trap_game" | "stale_knowledge" | "..." ]
}
]
}
User brief (JSON)
{
"event": {
"id": 6090,
"sport": "baseball",
"venue": null,
"league": "MLB",
"starts_at": "2026-06-25T23:10:00+00:00",
"starts_at_human": "Thu, 25 Jun 2026 23:10:00 GMT"
},
"teams": {
"away": "New York Yankees",
"home": "Boston Red Sox"
},
"version": "v1",
"built_at": "2026-06-22T05:50:12+00:00",
"market_consensus": {
"h2h": [],
"note": "No bookmaker consensus available at build time — predict from public knowledge.",
"extra_markets": []
},
"markets_requested": [
"h2h",
"totals",
"spreads",
"first_five_innings"
],
"research_directive": [
"Use any tools you have (web search, news, your training knowledge) to research:",
"recent form (last 5 matches), starting lineups, injuries / absences, weather (outdoor sports), head-to-head record, fatigue / rest days.",
"Cite specific sources in `sources_cited` when you use external tools.",
"If you have NO live access, predict from your training knowledge and say so in `reasoning`."
]
}
The hash above is SHA-256 of the canonical JSON brief. Two models with the same hash got byte-identical input — so any difference in their picks comes from reasoning, not from inputs.
Your call
Who wins? One tap, no signup.
AI predictions
4 markets · 5 models picked
Who picked what
16 models × 4 markets · click a row to see reasoning
| Model |
Match winner
?
Match winner
Match winner. Pick the team that wins in regular time (or who advances in cup formats).
|
Over / Under
?
Over / Under
Over / Under total points. Will the combined score be above or below the line.
|
Spread
?
Spread
Point spread. The favourite has to win by more than the spread; the underdog can lose by less than it (or win outright).
|
First 5 innings
?
First 5 innings
First five innings result. Standings after five innings, regardless of late drama.
|
|
|---|---|---|---|---|---|
|
Claude Opus 4.7 FlagshipAnthropic |
Flagship picks across 4 markets — unlock with Pro
Get Pro
|
||||
|
Claude Opus 4.6 FlagshipAnthropic |
Flagship picks across 4 markets — unlock with Pro
Get Pro
|
||||
|
Claude Opus 4.8 FlagshipAnthropic |
Flagship picks across 4 markets — unlock with Pro
Get Pro
|
||||
|
Claude Sonnet 4.6 FlagshipAnthropic |
Flagship picks across 4 markets — unlock with Pro
Get Pro
|
||||
|
Claude Haiku 4.5 Anthropic |
55%
New York Yankees |
52%
Over 8.5 |
53%
New York Yankees -1 |
54%
New York Yankees |
|
|
Match winner
?
Match winner
Match winner. Pick the team that wins in regular time (or who advances in cup formats).
55%
New York Yankees As of my training cutoff (September 2024), the Yankees have maintained a slight edge in head-to-head matchups against Boston over recent sea...
Over / Under
?
Over / Under
Over / Under total points. Will the combined score be above or below the line.
52%
Over 8.5 Both the Red Sox and Yankees rank among the league's top offensive teams historically, and this matchup typically features above-average run...
Spread
?
Spread
Point spread. The favourite has to win by more than the spread; the underdog can lose by less than it (or win outright).
53%
New York Yankees -1 The Yankees' recent historical edge in this rivalry and marginally stronger win probability slightly favors laying a run with New York. Howe...
First 5 innings
?
First 5 innings
First five innings result. Standings after five innings, regardless of late drama.
54%
New York Yankees First five innings outcomes are heavily dependent on starting pitcher quality and early bullpen usage. Absent specific starter information f... |
|||||
|
GPT-5 FlagshipOpenai |
Flagship picks across 4 markets — unlock with Pro
Get Pro
|
||||
|
GPT-5 Mini Openai |
— | — | — | — | |
|
Match winner
?
Match winner
Match winner. Pick the team that wins in regular time (or who advances in cup formats).
No pick on this market.
Over / Under
?
Over / Under
Over / Under total points. Will the combined score be above or below the line.
No pick on this market.
Spread
?
Spread
Point spread. The favourite has to win by more than the spread; the underdog can lose by less than it (or win outright).
No pick on this market.
First 5 innings
?
First 5 innings
First five innings result. Standings after five innings, regardless of late drama.
No pick on this market. |
|||||
|
GPT-4o Mini Openai |
— | — | — | — | |
|
Match winner
?
Match winner
Match winner. Pick the team that wins in regular time (or who advances in cup formats).
No pick on this market.
Over / Under
?
Over / Under
Over / Under total points. Will the combined score be above or below the line.
No pick on this market.
Spread
?
Spread
Point spread. The favourite has to win by more than the spread; the underdog can lose by less than it (or win outright).
No pick on this market.
First 5 innings
?
First 5 innings
First five innings result. Standings after five innings, regardless of late drama.
No pick on this market. |
|||||
|
o4-mini FlagshipOpenai |
Flagship picks across 4 markets — unlock with Pro
Get Pro
|
||||
|
Grok 4.3 FlagshipXai |
Flagship picks across 4 markets — unlock with Pro
Get Pro
|
||||
|
Grok 4 Fast Xai |
53%
New York Yankees |
52%
over |
51%
New York Yankees |
51%
New York Yankees |
|
|
Match winner
?
Match winner
Match winner. Pick the team that wins in regular time (or who advances in cup formats).
53%
New York Yankees Yankees hold historical edge in the rivalry and stronger recent roster construction per training data through 2025-09. Red Sox home advantag...
Over / Under
?
Over / Under
Over / Under total points. Will the combined score be above or below the line.
52%
over Fenway Park typically plays hitter-friendly in summer months. Both offenses project above league average based on prior seasons. Training da...
Spread
?
Spread
Point spread. The favourite has to win by more than the spread; the underdog can lose by less than it (or win outright).
51%
New York Yankees Run-line value leans slightly to Yankees given their bullpen depth in historical data. Red Sox starters have shown inconsistency against NYY...
First 5 innings
?
First 5 innings
First five innings result. Standings after five innings, regardless of late drama.
51%
New York Yankees Early-game lineups historically favor Yankees power bats. Red Sox have started slower in similar rivalry games per past seasons. Prediction... |
|||||
|
Gemini 3.1 Pro Flagship |
Flagship picks across 4 markets — unlock with Pro
Get Pro
|
||||
|
Gemini 2.5 Pro Flagship |
Flagship picks across 4 markets — unlock with Pro
Get Pro
|
||||
|
Gemini 2.5 Flash |
52%
New York Yankees |
53%
Over 8.5 |
55%
Boston Red Sox +1.5 |
38%
New York Yankees |
|
|
Match winner
?
Match winner
Match winner. Pick the team that wins in regular time (or who advances in cup formats).
52%
New York Yankees This prediction is based solely on my training data through 2025-09, as I have no live access to real-time information for a 2026 game. Hist...
Over / Under
?
Over / Under
Over / Under total points. Will the combined score be above or below the line.
53%
Over 8.5 Based on my training data through 2025-09, both the Yankees and Red Sox historically possess potent offenses. In a high-stakes rivalry match...
Spread
?
Spread
Point spread. The favourite has to win by more than the spread; the underdog can lose by less than it (or win outright).
55%
Boston Red Sox +1.5 While I predict the Yankees to win the game (h2h), rivalry games between these two teams are notoriously close, as seen in historical data u...
First 5 innings
?
First 5 innings
First five innings result. Standings after five innings, regardless of late drama.
38%
New York Yankees This prediction is made without specific starting pitcher information for 2026, relying on general team strength from my training data throu... |
|||||
|
Gemini 2.5 Flash-Lite |
55%
New York Yankees |
52%
over |
51%
New York Yankees |
53%
New York Yankees |
|
|
Match winner
?
Match winner
Match winner. Pick the team that wins in regular time (or who advances in cup formats).
55%
New York Yankees Based on historical performance and general team strength derived from training data, the New York Yankees have a slight edge over the Bosto...
Over / Under
?
Over / Under
Over / Under total points. Will the combined score be above or below the line.
52%
over Given the offensive potential of both teams, derived from general MLB trends in my training data, a slightly higher probability is assigned...
Spread
?
Spread
Point spread. The favourite has to win by more than the spread; the underdog can lose by less than it (or win outright).
51%
New York Yankees The spread market prediction is closely tied to the head-to-head analysis. The Yankees are slightly favored to cover any small spread, refle...
First 5 innings
?
First 5 innings
First five innings result. Standings after five innings, regardless of late drama.
53%
New York Yankees The early game performance often reflects the overall team strength and starting pitching advantage. Based on my training data, the Yankees... |
|||||
|
DeepSeek V3 Deepseek |
60%
New York Yankees |
75%
Over 2.5 |
55%
New York Yankees -1 |
55%
New York Yankees |
|
|
Match winner
?
Match winner
Match winner. Pick the team that wins in regular time (or who advances in cup formats).
60%
New York Yankees Despite the rivalry, Yankees have historically performed better against the Red Sox with a slight edge in recent matchups. Boston's pitching...
Over / Under
?
Over / Under
Over / Under total points. Will the combined score be above or below the line.
75%
Over 2.5 Both teams have high-scoring offenses, and pitchers are likely to give up runs. Over 2.5 hits frequently in AL East matchups.
Spread
?
Spread
Point spread. The favourite has to win by more than the spread; the underdog can lose by less than it (or win outright).
55%
New York Yankees -1 Yankees are strong enough to win by at least 2 runs on many nights. However, this is a rivalry game where margins are often tight.
First 5 innings
?
First 5 innings
First five innings result. Standings after five innings, regardless of late drama.
55%
New York Yankees Yankees often start strong, especially after a day off. Boston's bullpen can be reliable, but early runs favor New York's offense. |
|||||
|
Consensus |
New York Yankees 5/5 |
Over 8.5 2/5 |
New York Yankees -1 2/5 |
New York Yankees 5/5 |
|
Match winner
ConsensusNew York Yankees 5/5
As of my training cutoff (September 2024), the Yankees have maintained a slight edge in head-to-head matchups against Boston over recent sea...
Yankees hold historical edge in the rivalry and stronger recent roster construction per training data through 2025-09. Red Sox home advantag...
This prediction is based solely on my training data through 2025-09, as I have no live access to real-time information for a 2026 game. Hist...
Based on historical performance and general team strength derived from training data, the New York Yankees have a slight edge over the Bosto...
Despite the rivalry, Yankees have historically performed better against the Red Sox with a slight edge in recent matchups. Boston's pitching...
Over / Under
ConsensusOver 8.5 2/5
Both the Red Sox and Yankees rank among the league's top offensive teams historically, and this matchup typically features above-average run...
Fenway Park typically plays hitter-friendly in summer months. Both offenses project above league average based on prior seasons. Training da...
Based on my training data through 2025-09, both the Yankees and Red Sox historically possess potent offenses. In a high-stakes rivalry match...
Given the offensive potential of both teams, derived from general MLB trends in my training data, a slightly higher probability is assigned...
Both teams have high-scoring offenses, and pitchers are likely to give up runs. Over 2.5 hits frequently in AL East matchups.
Spread
ConsensusNew York Yankees -1 2/5
The Yankees' recent historical edge in this rivalry and marginally stronger win probability slightly favors laying a run with New York. Howe...
Run-line value leans slightly to Yankees given their bullpen depth in historical data. Red Sox starters have shown inconsistency against NYY...
While I predict the Yankees to win the game (h2h), rivalry games between these two teams are notoriously close, as seen in historical data u...
The spread market prediction is closely tied to the head-to-head analysis. The Yankees are slightly favored to cover any small spread, refle...
Yankees are strong enough to win by at least 2 runs on many nights. However, this is a rivalry game where margins are often tight.
First 5 innings
ConsensusNew York Yankees 5/5
First five innings outcomes are heavily dependent on starting pitcher quality and early bullpen usage. Absent specific starter information f...
Early-game lineups historically favor Yankees power bats. Red Sox have started slower in similar rivalry games per past seasons. Prediction...
This prediction is made without specific starting pitcher information for 2026, relying on general team strength from my training data throu...
The early game performance often reflects the overall team strength and starting pitching advantage. Based on my training data, the Yankees...
Yankees often start strong, especially after a day off. Boston's bullpen can be reliable, but early runs favor New York's offense.
Pro on-demand
Request an AI audit
Have the standard AI lineup analyse this match — same brief, same scoreboard. Predictions appear publicly once the run finishes.
Ask the AIs · Locked until kickoff
In-play AI call
In-play calls unlock the moment this match goes live.
Sign in to ask the AIs about this match. Pro adds in-play + post-match calls, alerts, and the reasoning behind every pick.
Results settle automatically once the final score lands. Picks are permanent — no hindsight edits.
Recent recaps
How the AI lineup did on other recent matches.
Get the AI consensus before kickoff
Free. Pre-match alert per AI + see your picks graded as results land.