TUTORIAL

World Cup 2026 Predictions: Build an ML Model from Historical Data

Build a machine-learning model to predict the 2026 FIFA World Cup. Python + scikit-learn + historical World Cup data via API.

Published May 9, 2026Updated May 13, 20267 min read

Every World Cup brings out the predictors. Banks, betting markets, FiveThirtyEight-style stats sites, individual data scientists on Twitter — everyone publishes their bracket and inevitably gets ~half of it wrong. The math is hard, the sample sizes are small, and there's a lot of noise.

This tutorial walks through building a credible (not perfect) ML-based predictor for the 2026 FIFA World Cup using Python, scikit-learn, and historical World Cup data via REST API.

What we're predicting

Three things at increasing difficulty:

Per-match result (home win / draw / away win) — a multi-class classification problem
Per-match goals (home goals, away goals) — a Poisson regression problem
Tournament winner — Monte Carlo simulation built on top of the per-match model

We'll focus on (1) and use the same model to bootstrap (2) and (3).

Step 1: Pull historical World Cup data

import requests
import pandas as pd

API = 'https://api.thestatsapi.com/api/football/matches'
HEADERS = {'Authorization': 'Bearer YOUR_API_KEY'}

# Available modern World Cup seasons from /football/competitions/{id}/seasons
all_matches = []
season_ids = ["sn_29792", "sn_64936", "sn_6107", "sn_20655", "sn_844385", "sn_624440", "sn_326766"]

for season_id in season_ids:
    r = requests.get(
        API,
        params={'competition_id': COMPETITION_ID, 'season_id': season_id, 'status': 'finished', 'per_page': 100},
        headers=HEADERS,
    )
    all_matches.extend(r.json()['data'])

df = pd.DataFrame([{
    'season_id': m['season_id'],
    'home_team': m['home_team']['name'],
    'away_team': m['away_team']['name'],
    'home_team_id': m['home_team']['id'],
    'away_team_id': m['away_team']['id'],
    'home_score': m['score']['home'],
    'away_score': m['score']['away'],
    'stage': m.get('stage_name'),
} for m in all_matches])

print(f"Loaded {len(df)} historical matches")
# ~520 matches across 9 World Cups

Step 2: Build features

Football match prediction features fall into a few buckets:

Team strength — FIFA rank, Elo rating, recent form
Squad quality — average market value of the starting XI
Stage — group-stage games have different dynamics from knockouts
Rest — days since last match (fatigue matters in compressed tournaments)
Geography — distance travelled, time zone differences from home

For a first model, FIFA rank + recent form is enough:

# Add external Elo or FIFA rank snapshots from your own dataset.
# TheStatsAPI supplies match/team IDs; keep the rank source separate.
ratings = pd.read_csv('team_ratings_by_date.csv')
df = df.merge(ratings, left_on=['season_id', 'home_team_id'], right_on=['season_id', 'team_id'], how='left')
df = df.rename(columns={'rating': 'home_rating'}).drop(columns=['team_id'])
df = df.merge(ratings, left_on=['season_id', 'away_team_id'], right_on=['season_id', 'team_id'], how='left')
df = df.rename(columns={'rating': 'away_rating'}).drop(columns=['team_id'])

df['rating_diff'] = df['home_rating'] - df['away_rating']
df['result'] = df.apply(lambda r: 'H' if r['home_score'] > r['away_score'] else 'A' if r['home_score'] < r['away_score'] else 'D', axis=1)

Step 3: Train a classifier

from sklearn.model_selection import train_test_split
from sklearn.ensemble import GradientBoostingClassifier
from sklearn.metrics import classification_report, log_loss

features = ['rating_diff', 'home_rating', 'away_rating']
X = df[features].dropna()
y = df.loc[X.index, 'result']

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

model = GradientBoostingClassifier(n_estimators=200, max_depth=3, random_state=42)
model.fit(X_train, y_train)

y_pred = model.predict(X_test)
y_proba = model.predict_proba(X_test)
print(classification_report(y_test, y_pred))
print(f"Log loss: {log_loss(y_test, y_proba):.3f}")

A simple model like this typically gets to ~50% accuracy (vs ~33% baseline for "always home win") and log loss around 1.0. That's enough to outperform random guessing, but won't beat the closing line at Pinnacle.

Step 4: Add xG-based features

The biggest single improvement: include each team's xG form from recent matches in competitions you cover. Use /football/matches to find recent matches by team_id, then call /football/matches/{match_id}/stats for matches where xg_available is true:

def recent_xg_form(team_id, date_from, date_to):
    matches = requests.get(
        API,
        params={'team_id': team_id, 'date_from': date_from, 'date_to': date_to, 'status': 'finished', 'per_page': 20},
        headers=HEADERS,
    ).json()['data']

    xg_for = []
    xg_against = []
    for match in matches:
        if not match.get('xg_available'):
            continue
        stats = requests.get(f"{API}/{match['id']}/stats", headers=HEADERS).json()['data']
        is_home = match['home_team']['id'] == team_id
        xg = stats['overview']['expected_goals']['all']
        xg_for.append(xg['home'] if is_home else xg['away'])
        xg_against.append(xg['away'] if is_home else xg['home'])

    avg_xg = sum(xg_for) / len(xg_for)
    avg_xg_against = sum(xg_against) / len(xg_against)
    return avg_xg, avg_xg_against

# Add as features for your prediction date window
df['home_recent_xg'], df['home_recent_xga'] = zip(*df.apply(
    lambda r: recent_xg_form(r['home_team_id'], '2025-01-01', '2026-06-01'), axis=1
))

Now retrain. Expect a 5-10% accuracy boost.

Step 5: Predict the 2026 tournament

Pull the 2026 fixtures (with placeholders for knockout matches):

fixtures_2026 = requests.get(
    API,
    params={'competition_id': COMPETITION_ID, 'season_id': SEASON_ID_2026, 'per_page': 100},
    headers=HEADERS,
).json()['data']

# For each group-stage match, predict win/draw/away probability
group_matches = [f for f in fixtures_2026 if f.get('group_label')]

predictions = []
for f in group_matches:
    home = f['home_team']['id']
    away = f['away_team']['id']
    home_rating = current_team_rating(home)
    away_rating = current_team_rating(away)

    features = pd.DataFrame([{
        'rating_diff': home_rating - away_rating,
        'home_rating': home_rating,
        'away_rating': away_rating,
    }])
    proba = model.predict_proba(features)[0]
    predictions.append({
        'match': f"{f['home_team']['name']} vs {f['away_team']['name']}",
        'p_home': proba[list(model.classes_).index('H')],
        'p_draw': proba[list(model.classes_).index('D')],
        'p_away': proba[list(model.classes_).index('A')],
    })

print(pd.DataFrame(predictions).head(10))

Step 6: Monte Carlo the tournament

Once you can predict per-match win probabilities, simulate the entire tournament 10,000 times:

import random
import numpy as np

def simulate_tournament(model, fixtures, n_simulations=10000):
    champion_counts = {}
    for _ in range(n_simulations):
        # Simulate group stage → standings → R32 → ... → Final
        # Track who wins the final
        winner = simulate_one_run(model, fixtures)
        champion_counts[winner] = champion_counts.get(winner, 0) + 1

    return {team: count / n_simulations for team, count in champion_counts.items()}

odds = simulate_tournament(model, fixtures_2026)
for team, prob in sorted(odds.items(), key=lambda x: -x[1])[:10]:
    print(f"{team}: {prob:.1%}")

Step 7: Validate against bookmaker odds

The acid test for any prediction model is whether it beats the closing line at Pinnacle:

def compare_to_pinnacle(predictions, fixture_id):
    r = requests.get(f'https://api.thestatsapi.com/api/football/matches/{fixture_id}/odds', headers=HEADERS)
    pinnacle = next((b for b in r.json()['data']['bookmakers'] if b['bookmaker'] == 'Pinnacle'), None)
    if not pinnacle: return None

    match_odds = pinnacle['markets']['match_odds']
    p_market = {
        'H': 1 / float(match_odds['home']['last_seen']),
        'D': 1 / float(match_odds['draw']['last_seen']),
        'A': 1 / float(match_odds['away']['last_seen']),
    }
    # Remove vig
    overround = sum(p_market.values())
    p_market = {k: v / overround for k, v in p_market.items()}

    return {
        'model': predictions,
        'market': p_market,
        'edge_home': predictions['p_home'] - p_market['H'],
    }

If your model consistently disagrees with Pinnacle by more than a few percentage points and you're right more often than them, you've built something genuinely sharp. Most models are not — and that's fine for a fun side project.

What our free predictor tool does

Our free Poisson Score Predictor tool implements a much simpler version of this: input two teams' expected goals, get a score distribution out. It's pure client-side, takes 10 seconds to use, and is great for one-off questions.

Honest expectations

Random baseline: ~33% accuracy
"Always pick the higher-ranked team": ~45% accuracy
Decent ML model with FIFA rank + xG features: ~50-55% accuracy
Pinnacle closing line implied: ~55-58% accuracy
Beating Pinnacle: very hard

Tournament predictions are noisier than per-match predictions. Even a good model will get the winner wrong 60-70% of the time — the underdog factor is real.

Choose the Right Prediction Data API

If you are comparing providers before building your model, read the best football API for prediction models guide. It covers raw-data APIs versus prediction APIs, historical results, xG, shotmaps, odds, live odds movement, and explainability trade-offs.

Frequently Asked Questions

What's the minimum dataset I need?

About 200+ historical matches is enough to fit a simple gradient-boosted classifier without overfitting. The World Cup has ~520 matches since 1990, plus thousands of qualifying and friendly matches that can be used to learn team-strength priors.

Should I use neural networks?

For football prediction with this much data, gradient-boosted trees (XGBoost, LightGBM) typically beat neural networks. Football outcomes are noisy and small datasets favour simpler models with strong regularisation.

How do I handle teams that haven't played each other?

Use FIFA rank, Elo rating, or xG-based team strength as bridging features. The model doesn't need a direct head-to-head history — relative strength is enough.

Can I get betting markets back-tested?

Yes — historical Pinnacle closing lines are available via the API for matches going back several years. Compare your model's predictions against the closing line on each match and see whether you've achieved CLV.

Is this a get-rich-quick scheme?

No. Even good models that occasionally beat the line lose money to the vig over time. Treat this as a fun analytics project, not a guaranteed income.

What features matter most?

In our testing: relative team strength (Elo or FIFA rank), recent xG form (last 10 matches), squad market value, days of rest, and travel distance. Stage of tournament also matters — knockout games have different draw probabilities than group games.

How do I evaluate my model fairly?

Use temporal cross-validation: train on World Cups 1990-2014, validate on 2018, test on 2022. Don't use random splits — that leaks information across years.

Start building today

Ready to Power Your Sports App?

Start your 7-day free trial. All endpoints included on every plan.

Start Your Free Trial View Pricing

Cancel anytime

7-day free trial

Setup in 5 minutes