r/sportsanalytics

How Norway is secretly using AI and centralized biometric data to rewrite player development

Hey guys,

I just finished a tactical deep dive into Norway’s Centralized Intelligence Platform (NCIP) and wanted to share some thoughts on how they are changing youth development.

Instead of coaches blindly guessing player fatigue, they’ve synchronized biometric data from academies all the way up to the national team. It tracks wearable sensors, exhaustion levels, and even tactical heatmaps in real-time to prevent burnout and maximize talent like Haaland and Ødegaard.

I made a full video breakdown covering how this ecosystem functions and why it might be the future blueprint for smaller footballing nations.

Would love to hear your thoughts on this. Is relying this heavily on data ruining the natural instinct of the game, or is it a necessary evolution?

Full video breakdown: https://youtu.be/RosyLxn1fZc

u/IBosuke — 2 hours ago

▲ 2 r/sportsanalytics+1 crossposts

Does anyone know what tools data are used in college basketball analytics?

I've been following my local college women's basketball team for a few years and last year ran some analytics just to understand what they do best and how they work best, since they lost a lot of games in similar fashion. I looked on their website and they don't have an analyst so I thought I could provide them some information on a volunteer basis as a hobby.

Does anyone here know what sort of tools college programs use? So far I hand built some lineup data and some playcalling data but I don't know what the next step should be as far as what would be useful.

Does anyone have any insight?

reddit.com

u/Tiny_Spread5712 — 10 hours ago

▲ 28 r/sportsanalytics

At this point this sub could be renamed as r sportsbetting. And i think its boring.

Literally 90% of content is just "my model found a 5% edge at this game". Most of these have literally zero added value to the community, to the world. And its always just the same claude coded elo model that everyone has already made.

I get it, these are fun. But now they just feel like a huge pointless flood. I would prefer more innovation, even if it would result in less posts. Even if the quality of the posts would decrease. Seeing people try to make something new is what i wish for. And maybe just some general talk about it.

Am i overreacting? What do you think?

reddit.com

u/Next-Abbreviations76 — 12 hours ago

▲ 4 r/sportsanalytics

Brazil vs Norway (WC) — my model has Brazil at 56% vs market's 52%. Full breakdown + why the BTTS line looks interesting

Model output for tonight, based on a 94-match sample of comparable fixtures:

Match result: Brazil 56% / Draw 27% / Norway 17%
Market implied (1.83 / 3.60 / 4.50, normalized): 52% / 27% / 21%
→ Model is ~4pts higher on Brazil than the market, and correspondingly lower on Norway.

Goals: BTTS yes 59% (market ~56%), Over 2.5 at 55% (market agrees), Brazil to score 86%, most likely band 2–3 goals (50%).

The read: market and model agree this is Brazil's game — the small disagreement is how much chance Norway really has. Model thinks the Norway price is short of value despite the big number.

Posting this before kickoff so it's verifiable either way. I'll reply with the result tonight — including if it ages badly.

Not betting advice, just model output vs market. Happy to answer questions about the methodology.

u/proofles — 13 hours ago

▲ 0 r/sportsanalytics

What's the biggest mistake that ruined one of your football predictions?

I've realized that the matches I got wrong taught me much more than the ones I got right.

For me, the biggest mistakes were things like:

• Trusting recent form too much.

• Ignoring injuries.

• Overrating head-to-head records.

• Letting emotions influence the prediction.

I'm curious...

What's the biggest lesson you've learned after getting a football prediction completely wrong?

reddit.com

u/HDvideoNature — 11 hours ago

▲ 6 r/sportsanalytics+3 crossposts

Your personalized sports feed - no ads, no gambling, no "content" etc--just sports.

I had a hard time remembering when/how to watch all the games I care about so I made an app that aggregates many different sports and allows me to create a custom feed of all the stuff I care about.

TV/Streaming info in the future (scroll down) and highlights in the past (scroll up). It's a simple idea but it works great for me. I'd love any feedback you might have. Thank you so much.

SportsBot.io

u/montgomeryLCK — 1 day ago

▲ 3 r/sportsanalytics

Looking for a reliable football data provider that won’t break the bank

I’ve been using API-Football for the past year to power my iPhone live score app. Overall, I’ve been pretty happy with it. The quality-to-price ratio has been good for a small indie project.

However, over the last couple of months I’ve noticed a significant drop in data quality. Missing or delayed events, inconsistent statistics, and overall reliability have become a real issue. On top of that, support doesn’t seem to be what it used to be.

So I’m starting to look at alternatives.

Can anyone recommend a football data provider that offers reliable live data without enterprise-level pricing?

My main focus is the top European leagues (Premier League, La Liga, Serie A, Bundesliga, Ligue 1) as well as the Nordic leagues (Allsvenskan, Eliteserien, Danish Superliga, Veikkausliiga, etc.).

I’d also love to hear about your real-world experiences regarding data quality, uptime, and support, not just feature lists.

reddit.com

u/blytung — 1 day ago

▲ 21 r/sportsanalytics+1 crossposts

I modelled how much the Azteca actually changes England–Mexico

I wanted to move beyond the general claim that the Azteca is “a difficult place to play” and estimate how much the setting should actually change the probability of England or Mexico progressing.

I fitted a simple independent Poisson model to the opening 90-minute market, then constructed a neutral-venue counterfactual by removing an assumed combined adjustment for:

altitude and acclimatisation
crowd pressure
familiarity with the pitch, ball and venue
travel and recovery
weather

My central scenario estimates:

Neutral venue
England 68%
Mexico 32%

At the Azteca
England 58%
Mexico 42%

So the combined venue effect is worth roughly ten percentage points to Mexico.

The interesting part is that current best prices imply closer to 45.5% for Mexico, suggesting that the market may now be pricing an even larger Azteca effect than the model does.

The assumptions, sensitivity scenarios and full workbook are all public: https://substack.com/home/post/p-204909017

I’d be interested in criticism of the neutral counterfactual and the +0.35 expected-goal-difference central assumption.

u/silverkinger — 2 days ago

▲ 5 r/sportsanalytics

Analyzing 94 Years of FIFA World Cup Data: 3 "Absolute Laws" of Tournament Champions

Hey everyone,
I’ve spent the last few weeks compiling and analyzing the historical data of every FIFA World Cup since 1930 to identify if traditional tournament tropes (like home advantage, squad value, or luck) actually hold statistical weight.
Instead of those common narratives, the data revealed 3 specific regression/pattern laws that have maintained a 100% repetition rate across all 22 tournaments with zero exceptions:
1. The Squad Age Demographic Window (25-29): If you map out the mean squad age of World Cup champions, they fit strictly into a 25.00 to 29.00 age bracket. The absolute historical mean is 26.91. Even the strict statistical outliers—the ultra-youthful 2010 Spain squad (25.00) and the veteran 2006 Italy squad (28.80)—stayed within this exact window. Squads outside this demographic distribution simply do not win.
2. The Domestic Manager Correlation: In 94 years of data, a foreign manager has a 0% success rate at winning the tournament. 100% of the winning coaches held citizenship of the nation they represented. Out of hundreds of foreign managers in history, only two ever managed to reach a final (1958 and 1978)—and both lost.
3. The Total Goals Scored Fallacy: There is a weak statistical correlation between being the highest-scoring team in a tournament and actually winning it. In fact, less than 50% of World Cups were won by the team with the most goals. A perfect data point is 2018: Belgium’s attack was highly efficient, scoring a tournament-high 16 goals but finishing 3rd, while France won the tournament with 14 goals due to superior defensive variance and efficiency in the 8-game knockout format.
I wanted to visualize this data properly, so I mapped out the datasets and created a clean 2D data-visualization and animated breakdown on my new channel, Fabled Football.
If you are into sports analytics, data modeling in football, or tournament statistics, I'd highly appreciate your feedback on the video and these specific datasets:

https://youtu.be/xjsFTFmOP9o?si=1YKKxWUnK-JAYTnA

Do you think these metrics are mathematically absolute due to the short-form nature of a 7-game tournament format, or will we see a deviation in 2026? Let's discuss the analytics.

u/Mec17_ — 2 days ago

▲ 6 r/sportsanalytics

Should We Trust FIFA’s Math?

We wrote an analysis of the Croatia-Portugal offside controversy, looking at the physics, the IMU inside the World Cup ball, and the general limitations of the Trionda.
We’d love feedback :)

medium.com

u/Proof_wasted — 2 days ago

▲ 5 r/sportsanalytics+3 crossposts

Germany's Football Decline Is the Price of Its Social Progress

The article: between 1954 and 1990, West Germany played with an insane, existential urgency because of the heavy burden of WWII history and the division of their country. That immense pressure to demonstrate pride and honor forged this unbreakable "football steel" - a desperate hunger to prove themselves to the world.

But after reunification in 1990, that deep historical pain started to fade. Germany became a comfortable, prosperous, post-historical, and much more inclusive, diverse society. Because the country is doing so well and history stopped hurting, they've lost that raw, "fire-in-the-belly" edge.

I think it's a good theory to be considered. Before unification 3/10 World Cup Championships, 8/10 final 4 appearances. After unification, especially the last 3 world cups, well, it's not the same.

I think there is a sociological element to sports and I thought I would offer this for consideration.

Addendum: I did not think a sports article which I felt was interesting was going to draw so much hatred and venom. If the mods don't think their readers can handle a new idea without showing hatred and spewing insults, please delete the article and ban me from this subreddit.

backpagefootball.com

u/gubernatus — 2 days ago

▲ 0 r/sportsanalytics

Is 3-0 for Argentina too optimistic, or does it feel about right?

I've pretty much settled on Egypt, Argentina, and Colombia to win. I think the other two matches could still go either way, but Argentina is the one I feel most confident about. The only thing I'm still debating isn't whether they'll win, it's by how many.

Argentina have looked better with every game. Messi missed one match and is still tied with Mbappé for the Golden Boot. I don't think they'll start playing conservatively if they go one goal up. A big win would give them even more confidence heading into the knockout stage, so I expect them to keep attacking if the game opens up. I wouldn't be surprised if it ends 3-0 or even 4-0.

But Cape Verde is what makes me hesitate. They showed against Spain and Uruguay that they're well organized defensively, and they aren't completely harmless going forward either. I keep wondering whether they can find one goal because if they do, the whole game could look very different.

On the other hand, it's hard for me to believe they'll create many clear chances against Argentina's defense and Emiliano Martínez. A low block can slow the game down, but can they really survive 90 minutes of pressure from Argentina?

That's why I'm still going back and forth between predicting 2-0, 3-0, or even 4-0.

I also looked at the AI predictions on SportEval, and most of the models were predicting either 2-0 or 3-0. That actually lines up pretty closely with what I was already thinking.

What score are you predicting? Does anyone actually think Cape Verde gets on the scoresheet, or do you see this ending with a clean sheet for Argentina?

u/Ame_719 — 2 days ago

▲ 22 r/sportsanalytics

How the World Cup title favorite rankings shuffled since the tournament, per betting markets

Methodology and code here https://gist.github.com/taarushv/52a66b36e36ec2e0af6eb050b1c564f9

Realtime: https://cupcharts.com

u/offgramercy — 2 days ago

▲ 7 r/sportsanalytics

Statsbomb full access

Hi everyone.

I’m currently working on my Master’s dissertation and need access to football event data. I’ve been using the open StatsBomb data via statsbombpy, but it’s limited to a small selection of competitions and teams, and much of it is several years old.

I tried contacting StatsBomb’s sales team by email to ask about academic access, but I haven’t received a response.

Does anyone know if there’s a way for students or researchers to access more recent and comprehensive event data, either for free or at an academic discount? Alternatively, are there any other reliable sources of event data that you would recommend for research?

Any suggestions would be greatly appreciated.

reddit.com

u/Basket_Smooth — 2 days ago

▲ 30 r/sportsanalytics+2 crossposts

I built a searchable Summer League stats database for draft fans

With Summer League starting up soon, I wanted to share something I built for other draft/Summer League junkies.

I run nbadraft.app, and I recently added a Summer League stats section because I’ve always felt like detailed Summer League coverage is a weird hole on the internet. Box scores exist in scattered places, but I’ve never found a good way to search across players, games, seasons, teams, and advanced stats like you can easily do for other basketball events.

Main Summer League page:
https://nbadraft.app/stats/summer-league

Explorer/filter page for player, game, and season searches:
https://nbadraft.app/stats/summer-league/explorer?subject=players

Example game page with team/player shot charts:
https://nbadraft.app/stats/summer-league/2022/games/1147

The goal is to make it easier to answer questions like: who was the most statistically prolific player in LVSL last year, how does a prospect’s performance today compare historically, and how does a player’s Summer League production stack up against other lottery picks at the same position?

For example, you can filter last year’s LVSL by position, minutes, usage, shooting efficiency, or compare a lottery guard’s Summer League output against past lottery guards.

I’ll be updating it daily once games start. Would genuinely appreciate feedback, feature requests, bug reports, or anything that would make it more useful for people here.

reddit.com

u/jonathanbechtel — 3 days ago

▲ 72 r/sportsanalytics+7 crossposts

Interesting MLB changes proposed. What is the ripple effect on college baseball and player development?

Source: https://www.mlbplayers.com/press-releases/mlbpa-makes-transaction-proposals-to-benefit-all-players-and-build-upon-industry-momentum

u/Flat-Eggplant-9890 — 3 days ago

▲ 9 r/sportsanalytics+1 crossposts

Does Hosting the World Cup Actually Help You Win?

I dug into 90+ years of World Cup history to see if hosting actually gives teams a competitive edge on the pitch.

The pattern is stronger than I expected!

Across World Cups from 1930–2022, 16 out of 19 host nations outperformed their usual tournament performance when playing at home. In several cases, the jump was dramatic, Uruguay (1930) and England (1966) both improved by the equivalent of multiple tournament stages and went on to win the whole thing.

There are a few exceptions (Spain 1982 underperformed, while South Africa 2010 and Qatar 2022 roughly matched their baseline), but overall the trend points in one direction: host advantage isn’t just noise, it shows up consistently in results.

That said, the sample is small and context matters. Many countries only host once, so a single tournament can heavily skew perception. It’s not proof of causation, but it is a surprisingly consistent historical signal.

With 2026 underway and multiple host nations involved, it raises an interesting question: are we about to see this pattern repeat again? They're all doing quite well so far!

Full breakdown + data here:
👉 https://danflemingdata.substack.com/p/does-hosting-the-world-cup-actually

https://preview.redd.it/774py020ftah1.png?width=5100&format=png&auto=webp&s=33d1bdbe20e3dd1cf312d57e9d6588c477fac822

reddit.com

u/Trick-Palpitation831 — 4 days ago

▲ 18 r/sportsanalytics+13 crossposts

100 Million World Cup Brackets

I've created a website that tracks 100 million FIFA World Cup 2026 knockout stage brackets live to see how they evolve throughout the tournament—and ultimately how many matches it takes before there isn't a single perfect bracket left.

Each of the 100 million brackets is generated using a Monte Carlo simulation. The model is driven primarily by my own team ratings, which are similar in concept to advanced metrics like KenPom for college basketball, while also incorporating factors such as betting markets, injuries, recent form, home-continent advantage, travel, and other matchup-specific adjustments. Every bracket is an independent simulation of the entire knockout stage.

I'll update the website match by match in chronological order, allowing everyone to experience exactly how the live tracker will work once the real World Cup starts.

You can follow along at brackit.us and watch the surviving perfect brackets disappear one match at a time.

u/dombaby18 — 3 days ago

▲ 11 r/sportsanalytics+5 crossposts

I engineered 102 leakage-free ML features from 49,000+ international football matches (1872–2026) and published it as a free dataset

Been working on a football prediction project and couldn't find a dataset that had

the actual context needed to model match outcomes — just raw results everywhere.

So I built one from scratch on top of the International Football Results dataset

by Mart Jürisoo (the well known one on Kaggle with 49,000+ matches going back to 1872).

What I added:

**Elo ratings** — built from scratch, updated after every single match across 150

years. Both teams' ratings, their difference, and the expected win probability

going into each match.

**Rolling form** — win rate, goals scored, goals conceded, goal difference, clean

sheet rate, both-teams-scored rate, scoring rate, and win streak. Computed at

three lookback windows: last 5, last 10, and last 20 matches. For both teams.

**Head-to-head history** — based on the last 10 meetings between those two specific

teams. Some teams have persistent edges over specific opponents that their general

form doesn't explain.

**Fatigue signals** — days since each team's last match and the difference between

the two.

**Penalty reliance** — fraction of each team's historical goals that came from

penalties, pulled from the goalscorer dataset.

**Shootout composure** — historical penalty shootout win rate for each team, from

the shootouts dataset.

**Tournament context** — World Cup, qualifier, friendly, neutral venue, competition

importance weight, confederation.

The thing I spent the most time on: every feature is computed in strict

chronological order using only data that existed before that match was played.

State updates happen after each row is recorded, never before. No lookahead,

no leakage anywhere in the 102 columns.

102 features total. 49,094 rows. result column (H/D/A) included as the label.

Drop date and result, plug into any classifier.

Dataset is fully documented with column descriptors for every feature.

Link: https://www.kaggle.com/datasets/kriishgulati/football-match-results-1872-2026-with-ml-features

Built on top of the original dataset by Mart Jürisoo — full credit and link

in the dataset description.

kaggle.com

u/Kriish_Gulati — 3 days ago

▲ 5 r/sportsanalytics+1 crossposts

Built a real-time odds API 11 bookies, ~1s WebSocket refresh, free tier

Hey all! I've been working on PulseScore (https://pulsescore.net), a real-time sports odds API aimed at people building trading bots, arb scanners, and dashboards. Wanted to share it here and get feedback from people who actually use this kind of data.

What it covers:

- Bookmakers: Bet365, Fanduel, Bwin, Unibet AU, Paddy Power, BetOnline, PS3838, William Hill, Ladbrokes, DraftKings, Betano DE

- Live in-play via WebSocket 1–2s refresh

- Pre-match odds across all books on your plan

- 14+ sports (soccer, tennis, basketball, ice hockey, AF, cricket, horse racing, etc.)

- Bet365 goes deepest — 50+ markets per event (1X2, AH, O/U, BTTS, correct score…)

Same JSON shape across every bookmaker — swap /bet365 for /fanduel, /bwin, /ps3838 and the rest. REST + WebSocket, X-Secret header auth.

Pricing:

- Free: 500 req/month

- Starter €20/mo: 30k req/month

- Pro €79/mo: unlimited req + 1 WebSocket (7-day trial)

- Max €149/mo: unlimited + 3 WebSockets multi-sport + /all, /count endpoints

Free tier is enough to actually test it, no card needed. Happy to answer anything about latency, market depth, or how it compares to OddsAPI / TheOddsAPI / RapidAPI alternatives. Roast away.

u/Hot-Muscle-7021 — 3 days ago