The power of models

In this module, we explore how models move beyond academic exercises to become powerful tools that shape real-world decisions—decisions that can literally change lives, determine insurance premiums, and influence policy.

Throughout this course, we've learned that models are services that transform inputs (x) into outputs (y) through optimized parameters. But the true power of models lies not in their mathematical elegance, but in their application to critical decision-making processes.

Decision making processes: When models determine your insurance premium

Consider this scenario: You're shopping for car insurance. The insurance company asks about your vehicle's make, model, and weight. Within seconds, their algorithm—a sophisticated predictive model—calculates your premium. But what data drives these life-changing decisions?

Loading car crash data...

The chart above reveals a sobering reality about vehicle weight and fatalities. The model shows two distinct patterns:

Blue line (Own car occupants): As vehicle weight increases, deaths to the car's own occupants generally decrease. Heavier cars provide better protection to their passengers through mass, crumple zones, and structural integrity.

Red line (Other people): As vehicle weight increases, deaths to other people—pedestrians, cyclists, occupants of lighter vehicles—dramatically increase. The physics is unforgiving: heavier vehicles transfer more energy during crashes.

The intersection point (purple dot) represents a critical threshold where the safety trade-off becomes apparent. This single data point could determine whether your insurance premium is $800 or $1,200 per year.

Real-world implications of this model:

Insurance pricing: Companies use models like this to assess risk. Drive a heavy SUV? Your liability coverage might cost more because the model predicts higher risk to others. Drive a lightweight car? Your comprehensive coverage might increase due to higher risk to yourself.

Urban policy: City planners use similar models to set speed limits, design bike lanes, and regulate vehicle access in dense areas. The red line's steep slope has influenced policies limiting large vehicles in European city centers.

Manufacturer decisions: Automakers balance these competing safety demands when designing vehicles. The model influences everything from weight distribution to marketing strategies.

Regulatory frameworks: Government agencies use these relationships to set safety standards, crash test requirements, and fuel economy rules that account for the externalized risks of heavier vehicles.

This demonstrates the profound power of models: a mathematical relationship derived from crash data becomes the foundation for decisions affecting millions of people's safety, finances, and daily lives.

⚠️The responsibility of modelers

When models inform decisions this consequential, the responsibilities multiply:

  • Bias detection: Are certain populations unfairly penalized?
  • Data quality: Is the underlying crash data representative?
  • Model limitations: What factors does this model ignore (driver behavior, road conditions, vehicle age)?
  • Ethical implications: Should individual premiums reflect collective risks?
📝Note

This is the link to the X post that served as inspiration for this example. This is the link to the reference article on the Economist that inspired the X post.

💡Spreadsheet

You can find the data and models at this link.

🧠 Association vs. Causation

Understanding the difference between association and causation is essential when interpreting data. Just because two things happen at the same time doesn't mean one causes the other!

Let us look at an example:

Ice Cream Sales vs. Shark Attacks

Correlation (R): 0.95
R-squared (R²): 0.91

(This example illustrates association, not causation.)


🍦🦈 This chart shows a strong relationship between monthly ice cream sales and shark attacks. When ice cream sales go up, so do shark attacks.

But does eating more ice cream cause more shark attacks? Of course not! They are both linked to a third factor: hot weather. More people buy ice cream and swim in the ocean when it is hot — increasing both ice cream sales and the chances of shark encounters.

This is a perfect example of association without causation — also known as a spurious correlation.

🔍 More Examples of Spurious Correlations

Here are other pairs that are correlated but not causally connected:

  • School grades and shoe size: Younger kids tend to have smaller feet and lower grades — not because foot size affects intelligence, but because of age.

  • Number of firefighters and damage caused by a fire: Bigger fires need more firefighters — but sending more firefighters does not cause more damage.

  • Training load and injury: Athletes who train more might get injured more, but does training cause injury, or are other factors involved (like recovery, technique, or underlying health)?

🧪 The Takeaway

Association ≠ Causation. Before concluding that one thing causes another, we must rule out:

  • Coincidence
  • Confounding variables (a hidden factor influencing both)
  • Reverse causation (maybe B causes A, not A causes B)

This image below encapsulates this concept quite nicely. This might well be the greatest image exemplifying the common misconception of correlation vs causation. Credits to this X post.


Correlation vs Causation

That is why in science and statistics, we use experiments, controlled studies, and critical thinking to figure out what is really going on: not just what appears to be true from the data.

📝note

These examples and the data used in the graph have been taken from Math For The Real World.

📚Biblio

I highly recommend reading the following contributions from Impellizzeri et al., 2020, Part I and Impellizzeri et al., 2020, Part II navigating the issues with the acute/chronic training load ratio concept Gabbett, 2020.


The universal language of models: One equation, infinite applications

One of the most remarkable discoveries in modeling is that completely different systems often follow identical mathematical rules. The first-order differential equation we used for VO₂ kinetics in Module 4 appears everywhere in science and engineering—a testament to the fundamental patterns underlying diverse phenomena.

Consider this elegant equation from Module 4:

τddty(t)+y(t)=x(t)\tau \frac{d}{dt} y(t) + y(t) = x(t)

This single mathematical relationship describes an astonishing variety of real-world systems. Let's explore four completely different applications of this same equation:

1. VO₂ Kinetics (Exercise Physiology)

τddtVO2(t)+VO2(t)=VO2SS(t)\tau \frac{d}{dt} VO_2(t) + VO_2(t) = VO_{2SS}(t)

Where oxygen uptake responds to changes in steady-state demand with time constant τ\tau.

2. Exponentially Weighted Moving Average (Data Analysis)

τddtxˉ(t)+xˉ(t)=x(t)\tau \frac{d}{dt} \bar{x}(t) + \bar{x}(t) = x(t)

Where a smoothed average xˉ(t)\bar{x}(t) tracks a noisy input signal x(t)x(t) with smoothing parameter τ\tau. This is the continuous-time version of the exponentially weighted moving average used everywhere from stock market analysis to GPS tracking.

3. Banister Fitness-Fatigue Model (Training Science)

τfddtF(t)+F(t)=w(t)\tau_f \frac{d}{dt} F(t) + F(t) = w(t)
τgddtG(t)+G(t)=w(t)\tau_g \frac{d}{dt} G(t) + G(t) = w(t)

Where fitness G(t)G(t) and fatigue F(t)F(t) respond to training load w(t)w(t) with different time constants (τg\tau_g for fitness decay, τf\tau_f for fatigue decay). Performance = G(t)F(t)G(t) - F(t).

4. RC Circuit (Electrical Engineering)

RCddtVC(t)+VC(t)=Vin(t)RC \frac{d}{dt} V_C(t) + V_C(t) = V_{in}(t)

Where capacitor voltage VC(t)V_C(t) responds to input voltage Vin(t)V_{in}(t) with time constant τ=RC\tau = RC (resistance × capacitance).

The universal pattern: In every case, we have:

  • A state variable that changes over time: VO2VO_2, moving average, fitness/fatigue, or voltage
  • An input that drives the change: power demand, data signal, training load, or applied voltage
  • A time constant that determines response speed: metabolic τ\tau, smoothing τ\tau, adaptation τ\tau, or electrical τ\tau

The mathematical structure is identical—only the physical interpretation changes. A sports scientist optimizing training periodization uses the same differential equation as an electrical engineer designing a filter circuit.

This universality reveals a profound truth: nature tends to organize itself around common mathematical principles. Once you understand the exponential response pattern in one domain, you've gained insight into countless others.

Why does this matter for sports scientists?

  • Techniques from engineering (like control theory) can improve training load management
  • Signal processing methods can enhance performance data analysis
  • Economic forecasting approaches can predict adaptation responses
  • Understanding these analogies accelerates learning and problem-solving across disciplines

The equation τddty+y=x\tau\frac{d}{dt}y + y = x is not just a mathematical curiosity—it's a fundamental language that describes how systems respond, adapt, and evolve across the entire spectrum of science and engineering.

📝Note

The mathematical unity across disciplines was systematically explored by Schönfeld 1953, who showed how mechanical, electrical, and fluid systems share identical governing equations. This work laid the foundation for systems theory and cross-disciplinary modeling approaches that continue to drive innovation today.


Checkout

Well done, you were able to complete the sixth module of this course.

👮🏻‍♂️Checkpoint

At the end of Module-6 you should be able to reply to these questions with confidence:

  1. What are the real-world implications when models inform critical decisions like insurance premiums, and what responsibilities do modelers have when their work affects people's lives and finances?
  2. What is the difference between association and causation, and why can spurious correlations like ice cream sales vs shark attacks lead to incorrect conclusions?
  3. How does the same first-order differential equation τ(d/dt)y + y = x apply across completely different domains like VO₂ kinetics, data smoothing, training adaptation, and electrical circuits?
  4. How can understanding these mathematical analogies between disciplines help sports scientists apply techniques from engineering, signal processing, and other fields to improve their research and practice?
Previous Module