Site icon James Young

Pre-Snap Tells and Offensive Predictability: Insights for Strategy And Performance In The NFL


Introduction

In the NFL, success on offense involves more than just raw talent. Defenses are constantly looking for cues—“pre-snap tells”—that tip them off to whether a play is likely to be a run or pass. In my recent work in the 2025 NFL Big Data Bowl (which uses player tracking data from 2022) l dig into how predictable these offensive cues are at both the player and team level, and how that predictability relates to key performance indicators like Pro Football Focus (PFF) grades and overall offensive efficiency.

In this blog post, I’ll share the high-level concepts, interesting findings, and tactical insights for coaches, analysts, and executives looking to leverage data in strategic decision-making. If you’d like deeper technical details or to see the code used, please check the links at the end.

There are much deeper-divers to be had here, but this is a start on the aspects I found most interesting. To see more of the code behind this work, check out my Kaggle Submission here.


The Core Question

Is there a measurable relationship between an offense’s pre-snap predictability and its on-field success?

From an analytics standpoint, it boils down to:

  1. Detecting Predictability: Can we accurately classify a play as a run or pass strictly by looking at player locations, orientations, formations, and contextual data (e.g., down/distance, prior tendencies, etc.)?
  2. Assessing Impact on Performance: Once we understand who and which teams are more predictable, does that correlate with negative (or positive) outcomes in efficiency metrics like offensive line run-blocking (RBLK), quarterback grades, or skill-position performance?

Quick Overview of the Approach

  1. Data Engineering
    • Pulled in the NFL’s Next Gen Stats tracking data for player x-y positions, speeds, orientations, and contextual fields such as down and distance.
    • Engineered features to capture pre-snap stance, alignment relative to the quarterback, and possible interplay between formation and the situation.
  2. Modeling Predictability
    • Trained a classification model (run vs. pass) using these engineered features, achieving solid performance (AUC of 0.77+).
    • Teams or players whose data made the model’s classification “too easy” were deemed more “predictable.”
  3. Performance Metrics
    • Merged PFF offensive grades (e.g., RBLK for run blocking, OFF for overall offense) to examine how changes in predictability connect to efficiency.
    • Performed correlation analyses, difference-in-differences (for players who switched teams), and positional breakdowns to see who is most (and least) impacted by predictability.

Key Findings & Visual Insights

1. Model Performance & Predictive Features

2. Team-Level Predictability vs. Run-Blocking (RBLK)

3. Positional Breakdown

4. Switchers & Diff-in-Diff

To further isolate whether predictability truly impacts a player’s performance (versus random variation or differing team strengths), I looked at players who switched teams between seasons (a “natural experiment”):


Why It Matters

  1. Coaching & Game Planning
    • Beyond Talent: Even highly skilled rosters can be hamstrung by telegraphed play-calling. Implementing formation diversity or subtle “false keys” can keep the defense on its heels.
    • Situational Adaptation: Adjusting guard alignments or quarterback cues could mitigate predictable patterns.
  2. Analytics & Roster Building
    • Right Fit: Offensive linemen and QBs sensitive to predictable schemes may flourish if inserted into more varied systems.
    • Position-Specific Investment: If your scheme is pass-happy, a highly predictable formation can hamper run-blocking. Balancing your roster to hide or offset these tendencies can pay dividends.
  3. Future of Sports Analytics
    • Machine Learning in Football: Continues to uncover insights where the difference between success and failure can be inches or split-seconds.
    • Player Valuation: NFL front offices increasingly weigh data-driven evaluations (like how a guard performs in a predictable vs. unpredictable system).

Guard-Specific Spotlight

One of the most striking details emerged from analyzing guards (G):


Conclusion & Next Steps

This project confirmed that keeping the defense guessing isn’t just a cliché: it’s backed by quantifiable evidence at both the team and individual levels. From a purely statistical viewpoint:

Looking Ahead


Let’s Connect

I hope you found these insights helpful! If you’re interested in discussing sports analytics, data engineering in football, or applying machine learning to real-time player data, feel free to reach out:

Thank you for reading, and I look forward to collaborating or hearing your feedback.

Exit mobile version