Loading...
Loading...
Sabermetrics is the empirical analysis of baseball through statistics. This guide explains the most important advanced metrics used by MLB front offices, analysts, and informed fans to evaluate player performance beyond traditional box score stats.
The term "sabermetrics" was coined by Bill James in the 1980s, derived from SABR (Society for American Baseball Research). It refers to the empirical analysis of baseball through objective, statistical evidence rather than subjective observation or traditional scouting alone.
While traditional stats like batting average, RBIs, and wins have been the foundation of baseball evaluation for over a century, sabermetrics revealed that these numbers often fail to capture a player's true value. Batting average ignores walks and extra-base hits. RBIs depend heavily on teammates getting on base. Pitcher wins are influenced more by run support than pitching quality.
Modern sabermetrics goes far beyond simple stat replacement. It encompasses win probability models, pitch-tracking data (Statcast), defensive metrics, baserunning analysis, and predictive modeling. Every MLB front office now employs teams of analysts who use these methods to make roster decisions, in-game strategy calls, and long-term player development plans.
The "Moneyball" revolution, popularized by Michael Lewis's 2003 book about the Oakland Athletics, brought sabermetrics into the mainstream. Billy Beane and Paul DePodesta demonstrated that undervalued statistical traits, particularly on-base percentage, could build a competitive team on a limited budget. Today, every team uses analytics, but the principles remain the same: measure what matters, ignore what does not, and find value where others overlook it.
Traditional baseball statistics were designed in the 19th century for newspaper box scores. They needed to be simple enough to calculate by hand and fit in narrow newspaper columns. While serviceable for their era, they carry significant limitations that can mislead evaluation.
Scale: ~.290 (avg), .340 (good), .370+ (excellent)
wOBA assigns different weights to each way a batter reaches base. Unlike traditional batting average, which treats all hits equally, wOBA recognizes that a home run is far more valuable than a single. It uses linear weights derived from run expectancy to measure a hitter's overall offensive contribution per plate appearance.
Traditional stats like batting average ignore walks and treat all hits the same. A player who hits .280 with 40 home runs is far more valuable than one who hits .310 with only singles. wOBA captures this difference by weighting each outcome proportionally to its run value.
wOBA = (0.69 x BB + 0.72 x HBP + 0.87 x 1B + 1.22 x 2B + 1.56 x 3B + 2.01 x HR) / (AB + BB - IBB + SF + HBP)In the 2024 season, Aaron Judge posted a .441 wOBA, ranking among the best in baseball. This reflects not just his batting average but his elite walk rate and power numbers. Meanwhile, a contact-oriented hitter with a higher batting average might have a lower wOBA because their hits are predominantly singles.
Scale: 100 = league average, 120+ (great), 150+ (elite)
wRC+ takes wOBA and adjusts it for park factors and league environment, then scales it so that 100 always equals league average. A wRC+ of 130 means the player created 30% more runs than the average hitter. This makes it one of the best single-number stats for comparing hitters across different eras and ballparks.
Raw stats are heavily influenced by where a player plays. A hitter in Coors Field (Colorado) naturally gets inflated numbers due to the thin air, while a hitter in a pitcher-friendly park like Oracle Park (San Francisco) might look worse than he actually is. wRC+ levels the playing field.
wRC+ = ((wRAA/PA + League Runs/PA) + (League Runs/PA - Park Factor x League Runs/PA)) / (League wRC/PA) x 100If Player A has a wRC+ of 140 at Coors Field and Player B has a wRC+ of 140 at Petco Park, they are equally valuable hitters despite likely having very different raw numbers. The park adjustment ensures a fair comparison.
Scale: 100 = league average, 120+ (great), 150+ (elite)
OPS+ adjusts a player's OPS (On-Base Percentage + Slugging Percentage) for the league and park they play in, normalizing to 100 as league average. While simpler than wRC+, it provides a quick and reasonably accurate measure of a batter's overall offensive ability.
Raw OPS varies significantly between eras and ballparks. An OPS of .800 meant something very different in the steroid era compared to the current low-offense environment. OPS+ allows meaningful comparisons across all contexts.
OPS+ = 100 x (OBP/lgOBP + SLG/lgSLG - 1), adjusted for park factorTed Williams' career OPS+ of 190 can be directly compared to Mike Trout's career OPS+ of 176 despite playing in completely different eras. Both numbers tell us exactly how much better than average each player was in their respective context.
Scale: .120 (below avg), .160 (avg), .200+ (great), .250+ (elite)
ISO measures a batter's raw power by subtracting batting average from slugging percentage. This isolates the extra bases a hitter generates beyond singles. A high ISO indicates a true power hitter, regardless of their overall batting average.
Slugging percentage includes singles, which can inflate the number for high-average, low-power hitters. ISO strips away singles to show pure extra-base hit ability. Two players with identical slugging percentages can have very different ISO values.
ISO = SLG - AVG = (2B + 2 x 3B + 3 x HR) / ABA player hitting .250 with .500 SLG has an ISO of .250 (elite power). Another hitting .320 with .450 SLG has an ISO of only .130 (below average power). Despite the second player having better traditional stats, the first player is the far superior power hitter.
Scale: League average ~.300, varies by player skill and speed
BABIP measures how often a ball put in play (not a home run, strikeout, or walk) falls for a hit. It helps distinguish between skill and luck in a player's batting average. While there is a skill component (line-drive hitters and fast runners tend to have higher BABIPs), extreme values often indicate unsustainable performance.
If a player has a .380 BABIP, their batting average is likely inflated by good fortune and may regress. Conversely, a .220 BABIP suggests bad luck that should improve. Analysts use BABIP to predict future performance changes and identify buy-low or sell-high candidates.
BABIP = (H - HR) / (AB - K - HR + SF)If a pitcher has a 2.50 ERA but a .220 BABIP, their ERA is likely being propped up by luck on balls in play. Over time, more of those batted balls will find holes, and the ERA will likely rise. BABIP helps analysts see through surface-level results.
K%: ~22% avg (lower is better for hitters). BB%: ~8% avg (higher is better)
Strikeout rate and walk rate express how often a batter strikes out or walks as a percentage of their plate appearances. These are 'three true outcomes' stats that are entirely within the control of the batter and pitcher, unaffected by defense or luck.
These rates are among the most stable and predictive stats in baseball. A hitter with a low K% and high BB% demonstrates excellent plate discipline and bat control. These numbers stabilize quickly (around 60 plate appearances for K%, 120 for BB%) making them useful for early-season evaluation.
K% = Strikeouts / Plate Appearances, BB% = Walks / Plate AppearancesJuan Soto consistently maintains a BB% above 15%, nearly double the league average. This elite plate discipline means he gets on base at an extraordinary rate even when he is not getting hits, making him one of the most valuable offensive players in baseball.
Scale: 3.20 (good), 3.00 (great), sub-2.50 (elite)
FIP estimates what a pitcher's ERA should be based solely on outcomes the pitcher controls: strikeouts, walks, hit-by-pitches, and home runs. It removes the influence of defense and luck on balls in play, providing a more accurate picture of a pitcher's true skill level.
ERA is heavily influenced by factors outside a pitcher's control. A pitcher with a great defense behind him will have a lower ERA than his skills warrant, while a pitcher with a poor defense will look worse. FIP strips away these external factors to reveal the pitcher's actual ability.
FIP = ((13 x HR) + (3 x (BB + HBP)) - (2 x K)) / IP + FIP ConstantIf a pitcher has a 4.50 ERA but a 3.20 FIP, the large gap suggests they have been unlucky with balls in play or have a below-average defense. Their future performance is more likely to align with the 3.20 FIP than the 4.50 ERA.
Scale: similar to FIP, but more stable year-to-year
xFIP is a variation of FIP that replaces a pitcher's actual home run total with an expected number based on their fly ball rate and the league-average HR/FB ratio. Since home run rates on fly balls fluctuate significantly from year to year, xFIP provides a more stable estimate of true talent.
Even FIP can be skewed by a pitcher getting lucky or unlucky with home runs. A pitcher who has an unusually low HR/FB rate one year will likely see it rise the next year. xFIP smooths out this volatility for a more predictive measure.
xFIP = ((13 x (Fly Balls x League HR/FB rate)) + (3 x (BB + HBP)) - (2 x K)) / IP + FIP ConstantA pitcher with a 2.80 FIP but a 3.40 xFIP is likely benefiting from an unsustainably low home run rate. The xFIP suggests that some of those fly balls that stayed in the park will eventually go over the fence.
Scale: 1.30 (avg), 1.15 (good), sub-1.00 (elite)
WHIP measures how many baserunners a pitcher allows per inning. While simpler than FIP, it provides a quick and intuitive measure of how effectively a pitcher keeps runners off base. It is one of the most commonly used pitching stats for a good reason: fewer baserunners mean fewer runs.
WHIP is easy to understand and correlates well with run prevention. A pitcher with a low WHIP is consistently putting up clean innings, which is the foundation of pitching success. It is more granular than ERA and less abstract than FIP.
WHIP = (Walks + Hits) / Innings PitchedHistorically, dominant pitchers like Pedro Martinez (career 1.054 WHIP) and Mariano Rivera (career 1.000 WHIP) were elite at keeping runners off base. Modern aces typically post WHIPs around 1.00-1.10.
Scale: 100 = league average, 130+ (great), 150+ (elite)
ERA+ adjusts a pitcher's ERA for the league average and their home ballpark, then inverts the scale so that higher numbers are better. An ERA+ of 130 means the pitcher's ERA is 30% better than league average after adjustments. This allows direct comparisons across eras and ballparks.
A 3.50 ERA in the dead-ball era meant something entirely different than a 3.50 ERA in the steroid era. ERA+ contextualizes performance so that Christy Mathewson can be compared to Greg Maddux to Jacob deGrom on equal footing.
ERA+ = 100 x (League ERA / (Park Factor x ERA))Pedro Martinez's 2000 season had an ERA+ of 291, widely considered one of the greatest pitching seasons ever. This means his ERA was nearly three times better than the league average after adjusting for Fenway Park's hitter-friendly environment.
K/9: 8.0 (avg), 10.0+ (elite). BB/9: 3.0 (avg), sub-2.0 (elite)
K/9 measures how many batters a pitcher strikes out per nine innings, while BB/9 measures walks allowed. Together, they paint a picture of a pitcher's stuff and command. High strikeout rates with low walk rates indicate dominant, efficient pitching.
Strikeouts are the most reliable way to get outs since they bypass the defense entirely. Walk rate reflects command and control. The K/BB ratio (K/9 divided by BB/9) is one of the best quick indicators of pitching quality.
K/9 = (Strikeouts x 9) / IP, BB/9 = (Walks x 9) / IPA pitcher with 12.0 K/9 and 1.8 BB/9 has a K/BB ratio of 6.67, indicating elite stuff and pinpoint command. Contrast this with a pitcher striking out 10.0 per nine but walking 4.5, whose K/BB of 2.22 suggests wildness that undermines his strikeout ability.
Scale: 0-1 (backup), 2 (starter), 4+ (All-Star), 6+ (MVP), 8+ (historic)
WAR is the single most comprehensive stat in baseball analytics. It estimates how many wins a player contributes to their team above what a replacement-level player (a freely available minor-leaguer or AAAA player) would contribute. It accounts for hitting, baserunning, fielding, and positional value for position players, and pitching performance for pitchers.
There are two major versions: fWAR (FanGraphs, uses FIP for pitchers) and bWAR (Baseball-Reference, uses RA/9). Differences are usually small but can matter for specific players.
Shohei Ohtani's unique two-way value is best captured by WAR. In 2021, he accumulated approximately 9.0 WAR, with contributions from both his pitching and hitting. No other stat can capture the full scope of his dual-threat ability like WAR does.
Cumulative stat, context-dependent. 3.0+ is excellent for a season.
WPA measures how much each plate appearance changes the team's probability of winning. A go-ahead home run in the 9th inning adds far more WPA than a solo shot in a blowout. It captures the 'clutch' value of a player's contributions by weighting each play by its impact on the game outcome.
A walk-off home run in a tied game might add 0.40 to 0.50 WPA in a single plate appearance, as it shifts the team's win probability from roughly 50% to 100%. In contrast, a solo homer making the score 10-1 might add only 0.01 WPA.
Cumulative stat, context-dependent but situationally aware
RE24 measures how each plate appearance changes the expected number of runs scored in an inning based on the 24 possible base-out states (8 base states x 3 out states). It is more nuanced than basic counting stats because it accounts for the game situation.
With runners on second and third with one out, the run expectancy is about 1.4 runs. If a batter hits a two-RBI double, the new state (runner on second, one out) has a run expectancy of about 0.7. The RE24 for that at-bat would be approximately 2.0 + 0.7 - 1.4 = 1.3.
Not all ballparks are created equal. Coors Field in Denver sits at 5,280 feet of elevation, where the thin air allows balls to travel farther and breaks on pitches to flatten. Oracle Park in San Francisco has dense marine air and deep outfield dimensions that suppress offense. These environmental differences can significantly inflate or deflate a player's raw statistics.
Park factors quantify these differences. A park factor of 105 for runs means that ballpark produces 5% more runs than average. A park factor of 95 means 5% fewer runs. When evaluating players, it is essential to account for where they play their home games.
Coors Field (COL)
Factor: ~115 (Hitter-friendly)
Highest elevation in MLB, extreme run inflation
Globe Life Field (TEX)
Factor: ~108 (Hitter-friendly)
Hot temperatures, carries well to left-center
Fenway Park (BOS)
Factor: ~106 (Hitter-friendly)
Short left field wall (Green Monster), boosts doubles
Oracle Park (SF)
Factor: ~92 (Pitcher-friendly)
Dense marine air, deep right-center, suppresses homers
Petco Park (SD)
Factor: ~94 (Pitcher-friendly)
Marine layer, spacious outfield, pitcher's park
loanDepot Park (MIA)
Factor: ~95 (Pitcher-friendly)
Retractable roof, humid air, deep gaps
This is why adjusted stats like wRC+, OPS+, and ERA+ are so valuable. They account for these park differences automatically, allowing you to compare a Rockies hitter to a Padres hitter on equal terms. Always prefer adjusted stats when comparing players from different teams.
One of the most important concepts in sabermetrics is sample size. Not all stats become reliable at the same speed. Some metrics stabilize quickly (meaning they reflect true talent early in the season) while others require a full season or more to become meaningful.
| Stat | Plate Appearances | Approx. Games |
|---|---|---|
| Strikeout Rate (K%) | 60 PA | ~15 games |
| Walk Rate (BB%) | 120 PA | ~30 games |
| HBP Rate | 300 PA | ~75 games |
| HR Rate | 170 PA | ~43 games |
| BABIP | 820 PA | ~200 games (over 1 season) |
| Batting Average | 910 PA | ~230 games (over 1 season) |
| OBP | 460 PA | ~115 games |
| SLG | 320 PA | ~80 games |
| ISO | 160 PA | ~40 games |
Key takeaway:In April, do not overreact to a player's batting average or BABIP. These stats need hundreds of plate appearances to stabilize. Instead, focus on K% and BB%, which stabilize much faster and are better early-season indicators of performance changes.
For pitchers, similar principles apply. FIP stabilizes faster than ERA because it is based on strikeouts, walks, and home runs, which are less subject to random variation than hits on balls in play.
You do not need to memorize every formula to benefit from sabermetrics. Here is a practical framework for using advanced stats as an informed baseball fan:
Remember: sabermetrics enhances your enjoyment of baseball. It does not replace watching the games. The best approach combines statistical analysis with the eye test, understanding that the numbers tell you what happened and provide probabilities for the future, but the games are played on the field, not on spreadsheets.
While win probability focuses on which team wins, game totals (over/under) and run production require understanding how sabermetrics translate into expected runs. This section bridges the gap between player statistics and scoring forecasts.
League-wide wOBA (Weighted On-Base Average) of roughly 0.315 corresponds to approximately 4.3 runs per game per team. A team with a wOBA of 0.330 should project to score roughly 4.5 runs per game; a team at 0.300 might score 4.1. The linear relationship holds reasonably well, though park factors (covered in Section 6) and opponent pitcher quality introduce variance. When analyzing a game's total, start with each team's wOBA and estimate run production, then adjust for the venue.
A starting pitcher with a low FIP (Fielding Independent Pitching) and high strikeout rate (K/9 ≥ 10) reliably suppresses run production. FIP under 3.00 indicates elite stuff that limits hard contact. When both starters carry strong FIP numbers, the game total tends to decline as batters struggle to generate hard contact and extra-base hits. Conversely, mediocre starters (FIP 4.00+) allow elevated scoring. The gap between starting pitchers often determines whether a total plays over or under.
Coors Field (park factor 1.30) inflates expected runs roughly 30%. A matchup projected for 8.5 combined runs at a neutral park might yield 11 runs at Coors. Conversely, Petco Park (0.94) suppresses run production. Always multiply your estimated total by the venue's park factor. If you estimate 4.3 runs per team (8.6 combined), apply the park factor: 8.6 × 1.30 = 11.2 at Coors, or 8.6 × 0.94 = 8.1 at Petco. This discipline prevents systematic errors.
In April, pitching statistics stabilize slowly. A starter posting a 2.50 ERA through two games may revert toward their true talent level as more innings accumulate. FIP stabilizes faster than ERA, but even FIP in small samples carries uncertainty. When evaluating early-season totals, prefer season-long career benchmarks for pitchers over raw April numbers. By May, season-to-date stats become more reliable. Be cautious with April pitching matchups until the sample size grows.
A quick alphabetical guide to the sabermetric and baseball terms used throughout this guide and StatScope's analysis.
| Abbreviation | Full Term | Definition |
|---|---|---|
| AB | At-Bats | Number of plate appearances that count as official at-bats (excludes walks, HBP) |
| AVG | Batting Average | Hits divided by at-bats; outdated metric that ignores walks |
| BABIP | Batting Average on Balls in Play | Hits on balls put in play (excluding HRs); indicates luck in batting average |
| BB | Walks (Base on Balls) | Times a batter reaches base without hitting the ball |
| BB% | Walk Rate | Walks as a percentage of plate appearances; stabilizes quickly |
| ERA | Earned Run Average | Runs allowed per nine innings; affected by defense and luck |
| ERA+ | Adjusted ERA | ERA adjusted for league average and park (100 = average) |
| FIP | Fielding Independent Pitching | ERA estimate based on K, BB, HBP, HR only; removes defense |
| HR | Home Runs | Hits that travel over the outfield fence |
| IP | Innings Pitched | Fractional format (e.g., 6.2 = 6⅔ innings); used to calculate rates |
| ISO | Isolated Power | Slugging % minus batting average; measures pure power |
| K | Strikeouts | Times a batter is struck out by a pitcher |
| K% | Strikeout Rate | Strikeouts as % of plate appearances; stabilizes at ~60 PA |
| OBP | On-Base Percentage | (H + BB + HBP) / (AB + BB + HBP + SF); better than AVG |
| OPS | On-Base Plus Slugging | OBP + SLG; quick offensive value metric |
| OPS+ | Adjusted OPS | OPS adjusted for league and park (100 = average) |
| PA | Plate Appearances | All times a batter steps up to the plate (includes walks, HBP) |
| RE24 | Run Expectancy Based on 24 Base-Out States | Context-dependent run impact metric; accounts for game situation |
| SLG | Slugging Percentage | Total bases / at-bats; measures power but inflated by singles |
| WAR | Wins Above Replacement | Total player value (hitting, defense, baserunning); best single-number metric |
| WHIP | Walks + Hits per Inning Pitched | (BB + H) / IP; measures baserunner efficiency |
| wOBA | Weighted On-Base Average | Weights each way of reaching base by run value; the best hitter metric |
| wRC+ | Weighted Runs Created Plus | wOBA adjusted for league and park (100 = average) |
| WPA | Win Probability Added | Change in win probability from each plate appearance; context-aware |
| xFIP | Expected Fielding Independent Pitching | FIP adjusted for expected HR rate; more stable than FIP |
Apply your sabermetrics knowledge with StatScope's live game analysis, player matchups, and win probability models.