When Spring Training Matters


Brandon MossOAK.374.571.380+.006 PlayerTeamProj.wOBASpringwOBARevisedwOBADiff. Andrew McCutchenPIT.402.666.407+.005 Abraham AlmonteSEA.335.267.330-.005 Jose TabataPIT.340.105.334-.006 Mike MoustakasKC.321.615.331+.010 A.J. PollockARI.337.503.344+.007 Corey HartSEA.367.209.358-.009 Kolten WongSTL.313.514.326+.013 Ryan SweeneyCHC.325.098.317-.008 Ruben TejadaNYM.315.142.309-.005 Austin JacksonDET.354.544.360+.006 (Revised wOBA is what we would expect for the players’ 2014 wOBA after we factor in their spring performances.)At the other end of the spectrum, the projected everyday players whose poor showings in spring training are most likely to cost them during the regular season are 2013 rookie sensation Yasiel Puig of the Dodgers, the Mariners’ Corey Hart, and the Cubs’ Ryan Sweeney and Junior Lake. Travis d’ArnaudNYM.318.206.312-.006 Going into the 2013 baseball season, you would have been forgiven for thinking Marlon Byrd’s days as a relevant player were behind him. He was 35 years old, an age at which most players’ skills have deteriorated significantly. He was coming off a miserable year, one in which he hit .210/.243/.245, was discarded by two teams (the Cubs and the Red Sox, who combined to go 130-194 on the season) and received a 50-game suspension after testing positive for the banned substance tamoxifen.1It remains unclear when exactly Byrd was using performance enhancing drugs, and how much of a residual effect they had on his horrendous 2012 season. Although he signed a minor-league deal with the Mets in February, few thought Byrd would even be serviceable in the upcoming season.Then Byrd went on a tear in spring training. He hit .357/.393/.571 in exhibition games, was subsequently named the New York Mets’ opening-day right fielder and went on to put up the best year of his career — in New York for five months of the season and later, after a trade, as a member of the Pittsburgh Pirates’ first playoff team since 1992.With the benefit of hindsight, it’s easy to connect the dots and declare that Byrd’s hot spring set the tone for his renaissance season. But is that what happened? Does an unusually strong March have any predictive power over a player’s performance once the games count?The answer is … well, sort of. To find out what that means for this year’s crop of spring standouts, I looked into all the data since 2006, the earliest season for which MLB.com lists spring-training statistics. Using wOBA, an advanced metric to measure a batter’s offensive performance,2I used the harmonic mean of plate appearances in each sample as the weights. I ran a weighted correlation3A quick primer on correlations: They measure the linear relationship between two variables, on a scale that runs from -1 (strong negative relationship) to 1 (strong positive relationship). The closer to 0, the less of a relationship there is. between performance in the spring and during the regular season. It revealed a weak relationship between the two variables, at best.4The correlation coefficient was 0.189, which is relatively feeble.You can see that weak relationship below. Each dot on the graph represents a player’s season plotted according to his spring training wOBA and his corresponding regular season wOBA.We also have access to information beyond a player’s spring-training statistics. In the case of a veteran player like Byrd, we know his track record from recent seasons and can use that data to inform expectations for the forthcoming season.But there’s a more sophisticated way to see if spring training matters come the regular season: Use a linear regression5Even quicker regression primer: this method seeks to model a linear relationship between two or more variables. The advantage of regression here is that it can estimate the impact of an increase in one variable (spring training wOBA) while holding the other (preseason projected wOBA) constant. to determine the predictive significance of spring training after controlling for expected performance. And as luck would have it, establishing a baseline of expected performance is where statistical forecasting systems6Such systems seek to set an expected level of performance for each player based on his age, previous statistics and sometimes even comparisons to similar players. can come in very handy.One of those systems was developed by sabermetrician Tom Tango, who releases a set of projections known as the Marcels (so named for the pet monkey from the show “Friends”) before each season. These projections are “so basic that a monkey could compute them,” but they perform no worse than far more sophisticated projection systems — a testament to the fundamental power of a weighted average of recent seasons and a simple aging adjustment. The sabermetricians (and brothers) Jeff and Darrell Zimmerman took the time to calculate historical Marcel projections for players going back to 1901, which we can use to build our regression.We then find that spring productivity is statistically significant when predicting actual performance in the upcoming season, even after controlling for a player’s Marcel projection. However, while significant, the effect is extremely small: To raise his expected regular-season wOBA by just a single point, a typical player would need to hit for a wOBA roughly 17 points higher than expected during the spring.In other words, spring numbers can and should affect our predictions for a player’s regular-season production, but only slightly, and only after a particularly strong or weak performance.Among players likely to get playing time (a minimum of 400 plate appearances, according to Fangraphs’ depth charts), we should keep an eye on the likes of the Tigers’ prospect Nick Castellanos, the Cardinals’ Kolten Wong, the Royals’ Mike Moustakas and the Mariners’ Brad Miller, all of whom are tearing up opposing pitching during the spring thus far. And the Pirates’ Andrew McCutchen, last year’s National League MVP, could be even better than expected this year given his spring. Skip SchumakerCIN.313.471.319+.006 Junior LakeCHC.351.166.344-.007 Brad MillerSEA.342.566.352+.010 Nick CastellanosDET.337.484.350+.013 PlayerTeamProj.wOBASpringwOBARevisedwOBADiff. Jose ReyesTOR.364.228.359-.005 Yasiel PuigLAD.397.147.387-.011 Yoenis CespedesOAK.352.145.345-.007 Dustin AckleySEA.316.496.323+.007 There’s no guarantee any single one of these guys will use the spring to propel himself to a great regular season — or, conversely, that a rough spring portends certain doom — but the data says these players are more likely to diverge from their projections now than they were just a month ago.

Leave a Reply

Your email address will not be published. Required fields are marked *