Tag Archives: Sports

Michigan: Destined for an early exit?

Posted on February 16, 2013 | 3 comments

Michigan is my favorite college basketball team, and for the first time in awhile, they are threatening to make a deep tournament run. However, they just lost three of four during a tough stretch against Indiana (L, away), Ohio State (W, home), Wisconsin (W, away), and Michigan State (W, away). I’m not writing them off — they only lost the away games — but some bad signs appeared in these games. Here’s the Game Stack for all four combined:

Michigan looks good on turnovers, but that comes at a cost — they get crushed on free throws and two point percentage. Having watched the games, I can connect the dots for you: the Wolverines don’t drive to the hoop much against good teams. They have some great shooters who can get reasonably open (Trey Burke, Tim Hardaway, Jr.) who are happy to “settle” for jumpers.

This keeps the ball out of danger in the lane (low turnovers), but it means that Michigan never gets to the line and shoots a lower percentage on twos as well. Michigan also rebounds a lower percentage of their own misses than the opponent, which could be related — a lot of “second chances” are just put-backs after a shot close to the hoop.

So, is Michigan sunk? We’ll see. I have some faith that Mitch McGary can improve and find some high percentage twos down low, but right now, Michigan is probably not efficient enough offensively and not good enough on the boards to compete with the best teams in the country. I would worry less about four games if the problem was just poor shooting in a small sample, but the problem seems to be about playing style against good defenses. I don’t think that’s going to change.

If you’re interested, here are the Game Stacks for all four games. The trends I discussed are pretty consistent across the games:

3 Comments

Posted in Basketball, College Sports, Commentary, Sports Stats

Tagged basketball, basketball alternative box score, basketball analysis, Big Ten Conference, box score, college basketball, game stacks, Indiana, Michigan, Michigan basketball, Michigan basketball analysis, Michigan free throws, Michigan Indiana, Michigan Michigan St, Michigan Ohio State, Michigan Wisconsin, Michigan Wolverines, Ohio State University, Sports, sports column chart, sports data graphs, sports data visualization, sports stacked bar chart, Tim Hardaway, Tim Hardaway Jr., Trey Burke, U of M, visual box score, visualization, Wisconsin, Wolverine

Basketball Stacks part 2: Rebounding

Posted on February 8, 2013 | 4 comments

Yesterday, I posted a new idea for visualizing box scores: Game Stacks. While the first version did a good job of showing shooting percentages and turnover rates, it didn’t do a good job on rebounds. As my pops pointed out, Indiana had a big rebounding advantage over Michigan by the basic numbers (36-22), so it seemed wrong to rely only on the height of the stacks to determine who rebounded better. The reality: Michigan got more chances not because they rebounded better, but because they had more misses — and you have to miss to get a second chance. The height of the stacks just showed that Michigan got more offensive rebounds, even though their rebounding rate was terrible.

So, round two. Here’s the Michigan-Indiana Game Stack redesigned to capture rebounding:

Without play by play data, I had to keep the rebounding simple — I figured out the offensive rebound rate for each team:

Off reb rate = your off rebs/(their def rebs + your off rebs).

Then, I multiplied this rate by the relevant number of shots to generate the “Missed (O Reb)” category for each type of shot (the dashed regions). Each dashed/empty combo now visualizes the offensive rebound rate for the relevant team.

Now the picture is clearer:

Michigan got a couple extra chances, but Continue reading →

4 Comments

Posted in Basketball, College Sports, Sports Stats

Tagged basketball, basketball graphic, Boston, Boston Celtics, box score, Celtics, Celtics offensive rebounding, Clippers, college hoops, defensive breakdowns, Dick Vitale, Free throw, Game Stack, game stacks, Golden State Warriors, graphical statistics, graphics sports, Hoosier, Houston Rockets, Indiana, Indiana basketball, indiana game, Lakers, lakers pistons, Los Angeles Lakers, Michigan, Michigan basketball, NBA, nba game, offensive rebound, Pistons, point attempts, Rebound (basketball), rebounding advantage, Rockets, Rockets 23 three pointers, Rockets three pointers, shooting percentages, shot attempts, Sports, sports statistics, Three-point field goal, turnover rates, visual shooting percentages, visual statistics, visualization, visualizing basketball games, Warriors, Wolverines

Visualization: Basketball Game Stacks

Posted on February 5, 2013 | 4 comments

Note: On my dad’s advice, I posted another version of the Game Stacks that depicts rebounding rates, rather than just total offensive rebounds. The discussion in this post is a little naive on that point — the new version yields a better analysis of rebounding.

I have a general hang up when looking at the box score for basketball (or listening to announcers list off statistics). I see some rebounding numbers, but I can’t tell who rebounded better without offensive and defensive breakdowns, plus the number of shots missed by each team. And I see shooting percentages and shot attempts, but it’s hard to put it all together into how a team got its points.

I realized that what I really want to see is not complicated. Here’s the list:

What each team did with their scoring chances:
- Two point attempts
- Three point attempts
- Free throw trips (2 attempts)
- Turnovers
Efficiency on each type of shot
Rebounding advantage in terms of extra scoring chances
And, of course, total score

All these stats exist, but there should be an easy way to see all of it at once and get a sense for how the game was won. Here’s my first try, the Game Stack:

The picture shows total “plays,” or chances to score, for each team, and total points, broken down by type. In a quick glance, you can see that Indiana was out-rebounded (Michigan got three more chances to score) and turned the ball over a ton. However, on just over 60 non-turnover plays, the Hoosiers Continue reading →

4 Comments

Posted in Basketball, College Sports, Innovative Ideas, Sports Stats

Tagged basketball, basketball graphic, box score, Celtics, Celtics offensive rebounding, Clippers, college hoops, defensive breakdowns, Dick Vitale, Free throw, Game Stack, graphical statistics, graphics sports, Hoosier, Indiana, Indiana basketball, Lakers, Michigan, Michigan basketball, NBA, nba game, Pistons, point attempts, Rebound (basketball), rebounding advantage, shooting percentages, shot attempts, Sports, sports statistics, Three-point field goal, visual shooting percentages, visual statistics, visualization, visualizing basketball games, Wolverines

Playoff Appetizer: True Wins Plus (Fumble Adjusted)

Posted on January 5, 2013 | 6 comments

We might be halfway through the first quarter of the first NFL playoff game of 2013, but I’m still finishing up with baseball and just getting warmed up on football. Football month on the blog officially kicks off today — there’s lots of interest stuff to come, from innovative rule ideas and play calling to new prediction methods and game analysis. Today, I’m trying an addition to the measure of NFL team quality that I debuted last year: True Wins. True Wins are calculated as follows:

True Win = Blowout Wins + Close Wins/2 + Close Losses/2 + Ties/2

You may recognize the intuition from pythagorean expectations — you get full credit for blowout wins (I define this as more than 7 points), but no extra credit for winning by huge margins, and you get half credit for all close games, since those probably come down to luck more than skill. Last year, I showed that True Wins predicts a little better than pythagoreans, and it’s a whole lot more direct. Both measures are much better than using wins alone, which unfairly penalize (reward) teams that lose (win) a lot of close games.

What Else is Luck-Driven? Fumble Recoveries?

With the playoffs coming right up, I decided to try an improvement that adjusts for possible luck in fumble recoveries as well. Here’s the logic (from Football Outsiders):

Stripping the ball is a skill. Holding onto the ball is a skill. Pouncing on the ball as it is bouncing all over the place is not a skill. There is no correlation whatsoever between the percentage of fumbles recovered by a team in one year and the percentage they recover in the next year. The odds of recovery are based solely on the type of play involved, not the teams or any of their players . . . Fumble recovery is a major reason why the general public overestimates or underestimates certain teams. Fumbles are huge, turning-point plays that dramatically impact wins and losses in the past, while fumble recovery percentage says absolutely nothing about a team’s chances of winning games in the future. With this in mind, Football Outsiders stats treat all fumbles as equal, penalizing them based on the likelihood of each type of fumble (run, pass, sack, etc.) being recovered by the defense.

The keys are:

Fumbles are huge turning points in games
Teams don’t maintain high or low recovery rates over time

To quantify #1, I determined the point value of a recovery. A simple regression of point differential in each game on total fumbles and fumbles Continue reading →

6 Comments

Posted in Football, Prediction, Sports Stats

Tagged Andrew Luck, Cincinnati Bengals, Football Outsiders, Football Outsiders fumbles, fumble, fumble recoveries, fumble recovery, Green Bay Packers, Houston Texans, Indianapolis Colts, luck, luck football, luckiest NFL teams 2012, lucky, lucky teams, Minnesota Vikings, National Football League, NFL, nfl playoff game, NFL playoffs, NFL prediction, playoff predictions, playoffs, randomness, randomness sports, Seattle Seahawks, Sports, Super Bowl, True Wins, True Wins Plus, Washington Redskins

Is that a shiny new free agent in your stocking, or an old lump of coal?

Posted on December 21, 2012 | 1 comment

NFL playoffs are right around the corner, but ’tis the season for a jolt of baseball excitement too, as teams sign new players. The contracts are getting bigger and bigger, supported by growing MLB revenues. Some of the major signings under the tree this year (more here):

Zack Greinke, 6 yrs, $147 million (Dodgers)
Josh Hamilton, 5 yrs, $125 million (Angels)
B.J. Upton, 5 yrs, $75 million (Rays)
Anibal Sanchez, 5 yrs, $80 million (Tigers)

But before you start thinking playoffs, remember that many big deals don’t work out. Who will be nice and who will be naughty this year?

The Old Lumps of Coal

From the list above, Greinke is 29 years old, Hamilton is 31, Upton is 28, and Sanchez is 28. Not many young players are available through free agency, but are these 4 to 6 year deals for 28 to 31 year olds a good idea? I tackled this question with my friend Jeff Phillips for ESPN the Magazine in early October.

Specifically, we wondered if long deals for 30 year olds made more sense during the steroid era, when players could recover, train, and maintain more easily. There are two sides of the coin: (1) how has older player performance changed, and (2) has older player compensation evolved appropriately. We focused on players in the top quarter of the salary distribution, since that’s where the big money is spent. To measure performance, we examined average Wins Above Replacement Player (WARP)* by age during and after the steroid era:

Uh oh. Although performance for all highly paid players has gone down, older “stars” have turned out to be coal indeed. Looking year by year highlights the post-PED age decline. Average WARP for older and younger stars was remarkably similar through the steroid era, but older player WARP Continue reading →

1 Comment

Posted in Baseball, Causal Analysis, Financial Analysis, Prediction, Trades/Free Agency

Tagged age and performance, Age and steroids, aging, aging baseball, Albert Pujols, Angel Pagan, Angels, Anibal Sanchez, average salary, B.J. Upton, baseball, contracts, David Ortiz, Detroit Tigers, ESPN The Magazine, Fielder contract too long, free agent projections, Jake Peavy, Jeff Phillips, Josh Hamilton, lumps of coal, Major League Baseball, Michael Bourn, Mike Napoli, Mitchell Report, MLB free agent market 2013, MLB free agents, MLB revenue, MLB revenue growth, Nick Swisher, older stars, Post-steroid era baseball, Prince Fielder, projected value, projections, Pujols contract, rising salaries MLB, salary distribution, Shane Victorino, Sports, Steroid era MLB, steroids and age, steroids and aging, Torii Hunter, WARP, worst contracts, worst contracts MLB, Zack Greinke

New York is Lefty Land

Posted on October 19, 2012 | 5 comments

I’m a Tigers fan, so I’m pretty excited about how things worked out the last week. Basically, everything went right for the Tigers and nothing went right for the Yankees.

The only glimmer of hope for the Yankees came in game one. Down 4-0, Ichiro Suzuki hit a line drive homer to right in the bottom of the ninth and Raul Ibanez followed with a pop fly two-run “shot” that might have been an out (or perhaps a double) in most parks. Hope turned to despair when Derek Jeter went down with an ankle injury in the 12th, ending his season, while the Tigers stormed back into the lead. Even worse for the Yankees, their near victory finally knocked Jose Valverde off his closer pedestal. The Tigers should have made that move months ago.

I want to go back to the homers though. It’s no coincidence that both homers went to right field off of left-handed bats. Here are the home/road home run splits for the Yankees lefties in 2012:

Continue reading →

5 Comments

Posted in Baseball, Causal Analysis, Commentary

Tagged age baseball, age decline baseball, age decline baseball Tyler Williams Jeff Phillips, Age Issue Tyler Williams Jeff Phillips, age steroids baseball, ALCS, American League Championship Series, Andy Dirks, baseball, bottom of the ninth, chris dickerson, Curtis Granderson, Derek Jeter, Detroit Tigers, dewayne wise, Eric Chavez, ESPN Home Run Tracker, ESPN the Mag, ESPN the Mag Age Issue, ESPN the Magazine Jeff Phillips Tyler Williams, Hit Tracker, home away home run splits, Ichiro Suzuki, Jeff Phillips, Jeter injury, Jim Caple, Jim Caple ESPN, Jose Valverde, lefties Yankees, Major League Baseball, Mark Teixeira, MLB, New York Yankees, Nick Swisher, quintin berry, Raul Ibanez, Robinson Cano, Sports, Tigers sweep, Tigers World Series, Tyler Williams ESPN, Yankee Stadium, Yankee Stadium unfair, Yankees can't hit, Yankees home field advantage, Yankees home run advantage, Yankees home run splits home away, Yankees home runs, Yankees left-handed bats, Yankees left-handed hitters, Yankees lefties overrated, yankees lineup, Yankees old, Yankees overrated, Yankees poor hitting, Yankees right field, Yankees short porch, Yankees struggle at the plate

Sabermetrics: Cabrera vs. Trout, Round 2

Posted on October 6, 2012 | 6 comments

Last week, I entered the fray on the Mike Trout versus Miguel Cabrera AL MVP debate. It’s similar to the 2010 AL Cy Young discussion — Felix Hernandez led the AL in strikeouts and ERA but managed just a 13-12 record because Seattle couldn’t score. The new era of baseball stats won out. Voters ignored wins, which have little to do with pitching quality, and Hernandez won the award.

Likewise, Trout lags Cabrera in highly publicized but somewhat meaningless stats (RBI, Triple Crown). Some saber-men would have you believe that Trout laps Cabrera in the only stats that matter (WAR over 10 compared to 7 for Cabrera), but that requires a level of trust that I don’t have. WAR — Wins Above Replacement — is complicated to the point of complete confusion. Cabrera contributed more in some categories (doubles, homers, total bases, batting average) but less in others (triples, baserunning, defense). Is WAR capturing these contributions accurately?

True Runs Revised (A WAR Replacement)

Rather than critique WAR (which would take days), I developed a new, simpler stat: True Runs. True Runs (named in honor of my True Wins football statistic) estimates a player’s contribution to his team based only on simple statistics. I got some good comments on the methodology, and what better time to revise it than now, while listening to MVP chants ring out at Comerica Park in Detroit.

Per DRDR’s comment, I included outs/reached on error in the revised methodology:

Using data since 1990, regress total runs scored by each team each season on total singles, doubles, triples, homers, walks, hit by pitches, usual outs/reached on error, strikeouts, double plays, stolen bases, and caught stealing in that season
Take the coefficients from this regression, multiply them by each individual’s stats, and add up the result

Intuitively, the regression finds the best way to add up all these stats to most closely approximate total runs scored across all teams in all years. The result: True Runs now captures the four basic things a hitter can do at the plate — walk, get a hit, make an out/reach on an error, strikeout — as well as steals. The regression coefficients approximate how many runs each of these actions is worth, on average.*

Here’s the top 10 for 2012 across both leagues Continue reading →

6 Comments

Posted in Baseball, Commentary, Common Sense, Sports Stats

Tagged advanced baseball statistics, American League MVP, Angels playoffs, baseball, baseball statistics, baseball stats regression, Cabrera, Cabrera defense, Cabrera overrated, Cabrera Trout comparison, Cabrera Trout defensive statistics, Cabrera underrated, Cabrera vs Trout, Cabrera WAR, Carl Yastrzemski, comerica, Comerica Park, Cy Young, Cy Young Award, Cy Young Award 2010, Cy Young Felix Hernandez, Detroit Tigers, double plays, felix hernandez, Felix Hernandez WAR, Los Angeles Angels, Miguel Cabrera, Miguel Cabrera Triple Crown, Mike Trout, Most Valuable Player, MVP advanced statistics, offense scores, oWAR, performance statistics, R squared, regression analysis, regression baseball, regression coefficient, regression sports, replacement player, Run batted in, Sabermetrics, sabermetrics Mike Trout, simple sabermetrics, simplified stats, Sports, strikeout, Tigers playoffs, Total Zone Fielding Runs, Total Zone Runs, traditional metrics, Triple Crown, Trout, Trout defense, Trout overrated, Trout underrated, Trout WAR, using regression in baseball stats, WAR, WAR complicated, WAR confusing, WAR definition, WAR explanation, WAR formula, WAR MVP, who should win the AL MVP, Who's better Cabrera or Trout, Wins above replacement

Adrian the Canadian explains what happened on the infamous Hail Mary

Posted on September 30, 2012 | 1 comment

I’ll let my legal expert, Adrian the Canadian take it away (and believe it or not, this has little to do with the incompetence of the replacement referees and everything to do with the NFL’s replay review procedures):

Every football fan, even the replacement refs, was relieved when the NFL and the real officials resolved their labor dispute. The fast resolution was driven, in large part, by the result of the Monday Night game between the Seahawks and the Packers. By now, even non-fans know what happened. If you’ve been living in a shoe box, here’s the video of the call that encompasses the replacements’ legacy. But simply blaming the replacement refs doesn’t quite get us to the clearly incorrect result. Yes, the refs blew it on the field, but they also had a chance to review the play using instant replay and still allowed the call on the field to stand. How could instant replay fail to correct such an obvious mistake?

The Play:

Down by five, the Seahawks had one chance to beat the Packers: a Russell Wilson Hail Mary pass. While the pass was in the air, Seahawks receiver Golden Tate first pushed off and over a Green Bay defender Continue reading →

1 Comment

Posted in Common Sense, Football, Rules Analysis

Tagged Adrian the Canadian, football, Golden Tate, Green Bay Packers, hail mary pass, incontrovertible evidence, legal football, Mike Pereira, Monday Night Football, monday night game, National Football League, NFL, NFL legal analysis, NFL replay review legal issues, NFL replay review rules, NFL replay review rules and analysis, packers robbed, Peter King, replacement referees, rules NFL, Russell Wilson, seahawks gift, seahawks hail mary, Seattle Seahawks, simultaneous catch reviewable, simultaneous catch reviewable end zone, Sports, standard of review, tie goes to the runner, was the Seahawks play reviewable

Cabrera Might Get the Triple Crown, but Does He Deserve the MVP?

Posted on September 25, 2012 | 22 comments

Edit: Please see my later post as well, which corrects an omission here.

Miguel Cabrera has a shot at the Triple Crown this year. No one has done it since Carl Yastrzemski. Is it really possible that he could win the Triple Crown and not win the MVP? Well, yes. Every advanced stats guy out there is trumpeting Mike Trout for MVP, with his “wins above replacement” (WAR) above 10 (next best in the majors is 6.8) and his 13 “total zone total fielding runs above average” (basically, this is the number of runs he has saved with his fielding, compared to an average fielder).

The discussion is eerily similar to the AL Cy Young conversation in 2010. Felix Hernandez won because he led the AL in innings pitched, ERA, and, most importantly, WAR, even though his win-loss record was a mediocre 13-12.

The 2010 Cy Young was a victory for sabermetricians. Pitchers can’t control how many runs their offense scores. All they can do is put up a low ERA and stick around for as many innings as possible. Strikeouts help too, since they reduce the risk of errors, and walks hurt, since fielders can’t do anything about a walk. There might be some cases where pitchers rise to the occasion in a close game to get a win, but for the most part, getting a “win” has little to do with pitcher skill after accounting for pitchers’ direct performance statistics.

2012 MVP: the Saber-Men After Party?

This time around, sabermetric thinking is stacked heavily against Cabrera (and the media is paying attention):

RBIs are meaningless. After accounting for total bases and on base percentage in some way, RBIs have little to do with individual skill
Cabrera LEADS THE AL IN DOUBLE PLAYS with 28, which is not captured by any traditional stat (granted, he has Austin Jackson’s high OBP in front of him, so he has lots of chances)
Trout steals lots of bases and never gets caught (46 for 50 this year), which also isn’t captured by traditional metrics
Cabrera is a poor fielder (10 runs worse than average at third base), Trout is a good fielder (mentioned above)

All these factors lead to Trout’s 10.4 to 6.7 WAR advantage over Cabrera. If voters take these numbers seriously, it seems that we’ll be looking at another win for the number crunchers.

But What is WAR Anyway?

Four extra wins is a lot and WAR is widely accepted as meaningful, but before I leap on the Trout-wagon, is WAR actually a good statistic? Here’s a snippet from Baseball Reference’s WAR explanation:

There is no one way to determine WAR. There are hundreds of steps to make this calculation, and dozens of places where reasonable people can disagree on the best way to implement a particular part of the framework.

Uh oh . . . hundreds of steps is never a good sign, Continue reading →

22 Comments

Posted in Baseball, Commentary, Common Sense, Sports Stats

Tagged advanced baseball statistics, American League MVP, Angels playoffs, baseball, baseball statistics, Cabrera, Cabrera defense, Cabrera overrated, Cabrera Trout comparison, Cabrera Trout defensive statistics, Cabrera underrated, Cabrera vs Trout, Cabrera WAR, Carl Yastrzemski, Cy Young, Cy Young Award, Cy Young Award 2010, Cy Young Felix Hernandez, Detroit Tigers, felix hernandez, Felix Hernandez WAR, Los Angeles Angels, Miguel Cabrera, Miguel Cabrera Triple Crown, Mike Trout, Most Valuable Player, MVP advanced statistics, offense scores, oWAR, performance statistics, replacement player, Run batted in, Sabermetrics, sabermetrics Mike Trout, simple sabermetrics, simplified stats, Sports, Tigers playoffs, Total Zone Fielding Runs, Total Zone Runs, traditional metrics, Triple Crown, Trout, Trout defense, Trout overrated, Trout underrated, Trout WAR, WAR, WAR complicated, WAR confusing, WAR definition, WAR explanation, WAR formula, WAR MVP, who should win the AL MVP, Who's better Cabrera or Trout, Wins above replacement

Ranking College Football Teams

Posted on September 10, 2012 | 3 comments

As the college football season gets under way, my buddy Jeff and I put together a brand new college football ranking for ESPN the Magazine (insider required, in print 9/17/2012). We started with ESPN’s pro franchise ultimate standings as a template, and tried to make things as quantitative as we could to make the ranking defensible. We’ve inspired some feedback already. The SEC does well of course but didn’t land the number one team — check it out if you get the chance!

3 Comments

Posted in College Sports, Financial Analysis, Football

Tagged American, College and University, college football, college football ranking, college football season, ESPN, ESPN college football rankings, ESPN The Magazine, ESPN ultimate standings, football, franchise, insider, Jeff Phillips, NCAA Division I-A, Sports, Tyler Williams