Basketball eigenspectra – applying principal component analysis to SportsVU data

In trying to think of things to do with sportsVU player-tracking data if I ever had the chance to work with it, one of the first things that came to mind was to create “eigenspectra”. So I was really¬†excited when I saw that the raw data was available from github and I finally had the … More Basketball eigenspectra – applying principal component analysis to SportsVU data

Graphicacy major league dataviz challenge – my entry and the winners

As I mentioned in my previous post, I submitted an entry to the Graphicacy major league data challenge, and I want to share my entry here. I will describe my entry in more detail below, but here is a screenshot. A fully interactive version is available here.¬†My entry was awarded an honorable mention, which puts … More Graphicacy major league dataviz challenge – my entry and the winners

Graphicacy Major League Data Challenge

A few days ago an announcement for a data visualization challenge came across my twitter feed. It is organized by Graphicacy, and asks participants to visualize the careers of the top 20 players in baseball history. More information is available here http://www.majorleaguedatachallenge.com If you’re reading this, it’s probably too late to enter – entries due … More Graphicacy Major League Data Challenge

an investigation of DRA – impact of catchers

In my previous post I posted some code to make DRA-value and retro-CSAA databases, starting from retrosheet + baseball reference WAR (for fielding) + lahman (for ID matching) databases, and provided a link to some data for 1997-2004. I’ve run the models for some additional years, and done some analysis of the results. The first … More an investigation of DRA – impact of catchers

building CSAA and DRA-value databases using retrosheet

Several weeks ago, the crew at baseball prospectus put out a new pitching metric called DRA, or Deserved Run Average. It is distinctive in building in controls for lots of different factors, for including pitcher catcher framing, and for applying the technique of mixed-effect statistical modeling. The authors have put forth a tremendous amount of … More building CSAA and DRA-value databases using retrosheet

the tombstone proposal for the NBA draft

fivethirtyeight recently did an interesting crowd-sourced exercise on looking for ways to fix the NBA draft and address the issue of tanking, http://fivethirtyeight.com/datalab/the-6493-best-ideas-to-prevent-tanking-in-the-nba/ The purpose of this post is to look at the proposal to award draft position by wins-after-elimination, or the so-called tombstone proposal. Specifically, the tombstone proposal says , weight the draft picks … More the tombstone proposal for the NBA draft