Rebounding and Height
To test out the height ordering measure I came up with, and to try some of the methods described in recent posts on the Sabermetric Research blog, I decided to run some correlations to look at the relationship between a player’s height and his rebounding performance.
For the 2006-07 season, I looked at all players who played at least 200 minutes (which came out to 397 players, counting stints with different teams separately). I chose 200 minutes as the cutoff because the correlations seemed to stabilize at that level (at lower cutoffs the correlations were lower because of fluky low minute guys, and at higher cutoffs the correlations were very similar to what they were at the 200 minute cutoff). The explanatory variables that I used were height (in inches) and height ordering (which is on a 1 to 5 scale, with 1 indicating that the player played all of his minutes as the shortest player on the court for his team). The response variables were defensive rebounding percentage and offensive rebounding percentage. DRB% is an opportunity rate measuring DRB/(DRB opportunities), or more specifically, DRB/(team DRB while the player was on the court + opponent ORB while the player was on the court). ORB% is similar but uses ORB opportunities. The actual formulas, which estimate the on-court part, are as follows:
DRB% = DRB/((5*MIN/tmMIN)*(tmDRB + oppORB)) ORB% = ORB/((5*MIN/tmMIN)*(tmORB + oppDRB))
Last season, among players who played at least 200 minutes, Kevin Garnett led the league in DRB% at 30.7%. Earl Boykins finished last at 5.1% for his stint in Denver. For ORB%, Justin Williams was first at 17.6%, while Keith McLeod was last at 0.3%.
Correlations and Regression Equations
Correlation coefficients (r):
DRB% ORB%
---- ----
Ht .68 .73
HtOrd .75 .76
Regression equations (Y = mX + b):
DRB% = 0.007*Ht + -0.49 or
DRB% = 0.7%*(Ht - 71)
ORB% = 0.011*Ht + -0.73 or
ORB% = 1.1%*(Ht - 66)
DRB% = 0.021*HtOrd + -0.007 or
DRB% = 2.1%*(HtOrd - 0.3)
ORB% = 0.032*HtOrd + 0.046 or
ORB% = 3.2%*(HtOrd + 1.45)
So for each additional inch of height, the average player gained 0.7% in his DRB% and 1.1% in his ORB%.
And for each additional unit of height ordering (i.e. moving up from 1st shortest to 2nd shortest, or 3rd to 4th, etc.), the average player gained 2.1% in his DRB% and 3.2% in his ORB%.
Analysis
Let’s go back to the correlation coefficients. The first thing to notice is that they are all pretty high. An r of around .7 means an r-squared around .5, which indicates that height accounts for around 50% of the variance in rebounding percentages. Whatever other factors contribute to rebounding, none will have the impact that height does.
The next thing I noticed was that for both offensive and defensive rebounding, height ordering correlated better than just plain height. This provides some evidence that the height ordering measure I came up with is picking up some important information that can’t be gained just by looking at height.
Interestingly, offensive rebounding seems to be determined by height more than defensive rebounding (.73 to .68), but this difference almost completely disappears when we look at height ordering (.76 to .75). I’d have to take a closer look to really figure out what’s going on there, but I have some initial speculation. My guess is that height ordering correlates better to DRB% than height mainly because of undersized big men who grab more defensive boards than the typical player their height simply because they are playing a position that puts them closer to the hoop on defense. In other words, a 6′8″ player who is forced to play most of his minutes as the second tallest player on the court for his team (i.e. at a height ordering of 4), will be guarding opposing PFs close to the hoop, and thus be in a better position for defensive rebounds than the typical 6′8″ player, who is guarding SFs on the perimeter (and who has a height ordering of 3).
That’s all I have for now, though I think there are a lot of interesting directions one could go in using this as a starting point. For instance, what are the implications of the fact that a huge factor in rebounding is an aspect of a player that is completely unchanging regardless of team context, coaching, player improvement, aging, etc. (though note that while height is fixed, height ordering is not)? And should we account for player height or position in evaluating rebounding, either on its own or as part of a more comprehensive metric?
Here are a few more interesting correlations that I’ll post without analysis:
Offensive Rebounding Percentage & Defensive Rebounding Percentage:
DRB%
----
ORB% .77
Block Percentage [BLK/((5*MIN/tmMIN)*opp2PA)] & Height:
BLK%
----
Ht .62
HtOrd .66
I’m pretty sure you’ve seen this already, but I kind of looked at this a couple of years ago:
http://sonicscentral.com/apbrmetrics/viewtopic.php?t=173&
Comment by edkupfer — December 1, 2007
Thanks Ed. I remembered you had looked at the different effects player offensive and defensive boards had on team numbers, but I forgot you also looked at height.
Your post later in the thread looking at how player rebounding changes when players change teams is something I’ve been looking at recently for a lot of stats and will be posting about soon.
Comment by Eli — December 2, 2007
Something else interesting to look at that I’ve been thinking about lately is how much a player’s role in the offense and from where they shoot impacts offensive rebounding. You look at KG and see that his defensive rebounding is about the best in the league but his offensive rebounding isn’t great at all. My guess isn’t because his rebounding skill is much different on the offensive end, but because he’s mainly a jump shooter (77% of his shots were jump shots according to 82games last year) and takes a lot of shots. If it ends up being important how a player plays on offense, it would be interesting to adjust for that when evaluating offensive rebounders and seeing if anyone jumps up the charts.
Comment by Ben — December 2, 2007
That would be good to look at and could be done using HotZones data.
A model of player offensive rebounding might look something like this:
ORB% = Height + Position + Offensive court location (jump shooter vs. post player) + Coaching strategy (crash the o-boards vs. get back on defense) + Rebounding skill + Luck + Unknown
With some work we might be able to assign percentages to each of those factors.
Comment by Eli — December 2, 2007
A couple thoughts:
* Why would position matter? Wouldn’t that just take into account height and offensive court location?
* I would think the hardest thing to evaluate in that equation would be coaching strategy.
* On some level, I’m not sure height should even be evaluated. Clearly it’s part of the equation but I would combine it with rebounding skill. Everything else in that equation can conceivably be controlled by the coach and context, but height can’t so why look at it? Even if we know that, say, Ben Wallace is an unbelievable player for his height, that doesn’t help anything in our evaluation of him. A better example would be Earl Boykins - he could be the best rebounder in the league, inch for inch, but that doesn’t matter. He’s still the worst rebounder in the league.
But I think it’s an interesting and worthwhile endeavor.
Comment by Ben — December 2, 2007
Position may or may not matter apart from height and offensive court location. But I wouldn’t throw it out at the start before looking at the data. It may provide something that’s missing in just looking at where a player shoots from.
For coaching strategy, I think there are some ways to isolate that, though it wouldn’t be easy. You could start by looking at the team’s offensive rebounding as a whole, their pace and fastbreak points allowed, etc. Or look at players who changed teams like Ed did in the discussion he linked to.
I agree that in some sense height is a part of rebounding skill, but I think it’s important to isolate it for a variety of reasons. One, we can accurately measure it. Two, we know it remains fixed over time in all contexts. The part of rebounding skill that’s separate from height on the other hand could fluctuate as a player works on his game, ages, gets hurt, etc. I think it’s important to see how important that part is relative to the fixed height part.
The idea of the model is to estimate the underlying elements that combine to produce the actual ORB% that players produce. Once you have that model then you can make better estimates of how the stat would change in different contexts.
Comment by Eli — December 2, 2007
Ken Pomeroy just posted a piece looking at height in the college game:
http://www.basketballprospectus.com/article.php?articleid=82
Comment by Eli — January 4, 2008