DoctorMyBrainHurts
Feb 06, 2010 Aug 06, 2010 21 137
a fan of
Carolina Hurricanes
St. Louis Blues
RSSUser Blog
Simulating Kovi
I put together a series of simulations to see what spread of shooting production we might expect to see from average players and how the variability in that production compares to what we might expect from Kovi. In looking at the data over the last few years, it appears roughly 500 players have had greater than 20 even strength shots each season, with the maximum shots around 200. I used these as the basis for my models. I'm using a coin-flipping model, with every one of the 500 players other than Kovi having a known shooting percentage of 8.0%
Save Percentage On Ice/Off Ice
I opened another can of worms.
In an earlier post, Scott Reynolds pointed out the discrepancy between on ice save percentage and off ice save percentage for Smid, Fistric, Weaver, and Giordano. So I computed it, and sure enough, Smid's on ice save percentage falls outside the 95% CI established by his off ice save percentage. Same for Weaver. Same for Fistric. Same for Giordano. Uh-oh!
Defensive Contribution and QualComp/QualTeam
I think it makes sense to incorporate QualComp and QualTeam into DGVA. It just doesn't make much difference.
Defensive Contribution
Having thoroughly messed up goaltender performance metrics, I lay awake a while last night pondering what to do next. I thought, "I'm a contrarian. Nobody looks at defensive contribution. Let's look at that."
Replacement Level Goaltenders
Replacement goaltenders are not the same as threshold goaltenders.
Comparing GGVT to GVA
For goaltenders, GGVT is the same as GVA.
Simulating GGVT
In a previous post, I looked at simulating GVA. Here I simulate goaltender performance and calculate the goaltender portion of GVT.
Calculating GVT
Turns out it's harder than you think.
Simulation of GVA and the estimation error of GVA
Sunny Mehta recently suggested that the observed range of goaltender save percentages over the course of a single season could essentially be duplicated by an average goalie and random factors. It dawned on me that, if this were true, that observed goaltender performance metrics ought to similarly be indistinguishable from metrics derived from an average goaltender plus random factors. To explore this, I ran some simulations using a pseudo-random number generator. For these simulations I used an ES save percentage of 0.917, PK save percentage of 0.868, and PP save percentage of 0.914
The range of GVA seen in a single season can be duplicated by an average goaltender and random factors. GVA is a fairly imprecise estimate, with a large standard error and a large confidence interval.
Power Play Save Percentage
I expanded my look at save percentages to power play data. Once again I calculated logits from the save percentages. I removed goalies with logits that were infinity or undefined. Power play save percentage looks a lot like the even strength data. Goalies differ, teams don't. Only here, some goalies are better than average.
Penalty Kill Save Percentage
I expanded my look at save percentages to penalty kill data. Once again I calculated logits from the save percentages. I removed goalies with logits that were infinity or undefined. Compared to even strength, the penalty kill data is a real muddle. The best model explains only about 13% of the variability seen. About 87% of the variability remains unexplained.
Beating Save Percentage to Death: The Theory Of Everything
The Big TOE. The big fella, the Lizard King. Godzilla.
Simulating Rink Bias
I ran some simulations to see if rink bias really matters. It does not seem to make much difference.
Beating save percentage to death: Teams
I used my Even Strength Save Data to look at the effect of Teams. There is not much there.
Beating save percentage to death: Logistic Regression.
I reran my Even Strength Save Data using logistic regression. The results again show a significant difference between goalies.
Beating save percentage to death: Simulation
Using a psuedo-random number generator, I simulated up to 5 seasons of goalie save data. Results after the jump.
Beating Save Percentage to Death Part 3: Goaltenders
Goalie differences
Do goalies differ? As a fan of the Quebec Nordiques, my answer is an emphatic "Yes!" I would bet fans of the pre-lockout Ottawa Senators would wholeheartedly agree.
Not everybody does. Sunny Mehta recently said "No one has conclusively shown a meaningful difference in skill between NHL goaltenders. I'm not saying it doesn't exist, I'm just saying no one's really proved it, and that all signs point to goaltending differences being far less important than everyone thinks."
Beating Save Percentage to Death Part 2: Year
I've looked at even strength save percentage for all goalies from 1997-2010. The data for 1997-1998 for special teams on the NHL.com web site has been fixed and I am now including it. In part 1 I looked at goalie age and even strength save percentage. In part 2, I look at the trend of even strength save percentage over time.
Unweighted
A first look at the relationship comes from unweighted data. Each goalie is one data point, whether he played 1 game or 80. Here is the scatter plot:
Model Summary
|
|
R |
R Square |
Adjusted R Square |
Std. Error of the Estimate |
|
|
.03 |
.00 |
.00 |
.04 |
ANOVA
|
|
|
Sum of Squares |
df |
Mean Square |
F |
Significance |
|---|---|---|---|---|---|---|
|
|
Regression |
.00 |
1 |
.00 |
1.09 |
.30 |
|
|
Residual |
1.27 |
1033 |
.00 |
|
|
|
|
Total |
1.27 |
1034 |
|
|
|
Coefficients
|
|
|
B |
Std. Error |
Beta |
t |
Significance |
|---|---|---|---|---|---|---|
|
|
(Constant) |
.31 |
.57 |
.00 |
.54 |
.68 |
|
|
Year |
.00 |
.00 |
.03 |
1.04 |
.30 |
|
|
|
|
|
|
|
|
Not a lot there.
Weighted
We can rerun the regression, using shots faced as a weighting factor. The scatter plot of the means does look promising:
And there is a significant relationship.
Model Summary
|
|
R |
R Square |
Adjusted R Square |
Std. Error of the Estimate |
|
|
.61 |
.38 |
.38 |
.00 |
ANOVA
|
|
|
Sum of Squares |
df |
Mean Square |
F |
Significance |
|---|---|---|---|---|---|---|
|
|
Regression |
0.0000 |
1.0000 |
0.0000 |
6.0317 |
0.0339 |
|
|
Residual |
0.0001 |
10.0000 |
0.0000 |
|
|
|
|
Total |
0.0001 |
11.0000 |
|
|
|
Coefficients
|
|
|
B |
Std. Error |
Beta |
t |
Significance |
|---|---|---|---|---|---|---|
|
|
(Constant) |
0.06538 |
0.35000 |
0.00000 |
0.19000 |
0.88000 |
|
|
Year |
0.00043 |
0.00020 |
0.61340 |
2.45600 |
0.03390 |
|
|
|
|
|
|
|
|
So not a huge effect from year to year, but over the 12 years the predicted average even strength save percentage does go from 0.914 to 0.919. As an aside, the 1997-1998 data put this over the hump in terms of statistical significance. Looking at the trend in overall save percentage, I suspect that, if we go back further, the trend continues. If anyone has the data from the years before 1997 and would like to make it available to me I would be deeply appreciative. If not, I can see if I can construct it myself.
Conclusion
There is a statistically significant relationship between year and even strength save percentage.
Beating Save Percentage to Death Part 1B: Young Goalies
The following goalies made their debut at 19 or 20, and had at least 3 years of data. The first four made their debut at 19, the remainder at 20.
Backstrom
Dipietro
Fleury
Ouellet
Auld
Emery
Esche
Fiset
Labarbara
Lehtonen
Luongo
Pavelec
Pelletier
Price
Ramo
Rask
Raycroft
Schwarz
One-Sample Statistics
|
|
N |
Mean |
Std. Deviation |
S.E. Mean |
|---|---|---|---|---|
|
slope |
18 |
0.005 |
0.016 |
.00 |
|
|
Test Value = 0.000000 |
|||||
|---|---|---|---|---|---|---|
|
|
|
|
|
|
95% Confidence Interval of the Difference |
|
|
|
t |
df |
Sig. (2-tailed) |
Mean Difference |
Lower |
Upper |
|
slope |
1.35 |
17 |
0.19 |
.01 |
-0.003 |
0.013 |
T-TEST
I f we include goalies who made their debut at 21, we add in:
Anderson
Aubin
Biron
Bryzgalov
Crawford
Denis
Finley
Giguere
Halak
Harding
Howard
Johnson
Leclaire
Leighton
Noronen
Quick
Toivonen
Ward
One-Sample Statistics
|
|
N |
Mean |
Std. Deviation |
S.E. Mean |
|---|---|---|---|---|
|
slope |
36 |
0.003 |
0.010 |
.00 |
|
|
Test Value = 0.000000 |
|||||
|---|---|---|---|---|---|---|
|
|
|
|
|
|
95% Confidence Interval of the Difference |
|
|
|
t |
df |
Sig. (2-tailed) |
Mean Difference |
Lower |
Upper |
|
slope |
1.18 |
35 |
.25 |
.00 |
-0.002 |
0.007 |
T-TEST
So neither group achieves statistical significance.
Beating Save Percentage to Death Part 1: Age
I've looked at even strength save percentage for all goalies from 1998-2010. The data for 1997-1998 for special teams on the NHL.com web site is flawed, so I have omitted it.
Unweighted
A first look at the relationship comes from unweighted data. Each goalie is one data point, whether he played 1 game or 80. Here is the scatter plot:

Model Summary
|
|
R |
R Square |
Adjusted R Square |
Std. Error of the Estimate |
|
|
.04 |
.00 |
.00 |
.04 |
|
|
|
Sum of Squares |
df |
Mean Square |
F |
Significance |
|---|---|---|---|---|---|---|
|
|
Regression |
.00 |
1 |
.00 |
1.23 |
.27 |
|
|
Residual |
1.22 |
960 |
.00 |
|
|
|
|
Total |
1.22 |
961 |
|
|
|
|
|
|
B |
Std. Error |
Beta |
t |
Significance |
|---|---|---|---|---|---|---|
|
|
(Constant) |
.90 |
.01 |
.00 |
129.87 |
.00 |
|
|
Age |
.00 |
.00 |
.04 |
1.11 |
.27 |
|
|
|
|
|
|
|
|
So, the slope of save percentage over age isn't just close to 0.0, it is 0.0.
Weighted
We can rerun the regression, using shots faced as a weighting factor. The scatter plot does not look any more promising:
Model Summary
|
|
R |
R Square |
Adjusted R Square |
Std. Error of the Estimate |
|
|
.11 |
.01 |
.01 |
.00 |
|
|
|
Sum of Squares |
df |
Mean Square |
F |
Significance |
|---|---|---|---|---|---|---|
|
|
Regression |
.00 |
1 |
.00 |
.26 |
.61 |
|
|
Residual |
.00 |
21 |
.00 |
|
|
|
|
Total |
.00 |
22 |
|
|
|
|
|
|
B |
Std. Error |
Beta |
t |
Significance |
|---|---|---|---|---|---|---|
|
|
(Constant) |
.91 |
.00 |
.00 |
261.15 |
.00 |
|
|
Age |
.00 |
.00 |
.11 |
.51 |
.61 |
|
|
|
|
|
|
|
|
And, indeed, it is not any better.
Serial Measures
One last way to look for a relationship would be to look at each goalie relative to himself as he ages. For this analysis, I have restricted the analysis to goalies with at least 3 seasons. For each goalie, I computed his personal slope. We can then look at the average of these slopes. The null hypothesis is that the average slope is 0.0.
DISTRIBUTION PARAMETER ESTIMATES
========================================================
Slope (N = 140) Mean = -0.001 Variance = 0.000 Std.Dev. = 0.011
0.950 Confidence Interval for mean : -0.003 to 0.001
Once again, the slope is not significantly different from 0.0.
Conclusion
Analyzing the data every way I can think up, there is no evidence whatsoever of any relationship between goalie age and even strength save percentage.
23 comments
|
1 recs |
Tweet
Showing 1 - 21 of 21


