2012-2013 PP rankings, and a Look at a PP Prediction

While a ton of e-ink was spilled in my last post, perhaps the most succinct thought on PP prediction came from good friend of the blog, holiday park.

“The basic problem is that relatively little time in the game is spent on special teams, and goals are only scored in a fraction of those times: the result is that teams don’t vary that much in either PP or PK efficiency. Insofar as any correlation-based method of analysis is about explaining variation in one variable in terms of variation in another, you’re kind of stuck if your dependent variable doesn’t vary all that much.”

To make sure both holiday park and I aren’t full of shit, let’s apply the correlation findings from this past article to the 2012-2013 season, which weren’t included in my previous analysis.

Methods

Let’s “pretend” we are starting from the half-way point in the 12-13 season, when all teams have played approximately 24 games. We look at our stats that are the best predictors of PP success and give our best prediction of what the year end GF/60 will be. We then wait a theoretical half season, compare our 1st half results with our 2nd half results and see how we did.

Misses and shots were corrected for scorer bias as indicated in the previous study.

Results

Table 1. below shows FF/60 regressed 33% to the mean at the half way point, FF/60 regressed 33% to the mean for teh 2nd half of the season, GF/60 and Pts for the 2nd half.

Team	regressed FF/60 (game 24)	regressed FF/60 (game 48)	GF/60	Pts	Predicted GF/60	Error
ANA	74.07	71.05	5.26	27	5.93	0.67
BOS	69.89	71.68	4.43	23	5.59	1.17
BUF	66.94	60.41	4.65	28	5.36	0.71
CAR	66.25	64.60	4.58	13	5.30	0.72
CBJ	74.74	53.88	4.57	35	5.98	1.41
CGY	63.55	62.05	6.87	20	5.08	1.78
CHI	65.68	69.19	4.00	32	5.25	1.26
COL	62.24	66.52	4.95	15	4.98	0.02
DAL	59.75	63.55	4.42	22	4.78	0.36
DET	69.44	74.55	7.84	28	5.55	2.28
EDM	57.28	57.48	6.67	24	4.58	2.09
FLA	61.75	68.28	7.60	17	4.94	2.66
L.A	65.08	63.32	7.91	29	5.21	2.71
MIN	63.64	72.63	7.36	27	5.09	2.27
MTL	66.46	63.71	6.54	29	5.32	1.23
N.J	74.64	66.86	5.50	21	5.97	0.47
NSH	60.50	68.37	6.22	16	4.84	1.38
NYI	70.42	70.78	5.43	32	5.63	0.20
NYR	61.44	56.94	5.07	28	4.92	0.16
OTT	72.68	67.27	4.76	28	5.81	1.06
PHI	68.50	72.39	7.08	26	5.48	1.60
PHX	63.54	60.93	3.91	26	5.08	1.17
PIT	67.56	77.71	7.92	40	5.40	2.52
S.J	72.69	75.88	8.43	29	5.82	2.61
STL	72.55	67.51	4.08	32	5.80	1.72
T.B	61.39	59.45	5.79	19	4.91	0.88
TOR	62.85	55.55	7.53	27	5.03	2.50
VAN	63.85	68.38	5.36	31	5.11	0.25
WPG	59.14	64.45	4.62	26	4.73	0.11
WSH	69.94	70.72	10.86	36	5.59	5.27
Avg	66.28	66.20	6.01	26.20	5.30	1.44

FF/60: goals + shots + missed shots for per 60 min 5v4 ice time. GF/60: goal for per 60 min 5v4 ice time. Pts: Staindings points.

We compare our first half regressed FF/60 with our dependent variables of interest using correlations, to show how predictive FF/60 was.

Fenwick For/60 correlations
r(self)	0.37
r(GF/60)	0.01
r(Pts)	0.43

r(x): correlation between FF/60 and variable x, where x is row headings.

Lastly, a table of 12-13 results for all games, 1-48. All data 5v4, non-empty net.

1213	Pts	GF/60	Sh%	PP%	SF/60	FF/60	CF/60
S.J	57	6.41	11.1%	0.18	51.49	76.56	101.65
ANA	66	7.98	14.2%	0.22	48.16	74.52	107.99
PIT	72	8.12	15.0%	0.22	46.10	73.93	96.36
DET	56	6.22	12.9%	0.17	42.11	73.52	107.35
N.J	48	5.27	10.9%	0.15	43.10	71.85	102.56
BOS	62	4.82	10.2%	0.14	42.48	71.65	92.91
NYI	55	7.43	14.4%	0.20	44.29	71.59	91.73
WSH	57	9.95	20.5%	0.27	38.58	71.18	88.90
PHI	49	7.57	14.3%	0.21	45.42	71.04	93.66
STL	60	6.96	14.1%	0.20	42.49	70.87	105.76
OTT	56	5.39	10.1%	0.15	47.74	70.57	96.65
MIN	55	5.65	11.2%	0.17	44.94	67.05	92.04
CHI	77	4.60	10.9%	0.14	37.52	66.50	88.72
CBJ	55	4.52	10.2%	0.13	39.98	64.84	86.88
VAN	59	5.09	11.7%	0.15	38.27	64.34	90.96
CAR	42	4.56	9.5%	0.13	43.54	64.03	87.25
MTL	63	6.51	13.8%	0.18	40.77	63.40	78.20
FLA	36	6.88	14.1%	0.20	42.05	62.49	84.46
COL	39	5.04	11.2%	0.14	39.80	62.23	80.00
L.A	59	7.33	16.2%	0.19	37.89	62.04	103.49
NSH	41	5.72	12.6%	0.16	39.80	61.95	79.19
BUF	48	4.22	9.5%	0.12	40.06	61.25	85.14
CGY	42	6.42	15.0%	0.18	36.29	60.14	82.46
PHX	51	4.27	10.3%	0.12	37.10	59.29	84.63
WPG	51	4.69	10.7%	0.14	39.08	58.54	87.22
DAL	48	5.50	12.8%	0.16	37.39	58.25	83.74
T.B	40	5.36	13.4%	0.15	34.70	56.71	74.42
TOR	57	6.36	14.9%	0.17	36.43	55.11	77.15
NYR	56	4.83	11.6%	0.14	36.95	54.75	80.08
EDM	45	6.84	17.0%	0.19	33.50	51.87	67.95

Discussion

A little disheartening, but not entirely surprising. Our ability to predict PP success was terrible. Using the best predictor of PP success we came out with a correlation between regressed FF/60 to GF/60 of basically 0. Why? Because Sh% dominates GF/60 over such a small sample of 24 games. We previously showed that sh% is almost entirely random, and therefore, we basically have no capacity to predict GF/60 in 5v4 play.

To drive this home, and I fucking hate doing this, let’s look at the top 5 teams by shooting percentage over the first half of the year, and see how they did the 2nd half.

Top 5 Teams by Sh% 1-24	Sh% 1-24	Sh% 25-48
WSH	19.4%	21.6%
ANA	18.8%	9.6%
STL	18.5%	8.8%
NYI	17.3%	10.6%
PIT	17.0%	13.0%
Top 5 average	18.2%	12.7%
League Average	12.8%	13.0%

As a whole, they regressed more than we predicted. We think they will sustain about 5-7%, but in total last year they performed below league average. This variance is absolutely expected given that I’ve only selected 5 teams, and we are only using 24 games.

For whatever reason, writing this article brings to mind all the posts leading up to the playoffs and just after the playoffs that compare PP/PK success between teams. From this and the previous article, we can conclusively say that using stats to try to substantiate any argument is basically blatant lying. Unless we start applying stats that have reliability, we aren’t likely to gain much ground.

Conclusion

Given the above, the model performed badly at predicting GF/60 at the half-way point last season. At best, we can assume that FF/60 is probably driving PP success, but even over a small sample (24 games), remains marginally reliable. The biggest issue is that GF/60 is heavily dominated by shooting percentage, which we showed last post to be almost entirely random.

Our analysis here showed a much lower correlation than we expected, 0 vs. 0.34 respectively. I still expect the 13-14 season to show at least modest correlation (0.34) between regressed Fenwick For/60 and GF/60. Regardless, predicting PP success (and by virtue of lower correlation coefficients PK success) is difficult, if not improbable. Laugh at your friends who speak of teams as “strong” or “elite” citing a top PP%.

Talking Points