Does the Cy Young Matter Anymore?
2005-11-09 09:52
by Mike Carminati

Bartolo Colon is a very good pitcher who had a very good year this past season. I'll accept that.

He is not the best pitcher in the AL nor did he have the best pitching year in the AL this year. But he won the Cy Young award yesterday.

It seems that the sole reason to give out the award is to stir up controversy. Hardly anyone who is a serious student of the game would pick Colon number one, but in the baseball writers' world in which, apparently, wins are king, Colon rules.

I think they should rename it the Lamarr Hoyt award (but they should a few more "r"'s to his first name first). For those of you how are two young to remember, Hoyt won 24 games for the White Sox back in 1983, when they had a cartoon batter logo on the front of their uniforms and Harold Baines was the mayor of Chicago. Hoyt won the Cy Young despite a 3.66 ERA, 1.24 runs higher than league leader Rick "Don't Call me BJ" Honeycutt (who was traded midseason to the NL), and 115 adjusted ERA (52 points worse than Honeycutt). Hoyt had just one more season with the Sox and was out of baseball altogether in three seasons.

Colon isn't as bad a choice as Hoyt was in 1983, but he ranks fourth in pitching Win Shares, fifth in BP's VORP (Value over Replacement Player), and 13th adjusted ERA (i.e., ERA based on the park-adjusted league average or ERA+). He does, however, rank numero uno in wins, 21, with three more than the next AL pitcher. My assumption is that those voters who looked past win totals split their vote among better candidates like Johan Santana, Mariano Rivera, and Mark Buehrle.

So Colon won. There was the usual gnashing of teeth and wringing of hands, but it all seems to have gotten a bit too de rigueur.

Given that I assume that the voters will be attracted to the wrong candidates, I'm left with minor questions. For example, it's obvious that Colon was not the best candidate, but how bad was he? Was he one of the top five pitchers in the AL? And are the voters solely attracted to the win totals baubles? If so, why did Kevin Millwood receive a vote when he won just nine games to go with his 2.86 ERA?

If you want to defend the award being given to Colon, to quote Jack Nicholson in the woefully inappropriately named "As Good As It Gets", "If your selling crazy go sell it somewhere else. We're full up here."

Ok, first I took the top ten pitchers in VORP, P WS, and ERA+. Then I added wins, ERA, strikeout-to-walk ratio, Walks plus Hits over Innings Pitched (WHIP), and strikeouts per nine innings. I took the rankings for each of these and averaged them. Here are the results. First the Cy Young voting along with the sabermetrically leaning stats:

Pitcher1st2nd3rdPtsRkP WSRk VORP Rk ERA+Rk
Johan Santana, Min381251323.12 73.0 1 1533
Mark Buehrle, CHW0055523.21 54.2 2 1434
Roy Halladay, TOR016.111 52.7 3 1842
Bartolo Colon, LAA17110118119.24 51.1 5 12013
Randy Johnson, NYY0 16.41044.1 11 11714
Jon Garland, CHW0011621.13 50.1 7 1279
John Lackey, ANA 0 17.29 50.3 6 12212
Mariano Rivera, NYY87768217.3832.3 26 3231
Kevin Millwood, Cle0011715.313 52.3 4 1434
Jose Contreras, CWS0 18.26 41.5 13 12311
Cliff Lee, Cle0228414.516 39.8 16 10816
Freddy Garcia, CHA 0 18.65 45.6 9 11515
Joe Blanton, OAK 0 14.418 44.3 10 1279
Kenny Rogers, TEX0 17.57 40.5 15 1307
Jarrod Washburn, ANA 0 14.71448.8 8 1316

Next, the more conventional stats (though a number will stick tick off Joe Morgan):

PitcherWRk ERA RkWHIPRk K/9IP Rk K/BB Rk
Johan Santana, Min165 2.87 40.973 9.25 1 5.29 2
Mark Buehrle, CHW165 3.12 51.189 5.67 22 3.73 9
Roy Halladay, TOR1227 2.41 20.962 6.86 12 6.00 1
Bartolo Colon, LAA211 3.48 101.165 6.35 18 3.65 10
Randy Johnson, NYY174 3.79 201.134 8.42 5 4.49 6
Jon Garland, CHW182 3.50 111.176 4.68 35 2.45 20
John Lackey, ANA 14133.4481.3329 8.57 3 2.80 16
Mariano Rivera, NYY758 1.38 10.871 9.19 2 4.44 7
Kevin Millwood, Cle943 2.86 31.2214 6.84 13 2.81 15
Jose Contreras, CWS158 3.61 131.2316 6.77 14 2.05 30
Cliff Lee, Cle182 3.79 191.2213 6.37 17 2.75 17
Freddy Garcia, CHA 1413 3.87 231.2518 5.76 21 2.43 21
Joe Blanton, OAK 1227 3.53 121.2212 5.19 28 1.73 42
Kenny Rogers, TEX1413 3.46 91.3227 4.01 42 1.64 44
Jarrod Washburn, ANA 850 3.20 61.3328 4.77 35 1.84 39

And the final rankings, based on the average of all rankings and then on all rankings but wins:

Pitcher Avg Rk Avg Rk (w/o W)
Johan Santana, Min 2.63 2.29
Mark Buehrle, CHW 7.13 7.43
Roy Halladay, TOR 7.50 4.71
Bartolo Colon, LAA 8.25 9.29
Randy Johnson, NYY 9.25 10.00
Jon Garland, CHW 11.63 13.00
John Lackey, ANA 12.00 11.86
Mariano Rivera, NYY 13.00 6.57
Kevin Millwood, Cle 13.63 9.43
Jose Contreras, CWS 13.88 14.71
Cliff Lee, Cle 14.50 16.29
Freddy Garcia, CHA 15.63 16.00
Joe Blanton, OAK 19.75 18.71
Kenny Rogers, TEX 20.50 21.57
Jarrod Washburn, ANA 23.25 19.43

Colon comes in fourth overall and fifth if we ignore wins. So it goes to show you that if you major competition are two guys who won 16 games, another that won 12 in an abbreviated season, and a closer who didn't break any records (besides many voters will tell you closers don't belong in the Cy Young voting, just the MVP vote), that shiny 21-win brass ring is going to attract the idiotic writers—sorry, "idiotic writers" is a bit redundant.

But wait a second, maybe I am being to mean to the idiots, I mean, writers. Just because the guy with the most wins won the award, I should not just assume that wins are the be-all and end-all. Now that we have all of these data, we can put it to the test.

How well does the actual vote correlate to win totals or to any of the stats that we have for that matter. Let's see…

I ran the numbers, and though none have any significant correlation to the Cy Young result some do much better than others. Here are the results. Remember that we want a negative coefficient (because the voting descends while the ranks ascend) tending toward 1.000.

Stat RankCorrel to Pts
P WS-0.343
VORP 0.086
ERA -0.246
K/9IP -0.304
K/BB -0.392
Avg Rk -0.395
Avg Rk (w/o W) -0.432

The first thing you might notice is that VORP and Cy Young actually have the worst correlation. The CY vote actual runs slightly counter to VORP.

But the next thing that popped out was how poorly wins did—second to last! And the average ranking with wins did better than the overall average ranking. Apparently, wins alone are not the entire basis of the writers' vote.

Of all the derived SABR stats, pitching Win Shares does best—congrats to Bill James, I guess.

Oddly, the stat that correlated best was WHIP while the other strikeouts/walks stats did better than most. So, am I left to believe that writers base their vote on strikeouts and walks? I guess. At least it's better than wins. Maybe by the 2050 the troglodytic voters will have evolved to the neanderthalic ERA. In the age of Elroy Jetson, I am sure they will be trafficking in Pitching Win Shares and VORP. Then again, I expected everyone to be zooming to work with those jet packs on their packs that we were promised were just around the corner when we were kids.

2005-11-09 10:08:53
1.   Todd S
Is it possible that some of the writers are intentionally lashing out at the more modern metrics? As in "I'll show those damn Moneyballers. I'm going to vote for whoever has the most wins, just to show 'em!" I hope that's not the case, of course, but that mentality seems to be present in the MSM right now. Even if it did factor in, it was probably a minority of the voters, but perhaps a big enough bloc to swing the vote?
2005-11-09 10:45:38
2.   nickb
Interestingly, the Bill James "Cy Young Predictor" ( ranked the participants in this fashion:

1. Rivera
2. Colon
3. Nathan
4. Buerhle
5. Garland
6. Santana
7. KRod
8. Baez
9. Unit
10. Lackey

2005-11-09 11:59:13
3.   YankeeInMichigan
Perhaps the writers are focusing on wins for the #1 spot and applying more open minds to the runner-up positions.
2005-11-09 21:45:25
4.   Vince Galloro
"For those of you how are two young to remember, Hoyt won 24 games for the White Sox back in 1983, when they had a cartoon batter logo on the front of their uniforms"

Actually, the Sox were wearing the uniforms with pullover jerseys and the SOX across the front in white on a blue background with red horizontal stripes above and below.

2005-11-10 19:39:20
5.   Brent is a Dodger Fan
4 Right you are! The cartoon logo was banished sometime in the early 80s or late 70s, can't recall exactly.

Mike: I want to quibble a bit with your method. Just a bit, I promise. You really shouldn't average the ranking across stats. It's sort of like averaging averages -- a statistical mistake.

The reason for this is this: Let's say one pitcher had an astonishing lead over another the next best pitcher in some stat, like Santana's dominance in VORP (35% more VORP than the next guy!).

Move onto another stat: Win Shares. Buehrle's 23.2 gets him the 1, Santana's 23.1 gets him the 2.

Wow. Less than 1% difference in P WS but 35% in VORP? And it counts the same?

Instead, you need to find a way to normalize all the stats and then add them, then rank the result.

The way I did it was as follows: For each stat, divide each player's stat by the best in category (or, when lower is better, divide the best by the players' stat). This results in a ratio where the best player has 1.0 and all the players with worse stats get some ratio, like .94, or .75. Now these numbers are all on the same scale! Sum up the set of ratios. Then rank them.

This method, using all the stats you used, results in Santana, Buerhle, Garland, Colon then Garcia. I'm not saying it is going to give a different end result, I just hate to see that statistical mistake made.

2005-11-11 05:49:07
6.   Mike Carminati

I realize that it's just averaging rankings. It's not perfect, but it's not like averaging averages, it's like averaging rankings, which is of course what it is. If you want to weight the stats, that's fine.

And I apologizing the honor of these fine unis:

2005-11-11 07:40:29
7.   PhillyJ
Would the times (albeit few) that a reliever won the award have any bearing on the wins/CYA correlation.

Is it significant enought that if you removed the relievers that the correlation would be higher?

2005-11-11 10:30:38
8.   Mike Carminati
"And I apologizing the honor of these fine unis:"

Huh? That's, "I apologize for besmirching the honor of..." (or words to that effect).

2005-11-11 13:27:01
9.   Brent is a Dodger Fan
6 Mike: you say tomay-to, I say tomah-to...

I believe averaging rankings is in the same category of statistical error as averaging averages, but I think we can agree that you were attempting a quick approach rather than a purely scientific one. Or, at least one that is more likely to be used by sportswritiers: look at some stats and see what it looks like to you.

More scientific approaches are already available: VORP and P WS are attempts at encorporating mutliple stats, like K, BB, IP, H, and yielding one measure that is predictive of contribution towards winning games. Though I've never seen the research, these stats presumably have more scientific value on their own than attempting to average rankings, or weight performance across individual stats, like the method shown above.

2005-11-11 13:33:17
10.   Brent is a Dodger Fan
6 Oh, and it looks like the years where the cartoon Sox on the sleeve was from 1971-1975, like here:

I guess the Sox couldn't really decide what their colors are:
1960: Black and white
1971: Red and white
1976: Black and white again
1982: Red white and blue
1987: Black and white again

