Baseball Toaster was unplugged on February 4, 2009.
We got to start thinking beyond our guns. These days are closing fast.
"Lip" Pike Bishop in the "The Wild Bunch"
Here we are at the beginning of May with most teams having played about twenty-five games. So what have we learned? Anything besides Juan Rincon is a stupid, stupid man?
The Yankees seem to think that they've learned a great deal about their current team and decided to shake things up a bit in Billy Beane-like fashion.
Maybe the Yanks do have cause for some concern, but should the O's, D-Backs, Blue Jays, White Sox, and Nats feel that their early-season turnaround is anything more than Brian Roberts-esque, Tuffy Rhodesian, early-season mirage?
Does April mean anything to anyone besides May flowers?
I took a look at the team records for first twenty-five games of each season starting in 1901. I compared it to the team's eventual record to see what bearing it had. I also took the teams' expected winning percentage in the first 25 games and compared that to their overall record to see if maybe, just maybe, it presaged the teams' true capabilities better than just winning percentage.
What I found was the 25-game winning percentage correlates to the overall winning percentage somewhat but not all that well (coefficient of 0.625). However, it does ever so slightly better than expected winning percentage (0.617), which I did not expect.
Evidently, a large number of teams have changed their fates after 25 games and there was no indication, at least from their runs for and against, that they had it in them.
Take a look at the teams that improved the most after 25 games:
Tm | Yr | W PCT at 25G | Exp PCT (25G) | W PCT after 25G | Overall W PCT | Pos | Postseason? | Improvement | Exp Diff |
BSN | 1914 | .250 | .287 | .682 | .614 | 1 | WS Champ | .432 | .037 |
OAK | 2001 | .320 | .406 | .686 | .630 | 2 | Wildcard | .366 | .086 |
HOU | 1969 | .200 | .250 | .555 | .500 | 5 | .355 | .050 | |
STL | 1973 | .200 | .329 | .555 | .500 | 2 | .355 | .129 | |
CIN | 1986 | .240 | .321 | .584 | .531 | 2 | .344 | .081 | |
BOS | 1996 | .240 | .319 | .577 | .525 | 3 | .337 | .079 | |
FLA | 1995 | .200 | .384 | .525 | .469 | 4 | .325 | .184 | |
BRO | 1907 | .167 | .208 | .492 | .439 | 5 | .325 | .041 | |
BAL | 1988 | .080 | .193 | .382 | .335 | 7 | .302 | .113 | |
STL | 1921 | .320 | .380 | .617 | .569 | 3 | .297 | .060 | |
STL | 1947 | .333 | .434 | .623 | .578 | 2 | .290 | .101 | |
DET | 1987 | .360 | .500 | .650 | .605 | 1 | Division Champ | .290 | .140 |
KCR | 1992 | .200 | .347 | .489 | .444 | 5 | .289 | .147 | |
CIN | 1922 | .320 | .420 | .605 | .558 | 2 | .285 | .100 | |
CLE | 1902 | .280 | .387 | .559 | .507 | 5 | .279 | .107 | |
PHA | 1901 | .320 | .407 | .595 | .544 | 4 | .275 | .087 | |
KCR | 1981 | .280 | .288 | .551 | .485 | 4 | 2nd Half | .271 | .008 |
PIT | 1940 | .280 | .390 | .550 | .506 | 4 | .270 | .110 | |
STL | 1949 | .400 | .441 | .667 | .623 | 2 | .267 | .041 | |
CLE | 1969 | .160 | .188 | .426 | .385 | 6 | .266 | .028 |
It's no surprise that the Miracle Braves are number one with a 432-point improvement after 25 games. It is a surprise that just four made the playoffs though and one was via the 1981 strike while another was a wildcard. I guess there are a lot of teams improving from abysmal in the small sample size to just plain bad for the season.
Anyway, one thing that is interesting is that the difference between the expected winning percentage and the actual 25 games into the season ("Exp Diff"), is positive for the teams with the greatest improvements.
Maybe this can be used to predict a post-25G improvement. Let's see
Well, the results are that improvement after the first 25 games has even less to with a team underachieving their expected winning percentage through the first 25 games than our previous attempts at correlations through 25 games (coefficient of 0.470). So all of you using expected winning percentages in April to predict the division winnersyou know who you arecan now stop.
Just for fun, I ran the same comparisons for the last 25 games of the season. Do good teams finish stronger or start stronger? They always same a win in April is worth two in the bush, or something like that.
I found that a team's record in the final 25 games correlates better to their overall record than their record in first 25 games (coefficient of 0.707). However, the expected winning percentage (0.696) still gets beaten out by the actual winning percentage.
Apparently, the final 25 games are a much better predictor of a team's overall success. It's too bad that the season's already in the books at that point though. It is odd given that you would think a lot of teams would be clinching early on in September and may rest regulars. Meanwhile others would know their fate is settled and may play less enthusiastically. And don't forget all the youngsters who are getting a try. And yet this time of year is more meaningful than April, odd.
Here are the teams with the greatest improvements in the last 25 games:
Tm | Yr | PCT last 25G | Exp PCT | PCT before last 25G | Overall PCT | Pos | Postseason? | Biggest Improvement | Biggest Exp Diff |
CLE | 1912 | .800 | .680 | .430 | .490 | 5 | .370 | .120 | |
BSN | 1926 | .720 | .605 | .378 | .434 | 7 | .342 | .115 | |
CHW | 1987 | .760 | .676 | .423 | .475 | 5 | .337 | .084 | |
CIN | 1918 | .800 | .725 | .466 | .531 | 3 | .334 | .075 | |
MIN | 1980 | .760 | .631 | .426 | .478 | 3 | .334 | .129 | |
BAL | 1955 | .640 | .559 | .318 | .370 | 7 | .322 | .081 | |
NYG | 1916 | .833 | .801 | .516 | .566 | 4 | .318 | .033 | |
PHA | 1951 | .720 | .672 | .403 | .455 | 6 | .317 | .048 | |
NYY | 1995 | .800 | .722 | .496 | .549 | 2 | Wildcard | .304 | .078 |
SLB | 1907 | .708 | .606 | .406 | .454 | 6 | .302 | .102 | |
OAK | 2001 | .880 | .773 | .584 | .630 | 2 | Wildcard | .296 | .107 |
ANA | 1999 | .680 | .608 | .387 | .432 | 4 | .293 | .072 | |
STL | 1930 | .840 | .755 | .550 | .597 | 1 | NL Pennant | .290 | .085 |
PHI | 1983 | .800 | .676 | .511 | .556 | 1 | NL Pennant | .289 | .124 |
PHI | 1904 | .583 | .472 | .297 | .342 | 8 | .286 | .112 | |
STL | 2002 | .840 | .783 | .555 | .599 | 1 | Division Champ | .285 | .057 |
TOR | 2002 | .720 | .649 | .438 | .481 | 3 | .282 | .071 | |
CHW | 1910 | .680 | .729 | .398 | .444 | 6 | .282 | -.049 | |
CHC | 1935 | .880 | .823 | .605 | .649 | 1 | NL Pennant | .275 | .057 |
HOU | 1963 | .640 | .505 | .365 | .407 | 9 | .275 | .135 |
You'll notice that the 2001 A's make both lists, an odd feat. You may also note that six of these teams made the playoffs while only four from the previous list did. Not that that is conclusive, just interesting.
Once Elias let's me have a go at their data, I will weight it by strength of schedule.
Comment status: comments have been closed. Baseball Toaster is now out of business.