Baseball Toaster was unplugged on February 4, 2009.
Oftentimes, to win us to our harm,
The instruments of darkness tell us truths,
Win us with honest trifles, to betray’s
In deepest consequence.
—"Dave" Banquo in William "Author" Shakespeare's Macbeth
To win by strategy is no less the role of a general than to win by arms.
—Julius "Matos" Caesar
The high sentiments always win in the end, the leaders who offer blood, toil, tears and sweat always get more out of their followers than those who offer safety and a good time. When it comes to the pinch, human beings are heroic.
—George "Bamberger" Orwell, who obviously never read the ending to his book 1984.
Wesley: Excuse me? Is this something we should all be talking about?
Angel: No... it was mostly... theoretical.
Spike: Look, if cavemen and astronauts got into a fight, who would win?
Wesley: You've been arguing for forty minutes about this. (thinks) Do the astronauts have weapons?
Spike, Angel: No. (in unison)
—(Anaheim) Angel
Winning, that's the point of the game. Even a Phils fan like myself knows that.
So which are the teams that win? Well, obviously teams with more talent win more often. However, how do you measure that? Is it more due to their offensive prowess or pitching skills and how to quantify that?
A team's winning percentage in one-run ballgames is often cited by analysts (right after telling you a certain batter is 2-for-6 with a homer off a certain pitcher, as if that has any meaning). Are there certain traits, like winning close ballgames, that winning teams display? Can one predict a team's success from how they perform under certain conditions like close ballgames?
OK, enough with the question. After being directed to Baseball Reference's situations splits page by my friend Gregor Gross, and after kvelling an appropriate amount of time, I attempted to address these issues.
For offensive stats, I used the yearly team batting ratios: average, on-base, slugging, and OPS. For pitching, the following stats were recorded each year for each team: ERA, WHIP (walks plus hits per innings pitched), Strikeout-to-walk ratio (K:BB), Strikeouts-per-nine-innings (K/9IP). Then all stats were adjusted for by dividing by the league average. Finally, they were each compared against the team winning percentage and then the correlation coefficient, or how well each set of data corresponds to winning percentage, was determined for each decade and for all-time. I also adjusted the batting and pitching stats by the team's home ballpark factor and re-ran the correlation.
Then, I took the situational winning percentage for a set of criteria and tried to see how well each correlated to overall winning percentage. Those situations, which I kind of picked out of a hat, were the team's yearly record in home games, road games, one-run games, games won by no more than three runs ("Save" games), low-scoring games (five or less total runs), high-scoring games (10 or more total runs), and games with a large margin of victory (five or more runs). I also took the runs scored for and against in the above situations and calculated the expected winning percentage to determine if that correlated to overall winning percentage better (using the 1.83-exponent Pythagorean formula).
So without further ado, here are the results such as they are:
Decade | BA AL | OBP AL | SLUG AL | OPS AL |
Total | 0.674 | 0.724 | 0.689 | 0.735 |
1870s | 0.792 | 0.795 | 0.802 | 0.805 |
1880s | 0.740 | 0.783 | 0.710 | 0.755 |
1890s | 0.747 | 0.746 | 0.712 | 0.759 |
1900s | 0.684 | 0.750 | 0.689 | 0.754 |
1910s | 0.607 | 0.642 | 0.653 | 0.686 |
1920s | 0.669 | 0.717 | 0.695 | 0.739 |
1930s | 0.635 | 0.691 | 0.718 | 0.735 |
1940s | 0.698 | 0.738 | 0.709 | 0.759 |
1950s | 0.653 | 0.725 | 0.721 | 0.771 |
1960s | 0.605 | 0.608 | 0.590 | 0.627 |
1970s | 0.637 | 0.702 | 0.706 | 0.754 |
1980s | 0.509 | 0.615 | 0.595 | 0.655 |
1990s | 0.504 | 0.659 | 0.563 | 0.644 |
2000s | 0.548 | 0.697 | 0.576 | 0.642 |
Of course, as baseball matured batting average became less important to winning, but don’t tell Joe Morgan. Overall OPS correlates best to winning percentage, but in the 2000s OBP is beating OPS, which was a bit of a surprise.
Decade | BA AL | OBP AL | SLUG AL | OPS AL |
Total | 0.608 | 0.642 | 0.667 | 0.695 |
1870s | 0.725 | 0.726 | 0.757 | 0.752 |
1880s | 0.658 | 0.680 | 0.651 | 0.684 |
1890s | 0.552 | 0.524 | 0.566 | 0.578 |
1900s | 0.545 | 0.596 | 0.611 | 0.655 |
1910s | 0.558 | 0.583 | 0.637 | 0.654 |
1920s | 0.629 | 0.658 | 0.715 | 0.738 |
1930s | 0.597 | 0.652 | 0.718 | 0.725 |
1940s | 0.533 | 0.629 | 0.656 | 0.693 |
1950s | 0.626 | 0.746 | 0.726 | 0.781 |
1960s | 0.584 | 0.575 | 0.617 | 0.652 |
1970s | 0.627 | 0.667 | 0.747 | 0.785 |
1980s | 0.531 | 0.608 | 0.632 | 0.676 |
1990s | 0.534 | 0.600 | 0.623 | 0.669 |
2000s | 0.594 | 0.641 | 0.628 | 0.662 |
When you adjust for ballpark OPS does beat OBP. However, you'll notice that none of these correlations is very strong.
[Note: negative correlation just means that as one goes down the other rises, but still expresses a correlation. E.g., as ERAs go down winning percentages tend to go up. Makes sense.]
Decade | ERA AL | WHIP AL | K-BB AL | K-9IP AL |
Total | -0.781 | -0.761 | 0.458 | 0.321 |
1870s | -0.819 | -0.759 | 0.071 | 0.118 |
1880s | -0.829 | -0.839 | 0.498 | 0.399 |
1890s | -0.818 | -0.834 | 0.598 | 0.377 |
1900s | -0.811 | -0.779 | 0.593 | 0.357 |
1910s | -0.783 | -0.753 | 0.573 | 0.382 |
1920s | -0.759 | -0.740 | 0.661 | 0.464 |
1930s | -0.794 | -0.758 | 0.739 | 0.604 |
1940s | -0.813 | -0.759 | 0.649 | 0.492 |
1950s | -0.796 | -0.760 | 0.616 | 0.487 |
1960s | -0.727 | -0.705 | 0.525 | 0.359 |
1970s | -0.689 | -0.722 | 0.558 | -0.010 |
1980s | -0.657 | -0.674 | 0.484 | 0.195 |
1990s | -0.663 | -0.652 | 0.555 | 0.355 |
2000s | -0.764 | -0.769 | 0.606 | 0.443 |
ERA correlates best here overall, though WHIP has surpassed ERA in the 2000s. The strikeout ratios are gaining strength but still don't correlate very well.
Decade | ERA AL | WHIP AL | K-BB AL | K-9IP AL |
Total | -0.782 | -0.731 | 0.463 | 0.334 |
1870s | -0.811 | -0.730 | 0.071 | 0.119 |
1880s | -0.835 | -0.831 | 0.480 | 0.365 |
1890s | -0.816 | -0.826 | 0.586 | 0.344 |
1900s | -0.818 | -0.783 | 0.587 | 0.358 |
1910s | -0.769 | -0.702 | 0.588 | 0.426 |
1920s | -0.745 | -0.688 | 0.672 | 0.497 |
1930s | -0.786 | -0.704 | 0.754 | 0.633 |
1940s | -0.816 | -0.737 | 0.658 | 0.509 |
1950s | -0.794 | -0.664 | 0.651 | 0.553 |
1960s | -0.738 | -0.685 | 0.533 | 0.399 |
1970s | -0.707 | -0.701 | 0.574 | 0.030 |
1980s | -0.644 | -0.608 | 0.506 | 0.240 |
1990s | -0.674 | -0.613 | 0.563 | 0.377 |
2000s | -0.789 | -0.735 | 0.609 | 0.445 |
ERA does surpass WHIP after the park adjustment, like OPS and OBP in batting.
Decade | Home Win% | Home Exp Win% | Road Win% | Road Exp Win% |
Total | 0.897 | 0.869 | 0.897 | 0.866 |
1900s | 0.913 | 0.876 | 0.929 | 0.900 |
1910s | 0.918 | 0.885 | 0.915 | 0.896 |
1920s | 0.923 | 0.903 | 0.926 | 0.903 |
1930s | 0.936 | 0.914 | 0.932 | 0.900 |
1940s | 0.906 | 0.871 | 0.900 | 0.903 |
1950s | 0.902 | 0.891 | 0.908 | 0.872 |
1960s | 0.887 | 0.875 | 0.884 | 0.852 |
1970s | 0.892 | 0.865 | 0.856 | 0.812 |
1980s | 0.850 | 0.804 | 0.862 | 0.798 |
1990s | 0.851 | 0.811 | 0.856 | 0.822 |
2000s | 0.886 | 0.875 | 0.897 | 0.855 |
Home and road records correlate equally well to winning percentage though for the last 25 years, road records have ruled. Also the expected home record beats out the road one. However, both of these are pretty strong correlations, which makes sense since they comprise about a half a season each.
Decade | One-run Win% | One-run Exp Win% | Save Win% | Save Exp Win% |
Total | 0.622 | 0.622 | 0.888 | 0.866 |
1900s | 0.759 | 0.755 | 0.922 | 0.915 |
1910s | 0.744 | 0.728 | 0.925 | 0.901 |
1920s | 0.638 | 0.638 | 0.887 | 0.863 |
1930s | 0.566 | 0.561 | 0.900 | 0.887 |
1940s | 0.570 | 0.565 | 0.886 | 0.870 |
1950s | 0.588 | 0.592 | 0.864 | 0.867 |
1960s | 0.676 | 0.666 | 0.919 | 0.885 |
1970s | 0.567 | 0.547 | 0.865 | 0.841 |
1980s | 0.624 | 0.615 | 0.880 | 0.836 |
1990s | 0.566 | 0.573 | 0.828 | 0.795 |
2000s | 0.520 | 0.532 | 0.885 | 0.883 |
One-run record don't correlate well and are doing worse as time goes by. "Save" records, close games, do correlate fairly well, which is why, I guess, they came up with the concept of a save in the first place.
Decade | Low Scoring Win% | Low Scoring Exp Win% | High Scoring Win% | High Scoring Exp Win% |
Total | 0.692 | 0.694 | 0.869 | 0.846 |
1900s | 0.795 | 0.794 | 0.910 | 0.875 |
1910s | 0.830 | 0.795 | 0.833 | 0.810 |
1920s | 0.621 | 0.605 | 0.917 | 0.910 |
1930s | 0.729 | 0.737 | 0.923 | 0.908 |
1940s | 0.792 | 0.832 | 0.909 | 0.885 |
1950s | 0.695 | 0.723 | 0.885 | 0.879 |
1960s | 0.745 | 0.732 | 0.840 | 0.804 |
1970s | 0.660 | 0.657 | 0.847 | 0.815 |
1980s | 0.641 | 0.602 | 0.766 | 0.731 |
1990s | 0.576 | 0.572 | 0.837 | 0.814 |
2000s | 0.556 | 0.565 | 0.883 | 0.858 |
This one's odd: Low-scoring record don't correlate very well at all, but high-scoring correlates extremely well, better in fact than based on the expected winning percentage. Maybe the Rockies should take note.
Decade | Large Margin Win% | Large Margin Exp Win% |
Total | 0.862 | 0.851 |
1900s | 0.893 | 0.887 |
1910s | 0.862 | 0.847 |
1920s | 0.918 | 0.903 |
1930s | 0.906 | 0.900 |
1940s | 0.891 | 0.881 |
1950s | 0.865 | 0.872 |
1960s | 0.862 | 0.846 |
1970s | 0.840 | 0.827 |
1980s | 0.773 | 0.749 |
1990s | 0.804 | 0.801 |
2000s | 0.863 | 0.865 |
Again, it looks like higher scoring affairs are better predictors of a team's success or at least correlate better to their winning percentage. Perhaps this is due to a low-scoring game being more based on the pitchers' performance than the teams as a whole.
Comment status: comments have been closed. Baseball Toaster is now out of business.