<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:georss='http://www.georss.org/georss' xmlns:gd='http://schemas.google.com/g/2005' xmlns:thr='http://purl.org/syndication/thread/1.0'><id>tag:blogger.com,1999:blog-31545676</id><updated>2012-02-01T23:54:55.066-05:00</updated><category term='VORP'/><category term='NASCAR'/><category term='thaler'/><category term='King Kaufman'/><category term='forecasting'/><category term='Yankees'/><category term='double play'/><category term='Pirates'/><category term='Don Cherry'/><category term='.300 hitters'/><category term='strategy'/><category term='lemons'/><category term='Ryder'/><category term='McCown'/><category term='Blyleven'/><category term='sabermetrics'/><category term='analytics'/><category term='income inequality'/><category term='NBA'/><category term='self-promotion'/><category term='salary cap'/><category term='taxes'/><category term='Ken Dryden'/><category term='blink'/><category term='Forbes'/><category term='protection'/><category term='Balsillie'/><category term='weather'/><category term='baseball'/><category term='false positives'/><category term='pinball'/><category term='soccer'/><category term='jet lag'/><category term='super crunchers'/><category term='numeracy'/><category term='hot hand'/><category term='luck'/><category term='on the forecheck'/><category term='Hamermesh update'/><category term='Consumer Reports'/><category term='Hamermesh'/><category term='swimming'/><category term='free throws'/><category term='payroll'/><category term='SABR 38'/><category term='statistics'/><category term='cancers'/><category term='race'/><category term='JQAS'/><category term='tennis'/><category term='win probabilities'/><category term='Alan Schwarz'/><category term='pythagoras'/><category term='circadian advantage'/><category term='SABR'/><category term='umpires'/><category term='nate silver'/><category term='steroids'/><category term='pop music'/><category term='wine'/><category term='rebounds'/><category term='wealth inequality'/><category term='volleyball'/><category term='olympics'/><category term='salmon'/><category term='Wonderlic'/><category term='charity'/><category term='bet'/><category term='physics'/><category term='horse racing'/><category term='targeting'/><category term='goaltending'/><category term='speed'/><category term='new york times'/><category term='Cubs'/><category term='golf'/><category term='hall of fame'/><category term='markov'/><category term='Andrecheck'/><category term='rugby'/><category term='fighting'/><category term='curling'/><category term='streakiness'/><category term='scouting'/><category term='siblings'/><category term='KFC'/><category term='critiquing a study based on secondhand sources'/><category term='skating'/><category term='smoking'/><category term='point shaving'/><category term='Cy Young'/><category term='gender'/><category term='massey'/><category term='Clemens'/><category term='hockey'/><category term='worldofwarcraftometrics'/><category term='Pavitt'/><category term='symposium'/><category term='baserunning'/><category term='interest rates'/><category term='managers'/><category term='NHL'/><category term='Bloomberg'/><category term='australian rules football'/><category term='basketball'/><category term='billy beane'/><category term='Heisman'/><category term='JSE'/><category term='competitive balance'/><category term='wild pitches'/><category term='clutch'/><category term='RBIs'/><category term='game theory'/><category term='martians'/><category term='uncertainty of outcome'/><category term='regression'/><category term='psychology'/><category term='power play'/><category term='game time'/><category term='quidditch'/><category term='Warren Buffett'/><category term='goalies'/><category term='interactions'/><category term='bayes'/><category term='gag law'/><category term='regression to the mean'/><category term='rudeness'/><category term='BABIP'/><category term='simulation'/><category term='plus-minus'/><category term='faceoff'/><category term='coaches'/><category term='choking'/><category term='Bill James'/><category term='trades'/><category term='Bill Madden'/><category term='Don Coffin'/><category term='climate change'/><category term='sino-forest'/><category term='GiveWell'/><category term='linear weights'/><category term='Landsburg'/><category term='Stein&apos;s Paradox'/><category term='OSC'/><category term='color'/><category term='book review'/><category term='NFL'/><category term='fun'/><category term='Moneyball'/><category term='lineups'/><category term='sportswriting'/><category term='Wayne Winston'/><category term='babies'/><category term='fielding'/><category term='NCAA'/><category term='goons'/><category term='distribution of talent'/><category term='Scorecasting'/><category term='cricket'/><category term='picasso'/><category term='Charlie Pavitt'/><category term='aging'/><category term='Baseball Analyst'/><category term='academics'/><category term='cheating'/><category term='shootout'/><category term='crime'/><category term='peer review'/><category term='internet'/><category term='football'/><category term='boxing'/><category term='Donaghy'/><category term='count'/><category term='Jeter'/><category term='track and field'/><category term='replacement level'/><category term='The Economist'/><category term='overtime'/><category term='pitching'/><category term='BRJ'/><category term='DiMaggio'/><category term='referee bias'/><category term='Stumbling on Wins'/><category term='draft'/><category term='kickers'/><category term='fastballs'/><category term='Larry Summers'/><category term='options'/><category term='expansion'/><category term='penalties'/><category term='home field advantage'/><category term='Canseco'/><category term='economics'/><category term='The Wages of Wins'/><category term='gini'/><category term='aggression'/><category term='gambling'/><category term='runs created'/><category term='scientific method'/><category term='free speech'/><category term='Berri'/><category term='medicine'/><title type='text'>Sabermetric Research</title><subtitle type='html'>Links to and reviews of sabermetric studies and sports research</subtitle><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://sabermetricresearch.blogspot.com/feeds/posts/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default?max-results=100'/><link rel='alternate' type='text/html' href='http://sabermetricresearch.blogspot.com/'/><link rel='hub' href='http://pubsubhubbub.appspot.com/'/><link rel='next' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default?start-index=101&amp;max-results=100'/><author><name>Phil Birnbaum</name><uri>http://www.blogger.com/profile/03800617749001032996</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>555</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>100</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-31545676.post-3436532928240326420</id><published>2012-01-30T10:34:00.008-05:00</published><updated>2012-01-30T10:45:26.298-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='hockey'/><category scheme='http://www.blogger.com/atom/ns#' term='referee bias'/><category scheme='http://www.blogger.com/atom/ns#' term='penalties'/><category scheme='http://www.blogger.com/atom/ns#' term='NHL'/><title type='text'>Do NHL teams get a boost after killing a two-man advantage?</title><content type='html'>&lt;span style="font-family:verdana;"&gt;In an OHL game I was watching the other day, one of the teams had a two-man advantage and didn't score.  The announcer was disappointed that the shorthanded team to get a boost from having killed off the penalties, as conventional wisdom says they should.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Is conventional wisdom right?  Now that I have access to a database of NHL games (thanks again to the &lt;a href="http://hsp.flyershistory.com/"&gt;Hockey Summary Project&lt;/a&gt;), I was able to check.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;This study is basically the same format as the study I did on fights a few weeks back.  I found all games from 1967-68 to 1984-85 where one team killed off a two-man advantage (of any length).  Then, I found a random control game, which matched the score differential and the relative quality of the home and road teams.  When I was done, I had two pools, each comprised of 1,703 games.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;The teams that killed the penalties scored an average 0.26 more goals than their opponents from that point to the end of the game (actually, to the 17:00 mark of the third period).  On the other hand, the control team scored only 0.12 more goals then their opponents.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;That's statistically significant, at almost exactly 2 SDs.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I'll put that in chart form to make it easier to read, along with the SD.  I use the term "killing teams" to mean the ones that actually killed off the two-man advantage.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Killing teams .... +0.26 goals (+/- 0.05)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Control teams .... +0.12 goals (+/- 0.05)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;-------------------------------------&lt;/span&gt;-----&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Difference ....... +0.14 goals (+/- 0.07)&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;At six goals per win, you'd have expected the extra goals to have resulted in around 40 extra wins.  They actually resulted in 32 extra wins.  Actually, 36 extra wins, minus 8 fewer ties:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Killing teams .... 836-604-263&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Control teams .... 806-626-271&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;------------------------------------&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Difference ....... +36 wins, -8 ties&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;So, should we conclude that killing off a two-man advantage causes a psychological boost?  Well, not so fast.  Because, after you take two consecutive penalties, the referee is very likely to try to even things up by giving &lt;a href="http://sabermetricresearch.blogspot.com/2011/12/do-nhl-referees-call-make-up-penalties.html"&gt;future&lt;/a&gt; penalties to the other team.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;The difference of +0.14 goals is almost exactly what you'd get from a single power play.  So, if the result of surviving a two-man advantage is that you get one extra "free" power play in the remainder of the game, that would explain the results exactly.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;As it turns out, it's not quite that high.  It's only half that high.  On average, the teams that survived being shorthanded two men got about half an extra power play in the remainder of the game:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Killing teams ... +.346 power plays rest of game&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Control teams ... -.130 power plays rest of game&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;------------------------------------------------&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Difference ...... +.476 power plays rest of game&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;That leaves about 0.07 goals per game as the unexplained difference.  It's only 1 SD, which is no longer statistically significant. It's about the effect of half a power play.  Or, with an average save percentage of .900, it works out to 7/10 of an additional shot on goal.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;--------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;We can also handle the penalty issue another way.  We can insist that when we choose a control game for the real game, we make sure the control team was the lone who took the last penalty.  That way, we'd expect some of the referee "evening up" difference to disappear.  Perhaps not all of it, because a two-man advantage isn't the same as a one-man advantage -- but at least part of it.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;The additional restriction reduced the sample size to 1,662 games; for the remaining 41 games, I couldn't find a suitable control.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;As it turns out, the goal difference stays about the same, even though the penalty difference is significantly reduced:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Killing teams ... +0.25 goals (+/- 0.05)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Control teams ... +0.08 goals (+/- 0.05)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;-------------------------------------&lt;/span&gt;---&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Difference ...... +0.17 goals (+/- 0.07)&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Killing teams ... +.340 power plays rest of game&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Control teams ... +.032 power plays rest of game&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;------------------------------------------------&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Difference ...... +.308 power plays rest of game&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;The difference of .308 power plays accounts for around .04 goals of the observed .17 difference.  That leaves .13, which is a little less than 2 SD from zero.  Not statistically significant, but close.  (Technically, it's even less than that, because the control games aren't completely independent.  Also, when I ran the study a second time, I got +0.10 goals instead of +0.08, which lowers the difference.  So think of the 1.9 SD as probably a bit too high.)&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Strangely, though, there wasn't as much difference in game results; only the equivalent of 13.5 wins:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Killing teams ... 815-591-256&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Control teams ... 807-610-245&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;------------------------------&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Difference: +8 wins, +11 ties&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Again at six goals per win, you'd expect 47 wins, not 13.5.  What happened?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Well, it turns out that the "killing" teams spent a lot of their goals winning blowouts.  For instance, in games won by six goals or more, they were 81-34.  The control group was only 73-51.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;In those games, the difference was 12.5 wins.  That normally "costs" 75 goals, but, for these games, the difference was really around 150 goals.  So, that accounts for 75 of the 282 goal difference right there. &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;The "killing" group also "wasted" goals in the 3- and 4-goal games.  That was offset by the opposite effect in five-goal games, but not by much.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;If you recall, we found the same effect when we looked at fighting: teams that started a fight appeared to score &lt;a href="http://sabermetricresearch.blogspot.com/2012/01/do-hockey-fights-lift-teams-performance.html"&gt;more goals&lt;/a&gt;, but not necessarily win &lt;a href="http://sabermetricresearch.blogspot.com/2012/01/do-hockey-fights-lift-teams-performance_13.html"&gt;more games&lt;/a&gt;.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;What connects the two studies is ... penalties.  It could be that teams that get penalized a lot win a lot of blowouts.  Not necessarily because of cause-and-effect, but because it just so happened that, between 1967 and 1984, certain teams just happened to be high in both categories.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Or, it could be coincidence.  Or, it could be something else.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;For my bottom line, I'd say: after killing off a two-man advantage, teams did appear to benefit by about 1/7 of a goal.  Half of that can be traced to referees calling fewer penalties against them in the remainder of the game.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;The other half is unknown.  It's not statistically significant, so you have to give serious consideration to the idea that it's just coincidence ... but the teams *did* appear to benefit, by around 0.07 goals.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Historically, the average size of the "boost" in a team's play after a two-man kill has been small: the equivalent of less than a single shot on goal over the remainder of the game.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/31545676-3436532928240326420?l=sabermetricresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://sabermetricresearch.blogspot.com/feeds/3436532928240326420/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=31545676&amp;postID=3436532928240326420' title='6 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/3436532928240326420'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/3436532928240326420'/><link rel='alternate' type='text/html' href='http://sabermetricresearch.blogspot.com/2012/01/do-nhl-teams-get-boost-after-killing.html' title='Do NHL teams get a boost after killing a two-man advantage?'/><author><name>Phil Birnbaum</name><uri>http://www.blogger.com/profile/03800617749001032996</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>6</thr:total></entry><entry><id>tag:blogger.com,1999:blog-31545676.post-8090602528949050136</id><published>2012-01-20T13:02:00.008-05:00</published><updated>2012-01-20T16:03:34.707-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='charity'/><category scheme='http://www.blogger.com/atom/ns#' term='GiveWell'/><category scheme='http://www.blogger.com/atom/ns#' term='academics'/><category scheme='http://www.blogger.com/atom/ns#' term='bayes'/><title type='text'>GiveWell: Overcomplicating research studies can cost lives</title><content type='html'>&lt;span style="font-family:verdana;"&gt;"&lt;a href="http://www.givewell.org/"&gt;GiveWell&lt;/a&gt;" is an organization that evaluates charities.  Not just the usual things -- how well they're run, or how much money goes to administrative expenses -- but also how much good they do for the money they receive.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;The idea is: if you have $100 to give to try to make the world a better place, shouldn't you give that $100 where it would give the most benefit?  Not just to whoever shows up at your door that day, or whatever organization makes you feel guiltiest, or whoever's suffering kids look the cutest ... but, seriously, to where you can do the most good.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;That might not appeal to everyone.  If you donate to maximize your own good feelings, instead of the good your donation actually does, GiveWell's evaluations won't make much difference to you.  Some people hate to say "no", and so they prefer to give $5 to each of the twenty charities that ask for money.  Some people prefer to give to diseases that killed their loved ones, or diseases associated with heroes like Terry Fox.  Some people give to causes that signal their political views.  Most people prefer to give to help people in their own city or country, even when their dollars will save many more lives abroad.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;(I've done all these things, and I'm bit embarrassed about some of them.  But I'm not alone.  I mean, people give money to the Children's Wish Foundation to send a terminally ill kid to Disneyland ... which is nice, but, that same amount of money might actually save ten lives if they sent it to Africa where kids are actually dying of things that are easily preventable.  I'm not sure what's up with me, and my fellow humans, sometimes.  But I digress.)&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;So, in at least one sense, GiveWell is to donors what sabermetrics is to Joe Morgan.  It does analysis to reach conclusions that some might find uncomfortable.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;However, in another sense, what GiveWell does is *unlike* sabermetrics, in that it usually doesn't try to get down to the third decimal place.  It argues that it can evaluate charities &lt;a href="http://www.givewell.org/international/process/2011#Developingheuristicstoflagpromisingcharities"&gt;heuristically&lt;/a&gt;, that the differences are big enough that they can figure out which charities are the best, using the charities' own reports.  As I interpret what they're saying, GiveWell can very easily tell you whether a charity is a Danny Ainge or an Albert Pujols, and it can even tell you more subtle things, like whether a charity is a Joe Carter or an Albert Pujols.  But it doesn't try to figure out if a charity is a Ryan Braun or an Albert Pujols.  It will just tell you that both are recommended.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;That is, GiveWell argues that its goals are better met by the transparency of its recommendations than by any detailed, opaque analyses.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Which is almost exactly what I argued in &lt;a href="http://sabermetricresearch.blogspot.com/2011/12/transparent-studies-are-better-even-if.html"&gt;one of my recent posts&lt;/a&gt; -- that, in research, simplicity and transparency are more important than rigor.  Simple  studies make it much easier to understand the results and catch the inevitable errors.  A gentleman from GiveWell, Elie Hassenfeld, read that post, and pointed me to a particular &lt;a href="http://blog.givewell.org/2011/09/29/errors-in-dcp2-cost-effectiveness-estimate-for-deworming/"&gt;example&lt;/a&gt; of a serious error that his organization uncovered.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;(Disclaimer: I don't really know much about GiveWell.  However, I've been impressed by what I've seen, and at least two of the blogs I read and respect (&lt;a href="http://marginalrevolution.com/?s=givewell"&gt;here's one&lt;/a&gt;) say very good things about them.  So my Bayesian evaluation of them is quite high.)&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-----&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;As I said, GiveWell doesn't believe they need detailed statistical cost/benefit studies to decide which charities to recommend.   However, charities themselves often use such analyses to decide where the money should be spent.  There's a whole bunch of organizations and academics devoted to figuring out how to save the most lives for the fewest dollars.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;With that objective, the Bill and Melinda Gates Foundation donated $3.5 million to fund a study, "Disease Control Priorities in Developing Countries".  They published a report ranking various interventions on cost-effectiveness.  The Gates Foundation didn't do that itself -- it was done jointly by The World Bank, the National Institutes of Health, the World Health Organization, and the Population Reference Bureau.  Those sound like heavyweights in the world health field.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;The results found that -- unsurprisingly to me -- hygiene promotion was the cheapest way to reduce death and disease.  The second cheapest, though, was deworming.  Specifically, "soil-transmitted helminth" (STH) deworming treatments.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;After the report was released, the Gates Foundation provided another $4.4 million to promote the findings.  And the findings did indeed attract serious attention.  GiveWell writes,&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;&lt;/span&gt;&lt;blockquote&gt;&lt;span style="font-family:verdana;"&gt;The DCP2’s cost-effectiveness estimates for deworming have been cited widely to advocate a greater focus on treating STH infections, including in:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-- an article in The Lancet&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-- a report by REACH, a consortium of large international NGOs and other organizations working to end child hunger, which labeled deworming one of 11 “promoted interventions”&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-- the most-cited paper published in the journal International Health&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-- an editorial by Peter Hotez, a co-founder of the Global Network for Neglected Tropical Diseases, which has received more than $40 million in funding from the Gates Foundation&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-- work by charity evaluators, such as GiveWell, Giving What We Can, and the University of Pennsylvania’s Center for High Impact Philanthropy.&lt;/span&gt;&lt;/blockquote&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;But, as GiveWell later discovered, it turns out the STH estimate was wrong.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;That doesn't sound too serious, but here's the thing: it's not just that the estimate was wrong.  It was wrong by a factor of almost ONE HUNDRED.  The study said that you could save one "disability-adjusted life year" by spending $3.41 on deworming treatments.  But, after correcting for the (acknowledged) errors in the study, the actual number was $326.43.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;All these well-respected organizations, with serious researchers and serious money, wound up promoting a conclusion that was about as wrong as it could have been.  Until the error was caught, then, effectively, 99% of the money devoted to STH treatment was wasted.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;How did GiveWell catch the error?  Subject matter expertise, mostly.  In reading the report, they noticed that the STH estimate was much, much lower than other estimates they had seen.  Instead of just assuming that this research was somehow better than the previous studies, they investigated.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;That seems like just common sense, right?  If you see a study that says an iPod can be bought for $3, when you know it usually costs $300, you should look again, shouldn't you?  But that didn't happen until someone at GiveWell decided to figure out what was going on.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;So they wrote to one researcher, who sent them to other researchers, who sent them complicated spreadsheets.  They tried to figure those out, but they couldn't, so they wrote back and forth with questions and explanations.  They were referred to still another researcher, who sent them a copy of yet another study that was the source of some of the data.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Eventually, they figured out where the issues were ... if you want a full explanation, it's in their &lt;a href="http://blog.givewell.org/2011/09/29/errors-in-dcp2-cost-effectiveness-estimate-for-deworming/"&gt;post&lt;/a&gt;.  It was a lot of detailed, technical effort to figure out what went wrong, and which parameters were in error.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;GiveWell's conclusions:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;&lt;/span&gt;&lt;blockquote&gt;&lt;span style="font-family:verdana;"&gt;We believe that the errors we’ve found in the estimate would have been caught by a helminth expert independently examining the estimate. Therefore, the presence of these errors implies to us that there has been no such examination. If this is the case, it would argue against the reliability of the DCP2’s estimates in general.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;We’ve previously argued for a limited role for cost-effectiveness estimates; we now think that the appropriate role may be even more limited, at least for opaque estimates (e.g., estimates published without the details necessary for others to independently examine them) like the DCP2’s.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;More generally, we see this case as a general argument for expecting transparency, rather than taking recommendations on trust - no matter how pedigreed the people making the recommendations. Note that the DCP2 was published by the Disease Control Priorities Project, a joint enterprise of The World Bank, the National Institutes of Health, the World Health Organization, and the Population Reference Bureau, which was funded primarily by a $3.5 million grant from the Gates Foundation. The DCP2 chapter on helminth infections, which contains the $3.41/DALY estimate, has 18 authors, including many of the world’s foremost experts on soil-transmitted helminths.&lt;/span&gt;&lt;/blockquote&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Absolutely right.  You can't substitute credentials for subject matter expertise, and you can't substitute complexity for transparency.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;And, one thing I would add: when a study appears to discover that you can get benefits at 99% off the original, well-accepted price ... you have to be &lt;a href="http://sabermetricresearch.blogspot.com/2011/11/research-conclusions-have-to-be.html"&gt;suspicious&lt;/a&gt; about accepting that conclusion, even if you have no other reason to believe there was any mistake.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-----&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;P.S.  GiveWell expands on the theme &lt;a href="http://blog.givewell.org/2011/11/04/some-considerations-against-more-investment-in-cost-effectiveness-estimates/"&gt;here&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/31545676-8090602528949050136?l=sabermetricresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://sabermetricresearch.blogspot.com/feeds/8090602528949050136/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=31545676&amp;postID=8090602528949050136' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/8090602528949050136'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/8090602528949050136'/><link rel='alternate' type='text/html' href='http://sabermetricresearch.blogspot.com/2012/01/givewell-overcomplicating-research.html' title='GiveWell: Overcomplicating research studies can cost lives'/><author><name>Phil Birnbaum</name><uri>http://www.blogger.com/profile/03800617749001032996</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-31545676.post-2212184079722536614</id><published>2012-01-15T00:15:00.006-05:00</published><updated>2012-01-15T11:14:44.564-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='penalties'/><category scheme='http://www.blogger.com/atom/ns#' term='NHL'/><title type='text'>Are more NHL penalties called in back-to-back games?</title><content type='html'>&lt;span style="font-family:verdana;"&gt;In a &lt;a href="http://sabermetricresearch.blogspot.com/2012/01/do-nhl-referees-call-make-up-penalties.html#c7494193056707989128"&gt;comment to one of the posts&lt;/a&gt; on "make-up" penalties, J.-P. Martel wrote,&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;&lt;/span&gt;&lt;blockquote&gt;&lt;span style="font-family:verdana;"&gt;"... blow-outs can easily lead to situations that get out of hand, so referees may call penalties on the leading team so that the trailing team still thinks it has a chance to come back, rather than resort to fighting to "prepare" the next game between the two teams.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Actually, you may want to check penalties in the second half of the third period when the teams' next game is (or may be, depending on outcome) against each other (particularly in the playoffs), as opposed to when it's not."&lt;/span&gt;&lt;/blockquote&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;So I did.  And, J.-P. is right, it looks like there's something there.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I found all cases from 1967-68 to 1984-85 where teams played back-to-back games (regular season only).  Then, I formed three groups:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-- first game of back-to-back games&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-- second game of back-to-back games&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-- other games that year between those two teams&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;It turns out that, overall, there are more penalties than usual in the first game, and fewer penalties than usual in the second game:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;First game .... 12.36&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Second game ... 10.87&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Other games ... 11.77&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Broken down by periods:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;-------------- Gm 1 --- Gm 2 --- Other&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;--------------------------------------&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Period 1 ..... 4.78 ... 3.98 ... 4.37&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Period 2 ..... 4.25 ... 3.75 ... 4.12&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Period 3 ..... 3.32 ... 3.13 ... 3.26&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;--------------------------------------&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Total ....... 12.36 .. 10.87 .. 11.77&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;So: there's 0.6 extra penalties in the first game, and 0.9 fewer penalties in the second game.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I thought the second game would be dirtier because the player are holding recent grudges from the previous game, but the numbers show the opposite.  The players seem to be more aggressive early, rather than late.  In fact, more than half the "first game" effect happens in the first period.  By contrast, a large "second game" effect seems to last two periods rather than one.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Most of the differences are statistically significant, which suggests that they're all real.  For those scoring at home, here are the standard errors:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;-------------- Gm 1 --- Gm 2 --- Other&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;--------------------------------------&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Period 1 ..... 0.16 ... 0.12 ... 0.06&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Period 2 ..... 0.13 ... 0.11 ... 0.06&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Period 3 ..... 0.15 ... 0.12 ... 0.07&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;--------------------------------------&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Total ........ 0.30 ... 0.24 ... 0.13&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Finally, coming back to J.-P.'s hypothesis about the second half of the third period of the first game, here are the numbers:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;First game .... 1.76&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Second game ... 1.63&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Other games ... 1.67&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;So, yes, there's a small effect where, when the teams are going to meet again next game, the referee calls more penalties than normal in the last ten minutes of the third period.  Whether that's because of the referee, or the players, we can't tell.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Taken alone, these differences aren't statistically significant.  But, considering they match the pattern, and the broader picture is statistically significant, we can be fairly confident that this is a real effect we're seeing.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;That's actually why I saved J.-P.'s scenario for last, so I could first show that the effect is probably real and not just random.&lt;br /&gt;&lt;br /&gt;-----&lt;br /&gt;&lt;br /&gt;&lt;span style="font-size:85%;"&gt;UPDATE, 1/15/2012:&lt;br /&gt;&lt;br /&gt;Technical note: the "other games" rows and columns are weighted by games, rather than matchups.  Suppose teams A and B had back-to-back games, and so did C and D.  But A and B met only 2 other times that year, while C and D met 4 other times.  That means that C/D will be overrepresented in the "other games" column.&lt;br /&gt;&lt;br /&gt;If I reweight that column so A/B and C/D get equal weight, the results change just a little bit.  These are the revised "other" columns:&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;span style="font-family:courier new;font-size:85%;"&gt;Overall ...... 11.48 (was 11.77)&lt;/span&gt;&lt;span style="font-size:85%;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;span style="font-family:courier new;font-size:85%;"&gt;1st period .... 4.24 (was 4.37)&lt;/span&gt;&lt;span style="font-size:85%;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;span style="font-family:courier new;font-size:85%;"&gt;2nd period .... 4.07 (was 4.12)&lt;/span&gt;&lt;span style="font-size:85%;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;span style="font-family:courier new;font-size:85%;"&gt;3rd period .... 3.15 (was 3.26)&lt;/span&gt;&lt;span style="font-size:85%;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;span style="font-family:courier new;font-size:85%;"&gt;Last 10 min ... 1.59 (was 1.67)&lt;/span&gt;&lt;span style="font-size:85%;"&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/31545676-2212184079722536614?l=sabermetricresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://sabermetricresearch.blogspot.com/feeds/2212184079722536614/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=31545676&amp;postID=2212184079722536614' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/2212184079722536614'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/2212184079722536614'/><link rel='alternate' type='text/html' href='http://sabermetricresearch.blogspot.com/2012/01/are-more-nhl-penalties-called-in-back.html' title='Are more NHL penalties called in back-to-back games?'/><author><name>Phil Birnbaum</name><uri>http://www.blogger.com/profile/03800617749001032996</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-31545676.post-2644880560627830199</id><published>2012-01-13T10:46:00.005-05:00</published><updated>2012-01-13T11:00:07.312-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='hockey'/><category scheme='http://www.blogger.com/atom/ns#' term='goons'/><category scheme='http://www.blogger.com/atom/ns#' term='fighting'/><category scheme='http://www.blogger.com/atom/ns#' term='NHL'/><title type='text'>Do hockey fights lift a team's performance?  Part II</title><content type='html'>&lt;span style="font-family:verdana;"&gt;The &lt;a href="http://sabermetricresearch.blogspot.com/2012/01/do-hockey-fights-lift-teams-performance.html"&gt;previous post&lt;/a&gt; was a study on NHL fights.  It found that, generally, a fight doesn't help the team that it's sometimes said to help (the team that's behind in the game, for instance), but in one particular case, MAYBE it did.  That was the case where:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;(a) one team was behind in the game&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;(b) that team fought more regularly than the other team, and&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;(c) the player fighting also fought more often than the other team's fighter.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;In that situation, that team appeared to benefit by around 0.13 goals, as compared to a similar team that didn't fight.  That was about the same as one extra power play.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;However, the result was not statistically significant, being only 1 SD away from zero.  Still, I left it at least a little bit open whether the effect *might* be real.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Tom Tango is more &lt;a href="http://www.insidethebook.com/ee/index.php/site/article/fighting_leads_to_goals/"&gt;skeptical&lt;/a&gt; than that: &lt;/span&gt;&lt;br /&gt;&lt;blockquote&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;&lt;/span&gt;&lt;span style="font-family:verdana;"&gt;It’s not monkeys at a typewriter creating Shakespeare, but it’s close.&lt;/span&gt;&lt;/blockquote&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Well, I have some more evidence that supports that point of view.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I repeated the study 27 times, to get a larger sample of random control games.  (I didn't pick the number 27 beforehand; I just ran the thing over and over until I got sick of it.)  Here's the average of those 27 runs:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Actual teams .... -0.18 goals &lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Control teams ... -0.29 goals &lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;------------------------------&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Difference ...... +0.11 goals&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;To remind you what this means: the fighting team meeting the conditions was outscored by its opponent by 0.18 goals over the rest of the game.  On the other hand, the control teams, which were selected randomly from games which matched as closely as possible (except for the fight), got outscored by 0.29 goals.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;So, it looks like the team that fought gained 0.11 goals per game.  As I said, that result is not statistically significant.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;But now, here's the new thing.  Even though the fighting team gained 0.11 goals, it actually lost more games.  Here are the records, in W-L-T format:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Actual teams .... 52-274-38 &lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Control teams ... 49-267-48 &lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;----------------------------&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Difference ...... -2 wins&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;So, even though the fighting teams did better on the scoreboard, they did worse in terms of winning games.  Actually, they won three extra games, but they lost seven more and tied 10 fewer.  That adds up to minus four points in the standings, which is why I write "-2 wins".  (I'm ignoring the "pity point" for an overtime loss.)&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;You wouldn't expect this to happen, that you score more goals but lose more games.  The better your goal differential, the better your outcomes should be.  I think I saw Gabriel Desjardins write, somewhere, that six goals equals one win.  The observed difference of +0.11 goals per game, over 364 games, equals around 40 goals, which is almost seven wins.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;But instead of winning seven extra games, the fighting teams *lost* two extra games.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Why did this happen?  I think it's just luck, well within the bounds of random error.  I think the +0.11 goals per game is random chance, I think the -2 wins is random chance, and I think the discrepancy between the two results is also random chance.&lt;/span&gt;  &lt;span style="font-family:verdana;"&gt;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;In any case, if you don't like all this talk of significance levels and randomness, you can just summarize like this: overall, the teams that fought wound up very slightly better on the scoreboard, but very slightly worse in the standings.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/31545676-2644880560627830199?l=sabermetricresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://sabermetricresearch.blogspot.com/feeds/2644880560627830199/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=31545676&amp;postID=2644880560627830199' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/2644880560627830199'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/2644880560627830199'/><link rel='alternate' type='text/html' href='http://sabermetricresearch.blogspot.com/2012/01/do-hockey-fights-lift-teams-performance_13.html' title='Do hockey fights lift a team&apos;s performance?  Part II'/><author><name>Phil Birnbaum</name><uri>http://www.blogger.com/profile/03800617749001032996</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-31545676.post-6694780211645224311</id><published>2012-01-10T11:59:00.011-05:00</published><updated>2012-01-31T18:04:27.047-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='hockey'/><category scheme='http://www.blogger.com/atom/ns#' term='goons'/><category scheme='http://www.blogger.com/atom/ns#' term='fighting'/><category scheme='http://www.blogger.com/atom/ns#' term='NHL'/><title type='text'>Do hockey fights lift a team's performance?</title><content type='html'>&lt;span style="font-family:verdana;"&gt;It's been said that when an NHL team needs a lift, a fight can jolt it out of its complacency and make it better.  And, just a few days ago, the &lt;a href="http://www.thehockeynews.com/articles/43976-Moneyball-look-at-NHL-stats-shows-starting-a-fight-sure-way-to-change-momentum.html"&gt;media cited a study&lt;/a&gt; by researcher Terry Appleby, of &lt;a href="http://powerscouthockey.com/"&gt;powerscouthockey.com&lt;/a&gt;, showing that momentum (in terms of shots on goal) usually increases for at least one team after a fight.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;But, if *either* team can benefit from a fight, what's the point?  You want to know if *your* team can benefit from a fight, at least more than the other team does.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;The problem is: how can you know that?  A fight involves both teams, so if it helps one, it hurts another by the same amount.  If you look at both teams, you'll always find the total effect to be zero.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;So, the "fighting helps a team" theory has to say *which* team is helped.  The most logical interpretation would be that that the fight helps the team that instigated it.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;If you're going to study that, you need to know which team is the instigating team.  That's tough to figure out from historical data.  But, one shortcut would be to assume that the team that generally gets involved in more fights is the team that's more likely to have instigated.  The 1974-75 Philadelphia Flyers took 76 fighting penalties (actually, 76 offsetting majors, which I used as a proxy for fights).  That same season, the expansion Kansas City Scouts took only 19.  It seems fair to assume that if a fight broke out at a Flyers/Scouts game, it was the Flyers who were likely responsible.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;On that assumption, I decided to check.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Using data from the &lt;a href="http://hsp.flyershistory.com/"&gt;Hockey Summary Project&lt;/a&gt;, I looked at fights between 1967-68 and 1984-85, and checked to see how the more-likely-to-fight team did in the remainder of the game.  Then, I found a control game to match it with.  The result was two large groups of games, which could then be compared.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I'll give you an example of how the controls were found.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;On &lt;a href="http://www.flyershistory.com/cgi-bin/poboxscore.cgi?H19680340"&gt;Feburary 16, 1969&lt;/a&gt;, the Bruins played the Black Hawks at Chicago Stadium.  Just as the first period ended, with the score 2-0 Chicago, the Bruins' Don Awrey got into a fight against Stan Mikita of the Hawks.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I looked for a game to serve as the control for that Boston/Chicago game.  What I wanted was:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;1.  A game in the same season, the season before, or the season after&lt;/span&gt;;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;2.  ... where the home team had the same size lead at that same time of the game&lt;/span&gt;;&lt;span style="font-family:verdana;"&gt;&lt;br /&gt;3.  ... and where the two teams were of roughly similar relative quality.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;#1 and #2 were non-negotiable (except that all differences of 4 or more goals were considered the same).  But, for #3, the quality only had to be close, within two goals (which I'll explain in a minute).&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I started pulling random games until I found one that matched all three requirements.  In this particular case, the control wound up being the Bruins vs. Rangers game of &lt;a href="http://www.flyershistory.com/cgi-bin/poboxscore.cgi?H19680359"&gt;February 23, 1969&lt;/a&gt;.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;That game qualifies under the rules because&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;1.  1968-69 is in the same season as the original;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;2.  That game had the home team also leading by two goals at 20:00 of the first period, and&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;3.  The two sets of teams are of similar relative quality.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Now, let me explain #3.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;In &lt;a href="http://www.hockey-reference.com/leagues/NHL_1969.html"&gt;1968-69&lt;/a&gt;, the Bruins were +82 in goal differential (303 goals for, 221 against).  The Black Hawks were +34 (280-246).  So, for the original game, the home team was 48 goals worse than the visiting team.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Since the control game was the same year, the Bruins were still +82.  The Rangers were +35 (231-196).  So, in the control game, the home team was 47 goals worse than the visiting team.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Since "47 goals worse" is within two goals of "48 goals worse," that's close enough for the Bruins/Rangers game to serve as a control.  If it hadn't been within two goals -- which is most of the time -- that game wouldn't have qualified under #3, and I would have tried another random game.  (If there were absolutely no games that qualified under #3, I would have taken the one where the team quality was closest in goals.  If none of the random games had qualified under #1 and #2, I would have thrown the original game out of the study -- but that never happened.)&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;OK, so now we have our real game, and our control game.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Which team in our real game are we going to expect to have gotten the "lift" from the fight?  In 1968-69, the Bruins had 41 fights, but the Black Hawks had only 20.  So, the assumption is that the fight was more the work of the Bruins, and they should be the ones expected to benefit.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;How did the Bruins do in the rest of the game relative to the Black Hawks?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Well, the final score was 5-1 Hawks.  Since it was 2-0 at the time of the fight, that means the Black Hawks outscored the Bruins 3-0 in the remainder of the game.  In other words, a "minus 3" goal differential for the visiting Bruins.  (I excluded any goals in the last three minutes of the third period, to make sure empty-net goals didn't screw things up.)&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;What about the control game?  That game actually wound up 9-0 Rangers, which means 7-0 Rangers from the fight to the end of the game.  Since the "real" game was relative to the Bruins, the visiting team, we also want to express the control game from the standpoint of the visiting team.  So that's "minus 7". &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;So, our score so far is:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;"&gt;Actual games: -3.0 goal differential for the fighting team&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;"&gt;Control group: -7.0 goal differential for the control team&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;So far, it looks like fighting helps, by four goals a fight!&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Of course, that's only one game.  I repeated this process for every fight from 1967-68 to 1984-85.  Actually, not *every* fight.  First, I included only fights where one team appeared to be significantly more aggressive than the other (specifically, where the two teams were 10 or more fighting penalties apart for the season).  Second, I included only first- or second-period fights, to increase the amount of time for the "lift" effect to make itself felt.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Even with those restrictions, there were 2,834 fights total.  The results: &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;"&gt;Fighting teams ... -0.04 goals&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;"&gt;Control group .... -0.02 goals&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;The team with more fights was 0.04 goals worse than the other team over the remainder of the game.  It "should have" been 0.02 goals worse.  (Both numbers are negative probably because the teams that got in more fights were slightly worse teams overall than their opponents.)&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;So, there seems to be a small, negative effect: a team loses one additional goal for every 50 fights.  But, that difference isn't even close to statistically significant.  It's less than one SD from zero.  (The two individual SDs are about 0.04 each, so the SD of the difference is around 0.06.)&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Conclusion: it doesn't appear that fighting helps a team.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-----&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Maybe a difference of 10 fights a year isn't enough to separate the two teams?  I redid the study, but required the teams to be 20 fighting penalties apart.  That reduced the sample size to 1,581 each group.  The results were about the same (the +/- in parentheses is the standard error): &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;"&gt;Fighting teams .... 0.00 goals (+/- 0.05)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;"&gt;Control group .... -0.03 goals (+/- 0.06)&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-----&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Looking at the entire database, I found that the average fight starts with a goal differential of 1.617.  The average goal differential in all other games, weighted by the times of fights, is 1.421 goals.  So, it seems like fights start when the game is a little more lopsided than usual.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;So, maybe it's the team that's *trailing* that starts the fight, in an effort to wake itself up.  Maybe we should look at trailing teams, not goonier teams.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I tried that. I threw away all situations where the score was tied when the fight happened, and looked at all the rest.  The results:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;"&gt;2,941 datapoints&lt;br /&gt;----------------&lt;br /&gt;Trailing teams ... -0.20 goals (+/- 0.04)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;"&gt;Control group .... -0.19 goals (+/- 0.04)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Again, no real difference.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-----&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Trying again, but looking only at fights where one team was trailing by at least three goals:&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;"&gt;&lt;br /&gt;591 datapoints&lt;br /&gt;--------------&lt;br /&gt;Trailing teams ... -0.25 goals (+/- 0.08)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;"&gt;Control group .... -0.29 goals (+/- 0.08)&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Nothing there, either.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-----&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Is it possible that the benefit accrues only to GOOD teams trailing by three goals?  Those are the teams playing the worst relative to their abilities, so the "wake up" effect should be strongest.  Here are teams trailing by 3 goals that were at least +30 in goal differential for the season:&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;"&gt;&lt;br /&gt;122 datapoints&lt;br /&gt;--------------&lt;br /&gt;Trailing teams ... +0.14 goals (+/- 0.17)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;"&gt;Control group .... +0.17 goals (+/- 0.16)&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Nope.  What if we look at good teams trailing by *any* number of goals?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;"&gt;841 datapoints&lt;br /&gt;--------------&lt;br /&gt;Trailing teams ... +0.40 goals (+/- 0.07)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;"&gt;Control group .... +0.26 goals (+/- 0.07)&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Aha!  This time, there's a small "lift" effect, at about 1.4 SD.  But, why would there be an effect for teams trailing by 1 goal, but not for teams traling by 3 goals?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I got curious and ran the same study again, and this time the random control group came in at +0.33, bring the difference down to 1.0 SD. (Of course, it's not appropriate to dismiss the first result just because the second one came out less extreme.)&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;span style="font-family:verdana;"&gt;-----&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;At this point, you might reasonably argue that the rules "team with more majors that year" and "team trailing in the game" are not precise enough in selecting teams that started the fight.  So, this time, I assumed the fight was started by the *player* with the most majors that season, rather than the *team* with the most majors that season.  So when the goon of a pacifist team starts a fight with a pacifist of a goon team, you go with the goon player on the pacifist team.  The results:&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;"&gt;&lt;br /&gt;4,185 datapoints&lt;br /&gt;----------------&lt;br /&gt;Goonier Player ... +0.01 goals (+/- 0.03)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;"&gt;Control Group .... -0.02 goals (+/- 0.03)&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Again, less than 1 SD difference.  There's not much difference between this "goon player" breakdown and the previous "goon team" breakdown, probably because most of the goonier players also played on goonier teams.  But it was worth a try.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-----&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Finally, one last try.  For this run, I combined all three criteria.  To be included in the study:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;(a) one team had to have at least 20 more majors for the season than its opponent;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;(b) that same team's fighter had to have more majors that year than his opponent; and&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;(c) that same team had to be trailing in the game.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;This *has* to work, right?  I mean, that pushes all the right buttons: a truculent team, with a figher selected for that purpose, behind in the game and likely to be needing a lift.  If *those* teams don't benefit from the fight, then who would?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I expected the same non-result, but, this time, we get the biggest effect so far:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;"&gt;364 datapoints&lt;br /&gt;--------------&lt;br /&gt;Teams qualifying ... -0.18 goals (+/- 0.10)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;"&gt;Control group ...... -0.38 goals (+/- 0.11)&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;There's a difference of .20 goals -- almost a fifth of a goal per fight!  Taken at face value, that means that when a team like that starts a fight, it benefits by even more than a power play (which has a 15 to 20 percent success rate).&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;That difference is still only about 1.4 SD from zero.  Still, I hate to just dismiss it.  I've always thought that if you get a result that's significant in the real-world (hockey) sense, but it's not statistically signficant, that's a problem with your study -- it's just that you haven't used enough data to be able to prove anything.  We should still be open to the possibility that the effect might be real.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I ran it a few more times, to check if maybe the control group was just a random outlier.  The extra results:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Control group: -0.26 goals&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Control group: -0.21 goals&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Control group: -0.27 goals&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Control group: -0.38 goals (again) &lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Control group: -0.38 goals (again)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Control group: -0.29 goals &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;So, the original run was a little extreme, but not much. &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;There are, however, some mitigating factors.  First, the control group numbers aren't all independent, since there's a limited number of control games to choose randomly from.  Second, we obviously can't do extra runs to reduce random chance in the *real* games, but it's still possible those teams scored more goals for random reasons having nothing to do with any lift they got from the fight.  Third, the SDs of both groups are a bit understated: I calculated them based on the assumption that games are independent, but they're not -- a real game appears in the study multiple times, once for each fight, and a control game could get randomly selected more than once, too.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;If you average the seven control groups in the seven repetitions of the study, you get -0.31 goals.  That's 0.13 goals worse than the actual games.  Taking into account the fact that we ran the control group five times, the 0.13 difference is now around 1 SD.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Oh, and this is as good a time as any to emphasize that I could also have screwed up somewhere ... I've already had to rerun everything once when I found a misplaced parenthesis in my code.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-----&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;So, I guess, our overall conclusion from this study isn't completely certain.  We wind up with a summary like:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;1.  The effect doesn't seem to exist for run-of-the-mill fights.&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;2.  When a goon fighter on a goon team fights when his team is down, it seems to benefit that team by 1/8 of a goal, or a bit less than a normal power play.&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;3.  But, that effect isn't statistically significant, so we have some doubts that it's real&lt;/span&gt;.&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;4.  And, with only 364 such datapoints qualifying out of around 5,000, only a small percentage of fights match the criterion for that kind of boost.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;If you had to reduce that to one line, it might be:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;At best, there might be a small effect in certain specific circumstances ... but much, much less than sportscasters make it out to be.&lt;/span&gt;&lt;div&gt;&lt;span style="font-family:verdana;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style="font-family:verdana;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style="font-family:verdana;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span style="font-family:verdana;"&gt;UPDATE: Part 2 is &lt;a href="http://sabermetricresearch.blogspot.com/2012/01/do-hockey-fights-lift-teams-performance_13.html"&gt;here&lt;/a&gt;.&lt;/span&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/31545676-6694780211645224311?l=sabermetricresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://sabermetricresearch.blogspot.com/feeds/6694780211645224311/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=31545676&amp;postID=6694780211645224311' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/6694780211645224311'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/6694780211645224311'/><link rel='alternate' type='text/html' href='http://sabermetricresearch.blogspot.com/2012/01/do-hockey-fights-lift-teams-performance.html' title='Do hockey fights lift a team&apos;s performance?'/><author><name>Phil Birnbaum</name><uri>http://www.blogger.com/profile/03800617749001032996</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-31545676.post-2128645526290686989</id><published>2012-01-09T17:11:00.004-05:00</published><updated>2012-01-09T17:36:28.939-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='basketball'/><category scheme='http://www.blogger.com/atom/ns#' term='hockey'/><category scheme='http://www.blogger.com/atom/ns#' term='referee bias'/><category scheme='http://www.blogger.com/atom/ns#' term='NBA'/><category scheme='http://www.blogger.com/atom/ns#' term='penalties'/><category scheme='http://www.blogger.com/atom/ns#' term='NHL'/><title type='text'>Do NHL referees call "make up" penalties? Part IV</title><content type='html'>&lt;span style="font-family:verdana;"&gt;A couple of links to other similar studies on penalty-calling:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;1.  &lt;/span&gt;&lt;a style="font-family: verdana;" href="http://www.blogger.com/comment.g?blogID=31545676&amp;amp;postID=1241175255298373551"&gt;Commenter&lt;/a&gt;&lt;span style="font-family:verdana;"&gt; Jack linked to &lt;/span&gt;&lt;a style="font-family: verdana;" href="http://kenpom.com/blog/index.php/weblog/the_untrained_eye_washington_state_vs._utah/"&gt;this article&lt;/a&gt;&lt;span style="font-family:verdana;"&gt; with some basketball foul-calling data.  Turns out the more consecutive fouls against one team, the more likely the next will go to the other team.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;2. Another reader pointed me to a 2009 hockey study (web version &lt;/span&gt;&lt;a style="font-family: verdana;" href="http://www.palgrave-journals.com/ori/journal/v22/n4/full/ori200912a.html"&gt;here&lt;/a&gt;&lt;span style="font-family:verdana;"&gt;, PDF &lt;/span&gt;&lt;a style="font-family: verdana;" href="http://www.palgrave-journals.com/ori/journal/v22/n4/pdf/ori200912a.pdf"&gt;here&lt;/a&gt;&lt;span style="font-family:verdana;"&gt;) by Jack Brimberg and William J. Hurley.  They looked at the first three penalties of every game, and found results similar to what I found.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/31545676-2128645526290686989?l=sabermetricresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://sabermetricresearch.blogspot.com/feeds/2128645526290686989/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=31545676&amp;postID=2128645526290686989' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/2128645526290686989'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/2128645526290686989'/><link rel='alternate' type='text/html' href='http://sabermetricresearch.blogspot.com/2012/01/do-nhl-referees-call-make-up-penalties_09.html' title='Do NHL referees call &quot;make up&quot; penalties? Part IV'/><author><name>Phil Birnbaum</name><uri>http://www.blogger.com/profile/03800617749001032996</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-31545676.post-711770839139162233</id><published>2012-01-07T12:58:00.007-05:00</published><updated>2012-01-07T13:23:56.496-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='hockey'/><category scheme='http://www.blogger.com/atom/ns#' term='Ken Dryden'/><title type='text'>Ken Dryden tracers from "The Game"</title><content type='html'>&lt;span style="font-family:verdana;"&gt;In my &lt;a href="http://sabermetricresearch.blogspot.com/2011/12/ken-drydens-game.html"&gt;review&lt;/a&gt; of Ken Dryden's book "The Game," I listed seven of the details that I tried to trace.  I now have an eighth one, and then a retrace of the second one.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;These two updates were originally posted to the &lt;a href="http://www.sihrhockey.org/main.cfm"&gt;SIHR&lt;/a&gt; mailing &lt;a href="http://sports.groups.yahoo.com/group/SIHR_eList/"&gt;list&lt;/a&gt;.  The original seven are &lt;/span&gt;&lt;a style="font-family: verdana;" href="http://sabermetricresearch.blogspot.com/2011/12/ken-drydens-game.html#tracers"&gt;here&lt;/a&gt;&lt;span style="font-family:verdana;"&gt;.  Thanks again to the &lt;a href="http://hsp.flyershistory.com/"&gt;Hockey Summary Project&lt;/a&gt; for the data making these tracers possible.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;----&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;8.  Here's Dryden, from page 121 of my edition:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;&lt;/span&gt;&lt;blockquote&gt;&lt;span style="font-family:verdana;"&gt;"A few months ago, we played the Colorado Rockies at the Forum.  Early in the game, I missed an easy shot from the blueline, and a little unnerved, for the next fifty minutes I juggled long shots, and allowed big rebounds for three additional goals.  After each Rockies goal, the team would put on a brief spurt and score quickly, and so with only minutes remaining, the game was tied.  Then the Rockies scored again, this time a long, sharp-angled shot that squirted through my legs.  The game had seemed finally lost.  But in the last three minutes, Lapointe scored, then Lafleur, and we won 6-5.  Alone in the dressing room later .... I just sat there, unable to understand why I felt the way I did.  Only slowly did it come to me: I had been irrelevant; I couldn't even lose the game."&lt;/span&gt;&lt;/blockquote&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;In Dryden's career, I found 12 Canadiens games against Colorado.   Montreal won 11 of them and tied one.  But none of them was by a score of 6-5.   &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Montreal did not have any 6-5 wins at all in 1978-79 (when the book is set), or in the previous two seasons.  In Dryden's entire career, I found only two 6-5 Montreal wins where he was in net.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;One was February 12, 1972, against the Kings.  The narrative doesn't match.  In that game, the Habs led 6-3, and then the Kings scored two late goals.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;The other was November 16, 1972.  In that game, Dryden was replaced by Michel Plasse after the first period, so that doesn't match either.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;So, I extended the search to look for all games where Dryden gave up 5+ goals, but the Canadiens won anyway.  There were five such games:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;February 12, 1972, 6-5 against LA (described above)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;November 22, 1974, 7-6 against Kansas City&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;February 18, 1976, 7-5 against Toronto&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;November 21, 1976, 9-5 against Toronto&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;December 23, 1977, 7-5 against New York Islanders.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;None of the games match exactly, but the Kansas City game is the best candidate:&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-size:85%;" &gt;&lt;br /&gt;&lt;/span&gt;&lt;span style="font-weight: bold;font-family:verdana;font-size:85%;"  &gt;--  It was against the team that eventually became the Rockies;&lt;/span&gt;&lt;span style="font-weight: bold;font-size:85%;" &gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;span style="font-weight: bold;font-family:verdana;font-size:85%;"  &gt;--  The opposition scored late (14:49 of the third), and the Habs won it later (17:53);&lt;/span&gt;&lt;span style="font-weight: bold;font-size:85%;" &gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;span style="font-weight: bold;font-family:verdana;font-size:85%;"  &gt;--  It was a one-goal game;&lt;/span&gt;&lt;span style="font-weight: bold;font-size:85%;" &gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;span style="font-weight: bold;font-family:verdana;font-size:85%;"  &gt;--  Dryden probably didn't play great (6 goals on 21 shots, seven per period);&lt;/span&gt;&lt;span style="font-weight: bold;font-size:85%;" &gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;span style="font-weight: bold;font-family:verdana;font-size:85%;"  &gt;--  The last five goals alternated by team.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;But other things don't match:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:verdana;font-size:85%;"  &gt;-- The Scouts tied it late, not took the lead late;&lt;/span&gt;&lt;span style="font-weight: bold;font-size:85%;" &gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;span style="font-weight: bold;font-family:verdana;font-size:85%;"  &gt;-- The Habs scored one goal to win, not two;&lt;/span&gt;&lt;span style="font-weight: bold;font-size:85%;" &gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;span style="font-weight: bold;font-family:verdana;font-size:85%;"  &gt;-- The goal was scored by Doug Risebrough, not Lapointe or Lafleur -- in fact, neither Lapointe nor Lafleur scored at all that game;&lt;/span&gt;&lt;span style="font-weight: bold;font-size:85%;" &gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;span style="font-weight: bold;font-family:verdana;font-size:85%;"  &gt;-- The early goals didn't alternate  (The goal sequence was kMMMMkkkMkMkM);&lt;/span&gt;&lt;span style="font-weight: bold;font-size:85%;" &gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;span style="font-weight: bold;font-family:verdana;font-size:85%;"  &gt;-- The game was in Kansas City, not Montreal;&lt;/span&gt;&lt;span style="font-weight: bold;font-size:85%;" &gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;span style="font-weight: bold;font-family:verdana;font-size:85%;"  &gt;--  The game happened several years previously, rather than months.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I checked the Globe and Mail recap for that game ... it was just a couple of paragraphs long, and didn't mention Dryden at all, or how the Kansas City goals went in.  I don't have online access to any Montreal newspapers to get a more detailed game story.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-----&lt;br /&gt;&lt;br /&gt;2.  Number &lt;a href="http://sabermetricresearch.blogspot.com/2011/12/ken-drydens-game.html#tracer2"&gt;2&lt;/a&gt; in my blog post listed a game against Toronto. Dryden writes that the Leafs tie the game early, get confident that they can keep up with the Canadiens, and begin to take over the play. But Mark Napier and Pat Hughes score two quick goals for the Habs. The Canadiens score two more, and then the Leafs get two late. The next day, the players wonder why coach Scotty Bowman didn't give them hell for allowing those two late goals.&lt;br /&gt;&lt;br /&gt;It all adds up to 6-4. There was no 6-4 win in Toronto in 1978-79.&lt;br /&gt;&lt;br /&gt;So, I looked for other games that might match.&lt;br /&gt;&lt;br /&gt;There were only two games during Dryden's career where Napier and Hughes both scored.&lt;br /&gt;&lt;br /&gt;One was November 15, 1978, a 6-1 win over Colorado. That doesn't match.&lt;br /&gt;&lt;br /&gt;But the game of January, 17, 1979 is probably it. It matches, though not exactly:&lt;br /&gt;&lt;br /&gt;&lt;span style="font-size:85%;"&gt;&lt;span style="font-weight: bold;"&gt;-- It's in 1978-79, the season Dryden was writing about.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;-- It was against Los Angeles, not Toronto, and home, not road.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;-- Although the Kings scored to make it 2-1 at 8:43 of the second period, they never tied it up.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;-- After the Kings' goal to make it 2-1, Napier and Hughes scored to make it 4-1.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;-- After that, the Habs scored three more goals (not two): Houle, then Napier and Hughes again.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;-- After that, the Kings got their two late goals.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;-- The final was 7-3, not 6-4.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;But, I think, pretty close anyway.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/31545676-711770839139162233?l=sabermetricresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://sabermetricresearch.blogspot.com/feeds/711770839139162233/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=31545676&amp;postID=711770839139162233' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/711770839139162233'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/711770839139162233'/><link rel='alternate' type='text/html' href='http://sabermetricresearch.blogspot.com/2012/01/ken-dryden-tracers-from-game.html' title='Ken Dryden tracers from &quot;The Game&quot;'/><author><name>Phil Birnbaum</name><uri>http://www.blogger.com/profile/03800617749001032996</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-31545676.post-1241175255298373551</id><published>2012-01-06T12:38:00.004-05:00</published><updated>2012-01-06T13:01:26.265-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='hockey'/><category scheme='http://www.blogger.com/atom/ns#' term='referee bias'/><category scheme='http://www.blogger.com/atom/ns#' term='penalties'/><category scheme='http://www.blogger.com/atom/ns#' term='NHL'/><title type='text'>Do NHL referees call "make up" penalties? Part III</title><content type='html'>&lt;span style="font-family:verdana;"&gt;The last &lt;a href="http://sabermetricresearch.blogspot.com/2011/12/do-nhl-referees-call-make-up-penalties.html"&gt;two&lt;/a&gt; &lt;a href="http://sabermetricresearch.blogspot.com/2012/01/do-nhl-referees-call-make-up-penalties.html"&gt;posts&lt;/a&gt; talked about how NHL referees are more likely to "even-up" their calls, issuing the next penalty to the opposing team 60% of the time.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;This post, I'll show a regression I ran to quantify the effect a bit better.  If you're not interested, just skip the technical parts (smaller font).  If you're *really* not interested, you can probably just skip this entire post, since the results are pretty much the same as shown in the previous posts.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;---&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-size:85%;"&gt;&lt;span style="font-family:verdana;"&gt;Technical notes 1:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Even though we're interested in whether the referee called the next penalty to the "other" team, I set up the regression to predict whether the referee called the next penalty to the *home* team.  That just makes everything easier to interpret, but, as I'll describe, it still lets us estimate the "even-up" effect.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;In the study, I ignored all misconduct penalties, all first penalties of the game, and all penalties where the other team had a player called at the same time.  (I treated those penalties as if they didn't exist, so they didn't interrupt "consecutiveness" of the two surrounding penalties.)&lt;br /&gt;&lt;br /&gt;Non-dummy variables I used: Time gone in game.  Time since last penalty.&lt;br /&gt;&lt;br /&gt;Dummy variables I used: PP goal on last penalty.  SH goal on last penalty.  Home team lead, from -3 to +3, except 0 (that is, six dummy variables), where anything more or less than 3 goals was coded as 3.  All eight of the previous variables interacted with "whether the last penalty was to the home team."  And, of course, the dummy for "whether the last penalty was to the home team" itself.&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;span style="font-family:verdana;"&gt;The regression shows the home team percentage diminishes during the game, by about 1 percentage point per period.  In all the numbers in this post, I just used the beginning of the game.  If you want the middle of the game, subtract about 1.5 percent from each "home team" percentage (or add 1.5 percent to each "visiting team" percentage) if you want to adjust to the middle of the second period.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Also, the regression says you have to subtract about 1 percentage point for every 20 minutes since the last penalty. I didn't bother for this post.  If you assume penalties are usually around 5 minutes apart, feel free to subtract 0.25 percentage points from each of the "home team" percentages.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Those two time adjustments won't affect the "even-up" numbers, just the raw percentages of home team penalties.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-----&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;OK, first, let me show you the percentage of penalties taken by the home team, by game score.  Clearly, teams are more likely to take penalties when they're ahead in the game.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;After the visiting team took the last penalty, the home team took:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;" &gt;46.3% when down by 3+&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;" &gt;49.2% when down by 2&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;" &gt;53.1% when down by 1&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;" &gt;61.2% when tied&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;" &gt;64.3% when up by 1&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;" &gt;67.9% when up by 2&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;" &gt;66.8% when up by 3+&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;And after the home team took the last penalty, the home team took:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;" &gt;34.5% when down by 3+&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;" &gt;32.8% when down by 2&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;" &gt;32.3% when down by 1&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;" &gt;35.0% when tied&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;" &gt;42.6% when up by 1&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;" &gt;47.6% when up by 2&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;" &gt;52.9% when up by 3+&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Obviously, you can do this for visiting teams just by subtracting all the percentages from 100.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Now, we can calculate the "even-up" effect as the difference between the lines of the two tables.  When the score was tied, the home team took:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;" &gt;61.2 percent after visiting team penalty&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;" &gt;35.0 percent after home team penalty&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;" &gt;-------------------------------------------------&lt;br /&gt;26.2 percent difference&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;You can convert to visiting team just by subtracting the first two numbers from 100%.  The difference has to come out the same.  I'll do that anyway.  When the score was tied, the visiting team took:&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;" &gt;38.8 percent after visiting team penalty&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;" &gt;65.0 percent after home team penalty&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;" &gt;-----------------------------------------------------&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;" &gt;26.2 percent difference&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;It turns out the "even-up" difference is highest for tie games.  Here's the full breakdown: &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;" &gt;11.8 percent difference down by 3+&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;" &gt;16.4 percent difference down by 2&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;" &gt;20.7 percent difference down by 1&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;" &gt;26.2 percent difference tied&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;" &gt;21.7 percent difference up by 1&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;" &gt;20.3 percent difference up by 2&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;" &gt;13.9 percent difference up by 3+&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Tango suggested there might be an extra "compassion" effect when the team scores a PP or SH goal.  He seems to have been right.  The effect is small, relative to the overall effect, but still enough to affect the games:&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;span style="font-family:courier new;"&gt;-- If the home team scored a PPG on the previous penalty, &lt;span style="font-weight: bold;"&gt;add 1.2&lt;/span&gt; percentage points to the above differences.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;-- if the visiting team scored a PPG on the previous penalty, &lt;span style="font-weight: bold;"&gt;subtract 3.2&lt;/span&gt; percentage points to the above differences.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;-- if the home team scored a SHG on the previous penalty, &lt;span style="font-weight: bold;"&gt;add 2.8 &lt;/span&gt;percentage points from the above differences.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;&lt;span style="font-family:courier new;"&gt;-- if the visiting team scored a SHG on the previous penalty, &lt;/span&gt;&lt;span style="font-weight: bold;font-family:courier new;" &gt;subtract 0.7&lt;/span&gt; percentage points from the above differences.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;The PPG numbers are statistically significant.  The SHG numbers aren't, but they go in the right direction and are about the right magnitude, so I think it's reasonable to consider them as decent estimates. &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;So, as I promised in the second paragraph: the results of the regression seem to match what we found in the previous posts.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-size:85%;"&gt;&lt;span style="font-family:verdana;"&gt;Technical notes 2:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;For full disclosure, here are the coefficients for all the variables in the regression.  I'll present them in "here's how to calculate the percentage of home-team penalties" format.  (If you prefer a table, the full computer output is &lt;a href="http://philbirnbaum.com/penreg.pdf"&gt;here&lt;/a&gt; (pdf).  You'll be able to tell what the variables represent by matching the coefficients to what's below.)&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Start with 0.6119 (constant).&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Add -0.2619 if the home team took the last penalty.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Add -8.15E-06 for each second that's passed in the game.  (About -.01 per period.)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Add -7.39E-06 for each second that's passed since the last penalty (not significant, p=.104, but magnitude is reasonable and has the right sign).&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Add -0.1483 if the home team is down by 3 or more goals.&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Add +0.1436 if the home team is down by 3+ and also took the last penalty.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Add -0.1196 if the home team is down by exactly 2 goals.&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Add +0.0983 if the home team is down 2 and also took the last penalty.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Add -0.0807 if the home team is down by 1 goal.&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Add +0.0545 if the home team is down 1 and also took the last penalty.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Add +0.0309 if the home team is up by 1 goal.&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Add +0.0450 if the home team is up 1 and also took the last penalty.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Add +0.0668 if the home team is up by 2 goals.&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Add +0.0591 if the home team is up 2 and also took the last penalty.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Add +0.0559 if the home team is up by 3 or more goals.&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Add +0.1228 if the home team is up 3+ and also took the last penalty.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Add +0.0115 if a PP goal was scored on the last penalty&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Add -0.0438 if a PP goal was scored on the last penalty and that penalty was to the home team.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Add -0.0072 if a SH goal was scored on the last penalty&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Add +0.0350 if a SH goal was scored on the last penalty and that penalty was to the home team.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;-----&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/31545676-1241175255298373551?l=sabermetricresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://sabermetricresearch.blogspot.com/feeds/1241175255298373551/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=31545676&amp;postID=1241175255298373551' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/1241175255298373551'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/1241175255298373551'/><link rel='alternate' type='text/html' href='http://sabermetricresearch.blogspot.com/2012/01/do-nhl-referees-call-make-up-penalties_06.html' title='Do NHL referees call &quot;make up&quot; penalties? Part III'/><author><name>Phil Birnbaum</name><uri>http://www.blogger.com/profile/03800617749001032996</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-31545676.post-6896387712716993693</id><published>2012-01-03T10:15:00.010-05:00</published><updated>2012-01-06T13:02:27.802-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='hockey'/><category scheme='http://www.blogger.com/atom/ns#' term='referee bias'/><category scheme='http://www.blogger.com/atom/ns#' term='penalties'/><category scheme='http://www.blogger.com/atom/ns#' term='NHL'/><title type='text'>Do NHL referees call "make up" penalties?  Part II</title><content type='html'>&lt;span style="font-family:verdana;"&gt;Last post, I found that &lt;a href="http://sabermetricresearch.blogspot.com/2011/12/do-nhl-referees-call-make-up-penalties.html"&gt;referees are likely to "even up" their penalty calls&lt;/a&gt;: they're around 50% more likely to give the other team the next power play than to give the same team two power plays in a row.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I wasn't not convinced this is because of referee bias, or what Tango calls the "compassionate referee."&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;&lt;a href="http://www.insidethebook.com/ee/index.php/site/comments/compassionate_referee/#comments"&gt;Tango suggested&lt;/a&gt; this experiment: check to see if a power play goal was scored on the first penalty.  If the referee is indeed "compassionate" towards the other team, he should be more compassionate if the penalty actually cost them a goal, less so if there was no goal, and even less so if the penalized team *benefited* from the penalty by scoring shorthanded.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;So I checked.  I looked at all cases where there was a power play goal (PPG) on a first penalty, and then no more scoring until the next penalty was called.  Indeed, that does appear to make the ref more compassionate.&lt;br /&gt;&lt;br /&gt;After a penalty resulting a PPG, the next penalty was of the "even-up" variety 65.9% of the time.  That's higher than the overall rate of 59.7%.  Repeating that in a better font:&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;span style="font-weight: bold;font-family:courier new;" &gt;65.9% after a PPG&lt;br /&gt;&lt;/span&gt;&lt;span style="font-weight: bold;font-family:courier new;" &gt;59.7% overall rate&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;And, the same effect appears for shorthanded goals (SHG):&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;" &gt;52.5% after an SHG&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;" &gt;59.7% overall rate&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;It's a large effect, and exactly in the direction Tango predicted.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-----&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;But wait!  It might not be referee bias at all.  Because, it turns out that teams with a lead take significantly more penalties than teams who are behind.  For instance, when a penalty is called while you have a two goal lead, there's a 55.2% chance the penalty goes against you (and so a 44.8% chance the penalty goes against the other team).  Full chart:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;" &gt;55.2% of penalties to team leading by 1 &lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;" &gt;58.2% of penalties to team leading by 2&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;" &gt;59.0% of penalties to team leading by 3&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;" &gt;59.4% of penalties to team leading by 4&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;" &gt;59.7% of penalties to team leading by 5&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;So, the score effect could explain what we're seeing.  After a power play goal, the team has a bigger lead (or smaller deficit) than before.  That would make it likely to take more penalties in future, even if the referee wasn't compassionate at all.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;(Of course, the score effect might itself be due to referee "compassion," but that's a whole other argument.)&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Specifically: a power play goal makes the team 6 percentage points more likely to take the next penalty.  But scoring ANY tiebreaking goal in the first period makes a team 5 percentage points likely to take the next penalty.  So how can we be sure there's a separate power-play effect, or how big it is?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-----&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;What might also complicate things is there's a "time of game" effect:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;" &gt;42,721 PPs came in the first period.&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;" &gt;38.060 PPs in the second period.&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;" &gt;26,705 PPs came in the third period.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;There are fewer penalties in the third period than in the first.  Is that a separate period effect?  It might be.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Here's the score effect chart, again, but this time only for first-period penalties.  The effect is more extreme than for the entire game:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;" &gt;55.6% of penalties to team leading by 1 &lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;" &gt;60.1% of penalties to team leading by 2&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;" &gt;61.0% of penalties to team leading by 3&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;" &gt;66.5% of penalties to team leading by 4&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;" &gt;58.7% of penalties to team leading by 5 (only 46 datapoints)&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-----&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;It almost looks like we need a regression to sort all this out.  But, wait!  One more try before we turn to the dark side.  Let's engineer a comparison where score and period won't screw things up.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I took every situation where:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;1.  It was the first period.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;2.  The game was tied at the time of the first penalty, and exactly one additional goal was scored before the second penalty.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;3.  The one extra goal was scored by the team that had the power play on the first penalty.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Then, I divided those situations into two groups. &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;The "Highest compassion" group is where the team scored the goal *on the power play*, presumably making the referee feel extra bad that he caused the goal.  The "Typical compassion," is where the team scored the goal *after* the power play, and the referee's call wasn't the cause. &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;What percentage of the second penalties went to the other team?  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;" &gt;Highest compassion: 71.6% (2163 datapoints)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;" &gt;Average compassion: 69.7% (1051 datapoints).&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;There's a small effect there, in the expected direction, of 1.9 percentage points.  (That's less than 1 SD, so not statistically signficant.)&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Here's the same result, but the other way, where it's the originally-penalized team that scored before the next penalty.  When that goal was scored shorthanded, we can call that "Lowest compassion".  When it wasn't, it's again "Average compassion."  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Again, what percentage of the time did the second penalty even things out?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;" &gt;Lowest compassion: 62.5% (253 datapoints)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;" &gt;Average compassion: 58.2% (1006 datapoints).&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;This time the effect goes the "wrong" way, but there's too little data to draw any conclusions.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Doing the same thing for the second period instead of the first, we find a larger difference, but still not statistically significant (1.4 SD):&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;" &gt;Highest compassion: 70.4% (568 datapoints)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;" &gt;Average compassion: 65.7% (271 datapoints).&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;And the shorthanded case, which really has too small a sample to take seriously:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;" &gt;Lowest compassion: 51.8 (83 datapoints)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:courier new;" &gt;Average compassion: 48.9% (268 datapoints).&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-----&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;So, in summary: yes, there appears to be weak evidence for a small "compassion effect."&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;In the previous post, I considered three hypotheses:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;1.  Referee bias&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;2.  Penalized teams play more carefully after the penalty&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;3.  Power play teams play more aggressively after the penalty&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Here's a fourth one, a variation of one &lt;a href="http://sabermetricresearch.blogspot.com/2011/12/do-nhl-referees-call-make-up-penalties.html#c5026720384234199267"&gt;suggested&lt;/a&gt; by commenter Wexler in the previous post:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;&lt;/span&gt;&lt;blockquote&gt;&lt;span style="font-family:verdana;"&gt;4.  Referees like to let the players play, and dislike calling penalties.  But, sometimes they have to assert themselves to make sure the game doesn't get out of hand.  Sometimes they're a bit too late, and they have to call a penalty on something that wasn't a penalty two minutes ago.  This sends a message to the players, "OK, enough."&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;That might be necessary, but is obviously unfair to the penalized team.  And, so, the referees know they have to call a "make up" penalty on those particular calls.  Both teams understand what's happening, and won't object to either that call or the subsequent call.&lt;/span&gt;&lt;/blockquote&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I don't know if #4 is plausible or not ... but one of my co-workers is a soccer referee, and it's consistent with what he says about having to keep the game under control before it's too late.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;As usual, I await comments from readers who know more about this stuff than I do.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;------&lt;br /&gt;&lt;br /&gt;UPDATE: Part 3 is &lt;a href="http://sabermetricresearch.blogspot.com/2012/01/do-nhl-referees-call-make-up-penalties_06.html"&gt;here&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/31545676-6896387712716993693?l=sabermetricresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://sabermetricresearch.blogspot.com/feeds/6896387712716993693/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=31545676&amp;postID=6896387712716993693' title='8 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/6896387712716993693'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/6896387712716993693'/><link rel='alternate' type='text/html' href='http://sabermetricresearch.blogspot.com/2012/01/do-nhl-referees-call-make-up-penalties.html' title='Do NHL referees call &quot;make up&quot; penalties?  Part II'/><author><name>Phil Birnbaum</name><uri>http://www.blogger.com/profile/03800617749001032996</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>8</thr:total></entry><entry><id>tag:blogger.com,1999:blog-31545676.post-8006033108875115266</id><published>2011-12-31T13:06:00.009-05:00</published><updated>2012-01-06T13:02:03.519-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='hockey'/><category scheme='http://www.blogger.com/atom/ns#' term='referee bias'/><category scheme='http://www.blogger.com/atom/ns#' term='penalties'/><category scheme='http://www.blogger.com/atom/ns#' term='NHL'/><title type='text'>Do NHL referees call "make up" penalties?</title><content type='html'>&lt;span style="font-family:verdana;"&gt;Among NHL fans, there's a perception that referees like to call "make up" penalties.  If a ref has just called a minor penalty on one team, it's very likely that the next penalty will go to the other team.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I was skeptical, until I downloaded a bunch of data from The Hockey Summary Project ... they're like Retrosheet for hockey.  (Their website is &lt;a href="http://hsp.flyershistory.com/"&gt;here&lt;/a&gt;, and if you want data downloads, you can join their group by going &lt;a href="http://sports.groups.yahoo.com/group/hockey_summary_project/"&gt;here&lt;/a&gt;.)&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I looked at all penalties from 1953-54 to 1984-85 (for which the HSP data is almost complete).  I eliminated all cases where there both teams got penalties at the same time.  Then, I checked what was left, to see if the team that got the current penalty was less likely to get the next one.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Absolutely, very much so.  There's a 60% chance the next penalty will go to the other team -- 59.7%, to be more exact. (But, since I'm not sure that database is complete, and I forgot to remove misconducts, and I didn't consider situations where both teams got a penalty but one team got an extra one, I'm happier to drop the decimal and just go with 60%.)&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;The effect is reasonably consistent over time, although it was a little stronger back in the six-team era.  Here's a too-long chart.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 1953-54: 62.7  888/1416&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 1954-55: 61.3  857/1397&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 1955-56: 60.6  912/1506&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 1956-57: 61.2  833/1360&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 1957-58: 58.8  793/1348&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 1958-59: 61.4  801/1305&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 1959-60: 62.3  723/1160&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 1960-61: 61.7  740/1199&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 1961-62: 62.0  821/1324&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 1962-63: 62.0  797/1286&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 1963-64: 61.8  826/1337&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 1964-65: 61.7  841/1362&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 1965-66: 58.6  820/1399&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 1966-67: 58.6  710/1212&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 1967-68: 60.0  1515/2527&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 1968-69: 59.5  1666/2800&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 1969-70: 60.5  1793/2962&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 1970-71: 60.4  1944/3220&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 1971-72: 57.8  1918/3317&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 1972-73: 60.6  2200/3633&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 1973-74: 58.8  2135/3628&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 1974-75: 56.7  2873/5069&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 1975-76: 56.3  2890/5130&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 1976-77: 57.2  2371/4144&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 1977-78: 59.1  2316/3916&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 1978-79: 59.5  2337/3925&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 1979-80: 59.3  3021/5091&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 1980-81: 60.1  3780/6293&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 1981-82: 60.7  3613/5957&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 1982-83: 60.3  3479/5774&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 1983-84: 60.2  3788/6296&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 1984-85: 60.6  3542/5847&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;-------------------------&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;   Overall: 59.7 39597/58543&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Even though the effect is real, we can't say for sure that it's referee bias.  It could just be that, after a penalty, the penalized team plays more cautiously, trying to avoid a second penalty.  Or, it could be that the just had the power play decides to play more aggressively.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-size:85%;"&gt;&lt;span style="font-family:verdana;"&gt;(As an aside: why did penalties drop so much between 1975-76 and 1976-77?  At first I thought it might be bad data, but then I checked power-play opportunities on &lt;a href="http://www.hockey-reference.com/leagues/NHL_1976.html"&gt;Hockey&lt;/a&gt; &lt;a href="http://www.hockey-reference.com/leagues/NHL_1977.html"&gt;Reference&lt;/a&gt;, and it checked out.)&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Here's what I think is some relevant evidence.  I broke down the stats by referee (minimum 300 datapoints).  The database only has the referee named for about a quarter of the total games (mostly older ones), but I figure it's probably good enough to at least look at.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;The first column is the main number, the percentage of penalties called against the team who drew the last one.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;Pctg  Z-sc Size Ref  &lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;---- ---- ---- ----------------------------&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 59.3 00.0             0509          Andy Van Hellemond&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 60.4          +0.4           0846          Art Skov&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 59.3         -1.2           1128         Ashley&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 60.0 00.0             0460          Bill Friday&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 57.0         -1.1           0537          Bob Myers&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 58.9         -0.3           0878          Bruce Hood&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 57.1         -0.9           0580          Bryan Lewis&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 59.3         -1.6           1453         Buffey&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 64.6          +1.6           0933          Chadwick&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 59.4         -0.1           0567          Dave Newell&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 60.4         -0.3           0356          Farelli&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 60.1         -0.3           0511          Friday&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 59.9         -0.1           0696          John Ashley&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 61.3          +0.8           0359          Lloyd Gilmour&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 61.3         -0.2           0789          Macarthur&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 56.7         -1.4           0319          Mehlenbecher&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 63.0          +0.5           0327          Olinski&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 60.4         -0.8           1906         Powers&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 55.2         -2.2           0698          Ron Wicks&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 63.7          +1.7           1095         Skov&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 57.7         -3.1           2197         Storey&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 63.1          +2.7           4717         Udvari&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 59.9          +0.4           0709          Wally Harris&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;The least "biased" referee is 55%, and the most "biased" is 64%.  If you think it's only referee bias that keeps the numbers from being 50%, you'd have to think that EVERY referee is biased almost exactly the same way. It's hard for me to accept that none of the referees noticed the bias and saw fit to try to eliminate it.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;The second column of the table is the Z-score, the number of standard deviations the referee is from expected (which is normalized to the seasons he officiated).  Normally, you concentrate on those with at least plus or minus 2 SD.  That gives you Red Storey and Ron Wicks (less biased than most) and Frank Udvari (more biased than most).&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;The standard deviation of the Z-scores was 1.29.  If every referee were the same, and differences were only random, it would be 1.00.  This suggests that there are real differences between referees.  Specifically, the SD of referee tendencies (or "talent", you might say) is 0.8 (since 1 squared plus 0.8 squared equals 1.29 squared).  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;In English, you can perhaps interpret that as saying that the differences in the table are about half real and half random, with a little more random than real (since 1.00 is a little higher than 0.8).  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;The observed range is 55 to 64.  Regressing to the mean, the actual range of referee tendencies is probably 57 to 62, or something like that.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;So if you think it's referee bias, you have to explain why all the referees seem to be biased within such a tight range, especially, when, presumably, they are all working hard to be as unbiased as possible.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;--------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Here's another interesting breakdown, by time since the previous penalty:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;0:01 to 1:00:  69.1 (7000)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 1:01 to 2:00:  64.7 (9444)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 2:01 to 3:00:  68.5 (11778)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 3:01 to 4:00:  64.2 (10574)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 4:01 to 5:00:  61.2 (8831)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 5:01 to 6:00:  59.7 (7470)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 6:01 to 7:00:  58.9 (6328)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 7:01 to 8:00:  58.3 (5333)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 8:01 to 9:00:  56.6 (4399)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 9:01 to 10:00:  55.5 (3719)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 10:01 to 11:00:  56.7 (3000)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 11:01 to 12:00:  55.5 (2591)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 12:01 to 13:00:  53.8 (2193)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 13:01 to 14:00:  55.2 (1837)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 14:01 to 15:00:  53.3 (1565)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 15:01 to 16:00:  53.1 (1376)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 16:01 to 17:00:  53.1 (1135)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 17:01 to 18:00:  52.4 (1019)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 18:01 to 19:00:  51.9 (807)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 19:01 to 20:00:  53.6 (757)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt; 20:01 to 99:99:  51.8 (3883)&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;The longer the interval since the previous penalty, the less likely the next penalty will go to the other team.  That's consistent with many theories.  The "referees are biased" theory would say that referees "forget" to even things up as the game goes on.  The "other team wants revenge and plays aggressively" theory would say that if they don't get revenge early, they don't need it as much later.  And the "penalized team takes fewer chances" theory would say that as time goes on, the players "forget" that they have to be more careful.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;So, the data doesn't help us choose, but it's interesting nonetheless.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;By the way, the 1:01 to 2:00 group is an exception to the pattern, but that's probably due to power plays, since the first penalty is probably still in effect.  Actually, I'd have expected that part to go the other way, with the first two minutes being *more* than 50 percent, on the logic that the shorthanded team playing in the defensive zone is more likely to be forced to take a penalty.  But, that doesn't happen.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;And here's an interesting breakdown of the first half of the first group:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;81.9% within 5 seconds &lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;78.1% between 6 and 10 seconds&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;76.0% between 11 and 15 seconds&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;73.8% between 16 and 20 seconds&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;69.3% between 21 and 25 seconds&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;67.0% between 26 and 30 seconds.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;&lt;/span&gt;-----&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Finally, one more question: after one team gets, say, four straight penalties, what happens then?  Is there an even stronger bias for the other team to take the next penalty?  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Yup.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;57.1 after exactly 1 in a row (64858 datapoints)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;64.0 after exactly 2 in a row (23850)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;66.6 after exactly 3 in a row (7042)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;66.0 after exactly 4 in a row (1781)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;63.8 after exactly 5 in a row (442)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;60.6 after exactly 6 in a row (127)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;67.5 after 7 or more in a row (40)&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;So: what's going on?  Any ideas?&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;UPDATE: Part 2 is &lt;a href="http://sabermetricresearch.blogspot.com/2012/01/do-nhl-referees-call-make-up-penalties.html"&gt;here&lt;/a&gt;.  Part 3 is &lt;a href="http://sabermetricresearch.blogspot.com/2012/01/do-nhl-referees-call-make-up-penalties_06.html"&gt;here&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/31545676-8006033108875115266?l=sabermetricresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://sabermetricresearch.blogspot.com/feeds/8006033108875115266/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=31545676&amp;postID=8006033108875115266' title='8 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/8006033108875115266'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/8006033108875115266'/><link rel='alternate' type='text/html' href='http://sabermetricresearch.blogspot.com/2011/12/do-nhl-referees-call-make-up-penalties.html' title='Do NHL referees call &quot;make up&quot; penalties?'/><author><name>Phil Birnbaum</name><uri>http://www.blogger.com/profile/03800617749001032996</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>8</thr:total></entry><entry><id>tag:blogger.com,1999:blog-31545676.post-3070854957026554224</id><published>2011-12-28T16:12:00.003-05:00</published><updated>2011-12-28T16:18:04.169-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='goalies'/><title type='text'>The best goalies should play for the worst teams</title><content type='html'>&lt;span style="font-family:verdana;"&gt;Last week, I &lt;a href="http://sabermetricresearch.blogspot.com/2011/12/are-there-good-team-goalies-and-bad.html"&gt;described&lt;/a&gt; a way to look for "bad team" goalies as described by Ken Dryden.  I don't think that method is going to work ... the sample sizes are too small.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;But, while doing the math (which I'll spare you), it occurred to me that there IS a class of goalies that could be considered "bad team" goalies, in the sense that they're more valuable to a good team than a bad team.  That class of goalies is simply ... the best goalies.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;The worse the team defense, the more shots the other team gets.  So the great goalie will wind up saving a lot more goals for a bad defense than a good defense.  It's like how a policeman is more productive in a bad neighborhood.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I guess this isn't a new realization ... people have said that Ken Dryden was wasted, a bit, in the Montreal goal for so many years ... there was very little for him to do.  He would have had more value to a team with a worse defense -- at least in terms of goals saved.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;&lt;/span&gt;&lt;span style="font-family:verdana;"&gt;So, if teams are rational, you should see the best goalies playing behind the worst defenses. &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Or maybe not.  I'm surprised at how tiny the effect is.  Eyeballing last year's NHL stats, it looks like bad teams gave up maybe 125 more shots than average.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;The best goalie in the league might be 2 &lt;a href="http://www.insidethebook.com/ee/index.php/site/article/how_much_talent_is_there_with_nhl_goalies/"&gt;SD&lt;/a&gt; above average, or .008.  Multiply 125 by .008, and you get ... exactly one goal.  So, even a great goalie is worth only one more goal to a bad team than to an average team.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;That assumes that all shots are equal.  Suppose bad teams give up harder shots, and good teams give up easier shots.  Maybe that doubles the effect.  In that case, the advantage becomes two goals.  So moving from the best team to the worst is worth four goals -- from two goals worse than normal, to two goals better than normal.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Hmmm ... maybe not as tiny as I thought.  To get four goals of goalie improvement is the equivalent of 3/4 of a standard deviation in goalie talent.  For a team, saving four goals should get you, what, maybe a couple of points in the standings?  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I'm sure this is old hat to hockey sabermetricians, but this is the first time it seriously occurred to me that the same player can be more valuable, in terms of influencing the score, with a bad team than a good team.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;You've also got the punter in football ... the worse the offense, the more fourth downs, so the more important punting is overall.  And, maybe, the safety: he's the last line of defense, so he gets more chances when his teammates fail to make the tackle before him.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;In baseball, good fielders are more valuable on bad teams, since bad pitchers allow more balls in play.  Also, a bad team will have a lot more men on base than normal, which means more double-play opportunities.  Also, a strikeout pitcher is more valuable on a team that doesn't field well.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;You might also argue for the NHL enforcer, if his job is to start fights when his team is behind, and you also accept the premise that the goonery actually helps the team come back.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Which is the strongest example, the one where the player adds the most value moving to the bad team?  I'd guess the NHL goalie, but, really, I have no idea.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/31545676-3070854957026554224?l=sabermetricresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://sabermetricresearch.blogspot.com/feeds/3070854957026554224/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=31545676&amp;postID=3070854957026554224' title='3 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/3070854957026554224'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/3070854957026554224'/><link rel='alternate' type='text/html' href='http://sabermetricresearch.blogspot.com/2011/12/best-goalies-should-play-for-worst.html' title='The best goalies should play for the worst teams'/><author><name>Phil Birnbaum</name><uri>http://www.blogger.com/profile/03800617749001032996</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>3</thr:total></entry><entry><id>tag:blogger.com,1999:blog-31545676.post-2240591562829506950</id><published>2011-12-27T11:09:00.013-05:00</published><updated>2011-12-27T11:44:14.499-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='new york times'/><category scheme='http://www.blogger.com/atom/ns#' term='income inequality'/><title type='text'>Will taxing the rich improve democracy?</title><content type='html'>&lt;span style="font-family:verdana;"&gt;Many people believe that income inequality in our society is too high.  I generally don't agree (some of my reasons are &lt;a href="http://sabermetricresearch.blogspot.com/2009/03/gini-coefficient.html"&gt;here&lt;/a&gt;), but I'm open to arguments.  However ... not the particular argument that Ian Ayres and Aaron Edlin made &lt;a href="http://www.nytimes.com/2011/12/19/opinion/dont-tax-the-rich-tax-inequality-itself.html?_r=1&amp;amp;ref=opinion"&gt;in the New York Times&lt;/a&gt; last week (and &lt;a href="http://www.freakonomics.com/2011/12/20/there-will-be-rich-always/"&gt;later&lt;/a&gt; &lt;a href="http://www.freakonomics.com/2011/12/21/an-inequality-tax-trigger-the-brandeis-ratio-explained/"&gt;added&lt;/a&gt; &lt;a href="http://www.freakonomics.com/2011/12/26/of-lags-and-caps-possible-implementations-of-a-brandeis-tax/"&gt;to&lt;/a&gt; at the Freakonomics site), that income inequality undermines democracy:&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;&lt;/span&gt;&lt;blockquote&gt;&lt;span style="font-family:verdana;"&gt;The progressive reformer and eminent jurist Louis D. Brandeis once said, “We may have democracy, or we may have wealth concentrated in the hands of a few, but we cannot have both."&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Brandeis understood that at some point the concentration of economic power could undermine the democratic requisite of dispersed political power. This concern looms large in today’s America, where billionaires are allowed to spend unlimited amounts of money on their own campaigns or expressly advocating the election of others.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;What we call the Brandeis Ratio — the ratio of the average income of the nation’s richest 1 percent to the median household income — [now stands at 36].  We believe that we have reached the Brandeis tipping point. It would be bad for our democracy if 1-percenters started making 40 or 50 times as much as the median American.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Enough is enough.  Congress should reform our tax law to put the brakes on further inequality.  Specifically, we propose an automatic extra tax ["Brandeis Tax"] on the income of the top 1 percent of earners -- a tax that would limit the after-tax incomes of the club to 36 times the median household income.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;/blockquote&gt;&lt;span style="font-family:verdana;"&gt;So, their idea is: some people have so much money more than the rest of us that they can disproportionately affect the outcome of elections.  Therefore, we should tax them to make the income distribution more equal.  That way, the rich will have less money, which means they'll spend less influencing politicians, and democracy will be stronger.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;There are so many reasons to disagree with this that I won't list them all.  But, the most obvious one: are the authors really saying, with a straight face, that a tax on the rich will make them less politically active?  Would that work with any other group?  "Hey, black people are starting to march on Washington.  Let's put a special surtax on black people.  That'll quiet them down!"  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;And that's not even the biggest problem.  The biggest problem is that Ayres and Edlin don't own a mirror.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Think of the most important ways that the USA and Canada are different, in terms of government policy, than they were a couple of generations ago.  If you were to ask me, the biggest changes are things like: The elimination of racial segregation.  Women's Rights.  Gay Rights.  The Canadian Constitution and Charter of Rights.  A more peaceful world.  A better welfare system.  Somewhat lower crime.  Socialized medicine (in Canada).  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Add your own.  Then, ask yourself, how many of them have much to do with rich people giving money to politicians?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Take, for instance, racial segregation.  How did that change?  Did some rich black guy come along, slip a politician a couple of million dollars, and then, wham, suddenly everyone gets to eat at the same lunch counter?  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Of course not.  Racial progress happened by a change in the attitudes of Americans, not by the actions of politicians (who acted late, only in response to public demand.) What happened is that Americans fueled the process.  They read the newspaper, and debated, and protested, and wondered, and marched, and argued, and pondered, and chatted at the water cooler, and gave speeches, and rode buses, and watched TV.&lt;br /&gt;&lt;br /&gt;And, slowly, day by day, week by week, people's views changed.&lt;br /&gt;&lt;br /&gt;It had nothing to do with where wealth was concentrated, and nothing to do with the richest 1% of Americans donating money to the right politicians.  The "power" WAS dispersed; it was dispersed among millions and millions of citizen voters.  The wealthiest black person in the country couldn't possibly have used political contributions to speed up the process much before its time.  &lt;/span&gt;&lt;span style="font-family:verdana;"&gt;&lt;br /&gt;&lt;br /&gt;Of course, some people had more influence than others, like Martin Luther King.  And, the media: newscasters, &lt;/span&gt;&lt;span style="font-family:verdana;"&gt;and columnists, and reporters; TV, and radio, and newspapers.  Someone writing for the New York Times, for instance, would get read by millions of people.  One New York Times is the equivalent of thousands of water coolers.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;That means that if you want to argue that inequality is undermining democracy, you shouldn't be thinking about money.  You should be thinking about public discourse. &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Let's suppose that over the last year, the Times has had five or six opinion pieces per day.  That means that, at most, around 2,000 Americans got to have their voices heard in the op-ed pages of the New York Times.  Many writers, of course, appeared more than once.  For the sake of argument, let's say there were 1,500 different writers.  The US population is 300 million, which gives us this shocking measure of inequality of influence:&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;span style="font-weight: bold;font-family:verdana;" &gt;The top 0.0005% of US writers wrote 100% of the New York Times op-eds.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Compare that to money: by my estimation, the top 0.0005% of US households earned less than 2% of the overall income.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;By this measure, that means the concentration of op-eds is FIFTY TIMES AS HIGH as the concentration of income.  Fifty times.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;But that's not really fair, since the New York Times isn't the only place you can broadcast your influence.  Let's suppose the top 100 venues -- newspapers, TV, or blogs -- have 100,000 different writers, and comprise 75% of the total political influence in the US.  Compare that to the top 100,000 households in income:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:courier new;"&gt;0.03% of the population has 8% of the income.&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;&lt;span style="font-family:courier new;"&gt;0.03% of the population has 75% of the influence.&lt;/span&gt;  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;So, if democracy is compromised by the unequal distribution of income, how can you say it's not compromised by an unequal distribution of public access to opinions and influence -- especially when the latter inequality is EIGHT TIMES AS HIGH?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;The fact is that Ian Ayres, with &lt;a href="http://www.nytimes.com/2005/09/16/opinion/16ayres.html"&gt;at&lt;/a&gt; &lt;a href="http://www.nytimes.com/2003/09/30/opinion/dialing-for-dollars.html?n=Top%2fReference%2fTimes%20Topics%2fSubjects%2fT%2fTelephones%20and%20Telecommunications"&gt;least&lt;/a&gt; &lt;a href="http://www.nytimes.com/2003/04/15/opinion/charity-begins-at-schedule-a.html?n=Top%2fReference%2fTimes%20Topics%2fSubjects%2fT%2fTaxes"&gt;three&lt;/a&gt; other op-ed pieces in the New York Times in the past ten years -- 50,000 times as many as the average American -- has much, much more influence on public policy than your typical rich guy.  And &lt;a href="http://topics.nytimes.com/top/opinion/editorialsandoped/oped/columnists/paulkrugman/index.html"&gt;Paul Krugman&lt;/a&gt;, the Times economics columnist, writes around 100 columns a year -- &lt;span style="font-style: italic;"&gt;fifteen million times as many as the average American&lt;/span&gt;.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;"Enough is enough," indeed.&lt;br /&gt;&lt;br /&gt;I propose &lt;/span&gt;&lt;span style="font-family:verdana;"&gt;a "Krugman Tax."  Every year, we'll compute the "Krugman Ratio," the number of words the top 100,000 writers published, divided by the number of words the average American published.  If it's more than, say, 40, those writers will be taxed just enough words to bring the ratio back down to 40 in future.  That will ensure that everyone, not just the ultra-published like Ian Ayres and Paul Krugman, can influence American political decisions.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;We'll do it for democracy.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/31545676-2240591562829506950?l=sabermetricresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://sabermetricresearch.blogspot.com/feeds/2240591562829506950/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=31545676&amp;postID=2240591562829506950' title='13 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/2240591562829506950'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/2240591562829506950'/><link rel='alternate' type='text/html' href='http://sabermetricresearch.blogspot.com/2011/12/will-taxing-rich-improve-democracy.html' title='Will taxing the rich improve democracy?'/><author><name>Phil Birnbaum</name><uri>http://www.blogger.com/profile/03800617749001032996</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>13</thr:total></entry><entry><id>tag:blogger.com,1999:blog-31545676.post-7557845802584452636</id><published>2011-12-21T15:36:00.006-05:00</published><updated>2011-12-21T15:50:57.755-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Ken Dryden'/><category scheme='http://www.blogger.com/atom/ns#' term='clutch'/><category scheme='http://www.blogger.com/atom/ns#' term='goaltending'/><title type='text'>Are there "good team" goalies and "bad team" goalies?</title><content type='html'>&lt;span style="font-family:verdana;"&gt;In "&lt;a href="http://www.amazon.com/Game-Ken-Dryden/dp/0470835842/ref=sr_1_1?ie=UTF8&amp;amp;qid=1324306372&amp;amp;sr=8-1"&gt;The Game&lt;/a&gt;," Ken Dryden argues that some players are not psychologically suited to playing on good teams:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;&lt;/span&gt;&lt;blockquote&gt;&lt;span style="font-family:verdana;"&gt;Because the demands of a goalie are mostly mental, it means that for a goalie the biggest enemy is himself.  The fear of failing, the fear of being embarrassed ... The successful goalie understands these neuroses, accepts them, and puts them under control.  The unsuccessful goalie is distracted by them, his mind in knots, his body quickly following.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;It is why [Rogie] Vachon was superb in Los Angeles and as a high-priced free-agent messiah, poor in Detroit.  It is why Dan Bouchard ... lurches annoyingly in and out of mediocrity.  It is why there are good "good team" goalies and good "bad team" goalies -- Gary Smith, Doug Favell, Denis Herron.  The latter are spectacular, capable of making near-impossible saves that few others can make.  They are essential for bad teams, winning them games they shouldn't win, but they are goalies who need a second chance, who need the cushion of an occasional bad goal, knowing that they can seem to earn it back later with several inspired saves.  On a good team, a goalie has few near-impossible saves to make, but the rest he must make, and playing in close and critical games as he does, he gets no second chance.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;A good "bad team" goalie, numbed by the volume of goals he cannot prevent, can focus on brilliant saves and brilliant games, the only things that make a difference to a poor team.  A good "good team" goalie cannot.  Allowing few enough goals that he feels every one, he is driven instead by something else -- the penetrating hatred of letting in a goal.&lt;/span&gt;&lt;/blockquote&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Dryden seems to be saying at least three things here:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;1.  Some goalies, like Rogie Vachon, can't handle pressure.&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;2.  Some goalies are better on bad teams than on good teams.&lt;br /&gt;3.  Those two groups are the same goalies.&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I'm very skeptical about #1, especially with regards to Rogie Vachon.  Yes, &lt;a href="http://www.hockey-reference.com/players/v/vachoro01.html"&gt;Vachon&lt;/a&gt; had a serious decline after leaving the Kings -- with Detroit, he was worse by more than a goal a game (3.90 to 2.86).  But, was it really Vachon's neuroses?  After all, he was 33 years old that year.  Dryden may know Vachon pretty well -- they were together on the Canadiens for a few months in &lt;a href="http://www.hockey-reference.com/teams/MTL/1972.html"&gt;1971&lt;/a&gt; -- but is that enough for him to conclude that Vachon's problem is that he choked under pressure?  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I'll skip over #3, also, and concentrate on #2, the part about "good team" goalies and "bad team" goalies.  What Dryden seems to be saying, as an empirical hypothesis, is something like this:  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-style: italic;font-family:verdana;" &gt;There are some goalies who make brilliant saves that few others can, but also give up more weak goals.  Those goalies are more valuable to bad teams, because bad teams give up more scoring chances where brilliant saves are required.  It wouldn't make sense for a good team to pick up a goalie like that, because they'd get only the weak goals, but not the brilliant saves.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;That's actually a pretty interesting theory!  And it seems plausible.  After all, what a team should care about is how many goals a guy allows, not how he looks doing it.  A goalie with a 2.50 GAA is more valuable than a goalie with a 2.75 GAA, even if the first guy lets in more bad goals than the second guy.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;But, is there any evidence for it?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Not in the book.  Dryden gives us only those three examples of "bad team" goalies.  Unfortunately, they played on bad teams for most of their careers.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Still, we have a few datapoints.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-- &lt;a href="http://www.hockey-reference.com/players/s/smithga01.html"&gt;Gary Smith&lt;/a&gt; left Oakland (bad) to play two seasons for the Black Hawks (good) as Tony Esposito's backup.  The first year, he was very good; the second year, he was mediocre.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-- &lt;a href="http://www.hockey-reference.com/players/h/herrode01.html"&gt;Denis Herron&lt;/a&gt; moved from (bad) Pittsburgh to (good) Montreal (where he replaced Ken Dryden).  Like Smith, he was great the first year, but not so great the second year.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-- &lt;a href="http://www.hockey-reference.com/players/f/faveldo01.html"&gt;Doug Favell&lt;/a&gt; was nothing special in his last season with Toronto (an average team).  Then, he went to a below-average Colorado Rockies team, where it seems like he was pretty good.  So, maybe that's a plus.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;But, overall ... no real evidence either way, really.  And Dryden doesn't give any examples of "good team" goalies, so there's nothing to check there.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;But ... maybe here's something we can do. &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Goalies play some of their games against good teams, and some against bad teams.  If Dryden is correct, that Smith, Herron and Favell give up more bad goals but also make more spectacular saves, they should do better than expected against good teams, and worse than expected against bad teams.  That's because they'll give up roughly the same amount of bad goals each way, but they'll make more brilliant saves against the good teams.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Does that make sense?  Maybe we can find a way to check that.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Here's how that might work.  In &lt;a href="http://www.hockey-reference.com/leagues/NHL_1978.html"&gt;1977-78&lt;/a&gt;, the &lt;a href="http://www.hockey-reference.com/teams/PIT/1978.html"&gt;Penguins&lt;/a&gt;, with Denis Herron as their regular goalie, gave up 321 goals, or 4.01 per game.  That season, the best five teams (alphabetically) were the Bruins, Canadiens, Flyers, Islanders, and Sabres.  The worst were the Barons, Blues, Canucks, Capitals, and North Stars.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;From the &lt;a href="http://www.databasehockey.com/teams/teamscores.htm?tm=PIT&amp;amp;yr=1977"&gt;game log&lt;/a&gt;, I manually calculated that against the five best teams, the Penguins gave up 5.25 goals per game.  Against the five worst teams, they gave up 3.14.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;For that to be evidence that the Penguins have "bad team" goalies, you'd have to show that 5.25 goals against good teams is actually better than expected for a team that gives up 4.01, and that 3.14 against bad teams is actually worse than expected for a team that gives up 4.01.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;How would you do that?  Well, one thing you could do is find a matching team, one that also gave up 4.01 goals per game (or close to it), but had a random goalie.  If that team gave up 6.00 goals against the good teams, but 2.50 against the bad teams, that would be confirmatory evidence.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;The match wouldn't be perfect, because it might have to be from another season, and the "best" and "worst" groups might not be comparable.  Still, even with just those three goalies, you'd have about 33 seasons to compare (if you require a minimum 30 game season).  If you found the two most comparable instead of one, that would be 66 comparisons.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;That's better than nothing.  But there'd still be a lot of noise.  To make it workable, you'd have to limit your sample to games those goalies started (in 1977-78, Herron himself gave up only 3.57 goals per game, compared to 3.96 for the team after subtracting empty net goals).  There'd be even less noise if you used save percentage instead of goals against.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;The process would be a bit like searching for clutch hitting, only with a lot less data.  And it would be a lot of work ... but, there's an organization, the &lt;a href="http://hsp.flyershistory.com/"&gt;Hockey Summary Project&lt;/a&gt;, that collects NHL game summaries -- like a hockey Retrosheet -- and I've asked &lt;a href="http://sports.groups.yahoo.com/group/hockey_summary_project/"&gt;them&lt;/a&gt; for access to their database.  I'm hoping they have shots on goal.  (Also, those summaries might help us trying to trace the games that Dryden talked about in his book, the ones that didn't match the game logs.)&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Before I go any further, does this make sense as a way to test Dryden's hypothesis?  Can you think of any others that might be easier?&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/31545676-7557845802584452636?l=sabermetricresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://sabermetricresearch.blogspot.com/feeds/7557845802584452636/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=31545676&amp;postID=7557845802584452636' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/7557845802584452636'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/7557845802584452636'/><link rel='alternate' type='text/html' href='http://sabermetricresearch.blogspot.com/2011/12/are-there-good-team-goalies-and-bad.html' title='Are there &quot;good team&quot; goalies and &quot;bad team&quot; goalies?'/><author><name>Phil Birnbaum</name><uri>http://www.blogger.com/profile/03800617749001032996</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-31545676.post-2977983642047799014</id><published>2011-12-19T09:49:00.014-05:00</published><updated>2012-01-07T13:11:28.075-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='hockey'/><category scheme='http://www.blogger.com/atom/ns#' term='Don Cherry'/><category scheme='http://www.blogger.com/atom/ns#' term='Ken Dryden'/><title type='text'>Ken Dryden's "The Game"</title><content type='html'>&lt;span style="font-family:verdana;"&gt;Ken Dryden's book, "&lt;a href="http://www.amazon.com/Game-Ken-Dryden/dp/0470835842/ref=sr_1_1?ie=UTF8&amp;amp;qid=1324306372&amp;amp;sr=8-1"&gt;The Game&lt;/a&gt;," is considered one of the greatest ever in sports.  Many times, I've heard it called "the best hockey book ever written," and that's the quote (unattributed) on the cover of my 1999 edition.  The Canadian literary crowd loves it.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;So, last week, I thought I'd read it.  I was disappointed.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;"The Game" takes place towards the end of the 1978-79 NHL season, Dryden's last before retirement.  It was actually written later, based on his notes at the time, and published in 1983.  It takes the form of a ten-day diary, although most of each day is taken up by Dryden's reminiscences and analyses, rather than the actual events of the particular day.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I'd argue that it's only tangentially a book about hockey.  It's really a book about Ken Dryden.  What he does, how he feels, what goes on in his dressing room, what he thinks of his teammates, and so forth.  If you actually were hoping to learn something about hockey and how it works, there won't be much for you here ... except that you'll hear some stories about the personalities of teammates and coaches.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;It doesn't talk much at all about strategy, or playmaking, or statistics, or how to win games.  It deals a little more with personalities, and stories.  But, mostly, it deals with Ken Dryden's feelings.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;That may not sound too interesting ... except that Ken Dryden is very, very articulate.  He can take the most common observation, and write paragraphs of poetry about it.  Here he is talking about travelling to the Forum:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;&lt;/span&gt;&lt;blockquote&gt;&lt;span style="font-family:verdana;"&gt;"I drive down side streets narrowed by drifts and snow-shrouded cars.  Traffic is light today, and the few cars on the road move easily, unconcerned by the conditions.  After the awkward caution of a winter's first snowfall, for Montreal drivers, like riding a bike, it all comes back, and slippery streets are driven as if bare and dry.  I park several blocks away from the Forum and walk.  The wind, gusting up Atwater Street, is bitter and cold, and, hunching over, I try to cover up, but can't.  I start to jog, then run, faster as the wind bites harder.  At de Maisonneuve, the light turns red but I continue across."&lt;/span&gt;&lt;/blockquote&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Well, OK, great, that's certainly a vivid picture of a windy Montreal winter's day.  But the tone is a bit dramatic.  It's an impressive feat of writing, and there's nothing wrong with being Shakesperean at times ... but Dryden does it *everywhere*, from the first page to the last.  It started grating on me.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;What would mitigate the tone, a bit, is if Dryden would talk more concretely about hockey.  Sure, there are a few pages on Scotty Bowman's coaching style, and on Larry Robinson's history, and little profiles of some of the other players pop up occasionally.  Finally, close to the end of the book, there's a bit of meat -- about 20 pages on the history of hockey, and how the game has suffered, and how rule changes can fix it.  It's really good stuff, especially for 30 years ago, when that kind of analysis was rare.   Moreover, with something more concrete to explain, Dryden allows some air to come out of the lofty prose, and it reads a lot better.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Still, even accounting for those 20 pages, the book is mostly Ken Dryden, sociologist and psychologist, observing his hockey team.  It becomes a little weird because Dryden writes like he's not even there, like he's a psychiatrist floating above the dressing room taking notes.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I guess that's his thing.  Ken Dryden is a lawyer; he famously took a year off from the NHL in 1973-74 to finish his law degree.  He's intelligent and articulate, and it's almost like he figures there's no everyday scene that can't be made better if you try to analyze it dramatically:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;&lt;/span&gt;&lt;blockquote&gt;&lt;span style="font-family:verdana;"&gt;"... Half-naked players move hurriedly about, laughing, shouting for tape (black or white, thick or thin), cotton, skate laces, gum, ammonia "sniffers," Q-Tips, toilet paper, and for trainers to get them faster than they can.  It is the kid of unremitting noise that no one hears and everyone feels.  But there is another level dialogue we can all hear.  It is loud, invigorating, paced to the mood of the room, the product of wound-up bodies with wound-up minds.  It's one line, a laugh, and get out of the way of the next guy -- "jock humor."  It is like a "roast," the kind of intimate, indiscriminate carving that friends do to keep egos under control.  Set in motion, it rebounds by word association, thought association, by "off the wall" anything association, just verbal reflex, whatever comes off your tongue, the more outrageous the better.  Elections, murders, girl friends, body shapes, body parts, in the great Tonight Show / Saturday Night Live tradition, verbal slapstick dressed as worldly comment ..."&lt;/span&gt;&lt;/blockquote&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Nothing wrong with all that, but ... for a lot of the book, that's all there is.  And, for all the analysis and introspection, you won't even feel like you know much abour Dryden, which is weird, because he writes a lot about himself: how he feels, and when he's more confident in net, and how he reacts to winning and losing, and his thoughts on retirement.  But it's all detached, like he's trying to save you the trouble of understanding him yourself by doing the analysis for you.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;There are a few occasions when he tries to say something about the game, something that a sports columnist would say, and it's almost a relief -- finally, something you can get your brain around!  At one point, for instance, he talks about how he doesn't think the Canadiens are going to win this year (they eventually did, but that season was their last of four consecutive Stanley Cups).  Why?  Because, he says, he notices the team is complacent, spoiled by its own success, looking for the "big play" to win the game.  Players "shoot from long range, safe from the punishment that goes with rebounds, deflections, screens, and goal-mouth tip-ins."  Dryden says he himself cares less about winning, "content that goals appear as "good goals.""&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I'm not sure I necessarily buy it 100%, but at least there's something concrete there ... you can try to figure out if it's true or false, or at least how you could study the issue.  You think, hey, the guy played in the NHL for a few years, finally he's telling us what he learned!&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;But, that's as meaty as it gets before Dryden starts getting poetic again:&lt;/span&gt;&lt;br /&gt;&lt;blockquote&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;"I have felt it before in other years, but never so often and never with the same feeling, that if we lose, it will be because of us, no one else.  It is not fun to feel a team break down, to find weakness where I always found strength; to discover the discipline and desire can go soft and complacent; to discover that we are not so different as we once thought; to realize that winning is the central card in a house of cards, and that without it, or with less of it, motivations that seemed pure and clear go cloudy, and personal qualities once noble and abundant turn on end; to realize that I am a part of that breakdown."  &lt;/span&gt;&lt;/blockquote&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I can see why some people like this stuff, but ... well, I don't.  It's just cotton candy, an exquisitely-worded paragraph that melts down to nothing when you try to figure out what it means.  It's articulateness disembodied from communication, as if, when you say something beautifully and poetically enough, it doesn't matter that there's no content.  Dryden produces this huge fog of articulateness that overwhelms you with feeling and the sense that something important has just been said, but ... there's not much there when you actually look.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I can't resist one more example.  Here's Dryden analyzing Guy Lafleur:&lt;/span&gt;&lt;br /&gt;&lt;blockquote&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;"For there is a life there, and in destiny and romance there is no room for life.  Painted as they are with broad brush strokes, vivid and lush, they find shape and pattern only with distance.  The person who lives them is too close.  He feels sweat as well as triumph.  He understands what others see, but feels none of it himself."&lt;/span&gt;&lt;/blockquote&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Huh?  Some people eat this stuff up, but I just don't get it.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Call me cynical, but I think that these things I think are weaknesses are actually why people like this book.  It's about hockey, but not about hockey.  It talks about "big issues" that people like to pretend they care about.  There are long digressions into Quebec separatism and culture, and personal growth and emotion.  It's articulate, it's educated, and it's erudite.  It detaches the reader from the world of uneducated jocks, allowing them to identify and affiliate with someone they look up to.  For some readers, it lets them signal to the world that *this* is the hockey they like, that they watch every Saturday night and talk about at the water cooler, the hockey that's deep and sociological and high-class. &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;"The Book" is a lot like a politician giving a speech, saying things so beautiful and eloquent and moving that you don't even notice that he's not saying anything concrete about what he'll do once elected.  You wind up voting for the guy, not because of how he'll be able to run the country, but because you feel like you're voting for him as a person, good-looking and well-dressed and articulate.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;But, no offense to Dryden ... it could be just me, that his book just isn't what I'm looking for, that the emotional stuff doesn't do anything for me.  But ... whether it's me or not me, this is still not a book about hockey.  It's a book about Ken Dryden, articulate hockey player.  If you didn't know much about hockey before the book, you still won't know much about it after.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Of course, if you're a Montreal Canadiens fan, you'll love it ... the little stories about the team you love will be irresistible.  In his chapter on Toronto, there are some of Dryden's little psychological observations on some of the Leafs of my childhood.  I'm not sure if I really believe them, but I still ate them up, and I wished there were more. &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;------&lt;/span&gt;&lt;a name="tracers"&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;It's kind of an aside, but the reminiscences in the book don't seem to be right.  When I started writing this post, I figured I'd try to let you know which ten days Dryden's diary covered.  It should have been easy: a ten-day stretch with the first game in Buffalo, and the last game against the Islanders.  But the historical record doesn't jibe with Dryden's narrative.  It's not just occasionally that the book is off, but almost all the way through.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;This may be a little long ... if you don't like this boring "tracers" stuff, you can skip to the next section.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;1.  At the beginning of the book, Dryden writes "last night was the sixty-second game of my eighth season with the Montreal Canadiens."  It was a game in Buffalo, where the Habs won and Dryden played well.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Here's a &lt;/span&gt;&lt;a style="font-family: verdana;" href="http://www.databasehockey.com/teams/teamscores.htm?tm=Mon&amp;amp;yr=1978"&gt;game log for the 1978-79 Canadiens&lt;/a&gt;&lt;span style="font-family:verdana;"&gt;, one I'll refer to many times in this post.  From the log, it seems the 62nd game of the 1978-79 season was a home game against the Leafs, on March 1.  Dryden might be referring to the February 18, game, which was a 5-2 win in Buffalo.  Not that big a deal.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;That game, Dryden writes, came "after a tie in Chicago and a Saturday loss at home to Minnesota."  That doesn't work out.  The Canadiens played two games in Chicago that year: a 4-1 loss on October 28, and a 5-3 victory on December 20.  Neither of their two home games against the Black Hawks was a tie, and neither closely preceded a game in Buffalo.  They did tie Chicago the year before, on Feburary 9, 1978, but that was a home game.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;They did play Chicago at home on November 25, losing 8-3, making up for it with a 8-1 rout in Buffalo two games later.  That's as close as I could find.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;As for Minnesota, the Habs played them four times that year, winning three.  The loss was a 4-3 game, but not at home, and not on a Saturday.  It was on Wednesday, March 14, 1979 -- almost a month *after* the Sabres game that Dryden possibly describes.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;&lt;a name="tracer2"&gt;&lt;/a&gt;2.  That Buffalo game was on a Sunday, as it was described as "yesterday" in the "Monday" chapter.  The next game occurs on "Wednesday," at Maple Leaf Gardens in Toronto.  That could have been the aforementioned 62nd game that year, on March 1, except that it doesn't match.  The Habs won it 2-1, but Dryden's description is 6-4.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;It can't be that he just misremembers the score, because he describes the game in detail.  After tying the game, Dryden writes, the Leafs get confident that they can keep up with the powerhouse Canadiens, and the Leafs begin to take over the play.  But Mark Napier and Pat Hughes score two quick goals for the Habs.  The Canadiens score two more, and then the Leafs get two late.  The next day, the players wonder why coach Scotty Bowman didn't give them hell for allowing those two late goals.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;That adds up to 6-4.  There was no 6-4 win in Toronto in 1978-79.  Also, I couldn't find a 6-4 win in Toronto in either of the two prior seasons.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;There was a 6-3 win on February 3.  I looked that game up in the February 5 Globe and Mail.  It doesn't match Dryden's description: it went 1-0 for the Leafs, then 3-1 Habs, then 3-2, Then 4-2, then 6-2, then 6-3.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;3.  The next day, Dryden writes, the Habs fly to Boston, where they win 3-2 after coming back in the third period from a 2-1 deficit.  But, according to the 1978-79 game logs, Montreal played only two regular season games in Boston, and tied both of them, 1-1 and 3-3.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;There *is* a game that fits in the *following* season, February 10, 1980.  It matches the 3-2 score, and the Habs coming back from 2-1 in the third period.  But, of course, it can't be that game, because Dryden was then retired (Denis Herron was in net).  Also, Dryden describes Larry Robinson tying the game early in the period on a power-play goal, and Mario Tremblay potting the winner.  But the third period goals were actually scored by Mark Napier and Pierre Larouche -- the first goal coming at 10:03 -- and there were no power plays after the first period.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;4.  The next game is in Montreal, against Detroit.  Scotty Bowman says, "we got 'em back in Detroit next week."  The home-and-home timeline pins it down to the game of April 4 (the return game in Detroit would have been April 8).  According to the game log, Montreal won the April 4 game by a score of 4-1.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;But that can't be it.  During the game, Dryden notices the Leafs vs. Flyers on the out of town scoreboard. Those two teams didn't play each other on April 4.  But they did on March 3, when Detroit was also visiting Montreal.  So now we have two candidates.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Dryden doesn't explicitly tell us the final score, but he tells us Montreal won.  The book says it was 1-0 after the first period, 2-1 after the second, and, with five minutes left in the game, Guy Lapointe scored on a shot off Mario Tremblay's right knee to make it 3-1.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Montreal lost on March 3, so it couldn't be that one.  On April 4, Montreal won 4-1, which looks promising -- perhaps an empty-net goal that Dryden didn't report.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;But, nope, the rest of the details don't match.  The April 4 game was 1-1 after one period, and 4-1 after the second.  Jacques Lemaire had a hat trick, and Steve Shutt the other goal.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;5.  The next game, Dryden has Montreal 7, Philadelphia 3.  Flyers goalie Bernie Parent was out with an eye injury that game, the book says.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Parent did indeed suffer a career-ending eye injury in 1978-79, which &lt;a href="http://www.hockey-reference.com/players/p/parenbe01.html"&gt;occurred on February 17&lt;/a&gt;.  But, Montreal's last game against Philadelphia was January 29.  It was indeed a 7-3 score -- but Bernie Parent was the starting goalie.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;6.  Next is a road game against the Islanders.  The book has been foreshadowing that Islanders game from the beginning -- the Islanders were challenging Montreal for number 1 in the standings, and appeared to be the Canadiens' main rivals for the Cup that year.  There are several mentions of that important Islanders game coming up, including one in the first few pages, which puts it nine or ten days after the Sabres game.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;As far as score, all Dryden says is that the Canadiens lost.  So it could be either of Montreal's two games on Long Island: February 27, by a 7-3 score, or October 17, by 3-1.  The February 27 game is nine games after the original Buffalo game, so that must be it.  Still, none of the games in between are the ones Dryden describes in the book.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;7.  Epilogue: "In the season's final game, we needed a tie against Detroit for first place, and we lost.  The Islanders, waiting to be crowned, lost to the Rangers in the playoffs.  And we won again."&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Finally, this time it works out.  Detroit beat Montreal 1-0 in the last game, and the playoffs match Dryden's recollection.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;...&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;So, six out of seven cases don't check out.  What happened?  Perhaps instead of using an actual ten days out of the season, Dryden pieced together a composite.  That might actually make sense.  The book has lots to say about Toronto, where Dryden grew up.  It has lots to say about Boston, where he went to school and had a major playoff victory in 1972.  And it has lots to say about Philadelphia, which Dryden uses as his springboard for what's wrong with hockey.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;He'd have liked to have a period that included all three cities.  Maybe he just constructed one out of previous games, periods, and goals that he saw.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;...&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;8.  Finally, not a score thing, but: on page 71, Dryden describes Maple Leaf Gardens:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;&lt;/span&gt;&lt;blockquote&gt;&lt;span style="font-family:verdana;"&gt;"The enormous Sportimer is gone; an even larger, more versatile scoreboard-clock, the kind you might find in any large arena, is in its place."&lt;/span&gt;&lt;/blockquote&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;But: in 1979, Gardens sported the same, iconic clock that it had since 1966.  It wasn't changed until 1982 -- after Dryden's retirement, but before his book was published.  Or, maybe he's referring to the original Sportimer, the one before 1966, which he would have seen as a child.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Either way, Dryden's career went from 1971 to 1979, so he would have seen the exact same clock his entire career.  So what's this all about?  Perhaps he saw the new clock in 1982, before the book was published, and thought he also saw it back in 1979.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Reading the book made me think of how different Dryden is from Don Cherry.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Cherry, in one five-minute episode of Coach's Corner, will say more of substance than Dryden says in ten pages.  But Cherry uses uneducated language, dresses funny, is passionate about the things he believes, and occasionally slips into political views that tend to be less accepted by "learned" people.  So Dryden gets credited with "the best hockey book ever written," and Cherry gets scorn.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Coincidentally, Cherry has books of hockey stories too (as dictated to sportswriter Al Strachan); I just finished reading &lt;a href="http://www.amazon.com/Don-Cherrys-Hockey-Stories-Part/dp/0385670052/ref=sr_1_1?s=books&amp;amp;ie=UTF8&amp;amp;qid=1324306504&amp;amp;sr=1-1"&gt;his second one&lt;/a&gt;.  The styles, as you can imagine, are different.  Dryden shrouds each locker room conversation in a cloud of profundity and mood; Cherry tells it in his straight-ahead style.  But Cherry has something to say, and a point to make, and the stories are actually interesting.  Dryden's got a couple of good ones -- like Steve Shutt urinating into a cup, adding Coke for color, and waiting for one of his teammates to come by and drink it -- but most of them are, well, not that engrossing.  Like this one:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;&lt;/span&gt;&lt;blockquote&gt;&lt;span style="font-family:verdana;"&gt;Amid the business of getting ready for practice, there is talk of beer.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;"Calisse, you see the paper?" [Rejean] Houle moans.  "Beer's goin' up sixty-five cents a case.  Sixty-five cents!"&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;His words bring a grumble of memory.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;"Shit, yeah," says a mocking voice, "the only thing should go up is what they pay fifteen-goal scorers, eh, Reggie?"&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;There is laughter this time.  Across the room, Guy Lapointe stares at the ceiling, lost in thought.  Suddenly he blurts, "That's it, that's it.  No more drinkin'."&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;There is loud laughter.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;"Hey, Pointu," Steve Shutt says, "ya just gotta learn to beat the system -- drink on the road."&lt;/span&gt;&lt;/blockquote&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;That one, it seems safe to say, wouldn't have made it into Cherry's book.  And, as far as actual hockey content goes, Cherry has Dryden beat by a mile.  I opened Cherry's book to a random page, 74, where there's a thing about fighting.  It's too long to quote entirely, but here's what Cherry tells me in five paragraphs:&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt; &lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-- There is no rush in the world like when you fight.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-- When players get older, they get a conscience and start to hesitate, and they have a tough time.  That's why you see few older players fighting.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-- Fighters' hands suffer serious damage.  "You wouldn't believe the hands on Joey Kocur ... it looks like he's had a ping-pong ball implanted under each knuckle."&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-- The advent of helmets and visors have made hand injuries much more common, so fighters will remove their helmets as a show of respect.  "I love it when I see one guy who's have trouble getting the strap loose on his helmet, and the other guy gives him time while he gets it off. ... That's honour!  I love those guys."&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;No exaggeration: I learned more in that half-page than probably in 50 pages of Ken Dryden.  Now, I'm not saying Don Cherry is always right, or even mostly right, or that you have to agree with him, or that he's particularly eloquent.  But, he does try to tell you something about hockey.  He may be wrong about some of the things he believes, like Joe Morgan or Harold Reynolds, or any other commentator in any other sport.  But, geez, at least he says things about hockey!&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;As much as people love the Ken Drydens of the world, it's the Don Cherrys who actually can teach you things.  I mean, they're not always right: for every non-Dryden who believes something correctly and passionately, there'll be many other non-Drydens who believe something different, incorrectly but just as passionately.  So I'm not saying that when you find a Don Cherry, you should immediately become a mindless follower.  I'm just saying that if you actually want to know how things are, you should start with the Cherrys and then go with your brain.  Whatever the subject, hockey or otherwise, you'll never learn anything unless you stick with the people who are seriously trying to tell you something -- not just the people who sound the best.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;No offense to Ken Dryden.  I'm not saying that he doesn't know anything about hockey, just that he chose not to tell us too much about it.  He actually wrote a pretty decent book, just one that's not as much to my taste as it could have been.  And Dryden is under no obligation to write the kind of book that I like to read.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Still, "The Game" is nowhere near the best hockey book ever written.  It's just the most poetic.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/31545676-2977983642047799014?l=sabermetricresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://sabermetricresearch.blogspot.com/feeds/2977983642047799014/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=31545676&amp;postID=2977983642047799014' title='6 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/2977983642047799014'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/2977983642047799014'/><link rel='alternate' type='text/html' href='http://sabermetricresearch.blogspot.com/2011/12/ken-drydens-game.html' title='Ken Dryden&apos;s &quot;The Game&quot;'/><author><name>Phil Birnbaum</name><uri>http://www.blogger.com/profile/03800617749001032996</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>6</thr:total></entry><entry><id>tag:blogger.com,1999:blog-31545676.post-1771557642828080674</id><published>2011-12-09T14:21:00.009-05:00</published><updated>2011-12-10T12:11:04.455-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Moneyball'/><category scheme='http://www.blogger.com/atom/ns#' term='economics'/><category scheme='http://www.blogger.com/atom/ns#' term='regression'/><category scheme='http://www.blogger.com/atom/ns#' term='baseball'/><category scheme='http://www.blogger.com/atom/ns#' term='payroll'/><title type='text'>A "Grantland" article on Moneyball effects</title><content type='html'>&lt;span style="font-family:Verdana,sans-serif;"&gt;Here's a &lt;a href="http://www.grantland.com/story/_/id/7328539/the-economics-moneyball"&gt;baseball salary article at Grantland&lt;/a&gt;, by economists Tyler Cowen and Kevin Grier.  It’s a strange one ... the impression I get is that is that the authors are just going on the basics of the "Moneyball" story, but don’t really follow baseball discussions very much.  And so some of their arguments are obviously behind the curve.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:Verdana,sans-serif;"&gt;For instance, they talk about how closers used to be paid inefficiently, but aren't any more, except by free-spending teams like New York:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;blockquote class="tr_bq"&gt;&lt;span style="font-family:Verdana,sans-serif;"&gt;"This year, the Yankees' Mariano Rivera was ranked fifth in total saves with 44. At a salary of $14.9 million, that works out to be a hefty $338,600 per save. The four closers ranked ahead of him averaged 46.5 saves and a salary of $2.9 million, or $63,771 per save — quite the bargain."&lt;/span&gt;&lt;/blockquote&gt;&lt;br /&gt;&lt;span style="font-family:Verdana,sans-serif;"&gt;The problem here is obvious to almost any serious baseball fan: closers aren’t normally evaluated by the number of saves, which is mostly a function of the opportunities the team provides.  Rather, and like any other member of the roster, the closer is paid according to how many wins he can contribute to the team's record, as compared to a replacement player.  For Rivera to be worth $15 million, he has to contribute about three extra wins (at a going rate of $4.5 million per win).  Which means, basically, he has to blow three fewer saves, given his opportunities.  Or, rather, he has to be *expected* to blow three fewer saves; there's still a lot of randomness there.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:Verdana,sans-serif;"&gt;But Cowen and Grier don't mention randomness at all.  And their only reference to blown saves is in one sentence that mentions the Twins' Joe Nathan and Matt Capps, who blew 12 saves out of 41 opportunities.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:Verdana,sans-serif;"&gt;Another thing, too, is that the article doesn't mention one big difference between Rivera and the others: Rivera is a free agent, while young players like Neftali Feliz can be paid whatever the team wants.  The Yankees might prefer Feliz to Rivera, but that’s not a choice they have open to them.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:Verdana,sans-serif;"&gt;It's not a new "Moneyball" discovery that "slaves" make less money than established free-agent stars ... but the article seems to imply that teams don’t realize that the $400,000 stopper can be just as valuable, for the money, as the $15,000,000 stopper.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:Verdana,sans-serif;"&gt;To me, it looks like the problem is that if you don’t know baseball that well, you tend to overrate the “Moneyball” possibilities, because that’s the story that you’ve heard the most.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:Verdana,sans-serif;"&gt;-----&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:Verdana,sans-serif;"&gt;The authors then go on to say:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;blockquote class="tr_bq"&gt;&lt;span style="font-family:Verdana,sans-serif;"&gt;"The best-known Moneyball theory was that on-base percentage was an undervalued asset and sluggers were overvalued. At the time, protagonist Billy Beane was correct. Jahn Hakes and Skip Sauer showed this in a very good economics paper. From 1999 to 2003, on-base percentage was a significant predictor of wins, but not a very significant predictor of individual player salaries. That means players who draw a lot of walks were really cheap on the market, just as the movie narrates."&lt;/span&gt;&lt;/blockquote&gt;&lt;br /&gt;&lt;span style="font-family:Verdana,sans-serif;"&gt;The authors imply that “walks were really cheap on the market,” means that the A’s had a huge hole to exploit.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:Verdana,sans-serif;"&gt;But ... even if walks were indeed “really cheap,” it would still be a small hole.  Walks are a significant part of a player’s value, but still in the sense of a small edge, not a huge one.  Suppose teams valued walks at only half their actual value.  If you can pick up a player with 60 walks, for the price of 30, you’ll gain about 10 runs, or one win.  Not a big deal.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:Verdana,sans-serif;"&gt;Of course, if you can do that nine times, that’s nine free wins.  But the A’s didn’t.  In 2002, they walked &lt;a href="http://www.baseball-reference.com/leagues/AL/2002.shtml"&gt;609&lt;/a&gt; times, third in the league.  But that was only 157 more walks than Baltimore, second-worst in the league.  If 157 was the number of walks they got at half-price, that’s still only two or three wins.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:Verdana,sans-serif;"&gt;You could choose, instead, to compare the A’s to the 2002 Tigers, who walked only 363 times.  That would be completely unrealistic, in my view, to assume the A’s would have been as bad as one of the worst recent teams ever.  But if you do, you *still* only gain four wins.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:Verdana,sans-serif;"&gt;----&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:Verdana,sans-serif;"&gt;The authors also put too much faith in the Hakes/Sauer paper.  As &lt;a href="http://sabermetricresearch.blogspot.com/2006/10/how-fast-did-market-learn-from.html"&gt;I wrote&lt;/a&gt; &lt;a href="http://sabermetricresearch.blogspot.com/2007/10/updated-moneyball-effect-study.html"&gt;a few years ago&lt;/a&gt;, it seems to me that the paper has a few problems, and I don’t think it shows what it purports to show.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:Verdana,sans-serif;"&gt;The study found a huge increase in the correlation between salary and OBP between 2003 (when the "Moneyball" book was released) and 2004.  The numbers for 2004 almost exactly matched the actual value of a walk, so the authors concluded that the market became efficient in the off-season, and teams wised up after reading the book..  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:Verdana,sans-serif;"&gt;But that conclusion doesn’t make sense.  Since only a small percentage of players got new contracts between 2003 and 2004, for the overall average to move so much, the market would have had to overcompensate for walks by double, or triple their real value!  That doesn’t sound like a reasonable possibility, and it’s certainly not consistent with GMs now learning to be efficient. &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:Verdana,sans-serif;"&gt;-----&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:Verdana,sans-serif;"&gt;Finally, on the subject of correlation:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;blockquote&gt;&lt;span style="font-family:Verdana,sans-serif;"&gt;"Here's something funny about the Moneyball strategy: It is bringing us a world where payroll matters more and more. Spotting undervalued players boosts their salaries and makes money more important for the general manager; little did Billy Beane know that in the long run he would be strengthening the hand of the large home-market teams, such as the Yankees. From 1986 to 1993, payroll explained 2.2 percent of the variation in team winning percentage, and that meant spending more money yielded little return in terms of quality on the field. In the 2004 to 2006 seasons, after the Moneyball revolution was under way, payroll explained 27.1 percent of the variation in team winning percentage, which means a stronger reason to spend more.&lt;/span&gt;"&lt;/blockquote&gt;&lt;br /&gt;&lt;span style="font-family:Verdana,sans-serif;"&gt;I've written about this &lt;a href="http://sabermetricresearch.blogspot.com/2009/05/regression-equation-versus-r-squared.html"&gt;before&lt;/a&gt;, and Tango’s written about it several times: a higher r-squared does NOT necessarily mean money is more important in buying wins.  Rather, the r-squared is a combination of:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:Verdana,sans-serif;"&gt;1. the extent to which money can actually buy wins;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:Verdana,sans-serif;"&gt;2. the extent to which teams differ in spending, in real-life.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:Verdana,sans-serif;"&gt;When the authors say, "spending more money yielded little return,"  they seem to be assuming it’s all the first thing, when it might be all the second thing.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:Verdana,sans-serif;"&gt;As an example, take &lt;a href="http://en.wikipedia.org/wiki/Duel"&gt;dueling&lt;/a&gt;, where two people go out at dawn, draw weapons, and one of them kills the other.  Back when it was legal, dueling would explain a lot of the variation in death rates of people who didn’t like each other.  Now that it’s illegal, it explains zero.   &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:Verdana,sans-serif;"&gt;However, the fact that the r-squared dropped doesn’t mean that dueling is any less dangerous than it used to be (point 1) -- it just means that people no longer vary in how often they get killed in duels (point 2).&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:Verdana,sans-serif;"&gt;The same thing could be happening here.  I did a Google search and found an article (&lt;a href="http://www.accessecon.com/pubs/eb/2003/volume1/EB-03A10003A.pdf"&gt;.pdf&lt;/a&gt;) that gives some team payroll data for the period the article covers.  From Table 1, the article shows that from 1985 to 1990, fourth quartile teams (the 25% of teams with the highest payrolls) outspend the first quartile teams by only about 2 to 1.  From 1998 to 2002, the ratio jumped to 3 to 1.  The paper only covers to 2002, but a glance at &lt;a href="http://www.stevetheump.com/Payrolls.htm#98_payroll"&gt;later numbers&lt;/a&gt; seems to show around 2.5 to 1 (but up to 3.1 to 1 for the 2011 season).&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:Verdana,sans-serif;"&gt;This is evidence that at least *some* of the difference is probably caused by teams being willing to spend more. &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:Verdana,sans-serif;"&gt;I may be unfair to the authors here ... that might be partly what they’re saying.  If I read them right, they’re saying that, armed with "Moneyball" concepts, teams are realizing they can buy wins cheaper by evaluating players more accurately (1) -- and, that teams are therefore more likely to vary in how much they pay when they know it’s money well spent (2).  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:Verdana,sans-serif;"&gt;But ... well, I think these effects are pretty small.  As I argued, walks are a small part of the overall equation, even if they were undervalued by half (which itself is probably an overestimate).  It’s not like, in 1990, teams were paying Jose Oquendo as much as Wade Boggs.  To be sure, teams weren’t perfect in evaluating players -- but they were still reasonably good.  Any improvement since then has to be relatively small, at the margins.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:Verdana,sans-serif;"&gt;So, the idea that teams would say, "hey, we can now evaluate players slightly more accurately, so let’s go on a spending spree" doesn’t seem all that plausible.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:Verdana,sans-serif;"&gt;------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:Verdana,sans-serif;"&gt;What actually *did* happen to tighten the relationship between payroll and wins?  As usual, you guys probably know better than I do.  I’ll give you my guess anyway, which is that it’s a combination of a bunch of things:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:Verdana,sans-serif;"&gt;1.  It became more "socially acceptable" for teams to pay big money to free agents.  Remember, 1985 to 1990 includes the collusion year, and there was probably a significant amount of pressure to keep spending down.  That pressure was probably more significant in discouraging headline-grabbing salaries, rather than routine signings, so maybe a player who was twice as valuable wouldn’t be able to sign for twice as much.  That would help keep the correlation between salary and success low.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:Verdana,sans-serif;"&gt;2.  When baseball revenues exploded, they grew more in some cities than others.  That meant that marginal wins would be extremely valuable to the Yankees, but not so much to the Pirates.  That increased the variation in team spending, which pushed up the r-squared.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:Verdana,sans-serif;"&gt;3.  Teams got smarter, in line with Cowen and Grier’s theory.  But I think that was a small part of what happened.  Also, I’d guess that a lot of improvement in that regard would have happened well before Moneyball, as Bill James’ discoveries got around a bit.  Conventional wisdom denies that baseball executives put any faith in what Bill James had to say, but ... I dunno, good ideas tend to get noticed, even if people say they don’t believe in them.  Also, Bill James’ ideas showed up early in arbitration hearings, which affected the teams’ bottom lines pretty much immediately.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:Verdana,sans-serif;"&gt;4.  Randomness.  In a team payroll to wins regression, Cowen and Grier give an r-squared of .022 for 1986 to 1993.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:Verdana,sans-serif;"&gt;(By the way, I assume Cowen and Grier's regression adjusted for payroll inflation ... salaries more than doubled between 1986 and 1993.  If they didn't adjust, that might explain the low correlation.)&lt;/span&gt;&lt;span style="font-family:Verdana,sans-serif;"&gt; &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:Verdana,sans-serif;"&gt;I wonder if that .022 might just be an outlier.  Here are equivalent numbers from Berri/Schmidt/Brook in "The Wages of Wins," page 40: &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:Verdana,sans-serif;"&gt;Wages of Wins:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;1988 to 1994: r-squared = .062, r = .25&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;1995 to 1999: r-squared = .325, r = .57&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;2000 to 2005: r-squared = .176, r = .42&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:Verdana,sans-serif;"&gt;Cowen/Grier: &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;1986 to 1993: r-squared = .022, r = .15&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:Verdana,sans-serif;"&gt;The numbers sure do move around a lot!  It probably doesn’t take much to knock the correlation down: you need a few teams to get lucky in exceeding their talent, and a few teams to get lucky and get some good slaves and arbs.  Maybe I’ll try a simulation and see how common a .022 might actually be.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:Verdana,sans-serif;"&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/31545676-1771557642828080674?l=sabermetricresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://sabermetricresearch.blogspot.com/feeds/1771557642828080674/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=31545676&amp;postID=1771557642828080674' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/1771557642828080674'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/1771557642828080674'/><link rel='alternate' type='text/html' href='http://sabermetricresearch.blogspot.com/2011/12/heres-baseball-salary-article-at.html' title='A &quot;Grantland&quot; article on Moneyball effects'/><author><name>Phil Birnbaum</name><uri>http://www.blogger.com/profile/03800617749001032996</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-31545676.post-4103303789816415634</id><published>2011-12-06T14:29:00.003-05:00</published><updated>2011-12-06T14:33:49.380-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='statistics'/><category scheme='http://www.blogger.com/atom/ns#' term='academics'/><category scheme='http://www.blogger.com/atom/ns#' term='Bill James'/><category scheme='http://www.blogger.com/atom/ns#' term='regression'/><title type='text'>Transparent studies are better, even if they're less rigorous</title><content type='html'>&lt;span style="font-family: verdana;"&gt;In 1985, Orioles manager Joe Altobelli claimed that power pitchers do better than finesse pitchers in cold weather.  In the 1986 Baseball Abstract, Bill James wanted to check whether that was true.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;So, he went through baseball history, and found sets of pitchers with exactly the same season W-L record, but where one was a power pitcher and one was a finesse pitcher.  For example, Tom Seaver, a power pitcher, went 22-9 in 1975.  He got paired with Lary Caldwell, a finesse pitcher, who went 22-9 in 1978.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;Bill found 30 such pairs of pitchers.  So, he had two groups of 30 pitchers, each with identical 539-345 (.610) records overall.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;He compared the two groups in April, the cold-weather month.  As he put it, "Altobelli was dead wrong.  He couldn't have been more wrong."  It turned out that the power pitchers were only 49-51 (.490) in April, while the finesse pitchers were 63-37 (.630).  That's exactly opposite to what Altobelli had thought.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;Nice study, right?  I love it ... it's one of my favorites.  But it wasn't very "sophisticated" in terms of methodology.  For instance, It didn't use regression.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;Should it?  Well, I suppose you could make that argument.  With regression, the study can be more precise.  Bill James used only season W-L record in his dataset, but in your regression, you could add a lot more things: ERA, strikeouts, walks.  You could include dummy variables for season, park, handedness, and lots of other things.  And, of course, you wouldn't be limited to only 30 pairs of pitchers.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;And you'd get a precise estimate for the effect, with confidence intervals and p-values.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;But ... in my opinion, that would make it a WORSE study, not better.  Why?  One big reason: Bill's study is completely transparent, and understandable.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;Take the four paragraphs above, and show them to a friend who doesn't know much about statistics.  That person, if he's a baseball fan, will immediately understand how the study worked, what it showed, and what it means.   &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;On the other hand, if you try to explain the results of a regression, he won't get it.  Sure, you could explain the conclusions, and what the coefficient of walks and strikeouts mean, and so on.  And he might believe you.  But he won't really get it.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;With the easy method, anyone can understand what the evidence means.  With regression, they have to settle for understanding someone else's explanation of what the evidence means.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;Reading Bill's study is like being an eyewitness to a crime.  Reading the results of the regression is like hearing an expert witness testify what happened.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;Now, you may object: why is that so important?  After all, if it takes a sophisticated method to uncover the truth, well, that's what we have to do.  Sure, it's nice if the guy on the street can be an eyewitness and understand the evidence, but that's not always possible.  If we limited our research studies to methods that were intuitive to the layman, we'd never learn anything!  Physics is difficult, but gives us cars and airplanes and electronics and nuclear energy.  If it takes a little bit of effort and education to be able to do it, then that's the price we pay!&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;To which I have two responses.  The first one is, that, actually, I agree.  I'm not saying that we should limit our research to *only* studies that laymen understand better.  I'm just saying that it's preferable *when it's possible*.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;The second response, though, is: it's not just laymen I'm talking about.  It's also sophisticated statisticians and sabermetricians.  You, and me, and Tango, and Bill James, and JC Bradbury, and David Berri, and Brian Burke, and all those guys.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;Because, the truth is, a regression study is not transparent to ANYBODY.  I mean, I've read a lot of regressions in the last few years, and I can tell you, it takes a *lot* of work to figure out what's going on.  There are a lot of details, and, even when the regression is simple, there's a lot of work do to in the interpretation.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;For instance, a few paragraphs ago, I gave a little explanation of how a regression might work for Bill's study.  And, suppose, making some numbers up, I get a coefficent of -.0007 per strikeout, and -.0003 per walk.  What does that mean?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;Well, you have to think about it.  It means, for instance, that if Tom Seaver had 50 extra strikeouts and 20 extra walks than pitcher X, his April winning percentage will be .041 worse, all things being equal, than X.  But ... it takes a while, and you have to do it in your head.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;But wait!  Not every pitcher in the study had the same number of innings pitched.  So Tom Seaver's 50 extra strikeouts, do I have to convert that to a fixed number of innings?  What did the study use?  Now I have to go back and check.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;And how do I interpret the results?  Am I sure they really apply to all pitchers?  I mean, suppose there's a pitcher like Seaver, who strikes out a lot of guys, but has better control, so he walks 30 fewer guys.  Should I really assume that he'll do better in April than Seaver, since he's "less" of a power pitcher in that dimension?  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;Also, wait a sec.  The regression included season W-L record, but that *includes* the April W-L record that it was trying to predict.  That will throw off the results, won't it?  Maybe the regression should have used only May to October.  Or maybe it did?  Now I have to go check that.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;And what if it didn't, and there were a bunch of pitchers that pitched only one game after April?  Will that throw off the results?  If the confidence intervals suspect, are the coefficients suspect too?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;I could go on ... there are a thousand issues that affect the interpretation of the regression's results.  And it's impossible for any one person, even the best statistician in the world, to hold all thousand in his head at once.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;You could now come back with another argument: "OK, the regression is harder to interpret, but we can just do that interpretation.  Indeed, that's the duty of the researcher.  When you write up a study, your duty isn't just to do a regression and report the results.  It's to figure out what the results mean, and check that everything's been done right.  Also, that's what the peer reviewers are for, and the journal editors."&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;To which the most obvious reply is: it's simply not true that peer reviewing makes sure everything is correct.  There are loads and loads of problems with actual interpretation of regressions, some of which I've written about here.  There was the paper that forgot to hold everything constant when interpreting a coefficient.  There was the paper that decided that X picks ahead was worth zero, but drafting 2X picks ahead was significant.  There was the paper that decided that a low "coefficient of determination" meant that the effect was insignificant, regardless of what the coefficient showed.  And so on.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;And those are the easy ones.  I mean, sure, I'm no sophisticated peer reviewer with a Ph.D. who looks through a hundred of these a month, but I do have a decent basic idea of how regressions work and how baseball works.  But, for some of these regressions, it took me a long time, measured in hours, to figure out what they actually were doing and (in many cases) why the results didn't really mean what the author thought they meant.  It's not that you need the right kind of expertise ... it's that every case is different, and there's no formula.  To figure out what a result actually means, you have to look at everything: where the data comes from, how it interacts, what is really being measured, what the coefficients mean, and, especially, if the model is realistic and if other models give different results.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;As I've said before, regression is easy.  Interpreting the regression is hard -- legitimately hard.  And, unlike other hard problems, you don't know when or whether you've found the right answer.  You can spend days looking at it, and you might still be missing something.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;Which, of course, means that the simple method is more likely to be correct than the complicated method: if we understand a study, we're much more likely to spot its flaws.  If Bill James did something dumb in how he compiled his data, most of us will catch it.  If Joe Regressor does something wrong in his complicated regression, it's likely that nobody will see it (unless it's in a hard science, in which case a plane will crash or a patient will die or something).&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;And, of course, there's still the advantage that the simple study is easier to understand.  That's an advantage even if the regression study is absolutely 100% correct.  If you can see the answer in four paragraphs of arithmetic, that's better than if it takes ten pages of regression notation.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;Which advantage is more important?  Actually, it looks like the first one is more important ... but, you know, the second one can make a strong case. &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;That Bill James study that I mentioned earlier ... if you have a copy of the 1986 Abstract handy, you should go read it.  It's on page 134.  It's only two pages long, and easy reading, like most of Bill's prose.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;If you've read it, I'd say that, right now, you KNOW the answer to Bill's question.  I don't mean you know 100% that he's right, and what the answer is ... I'm saying that you know, almost 100%, the evidence and the logic.  You may or may not think the results are conclusive -- for instance, I'd like to see a larger sample size -- but, either way, it's your own decision, not Bill's.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;That is: even though Bill did the study, you instantly absorb the answer.  You don't have to trust Bill about it.  Well, you have to trust that he aggregated the data properly, and didn't cheat in what pitchers he chose.  But you don't have to trust Bill's judgment, or Bill's knowledge of baseball, or Bill's interpretation.  His interpretation will become yours: you'll see what he did, and understand it well enough that if someone challenges it, you can defend it.  Bill's study is so transparent, that, after you read it, you understand the answer as well as Bill did, probably about as well as anyone can.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;That's not true for a regression study.  Often, for many readers (myself included), the explanation is impenetrable.  The references for the methods refer you to textbooks that are hard to find and technical, and, usually, there's no discussion of what the results mean, other than just a surface reading of the coefficients.  If you want to truly understand what's going on, you have to read the study, and read critically.  Then you have to read it again, filling in the missing pieces.  Then you have to look at the tables, and back to the text, then to the model.  And then you still probably have to read it again.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;And all this is assuming you already know something about how regression works.  If you don't, you'll just have no idea.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;So the difference is: if the study is simple, you know what it means.  If the study is complicated, you don't know anything.  You have to trust the person that wrote the study.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;It's night and day: knowing versus not knowing.  With the Bill James study, you &lt;span style="font-weight: bold;"&gt;know &lt;/span&gt;it's true.  With a regression study, you just &lt;span style="font-weight: bold;"&gt;believe &lt;/span&gt;it's true.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;To go a bit off topic ...&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;A lot of the factual things we think we know, and will passionately defend, we don't really know, except from what other people tell us.  I bet everyone reading this has a strong opinion on creation vs. evolution, or whether global warming is real or a hoax, or whether 9/11 was or was not partly an inside job by the US government.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;Take evolution.  Let's suppose you believe in evolution, and you think creationism is not true, just wishful thinking by creationists.  I think most of my friends fall into this category, and I've read surveys that say most Americans generally believe this.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;But if that's you, can you really say that you KNOW it?  You don't, probably.  I bet you that for most of you (myself included), if someone asked you for examples of actual evidence for evolution, we'd have nothing.  Seriously, zero.  I couldn't even give a half-hearted attempt at a single sentence of why and how we know that evolution actually happened.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;Sure, I believe evolution happened, but not because of any actual evidence.  I believe it for secondhand reasons.  I believe it happened because I know that scientists, serious researchers, have looked at the evidence, and that it's strong evidence.  What I *do* think I know, from dealing (at arm's length) with teachers and authors and scientists and journalists, is that it's very, very unlikely that the worldwide supply of scientists, working independently and jockeying for discoveries and status and publications, and being so steeped in scientific method, and competing in an open marketplace of ideas, could have deluded themselves into misinterpreting so much evidence over so many decades.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;So, you know, I can't say I know evolution is true.  But now I DO know there is strong evidence that power pitchers do not outperform in April.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;That's the beauty of the Bill James-type study, the one that lays it all out for you without using fancy techniques.  It gives you a completely different feeling, the feeling that you actually know something, instead of just believing it from hearsay.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;And that, to me, is an important part of what science is all about.&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/31545676-4103303789816415634?l=sabermetricresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://sabermetricresearch.blogspot.com/feeds/4103303789816415634/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=31545676&amp;postID=4103303789816415634' title='9 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/4103303789816415634'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/4103303789816415634'/><link rel='alternate' type='text/html' href='http://sabermetricresearch.blogspot.com/2011/12/transparent-studies-are-better-even-if.html' title='Transparent studies are better, even if they&apos;re less rigorous'/><author><name>Phil Birnbaum</name><uri>http://www.blogger.com/profile/03800617749001032996</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>9</thr:total></entry><entry><id>tag:blogger.com,1999:blog-31545676.post-2120990259261362527</id><published>2011-11-30T10:46:00.011-05:00</published><updated>2011-11-30T11:07:52.086-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='statistics'/><category scheme='http://www.blogger.com/atom/ns#' term='academics'/><category scheme='http://www.blogger.com/atom/ns#' term='bayes'/><title type='text'>Why it's hard to estimate small effects</title><content type='html'>&lt;span style="font-family:verdana;"&gt;Here's a great 2009 paper (&lt;a href="http://www.stat.columbia.edu/~gelman/research/published/power4r.pdf"&gt;.pdf&lt;/a&gt;) by Andrew Gelman and David Weakliem (whom I'll call "G/W"), on the difficulty of finding small effects in a research study.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I'm translating to baseball to start.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-----&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Let's suppose you have someone who claims to be a clutch hitter.  He's a .300 hitter, but, with runners on base, he claims to be a bit better.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;So, you say, show us!  You watch his 2012 season, and see how well he hits in the clutch.  You decide in advance that if it's statistically significantly different from .300, that will be evidence he's a clutch hitter.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Will that work?  No, it won't.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Over 100 AB, the standard deviation of batting average is about 46 points.  To find statistical significance, you want 2 SD.  That means to convince you, the player would have to hit .392 in the clutch.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;The problem is, he's not a .392 hitter!  He, himself, is only claiming to be a little bit better than .300.  So, in your study, the only evidence you're willing to allow, is evidence that you *know* can't be taken at face value.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Let's say the batter actually does meet your requirement.  In fact, let's suppose he exceeds it, and hits .420.  What can you conclude?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Well, suppose you didn't know in advance that you were looking for small effect.  Suppose you were just doing a "normal" paper.  You'd say, "look, he beat his expectation by 2.6 SD, which is statistically significant.  Therefore, we conclude he's a clutch hitter."  And then you write a "conclusions" section with all the implications of having a .420 clutch hitter in your lineup.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;But, in this case, that would be wrong, because you KNOW he's not a .420 clutch hitter, even though that's what he hit and you found statistical significance.  He's .310 at best, maybe .320, if you stretch it.  You KNOW that the .420 was mostly due to luck.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Still ... even if you can't conclude that the guy is truly a .420 clutch hitter, you SHOULD be able to at least conclude that he's better than .300 right?  Because you did get that statistical significance.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Well ... not really, I don't think.  Because, the same evidence that purports to show he's not a .300 hitter ALSO shows he's not a .320 hitter!  That is, .420 is also more than 2 standard deviations from .320, which is the best he possibly could be.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;What you CAN do, perhaps, is compare the two discrepancies.  .420 is 2.6 SDs from .300, but only 2.2 SDs from .320.  That does appear to make .320 more likely than .300.  In fact, the probability of a .320 hitter going 42-for-100 is almost five times as high as the probability of the .300 hitter going 42-for-100.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;But, first, that's only 5 in 6.  Second, that ignores the fact that there are a lot more .300 hitters than .320 hitters, which you have to take into account.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;So, all things considered, you should know in advance that you won't be able to  conclude much from this study.  The sample size is too small.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;That's Gelman and Weakliem's point: if you're looking for a very small effect, and you don't have much data, you're ALWAYS going to have this problem.  If you're looking for the difference between .300 and .320, that's a difference of 20 points.  If the standard error of your experiment is a lot more than 20 points ... how are you ever going to prove anything?  Your instrument is just too blunt.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;In our example, the standard error is 46 points.  To find statistical significance, you'd have to observe an effect of at least 92 points!  And so, if you're pretty sure clutch hitting talent is less than 92 points, why do the experiment at all?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;But what if you don't know if clutch hitting talent is less than 92 points?  Well, fine.  But you're still never going to find an effect less than 92 points.  And so, your experiment is biased, in a way: it's set up to only find effects of 92 points or more.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;That means that if the effect is small, no matter how many scientists you have independently searching for it, they'll never find it.  Moreover, they will frequently find a LARGE effect. &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;No matter what happens, the experiment will either be wrong too high, or wrong too low.  It is impossible for it to be accurate for a small effect.  The only way to find a small effect is to increase the sample size.  But even then, that doesn't eliminate the problem: it just reduces it.  No matter what your experiment, and how big your sample size, if the effect your looking for is smaller than 2 SDs, you'll never find it.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;That's G/W's criticism.  It's a good one.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;G/W's example, of course, is not about clutch hitting.  It's about a previously-published paper, which found that good-looking people are more likely to produce female offspring than male offspring.  That study found an 8 percentage point difference between the nicest-looking parents and the worst-looking parents -- 52 percent girls vs. 44 percent girls.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;And what G/W are saying is, that 8 point difference is HUGE.  How do they know?  Well, it's huge as compared to a wide range of other results in the field.  Based on the history of studies on birth sex bias, two or three points is about the limit.  Eight points, on the other hand, is unheard of.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Therefore, they argue, this study suffers from the "can't find the real effect" problem.  The standard error of the study was over 4 points.  How can you find an effect of less than 3 points, if your standard error is 4 points?  Any reasonable confidence interval will cover so much of the plausible territory, that you can't really conclude anything at all.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Gelman and Weakliem don't say so explicitly, but this is a Bayesian argument.  In order to make it, you have to argue that the plausible effect is small, compared to the standard error.  How do you know the plausible effect is small?  Because of your subject matter expertise.  In Bayesian terms, you know, from your prior, that the effect is most likely in the 0-3 range, so any study that can only find an 8-point difference must be biased.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Every study has its own limits of how the standard error compares to the expected "small" effect.  You need to know what "small" is.  If a clutch hitting study was only accurate to within .0000001 points of batting average ... well, that would be just fine, because we know, from prior experience, that a clutch effect of .0000002 is relatively plausible.  On the other hand, if it's only accurate to within .046, that's too big -- because a clutch effect of .092 is much too large to be plausible.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;It's our prior that tells us that.  As &lt;a href="http://sabermetricresearch.blogspot.com/2011/11/research-study-is-just-peer-reviewed.html"&gt;I've&lt;/a&gt; &lt;a href="http://sabermetricresearch.blogspot.com/2011/11/research-conclusions-have-to-be.html"&gt;argued&lt;/a&gt;, interpreting the conclusions of your study is an informal Bayesian process.  G/W's paper is one example of how that kind of argument works.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;--------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Hat tip: &lt;a href="http://marginalrevolution.com/marginalrevolution/2011/11/small-samples-mean-statistically-significant-results-should-usually-be-ignored.html"&gt;Alex Tabarrok at Marginal Revolution&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/31545676-2120990259261362527?l=sabermetricresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://sabermetricresearch.blogspot.com/feeds/2120990259261362527/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=31545676&amp;postID=2120990259261362527' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/2120990259261362527'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/2120990259261362527'/><link rel='alternate' type='text/html' href='http://sabermetricresearch.blogspot.com/2011/11/why-its-hard-to-estimate-small-effects.html' title='Why it&apos;s hard to estimate small effects'/><author><name>Phil Birnbaum</name><uri>http://www.blogger.com/profile/03800617749001032996</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-31545676.post-2165700904894073630</id><published>2011-11-28T18:48:00.013-05:00</published><updated>2011-11-28T19:21:17.172-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='statistics'/><category scheme='http://www.blogger.com/atom/ns#' term='academics'/><category scheme='http://www.blogger.com/atom/ns#' term='bayes'/><title type='text'>Why p-value isn't enough, reiterated</title><content type='html'>&lt;span style="font-weight: bold;font-family:verdana;"&gt;Question 1:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;People are routinely tested for disease X, which 1 in 1000 people have overall. It is known that if the person has the disease, the test is correct 99% of the time. If the person does not have the disease, the test is also correct 99% of the time. &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;A patient goes to his doctor for the test. It comes out positive.&lt;br /&gt;&lt;br /&gt;What is the probability that the patient has the disease?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:verdana;"&gt;Question 2:&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;span style="font-family:verdana;"&gt;Researchers routinely run studies to test unexpected hypotheses (such as: can outside prayer help cure disease?), of which 1 in 1000 tend to be true overall. It is known that if a hypothesis is true, a study correctly finds statistical significance 99% of the time. If the hypothesis is false, the study correctly finds NO statistical significance 99% of the time.&lt;br /&gt;&lt;br /&gt;A researcher tests one such unexpected hypothesis.  He finds statistical significance.&lt;br /&gt;&lt;br /&gt;What is the probability that the hypothesis is true?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;--------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Hat Tip: Inspired by Jeremy's last paragraph of comment #25, &lt;a href="http://www.insidethebook.com/ee/index.php/site/comments/research_conclusions_have_to_be_bayesian/#25"&gt;here&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;--------&lt;br /&gt;&lt;br /&gt;&lt;span style="font-size:85%;"&gt;P.S.  Answer to question 1 (very slightly modified question, but the same answer) at my previous post, &lt;a href="http://sabermetricresearch.blogspot.com/2011/11/research-conclusions-have-to-be.html"&gt;here&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/31545676-2165700904894073630?l=sabermetricresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://sabermetricresearch.blogspot.com/feeds/2165700904894073630/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=31545676&amp;postID=2165700904894073630' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/2165700904894073630'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/2165700904894073630'/><link rel='alternate' type='text/html' href='http://sabermetricresearch.blogspot.com/2011/11/why-p-value-isnt-enough-reiterated.html' title='Why p-value isn&apos;t enough, reiterated'/><author><name>Phil Birnbaum</name><uri>http://www.blogger.com/profile/03800617749001032996</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-31545676.post-779155163614889873</id><published>2011-11-23T10:10:00.009-05:00</published><updated>2011-11-23T12:05:56.448-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='statistics'/><category scheme='http://www.blogger.com/atom/ns#' term='academics'/><category scheme='http://www.blogger.com/atom/ns#' term='bayes'/><title type='text'>Research conclusions *have* to be bayesian</title><content type='html'>&lt;span style="font-family:verdana;"&gt;The &lt;a href="http://sabermetricresearch.blogspot.com/2011/11/research-study-is-just-peer-reviewed.html"&gt;last&lt;/a&gt; &lt;a href="http://sabermetricresearch.blogspot.com/2011/11/statisticians-can-prove-almost-anything.html"&gt;couple&lt;/a&gt; of posts here have been about interpreting the results of statistical studies.  I argued that the statistical method itself might be just fine, but the *interpretation* of what it means, the conclusions you draw about real life, require an argument.  That is, you can get the regression right, but the conclusions wrong, because the conclusions call for argument and judgment.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Or, as some commenters have substituted, "intuition" and "subjectivity".  Those are negative things, in academic circles.  Objectivity is the ideal, and the idea that the reliability of a work of scholarship depends on a subjective evaluation of the author's judgment doesn't seem to be something that people like.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;But, I think it absolutely has to follow.  If you find a connection between A and B, how do you know if it's A that causes B, or B that causes A, or if it's all just random?  That's something no statistical analysis can tell you.  By definition, it calls for judgment, doesn't it?  At least a little bit.  Recall the &lt;a href="http://sabermetricresearch.blogspot.com/2011/11/statisticians-can-prove-almost-anything.html"&gt;recent&lt;/a&gt; (contrived) study that showed that listening to kids' music is linked to being physically older.  Nobody would conclude that the music MAKES you older, right?  But that's not a result of the statistical analysis -- it's a judgment based on outside knowledge.  An easy, obvious judgment, but a judgment nonetheless.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;It occurred to me that this judgment, that takes you from regression results to conclusions, is really an informal Bayesian inference.  I don't think this is a particularly novel insight, but it helps to make the issue clearer.  My argument is this: first, even if you do a completely normal, ("frequentist") experiment, the step from the results to the conclusions HAS to be Bayesian.    And, more importantly, because Bayesian techniques sometimes require judgment, and are therefore not completely objective, the convention has been to avoid such judgment in academic papers.  Therefore, these studies have locked themselves in to a situation in which they have to suspend judgment, and use strict rules, which sometimes lead to wrong -- or seemingly absurd -- answers.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;OK, let me start by explaining Bayesianism, as I understand it, first intuitively, then in a baseball context.  As always, real statisticians should correct me where I got it wrong.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;----------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Generally, Bayesian is a process by which you refine your probability estimate.  You start out with whatever evidence you have which leads you to a "prior" estimate for how things are.  Then, you get more evidence.  You add that to the pile, and refine your estimate by combining the evidence.  That gives you a new, "posterior" estimate for how things are.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;You're a juror at a trial.  At the beginning of the trial, you have no idea whether the guy is guilty or not.  You might think it's 50/50 -- not necessarily explicitly, but just intuitively.  Then, a witness comes up that says he saw the crime happen, and he's "pretty sure" this is the guy.  Combining that with the 50/50, you might now think it's 80/20.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Then, the defense calls the guy's boss, who said he was at work when the crime happened.  Hmmm, you say, that sounds like he couldn't have done it.  But there's still the eyewitness.  Maybe, then, it's now 40/60.&lt;br /&gt;&lt;br /&gt;And so on, as the other evidence unfolds.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;That's how Bayesian works.  You start out with your "prior" estimate, based on all the evidence to date: 50/50.  Then, you see some new evidence: there's an eyewitness, but the boss provides an alibi.  You combine that new evidence with the prior, and you adjust your estimate accordingly.  So your new best estimate, your "posterior," is now 40/60.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;---------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;That's an intuitive example, but there is a formal mathematical way this works.  There's one famous example, which goes like this:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;&lt;/span&gt;&lt;blockquote&gt;&lt;span style="font-family:verdana;"&gt;People are routinely tested for disease X, which 1 in 1000 people have overall. It is known that if the person has the disease, the test is correct 100% of the time.  If the person does not have the disease, the test is correct 99% of the time.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;A patient goes to his doctor for the test.  It comes out positive.  What is the probability that the patient has the disease?&lt;/span&gt;&lt;/blockquote&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;If you've never seen this problem before, you might think the chance is pretty high.  After all, the test is correct at least 99% of the time!  But that's not right, because you're ignoring all the "prior" evidence, which is that only 1 in 1000 people have the disease to begin with.  Therefore, there's still a strong chance that the test is a false positive, despite the 99 percent accuracy.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;The answer turns out to be about 1 in 11.  The (non-rigorous) explanation goes like this:  1000 people see the doctor.  One has the disease and tests positive.  Of the other 999 who don't have the disease, about 10 test positive.  So the positive tests comprise 10 people who don't have the disease, and 1 person who does.  So the chance of having the disease is 1 in 11.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Phrasing the answer in terms of Bayesian analysis: The "prior" estimate, before the evidence of the test, is 0.001 (1 in 1000).  The new evidence, though, is very significant, which means it changes things a fair bit.  So, when we combine the new evidence with the prior, we get a "posterior" of 0.091 (1 in 11).  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;If that still seems counterintuitive to you, think of it this way: if the test is 99% positive, that's 1 in 100 that it's wrong.  That's low odds, which makes you think the test is probably right!  But ... the original chance of having the disease is only 1 in 1000.  Those are even worse odds.  The prior of 1/1000 competes with the new evidence of 1/100.  Because the new number (test being wrong) is more likely than the old number (no disease), the odds are skewed to the test being wrong: odds of 10:1 that the test is wrong, compared to the patient having the disease.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Another way to put it: the less likely the disease was to start with, the more evidence you need to overcome those low odds.  1/100 isn't enough to completely overcome 1/1000.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;(Perhaps you can see where this will be going, which is: if a research study's hypothesis is extremely unlikely in the first place, even a .01 significance level shouldn't be enough to overcome your skepticism.  But I'm getting ahead of myself here.)&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;---------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Let's do an oversimplified baseball example.  At the beginning of the 2011 baseball season, you (unrealistically) know there's a 50% chance that Albert Pujols' batting average talent will be .300 for the season, and a 50% chance that his batting average talent will be .330.  Then, in April, he goes 26 for 106 (.245).  What is your revised estimate of the chance that he's actually a .300 hitter?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;You start with your "prior" -- a 50% chance he's a .300 hitter.  Then, you add the new evidence: 26 for 106.  Doing some calculation, you get your "posterior."  I won't do the math here, but if I've got it right, the answer is that now the chance is 80% that Pujols is actually a .300 hitter and not a .330 hitter. &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;That should be in line with your intuition.  Before, you thought there was a good chance he was a .330 hitter.  After, you think there's still a chance, but less of a chance.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;We started thinking Pujols was still awesome.  Then he hit .245 in April.  We thought, "Geez, he probably isn't really a .245 hitter, because we have many years of prior evidence that he's great!  But, on the other hand, maybe there's something wrong, because he just hit .245.  Or maybe it's just luck, but still ... he's probably not as good as I thought."  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;That's how Bayesian thinking works.  We start with an estimate based on previous evidence, and we update that estimate based on the new evidence we add to the pile.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;--------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Now, for the good part, where we talk about academic regression studies. &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;You want to figure out whether using product X causes cancer.  You do a study, and you find statistical significance at p=0.02, and the coefficient says that using product X is linked with a 1% increased chance of cancer.  You are excited about your new discovery.  What do you put in the "conclusions" section of your paper?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Well, maybe you say "this study has found evidence consistent with X causing cancer."  But that isn't helpful, is it?  I mean, you also found evidence that's consistent with X *not* causing cancer -- because, after all, it could have just been random luck.  (A significance level of .02 would happen by chance 1 out of 50 times.)&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Can you say, "this is strong evidence that X causes cancer?"  Well, if you do, it's subjective.  "Strong" is an imprecise, subjective word.  And what makes the evidence "strong"?  You better have a good argument about why it's strong and not weak, or moderate.  The .02 isn't enough.  As we saw in the disease example, a positive test -- which is equivalent to a significance level of .01, since a positive test happens only 1 in 100 times -- was absolutely NOT strong evidence of having the disease.  (It meant only a 1 in 11 chance.)&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Similarly, you can't say "weak" evidence, because how do you know?  You can't say anything, can you?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;It turns out that ANY conclusion about what this study means in real life has to be Bayesian, based not just on the result, but on your PRIOR information about the link between cancer and X.  There is no conclusion you can draw otherwise.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Why? Well, it's because the study has it backwards.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;What we want to know is, "assuming the data came up the way it did, what is the chance that X causes cancer?"  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;But the study only tells us the converse: "assuming X does not cause cancer, what is the chance that the data would come up the way it did?"&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;The p=0.02 is the answer to the second question only.  It is NOT the answer to the first question, which is what we really want to know.  There is a step of logic required to go from the second question to the first question.  In fact, &lt;a href="http://en.wikipedia.org/wiki/Bayes%27_theorem"&gt;Bayes' Theorem&lt;/a&gt; gives us the equation for finding the answer to the first question given the second.  That equation requires us to know the prior.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;What the study is asking is, "given that we got p=0.02 in this experiment, what's the chance that X causes cancer?"  Bayes' Theorem tells us the question is unanswerable.  All we can answer is, "given that we got p=0.02 in this experiment, what is the chance that X causes cancer, &lt;b&gt;given our prior estimate before this experiment&lt;/b&gt;?"&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;That is: you CANNOT make a conclusion about the likelihood of "X causes cancer" after the experiment, unless you had a reliable estimate of the likelihood of "X causes cancer" BEFORE the experiment.  (In &lt;a href="http://en.wikipedia.org/wiki/Bayes%27_theorem#Simple_form"&gt;mathematical terms&lt;/a&gt;, to calculate P(A|B) from P(B|A), you need to know p(B) and p(A).)&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Does this sound wrong?  Do you think you can get a good intuitive estimate just from this experiment alone?  Do you feel like the .02 we got is enough to be convincing?  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Well, then, let me ask you this: what's your answer?  What do you think the chance is that X causes cancer?  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;If you don't agree with me that there's no answer, then figure out what you think the answer is.  &lt;/span&gt;&lt;span style="font-family:verdana;"&gt;You may assume the experiment is perfectly designed, the  sample size is adequate, and so on.  &lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;span style="font-family:verdana;"&gt;If you don't have a number -- you  probably don't -- think of a description, at least.  Like, "X probably  causes cancer."  Or, "I doubt that X causes cancer."  Or, "by the  precautionary principle, I think everyone should avoid X."  Or, "I don't  know, but I'd sure keep my kids away from X until there's evidence that it's safe!"  &lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;&lt;br /&gt;Go ahead.  I'll leave some white space for you.  Get a good intuitive idea of what your answer is.&lt;/span&gt;&lt;span style="font-family:verdana;"&gt;  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;(&lt;a href="http://www.youtube.com/watch?v=vXGhvoekY44"&gt;Link to Jeopardy music while you think&lt;/a&gt; (optional)) &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;OK.  Now, I'm going to tell you: product X is a bible.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Does that change your mind?  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;It should.  Your conclusion about the dangers of X should absolutely depend on what X is -- more specifically, what you knew about X before.  That is, your PRIOR.  Your prior, I hope, had a probability of close to 0% that a Bible can cause cancer.  That's not just a wild-ass intuition.  There are very good, rational, objective reasons to believe it.  Indeed, there is no evidence that the information content of a book can cause cancer, and there is no evidence or logic that would lead you to believe that bibles are more carcinogenic than, say, copies of the 1983 Bill James Baseball Abstract.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Call this "intuition" or "subjectivity" if you want.  But if you decide not to use your own subjective judgment, what are you going to do?  Are you going to argue that bibles cause cancer just to avoid having to take a stand?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I suppose you can stop at saying, "this study shows a statistically significant relationship between bible use and cancer."  That's objectively true, but not very useful.  Because the whole point of the study is: do bibles cause cancer?  What good is the study if you can't apply the evidence to the question?  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;--------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;You could do the Bayesian approach thing more formally.  That's what researchers usually mean when they talk about "Bayesian methods" -- they mean formal statistical algorithms.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;To do a Bayesian analysis, you need a prior.  You could just arbitrarily take something you think is reasonable.  "Well, we don't believe there's much of a chance bibles cause cancer, so we're going to assume a prior 99.9999% probability that there's no effect, and we'll split up the last remaining .0001 in a range between -2% and +2%."  Now, you do the study, and recalculate your posterior distribution, to see if you now have enough evidence to conclude there's a danger.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;If you did that, you'd find that your posterior distribution -- your conclusion -- was that the probability of no effect went down, but only from 99.9999% to 99.995%, or something.  That would make your conclusion easy: "the evidence should increase our worry that bibles cause cancer, but only from 1 in a million to 1 in 20,000."&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;But, that Bayesian technique is not really welcome in academic studies.  Why?  Because that prior distribution is SUBJECTIVE.  The author can choose any distribution he wants, really.  I chose 99.9999%, but why not 99.99999% (which is probably more realistic)?  The rule is that academic papers are required to be objective.  If you allow the author to choose any prior he wants, based on his own intuition or judgment, then, first, the paper is no longer objective, and second, there is the fear that the author could get any conclusion he wanted just by choosing the appropriate prior.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;So papers don't want to assume a prior.  So instead of arguing about the chance the effect is real, the paper just assumes it's real, and takes it at face value.  If X appears to increase cancer by 1%, and it's statistically significant, then the conclusion will assume that X actually *does* increase cancer by 1%.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;That sounds like it's not Bayesian.  But, in a sense, it is.  It's exactly the result you'd get from a Bayesian analysis with a prior that assumes every result is equally likely.  Yes, it's objective, because you're always using the same prior.  But it's the *wrong* prior.  You're using a fixed assumption, instead of the best assumption you can, just because the best assumption is a matter of discretion.  You're saying, "Look, I don't want to make any subjective assumptions, because then I'm not an objective scientist.  So I'm going to assume that bibles are just as likely to cause 1% more cancers as they are to cause 0% more cancers." &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;That's obviously silly in the bible case, and, when it's that obvious, it looks "objective" enough that the study can acknowledge it.  But most of the time, it's not obvious.  In those cases, the studies will just take their results at face value, *as if theirs is the only evidence*.  That way, they don't have to decide if their result is plausible or not, in terms of real-life considerations.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Suppose you have two baseball studies.  One says that certain batters can hit .375 when the pitcher throws lots of curve balls.  Another says that batters gain 100 points on their batting average after the manager yells at them in the dugout.  Both studies find exactly the same size effect, with exactly the same significance level of, say, .04.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Of the two conclusions, which one is more likely to be true?  The curve ball study, of course.  We know that some batters hit curve balls better than others, and we know some batters hit well over .300 in talent.  It's fairly plausible that someone might actually have .375 talent against curve balls.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;But the "manager yells at them" study?  No way.  We have a strong reason to believe it's very, very unlikely that batters would improve by that much just because they were yelled at.  We have clutch hitting studies, that barely find an effect even when the game is on the line.  We have lots of other studies that, even when they do find an effect, like platooning, find it to be much, much less than 100 points.  Our prior for the "manager yelling is worth 100 points" hypothesis is so low that a .04 will barely move it.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Still ... I guarantee you that if these two studies were published, the two "conclusions" sections would not give the reader any indication of the relative real-life likelihood of the conclusions being correct, except by reference to the .04.  In their desire to be objective, the two studies would not only fail to give shadings of their hypotheses' overall plausibility, but they'd probably just treat both conclusions as if they were almost certainly true.  That's the general academic standard: if you have statistical significance, you're entitled to just go ahead and assume the null hypothesis is false.  To do anything else would be "subjective."&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;But while that eliminates subjectivity, it also eliminates truth, doesn't it?  What you're doing, when you use a significance level instead of an argument, is that you're choosing what's most objective, instead of what's most likely to be right.  You're saying, "I refuse to make a judgment, and so I'm going to go by rote and not consider that I might be wrong."  That's something that sounds silly in all other aspects of life.  Doesn't it also sound silly here?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;--------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;So, am I arguing that academics need to start doing explicit Bayesian analysis, with formal mathematical priors?  No, absolutely not.  I disagree with that approach for the same reasons other critics do: it's too subjective, and too subject to manipulation.  As opponents argue, how do you know you have the right prior?  And how can you trust the conclusions if you don't?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;So, that's why I actually prefer the informal, "casual Bayesian" approach, where you use common sense and make an informal argument.  You take everything you already know about the subject -- which is your prior -- and discuss it informally.  Then, you add the new evidence from your study.  Then, finally, you conclude about your evaluation of the real-life implications of what you found.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;You say, "Well, the study found that reading the bible is associated with a 1% increase in cancer.  But, that just sounds so implausible, based on our existing [prior] knowledge of how cancer works, that it would be silly to believe it."  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Or, you say, "Yes, the study found that batters hit 100 points better after being yelled at by their manager.  But, if that were true, it would be very surprising, given the hundreds of other [prior] studies that never found any kind of psychological effect even 1/20 that big.  So, take it with a grain of salt, and wait for more studies."&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Or, you say, "We found that using this new artificial sweetener is linked to one extra case of cancer per 1,000,000 users.  That's not much different from what was found in [prior] studies with chemicals in the same family.  So, we think there's a good chance the effect is real, and advise caution until other evidence makes the answer clearer."&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;That's what I meant, two posts ago, where I said "you have to make an argument."  If you want to go from "I found a statistically significant 4% connection between cancer and X,"  to "There is a good chance X causes cancer," you can't do that, logically or mathematically, without a prior.  The p value is NEVER enough information. &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;The argument is where you informally think about your prior, even if you don't use that word explicitly.  The argument is where you say that it's implausible that bibles cause cancer, but more plausible that artificial sweeteners cause cancer.  It's where you say that it's implausible that songs make you older, but not that the effect is just random.  It's where you say that there's so much existing evidence that triples are a good thing, that the fact that this one correlation is negative is not enough to change your mind about that, and there must be some other explanation.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;You always, always, have to make that argument.  If you disagree, fine. But don't blame me.  Blame Bayes' Theorem.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/31545676-779155163614889873?l=sabermetricresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://sabermetricresearch.blogspot.com/feeds/779155163614889873/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=31545676&amp;postID=779155163614889873' title='17 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/779155163614889873'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/779155163614889873'/><link rel='alternate' type='text/html' href='http://sabermetricresearch.blogspot.com/2011/11/research-conclusions-have-to-be.html' title='Research conclusions *have* to be bayesian'/><author><name>Phil Birnbaum</name><uri>http://www.blogger.com/profile/03800617749001032996</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>17</thr:total></entry><entry><id>tag:blogger.com,1999:blog-31545676.post-1107037309418926255</id><published>2011-11-21T10:43:00.003-05:00</published><updated>2011-11-21T10:47:46.003-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='statistics'/><category scheme='http://www.blogger.com/atom/ns#' term='academics'/><title type='text'>"Statisticians can prove almost anything"</title><content type='html'>&lt;span style="font-family:verdana;"&gt;Sometimes, when you look for statistical significance, you'll find it even if the effect isn't real -- in other words, a false positive.  With a 5% significance level, you'll find that one out of 20 times.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;However, experimenters don't do just one analysis one time.  They'll try a bunch of different variables, and a bunch of different datasets.  If they try enough things, they have a much better than 5% chance of coming up with a positive.  How much better?  Well, there's no real way to tell, since the tests aren't independent (adding one dependent variable to a regression isn't really a whole new regression).  But, intuitively: if, by coincidence, your first experiment winds up at (say) p=0.15, it seems like it should be possible to get it down to 0.05 if you try a few things.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;That's exactly what Joseph P. Simmons, Leif D. Nelson, and Uri Simonsohn did in a new &lt;a href="http://papers.ssrn.com/sol3/papers.cfm?abstract_id=1850704"&gt;academic paper&lt;/a&gt; (&lt;a href="http://news.nationalpost.com/2011/11/20/statisticians-can-prove-almost-anything-a-new-study-finds/"&gt;reported on&lt;/a&gt; in today's National Post).  They wanted to prove the hypothesis that listening to children's music makes you older.  (Not makes you *feel* older, but actually makes your date of birth earlier.)  Obviously, that hypothesis is false. &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Still, the authors managed to find statistical significance.  It turned out that subjects who were randomly selected to listen to "When I'm Sixty Four" had an average (adjusted) age of 20.1 years, but those who listened to the children's song "Kalimba" had an adjusted age of 21.5 years.  That was significant at p=.04.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;How?  Well, they gave the subjects three songs to listen to, but only put two in the regression.  They asked the subjects 12 questions, but used only one in the regression.  And, they kept testing subjects 10 at a time until they got significance, then stopped.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;In other words, they tried a large number of permutations, but only reported the one that led to statistical significance.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;One thing I found interesting was that one variable -- father's age -- made the biggest difference, dropping the p-value from .33 to .04.  That makes sense, because father's age is very much related to subject's age.  If you father is 40, you're unlikely to be 35.  You could actually make a case that father's age *helps* the logic, not hurts it, even though it was arbitrarily selected because it gave the desired result.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-----&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;In this case, all the permutations meant that statistical significance was extremely likely.  Suppose that, before any regressions, the two groups had about the same age.  Then, you start adjusting for things, one at a time.  What you're looking for is a significant difference in that one respect.  The chances of that are 5%.  But, the things the researchers adjusted for are independent: how much they would enjoy eating at a diner, their political orientation, which of four Canadian quarterbacks believed they won an award ... and so on.  With ten independent thingies, the chance at least one would be significant is about 0.4.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Add to that the possibility of continuing the experiment until significance was found, and the possibility of combining factors, and you're well over 0.5.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Plus, if the researchers hadn't found significance, they would have kept adjusting the experiment until they did!&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-----&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;The authors make recommendations for how to avoid this problem.  They say that researchers should be forced to decide in advance, when to stop collecting data.  And they should be forced to list all variables and all conditions, allowing the referees and the readers to see all the "failed" options.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;These are all good things.  Another thing that I might add is: you have to repeat the *exact same study* with a second dataset.  If the result was the result of manipulation, you'll have only a 5% chance of having it stand up to an exact replication.  This might create more false negatives, but I think it'd be worth it. &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-----&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;One point I'd add is that this study reinforces my point, last post, that the interpretation of the study is just as important as the regression.  For one thing, looking at all the "failed" iterations of the study is necessary to decide how to describe the conclusions.  But, mostly, this study shows an extreme example of how you have to use insight to figure out what's going on.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Even if this study wasn't manipulated, the conclusion "listening to children's music makes you older" would be ludicrous.  But, the regression doesn't tell you that.  Only an intelligent analysis of the problem tells you that.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;In this case, it's obvious, and you don't need much insight.  In other cases, it's more subtle.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-----&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Finally, let me take exception to the headline of the National Post article: "Statisticians can prove almost anything, a new study finds."  Boo!&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;First of all, the Post makes the same mistake I argued against last post: the statistics don't prove anything: the statistics *plus the argument* make the case.  Saying "statistics prove a hypothesis" is like saying "subtraction proves socialism works" or "the hammer built the birdhouse."  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Second, a psychologist who uses statistics should not be described as a statistician, any more than an insurance salesman should be described as an actuary.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Third, any statistician would tell you, in seconds, that if you allow yourself to try multiple attempts, the .05 goes out the window.  It's the sciences that have chosen to ignore that fact.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;The true moral of the story, I'd argue, is that the traditional academic standard is wrong -- the standard that once you find statistical significance, you're entitled to conclude your effect is real.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-----&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;P.S.  If Uri Simonsohn's name looks familiar, it might be because he was one of the authors of the "&lt;a href="http://sabermetricresearch.blogspot.com/search?q=.463+gunning+.300"&gt;batters hit .463 when gunning for .300&lt;/a&gt;" study.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/31545676-1107037309418926255?l=sabermetricresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://sabermetricresearch.blogspot.com/feeds/1107037309418926255/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=31545676&amp;postID=1107037309418926255' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/1107037309418926255'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/1107037309418926255'/><link rel='alternate' type='text/html' href='http://sabermetricresearch.blogspot.com/2011/11/statisticians-can-prove-almost-anything.html' title='&quot;Statisticians can prove almost anything&quot;'/><author><name>Phil Birnbaum</name><uri>http://www.blogger.com/profile/03800617749001032996</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-31545676.post-5045827626038734586</id><published>2011-11-18T09:25:00.014-05:00</published><updated>2011-11-18T11:29:29.278-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='scientific method'/><category scheme='http://www.blogger.com/atom/ns#' term='academics'/><category scheme='http://www.blogger.com/atom/ns#' term='regression'/><title type='text'>A research study is just a peer-reviewed argument</title><content type='html'>&lt;span style="font-family:verdana;"&gt;To make your case in court, you need two things: first, some evidence; and, second, an argument about what the evidence shows.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;The same thing is true in sabermetrics, or any other science.  You have your data, and your analysis; that's the evidence.  Then you have an argument about what it means.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;But, most of the time, the "argument" part gets short shrift.  Pick up a typical academic paper, and you'll see that most of the pages are devoted to explaining a regression, and listing the results and the coefficients and the corrections and the tests.  Then, the author will just make an unstated assumption about what that means in real life, as if the regression has proven the case all by itself.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;That's not right.  The regression is important, but it's just the gathering of the evidence.  You still have to look at that evidence, and explain what you think it means.  You have to make an argument.  The regression, by itself, is not an argument.  The *interpretation* of the regression is the argument.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;For instance: suppose you do a simple regression on exercise and lifespan, and you get the result that every extra mile jogged is associated with an increased lifespan of, say, 10 minutes.  What does that mean in practical terms?  Probably, the researcher will say that if you want Americans' lifespan to increase by a day, we should consider getting each of them to jog 144 more miles than they would otherwise.  That would seem reasonable to most of us.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Suppose, now, another study looks at pro sports, and finds that every year spent as a starting MLB shortstop is associated with an extra $2 million in lifetime earnings.  Will the researcher now say that if we want everyone to earn an extra $2 million, we should expand MLB so that everyone in the USA can be a starting shortstop?  That would be silly.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Still another researcher does a regression to use triples to predict runs scored.  That one finds a &lt;a href="http://sabermetricresearch.blogspot.com/2009/10/dont-use-regression-to-calculate-linear.html"&gt;negative relationship&lt;/a&gt;.  Should the study conclude that teams stop trying to hit triples, that it's just hurting them?  Again, that would be the wrong conclusion.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;All three of these regressions have exactly the same structure.  The math is the same, the computer software is the same, the testing for heteroskedasticity is the same ... everything about the regressions themselves is the same.  The difference is in the *interpretation* of what the regressions mean.  The same interpretation, the same argument, makes sense in the first case, but is obviously ludicrous in the other two cases.  And even the third case is very different from the second case.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;The regression is just data, just evidence.  It's the *interpretation* that's crucial, the argument about what that evidence means.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Why, then, do so many academic papers spend pages and pages on the details of the regression, but only a line or two justifying their conclusions?  I don't know for sure, but I'd guess it's because regression looks mathematical and scholarly and intellectual and high-status, while arguments sound subjective and imprecise and unscientific and low-status.&lt;br /&gt;&lt;br /&gt;Nonetheless, I think the academic world has it backwards.  Regressions are easy -- shove some numbers into a computer and see what comes out.  Interpretations -- especially correct interpretations -- are the hard part.&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-----&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;If you think my examples are silly because they're too obvious, here's a real-life example that's more subtle: the relationship between salary and wins in baseball, a topic that's been discussed quite a bit over the last few years.  If you do a &lt;a href="http://sabermetricresearch.blogspot.com/2010/09/does-mlb-payroll-matter-less-than-it.html"&gt;regression&lt;/a&gt; on 2009 data, you'll get that&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-- the correlation coefficient is .48&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-- the r-squared = .23&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-- the value of the coefficient is .16 of a win per $1 million spent&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-- the coefficient is statistically significant (as compared to the null hypothesis of zero).&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;That's all evidence.  But, evidence of what?  So far, it's just numbers.  What do they actually *mean*, in terms of actual knowledge about baseball?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;To get from the raw numbers to a conclusion, you have to interpret what the regression says.  You have to make an argument.  You have to use logic and reason.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;So you look the coefficient of .16.  From that, you can say, in 2009, every extra $6 million spent resulted, on average, in one extra win.  I'm happy calling that a "fact" -- it's exactly what the data shows.  But, almost anywhere you go from there now becomes interpretation.  What does that *mean*, that every extra $6 million resulted in an extra win?  What are the practical implications?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;For instance, suppose you're a GM and want to gain an extra win next year.  How much extra money do you have to spend on free agents?  If you want to convince me that you know the answer, you have to take the evidence provided by the regression, and *make an argument* for why you're right.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;A naive interpretation might be to just use that $6 million figure, and say, that's it!  Spend an extra $6 million, and get an extra win.  It seems obvious from the regression, but it would be wrong.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Why is it wrong?  It's wrong because there are other causes of winning than spending money on free agents.  There's also spending money on "slaves," and spending money on "arbs".  Those are much cheaper than free agents.  Effectively, some teams get wins almost for free, by having good young players.  The teams that don't have that have to spend double, as it were: they have to buy a free agent just to catch up to the team with the cheap guys, and then they have to buy another one to surpass him.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;For instance, team A has 80 wins for "free".  Team B has 70 wins for "free" and buys another 20 on the free-agent market.  The regression doesn't know free from not free.  It sees that team B has 10 more wins, but spent an extra $20X dollars, where X is the actual cost of a free agent per win.  Therefore, it spits out that it took 2X dollars to buy each extra win, even though it only took X.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;That is: the coefficient of dollars per win from the regression is twice what it actually costs to buy one.  The coefficient doesn't measure what a naive researcher might think it does.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;My numbers are artificial, but I chose numbers that actually come fairly close to real life.  Various sabermetric studies have shown that a free agent win actually costs $4.5 million.  But regressions for 2008, 2009, and 2010 respectively show figures of $8.9, $6.2, and $12.6 million, respectively -- about twice as much.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Again, the issue is interpretation.  If you're just showing the regression results, and saying, "here, figure out what this means," then, fine.  But if your paper has a section called "discussion," or "conclusions," that means you're interpreting the results.  And that's the part where it's easy to go wrong, and where you have to be careful.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;----&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Which brings me, finally, to the point that I'm trying to make: we should stop treating academic studies as objective scientific findings, and start treating them as arguments.  Sure, we can remember that academic papers are written by experts, and peer reviewed, and that much of the time, there's no political slant behind them.  If we want, we can consider them as generally well-reasoned arguments by experts of presumably above-average judgment.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;But they're still arguments.&lt;br /&gt;&lt;br /&gt;So when an interesting study is published, and the media report on it, they should treat it as an argument.  And we should hold it to the same standards of skepticism to which we hold other arguments. &lt;/span&gt;&lt;span style="font-family:verdana;"&gt;A research paper is like an extended op-ed.  The fact that there's math, and a review process, doesn't make them any less argument-like.  The New York Times wouldn't present Paul Krugman's column as fact just because he used regressions and peer review, would they?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I googled the phrase "a new study shows."  I got 55 million results.  "A new study claims" gives only 4 million.  "A new study argues" gives only 300,000.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;But, really, It should be the other way around.  New studies normally don't "show" anything but the regression results.  Their conclusions are always "claimed" or "argued".  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-----&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;The word "show" should be used only when the writer wants to indicate that the claim is true, or that it has been widely accepted in the field. At the time his original Baseball Abstract came out, you'd have to say Bill James was "arguing" that the Pythagorean Projection is a good estimator of team wins.  But now that we know it's right, we say he "showed" it.&lt;br /&gt;&lt;br /&gt;"Show" implies that you accept the conclusion.  "Argue" or "claim" implies that you're not making a judgment.&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;The interesting thing is that the media seem to understand this.  Sure, 90 percent of the time, they say "show".  But when they don't, it's for a reason.  The "claims" and "argues" are saved for controversial or frivolous cases, ones that the reporter doesn't want to imply are true.  For instance, "New study &lt;a href="http://ca.news.yahoo.com/blogs/dailybrew/study-claims-gun-control-laws-no-effect-canadian-201239248.html"&gt;claims&lt;/a&gt; gun-control laws have no effect on Canadian murder rate."  And, "a new study &lt;a href="http://www.economist.com/node/18713538"&gt;argues&lt;/a&gt; that poker is a game of skill, not chance."  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;It's as if the reporters want to pretend scientific papers are always right, unless they conclude something that the reporter or editor doesn't agree with.  But it's not the reporter's job to be implying the correctness of a conclusion, unless the reporter has analyzed the paper, and is writing the article as an opinion piece.&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Ninety-nine percent of the time, a research paper does not "show" anything -- it only argues it.  Because, correct conclusions don't just pop out of a regression.  They only show up when you support that regression with a good, solid argument.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/31545676-5045827626038734586?l=sabermetricresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://sabermetricresearch.blogspot.com/feeds/5045827626038734586/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=31545676&amp;postID=5045827626038734586' title='14 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/5045827626038734586'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/5045827626038734586'/><link rel='alternate' type='text/html' href='http://sabermetricresearch.blogspot.com/2011/11/research-study-is-just-peer-reviewed.html' title='A research study is just a peer-reviewed argument'/><author><name>Phil Birnbaum</name><uri>http://www.blogger.com/profile/03800617749001032996</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>14</thr:total></entry><entry><id>tag:blogger.com,1999:blog-31545676.post-1708598180317279029</id><published>2011-11-10T14:53:00.004-05:00</published><updated>2011-11-10T15:05:59.443-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='economics'/><category scheme='http://www.blogger.com/atom/ns#' term='baseball'/><title type='text'>The main economic benefit of baseball: we love it</title><content type='html'>&lt;span style="font-family:verdana;"&gt;From "The Sports Economist," &lt;a href="http://thesportseconomist.com/2011/10/13/what-is-the-ultimate-economic-impact-of-sports/"&gt;here's&lt;/a&gt; commenter "bobby" with a very good comment:&lt;/span&gt;&lt;br /&gt;&lt;blockquote&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I find it mildly distressing that almost all of the discussion about economic impacts of sporting events is about rectangles with rarely if ever a discussion of triangles. I was always trained that welfare was measured by consumer and producer surplus, not expenditures, but then what do I know?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;...&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I guess the idea that people are happier with a baseball game than a movie doesn’t mean much anymore, and its downright silly to suggest that a baseball game makes a place better off because people could have gone to a movie instead.&lt;/span&gt;&lt;/blockquote&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;What he's saying is that if you're trying to measure the benefit of something, you measure it by "consumer surplus."  That's the economic term for the difference between what you have to pay for something, and the maximum price you'd be willing to pay for it.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;A lot of things have a huge consumer surplus.  Take, for instance, a headache pill.  An ibuprofen tablet costs only a few pennies -- a dime, tops.  How much would you be willing to pay to make your headache go away?  It's at least a dollar -- probably more, but at least a dollar.  That means that every time you buy an Advil for ten cents, you're making a "profit" of at least 90 cents.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;An easier way of looking at it is this: if the product you're using didn't exist, how much worse off would you be?  That's consumer surplus.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;There's consumer surplus in almost everything you pay for.  That's because, if you didn't buy it, you'd have to buy something else you liked less.  If there were no Tim Hortons, I'd have to buy Starbucks coffee, which I don't like as much.  If my favorite restaurant closed, I'd have to go to my second favorite, which is still good, but not as good as my favorite.  And so on.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Now, for sports: how much consumer surplus do you get from sports?  How much worse off would you be if there were no baseball, or hockey?  For most of you reading this, your answer is probably -- a lot worse.  You'd have more money, because you wouldn't be spending on trading cards and tickets and Bill James Baseball Abstracts, but, that barely matters, compared to how much less interesting your life would be without baseball.  The same is probably true for your favorite team, if you have one.  My life would be a lot worse without the Toronto Maple Leafs, even if all the other teams were still around.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;So Bobby's argument is quite correct.  The most important economic consideration, when it comes to pro sports is how much better off people are because of it.  Why, then, in almost every discussion of sports and economics, is this not a consideration?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;One reason, as Victor Matheson says &lt;a href="http://thesportseconomist.com/2011/10/21/a-response-to-bobby-and-phil/"&gt;here&lt;/a&gt;, is that economists are often reacting to questions from non-economists, or politicians, who are more concerned about GDP and job creation than about the intrinsic value of sports as entertainment.  When a candidate for office talks about building a new stadium to attract a team, the economic arguments are about the monetary values of the transactions it will create, rather than how happy the fans will be.  And so, that's what the economists have to respond to.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Most of the time, though, the answer is that a new sports team creates only negligible zero jobs on net, and little increase in GDP.  Because, after all, if the money didn't get spent on sports, it would get spent on something else.  If you live in Montreal and don't have the Expos any more, you'll go to a movie, or go out to dinner instead.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;That's true for almost anything.  If medicines were made illegal, GDP wouldn't change much in the long term -- you'll take the dime you would have spent on an Advil, and spend it on something else instead.  The main reason a headache pill is a good thing is not that it adds 10 cents to total output, but that its benefit is way, way higher than its cost.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;What I'd like to see, in economic analysis of sports, is some kind of estimate of how much it improves people's lives.  It's a lot.  Matheson says in his post that there have been some attempts to quantify consumer surplus, but his example is only among people who pay for tickets.  But what about everything else?  Most of us benefit from baseball far, far more than just our ticket purchases.  We watch games on TV, write blogs about them, analyze them, talk about them at the water cooler.  Sports are a big part of the fabric of most of our lives, and having a team in our city to root for is a huge unmeasured happy benefit.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I have argued before that if it makes sense for government to subsidize things like public broadcasting (CBC, BBC, NPR, etc.), it should also make sense for it to subsidize a hockey team, on a cost/benefit basis.  But I can't show you any evidence (other than back-of-the-envelope) that that's the case, unless and until the economists start listening to bobby, and get to work on showing the size of the benefit.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/31545676-1708598180317279029?l=sabermetricresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://sabermetricresearch.blogspot.com/feeds/1708598180317279029/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=31545676&amp;postID=1708598180317279029' title='3 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/1708598180317279029'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/1708598180317279029'/><link rel='alternate' type='text/html' href='http://sabermetricresearch.blogspot.com/2011/11/main-economic-benefit-of-baseball-we.html' title='The main economic benefit of baseball: we love it'/><author><name>Phil Birnbaum</name><uri>http://www.blogger.com/profile/03800617749001032996</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>3</thr:total></entry><entry><id>tag:blogger.com,1999:blog-31545676.post-8010860885394876631</id><published>2011-11-02T15:35:00.005-04:00</published><updated>2011-11-02T15:53:34.386-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Baseball Analyst'/><title type='text'>"Baseball Analyst" archives now available</title><content type='html'>&lt;span style="font-family:verdana;"&gt;In 1982, Bill James created the "Baseball Analyst," a bimonthly amateur sabermetrics journal that relied on contributions from readers.  It ran 40 issues, dying in early 1989.&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;&lt;br /&gt;Last weekend, with Bill's permission, I scanned all my issues and sent them to Jacob Pomrenke of SABR.  Stephen Roney contributed some pages I was missing.  Jacob reformatted everything.  Rob Neyer, who was responsible for the Analyst's last few issues, wrote an introduction.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Finally, Jacob put it all &lt;a href="http://sabr.org/latest/baseball-analyst-archives-now-available"&gt;online&lt;/a&gt; at the SABR website.  All 40 issues are now publicly available for download.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Great stuff ... thanks to Jacob, Stephen, Rob, and especially Bill!&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/31545676-8010860885394876631?l=sabermetricresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://sabermetricresearch.blogspot.com/feeds/8010860885394876631/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=31545676&amp;postID=8010860885394876631' title='7 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/8010860885394876631'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/8010860885394876631'/><link rel='alternate' type='text/html' href='http://sabermetricresearch.blogspot.com/2011/11/baseball-analyst-archives-now-available.html' title='&quot;Baseball Analyst&quot; archives now available'/><author><name>Phil Birnbaum</name><uri>http://www.blogger.com/profile/03800617749001032996</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>7</thr:total></entry><entry><id>tag:blogger.com,1999:blog-31545676.post-1632679008902068075</id><published>2011-10-31T14:56:00.003-04:00</published><updated>2011-10-31T15:00:34.857-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='NFL'/><category scheme='http://www.blogger.com/atom/ns#' term='football'/><category scheme='http://www.blogger.com/atom/ns#' term='distribution of talent'/><category scheme='http://www.blogger.com/atom/ns#' term='simulation'/><title type='text'>A rudimentary NFL season simulation</title><content type='html'>&lt;span style="font-family:verdana;"&gt;Following a &lt;a href="http://www.insidethebook.com/ee/index.php/site/comments/poz_on_playoff_structures_of_various_leagues/"&gt;post&lt;/a&gt; by Tango a couple of weeks ago on the playoff systems of the various sports, I thought I'd try writing a simulation.  This is an update of that work in progress.  Actually, I've only started on the NFL, and I haven't even done playoffs yet, just the regular season.  But I thought I'd at least share what I've got so far.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;In the simulation, each of the 32 teams was assigned a "true talent," from a normal distribution with mean .500 and standard deviation .143.  No team was allowed to have talent higher than .900 or lower than .100; if they did, they were moved to .900 or .100.  Then, all 32 teams were moved the same amount (arithmetically) in the same direction to get the overall talent to average exactly .500.  (I think this method actually reduces the expected SD below .143, but I didn't bother fixing that.)&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;The 16-game schedule is random, instead of unbalanced (with the restriction that a team can't face any another team more than twice).  There are no tie games.  There is no home field advantage (although that would be easy to add in).  The chance of winning each game is determined by the log5 method.  There are no ties in games.  Ties in the standings (division or wild card) are broken randomly.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;As I said, I stopped there for now; haven't done playoffs yet.  That's the next step, along with home field advantage.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Anyway, here are some results.  Each result is out of 100,000 seasons.  Every result came from a different run of the simulation.  Results varied a fair bit per run, but I think everything is reasonably typical.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;--------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I checked for all teams out of 3,200,000 (32 teams, 100,000 seasons) that finished more than 8 games above or below their talent.  That's hard to do, obviously.  Also, the worse or better you are, the harder it is.  It's (relatively) easier for an 8-8 team to go 16-0 than for a 3-13 team to go 11-5.  Amplifying that is the fact that there are a lot more 8-8 teams than 3-13 teams.  However, offsetting that, a little bit, is the fact that the 3-13 team can also go 12-4 or 13-3 or better. &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;In any case ... there were 43 cases where a team differed from its talent by 8 games or more.  Of those, 26 were teams that outperformed, and 17 were teams that underperformed.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;The biggest differential was in season 98,534, where the Broncos a team that had talent of 4.56 wins (out of 16), but went 14-2, for a differential of 9.44 games.  That was the only team with a differential of 9 or more.  Part of the reason it did so well was that it faced inferior opponents.  You'd expect any given team's opponents to average 8.00 games of talent.  But in that season, the Broncos' opponents' talent was only 7.45 games.  Not a huge difference, but still.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Actually, when it comes to extreme events, a small difference in opponents makes a big difference in probability.  Of the 43 teams in the sample, 38 of them had records that went in the direction "aided" by the opposition (in the sense that the underperforming teams played better-than-expected opponents, and vice versa).  That's 38-5 in favor.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;The worst team in the sample was the season 63,924 Jets, a 3.03 team that went 12-4 (playing 7.11-win opponents).  The best team in the sample was a Bucs team that was expected to win 12.04 games, but instead went 4-12 (playing 8.58-win opponents).&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;--------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I also took a look at teams that went 0-16.&lt;br /&gt;&lt;br /&gt;Those results probably aren't as realistic, because they're heavily dependent on the shape of the tail of the talent distribution ... and we really don't know what that is.  Recall that we chose a normal distribution that gets truncated at .100 (1.6 wins).  Both those choices -- normal, and truncated -- are arbitrary and probably not close enough to real life.  (Also, teams could drop below .100 in talent from the adjustment that sets all league-years to .500.)&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;In addition, the other shortcuts in the simulation probably skew the results too.  The mainstream results are probably right, but the extremes are extremely sensitive to some of the assumptions. &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;With those caveats: there were 5,663 of those 0-16 teams out of 3.2 million, and their average talent was .181, which is just under 3-13.  I suspect the talent of actual flesh-and-blood 0-16 teams is higher than that, but I really don't know.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;16-0 should be exactly symmetrical, so I won't show that separately.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;--------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I checked for four-way ties where every team has the same record.  That happened 878 times out of 800,000, or about once every century.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;There were five seasons out of 100,000 where two divisions had a four-way tie.  Actually, that might be a little high ... the test runs had only 1 or 2 such seasons.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;---------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Anyway, before I start on the playoffs, and repeating this for other sports leagues, I'm looking for feedback on what I've got so far.  Any suggestions?  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;And, if you want me to run the sim and check for something in particular, let me know in the comments.  It's real easy to add a couple of lines of code to check for something specific.  &lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/31545676-1632679008902068075?l=sabermetricresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://sabermetricresearch.blogspot.com/feeds/1632679008902068075/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=31545676&amp;postID=1632679008902068075' title='16 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/1632679008902068075'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/1632679008902068075'/><link rel='alternate' type='text/html' href='http://sabermetricresearch.blogspot.com/2011/10/rudimentary-nfl-season-simulation.html' title='A rudimentary NFL season simulation'/><author><name>Phil Birnbaum</name><uri>http://www.blogger.com/profile/03800617749001032996</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>16</thr:total></entry><entry><id>tag:blogger.com,1999:blog-31545676.post-6565493386591939384</id><published>2011-10-26T18:39:00.010-04:00</published><updated>2011-10-29T19:03:43.174-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='rudeness'/><category scheme='http://www.blogger.com/atom/ns#' term='internet'/><title type='text'>Being proven wrong is like winning the lottery</title><content type='html'>&lt;span style="font-family:verdana;"&gt;Following my &lt;a href="http://sabermetricresearch.blogspot.com/2011/10/in-defense-of-online-rudeness.html"&gt;last post&lt;/a&gt; on being a dick in online message boards, I was going to prepare a list of things that can really piss off people who are trying to have a discussion with you, things that you shouldn't do. Like, for instance, changing the subject when someone brings up key points, or being inappropriately picky in ways that don't affect the main point.  That kind of thing.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;As I was preparing that list, it occurred to me: these are things that nobody would ever do who really, truly cared about whether they were right or not.  They're things that people do when they're trapped by logic, but they're reluctant to admit their position might be wrong.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Why don't people want to admit they were wrong, even when it seems obvious to everyone else?  Ego.  Nobody likes to be proven wrong.  But, if we can get over that pride thing, we should WANT to be proven wrong!  It makes things better for us.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Because, it's bad to be wrong.  Therefore, if you're wrong, the best thing is to *stop being wrong*.  And the way to stop being wrong is to change your mind.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Also, there are lots of side benefits to changing from a wrong view to a correct view.  For one thing, you're suddenly more right!  And, for another, now being right lets you see things in a whole new way.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;If you think that 1+1 equals 3, the world isn't going to make a whole lot of sense to you.  You'll think your paycheck doesn't add up, and everyone's giving you the wrong change.  You'll think the IRS is ripping you off, and mathematics is just arbitrary, since everyone just seems to be totaling things a different way.  You'll be suspicious of everyone, and life is going to be pretty difficult.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;But, when you see that 1+1=2, suddenly, everything comes into focus!  You'll see the world works logically, after all.  You now predict what people will do, and see that everything really does add up.  Your life is better -- a lot better.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;So, you should be happy and excited at the idea that we might be wrong.  The guy you're getting mad at could be the one that changes your life, if it turns out that he's right.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;--------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Another way to look at it: suppose you're a successful major-league pitcher, making a few million dollars a year.  You go to the doctor, and he says, hey, there's been something wrong with your elbow since birth.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Should you be insulted and angry that the doctor is insulting your awesome pitching arm?  No, you should be very excited -- that's great news!  If the doctor is right, and he the problem, maybe you'll be throwing 100 mph instead of 95!&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Don't like that analogy?  OK, here's another try.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Let's suppose you open a restaurant, and you're very successful, and people like your food.  You're very proud of being a great chef.  Then, someone tells you, correctly, that one of your appetizers, one that you think is one of your best, is actually pretty awful.  Your customers hate it.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Your first reaction might be to get defensive.  But, again, you should be thrilled!  Now you can fix that dish.  Your food, your restaurant, your profit, and your reputation will all be better than before.  It's almost the best thing that can happen to you. &lt;s&gt;Being&lt;/s&gt; Finding out you're wrong is like winning the lottery!&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;So, in online debates, we should all argue as if God will give us a million dollars if we're proven wrong.  That way, when someone disagrees with us, we won't just dismiss him out of hand or twist our logic to try to save our own position.  We'll stop and think, "hey, is it possible that this guy has my million dollars?"  We'll be less likely to let our ego take over and turn us into dicks.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Okay, maybe that's too much money.  Maybe for a shot at a million dollars, we'd listen to idiots way, way too long.  Maybe the right amount is, I dunno, a hundred dollars, or something.  But you get the idea.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;And I should add that I'm not saying that I, personally, know how to suppress my own ego, or even that I succeed in doing it when I try.  I'm just saying that I know I *should*.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-----&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I guess my overall point is that any online discussion, even between people who violently disagree with each other, should be a co-operative venture.  One of you is wrong, and you're working together to find out who.  And, we should keep in mind that most of the benefit goes to the person who was actually wrong in the first place.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;When someone you respect, or someone who seems to be expert and knowledgeable, starts disagreeing with you, it's like you've stumbled upon a fistful of lottery tickets.  Argue your position, yes, but don't get defensive, and keep an open mind.  Sure, it might be that other guy who's wrong.  But if you're really, really lucky, it'll be you.&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/31545676-6565493386591939384?l=sabermetricresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://sabermetricresearch.blogspot.com/feeds/6565493386591939384/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=31545676&amp;postID=6565493386591939384' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/6565493386591939384'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/6565493386591939384'/><link rel='alternate' type='text/html' href='http://sabermetricresearch.blogspot.com/2011/10/being-proven-wrong-is-like-winning.html' title='Being proven wrong is like winning the lottery'/><author><name>Phil Birnbaum</name><uri>http://www.blogger.com/profile/03800617749001032996</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-31545676.post-7697269252838453979</id><published>2011-10-22T12:08:00.005-04:00</published><updated>2011-10-22T16:48:39.439-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Don Cherry'/><category scheme='http://www.blogger.com/atom/ns#' term='rudeness'/><category scheme='http://www.blogger.com/atom/ns#' term='internet'/><title type='text'>In defense of online rudeness</title><content type='html'>&lt;span style="font-family:verdana;"&gt;A couple of weeks ago, the moderator of a certain website I frequent posted a message, reminding us commenters to respect each other.  The warning wasn't random, of course; it was prompted by a discussion that got a little less civilized than normal for that site.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;This kind of thing plays out all the time on thousands of different sites.  But what bugs me about these "please be civil and respect each other" warnings is that they only target one general type of rudeness.  I did a Google search for "message board rules etiquette," to get some examples of posted rules.  A lot of them include something like &lt;a href="http://www.lotsofkids.com/forums/Help/Netiquette.php"&gt;this&lt;/a&gt;:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;&lt;/span&gt;&lt;blockquote&gt;&lt;span style="font-family:verdana;"&gt;"Don't issue personal attacks, use profanity, or post threatening, abusive, harassing, or otherwise offensive language or images. Keep your messages appropriate and courteous at all times. Please disagree with other opinions respectfully."&lt;/span&gt;&lt;/blockquote&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Now, most of us follow this advice almost all the time.  But, sometimes, we don't.  When don't we?  When we get really frustrated with someone.  &lt;/span&gt;&lt;span style="font-family:verdana;"&gt;Why do we get frustrated? &lt;br /&gt;&lt;br /&gt;Well, they might be repeatedly misrepresenting something we wrote.  They might be ignoring our questions.  They might be following the argument back and forth for hours, until they realize they're "losing", and change the subject.  They might be obviously disingenuous, denying something they wrote in another post just days before.  They might agree with what you say when it suits their argument, but change their mind as soon as it goes against them.  They might be trolling for fun.  They might be committing any one of a thousand logical fallacies, and refusing to be corrected.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;But ... the rules don't prohibit that, do they?  The rules say you can get kicked off the forum for calling someone an idiot.  But you can't get kicked off for repeatedly (and perhaps deliberately) butchering your logic.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Here's a hypothetical situation I made up:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;&lt;/span&gt;&lt;blockquote&gt;&lt;span style="font-family:verdana;"&gt;A: US citizens spend too much on foreign aid.  We need to help our own instead.  There are thousands out of work.  I don't know why you don't see that.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;B: I see that, but money spent outside the US can help a lot more people who are desperate.  We can save hundreds of lives for almost nothing.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;A: I don't believe you.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;B: (Goes and searches the internet.  Writes several paragraphs of illustration of various public health costs, and how cheap it is to save lives in Africa with cheap drugs or vitamins or something.)&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;A: Well, maybe, but the multinationals make too much profit when we do that.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;B: (Goes and searches the internet)  Here are some companies selling drugs at cost, or offering them for free, if we just pay to distribute them!&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;A: Yeah, whatever.  And, regardless, we still spend too much.  Charity is nice, but in moderation.  We should spend only about half of 1% of our income on foreign aid.  That's $1 out of every $200.   That's my maximum.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;B: Hang on, let me search the internet ... well, as it turns out, we spend only 0.1% of our income on foreign aid!  So, we both agree that we could spend a little more, right?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;A: I said we should spend half of that!  See, we spend too much!&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;B: No, half of that would be 0.05%.  You said 0.5%.  Here, let me quote your post: "$1 out of every $200".&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;A: You're misquoting me.  Besides, you can throw numbers at me all you want.  They're just numbers.  The fact is that we spend too much on foreign aid.  We need to help our own instead.  I don't know why you don't see that.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;B: You're a dick.&lt;/span&gt;&lt;/blockquote&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;What happens next?  B gets in trouble for calling A a dick.  But A gets off scot-free for BEING a dick.  And that's a lot worse.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;As far as I'm concerned, it should be A who gets kicked off the site, not B.  But that'll never happen.  See, B's offense was objective, and easily caught.  He used the word "dick".  Everyone can understand that, and you can easily make a rule out of it.  "Why did B get kicked off?"  "He called A a dick."  "Oh, OK."&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;On the other hand, A's offense was subjective.  It requires a judgment call from the moderator.  And, there's no smoking gun.  "Why did A get kicked off?"  B would say, "He was a dick."  But A's supporters would say, "It's because he didn't agree with B."  "It's because the moderator didn't like his politics."  "It's because nobody respects B's right to stand up for American workers."  "It's because that website doesn't respect dissenting views."&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;It's hard to describe what A did wrong.  The evidence is hard to describe.  There's no smoking dick.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;And so, the As of the world get away with it, and we just have to put up with them.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;--------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I don't think that that B said anything morally objectionable.  A indeed WAS a dick.  Sure, it didn't *have* to be said ... but, it was true.  At least in the sense that "you're a dick" can ever be said to be true.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;And I would argue that B, after investing so much time and effort in moving the argument forward, earned the right to say it. &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Sure, you don't want discussions degenerating into name-calling.  That's no fun for anyone.  But, in the appropriate context, an occasional, controlled outburst is OK.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Strained analogy: think of a discussion as a pot luck, and insults like ketchup.  If you show up at a pot luck, you don't just bring a bottle of ketchup.  That's rude, and tacky.  But, if you bring hamburgers for everyone, and you *also* bring a bottle of ketchup ... that's perfectly OK.  In fact, the hamburgers you're providing are actually enhanced by the ketchup you brought.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;The rule is, if you want the right to serve ketchup at the pot luck, you have the obligation to serve the meat to go with it.  And, if B has just spent the better part of an hour researching foreign aid, and hundreds of characters typing rebuttals to a poster he thought was arguing in good faith ... that's a LOT of meat.  You've got to say that B has earned the right to pound the ketchup bottle a little bit.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;---------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;This is one of the problems we have with public discussion in general.  Whether it's politicians, columnists, academics, or talking heads on TV, the unwritten code is the same.  You can butcher logic all you want, and nobody will call you on it.  But resort to name-calling, or other "uneducated" forms of language, and you get in trouble.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;The public doesn't have the time or patience to judge what you've said.  But it does understand *how* you said it!&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;You probably know what happened to Don Cherry a couple of weeks ago.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Over the summer, in separate incidents, three former NHL players took their own lives.  They had all been "enforcers," players kept on the team for their willingness to start fights with opposing players.  Speculation ensued that there was somehow a link between being an enforcer and having mental health issues.  Three other former enforcers, Stu Grimson, Chris Nilan, and Jim Thomson, apparently made comments that expressed support for that hypothesis.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Don Cherry went on TV and accused Grimson, Nilan and Thomson of hypocrisy.   Those guys, themselves, once chose to make a very good living with their fists, Cherry said, knowing that they'd be out of professional sports if they didn't.  Now, they're trying to deny the same choice to today's players, since, with their careers long over, banning fighting would not longer cost them anything.  And this, Cherry argued, was on the basis of a flimsy hypothesis with no solid evidence behind it.  Hypocrites!&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Seems like a legitimate argument, right?  I mean, you can certainly disagree with it, but it's not that unreasonable a point to make, in the context of a controversial issue that's already attracted a lot of discussion.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;But Cherry got himself in trouble.  Why?  Because he didn't use educated language.  He didn't say it formally, the way I described it.  He used less fancy words.  One word, in particular: "pukes."  Cherry called Grimson, Nilan, and Thomson "a bunch of pukes."&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Without the word "pukes," it's just another Don Cherry TV segment.  But with the word "pukes," suddenly there's outrage.  There were news stories, outraged columnists, and even newspaper editorials, all of them prominently featuring the word "pukes."&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;In the midst of all this, Grimson &lt;a href="http://www.montrealgazette.com/sports/Cherry+must+apologize+could+face+lawsuit+sources/5529147/story.html"&gt;threatened to sue&lt;/a&gt; Cherry if he didn't apologize.  That was more than an idle threat: Grimson is a lawyer, and his statement threatening "further recourse" was issued by his own law firm.  Cherry apologized a few days later.  Grimson issued a &lt;a href="http://www.tsn.ca/nhl/story/?id=378263"&gt;new statement&lt;/a&gt; that said, OK, he wouldn't sue, but maybe the CBC should fire Cherry anyway.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Cherry got a raw deal: not just because of the content, which wasn't really any more controversial than his usual, but because of the attention he got by breaking the taboo against name-calling.  "If you call somebody a name," goes the unwritten, unspoken rule, "it signals that you're uneducated and boorish, so, accordingly, we will oppose your argument exceedingly vigorously."&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;The media and public would do better to take some &lt;a href="http://www.insidethebook.com/ee/index.php/site/article/wow_i_am_quoted_as_the_lead_in_for_a_blog/"&gt;now-famous&lt;/a&gt; advice from MGL: "If you guys can’t separate tone from substance, that is your problem not mine. Stop being such whiners about tone.”&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;--------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-size:85%;"&gt;&lt;span style="font-family:verdana;"&gt;P.S.  Kind of off topic, but while I'm here ... &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I don't agree with Cherry's logic that Grimson is a "puke" for speculating on the link between fighting and mental health.  I think that's a perfectly reasonable thing for a former goon to wonder about.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;But, I do have a problem with Stu Grimson's conduct afterwards.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I mean ... What kind of guy spends his career beating people up, then complains about a "lack of decency" because someone has the temerity to criticize his views about it?  What kind of guy threatens to sue someone just because he's been called a childish name?  What kind of guy speaks out on a position of public importance, and then when someone disagrees, tries to get him fired?  What kind of guy would leverage his advantage -- knowing he's a lawyer and can cause all kinds of problems for Don Cherry at no cost to himself -- to extort an apology with credible threats of a lawsuit?  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Maybe Don Cherry was right about Grimson, after all.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/31545676-7697269252838453979?l=sabermetricresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://sabermetricresearch.blogspot.com/feeds/7697269252838453979/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=31545676&amp;postID=7697269252838453979' title='13 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/7697269252838453979'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/7697269252838453979'/><link rel='alternate' type='text/html' href='http://sabermetricresearch.blogspot.com/2011/10/in-defense-of-online-rudeness.html' title='In defense of online rudeness'/><author><name>Phil Birnbaum</name><uri>http://www.blogger.com/profile/03800617749001032996</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>13</thr:total></entry><entry><id>tag:blogger.com,1999:blog-31545676.post-3272546973572107099</id><published>2011-10-19T10:43:00.005-04:00</published><updated>2011-10-21T14:00:52.674-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Moneyball'/><category scheme='http://www.blogger.com/atom/ns#' term='baseball'/><category scheme='http://www.blogger.com/atom/ns#' term='sabermetrics'/><title type='text'>How much does "Moneyball" help a team?</title><content type='html'>&lt;span style="font-family:verdana;"&gt;How much is sabermetrics worth to a team?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;That's probably a hard question to answer.  Every team uses statistics to some extent.  Even before sabermetrics, teams were looking at player statistics to decide who to play and who not to play.  They may not have had any fancy formulas, but they had a pretty good idea of how to weight the relative contributions of players.  Nobody ever released a 30-HR guy because he was only hitting .240, and nobody ever released a .330 hitter because he had no power.  Intuitive evaluations weren't perfect, of course, but they were pretty reasonable most of the time.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Where sabermetrics helps, I think, is not in evaluating actual performance, but in helping figure out *future* performance.  How to extrapolate minor-league performance in to major league performance ... how to take luck out of a player's batting or pitching line ... figuring how different kinds of players age ... that sort of thing.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Suppose you took a team management right out of the early 1970s, and gave them a team today without letting them learn anything discovered after 1977.  How much would that team underperform compared to the rest of MLB?  I don't have an answer to the question, but I'd be interested in hearing yours.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Anyway, here's a narrower question.  How much can a more sabermetric approach *today* benefit a team, compared to, say, the typical team's sabermetric approach?  For instance, how much did Billy Beane really mean to the A's?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;A couple of weeks ago, Tango did a &lt;a href="http://www.insidethebook.com/ee/index.php/site/comments/payroll_v_wins_in_the_moneyball_era_2002_2011/"&gt;study&lt;/a&gt; to figure out which teams did better or worse than expected, given their payroll.  The A's were the team that outperformed the most over the last decade -- about 7 games per season, it looks like.  That's a lot, but there's probably a whole bunch of luck there, since we're cherry-picking them as the best of the lot.  Also, it's possible that much of their outperformance came in the early years, when, as many critics of "Moneyball" hype have pointed out, they had three underpriced ace starters.&lt;br /&gt;&lt;br /&gt;So, we'd have to regress that 7 games to the mean a fair bit.  If you made me make an arbitrary guess, I'd be willing to bet that less than half of that seven game advantage came from sabermetrics.  (But, I have no real basis for that guess without studying it.)&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Anyway, with the Cubs signing Theo Epstein, we now have a market estimate for what sabermetrics might be worth today.  Epstein's new &lt;a href="http://thebiglead.com/index.php/2011/10/12/theo-epstein-to-the-cubs-for-20-million-over-5-years-how-quick-can-he-turn-the-franchise-around/"&gt;agreement&lt;/a&gt; is for about $4 million per season.  He still had one year to go on his contract with the Red Sox, for which they will receive some sort of &lt;a href="http://www.insidethebook.com/ee/index.php/site/comments/proper_compensation_for_epstein/"&gt;compensation&lt;/a&gt; from the Cubs.  Let's say that compensation will be worth $1 million.  So Epstein's value is around $5 million.  I don't know how much an &lt;s&gt;average&lt;/s&gt; replacement level GM makes, by comparison.  To be conservative, let's say it's $500,000, although it's probably more than that.  That means that Epstein's excess value is $4.5 million, exactly what it costs in free agent players to gain one extra win.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;It looks like that's what Epstein is worth: one win per season.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Is that a lot?  Frankly, I don't know.  It's a competitive market for players these days, with lots of money on the line, and there's lots of random luck in who makes it and who doesn't.  In that light, it could be that one win per season is an exceptional, genius-level performance.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;If that's the case, doesn't it mean that the "Moneyball" approach is overrated?  I mean, one win a year.  At that rate, it would take decades, even centuries, to have good statistical evidence that the sabermetric approach works.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Of course, you have to remember that that's compared to other teams ... and, nowadays, those other teams are doing a fair amount of statistical work themselves.  Maybe it's three or four games over a team that won't look at anything new at all, that never heard of Voros McCracken and winds up overpaying pitchers with lucky BABIPs.  And, maybe Epstein took less pay than he was worth in order to become a Cub.  Maybe it's a win and a quarter, or a win and a half.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Still ... to me, one game doesn't seem that unreasonable.  The point might not be that an you can win pennants just by embracing sabermetrics.  The point might be that, with every team in a sabermetric arms race against every other team, you certainly can *lose* pennants if you persist in living in the 70s.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;But, again ... one game.  Doesn't that mean that if a team does well, and someone credits "Moneyball," they're probably just blowing smoke?&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;UPDATES:&lt;br /&gt;&lt;br /&gt;1.  In the comments, Bill Waite suggests that sabermetrically-savvy managers might have a significant impact, too.  He says that just rejigging the lineup is worth almost half a game a season, and says that the difference between best and worst could be as much as eight games. &lt;br /&gt;&lt;br /&gt;Food for thought.  It would be interesting to consider how to try to look for this in the historical record (if indeed that is possible), since we know that some managers are indeed more numbers-oriented than others.&lt;br /&gt;&lt;br /&gt;2.  Matt Swartz e-mailed me about a study where he found a positive correlation between sabermetric management and team performance.  It's &lt;a href="http://www.baseballprospectus.com/article.php?articleid=11890"&gt;here&lt;/a&gt;. &lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/31545676-3272546973572107099?l=sabermetricresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://sabermetricresearch.blogspot.com/feeds/3272546973572107099/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=31545676&amp;postID=3272546973572107099' title='13 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/3272546973572107099'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/3272546973572107099'/><link rel='alternate' type='text/html' href='http://sabermetricresearch.blogspot.com/2011/10/how-much-does-moneyball-help-team.html' title='How much does &quot;Moneyball&quot; help a team?'/><author><name>Phil Birnbaum</name><uri>http://www.blogger.com/profile/03800617749001032996</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>13</thr:total></entry><entry><id>tag:blogger.com,1999:blog-31545676.post-606982854492275755</id><published>2011-10-15T15:04:00.004-04:00</published><updated>2011-10-15T15:16:24.165-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='economics'/><category scheme='http://www.blogger.com/atom/ns#' term='Warren Buffett'/><category scheme='http://www.blogger.com/atom/ns#' term='taxes'/><title type='text'>Capital Gains and Warren Buffett: Part II</title><content type='html'>&lt;span style="font-family: verdana;"&gt;This is a continuation of the &lt;a href="http://sabermetricresearch.blogspot.com/2011/10/capital-gains-and-warren-buffett-part-i.html"&gt;previous post&lt;/a&gt; about why capital gains taxes should be lower than regular taxes, and perhaps even zero.&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;-------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana; font-weight: bold;"&gt;3.  More Double Taxation&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;One year, the New York Yankees draft Mickey, a player with star potential.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;The team signs him to a five-year contract at $2 million a year.  They figure that's a bargain.  Mickey is good enough that he's expected to produce $15 million a year in revenue, so the Yankees figure, reasonably, that they're making $65 million on the deal.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;The day after the draft, along comes the IRS.  "Hey," they say to the Yankees.  "You just signed a deal that's worth $65 million in profit.  We want the corporate tax on that, $22 million.  Pay up!"&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;The Yankees say, "Wait a second!  Mickey hasn't played a single game for us yet, so we haven't made any profit!  How can you be asking for the tax already?"&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;The IRS says, "We don't want to wait.  We want the money now.  But, don't worry, we know we're collecting the money in advance, so we're discounting the amount owed by the current interest rate."&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;The Yankees protest.  "That's still not fair," they say.  "You shouldn't tax us until we actually make the money.  After all, Mickey just signed a contract for $10 million.  You're not taxing *him* on that $10 million right away ... you're going to tax him only when he gets his paychecks.  You're not going to his house and demanding $4 million from him right this second, are you?"&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;The government agent replies, "I don't know why you have a problem with this.  I already told you we're adjusting for the time value of money.  And we know you *have* enough cash to pay us now, because you're the Yankees.  So why not?"  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;"What if Mickey doesn't perform and we don't make the expected profit?" the Yankees ask.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;"No problem," replies the IRS.  "We'll give you a partial refund.  But, of course, if Mickey starts playing like Albert Pujols, and you clean up, we'll ask for more."&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;"Hmmm," the Yankees owner says.  "I guess it doesn't really matter to the long-term bottom line.  But it still seems like it's not right."&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;--------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;Is it unfair?  Yeah, I think it is ... if the Yankees haven't made any profit yet, they shouldn't pay any tax on that profit.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;Still, the present value of the tax paid is the same, whether the Yankees pay it in a lump sum, or whether they pay it on a year-by-year basis.  So if paying in advance is unfair, it's unfair only from a timing standpoint.  Either way, the Yankees pay the right amount of tax.  It's just that one way, the IRS is in an ungodly hurry, that they have no business being in.  But it's the same amount of money.  You might even imagine a situation where the Yankees would *choose* to pay in advance.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;So this is just slightly unfair.  What would make it *really* unfair is if the IRS made the Yankees pay BOTH WAYS.  First, if they made them pay $22 million on their projected $65 million profit, and then they also made them pay regular corporate tax on the annual $13 million profit.  That would be pure double taxation, right?  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;It has to be one or the other.  Either they pay the tax on the projected profit in advance, or they pay the tax as the profit comes in.  If you make the Yankees pay both, you really are making them pay exactly twice as much tax.  Instead of 35% tax, they'd be paying 70%.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;Now, if you don't stop to think about it at all, it might not seem unfair that way.  When they sign the contract, the accountants say, "Congratulations, Mr. Cashman, that deal you just made with Mickey is worth $65 million."  And, at the end of the next five years, the accountants say, "Congratulations, Mr. Cashman, you made $13 million more this year because of Mickey."  It may be tempting to add all those up to get $130 million in profit.  But you can't do that.  The $65 million is THE SAME MONEY as the five $13 millions.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;Tax the profit in advance, or tax it when it comes in.  Choose only one.  If you do both, it's grossly unfair.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;With me so far?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;--------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;Now I'm going to argue that when you have a capital gain on a stock, this is almost exactly what happens: the profit gets taxed twice: once in advance, and once when it comes in.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;You buy a stock in a drug company, Acme, for $1 a share.  Acme is just barely breaking even.  After you buy the stock, Acme discovers a new drug that prevents heart attacks.  It's going to be a blockbuster.  In fact, it's so good that it's going to produce annual earnings and dividends of $2 a share, indefinitely.  But, first, the FDA has to approve the drug.  It's so good, with no side effects, that FDA approval is assured, but it will still take two years.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;Wall Street immediately calculates that $2 a share, indefinitely, starting in two years, makes the stock worth $31.  Shareholders celebrate.  You sell your share of Acme for $31, incurring a $30 capital gain.  Should you pay tax on that capital gain?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;Look what's happened.  Acme hasn't actually made any profit yet.  The $30 increase in the stock price represents *future earnings*, just like the $65 million contract with Mickey is future earnings.  So if you tax the $30 increase -- which is what capital gains tax does -- and then you later tax the corporate profits -- which of course you will do -- that's exactly analogous to what happens if you tax the Yankees twice.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;To make things fair, the IRS should pick one: either tax the capital gain right now, or tax the corporate profits later.  One or the other.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;Otherwise, the IRS is getting a lot more than the corporate tax on the profits.  Because, the fact that I sold my share does not change the future tax bill of the corporation, or the new shareholder, at all.  Whether I sell the share or not, the corporation will still pay the normal taxes on the profits.  But, by me selling the share, I trigger an *extra* tax, a capital gains tax, on the value of the future profits.  That's two taxes on the same money.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;The only thing that makes this example different from the Yankees example is that the two taxes are paid by two separate people, instead of one.  That kind of hides the unfairness.  The original buyer pays the full, future corporate tax, which seems OK because he just made a big gain on his stock.  And the new buyer pays the full, future corporate tax, which seems fair because he's the owner at the time the profit was made.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;But, still, they're both paying tax on the *same money*.   [See footnote 1.]&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;--------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;As I said, it should be one or the other.  Since it's hard to discount the corporate tax by capital gains tax that's already been paid, the easiest solution is to just eliminate the capital gains tax.  That way, the profit is only be taxed once, when the corporation earns it.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;That may seem unfair, because, after all, when you earned your $30 capital gain on your $1 investment, that's a lot of money.  Why shouldn't you pay tax on it?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;One answer is: you already did, kind of.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;When you sold the share at $31, that was the market price.  That market price is based on after-tax profits.  We said the corporation pays a $2 dividend.  That means it will make $3 in profits and pay $1 in corporate tax, leaving a $2 dividend.  The buyer of your share gets the $2 dividend, and pays 15% tax on it, leaving $1.70 in his pocket.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;That $1.70 in his pocket, year after year, is why the new buyer was willing to pay $31 for your share.  That means he's looking for an after-tax return of 5.48% on his money, which is what he's getting: $1.70 divided by $31.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;Now, suppose there were no corporate tax or dividend tax.  Then, instead of $1.70 a year, the new owner of the share would receive the full $3 profit.  For the same yield of 5.48% after tax, he'd now be willing to pay $54.74.  So, if there were no taxes, the shares would have been worth $54.74 instead of $31, and you would have made a capital gain of $53.74 instead of $30.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;So the corporate tax actually did cost you money!  It cost you almost exactly the same rate as the overall taxes on the profits -- in this case, about 45% -- because you received only $31 instead of $54.74.  That is, in order to take over *your* obligation to pay all that future tax to the government, the new buyer demanded a discount of $23.74 to compensate him for taking over your obligation.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;It may look like you're not paying any tax on your capital gain, but, you are, in a hidden kind of way.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;Look at it this way.  Suppose that while you owned Acme, the government decided to double the corporate tax on drug companies, forever into the future.  What would happen?  Well, because of the new tax, Acme would only be able to pay $1 in dividends, instead of $2.  That means the new owner would receive only 85 cents a year.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;But the new owner still wants his 5.48% after-tax yield.  So, he's not going to pay $31 for the stock any more.  He's only going to be willing to pay $15.50.  So that's what you'll have to sell it to him for.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;See?  The extra corporate tax comes right out of your pocket, even if you're not the one writing the check.  [See footnote 2.]&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;----------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;Here's another way to look at it: when you sell the company, you sell it for $54.74, its full value if there were no taxes, but you promise the buyer to pay all the taxes into the future.  Instead of paying every year, though, you hand the buyer a cheque for $23.74, and say, "here, invest this and it'll pay your tax bill every year."  The buyer says, "OK, that's a deal."&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;Instead of you actually physically paying the $23.74, you just take it off the price of the share, which is why you wind up with $31.  It doesn't look like you paid any tax, but you actually paid all of the corporation's tax into the future!&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;And, again, that's WITHOUT a separate capital gains tax.  With a full capital gains tax, you'd be paying double.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;---------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;Here's a simpler case.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;You grow apples.  There's a 50% income tax on the apples you grow.  There's also a 50% capital gains tax.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;You take two apple seeds, plant them, nurture them, fertilize them, and grow them into two healthy trees.  The apples start growing.  You harvest them and send half the apples as your income tax.  The government, therefore, gets one tree's worth of apples a year -- half the harvest of the two trees.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;Now, you sell the two trees to your neighbor.  Since you grew them from nothing, your capital gain is the entire two trees.  So, at a 50% capital gains tax, you owe the government one tree.  But you're out of trees, so you have to buy one tree back from your neighbor, which you now give to the government.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;What's happened?  The government used to get one tree's worth of apples a year.  Now it gets 1.5 trees' worth!  First, the tree that you just paid it in capital gains tax, which now belongs 100% to the government.  Second, half the apples from the other tree the neighbor now owns.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;So, effectively, the government is now taxing the apples not at 50%, but at 75%. &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;But it's hidden between the two of you.  You pay the first 50%, and your neighbor pays 50% of what's left.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;--------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;It may seem counterintuitive that Warren Buffett should get away with almost no tax on his capital gains.  You have to look hard to see that even if the capital gains tax rate is zero, Warren Buffett, and the rest of us, are still, in a hidden way, paying the normal rate.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;It may not be obvious, but I think it's true.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;--------&lt;br /&gt;&lt;span style="font-size:85%;"&gt;&lt;span style="font-family: verdana;"&gt;Footnote 1: This argument applies when the profit goes on indefinitely.  If not, the double taxation isn't quite as bad.  Suppose the patent on the drug runs out in 20 years.  At that point, the stock will tank and the new buyer will have a capital loss, exactly offsetting the capital gain the old buyer had 20 years before.&lt;br /&gt;&lt;br /&gt;However, the government will have earned interest on that capital gains tax for 20 years, before refunding it without interest.  After 20 years, because of forgone interest, the refund might be worth, say, only 1/4 the original amount.  So it's not double taxation, but only "1.75-ble" taxation.&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-size:85%;"&gt;&lt;span style="font-family: verdana;"&gt;Footnote 2:&lt;/span&gt;&lt;span style="font-family: verdana;"&gt;  Actually, it's not quite that simple.  We assumed the new buyer's demand for 5.48% after-tax was fixed.  Really, it's in comparison to other investments of equal risk.  And those other investments also have tax burdens associated with them.  The reason the new buyer demands 5.48% is because that's what he could get, after tax, from a bond of similar risk.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;But if there were no corporate taxes, the rate of return on bonds would also be higher.  Therefore, the new buyer might be demanding, and receving, a higher rate.  That shows that the overall investment tax rate affects the buyer, too.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;So it's not really as clear-cut as saying the original owner, the one who gets the capital gain, pays all the tax.  It's a complicated thing to figure out what proportion of the corporate tax is actually "paid" by the original owner, and what proportion is "paid" by the second owner.  All we know is that the proportions add up to 100%, and the total tax paid is the corporate tax plus the dividend tax.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;I might be wrong here ... economists, please correct me.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;--------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/31545676-606982854492275755?l=sabermetricresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://sabermetricresearch.blogspot.com/feeds/606982854492275755/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=31545676&amp;postID=606982854492275755' title='21 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/606982854492275755'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/606982854492275755'/><link rel='alternate' type='text/html' href='http://sabermetricresearch.blogspot.com/2011/10/capital-gains-and-warren-buffett-part.html' title='Capital Gains and Warren Buffett: Part II'/><author><name>Phil Birnbaum</name><uri>http://www.blogger.com/profile/03800617749001032996</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>21</thr:total></entry><entry><id>tag:blogger.com,1999:blog-31545676.post-2972520380840494891</id><published>2011-10-11T13:52:00.004-04:00</published><updated>2011-10-11T14:02:29.066-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='economics'/><category scheme='http://www.blogger.com/atom/ns#' term='Warren Buffett'/><category scheme='http://www.blogger.com/atom/ns#' term='taxes'/><title type='text'>Capital Gains and Warren Buffett: Part I</title><content type='html'>&lt;span style="font-family:verdana;"&gt;Recently, Warren Buffett &lt;a href="http://www.nytimes.com/2011/08/15/opinion/stop-coddling-the-super-rich.html"&gt;noted&lt;/a&gt; that the rate of tax he pays on investment income (around 17%) is much less than the rate his employees pay on their earned income (around 36%).  In a &lt;a href="http://sabermetricresearch.blogspot.com/2011/10/why-warren-buffett-is-wrong-about-tax.html"&gt;previous post&lt;/a&gt;, I argued that the comparison is meaningless, at least in the case of dividends.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Some commenters, here and &lt;a href="http://www.insidethebook.com/ee/index.php/site/comments/phil_v_warren_buffett_phil_on_a_tko/"&gt;at Tango's blog&lt;/a&gt;, criticized me for not dealing with capital gains, which they say is where most of Buffett's income arises.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Capital gains come from many different sources, which bring up different issues of fairness.  So, I'll have several things to say instead of just one.  My conclusion will be that there's an argument to be made for taxing capital gains at a very, very low rate, perhaps even zero.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I will argue that position is true even if you believe that tax rates on the rich are too low.  I believe that even if you think the rich should be taxed, at, say, 60%, or 70%, or 80%, you should STILL favor a system where their "regular" income is taxed at a higher rate and capital gains are taxed at a lower rate.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Here goes.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:verdana;" &gt;1.  Double taxation inside a corporation&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;If you buy a stock, and then sell it at a higher price, your profit is a capital gain.  But, in many cases, the corporation has already been taxed on the profit that forms some or all of the gain.  To tax it again at the full rate is unfair double taxation.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Suppose you buy a share of a company at $100.  This year, they earn $10 in profits.  They pay $3 corporate tax on the profit, and keep the other $7.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;So, a year later, and all things being equal, the company is worth $107 a share.  You sell your share for a $7 capital gain.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;But, the piece of the company you owned actually earned $10 in profits, not $7.  At a 30% tax rate, you already paid $3 in corporate tax on the profit.  To tax you another 30% on the remaining $7 would be unfair.  That would mean you'd only keep $4.90, and your effective tax rate would be 51%.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;The concept of "horizontal equity" says that people who have the same income should pay the same amount of tax, regardless of where the income came from.  If the top tax rate on employment income is, say, 40%, then the effective tax on income earned through a corporation, when you combine all the taxes, should also be 40%.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;As I described in more detail in the previous post, a personal capital gains tax rate of around 15% gives the result we're looking for: you keep 85% of 70% of corporate earnings, which works out to a tax rate of 40.5%.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;This is exactly the same argument as in the &lt;a href="http://sabermetricresearch.blogspot.com/2011/10/why-warren-buffett-is-wrong-about-tax.html"&gt;other post&lt;/a&gt;, just for capital gains instead of dividends.  I realize that if you didn't like that argument, you probably won't like this one either.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;----&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Want a real-life example?  Suppose you owned one share of Chevron.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Over the 16 years from 1995 to 2010 (&lt;a href="http://www3.valueline.com/dow30/f1936.pdf"&gt;.pdf&lt;/a&gt;), Chevron made a total profit for you of about $123.  It had around a 40% corporate tax rate (I'm not sure why so high -- other companies seem to be around 30%).  That means it paid around $50 in taxes, leaving $73 in after-tax earnings.  It paid around $27 in dividends over that stretch, leaving $45 inside the company.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;In that time, the stock went from around $25 to around $100, a $75 capital gain.  More than half of that capital gain -- $45 -- is from the profit on which the corporation has already paid tax for you.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;To fully tax you again on that $45 profit is not particularly fair.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;----&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;What about the rest of the capital gain, the remaining $30 out of the $75?  That probably shouldn't be taxed much either, since probably a lot of that is just inflation.  Which brings us to number two.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;font-family:verdana;" &gt;2.  Inflation&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;The idea behind income tax is that when you become wealthier, you give some of it to the government to provide public services.  But when the nominal value of your assets goes up only because of inflation, you're not wealthier, are you?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;In 1980, you bought a house for $100,000.   Today, you sell it for $300,000.  Are you really $200,000 wealthier?  Of course not.  That's just inflation increasing the price of your house.  In non-monetary terms, you might have paid 200,000 loaves of bread for it in 1980 (at 50 cents a loaf).  In 2011, you sell it again for 200,000 loaves of bread (at $1.50 a loaf).  Really, you've broken even.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;This is pretty obvious, and I think almost everyone understands this already.  That's why, in both Canada and the US, they offer tax relief for capital gains on houses you live in.  In Canada, you pay absolutely zero capital gains tax when you sell your primary residence.  In the USA, I once read, your first $400,000 in gains is tax-free if you buy another house with it.  (Is that still true?)&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;If the government didn't do that, there would be riots in the streets.  You wouldn't be able to move!  If you're living in a $500,000 house with $200,000 worth of taxes due when you sell it, you'd have to downsize substantially.  That would obviously be unfair.  In most cases, the profit you made on the house is artificial, just an artifact of inflation.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;The same is true for, say, stocks and mutual funds.  Suppose you bought a share twenty years ago for $10.  It never paid dividends.  Today, you sell it at $20, but because of inflation, the $20 buys only what $10 bought then.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;You really haven't made a profit.  Yes, you got more dollar bills now then you paid in the past, but in terms of actual wealth -- the number of loaves of bread it would buy -- you just barely got your investment back.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;That's part of the reason why the capital gains tax rate is lower than the regular tax rate: to compensate for the fact that a significant portion of a capital gain isn't really an increase in wealth.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Perhaps the best policy would be that when you sell an asset, you adjust for inflation when figuring your gain, and then you pay tax on that adjusted gain.  The problem with that is that it's complicated and involves lots of arithmetic.  I'd support implementing it anyway.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;----&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Now, you might be saying, that's fine if your capital gain just keeps pace with inflation.  But Warren Buffett is famous for his investing prowess, where he makes capital gains that far outstrip inflation!  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;To which my response is: OK, but first, can we agree that he shouldn't have to pay tax on the inflation portion of his gain?  (And if we agree on that, then at least we agree on at least one reason that Buffett's capital gains rate should be less than his employment income rate, right?)&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;But, yes, that still leaves the non-inflation portion of Buffett's gain, which is probably still substantial.  That'll be in Part II.&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/31545676-2972520380840494891?l=sabermetricresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://sabermetricresearch.blogspot.com/feeds/2972520380840494891/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=31545676&amp;postID=2972520380840494891' title='38 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/2972520380840494891'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/2972520380840494891'/><link rel='alternate' type='text/html' href='http://sabermetricresearch.blogspot.com/2011/10/capital-gains-and-warren-buffett-part-i.html' title='Capital Gains and Warren Buffett: Part I'/><author><name>Phil Birnbaum</name><uri>http://www.blogger.com/profile/03800617749001032996</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>38</thr:total></entry><entry><id>tag:blogger.com,1999:blog-31545676.post-7599390373201070715</id><published>2011-10-07T13:30:00.002-04:00</published><updated>2011-10-07T13:35:07.817-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='economics'/><category scheme='http://www.blogger.com/atom/ns#' term='baseball'/><category scheme='http://www.blogger.com/atom/ns#' term='payroll'/><title type='text'>Would MLB salaries drop if all players were free agents?  Part II</title><content type='html'>&lt;span style="font-family: verdana;"&gt;In my &lt;a href="http://sabermetricresearch.blogspot.com/2011/10/would-mlb-salaries-drop-if-all-players.html"&gt;previous post&lt;/a&gt;, I said I had another argument for why free agent salaries would drop if all players were free agents.  Here it is.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;Right now, some players are free agents.  Their value seems to be about $4.5 million per win.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;Now, suppose that, instead of *more* players being free agents, *fewer* players become free agents.  In the extreme case, suppose that only one player is a free agent, with a value of 1 WAR.  What happens?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;It seems like his price will be bid up.  But why?  It can't be not scarcity in and of itself.  Because, even in the real world, when there are lots of free agents, there is still be a time when there's only one left, and HIS price still seems to be $4.5 million per win.  There's something else going on.  Here's what I think it is.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;In in a world with few free agents, wins must be distributed without regard to where they can make the most money.  They go to whichever teams made the best draft choices or trades.  That's inefficient, financially, for the league as a whole.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;The Yankees value wins highly, and would like to buy more, but they can't.  The Pirates don't value them much, and would like to sell some to the Yankees.  But MLB rules forbid that.  So the Yankees are stuck with many fewer wins than they want, and the Pirates are stuck with many more.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;So what happens when this one and only free agent goes on the market?  Clearly, the Yankees and Red Sox are desperate for wins, with which they can make a lot of money.  They'll easily outbid the Royals and Pirates.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;But why will the price wind up higher than $4.5 million?  Because $4.5 million is the price that results when teams have the ability to fill a lot more of their needs.  The Yankees may have been blessed with only 80 wins from their farm system, but they've been able to sign free agents to bring themselves up to 95.  And $4.5 million is the value of that 95th win.  The wins before that, they valued much higher (or they wouldn't have bought them).&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;But, in this case, the Yankees are truly stuck at 80.  That 81st win they're thinking of buying must be worth more to them than the 95th win (which is worth $4.5 million).  And so, they'll be willing pay a lot more for it.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;The same logic applies to the Red Sox, and the Phillies, and other teams, and so the price of the single free agent gets bid up well beyond $4.5 million.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;--------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;That argument shows that fewer free agents means higher costs.  That means that more free agents means lower costs, which is what we were trying to prove.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;--------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;Another thing we can conclude is that the more free agents there are, the less competitive balance.  Why?  Because the fewer the number of free agents, the more wins are distributed haphazardly among teams.  Since teams aren't allowed to sell those wins, small-market teams wind up with wins they otherwise wouldn't have bought.  That means more competitive balance.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;If all players were free agents, it's possible that some teams would not find it profitable to buy ANY wins, and would stay with replacement-value, minimum-salary players.  Obviously, that means competitive balance suffers.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;--------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;At the risk of my usual overkill, here's another way to look at it: &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;Suppose that there is a fixed supply of BMWs.  In a free market, only rich people would own them, because they're of little use to poor people (who can't drive them much because they can't afford much gas or maintenance).  There might be 100,000 rich people in the city who might fancy a nice BMW, but only 10,000 actual cars.  So the BMWs go to the people who bid the most for them, and maybe the auction price is $50,000 each.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;Now, suppose MLB calls half of the cars "draft choices", airdrops them randomly on households, and prohibits selling them for what they're worth.  The poor people are happy to have them, since they're free, but they don't get much benefit from them.  On the other hand, the rich families who got them are thrilled: some of them were about to go out and spend $50,000 on one, and now here's one for nothing!&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;But now, that leaves only 5,000 cars left for auction.  And there might still be 99,000 rich people who are interested.  Obviously, with fewer cars available on the market, but not many fewer buyers, the price goes up, maybe to $100,000.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;Also, "competitive balance" increases.  It used to be that the rich owned 100% of the BMWs.  Now the rich only own maybe 60% of the BMWs.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;And, none of this would happen if the poor people were allowed to sell their cars to the rich people.  In that case, the cars would still go for $50,000, same as before, and "competitive balance" wouldn't change.  If Bud Selig changed the rules so that the poor could sell to the rich, both the poor and rich would benefit: the poor people would have more money, and the rich people would get their cars cheaper.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;Who wouldn't benefit?  The BMW fans rooting for poor people.  These fans don't care how much money their poor friend has: all they live for is the day that their poor friend drives a BMW to the World Series!  Before, when their poor friend had to keep his car, they had some hope.  Now that their poor friend will almost always sell his airdropped car, they have little to no hope.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;-------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;Another thing we can conclude is that it must be true, right now, that there do exist poor teams who have BMWs they'd like to sell but can't.  In &lt;a href="http://sabermetricresearch.blogspot.com/2010/04/why-teams-pay-equal-prices-for-free.html"&gt;previous&lt;/a&gt; &lt;a href="http://sabermetricresearch.blogspot.com/2010/04/marginal-value-of-win-in-baseball.html"&gt;posts&lt;/a&gt;, &lt;a href="http://sabermetricresearch.blogspot.com/2010/04/why-teams-pay-equal-prices-for-free_28.html"&gt;I assumed&lt;/a&gt; this didn't happen, to keep things nicely theoretical.  I assumed that even the Royals can earn a little bit more by buying a win or two, at the going rate of $4.5 million.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;But now we have evidence that's not true, that there are some teams who want to sell wins but can't.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;Why do I say we have evidence?  Because the &lt;a href="http://sabermetricresearch.blogspot.com/2011/10/would-mlb-salaries-drop-if-all-players.html"&gt;previous post&lt;/a&gt; showed that, today, if all players were free agents, the price would come down.  This proves that there must be at least some BMWs being held by poor households.  Otherwise, it wouldn't matter if you airdropped all the BMWs, or none of them: either way, they'd go to the same rich people at the same price.  The fact that free-agent restrictions are increasing prices proves that MLB could make more money redistributing wins to the rich teams.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;But, that makes an additional assumption: that the fans only care about wins, and not about the fairness of a sport that organizes itself so the Yankees will always be great, and the Royals will always be bad.  As I once &lt;a href="http://sabermetricresearch.blogspot.com/2010/02/does-it-matter-that-yankees-keep-buying.html"&gt;wrote&lt;/a&gt;, I think that even though wins seem to drive revenues today, the fans might get sick of it in the future, and MLB might be better off sacrificing some short-term revenues in favor of keeping the fans interested in the long-term.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;-------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;In summary, I think these arguments lead to a few real conclusions about the current state of MLB:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;1.  More free agents would mean lower salaries for those free agents;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;2.  MLB could make more money by allowing small-market teams to sell players to big-market teams;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;3.  A marginal win is worth an equal $4.5 million not to all teams, but only to the middle-class and rich teams;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;4.  The "arbs" and "slaves" do, in fact, contribute to competitive balance.&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/31545676-7599390373201070715?l=sabermetricresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://sabermetricresearch.blogspot.com/feeds/7599390373201070715/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=31545676&amp;postID=7599390373201070715' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/7599390373201070715'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/7599390373201070715'/><link rel='alternate' type='text/html' href='http://sabermetricresearch.blogspot.com/2011/10/would-mlb-salaries-drop-if-all-players_07.html' title='Would MLB salaries drop if all players were free agents?  Part II'/><author><name>Phil Birnbaum</name><uri>http://www.blogger.com/profile/03800617749001032996</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-31545676.post-1371367782070965233</id><published>2011-10-06T18:36:00.004-04:00</published><updated>2011-10-06T18:44:38.420-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='baseball'/><category scheme='http://www.blogger.com/atom/ns#' term='payroll'/><title type='text'>Would MLB salaries drop if all players were free agents?</title><content type='html'>&lt;span style="font-family:verdana;"&gt;If every player became a free agent, would player salaries drop?  Here's an argument that suggests that, yes, they *must* drop.  I'm not sure if it's right, but I can't find a flaw ... maybe you guys can.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Assumptions: all teams are rational, and all teams know the expected value of the future performance of every player.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Here goes.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-----&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Suppose that right now, at equilibrium, a marginal win in the major leagues is worth about $X million.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;That means that the last win every team buys (in free agency) must give them $X million in extra revenues.  If it gave less, they wouldn't have signed the guy.  If it resulted in more than a $X million increase, the team could probably increase profits by signing even more free agents.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;OK, now the hypothetical: what if EVERY player were a free agent?  That is, what if there were no "arbs" or "slaves", so that every player cost the same price per WAR?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I don't know the exact breakdown of free agents vs. arbs vs. slaves.  But, for the sake of argument, let's suppose that, right now, free agents contribute 1/2 the overall WAR, arbs contribute 1/4, and slaves contribute the other 1/4.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Also, let's assume that arbs make half of what free agents do, and that slaves make $0.  Then, the average WAR costs major league teams exactly $0.625X -- that is, 5/8 of what a free agent WAR costs.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;The fraction 5/8 works out to 62.5%.  That's not too far from the fraction of revenues that go to player salaries.  So, if every player were a free agent at $4.5 million per WAR, the 62.5% would go to 100%, and it would turn out that ALL of MLB's revenues would go to payroll.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;That can't happen, can it?  If payroll is higher than revenue for the league, it must be the case that payroll is higher than revenue for at least one team.  For that team, it would be better to cut payroll.  The only way it wouldn't be better to cut payroll would be if that team would lose even MORE money with EVERY possible payroll cut.&lt;br /&gt;&lt;br /&gt;But ... minimum payroll is only about $13 million.  It's hard to imagine that even a small-market team wouldn't get $13 million in revenue, even with a replacement level team expected to go 60-102.  (And, in any case, the team could fold, and avoid losing money that way.)&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;So, no team would ever spend more on payroll than it got in revenues.  Therefore, payroll can't be more than 100% of revenues.  Therefore, at least one of the assumptions in our model must be false.  The most likely candidate is the assumption that wins would still cost $4.5 million.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;So it must be the case that the cost per free-agent WAR would drop, from $4.5 million to something lower, to allow the league to still turn a profit.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-------- &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Does that argument work?  I think it does.  But I have another one if that one doesn't.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/31545676-1371367782070965233?l=sabermetricresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://sabermetricresearch.blogspot.com/feeds/1371367782070965233/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=31545676&amp;postID=1371367782070965233' title='3 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/1371367782070965233'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/1371367782070965233'/><link rel='alternate' type='text/html' href='http://sabermetricresearch.blogspot.com/2011/10/would-mlb-salaries-drop-if-all-players.html' title='Would MLB salaries drop if all players were free agents?'/><author><name>Phil Birnbaum</name><uri>http://www.blogger.com/profile/03800617749001032996</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>3</thr:total></entry><entry><id>tag:blogger.com,1999:blog-31545676.post-2019152756400234270</id><published>2011-10-03T12:38:00.005-04:00</published><updated>2011-10-03T12:51:32.830-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='economics'/><category scheme='http://www.blogger.com/atom/ns#' term='Warren Buffett'/><category scheme='http://www.blogger.com/atom/ns#' term='taxes'/><title type='text'>Why Warren Buffett is wrong about tax rates</title><content type='html'>&lt;span style="font-family:verdana;"&gt;Note: non-sports post.  Lots of numbers, though!&lt;br /&gt;&lt;br /&gt;-----&lt;br /&gt;&lt;br /&gt;As you've probably heard by now, &lt;a href="http://www.nytimes.com/2011/08/15/opinion/stop-coddling-the-super-rich.html"&gt;Warren Buffett thinks the rich should pay more taxes&lt;/a&gt;.  Much of his argument is based on the fact that wealthy people pay a lower percentage of their income to the government.  Buffett writes,&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;&lt;blockquote&gt;"Last year my federal tax bill — the income tax I paid, as well as payroll taxes paid by me and on my behalf — was $6,938,744. That sounds like a lot of money. But what I paid was only 17.4 percent of my taxable income — and that’s actually a lower percentage than was paid by any of the other 20 people in our office. Their tax burdens ranged from 33 percent to 41 percent and averaged 36 percent."&lt;/blockquote&gt;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;But, like a lot of numbers that get thrown around, this "percent of taxable income" is misleading.  Yes, the number 17.4 is lower than the number 36.  But if you look more closely, Buffett is actually paying at a comparable rate to that of his employees.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;The issue that confuses things is that Buffett earns most of his income through corporate dividends (some of it also comes from capital gains, but I'll ignore those for now).  Dividends are paid out of after-tax profits of corporations.  That means, effectively, that the corporation has already paid most of Buffett's tax.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Suppose Buffett's employee earns $50,000, and pays $18,000 in taxes, which is 36 percent.  As for Buffett himself, suppose he owns shares of McDonald's.  Let's say he owns 7,609 of those shares, which correspond to the same $50,000 in McDonald's pre-tax profits.  (My figures are approximate, all rounded from &lt;a href="http://www3.valueline.com/dow30/f5707.pdf"&gt;this&lt;/a&gt; Value Line summary.)&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;So, what happens?  Well, McDonald's gets taxed at about a 30% rate.  So that leaves only $35,000 in after-tax profits.  Then, the company pays Buffett a dividend.  It doesn't pay the entire $35,000, because it keeps some to reinvest.  It pays Buffett only about half of that, maybe $17,000.  Then, Buffett pays an additional 15% tax on that $17,000.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;So: on his $50,000 in profits, Buffett pays $15,000 through the corporation, and then an extra $2,550 in dividend tax.  That's a total of $17,550 out of $50,000, which is ... about 35%, approximately the same as his employee.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;--------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Actually, that's not quite right.  It actually understates Buffett's tax rate.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;As we saw, McDonald's makes $50,000, pays $15,000 in taxes, sends $17,000 to Warren Buffett, and reinvests the remaining $18,000.  But our calculation assumed that the $18,000 part is still Buffett's, but is fully tax paid.  It's not.  Eventually, Buffett will claim that $18,000 personally, either through another dividend, or through a capital gain when he sells his stock.  At that point, he'll pay another 15%.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;So, really, the bottom line is: Of every dollar Buffett earns through McDonald's, he gets to keep 85% of 70% of it.  That's 59.5%.  So his effective tax rate is 40.5%.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;You can argue that 40.5% is too high, or you can argue that 40.5% is too low.  But you CANNOT argue that Buffett's tax rate is only 17%.  That simply isn't true in any real sense.  It's true only in a misleading technical sense, in that McDonald's pays some of Buffett's tax for him.  The fact that the corporation makes the payment doesn't mean that it doesn't actually come out of Buffett's pocket.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;---------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;This is actually a standard explanation of why taxes on dividends are lower than taxes on "work" income.  In fact, it's actually the stated rationale.  Canada has a complicated method of calculating taxes on dividends, a method that actually takes into account how much corporate tax was already paid.  The explicit idea is that the overall tax rate should be the same, no matter if you earned the income through a direct investment, through an investment in a corporation, or through employment.  It doesn't always work out perfectly, according to my accountant (who tells me it's a little higher through a corporation), but it's close.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;It's well-enough known to economists and accountants and people who work in corporate finance that, like other bloggers who have written about this, I'm surprised Warren Buffett didn't know it.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Now, it could be that he knows it, but doesn't believe that the corporate taxes should "count".  A lot of people somehow believe that corporations should count as separate "people," and so it's fair for both McDonald's and Buffett to pay taxes separately, where the total adds up to more than if Buffett made the money directly.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;But that really doesn't make sense.  The fact is that if Buffett owns McDonald's, and McDonald's earns $50,000 from his investment, that $50,000 *belongs to Buffett*, even if, right now, it's classified as corporate earnings.  Corporations, and their earnings, are the property of their owner, and Buffett is that owner.  The fact that the first $15,000 of taxes appears on the corporate tax return instead of Buffett's absoutely does not change the fact that Buffett is paying that tax.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;If you still don't agree with that, if you think that Buffett really isn't paying that 30%, then you might like these two options I'm about to show you.  Those could reduce personal taxes substantially, while still providing the same amount of money to the government!&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Here's number 1.  As of tomorrow, ever person in the country has to start up a corporation, of which he/she is the sole shareholder and CEO.  Also, all employers now have to pay any salary to the corporation.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;So, what happens is this: your corporation now pays 30% of its income -- your salary -- in corporate tax.  It then pays you the rest as a dividend.  Like Warren Buffett, you now pay only 15% in taxes.  Indeed, you might pay zero in taxes -- according to &lt;a href="http://en.wikipedia.org/wiki/Dividend_tax"&gt;this Wikipedia page&lt;/a&gt;, low-income Americans pay 0% tax on dividends!&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;That means that Warren Buffett's employees will now definitely have a lower tax rate than he does -- or at worst, the same rate -- because they are taxed exactly the same way!&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Would you support that new law?  You probably wouldn't.  You'd see that that was just a sneaky way of taxing people the same rate as always, but making it look like they're not paying much tax.  That's exactly what I'm arguing in the case of McDonald's.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;--------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Here's number 2.  Right now, people take home a lot less than their salary, because of payroll deductions for income tax.  Someone making $50,000 might actually take home only $35,000.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;So here's what we do.  We eliminate payroll deductions and personal income tax completely.  Instead, we implement a corporate payroll tax, which works out to exactly the same as the income tax used to.  So, now, your employer still gives you $35,000 to take home, but pays an extra corporate tax of $15,000.  You pay zero percent tax on the $35,000.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Perfect, right?  Now the income tax on "work" is zero percent.  Warren Buffett is still paying 17.4% on his investments. Situation resolved!&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I bet you think that's ridiculous.  You probably should.  Whether you pay the tax, or your employer pays the tax, it's the same thing: you do $50,000 of work, and the government gets 30% of it.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Well, it's the same thing for McDonald's profits.  Whether McDonald's pays the tax, or Warren Buffett does, or they both do, the fact remains: McDonald's makes $50,000 of profit, and the government gets 40.5% of it.  How much of that 40.5% comes from a cheque from Buffett, and how much comes from a cheque from McDonald's, doesn't matter.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;--------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;OK, here's one more suggestion that's less ridiculous.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Change the law a bit, so that when McDonald's pays a dividend to an American taxpayer, they don't pay tax on that part of their profit.  That sounds kind of fair, right?  If McDonald's doesn't get to keep it, they don't pay tax on it.  Just like they can deduct interest that they pay to a bondholder, they can deduct interest that they pay to a shareholder. &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Then, when the shareholder receives the dividends, tax them at the normal rate, as if they were "work" income.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Sounds reasonable, right?  Well, it works out to almost same amount of tax collected.  Actually, if Buffett's personal tax rate is around 33%, you can leave out the "almost" -- it's exactly identical.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;McDonald's makes $50K.   They send Warren Buffett $21,428.57.  Buffett pays about 33% of that in taxes, or $6,978, leaving him $14,450.  After paying Buffett, McDonald's has $28,571.43 left.  They pay the government 30% of that, or $8,571.43.  That leaves them $20,000 to reinvest.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;That's EXACTLY what's already happening, in today's system where Buffett *appears* to be only paying 15%:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-- McDonald's makes $50,000&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-- Buffett keeps $14,450 after taxes &lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-- McDonald's keeps $20,000 after taxes&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-- The government gets $15,550.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;One way you look at it, it looks like Warren Buffett pays only 15% in taxes.  Another way, it looks like he pays 40.5% in taxes.  A third way, it looks like he pays 33% in taxes.  But the results, all three ways, are exactly the same!  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;No matter how you figure it, the bottom line is the same.  Buffett gets 28.9% of the profit, McDonald's keeps 40% of the profit, and the government gets 31.1% of the profit.  The difference is how you do the accounting.  If you're a government that wants to make it look like the rich have it too good, you levy the entire 31.1% on the corporation, so that it looks like Buffett pays zero.  If you're a government that wants to make it look like corporations aren't taxed enough, you levy the entire 31.1% on Buffett, so that it looks like McDonald's pays zero.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;In a vacuum, Warren Buffett's supposed 17% tax rate doesn't mean anything.  It's an artifact of how you do the accounting.  To really see what's happening, you have to look at the end result.  And that end result, in this example, is that the government winds up with 31.1% of the money that would otherwise have been Buffett's.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;You may think that's too low.  You might think that's too high.  That's fine.  But either way, the "17%" figure is not relevant, even if Warren Buffett thinks it is.&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/31545676-2019152756400234270?l=sabermetricresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://sabermetricresearch.blogspot.com/feeds/2019152756400234270/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=31545676&amp;postID=2019152756400234270' title='9 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/2019152756400234270'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/2019152756400234270'/><link rel='alternate' type='text/html' href='http://sabermetricresearch.blogspot.com/2011/10/why-warren-buffett-is-wrong-about-tax.html' title='Why Warren Buffett is wrong about tax rates'/><author><name>Phil Birnbaum</name><uri>http://www.blogger.com/profile/03800617749001032996</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>9</thr:total></entry><entry><id>tag:blogger.com,1999:blog-31545676.post-1895743236615985680</id><published>2011-09-27T11:17:00.006-04:00</published><updated>2011-09-28T00:24:15.519-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='gambling'/><category scheme='http://www.blogger.com/atom/ns#' term='forecasting'/><category scheme='http://www.blogger.com/atom/ns#' term='luck'/><title type='text'>How good are sports pundits' predictions?</title><content type='html'>&lt;span style="font-family:verdana;"&gt;According to &lt;a href="http://www.freakonomics.com/2011/09/22/picking-the-nfl-playoffs-how-the-experts-fumble-the-snap/"&gt;this Freakonomics post&lt;/a&gt;, the experts who make NFL predictions aren't very good at it.  Freakonomist Hayes Davenport checked the past three years' worth of predictions from USA Today, Sports Illustrated, and ESPN.  He found that the prognosticators correctly picked only 36% of the NFL division winners.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;That doesn't seem that great.  Picking randomly would get you 25%.  And, as the post points out,&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;&lt;/span&gt;&lt;blockquote&gt;&lt;span style="font-family:verdana;"&gt;"if the pickers were allowed to rule out one team from every division and then choose at random, they’d pick winners 33% of the time. So if you consider that most NFL divisions include at least one team with no hope of finishing first (this year’s Bengals, Chiefs, Dolphins, Panthers, Broncos, Vikings, and Manning-less Colts, for example), the pickers only need a minimum of NFL knowledge before essentially guessing in the dark."&lt;/span&gt;&lt;/blockquote&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Well, it sounds right, but you have to look deeper.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Winning depends on two things: talent, and luck.  Since luck is, by definition, random, when you predict a winner, the only thing you can do is pick the team with the most talent.  And, despite the 36% figure, there's no evidence that the pundits misjudged the talent.  Because, just by luck, sometimes the best team won't win, and that's unpredictable.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Two days ago, the Buffalo Bills &lt;a href="http://www.guardian.co.uk/sport/2011/sep/26/nfl-round-up-bills-patriots"&gt;upset&lt;/a&gt; the New England Patriots, despite being 7:2 underdogs (I'm &lt;a href="http://www.predictem.com/nfl/odds-conversion.php"&gt;estimating&lt;/a&gt; 7:2 based on the 9-point spread).  What percentage of experts would you expect to have got that right?  Your answer should be zero percent.  Nobody with any knowledge of football should have thought the Bills had a better than 50 percent chance of winning.  On the off-chance that you DO find someone who picked the Bills to win, he's probably a crappy predictor -- maybe he just flips a coin all the time.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;In the case where the underdogs wind up winning a game, or finishing first in their division, the truth is the opposite of what Freakonomics implies.  In that case, the higher percentage of correct predictions, the WORSE the pundits.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;So what does that 36% figure actually tell you?  By itself, absolutely nothing.  You have no idea, looking at that bare number, how good the pundits are.  It depends.  If it was *all* bad teams that won, a number as high as 36% means the experts are wrong a lot, but 36% of their bad picks happened to turn out OK.  If it was all good teams that won, a number as low as 36% means the experts are wrong a lot -- they must have picked 64% bad teams.  And, if it was exactly 36% of the best teams that won, but those aren't the cases where the experts were right, then, again, the experts are wrong a lot.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;But, if it was exactly 36% of the best teams that won, and those are exactly the cases where the experts were right ... then the experts are perfect predictors.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;So you can't just look at a number.  36% may be bad, but it might be awesome.  It depends what actually happened.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;However: while this logic applies to picking outright winners of games or divisions, it doesn't apply to picking against the spread.  Why not?  Because, against the spread, the presumption is that the odds are close to 50/50.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;In the Patriots/Bills game, the odds were roughly 77/22.  Some experts might have pegged the Bills as having a 25% chance of winning, while some may have estimated only 20%.  Still, both pundits would have obviously still bet on the Patriots.  The fact that New England ended up losing isn't really relevant.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;But against the +9 spread, you might have a reasonable difference of opinion.  One expert might figure the true spread should be +8.5, and another might figure&lt;/span&gt;&lt;span style="font-family:verdana;"&gt; +9.5.  So the first guy takes the Bills, and the second takes the Patriots.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;On bets that are approximately 50/50, reasonable experts can disagree.  On bets that are 77/22, they cannot.  So, when it's 50/50, a higher percentage could, in fact, mean better predictions.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Still, there's lots of luck there too.  If a pundit predicts all 256 games in a season, the (binomial) standard deviation of his success rate will be a little over 3 percentage points.  So one predictor in 20 will be over 56%, or under 44%, just by luck.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;That means it's still hard to figure out who's a "better" expert and who's not.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;--------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;The post goes on to criticize the pickers for risk aversion.  Why?  Because, it seems, the experts tended to pick the same teams that won last year.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Um ... why is that risk aversion?  It stands to reason that the teams that won before are more likely to still be pretty good, so they're probably reasonable picks.  But, the author says,&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;&lt;/span&gt;&lt;blockquote&gt;&lt;span style="font-family:verdana;"&gt;"Over the last fifteen seasons, the NFL has averaged exactly six new teams in the playoffs every year, meaning that half of the playoff picture is completely different from the year before. ... Given that information, a savvy picker relying on statistical precedent would choose six new teams when predicting the playoffs." &lt;/span&gt;&lt;/blockquote&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;That doesn't follow at all!  Just because I know six favorites will lose doesn't mean I should pick six underdogs!  That would be very, very silly.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;It's like predicting whether John Doe will win the lottery this week.  The odds say no, and that's the way I should bet.  And it's the same for Jane Smith, or Bob Jones.  If there are a million people in the lottery, I should pick them all to lose.  I'll be right 999,999 times, and wrong once.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;But, according to Freakonomics, I should arbitrarily pick one person to win!  But that's silly ... if I do that, I'll almost certainly be right only 999,998 times!&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;It's not exactly the same, but this logic reminds me of the Jeff Bagwell prediction controversy.  In the fall of 1990, Bill James produced forecasts of players' batting lines for 1991, and it turned out that Bagwell wound up with the highest prediction for batting average.  It was said that Bill James predicted Jeff Bagwell to win the batting title.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;But, obviously, he did not.  At best, and with certain assumptions, you might be able to say that James had Bagwell with the *best chance* of winning the batting title.  But that's different from predicting outright that he'd win it. &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Back to the lottery example ... if I notice that John Doe bought two lottery tickets, but the other 999,999 people only bought one, I would be correct in saying that Doe has the best chance to win.  That doesn't mean I'm *predicting* him to win.  He still only has a 1 in 500,000 chance.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;--------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Finally, in their introduction to their post, Levitt and Dubner say, &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;&lt;/span&gt;&lt;blockquote&gt;&lt;span style="font-family:verdana;"&gt;" ... humans love to predict the future, but are generally terrible at it."&lt;/span&gt;&lt;/blockquote&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I disagree, especially in sports.  Yes, very few people can beat the point spread year after year.  But that doesn't show that the experts don't know what they're doing.  It shows that they DO!  Because, after all, it's humans that set the Vegas line, the one that's so hard to beat!&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I'd argue 180 degrees the opposite.  In sports, humans, working together, have become SO GOOD at predicting the future, that nobody can add enough ingenuity to regularly beat the consensus prediction!&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/31545676-1895743236615985680?l=sabermetricresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://sabermetricresearch.blogspot.com/feeds/1895743236615985680/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=31545676&amp;postID=1895743236615985680' title='10 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/1895743236615985680'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/1895743236615985680'/><link rel='alternate' type='text/html' href='http://sabermetricresearch.blogspot.com/2011/09/how-good-are-sports-pundits-predictions.html' title='How good are sports pundits&apos; predictions?'/><author><name>Phil Birnbaum</name><uri>http://www.blogger.com/profile/03800617749001032996</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>10</thr:total></entry><entry><id>tag:blogger.com,1999:blog-31545676.post-2536437986329910109</id><published>2011-09-22T12:54:00.004-04:00</published><updated>2011-09-22T18:11:07.776-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='statistics'/><category scheme='http://www.blogger.com/atom/ns#' term='Cy Young'/><category scheme='http://www.blogger.com/atom/ns#' term='pitching'/><category scheme='http://www.blogger.com/atom/ns#' term='bayes'/><category scheme='http://www.blogger.com/atom/ns#' term='baseball'/><category scheme='http://www.blogger.com/atom/ns#' term='BABIP'/><title type='text'>The Bayesian Cy Young</title><content type='html'>&lt;span style="font-family:verdana;"&gt;At Fangraphs, Dave Cameron and Eric Seidman have a nice &lt;a href="http://www.fangraphs.com/blogs/index.php/a-discussion-about-evaluating-pitchers/"&gt;discussion&lt;/a&gt; (hat tip: &lt;a href="http://www.insidethebook.com/ee/index.php/site/comments/talking_pitching/"&gt;Tango&lt;/a&gt;) on who's the better Cy Young candidate: Clayton Kershaw, or Roy Halladay?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Part of the discussion hinges on BABIP: batting average on balls in play.  As Voros McCracken &lt;a href="http://www.baseballprospectus.com/glossary/index.php?search=babip"&gt;discovered&lt;/a&gt; years ago, pitchers generally don't differ much in what happens when a non-home-run ball is hit off them.  Most of the overall differences between pitchers, then, are due to the fielders behind them, but mostly due to luck.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;So far in 2011, Clayton Kershaw has a BABIP of .272, which Eric decribes as "absurdly low."  Still, Eric thinks it might actually be skill rather than luck, since since .272 it's not that much different than Kershaw allowed in previous years.  Dave argues that Kershaw's three seasons is still a fairly small sample size, and points out that most of his BABIP advantage comes from his record at home (he's about average on the road).&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Anyway, my point isn't to weigh in to which one is right -- they do a fine job hashing things out in their discussion.  What I want to talk about is something they both seem to agree on: that it's important whether the BABIP is luck or skill.  If it's luck, that reduces Kershaw's Cy Young credentials.  If it's skill, he's a better candidate.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Seems reasonable, and I don't necessarily disagree.  But let's see where that logic leads.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Because, there are other kinds of luck, or factors that pitchers can't control.  For instance, there's park (which is usually already adjusted for in WAR, the statistic Eric and Dave cite most in this debate).  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;There's also quality of opposition batting.  It's probably not too hard, if you have good data, to figure out how much either of the pitchers gained by being able to pitch to inferior hitters.  You could also check if one of them had the platoon advantage more often.  And, if one of them pitched more at home than the other one did.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;We'd probably all agree, right, that you'd want to adjust for those kinds of things if we had the information?  To be clear, I'm not criticizing Dave or Eric for not spending hours figuring this stuff out.  I'm just saying that if you have the data, it's relevant in comparing the pitchers.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;There are other things too, that eventually we'll be able to figure out, that we can't right now because (as far as I know) the research hasn't been done.  Suppose Kershaw throws a pitch at a certain speed, with a certain break, on a certain count.  And, someday, we'll know that kind of pitch is swung on and missed 30% of the time, called a ball 5% of the time, called a strike 10% of the time, fouled off 10% of the time, and hit in play 45% of the time with an OPS of .850.  Maybe, overall, that pitch is worth (say) +0.05 runs (in favor of the pitcher).&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Once we have that kind of information, we can check for "batter swing luck".  If it turns out that batters just randomly happened to go +0.03 on that pitch from Kershaw this season, instead of +0.05, we should credit him the extra 0.02, right?  He delivered a certain performance, and the batters just happened to get a bit lucky on it, as if his BABIP was too high.  (This measure would probably substitute for BABIP: it includes balls in play, but also home runs, swings-and-misses, and walk potential.)&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;So we'd adjust Kershaw and Halladay for how lucky the batters were on those swings.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;That's not unrealistic, and it'll probably eventually happen, to some degree of accuracy.  Here's one that probably won't, at least not for a few decades, but it works as a thought experiment.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Imagine we hook a probe to every batter's brain, so on every pitch we can tell if he's guessing fastball or curve, and if he's guessing inside or outside.  After a couple of years of analyzing this data, we figure that when he guesses right, it's worth +0.1 runs (for the batter), when he guesses half-right, it's worth 0, and when he guesses wrong, it's -0.1.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;That again, is something out of the control of the pitcher (especially if both batter and pitcher are randomizing using game theory).  So you'd want to control for it, right?  If Halladay is having a good year just because batters were unlucky enough to guess right only 23% of the time instead of 25%, you have to adjust, just like you'd adjust for a lucky BABIP.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;This will change the definition of "batter swing luck," but not replace it.  First, the batter may have been lucky enough to guess right, which is worth something.  Then, he might have been lucky enough to get better than expected wood on the ball even controlling for the fact that he guessed right.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;So you've got lots of sources of luck:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-- park&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-- day/night&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-- distribution of batters&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-- platoon luck&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-- BABIP luck&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-- batter swing luck&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-- batter guess luck&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;You'd want to adjust for all of these.  Right now, as I understand WAR, we're adjusting for park and BABIP.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;What about the others?  Well, we can't really adjust for those.  We *want* to, but we can't.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;So, we make do with just park and BABIP.  Still, no matter how many decimal places we go to with the debate on Kershaw/Halladay, we're still only going to have our best guess.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;At least we can argue that if all the other things are random, we should still be unbiased.  Right?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Well, not really.  From a Bayesian standpoint, we have a pretty good idea who had more luck.  It's much more likely to be Kershaw.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Why?  Because Halladay's performance is much more consistent with his career than Kershaw's.  Kershaw's a good pitcher, but wasn't expected to be *that* good.  Halladay, on the other hand, is having a typical Halladay season.  Well, a bit better than typical, but not much.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I'd be willing to bet a lot of money that if you found 50 pitchers who had a better-than-career season, by at least (say) 1.5 WAR, you would find that those 50 pitchers had above-average BABIP luck.  It stands to reason.  I won't make a full statistical argument, but here's a quick oversimplification of one.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;A pitcher can have his talent go up or down from year to year.  He can have his luck go up or down from year to year.  That's four combinations.  Only three of them are possibly consistent with a big improvement in WAR: talent up/luck up; talent up/luck down; talent down/luck up.  Two of those have his luck going up.  So, two times out of three, the pitcher was lucky.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;The argument applies to *all* sources of luck.  Even after taking BABIP into account, if a pitcher's adjusted performance is still above his career average, he's still more likely to have had good luck than bad, in other ways (batter swings, say).  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I don't have an easy way to quantify this, but still I'd give you better-than-even odds that, stripping out all the above, Halladay is performing better than Kershaw -- even after adjusting for park and BABIP.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;If you have two players with similar, outstanding performances, the player with the better expectation of talent is probably the one who's actually having the better year.  To believe that Kershaw was really likely to have had a better year than Halladay, you really need him to have put up *much* better numbers.  Either that, or you need a way to actually work out all the luck, and prove that the residual still favors Kershaw.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I should emphasize that I am NOT talking about talent here.  I think most people would agree that Halladay is still more talented than Kershaw, but would nonetheless argue Kershaw might still be having the better season.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;But, what I'm saying is, no, I bet Kershaw is NOT having a better season, even if his numbers look better.  I'm saying that it's likely that Kershaw *is actually not pitching better*.  If we had the data, it's more likely than not that we'd see that batters are just having bad luck -- not only are they (perhaps) hitting the ball directly to fielders, as BABIP suggests, but they're probably swinging and missing at hittable pitches.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;---------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Another way to look at it: if two pitchers have mostly the same results, but one has better stuff, what does that mean?  It means that the pitcher with the better stuff must have been unluckier than the pitcher with the worse stuff.  In other words, the batters facing the better stuff must have been luckier.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;We don't know for sure, of course, that Halladay had better stuff than Kershaw.  But history suggests that's more likely.  And so, the odds are on the side of Kershaw having been luckier than Halladay.  How much so?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I don't know.  One mitigating factor is that Kershaw is young, so you'd expect more of his improvement to be real.  But, still, a small improvement is more likely than a large improvement, so the odds are still on the side of postive luck over negative luck.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;---------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Does that take some of the fun out of the Cy Young?  I think it certainly does make it a little bit less entertaining, at least until we have better data.  That's because, as long as we remain ignorant of a significant amount of luck, it requires a much bigger hurdle to award the honor to anyone other than Halladay.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;This is a bit counterintuitive, but it's true.  Suppose a good but not great pitcher -- Matt Cain, say -- has almost exactly the same stat line as Roy Halladay, including BABIP, but is actually better in some categories.  Perhaps he a couple of extra strikeouts, and a couple fewer walks.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;From the usual arguments, there would be absolutely no debate that Cain's season is better, right?  He's better than Halladay in some categories, and the same as Halladay in all the others.&lt;br /&gt;&lt;br /&gt;But ... if you're trying to bet on which player actually pitched better after removing all the luck, you'd still have to go with Halladay.&lt;br /&gt;&lt;br /&gt;-----&lt;br /&gt;&lt;br /&gt;UPDATE: on his blog, Tango writes,&lt;br /&gt;&lt;span style="font-size:100%;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;span style="font-size:100%;"&gt;&lt;span class="Apple-style-span" style="color: rgb(0, 0, 0); font-family: verdana; font-size: 13px; font-style: normal; font-variant: normal; font-weight: normal; letter-spacing: normal; line-height: normal; orphans: 2; text-indent: 0px; text-transform: none; white-space: normal; widows: 2; word-spacing: 0px; background-color: rgb(255, 255, 255);"&gt;&lt;/span&gt;&lt;/span&gt;&lt;blockquote&gt;&lt;span style="font-size:100%;"&gt;&lt;span class="Apple-style-span" style="color: rgb(0, 0, 0); font-family: verdana; font-size: 13px; font-style: normal; font-variant: normal; font-weight: normal; letter-spacing: normal; line-height: normal; orphans: 2; text-indent: 0px; text-transform: none; white-space: normal; widows: 2; word-spacing: 0px; background-color: rgb(255, 255, 255);"&gt;Aside to Phil: Marcel had Kershaw with a 3.07 ERA for 2011, and Halladay at 3.04.  So, while you make great points in your article, you didn’t have the right examples!  Sabathia and Verlander would have been better examples.&lt;/span&gt;&lt;/span&gt;&lt;/blockquote&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;Oops!  I'll just leave it the way it is for now, but point taken.&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/31545676-2536437986329910109?l=sabermetricresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://sabermetricresearch.blogspot.com/feeds/2536437986329910109/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=31545676&amp;postID=2536437986329910109' title='3 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/2536437986329910109'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/2536437986329910109'/><link rel='alternate' type='text/html' href='http://sabermetricresearch.blogspot.com/2011/09/bayesian-cy-young.html' title='The Bayesian Cy Young'/><author><name>Phil Birnbaum</name><uri>http://www.blogger.com/profile/03800617749001032996</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>3</thr:total></entry><entry><id>tag:blogger.com,1999:blog-31545676.post-3653096617568326255</id><published>2011-09-18T14:29:00.011-04:00</published><updated>2011-09-18T14:53:43.333-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='sino-forest'/><category scheme='http://www.blogger.com/atom/ns#' term='options'/><category scheme='http://www.blogger.com/atom/ns#' term='OSC'/><title type='text'>Stock market integrity and the OSC's bizarre Catch-22</title><content type='html'>&lt;span style="font-family:verdana;"&gt;Warning: non-sports, non-numbers post.  Has to do with securities regulation and put options and bureaucratic illogic.  Still, should be comprehensible to all.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;-------&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;The Ontario Securities Commission (&lt;a href="http://www.osc.gov.on.ca/en/home.htm"&gt;OSC&lt;/a&gt;) is a government body that regulates capital markets (i.e., stocks, bonds, options, etc.).  It declares, among others, a responsibility to "&lt;a href="http://www.osc.gov.on.ca/en/About_about_index.htm"&gt;foster fair and efficient capital markets and confidence in the markets&lt;/a&gt;."  Recently, it made a decision that seems so obviously unfair and wrong that it has the opposite effect -- I am now materially *less* confident of the integrity of the market than I was before.  It's not just the precedent this decision sets, but my fear that the OSC just doesn't get it.  What unfair decisions will they make next, and is my retirement portfolio in jeopardy?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Of course, I might be wrong in my logic.  Please correct me if I am.  If you're more up-to-date than I am in how securities regulations work, let me know and I'll post corrections.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I'm going to start with an analogy that illustrates the issue.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;--------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I buy a house and a piece of land, for $400,000.  I insure the house.  There is a regulation on the books, quite reasonable, that it is not legal to sell land that is known to be contaminated, or to sell a house that is known to be uninhabitable.  But the house and land are fine, and the sale goes through.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Later, an arsonist burns down half the house and contaminates the land.  The state comes in and begins an investigation.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I contact the insurance company.  They agree I'm covered for $400,000.  They prepare to cut me a cheque.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;But, before anything else can happen, the regulator steps in.  "You can't do that," they say.  "When you settle an insurance claim, it means the house and the land transfer to the insurer.  But the house is uninhabitable, and the land is contaminated.  So, you can't transfer the house.  Therefore, the settlement is illegal."&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;In any case, the insurer doesn't have to pay.  He walks away happy, even keeping my premium, and I'm stuck with the loss.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;It's a kind of Catch-22.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;---------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Not fair, right?  And *obviously* unfair.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;What's happened is that the regulator is blindly sticking to a regulation that's not always right.  Sure, it might be a good idea to prohibit the sale of contaminated land *as a general rule*.  But there are exceptions.  This is an exception.  In fact, it's an exception where it's exactly the opposite -- where it's absolutely WRONG to prohibit the sale. &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;It's like, "don't jump out of the fifth floor window, you'll die."  Sure.  But if the building is on fire, and the smoke is choking you, and the firemen are holding a net below and yelling at you to jump ... then the rule reverses.  "Don't NOT jump out the fifth floor window, you'll die." &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;---------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;So here's the real story, which follows the analogy quite closely.  Some background first.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;&lt;a href="http://www.sinoforest.com/"&gt;Sino-Forest&lt;/a&gt; is a Canadian forestry company that does all its business in China.  Its &lt;a href="http://www.google.com/finance?q=TSE%3ATRE"&gt;stock&lt;/a&gt; went from $1 to $24 over the last ten years or so, as it grew and bought forests in China for harvesting.  In June, a small company called "&lt;a href="http://www.muddywatersresearch.com/"&gt;Muddy Waters&lt;/a&gt;," run by a man named Carson Block, &lt;a href="http://www.muddywatersresearch.com/research/tre/initiating-coverage-treto/"&gt;put out a report&lt;/a&gt; alleging, with evidence, that Sino-Forest was a fraud -- it didn't really own the forests it claimed it did.  The stock dropped immediately from $24ish, and fluctuated between $5 and $8 for the next two months.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;The company claimed innocence and &lt;a href="http://www.sinoforest.com/Uploads%5CSino%20-%20Press%20Release%20re%20response%20to%20Muddy%20Waters%20-%20Jun%203%202011%20final.pdf"&gt;hired independent auditors&lt;/a&gt;, but no information was forthcoming and the company cited documentation delays.  The stock dropped further.  At &lt;a href="http://www.sinoforest.com/Uploads%5CRls%20-%20SFC%20re_OSC%2008Jun2011%20Final.pdf"&gt;one point&lt;/a&gt;, the OSC said it was investigating.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Finally, in August, the &lt;a href="http://www.theglobeandmail.com/globe-investor/sinoforest/osc-alleges-fraud-at-sino-forest/article2143054/"&gt;OSC claimed&lt;/a&gt; there was evidence of fraud.  They did not give details, and speculation is that they got their information from the auditors, and from Muddy Waters.  The OSC immediately prohibited further trading in the stock, and ordered the CEO to resign.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;A few hours later, the OSC was told it didn't have the right to order any resignations.  Belatedly realizing it had overstepped its authority, the OSC &lt;a href="http://www.thestar.com/business/article/1045289--osc-rescinds-stunning-demand-for-the-resignation-of-sino-forest-executives"&gt;retracted&lt;/a&gt; that part of its order.  Nonetheless, the CEO voluntarily &lt;a href="http://www.reuters.com/article/2011/08/28/us-china-accounting-sinoresignation-idUSTRE77R22A20110828"&gt;stepped down&lt;/a&gt; a few days later.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;In addition to shares of stock, there were also "put options" trading on Sino-Forest.  A &lt;a href="http://www.google.ca/search?q=put+option&amp;amp;ie=utf-8&amp;amp;oe=utf-8&amp;amp;aq=t&amp;amp;rls=org.mozilla:en-US:official&amp;amp;client=firefox-a"&gt;put option&lt;/a&gt; is a contract between two parties.  One party pays the other some money -- say, $1 per share -- and, in return, receives the right (but not the obligation) to force the other party to buy his shares by a certain date, at a certain price.  Say, $20 by August 19.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;The idea is that you can use a put option as insurance.  If you own 100 shares, with a value of $2,000, and you're scared the price will drop, you can buy 100 put options for $100.  Then, even if the stock drops, you know you can still get $2,000 for them on August 19.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;It's exactly like insurance on a house.  You pay $100 for the insurance, and if anything bad happens to your house between now and August 19, the insurance company will take the house away and give you a $2000 settlement.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;So, at this point you can guess what happened next.   The OSC prohibited the contracts from being exercised.  The OSC said, "I don't care if you bought the insurance.  Settlement means that you would have to sell the shares to someone else, and we've prohibited you from doing that."&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;And, of course, August 19 has come and gone.  (There are contracts with other expiry dates too -- over different months -- but August 19 was one of them.)   The contracts have expired and are now worthless.  The OSC blindly followed the rule "it's bad for markets if shares of a fraudulent company are bought."  That's not always true.  In this case, it's WORSE for the market if the shares are NOT bought.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Normally, people don't want to buy a company when it might be a fraud.  In this special case, people DO want to buy a company ONLY when it's a fraud.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;This is so obviously wrong that anyone should understand that it's unfair.  But, especially, the OSC, which is the regulator, and supposed to be an expert in markets, and how they work, and investor confidence ... how did they make a mistake like that?  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Not only is it unfair, but ... if this precedent holds, the entire market for put options falls apart.  How do they not get that?  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Unless it's me that doesn't get it.  Which is certainly possible.  If you're a Bayesian, or even if you have normal common sense, you're probably asking yourself: who's more likely to be grossly wrong: Phil, the amateur investor, or the expert regulators at the OSC?  If you're Bayesian, you should probably figure that must be me who's wrong, especially when I tell you that I was unable to find anyone in the financial press complaining about any of this.  To my knowledge, I'm the first and only one.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;But ... I just can't see how this could be right.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;---------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Well, this past week, they held a hearing to revisit the decision.  I thought they'd say, "oops, sorry, we screwed up," and fix it.  But confronted with all the arguments (as I presume they were), they STILL didn't get it.  They only "fixed" part of it.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;&lt;a href="http://www.bloomberg.com/news/2011-09-15/canadian-regulator-agrees-to-modify-sino-forest-cease-trade-ban.html"&gt;What they did&lt;/a&gt; was to say, if you already own the shares, then, OK, you can sell them to the other party for the $20 to complete the transaction and collect on your insurance.  But if you happen to have the insurance contract, but you don't have the shares, because you were meaning to buy them later, then you're still SOL.  You can't go out and buy the shares from someone else, so that you can collect on your investment.  Instead, you have to let your options expire worthless.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Their logic appears to go something like this: "The put option is a contract to sell shares.  So we'll make an exception and let you sell shares if you have a previous contact.  But we won't let you BUY the shares to sell, because you only have a contract to sell, not to buy.  Besides, if you bought a contract to sell, but didn't own any to sell, you're just speculating, so we don't feel much sympathy for you."&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;But, that's ridiculous.  It's a common investment strategy to buy put options on stocks you don't own, if you expect them to drop.  Sometimes it's straight speculation that there's fraud, but sometimes it's part of a more complex hedging strategy.  Maybe you own a business in China, and you want to insure against a bad Chinese economy, and the easiest way is to buy puts on Sino-Forest.  If China goes downhill, and Sino-Forest with it, you buy the worthless Sino-Forest shares, and sell them according to your contract, which gives you the insurance money you need.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-size:85%;"&gt;&lt;span style="font-family:verdana;"&gt;(* In any case, since when is speculation something that anyone should be trying to avoid?  Speculation is a good thing, as economists will assure you.  And, securities regulators, being experts in how capital markets work, know that.  Speculators keep the market liquid and efficient, moving prices closer to their true value.  I personally would be hesitant to invest without speculators.  Right now, I can be pretty sure that I'm paying a fair price for any stock I buy -- if the price were too high, speculators would have stepped in before and sold short to push the price down.  Without speculators, I'd be more likely to be getting ripped off.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;( But I digress.  Oh, and while I'm digressing, a disclosure: I own shares of Sino-Forest, but have never had any Sino-Forest option positions.)&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;What the OSC has done with its fix is actually worse than what it did originally.  It said, "we'll let you enforce your contract if we approve of your investment strategy, but we will screw you around if we don't."  That's something the OSC has no business doing, favoring some parties but not others based on the capricious illogic of its bureaucrats.  It's also the worst thing you can do for market confidence -- signalling to the world that the rules are unpredictable based on how the regulator feels about you.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;For my part, I have bought put options before, on companies I thought were grossly overvalued.  I'll be damned if I'm going to do that again, at least in Canada.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/31545676-3653096617568326255?l=sabermetricresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://sabermetricresearch.blogspot.com/feeds/3653096617568326255/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=31545676&amp;postID=3653096617568326255' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/3653096617568326255'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/3653096617568326255'/><link rel='alternate' type='text/html' href='http://sabermetricresearch.blogspot.com/2011/09/stock-market-integrity-and-oscs-bizarre.html' title='Stock market integrity and the OSC&apos;s bizarre Catch-22'/><author><name>Phil Birnbaum</name><uri>http://www.blogger.com/profile/03800617749001032996</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-31545676.post-156101675988809223</id><published>2011-09-16T10:54:00.005-04:00</published><updated>2011-09-16T11:10:46.954-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='hockey'/><category scheme='http://www.blogger.com/atom/ns#' term='McCown'/><category scheme='http://www.blogger.com/atom/ns#' term='NHL'/><category scheme='http://www.blogger.com/atom/ns#' term='luck'/><title type='text'>Bob McCown on "puck luck"</title><content type='html'>&lt;span style="font-family:verdana;"&gt;I just got a copy of "&lt;a href="http://www.amazon.ca/McCowns-Law-Greatest-Hockey-Arguments/dp/0385666764/ref=sr_1_2?ie=UTF8&amp;amp;qid=1316184905&amp;amp;sr=8-2"&gt;McCown's Law -- The 100 Greatest Hockey Arguments&lt;/a&gt;."  I'm only on number 1, but already Bob McCown nails it:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;&lt;/span&gt;&lt;blockquote&gt;&lt;span style="font-family:verdana;"&gt;" ... hockey is enveloped by a culture that demands that everything be rationalized or explained ...&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;... it's hilarious the way fans react when their team loses a close game.  You'd swear the players couldn't do anything right.  And yet, when the same team wins a  game by a one-goal margin, it's showered in platitudes.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;So here's an experiment I'd love to perform sometime.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Let's take the tape of a five-year-old NHL game -- any game -- in which the score ended 3-1.  Now, let's edit out the goals and leave all the rest, so that about 59 of the 60 minutes are there to watch.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Now show it to an audience of hockey fans and see if they can guess who won.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I bet they couldn't, because aside from the moments in which the goals are scored, an awful lot of hockey games are nothing but back-and-forth flow, the trading of chances and puck luck.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;To have some fun, let's try the same experiment with a bunch of reporters.  Then, let's show them the stories they wrote about that exact game.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Most nights in hockey, both teams skate hard, check hard, and go to the net ...  And one of them has a puck hit the post and bounce into the net.  And the other hits a post and watches it bounce wide.  On more nights than you'd believe, the difference is as simple as that ...&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;In fact, I would say that puck luck, as it is often called, decides roughly half of the close games in the National Hockey League."&lt;/span&gt;&lt;/blockquote&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Absolutely right.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I'm looking forward to the rest of  McCown's book.  I'll probably find more things to post about later, if  the quality of the first chapter is any indication.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;-----&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Well, one picky point on McCown's essay: I don't know what "decides roughly half of the close games" means.  If a team wins 2-1, what does it mean that luck decided it, or not?  That's a bit vague.  I know what McCown means to say, and I agree with it on a gut level, but ... I'm not comfortable with phrasing it that way, because I like to have a precise definition.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;So let's arbitrarily make one up.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Suppose you did something like what McCown suggested -- you edited a tape of the game to remove the results all the shots and "dangerous" scoring chances (that might or might not have resulted in shots).  Then you somehow computed the win probability based only on the situations that appear on tape.  Maybe you give a breakaway an expectation of 0.3 goals.  And for a point blank slot possession, you assign 0.5 goals.  And a slapshot from the point, 0.1 goals.  And so on.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;You compute an expected score based on that.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Then you look at the real score.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;1.  If the "wrong" team won, it must have done so by "puck luck".  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;2.  If the "right" team won, but its expectation was to win by less than one goal, then you define that as a win by "puck luck".&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;That might actually be possible to partly figure out.  The NHL website gives all the &lt;a href="http://www.nhl.com/scores/htmlreports/20102011/PL030417.HTM"&gt;shots&lt;/a&gt;, by distance and type, and &lt;a href="http://hockeyanalytics.com/shot-quality/"&gt;Alan Ryder has done lots of research&lt;/a&gt; on how to get scoring probabilities for shots.  However, the NHL doesn't list other kinds of scoring chances aren't listed, so you'd have to stick to *shot* "puck luck".&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;In any case, even if you had scoring chances, there would still be luck unaccounted for, in the development of the play.  A breakaway might have itself been caused by a defender missing an easy puck.  A good chance was caused by three low-odds passes that happened to click.  And so on.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;So, let's try again.  How about, a game is decided by "puck luck" if:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;You edit the game per McCown's suggestion, and show it to reporters.  You make them bet their own money on who won, against each other at odds that they negotiate.  If the overall odds wind up between 60:40 and 50:50, or the overall underdog won the game, then that's a game decided by "puck luck".&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I'm not suggesting you actually do this, but that you do a thought experiment and estimate what would happen.  There are obviously some games where one team absolutely dominates (and wins).  The reporters would obviously get the right answer here ... they'd need 90:10 odds or something to back the underdog.  But there are obviously games that would look like toss-ups.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Any other suggestions for how to define that in a way where we could actually talk about how to get an answer?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;--------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;As an aside, I think this kind of "replay" technique has all kinds of sabermetric applications.  To evaluate referee performance, take a tape of the foul, do some digital processing to obscure the players and teams involved, and get referees to judge it.  To scout a pitcher, you can avoid being biased by the result of the pitch (a good pitch can still be hit for a home run) by digitally removing the result (and perhaps extrapolating/animating the last few inches, if the pitch was actually contacted).  And so on.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;I think I proposed this thought experiment once, along the same lines.  Suppose you had a time machine.  You go 40 years into the future, and you go to MLB.com, and you download video of every inning of every game.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;You take 10 players across the spectrum of hitting talent: the equivalent of Albert Pujols, the equivalent of John McDonald, the equivalent of Ichiro Suzuki, and so on.  You carefully select 200 AB from each of them, so that those 200 AB show the same batting line for each player, and put them on tape.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Then, you bring those tapes back to the present day, and show them to all the scouts.  Would the scouts be able to tell the good players from the bad players?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/31545676-156101675988809223?l=sabermetricresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://sabermetricresearch.blogspot.com/feeds/156101675988809223/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=31545676&amp;postID=156101675988809223' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/156101675988809223'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/156101675988809223'/><link rel='alternate' type='text/html' href='http://sabermetricresearch.blogspot.com/2011/09/bob-mccown-on-puck-luck.html' title='Bob McCown on &quot;puck luck&quot;'/><author><name>Phil Birnbaum</name><uri>http://www.blogger.com/profile/03800617749001032996</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-31545676.post-6114867393757228007</id><published>2011-09-13T18:07:00.004-04:00</published><updated>2011-09-13T18:13:12.035-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='statistics'/><title type='text'>Logical thinking</title><content type='html'>&lt;span style="font-family:verdana;"&gt;Great example of statistical logic, from &lt;a href="http://www.johndcook.com/blog/2008/01/21/selection-bias-and-bombers/"&gt;John D. Cook&lt;/a&gt; via &lt;a href="http://marginalrevolution.com/marginalrevolution/2011/09/the-importance-of-selection-effects.html"&gt;Alex Tabarrok&lt;/a&gt;:&lt;/span&gt;&lt;br /&gt;&lt;blockquote&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;"During WWII, statistician Abraham Wald was asked to help the British decide where to add armor to their bombers.  After analyzing the records, he recommended adding more armor to the places where there was no damage!"&lt;/span&gt;&lt;/blockquote&gt;&lt;br /&gt;&lt;span style="font-family:verdana;"&gt;Explanation at the above links.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="color: rgb(17, 17, 17); font-style: normal; font-variant: normal; font-weight: normal; letter-spacing: normal; line-height: 22px; orphans: 2; text-indent: 0px; text-transform: none; white-space: normal; widows: 2; word-spacing: 0px; background-color: rgb(255, 255, 255);font-family:Arial,'Helvetica Neue',Helvetica,sans-serif;font-size:14px;"  &gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/31545676-6114867393757228007?l=sabermetricresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://sabermetricresearch.blogspot.com/feeds/6114867393757228007/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=31545676&amp;postID=6114867393757228007' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/6114867393757228007'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/6114867393757228007'/><link rel='alternate' type='text/html' href='http://sabermetricresearch.blogspot.com/2011/09/logical-thinking.html' title='Logical thinking'/><author><name>Phil Birnbaum</name><uri>http://www.blogger.com/profile/03800617749001032996</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-31545676.post-4519332269746842764</id><published>2011-09-08T23:11:00.005-04:00</published><updated>2011-09-08T23:16:18.235-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='income inequality'/><category scheme='http://www.blogger.com/atom/ns#' term='wealth inequality'/><title type='text'>On inequality of wealth</title><content type='html'>&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;Note: Non-sports post.&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;----&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;In the United States, the top 10% of the population earns 30% of the income.  And the top 10% of the population owns 70% of the wealth.&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;Statistics like these seem to be popping up all over the place lately ... someone I know posted one on Facebook a few days ago, and there was a newspaper article or two in the last month.  I'm not sure what happened to bring all this up.  (If anyone has links from the last week or two, let me know.  I can't find them at the moment.)&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;A couple of years ago, taking about the Gini Coefficient, I made a &lt;a href="http://sabermetricresearch.blogspot.com/2009/03/gini-coefficient.html"&gt;bunch of arguments&lt;/a&gt; about why the distribution of income doesn't matter much.  I think it matters a bit, but not much.&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;Here, I'm going to concentrate on the distribution of *wealth*.  For wealth, I'm going to argue that, given a particular distribution of income, the distribution of wealth is almost completely meaningless as a moral issue, or an issue of people's well being.  That is: criticize, if you want, the fact that the top 10% get 30% of the income.  But given that income distribution, *it doesn't matter* how much of the wealth the top 10% own: whether it's 10%, 30%, 70%, or 99%.  &lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;-----&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;The difference between income and wealth is that income is a rate, how much you earn in a particular year.  Wealth is the total amount that you possess at a specific time.&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;How does anyone gain wealth?  Other than inheritance (which we'll disregard here), you have to save or invest some of your income.  You can earn ten million dollars one year, but if you blow it all on cocaine and hookers, your wealth will be zero.&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;So your wealth is a result of three things: (1) your income, (2) the amount you save, and (3) rate of return on the amount you save.  As I said, if you hold (1) as fixed, wealth is affected by only (2) and (3).&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;Suppose you have two people, John and Mary.  They have exactly the same education, and they graduate into exactly the same job, paying $50,000 a year.  John spends all his money every year.  Mary saves an annual $6,000 in a retirement fund, earning 5%, and spends the rest.&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;What happens?  After 40 years, John has $0 in wealth.  Mary has $725,000.&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;Is it fair to complain about that?  I don't think so.  Sure, Mary is now (fairly) wealthy while John has to live on just Social Security.  But, in the past, John lived much better than Mary, to the tune of $240,000 -- $6,000 a year for 40 years.  Some people's tendency would be to take some of Mary's money and give it to John.  But that wouldn't be fair.  It would actually be quite an injustice.  Mary deliberately lived significantly worse than John for 40 years, just so she could have a better retirement.  Giving that money to John would *compound* the inequality, wouldn't it?  It would take from the (formerly) poor lifestyle and give to the (formerly) rich lifestyle.  It would compensate for the future where Mary spends more than John, but not compensate for the past, when John spent more than Mary.&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;Really, even though Mary has more money than John, over their lifetimes, they're equal.  Thirty-five years ago, John spent $4,000 on a new state-of-the-art TV.  He knew, when he bought the TV, that $4,000 then would be the equivalent of $22,000 at retirement.  He bought the TV anyway.  Nothing wrong with that.  He chose, freely, to live $4,000 richer than Mary back then, in exchange for living $22,000 poorer than Mary later.  Mary also knew the terms of the trade, and made the other choice.  &lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;But, over their lifetime, they are exactly equal.  $4,000 can buy a lot of things: a vacation, a TV, a boat, a motorcycle, or a retirement fund of $22,000.  If John had bought a TV, and Mary had bought a boat, could anyone argue that Mary is richer than John because she has a boat?  Of course not -- because, by the same token, John has a TV of equal value.  &lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;The same thing applies here: if John has a TV that costs $4,000, and Mary has a $22,000 retirement fund, which also costs $4,000 ... they must be equally rich, right?&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;-----&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;When you talk about the distribution of wealth, what you're really talking about, for the most part, is the distribution of a desire to save.  And there is no "proper" distribution for that, any more than there's a "proper" distribution of religious beliefs.  People are diverse, and they have different tendencies.  Some people like to spend, and some people are compulsive savers.  Humans choose differently from each other.&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;Suppose the 1% of the population that owns the most rare baseball cards happens to own 70% of the rare baseball cards.  It just means that the other 99% don't care as much about baseball cards.  If they have the same income, they just own more other things instead.&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;Now, you can still argue that the reason it's a problem that the top 10% has 70% of the wealth is that the bottom 90% doesn't earn enough money to be able to save.  But that argument is better made by arguing about the *income* distribution.  Because, otherwise, you're combining two issues: having money, and choosing to save it.  If you were to complain that the top 1% own 70% of the baseball cards *because they have a higher income*, you'd be mostly wrong.  Yes, the top 1% of baseball card owners probably DO have a higher income.  But that's not the main reason they own 70% of the baseball cards.  The *main* reason they own 70% of the baseball cards is because they really, really like baseball cards.&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;-------&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;Here's a model, for a numerical example.&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;Start by assuming a population of 10,000 people.  They all have exactly the same education, and they all graduate at age 25 into a job that pays $40,000 a year.  They work until they're 65, at which point we measure their wealth.&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;But they're not all the same, because they have different personalities, and characteristics, and desires.  Specifically:&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;1.  They vary in how much money they like to spend.  The mean of the population is to spend 90% of their salary and save 10%, but with a standard deviation of 15 points.  Nobody saves more than 50% of their salary, or spends more than 115%.&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;2.  They vary in how many children they want, and when.  20% of them want no kids.  20% of them want one kid early in life (age 27), and 20% want one kid later in life (age 35).  20% of them want two kids early, and 20% want two kids late.  Kids cost $5,000 a year in expenses to age 18, and then $20,000 annually for the next four years, all of which comes out of saving.&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;3.  They vary in how good they are at investing their money.  Some play it safe, and some are more aggressive. Some study investing, and some don't.  The average annual return is 4%, with an SD of 1.5 percentage points.&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;4.  They vary in how much effort they put into their job, which affects their annual salary increases.  The mean increase is 2% a year, with a standard deviation of 0.5%.  Nobody ever gets fired or earns less than $40,000.&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;5.  Nobody goes into debt more than $50,000.  Once they reach $50,000, they cut their spending to keep the debt at $50K.  All debt is paid off in the year before retirement.  Debt earns interest of 10%.&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;Under these conditions, I ran a random simulation of the 10,000 people.  &lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;So, at age 65, what percentage of total wealth will the top 10% own?  Take a guess before reading on.  I'll write it cryptically so you don't see it by accident when you're thinking.&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;Ready? &lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;The top 10% of these graduates own (7 * 9 - 22)% of the total wealth.&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;Got that?  It's not as big as the real-life answer of 70%, but it's pretty big nonetheless.  And it's *completely* due to the decisions of the individuals themselves.  There is no inequality, no racism, no bad schools, no corruption, no government favors, no explotation by greedy employers.  It's just natural variation in how human beings choose to live their lives.&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;----------&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;Some of the other results:&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;The top 1% had 7% of the wealth.&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;The top 10% had 41% of the wealth.&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;The top 50% had 99% of the wealth.&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;As you would expect, the wealthiest people were the ones who saved the most and got the highest rates of return.  The wealthiest, person number 7,490, wound up with just over $4,000,000 in wealth.  She saved 41% of her salary and earned 8.3% per annum.  In case you think 41% is a lot ... it's not, really.  There are a lot of misers in the world.  At retirement, this person earned $55,551, which means she was living on around $33,000 per year.  That's not unreasonable for an outlier, just over 2 SD from the mean.&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;Overall, it turned out that number and timing of children didn't matter much.  Neither did salary (although the salaries were all pretty close).  Some of the richest people earned below-average salary increases.&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;So what mattered is how much they saved, and how well they invested it.  Of course, my model is way oversimplified, but that does correspond to my perception of how wealth happens in real life, where my sample of friends earns around the same as I do.&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;---------&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;One thing I should note is how the "top 1%" figure of 7% is way, way off the real life figure of 38%.  Why is that?  Well, the main reason is probably that the model didn't consider the possibility of enterpreneurs who can occasionally create a multi-billion-dollar company out of nothing.  If Bill Gates and Warren Buffett were in the model, the figure would jump substantially from 7%.  &lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;What's more surprising, I think, is that the top 10% number was so high, at 41%.  I expected it to be much lower, considering that there's so much less variation here than in the real model:&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;1.  Here, everyone had roughly the same income, between $40,000 and $60,000.  Real life, on the other hand, includes sports stars, CEOs, and other people with high productivity.&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;2.  Here, everyone was 65.  Older people are obviously wealthier, since they've had much more time to earn and save.  If you take these 10,000 retired people, and combine them with 10,000 babies, then the distribution is much more unequal, since you've added a bunch of zeroes.  Then, the top 10% jump to from 41% of the wealth to 64%.  &lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;The age thing is a big issue.  Even if everyone were exactly equal in every way, following the same career path and the same wealth accumulation path, the distribution would be unequal if you take a snapshot in time.  You'd be combining 65 year olds who are rich because they've been saving, to 25 year olds who WILL be just as rich, but aren't yet.  (That, by the way, is why it's best to look at lifetime income, or at least age-adjusted income, instead of snapshot income or snapshot wealth.)&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;Oh, and by the way ... I built a certain amount of progressive taxation into the model.  I assumed all salary above $40K is taxed at 30%.  I also assumed that the savings rate is based on after-tax salary.  And finally, I assumed that if you save more than 30% of your after-tax salary, any excess is taxed *again* at 30%.  (This was easier than trying to compute tax on investment income.)  The numbers above are *after* all this progressive taxation.&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;----------&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;So, my argument boils down to something like this (directed at a random skeptic):&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;You say that the top 10% owns 70% of the wealth, and that's too much.  Why is that too much?  It can't be just inequality, because here I have a model where everyone is equal, and the top 10% still owns more than 41%.  Why do you think 70% is wrong, and what should the number be, and what are your assumptions?  &lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;And suppose I cornered you, and asked you to tell me exactly what your policy prescriptions are -- how much to tax the rich, what to do with the money, how to tax investment income, what loopholes to close, how to get the poor to save more, and so on.  Then I would ask you, "after all that, how much of the wealth would the top 1% own?"  &lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;I'd bet you couldn't answer that.  And if you don't know what the distribution of wealth would be in your ideal world, how can you possibly argue that it's the wrong number now?&lt;/span&gt;&lt;/div&gt;&lt;div style="font-family: verdana;"&gt;&lt;span class="Apple-style-span"&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/31545676-4519332269746842764?l=sabermetricresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://sabermetricresearch.blogspot.com/feeds/4519332269746842764/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=31545676&amp;postID=4519332269746842764' title='9 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/4519332269746842764'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/4519332269746842764'/><link rel='alternate' type='text/html' href='http://sabermetricresearch.blogspot.com/2011/09/on-inequality-of-wealth.html' title='On inequality of wealth'/><author><name>Phil Birnbaum</name><uri>http://www.blogger.com/profile/03800617749001032996</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>9</thr:total></entry><entry><id>tag:blogger.com,1999:blog-31545676.post-447415647450746128</id><published>2011-09-03T14:23:00.004-04:00</published><updated>2011-09-03T23:39:08.437-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='academics'/><title type='text'>Academic editor resigns after publishing flawed study</title><content type='html'>&lt;span style="font-family: verdana;"&gt;The editor of an academic journal has &lt;a href="http://www.guardian.co.uk/environment/2011/sep/02/journal-editor-resigns-climate-sceptic-paper"&gt;resigned&lt;/a&gt; after publishing a study that turned out to be flawed.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;There's more to it than that, of course ... it's mostly an issue of political correctness, rather than a scholarly one.  The study in question was by a politically incorrect author, with a politically incorrect conclusion.  The paper, it turns out, was skeptical of climate change, and there are accusations that the peer reviewers who gave it their blessing were also known skeptics.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;Still, it's interesting to see how the reaction pretends that's not an issue.  The resigning editor, Wolfgang Wagner, wrote:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;&lt;/span&gt;&lt;blockquote&gt;&lt;span style="font-family: verdana;"&gt;"[The paper was] "fundamentally flawed and therefore wrongly accepted by the journal ... As the case presents itself now, the [peer review] editorial team unintentionally selected three reviewers who probably share some climate sceptic notions of the authors …the problem I see with the paper by [authors] Spencer and Braswell is not that it declared a minority view (which was later unfortunately much exaggerated by the public media) but that it essentially ignored the scientific arguments of its opponents."&lt;/span&gt;&lt;/blockquote&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;So, let me get the implications straight.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;1.  It is a very serious matter if a flawed paper is accepted by a journal.&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;2.  If the peer reviewers agree with the author on a related scientific theory prior to the paper being published, you should find other peer reviewers.&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;3.  If a paper ignores the scientific arguments of its opponents, it should not be published.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;Can these people possibly be serious?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;1.  Flawed papers are accepted by journals ALL THE TIME.  The sabermetric community has revealed the flaws in many, many academic studies, and no editor has resigned.  Indeed, on several occasions, we have revealed problems with studies when they're still in the "working paper" stage, and they get published anyway.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;If an editor had to resign every time a flawed paper got published, no editor would last longer than three months in the job.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;2.  This only seems to become a principle when the scientific theory in question is politically incorrect.  If a journal considers a study *confirming* climate change, do they really go out of their way to find climate skeptics to peer review it?  If a psychology journal publishes a study documenting the negative effects of racial bias, do they insist that one of the peer reviewers be a KKK member?  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;3.  Actually, I'm OK with this one.  But I can't resist snarping a little bit.  Has any editor ever been forced to resign because of the publication of an academic paper on sabermetrics that doesn't know who Bill James is, that barely cites any existing sabermetric research, and that could have been refuted by a sabermetrician in fifteen seconds?&lt;br /&gt;&lt;br /&gt;Okay, I'm done snarping.  Moving on now.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;-------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;Have you ever noticed what a big deal it is in academia whenever a study is acknowledged to be flawed?  The hands wring, the "mea culpa"s flow, and everyone talks about what could have gone wrong with the process that this was allowed to happen.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;It looks good at first, that academia is so concerned about getting it right that they take it so seriously when something is wrong.  But, really, it's a veneer, isn't it?  They're just trying to signal how serious and ethical they are to people who don't know any better.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;More often than not, it doesn't work that way in real life.  We've all seen and talked about studies that are obviously flawed, and we've seen examples of academics who deflect the arguments to irrelevant side issues, ignore them completely, or attack our credentials instead of the actual criticism at hand.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;Sometimes the community will be a bit more subtle than that ... they won't disown the study explicitly; instead, they'll publish a rebuttal letter, or a study opposing the original.  They'll try to position it as a healthy scientific debate between scientists.&lt;br /&gt;&lt;br /&gt;But, no matter how flawed the paper, they don't normally demand that the editor resign.  And there also seems to be an implicit understanding that you don't pillory the original author, even if the original study was obviously meritless.  You just make sure you never cite it favorably, and everyone ignores it and gets on with their lives.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;But here, they won't do that.  The climate change research community seems to hold the offending author in very low regard.  Normally, they'd just ignore the offending author, knowing their peers would extend to them the same courtesy.  But it seems like they're just fed up with this Spencer guy.  And, perhaps that's for good reason.  &lt;/span&gt;&lt;span style="font-family: verdana;"&gt;As one professor said,&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;blockquote&gt;&lt;span style="font-family: verdana;"&gt;"Spencer [one  of the co-authors] is well known in the scientific community for  publishing high-profile papers that initially dispute global warming and  only later are found to be faulty."  &lt;/span&gt;&lt;/blockquote&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;Still, this is not a case of academia standing up to defend its strict standards of truth.  This seems to be a case of academia having decided that this particular academic is persona non grata on this particular subject, and that they're not going to let him, or his editor, get away with things other academics can.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;And you know, this turn of events might have been totally the right thing to do.  I don't know this Spencer guy.  For all I know, he might be an awful scientist, blinded by his political beliefs, trying to publish bad papers anywhere he can get away with, to cast doubt on the climate change hypothesis.  In that case, we might agree that because of his repeated disingenuousness, and the poor quality of his work in the past, his latest study should have been subject to extra scrutiny -- and his editor's failure do that represents a resignable offense.  (Again, I don't know Spencer's work at all, so this is entirely hypothetical.)&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;But if that's the case, say so!  Don't say "the editor was fired because the paper was flawed."  It makes you academics sound ridiculous, like you consider yourselves more infallible than the pope.  It sends the idea that everything that makes it into a journal is invariably 100 percent correct -- because, if it weren't, would the editor still be working here?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;If you say, "the editor was &lt;s&gt;fired&lt;/s&gt; expected to resign because the paper was flawed," you sound arrogant and silly, not to mention dishonest.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;The critics should just tell the truth.  They should say, "Look, there's this one guy who's acting like a dork.  He's putting together these crappy studies, which have no scientific merit, and he won't do what scientists are supposed to do and look at the data objectively.  He's so politically committed to his hypothesis that he doesn't care about making his studies hold together, and he gets everything wrong.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;"Now, we're scientists, so we are very open to the idea that our current theories might be wrong, and there are skeptical scientists whose research is valid and whom we respect.  But not this guy.  His work has been so bad, for so long, that it's incumbent on any editor to double and triple-check his work to make sure he's not doing it again.  In that light, when his editor failed to do that, it's akin to negligence.  So it's only appopriate that he resign."&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;That would make sense.  But, I guess, to the general public, it doesn't look as good.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/31545676-447415647450746128?l=sabermetricresearch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://sabermetricresearch.blogspot.com/feeds/447415647450746128/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=31545676&amp;postID=447415647450746128' title='8 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/447415647450746128'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/31545676/posts/default/447415647450746128'/><link rel='alternate' type='text/html' href='http://sabermetricresearch.blogspot.com/2011/09/academic-editor-resigns-after.html' title='Academic editor resigns after publishing flawed study'/><author><name>Phil Birnbaum</name><uri>http://www.blogger.com/profile/03800617749001032996</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>8</thr:total></entry><entry><id>tag:blogger.com,1999:blog-31545676.post-5651817603366227441</id><published>2011-08-29T11:18:00.003-04:00</published><updated>2011-08-29T11:23:38.044-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='economics'/><category scheme='http://www.blogger.com/atom/ns#' term='picasso'/><title type='text'>More on "psychic value"</title><content type='html'>&lt;span style="font-family: verdana;"&gt;In &lt;a href="http://www.grantland.com/story/_/id/6874079/psychic-benefits-nba-lockout"&gt;Malcolm Gladwell's piece&lt;/a&gt; on the "psychic value" or "Picasso value" of owning a sports team (which I talked about &lt;a href="http://sabermetricresearch.blogspot.com/2011/08/gladwell-on-picasso-theory.html"&gt;here&lt;/a&gt;), there was a reference to an academic study (&lt;a href="http://www.accessecon.com/pubs/EB/2007/Volume26/EB-07Z10027A.pdf"&gt;.pdf&lt;/a&gt;) that tried to find the psychic value of owning a painting.  That study found that psychic value to be around 28% of the value of the painting.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;Well, that makes no sense.  Paintings don't return a stream of income (unless you charge admission to see them, which isn't the case here).  The only benefit to owning the painting is the intrinsic, subjective value you get from owning it.  So the "psychic value" of owning the painting can't be 28% of its value.  It must be 100%.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;The confusion, I think, comes from the fact that, sometimes, you can sell a painting at a profit.  That makes it seem like there are two benefits to the painting -- the psychic benefit of ownership, and the potential capital gain at the end.  But, really, there's one benefit: the psychic one.  Sure, the *value* of that psychic benefit will likely rise over the years, and, when it does, you can sell that benefit to another buyer at a higher price.  But you're still selling only joy.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;The "profit" is actually something you can expect, and it's built into the price of the painting.  Suppose owning a Picasso is worth $100K a year in "psychic value" to the person who likes it best.  And that value rises every year by the rate of inflation -- say, 5%.  And suppose interest rates are 10%.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;The buyer then expects:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;$100,000 worth of psychic value the first year&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;$105,000 worth of psychic value the second year&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;$110,250 the third year&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;$115,763 the fourth year&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;... and so on.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;How much is he willing to pay for the painting?  Well, at a discount rate of 10%, it works out to $2 million.  By buying the painting for $2 million, the buyer forgoes $200,000 in interest that he would get otherwise.  In exchange, he gets $100,000 in psychic value the first year, and the painting appreciates by $100,000.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;But if you were to look at the fact that the psychic value equals the appreciation, and conclude that only 50% of the value of the painting was psychic value, you'd would be incorrect.  Psychic value accounts for 100% of the value of the painting.  The appreciation comes from an increase, over time, in the rate of return in psychic value.&lt;br /&gt;&lt;br /&gt;What you &lt;/span&gt;&lt;span style="font-family: verdana;"&gt;CAN say is that, of the first year's forgone interest on the value of the painting, 50% of that represents the psychic value consumed that year, while 50% represents appreciation of the remainder of the psychic value.  But that's not that brilliant an insight.  It's true for everything you buy: the "psychic value" must be at least the forgone interest minus the appreciation (or plus the depreciation, which is negative appreciation).  If you buy a TV for $1000 at 10% interest, and it loses 20% of its value every year, the first year's "psychic value" must be at least $300, or you wouldn't buy it.&lt;/span&gt;&lt;span style="font-family: verdana;"&gt;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;------&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;Another thing that's confusing is that sometimes paintings appreciate a lot more than inflation, which makes them look like a good investment.  But that's got to be random.  If it was known in advance that the painting would appreciate more than stocks, the price would go up immediately as buyers bid up the price.  Those stories you hear about buyers paying $500 and selling for $1,000,000 ... well, those are outliers, like winning lottery tickets.  In a reasonably efficient market, the sum of the psychic value, and the appreciation, must be close to the return you can get from other (similarly risky) investments.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;But life is random, and it's possible that values increased much more than expected in the past.  The art world may have thought that psychic value would increase only with inflation, but, as more and more billionaires were created, the psychic value rose even faster.  That would certainly have caused prices to rise faster than expected, and would make paintings look like a good "investment".  But the market would adjust to the new expectations.  Indeed, as it did, prices would rise even faster!  They'd rise once for the fact that psychic values are now higher, and they'd rise again for the fact that psychic values are accelerating over time.  &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: verdana;"&gt;In retrospect, that may have made paintings look like they were a better than average investment (which I guess they would have been).  But that's not because paintings have two benefits -- psychic, and non-psychic.  It's because they have one benefit, psychic, and the value of that benefit increased sharply.  If you buy a painting as an "investment," you are speculating in the value of its psychic benefits.  And you are betting against the market.  Unless you have much, much better speculative skills than anyone else, you're probably going to break even in the long run, before taking into account auction fees, and such.  And "breaking even" includes psychic benefits.  If you don't like art, the expectation for your overall experience is strongly negative, compared to other investments.&lt;/span&gt;&lt;br /&gt;&lt;
