this is a top five takeaway summary of the signal and the noise a book about the art and the science of predictions written by nate silver takeaway number one the signal and the noise ever since the creation of the printing press in 1439 the amount of information that can be stored and reused later has exploded and this has only been enhanced even further with the spread of the Internet information created and shared seems to grow at an exponential pace the problem is just that the amount of useful information is not increasing at the same rate
say for instance that you're an investor and you would like to predict how the economy will do during the next five years to tell if you should put your money in the stock market or not well you have more than 45,000 economic indicators to choose from that are produced by US government only but which ones are relevant and which ones are not which ones are signal and which ones are noise it's important to be able to tell this difference if we want to make our predictions reliable according to Nate Silver the signal is the truth
the noise is what distracts us from the truth the goal of a prediction model should be to capture as much as possible of the signal while keeping the noise to a minimum let's pretend for instance that unemployment rate is the signal it is what can help us predict how the economy will do for the next five years then all the other forty 4999 economic indicators are noise they are just distracting us from realizing that it's only unemployment rate that we should be focusing on the problem is that we humans have a hyperactive pattern recognition it
may have helped us thousands of years ago to distinguish whether the ear rattled from that Bush is just a bird or if it's a lion but it's harming us when making forecasts we tend to identify more patterns among than noise more on this in Daniel Kahneman's book by the way Thinking Fast and Slow and also more on this in takeaway number three the Productivity paradox is a central part of this book we face danger whenever information growth outpaces our understanding of how to process it [Music] takeaway number two we have a prediction problem just before
the outbreak of the the financial crisis the rating agencies had given their top ratings the triple aid to thousands of mortgage-backed securities some of these were said to have only a 0.12 percent risk of defaulting how many of them do you think defaulted in reality a whooping 28% that's 200 times more than S&P had predicted in 1990 the IPCC the International Panel on Climate Change forecasted that the rise of temperature would most likely be at a rate of 3 degrees Celsius per century but definitely no less than 2 degrees per century going forward as of
2011 when Nate Silver released the signal and the noise the increase has been just 1.5 degrees Celsius here's how economists have forecast that the GDP during the 18 years between 1993 and 2010 in the chart the boss represent the area where the economists have said that GDP growth has a 90% chance of ending up as you can see the economists have been correct 12 times out of 18 or only 66% of the time that is quite far from their stated 90% all right so apparently at least in some fields we are terrible at making predictions
and yes you could blame Black Swan events for these failed forecasts like the S&P did but in reality it's much more likely that the fault is in the model of the forecaster than in the world itself we should show some sympathy though because there are situations in which predictions are extra difficult to make event is out-of-sample to think that housing prices could have such a major effect on the economy seemed quite unlikely looking at historical data the problem was just that in 2006 historical data didn't help much in making a prediction as the conditions were
vastly different than ever before never had the economy being so highly leveraged and never had so many side bets been made on housing prices dynamic systems when the behavior of the system at one point in time influences its behavior in the future we have a dynamic system this means that even if we are just slightly off in assessing the current state of the system we will end up very wrong when predicting the future of it as mistakes multiply over time this is the reason where we can focus the weather accurately a few days ahead but
not more than that a lack of theory simply put we just don't know enough about many of these systems the economy just like the climate are complex beasts and although we have some heuristics or rules of thumb that we can conform to it's not always enough to make accurate predictions take away number three correlation does not equal causation a mafia boss gives one of his minions three different looks with coats a red a blue and a yellow one he asks for a method to pick such locks after a few days the underling is back happily
stating that if it's a red lock just enter 1645 if it's blue 34 93 and if it's yellow 0 2 3 2 the underling would have mistaken correlation for causation and would completely have failed his task there's no reasonable explanation for why the color red would have a person enter the exact digits 1645 it just happened to be so in this particular case let's take another example is it true that consumption of icecream causes shark attacks well from this chart you may come up with that conclusion as it's quite clear that shark attacks increase whenever
the ice cream consumption does but obviously there's no causal relationship here the ice cream consumption does not cause shock attacks it's just that both swimming in the ocean which makes us more likely to get attacked by a shark and eating ice cream is more enjoyable in the summer while these examples may seem a bit silly mistaking causation for correlation is a very common forecasting problem as stated earlier there are 45,000 economic indicators produced by the US government each year if you are trying to predict the stock market for example and you look at historical data
to do this you are almost guaranteed to find that some of these variables seem to have strong predictive power yet they may not you may just have been fooled by randomness whenever you make a prediction be sure that there's a logical explanation for the mathematical relationship do not trust data unconditionally takeaway number four how can we become better at predicting here are three examples on how we can become better forecasters think probabilistically reality is not a yes/no situation even though some people seem to think that for example if I ask you how the stock market
will perform next year what is the best answer it's not for example 5% and it's not minus 5% minus 1% or 10% either the best answer is a spectrum of outcomes with probabilities attached to them good predictors are great at seeing reality as such and have developed a skill for assigning probabilities to outcomes for more on how to apply this in the stock market please see my summary of the dando investor change the forecast with new evidence say that you are dealt two aces in poker the best starting hand possible you decide to bet and
you are called by one of your opponents the first three cards on the table or the following nine of spades Teno spades and jack of spades he checks you bets and he calls the fourth card is an 8 of spades your opponent decides to bet should you call this bet or not your initial forecast was that you were going to win this hand easy but clearly you must factor in the new evidence you are beaten by an awful many straights and flushes and should definitely fold this hand you should make the best for cost today
regardless of what you said yesterday more on this in the final takeaway look for consensus good predictors are good at weighing multiple sources of information they don't get lost in narratives or stories and they are able to weigh both quantitative and qualitative information together before making their decisions for example they understand that Netflix can be a great stock to buy because it has an enormous potential for scalability but simultaneously they realize that it's quite expensive to buy a stock no matter how great it may be for 130 times its last year's earnings takeaway number five
base theorem you come home after you've been on a weekend with boys to Prague only to find a pair of pants on your bed that you are certain or not yours has your girlfriend been cheating on you while you've been gone you've been holding on to that Tesla stock for quite a while now and this very evening they reported an update that the model 3 seems more difficult to mass-produced than first anticipated will Tesla go bankrupt both of these examples involve updating a previous forecast when new evidence has presented itself most humans are terrible at
this luckily there is a solution Bayes theorem Bayes theorem is a mathematical formula that can help you in calculating the probability of something occurring given that something else has happened the mathematical equation looks like this let's apply it to our first example you must make some assumptions regarding three different events firstly what would you estimate that the probability of her cheating on you would have been before you found this new piece of evidence let's say that it's 4% apparently that is some kind of the national average secondly if she really is cheating on you how
likely is it that the guy would forget his pants well not to likely probably let's say 20% thirdly what's the probability that these pants are here if she's not cheating on you perhaps she has a brother who visits sometimes with these assumptions we can calculate that the probability that she's been cheating given that you found the pants is 14% not so high after all so please don't confront your girlfriend screaming and crying uncontrollably just yet but recognize that it's much higher than our initially expected 4% if we find another pair of pants a year later
we will have to revise this and update our probability to 39 percent successful predictors in any field recognize this when new information presents itself we must update our initial hypothesis perhaps we didn't think that Tesla would go bankrupt the first time they presented a liquidity issue but if new problems constantly arise we must update our estimates people may see this as a sign of weakness all right look at that guy he's constantly changing his mind but it's actually the most rational thing to do only when we are constantly refining our estimates can we come closer
to the signal and further away from the noise here's a summary in less than 53 seconds the signal is the truth the noise is what distracts us from the truth more data means more noise in relation to the signal some situations are particularly difficult to forecast especially when an event is out-of-sample we are forecasting dynamic systems and when there's a lack of theory in the domain correlation does not mean causation make sure that there's a logical explanation for the mathematical relationship before using it in your predict we can become better at predicting by thinking probabilistically
updating our forecasts when we are presented with new evidence and by looking for consensus use Bayes theorem worthies think in a Bayesian way when new information presents itself chess guys