{"id":26213,"date":"2021-02-13T09:39:21","date_gmt":"2021-02-13T13:39:21","guid":{"rendered":"https:\/\/www.shortform.com\/blog\/?p=26213"},"modified":"2021-02-24T19:58:36","modified_gmt":"2021-02-24T23:58:36","slug":"forecast-accuracy","status":"publish","type":"post","link":"https:\/\/www.shortform.com\/blog\/forecast-accuracy\/","title":{"rendered":"The Challenge of Measuring Forecast Accuracy"},"content":{"rendered":"\n<p>How do you measure forecast accuracy? What are some challenges in evaluating whether a forecast is correct and to what degree?<\/p>\n\n\n\n<p>Given all the ways our brains can work against us, forecasting accurately is incredibly difficult. But evaluating an existing forecast&#8217;s accuracy in the first place presents difficulties of its own.<\/p>\n\n\n\n<p>Read about the difficulties of measuring forecast accuracy.<\/p>\n\n\n\n<!--more-->\n\n\n\n<h2 class=\"wp-block-heading\"><strong>Measuring Forecast Accuracy<\/strong><\/h2>\n\n\n\n<p>Predictions about everything from global politics to the weather are not hard to come by. You find them on news channels, in bestselling books, and among friends and family. But most of these predictions have one thing in common: After the event, no one thinks to formally measure how accurate they were. <strong>This lack of measurement means that you have no sense of how accurate any particular source usually is.<\/strong> Without that baseline, how do you know who to listen to the next time you need to <a href=\"https:\/\/www.shortform.com\/blog\/be-decisive\/\">make a decision<\/a>?<\/p>\n\n\n\n<p>Given how important accurate predictions are, it\u2019s surprising that we have no standard way of measuring forecast accuracy. Instead, forecasters in popular media deliver their predictions with so much confidence that we take them at their word, and by the time the events they predict happen (or don\u2019t), the news cycle has moved on. <strong>The loudest voice is often the most convincing one, regardless of how accurate they are.<\/strong>&nbsp;<\/p>\n\n\n\n<p>(Meteorologists are an exception to this rule. They use data to continually update weather forecasts, and they compare the actual weather to their predictions after the fact to measure forecast accuracy and get insight into what they may have missed.)<\/p>\n\n\n\n<h3 class=\"wp-block-heading\">Evaluating Forecast Accuracy<\/h3>\n\n\n\n<p>Given all the ways our brains can work against us, forecasting accurately is incredibly difficult. But determining whether a forecast is accurate in the first place presents difficulties of its own.<strong> A forecast judged by different standards than the forecaster intended will be deemed a failure, even if it\u2019s not<\/strong>. This is the case for one of the most famous forecasting flops of all time: the 2007 claim by Steve Ballmer, then-CEO of Microsoft, that there was \u201cno chance\u201d that Apple\u2019s iPhone would get \u201cany significant market share.&#8221;&nbsp;<\/p>\n\n\n\n<p>In hindsight, this prediction looks spectacularly wrong, and it often tops lists of \u201cWorst Tech Predictions Of All Time.&#8221; But judging Ballmer\u2019s prediction is more difficult than it seems. What did he mean by \u201csignificant\u201d? And was he referring to the US market or the global market? The <a href=\"https:\/\/www.shortform.com\/blog\/smartphone-market\/\">smartphone market<\/a> or the mobile phone market as a whole?&nbsp;<\/p>\n\n\n\n<p>These questions matter, because the answers lead us to very different conclusions. Judged against the US smartphone market (where the iPhone commands 42% of the market share), Ballmer is laughably wrong. But in the global mobile phone market (not just smartphones), that number falls to 6%\u2014far from significant.&nbsp;<\/p>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>Some Forecasts Are Too Vague to Judge<\/strong><\/h3>\n\n\n\n<p>Although Ballmer\u2019s infamous iPhone forecast <em>seems <\/em>clear at first, it\u2019s actually ambiguous. The nature of language means that certain words can be interpreted differently by different people, and forecasts tend to be full of these words (like \u201csignificant,\u201d \u201clikely,\u201d and \u201cslight\u201d). Think about a forecast that claims a particular result \u201cmay happen.&#8221; Like the doomsday prediction, there\u2019s technically no way to discredit this forecast\u2014if something \u201cmay\u201d happen, it\u2019s also implied that it may not. Either way, the forecaster is correct. But the forecast itself is useless for making decisions.&nbsp;<\/p>\n\n\n\n<p>\u201cLikely\u201d is another word that often pops up in forecasts and presents similar problems. If a forecaster claims an event is \u201clikely\u201d to happen and then it doesn\u2019t\u2014was the forecaster wrong? Our gut reaction is to say yes, but that\u2019s incorrect. Think of it this way: If you reach into a bag that you know contains twenty red balls and one blue ball, you could correctly claim that it\u2019s \u201cmost likely\u201d you\u2019ll draw a red ball. If you happen to draw the lone blue ball, your claim is still correct\u2014you just happened to get an unlikely result.&nbsp;<\/p>\n\n\n\n<p>Lack of timelines is another common problem in popular forecasts. If someone says \u201cthe world will end tomorrow,&#8221; that has a clear end date\u2014tomorrow, if the world has not ended, we can safely say they were wrong. But if someone says \u201cthe world will end,&#8221; any arguments to the contrary can be met with \u201cjust wait and see.&#8221; <strong>The lack of a time frame means that no matter how much time passes, they can\u2019t be proven wrong.&nbsp;<\/strong><\/p>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>Probabilities Are Useful Estimates, Not Facts<\/strong><\/h3>\n\n\n\n<p>If the example of the red and blue balls brought back memories of math textbooks, there\u2019s a reason for that. <strong>Probability is one of the biggest obstacles to judging forecast accuracy. <\/strong>Calculating the probability of pulling a blue ball out of a bag is fairly easy\u2014even if you don\u2019t know any probability formulas, you can just keep blindly pulling a ball out of the bag, recording its color, then putting it back and repeating the process. After enough trials, it would be easy to say which color ball you\u2019re most likely to draw and about how much more likely you are to draw that color than the other.&nbsp;<\/p>\n\n\n\n<p>However, attaching an accurate number to the probability of a real-world event is almost impossible. To do so, we\u2019d need to be able to rerun history over and over again, accounting for all the different possible outcomes of a given scenario. This means that for most events that forecasters are concerned with, it is impossible to know for sure that there is a specific probability of the event happening. <strong>Therefore, any probability attached to an event in a forecast is only the forecaster\u2019s best guess, not an objective fact. <\/strong>This can be misleading, but it doesn\u2019t mean that estimated probabilities are useless.&nbsp;<\/p>\n\n\n\n<p>In fact, using numerical probability estimates in forecasts is critical. In the 1950s, <a href=\"https:\/\/www.shortform.com\/blog\/the-cia-operations-technology-langley\/\">the CIA<\/a> forecasting team discovered this after delivering a report forecasting the likelihood of the Soviet Union invading Yugoslavia. The report concluded that an attack was a \u201cserious possibility.&#8221; When a State Department official later asked the director of the forecasting team what they meant by \u201cserious possibility\u201d in terms of odds, he estimated the odds at 65 to 35, much higher than how the State Department had interpreted it.&nbsp;<\/p>\n\n\n\n<p>This miscommunication was understandably alarming. The director of the forecasting team, Sherman Kent, took the problem back to his team and asked them each to put a number on \u201cserious possibility.&#8221; Though they had all collectively approved of the phrasing in the official report, <strong>every single team member assigned a different numerical value to those words. <\/strong>Kent was horrified: Not only were the forecasters not on the same page, but their forecasts were being used to inform foreign policy. If their reports were misunderstood, there could be global consequences.&nbsp;<\/p>\n\n\n\n<p>That claim may sound dramatic, but it\u2019s exactly what happened in 1961 when President Kennedy commissioned the Joint Chiefs of Staff to report on his plan to invade Cuba. The final report predicted a \u201cfair chance\u201d of success, and the government went ahead with what became the Bay of Pigs disaster. After the fact, it was clarified that \u201cfair chance\u201d meant three to one odds <em>against<\/em> success, but President Kennedy interpreted the phrase more positively and acted accordingly.&nbsp;<\/p>\n\n\n\n<p>In the aftermath of the failed Bay of Pigs invasion, Sherman Kent proposed a universal standard for official forecasts that would eliminate ambiguity by assigning numerical probabilities to particular words. He created the chart below:<\/p>\n\n\n\n<figure class=\"wp-block-table\"><table><tbody><tr><td><strong>Certainty<\/strong><\/td><td><strong>Word<\/strong><\/td><\/tr><tr><td>100%<\/td><td>Certain<\/td><\/tr><tr><td>87-99%<\/td><td>Almost certain<\/td><\/tr><tr><td>63-86%&nbsp;<\/td><td>Probable<\/td><\/tr><tr><td>40-63%&nbsp;<\/td><td>Chances about even<\/td><\/tr><tr><td>20-39%&nbsp;<\/td><td>Probably not<\/td><\/tr><tr><td>1-19%&nbsp;<\/td><td>Almost certainly not<\/td><\/tr><tr><td>0%<\/td><td>Impossible<\/td><\/tr><\/tbody><\/table><\/figure>\n\n\n\n<p>The table above would make it difficult to misinterpret a forecast but was rejected outright by the intelligence community, who felt that expressing probabilities numerically was crude and misleading. They feared readers would fall into the common trap of interpreting numbers to mean something <em>is <\/em>X percentage likely to happen, not that the forecaster <em>believes<\/em> that to be the likelihood.&nbsp;<\/p>\n\n\n\n<p>That distinction matters, since it affects not just what we do with a prediction but how we judge the person who made it. What probability percentages mean and what people think they mean are entirely different. For example, if a meteorologist correctly predicts a 70% chance of rain, it means that if we were able to replay that day hundreds of times, it would rain in 70% of those replays.<\/p>\n\n\n\n<p>But that\u2019s not how we typically read weather forecasts. Instead, we fall for what the authors call <strong>the \u201cwrong-side-of-maybe fallacy,&#8221; where we interpret any prediction higher than 50% to mean something <\/strong><strong><em>will<\/em><\/strong><strong> happen and anything lower than 50% to mean it won\u2019t. <\/strong>So if the meteorologist predicts a 70% chance of rain on a day where it does not rain, we think she was wrong, and consequently, that she must not be very good at her job.&nbsp;<\/p>\n\n\n\n<p>In spite of those risks, meteorology has embraced the clarity of numbers, and most of us are now accustomed to seeing weather forecasts in terms of percentages. But avoiding baseless negative judgment is a major reason forecasters in other fields prefer vague language like \u201cserious possibility.&#8221;&nbsp;<\/p>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>Using Numbers to Evaluate Forecast<\/strong> Accuracy<\/h3>\n\n\n\n<p>How can we measure the overall accuracy of a particular forecaster? For singular events, there really isn\u2019t an accurate way\u2014even with modern technology, replaying history to see every possible outcome is a power still reserved for fictional heroes like Dr. Strange. Instead, we rely on aggregates.&nbsp;<\/p>\n\n\n\n<p>Let\u2019s imagine that the meteorologist in the above example predicts the weather every day for several years, racking up hundreds of total predictions. While we still can\u2019t say how accurate her forecast is for any specific day, we <em>can <\/em>figure out how accurate she is in general through a process called calibration.&nbsp;<\/p>\n\n\n\n<p>Let\u2019s say the meteorologist predicted a 70% chance of rain in 100 of her daily forecasts. If it actually did rain on 70 of those days, her forecasting is perfectly calibrated. In other words, a given event happens 70% of the time that she says there is a 70% chance of that event happening.&nbsp;<\/p>\n\n\n\n<p>Visually, you can represent calibration with a line graph, with \u201cforecasted percentage\u201d on the X-axis and \u201cpercentage correct\u201d on the Y-axis. <strong>Perfect calibration is an exact diagonal line<\/strong>, like on the graph below:<\/p>\n\n\n\n<p>Obviously, most forecasters aren\u2019t spot-on every time. The same graph setup can help us judge an individual forecaster\u2019s accuracy by plotting each of her predictions on the graph, calculating the curved trend line for all those points, and comparing that line to the perfect diagonal line. If the forecaster is under-confident (and chronically underestimates probabilities), her curve will be far over the line; If she\u2019s overconfident, the curve will be far under the line.&nbsp;<img decoding=\"async\" src=\"https:\/\/lh6.googleusercontent.com\/K0FJso-TJPJxxLC59wGiE0RPCLVQYv1gBD4WvBh6MAyOA_w23GNEfxvlO5rezmAfiGBRQNEF_ovYDkDaJ9o3_2onuI8T4zU0FypPRAXn9NJ6cpbep3eiH77fXC6ZJR1w9nXa-j0b\" width=\"763.8113207547169\" height=\"490\"><\/p>\n\n\n\n<figure class=\"wp-block-image\"><img decoding=\"async\" src=\"https:\/\/lh3.googleusercontent.com\/zAnIJjtKBRZ1ST5sbDl3ttwoEr47CMQ33OiS74NBrFJZkiNcvygSwMBqp46Tixg3eyb8hCATpYbX33FJr9AYDAk8ckxXW1_Sx8nAYXXkVdIKiMvLaW5IwQlndJtpAQCosGB8gOHh\" alt=\"\"\/><\/figure>\n\n\n\n<figure class=\"wp-block-image\"><img decoding=\"async\" src=\"https:\/\/lh3.googleusercontent.com\/stDVKcDfiTDtgGw_UkTi-mMlI8nTH3OmA7AzSBtnWjkG0BdqhEur6Nc1l3tEDQIg4ZqKLGIMcw9uyZNwHSM3uPicZgrwLJaH3fa0vXfyhinosT_jnA29LMQanVZQd5o1AM3FTTQW\" alt=\"\"\/><\/figure>\n\n\n\n<p>Calibration is helpful, but it\u2019s not the only important measure for evaluating forecasts. A forecaster who always predicts probabilities near the level of chance (50%) will be fairly well-calibrated, but the information isn\u2019t helpful\u2014it\u2019s the mathematical equivalent of a shrug. Stronger forecasters are accurate outside the range of chance\u2014they\u2019re willing to assign much higher or lower odds to a particular event, despite the increased risk of being wrong. We can measure this using <strong>resolution<\/strong>. Forecasters with higher resolution are more impressive than cautious forecasters who are equally well-calibrated. You can see this on the graphs below.<\/p>\n\n\n\n<figure class=\"wp-block-image\"><img decoding=\"async\" src=\"https:\/\/lh3.googleusercontent.com\/qr39fE2d-232eLttLclDRNc4I3CofPSGi1zKpdeBnWOm8oCWWm-HXG2CYIGX6EWK3EtOsMpvhvjBjGMTW9fSnX8lxpNsVUS6Glh_zkHHAdbrF7EQ5BNdvkLsKIADIjOl8dFOgREj\" alt=\"\"\/><\/figure>\n\n\n\n<figure class=\"wp-block-image\"><img decoding=\"async\" src=\"https:\/\/lh4.googleusercontent.com\/TD2o1UEwyy5XDANQa5XSpNZQ-RHHZ0vxjgDKh6z7v4jTL_4kd5Ssyq-JqJa_9PEjeaEi657nCBnD2YFUoK1yZ_5MYNDudPoRZYsWguThBaZ_Xz4wfiWm67H6jknUx5TRV8hlmwxZ\" alt=\"\"\/><\/figure>\n\n\n\n<h4 class=\"wp-block-heading\"><strong>Interpreting Brier Scores<\/strong><\/h4>\n\n\n\n<p>Combining measures of calibration and resolution gives us a concrete way to evaluate forecaster accuracy. These measures are combined into a single number, called a <strong>Brier score<\/strong>. Brier scores express the difference between a forecast and what really happened. Scores range between 0 and 2, where zero is an absolutely perfect forecast and two is a forecast that is wrong in every possible way. Random guessing, over time, produces a score of .5.&nbsp;<\/p>\n\n\n\n<p>A forecaster\u2019s Brier score is only meaningful in the context of the types of forecasts they make. For example, if a forecaster predicts the weather in Phoenix, Arizona to be \u201chot and sunny\u201d every day for the month of June, their Brier score is likely to be almost zero, since Phoenix summers are notoriously hot and sunny. This is an impressive score but says very little about the forecaster\u2019s skill because it took very little thoughtful consideration.&nbsp;<\/p>\n\n\n\n<p>Brier scores also give us a way to compare one forecaster to another\u2014we can say that a forecaster with an overall Brier score of .2 is a more accurate forecaster than someone with a score of .4. But context is important here, too, because <strong>Brier scores don\u2019t account for the difficulty of each prediction.<\/strong> Comparing weather forecasters using their Brier scores is helpful, but it\u2019s not fair to compare the Phoenix forecaster to a forecaster in a less stable climate like Missouri. Even if the Missouri forecaster\u2019s score is slightly higher (and thus less accurate), earning that score in unpredictable circumstances is still much more impressive than a better score in Phoenix.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>Skill vs. Luck<\/strong><\/h3>\n\n\n\n<p>Brier scores measure forecast accuracy against what really happened. They\u2019re a great way to measure the performance of individual forecasters, but there\u2019s a caveat\u2014they don\u2019t rule out the possibility that someone with a stellar Brier score is just an incredibly lucky guesser<em>. <\/em>To do that, we need a way to compare forecasters\u2019 performance to <em>each other<\/em> over time\u2014If someone outperforms other forecasters year after year, we can confidently say their success comes down to skill; If they score above average one year and below average the next, it\u2019s possible that initial success was just beginner\u2019s luck.&nbsp;<\/p>\n\n\n\n<p>Tracking each forecaster\u2019s performance compared to the group<strong> <\/strong>reveals<strong> how much of the superforecasters\u2019 success comes down to luck and how much is real skill. <\/strong>To understand the <a href=\"https:\/\/www.shortform.com\/blog\/the-role-of-luck-in-success\/\">role of luck<\/a> (or chance) in forecasting, we need to understand randomness. Skills and traits that are normally distributed in a population can be plotted on a classic <a href=\"https:\/\/www.shortform.com\/blog\/the-bell-curve\/\">bell curve<\/a>.&nbsp;<\/p>\n\n\n\n<ul class=\"wp-block-list\"><li>To simplify this, imagine a coin-tossing game. If 100 people were asked to predict the outcome of 100 coin tosses, the results would be normally distributed. The vast majority of guessers would be clustered in the middle of the curve, around 50%. A small group would have terrible luck and fall on the far left of the curve. Another small group would have fantastic luck and fall on the far right.&nbsp;<\/li><li>Remember, there is no skill involved in guessing heads or tails, so those on the far right extreme of the graph are not \u201cbetter guessers.&#8221; This sounds obvious when it\u2019s spelled out, but <strong>randomness is not an intuitive concept<\/strong>, and studies have shown that we\u2019re all too quick to interpret success as being a result of skill, not luck. In one study, even Yale students fell into this trap\u2014those who had a string of correct guesses early in the coin toss game predicted they would do better than chance if the experiment were repeated. In reality, they were no more likely to beat chance than they were the first time.&nbsp;<\/li><\/ul>\n\n\n\n<h4 class=\"wp-block-heading\"><strong>Regression to the Mean<\/strong><\/h4>\n\n\n\n<p>To fully understand randomness and the role of luck, we need to understand regression to the mean.<strong> With enough trials of a task, outliers will shift toward the mean.<\/strong> In the coin toss example, we would most likely see quick regression to the mean if the experiment were repeated multiple times. Over time, each person\u2019s data would average out to roughly 50% correct guesses. In each repetition, there would be guessers who did extremely well or extremely poorly\u2014but without any skill involved, the people on either extreme would be <em>different<\/em> people every time.<\/p>\n\n\n\n<p>Regression to the mean is an invaluable tool for interpreting the <a href=\"https:\/\/www.shortform.com\/blog\/iarpa\/\">IARPA<\/a> tournament results. To understand this, imagine two hypothetical forecasters, Person A and Person B. In year one, Person A was a standout with 99% accuracy (and a Brier score near 0). Person B did terribly, at 1% accuracy for forecasts (giving them a Brier score close to 2.0). Their scores are plotted on the graph below.<\/p>\n\n\n\n<figure class=\"wp-block-image\"><img decoding=\"async\" src=\"https:\/\/lh6.googleusercontent.com\/Xj4CYUPP5ewouo5aj7fqzVeJpuqkpRleoLQQr77EhrHhP6DyG7LuO2Mp5LQ-_bxQ94sDeOgabSu53I9S2GI-lVA6K1YpMEwh1fdremXuhygyq4b1SvIBXVuIoNrewMIFm-Yhaeos\" alt=\"\"\/><\/figure>\n\n\n\n<p>Now let\u2019s look at their scores in year two. If their year one performance was pure skill, we\u2019d expect no regression at all. If their scores were half luck, half skill, we\u2019d expect each person to regress halfway to the mean, so Person A would be at roughly 25% and Person B would be around 75%. If there is no skill involved at all (like the coin toss game), both people would likely regress back to the mean (50%) in year two. These outcomes are shown in the graphs below.&nbsp;<\/p>\n\n\n\n<figure class=\"wp-block-image\"><img decoding=\"async\" src=\"https:\/\/lh6.googleusercontent.com\/a5nNZ_gP5Hs2XVKSpv5zbnlFDwt_eOQVh23IWYDuZEtPHqIN0CMMW_2wtn4dVmhm8ebE6ICS6_exqs2JDzXRtwVMdmBLJ7lYECvV2G96qS0S2Y4qnMj_mgmJYBeUwKjg-pTCEwNM\" alt=\"\"\/><\/figure>\n\n\n\n<p>(Remember, these scores are just for year two. If each of these forecasters kept at it for decades, we would expect to see <em>some<\/em> regression\u2014poor Person B would hopefully improve a bit with practice, and Person A\u2019s scores would likely slip a bit over time. The key here is the rate at which this happens: <strong>Skill-based scores regress slowly, but luck-based scores regress quickly<\/strong>.)<\/p>\n","protected":false},"excerpt":{"rendered":"<p>How do you measure forecast accuracy? What are some challenges in evaluating whether a forecast is correct and to what degree? Given all the ways our brains can work against us, forecasting accurately is incredibly difficult. But evaluating an existing forecast&#8217;s accuracy in the first place presents difficulties of its own. Read about the difficulties of measuring forecast accuracy.<\/p>\n","protected":false},"author":7,"featured_media":8342,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[25],"tags":[203],"class_list":["post-26213","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-statistics","tag-superforecasting","","tg-column-two"],"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v24.3 (Yoast SEO v24.3) - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>The Challenge of Measuring Forecast Accuracy - Shortform Books<\/title>\n<meta name=\"description\" content=\"Given all the ways our brains work against us, forecasting accurately is hard. But evaluating forecast accuracy is a challenge of its own.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.shortform.com\/blog\/forecast-accuracy\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"The Challenge of Measuring Forecast Accuracy\" \/>\n<meta property=\"og:description\" content=\"Given all the ways our brains work against us, forecasting accurately is hard. But evaluating forecast accuracy is a challenge of its own.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.shortform.com\/blog\/forecast-accuracy\/\" \/>\n<meta property=\"og:site_name\" content=\"Shortform Books\" \/>\n<meta property=\"article:published_time\" content=\"2021-02-13T13:39:21+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2021-02-24T23:58:36+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/s3.amazonaws.com\/wordpress.shortform.com\/blog\/wp-content\/uploads\/2020\/04\/hopeful-breath-air-scaled.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"2560\" \/>\n\t<meta property=\"og:image:height\" content=\"1920\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"Darya Sinusoid\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Darya Sinusoid\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"14 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/www.shortform.com\/blog\/forecast-accuracy\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/www.shortform.com\/blog\/forecast-accuracy\/\"},\"author\":{\"name\":\"Darya Sinusoid\",\"@id\":\"https:\/\/www.shortform.com\/blog\/#\/schema\/person\/0421cce75bc249b11e2517b3a91f9c46\"},\"headline\":\"The Challenge of Measuring Forecast Accuracy\",\"datePublished\":\"2021-02-13T13:39:21+00:00\",\"dateModified\":\"2021-02-24T23:58:36+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/www.shortform.com\/blog\/forecast-accuracy\/\"},\"wordCount\":2856,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\/\/www.shortform.com\/blog\/#organization\"},\"image\":{\"@id\":\"https:\/\/www.shortform.com\/blog\/forecast-accuracy\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.shortform.com\/blog\/wp-content\/uploads\/2020\/04\/hopeful-breath-air-scaled.jpg\",\"keywords\":[\"Superforecasting\"],\"articleSection\":[\"Statistics\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/www.shortform.com\/blog\/forecast-accuracy\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.shortform.com\/blog\/forecast-accuracy\/\",\"url\":\"https:\/\/www.shortform.com\/blog\/forecast-accuracy\/\",\"name\":\"The Challenge of Measuring Forecast Accuracy - Shortform Books\",\"isPartOf\":{\"@id\":\"https:\/\/www.shortform.com\/blog\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/www.shortform.com\/blog\/forecast-accuracy\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/www.shortform.com\/blog\/forecast-accuracy\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.shortform.com\/blog\/wp-content\/uploads\/2020\/04\/hopeful-breath-air-scaled.jpg\",\"datePublished\":\"2021-02-13T13:39:21+00:00\",\"dateModified\":\"2021-02-24T23:58:36+00:00\",\"description\":\"Given all the ways our brains work against us, forecasting accurately is hard. But evaluating forecast accuracy is a challenge of its own.\",\"breadcrumb\":{\"@id\":\"https:\/\/www.shortform.com\/blog\/forecast-accuracy\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.shortform.com\/blog\/forecast-accuracy\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.shortform.com\/blog\/forecast-accuracy\/#primaryimage\",\"url\":\"https:\/\/www.shortform.com\/blog\/wp-content\/uploads\/2020\/04\/hopeful-breath-air-scaled.jpg\",\"contentUrl\":\"https:\/\/www.shortform.com\/blog\/wp-content\/uploads\/2020\/04\/hopeful-breath-air-scaled.jpg\",\"width\":2560,\"height\":1920},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.shortform.com\/blog\/forecast-accuracy\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/www.shortform.com\/blog\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"The Challenge of Measuring Forecast Accuracy\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.shortform.com\/blog\/#website\",\"url\":\"https:\/\/www.shortform.com\/blog\/\",\"name\":\"Shortform Books\",\"description\":\"The World&#039;s Best Book Summaries\",\"publisher\":{\"@id\":\"https:\/\/www.shortform.com\/blog\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.shortform.com\/blog\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/www.shortform.com\/blog\/#organization\",\"name\":\"Shortform Books\",\"url\":\"https:\/\/www.shortform.com\/blog\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.shortform.com\/blog\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/www.shortform.com\/blog\/wp-content\/uploads\/2019\/06\/logo-equilateral-with-text-no-bg.png\",\"contentUrl\":\"https:\/\/www.shortform.com\/blog\/wp-content\/uploads\/2019\/06\/logo-equilateral-with-text-no-bg.png\",\"width\":500,\"height\":74,\"caption\":\"Shortform Books\"},\"image\":{\"@id\":\"https:\/\/www.shortform.com\/blog\/#\/schema\/logo\/image\/\"}},{\"@type\":\"Person\",\"@id\":\"https:\/\/www.shortform.com\/blog\/#\/schema\/person\/0421cce75bc249b11e2517b3a91f9c46\",\"name\":\"Darya Sinusoid\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.shortform.com\/blog\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/www.shortform.com\/blog\/wp-content\/uploads\/2021\/07\/Untitled-design-1.png\",\"contentUrl\":\"https:\/\/www.shortform.com\/blog\/wp-content\/uploads\/2021\/07\/Untitled-design-1.png\",\"caption\":\"Darya Sinusoid\"},\"description\":\"Darya\u2019s love for reading started with fantasy novels (The LOTR trilogy is still her all-time-favorite). Growing up, however, she found herself transitioning to non-fiction, psychological, and self-help books. She has a degree in Psychology and a deep passion for the subject. She likes reading research-informed books that distill the workings of the human brain\/mind\/consciousness and thinking of ways to apply the insights to her own life. Some of her favorites include Thinking, Fast and Slow, How We Decide, and The Wisdom of the Enneagram.\",\"url\":\"https:\/\/www.shortform.com\/blog\/author\/darya\/\"}]}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"The Challenge of Measuring Forecast Accuracy - Shortform Books","description":"Given all the ways our brains work against us, forecasting accurately is hard. But evaluating forecast accuracy is a challenge of its own.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.shortform.com\/blog\/forecast-accuracy\/","og_locale":"en_US","og_type":"article","og_title":"The Challenge of Measuring Forecast Accuracy","og_description":"Given all the ways our brains work against us, forecasting accurately is hard. But evaluating forecast accuracy is a challenge of its own.","og_url":"https:\/\/www.shortform.com\/blog\/forecast-accuracy\/","og_site_name":"Shortform Books","article_published_time":"2021-02-13T13:39:21+00:00","article_modified_time":"2021-02-24T23:58:36+00:00","og_image":[{"width":2560,"height":1920,"url":"https:\/\/s3.amazonaws.com\/wordpress.shortform.com\/blog\/wp-content\/uploads\/2020\/04\/hopeful-breath-air-scaled.jpg","type":"image\/jpeg"}],"author":"Darya Sinusoid","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Darya Sinusoid","Est. reading time":"14 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.shortform.com\/blog\/forecast-accuracy\/#article","isPartOf":{"@id":"https:\/\/www.shortform.com\/blog\/forecast-accuracy\/"},"author":{"name":"Darya Sinusoid","@id":"https:\/\/www.shortform.com\/blog\/#\/schema\/person\/0421cce75bc249b11e2517b3a91f9c46"},"headline":"The Challenge of Measuring Forecast Accuracy","datePublished":"2021-02-13T13:39:21+00:00","dateModified":"2021-02-24T23:58:36+00:00","mainEntityOfPage":{"@id":"https:\/\/www.shortform.com\/blog\/forecast-accuracy\/"},"wordCount":2856,"commentCount":0,"publisher":{"@id":"https:\/\/www.shortform.com\/blog\/#organization"},"image":{"@id":"https:\/\/www.shortform.com\/blog\/forecast-accuracy\/#primaryimage"},"thumbnailUrl":"https:\/\/www.shortform.com\/blog\/wp-content\/uploads\/2020\/04\/hopeful-breath-air-scaled.jpg","keywords":["Superforecasting"],"articleSection":["Statistics"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/www.shortform.com\/blog\/forecast-accuracy\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/www.shortform.com\/blog\/forecast-accuracy\/","url":"https:\/\/www.shortform.com\/blog\/forecast-accuracy\/","name":"The Challenge of Measuring Forecast Accuracy - Shortform Books","isPartOf":{"@id":"https:\/\/www.shortform.com\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.shortform.com\/blog\/forecast-accuracy\/#primaryimage"},"image":{"@id":"https:\/\/www.shortform.com\/blog\/forecast-accuracy\/#primaryimage"},"thumbnailUrl":"https:\/\/www.shortform.com\/blog\/wp-content\/uploads\/2020\/04\/hopeful-breath-air-scaled.jpg","datePublished":"2021-02-13T13:39:21+00:00","dateModified":"2021-02-24T23:58:36+00:00","description":"Given all the ways our brains work against us, forecasting accurately is hard. But evaluating forecast accuracy is a challenge of its own.","breadcrumb":{"@id":"https:\/\/www.shortform.com\/blog\/forecast-accuracy\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.shortform.com\/blog\/forecast-accuracy\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.shortform.com\/blog\/forecast-accuracy\/#primaryimage","url":"https:\/\/www.shortform.com\/blog\/wp-content\/uploads\/2020\/04\/hopeful-breath-air-scaled.jpg","contentUrl":"https:\/\/www.shortform.com\/blog\/wp-content\/uploads\/2020\/04\/hopeful-breath-air-scaled.jpg","width":2560,"height":1920},{"@type":"BreadcrumbList","@id":"https:\/\/www.shortform.com\/blog\/forecast-accuracy\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.shortform.com\/blog\/"},{"@type":"ListItem","position":2,"name":"The Challenge of Measuring Forecast Accuracy"}]},{"@type":"WebSite","@id":"https:\/\/www.shortform.com\/blog\/#website","url":"https:\/\/www.shortform.com\/blog\/","name":"Shortform Books","description":"The World&#039;s Best Book Summaries","publisher":{"@id":"https:\/\/www.shortform.com\/blog\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.shortform.com\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/www.shortform.com\/blog\/#organization","name":"Shortform Books","url":"https:\/\/www.shortform.com\/blog\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.shortform.com\/blog\/#\/schema\/logo\/image\/","url":"https:\/\/www.shortform.com\/blog\/wp-content\/uploads\/2019\/06\/logo-equilateral-with-text-no-bg.png","contentUrl":"https:\/\/www.shortform.com\/blog\/wp-content\/uploads\/2019\/06\/logo-equilateral-with-text-no-bg.png","width":500,"height":74,"caption":"Shortform Books"},"image":{"@id":"https:\/\/www.shortform.com\/blog\/#\/schema\/logo\/image\/"}},{"@type":"Person","@id":"https:\/\/www.shortform.com\/blog\/#\/schema\/person\/0421cce75bc249b11e2517b3a91f9c46","name":"Darya Sinusoid","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.shortform.com\/blog\/#\/schema\/person\/image\/","url":"https:\/\/www.shortform.com\/blog\/wp-content\/uploads\/2021\/07\/Untitled-design-1.png","contentUrl":"https:\/\/www.shortform.com\/blog\/wp-content\/uploads\/2021\/07\/Untitled-design-1.png","caption":"Darya Sinusoid"},"description":"Darya\u2019s love for reading started with fantasy novels (The LOTR trilogy is still her all-time-favorite). Growing up, however, she found herself transitioning to non-fiction, psychological, and self-help books. She has a degree in Psychology and a deep passion for the subject. She likes reading research-informed books that distill the workings of the human brain\/mind\/consciousness and thinking of ways to apply the insights to her own life. Some of her favorites include Thinking, Fast and Slow, How We Decide, and The Wisdom of the Enneagram.","url":"https:\/\/www.shortform.com\/blog\/author\/darya\/"}]}},"jetpack_sharing_enabled":true,"jetpack_featured_media_url":"https:\/\/www.shortform.com\/blog\/wp-content\/uploads\/2020\/04\/hopeful-breath-air-scaled.jpg","_links":{"self":[{"href":"https:\/\/www.shortform.com\/blog\/wp-json\/wp\/v2\/posts\/26213","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.shortform.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.shortform.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.shortform.com\/blog\/wp-json\/wp\/v2\/users\/7"}],"replies":[{"embeddable":true,"href":"https:\/\/www.shortform.com\/blog\/wp-json\/wp\/v2\/comments?post=26213"}],"version-history":[{"count":5,"href":"https:\/\/www.shortform.com\/blog\/wp-json\/wp\/v2\/posts\/26213\/revisions"}],"predecessor-version":[{"id":27023,"href":"https:\/\/www.shortform.com\/blog\/wp-json\/wp\/v2\/posts\/26213\/revisions\/27023"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.shortform.com\/blog\/wp-json\/wp\/v2\/media\/8342"}],"wp:attachment":[{"href":"https:\/\/www.shortform.com\/blog\/wp-json\/wp\/v2\/media?parent=26213"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.shortform.com\/blog\/wp-json\/wp\/v2\/categories?post=26213"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.shortform.com\/blog\/wp-json\/wp\/v2\/tags?post=26213"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}