Reference class forecasting

Reference class forecasting or comparison class forecasting is a method of predicting the future by looking at similar past situations and their outcomes. The theories behind reference class forecasting were developed by Daniel Kahneman and Amos Tversky. The theoretical work helped Kahneman win the Nobel Prize in Economics.

Reference class forecasting is so named as it predicts the outcome of a planned action based on actual outcomes in a reference class of similar actions to that being forecast.

Discussion of which reference class to use when forecasting a given situation is known as the reference class problem.

Overview

Kahneman and Tversky^[1]^[2] found that human judgment is generally optimistic due to overconfidence and insufficient consideration of distributional information about outcomes.

People tend to underestimate the costs, completion times, and risks of planned actions, whereas they tend to overestimate the benefits of those same actions. Such error is caused by actors taking an "inside view", where focus is on the constituents of the specific planned action instead of on the actual outcomes of similar ventures that have already been completed.

Kahneman and Tversky concluded that disregard of distributional information, i.e. risk, is perhaps the major source of error in forecasting. On that basis they recommended that forecasters "should therefore make every effort to frame the forecasting problem so as to facilitate utilizing all the distributional information that is available".^[2]^: 416 Using distributional information from previous ventures similar to the one being forecast is called taking an "outside view". Reference class forecasting is a method for taking an outside view on planned actions.

Reference class forecasting for a specific project involves the following three steps:

Identify a reference class of past, similar projects.
Establish a probability distribution for the selected reference class for the parameter that is being forecast.
Compare the specific project with the reference class distribution, in order to establish the most likely outcome for the specific project.

Reference class tennis

The reference class problem, also known as reference class tennis, is the discussion of which reference class to use when forecasting a given situation.

Suppose someone were trying to predict how long it would take to write a psychology textbook. Reference class tennis would involve debating whether we should take the average of all books (closest to an outside view), just all textbooks, or just all psychology textbooks (closest to an inside view).^[3]^[4]

Practical use in policy and planning

Whereas Kahneman and Tversky developed the theories of reference class forecasting, Flyvbjerg and COWI (2004) developed the method for its practical use in policy and planning, which was published as an official Guidance Document in June 2004 by the UK Department for Transport.^[5]

The first instance of reference class forecasting in practice is described in Flyvbjerg (2006).^[6] This forecast was part of a review of the Edinburgh Tram Line 2 business case, which was carried out in October 2004 by Ove Arup and Partners Scotland. At the time, the project was forecast to cost a total of £320 million, of which £64 million – or 25% – was allocated for contingency. Using the newly implemented reference class forecasting guidelines, Ove Arup and Partners Scotland calculated the 80th percentile value (i.e., 80% likelihood of staying within budget) for total capital costs to be £400 million, which equaled 57% contingency. Similarly, they calculated the 50th percentile value (i.e., 50% likelihood of staying within budget) to be £357 million, which equaled 40% contingency. The review further acknowledged that the reference class forecasts were likely to be too low because the guidelines recommended that the uplifts should be applied at the time of decision to build, which the project had not yet reached, and that the risks therefore would be substantially higher at this early business case stage. On this basis, the review concluded that the forecasted costs could have been underestimated. The Edinburgh Tram Line 2 opened three years late in May 2014 with a final outturn cost of £776 million, which equals £628 million in 2004-prices.^[7]

Since the Edinburgh forecast, reference class forecasting has been applied to numerous other projects in the UK, including the £15 (US$29) billion Crossrail project in London. After 2004, The Netherlands, Denmark, and Switzerland have also implemented various types of reference class forecasting.

Before this, in 2001 (updated in 2011), AACE International (the Association for the Advancement of Cost Engineering) included Estimate Validation as a distinct step in the recommended practice of Cost Estimating (Estimate Validation is equivalent to Reference class forecasting in that it calls for separate empirical-based evaluations to benchmark the base estimate):

The estimate should be benchmarked or validated against or compared to historical experience and/or past estimates of the enterprise and of competitive enterprises to check its appropriateness, competitiveness, and to identify improvement opportunities...Validation examines the estimate from a different perspective and using different metrics than are used in estimate preparation.^[8]

In the process industries (e.g., oil and gas, chemicals, mining, energy, etc. which tend to dominate AACE's membership), benchmarking (i.e., "outside view") of project cost estimates against the historical costs of completed projects of similar types, including probabilistic information, has a long history.^[9] A method combining reference class forecasting and competitive crowdsourcing, Human Forest, has also been used in the life sciences, to estimate the likelihood that vaccines and treatments will successfully progress through clinical trial phases.^[10]^[11]

In the book Anthropic Bias, philosopher Nick Bostrom described ways in which statistical reasoning about reference classes can be applied to answering scientific questions related to our existence in the universe, such as with the anthropic principle, the fine-tuned universe hypothesis and its possible explanations, the doomsday argument, the size of the universe and possibility of a multiverse, and thought experiments such as the sleeping beauty problem. Bostrom investigates how to reason when one suspects that evidence is biased by "observation selection effects", in other words, when the evidence presented has been pre-filtered by the condition that there was some appropriately positioned observer to "receive" the evidence.^[12]^[13]

Bostrom argues against the self-indication assumption (SIA), a term he uses to characterize some existing views, and introduces the self-sampling assumption (SSA): that you should think of yourself as if you were a random observer from a suitable reference class. He later refines SSA into using observer-moments instead of observers to address certain paradoxes in anthropic reasoning, formalized as the strong self-sampling assumption (SSSA): Each observer-moment should reason as if it were randomly selected from the class of all observer-moments in its reference class.^[14] These different assumptions are affected differently based on the choice of reference class. An application of the principle underlying SSSA (though this application is nowhere expressly articulated by Bostrom), is: If the minute in which you read this article is randomly selected from every minute in every human's lifespan, then (with 95% confidence) this event has occurred after the first 5% of human observer-moments. If the mean lifespan in the future is twice the historic mean lifespan, this implies 95% confidence that N < 10n (the average future human will account for twice the observer-moments of the average historic human). Therefore, the 95th percentile extinction-time estimate in this version is 4560 years.

References

^ Kahneman, Daniel; Tversky, Amos (1979). "Prospect Theory: An Analysis of Decision under Risk" (PDF). Econometrica. 47 (2): 263–291. CiteSeerX 10.1.1.407.1910. doi:10.2307/1914185. JSTOR 1914185.
^ ^a ^b Kahneman, Daniel; Tversky, Amos (1977). "Intuitive prediction: Biases and corrective procedures" (PDF). Archived (PDF) from the original on September 8, 2013. {{cite journal}}: Cite journal requires |journal= (help) Decision Research Technical Report PTR-1042-77-6. In Kahneman, Daniel; Tversky, Amos (1982). "Intuitive prediction: Biases and corrective procedures". In Kahneman, Daniel; Slovic, Paul; Tversky, Amos (eds.). Judgment Under Uncertainty: Heuristics and Biases. pp. 414–421. doi:10.1017/CBO9780511809477.031. ISBN 9780511809477.
^ "Week 10: Reference Class Forecasting". Conceptually. Retrieved 20 April 2017.
^ "Outside view". LessWrong Wiki. Retrieved 20 April 2017.
^ "Procedures for Dealing with Optimism Bias in Transport Planning" (PDF). The British Department for Transport. June 2004. Retrieved 23 July 2021.
^ Flyvbjerg, Bent (2006). "From Nobel Prize to Project Management: Getting Risks Right". Project Management Journal. 37 (3): 5–15. arXiv:1302.3642. Bibcode:2013arXiv1302.3642F. doi:10.1177/875697280603700302. S2CID 13203075. SSRN 2238013.
^ "Report for the Edinburgh Tram Inquiry" (PDF). February 2018. Retrieved 23 July 2021.
^ AACE International, "Total Cost Management Framework, Section 7.3, Cost Estimating and Budgeting", 2011. p. 147
^ Merrow, E. and Yarossi, M., "Assessing Project Cost and Schedule Risk", 1990 AACE Transactions. pp H.6.1-7
^ Atanasov, Pavel D.; Joseph, Regina; Feijoo, Felipe; Marshall, Max; Conway, Amanda; Siddiqui, Sauleh (2022-07-27). "Human Forest vs. Random Forest in Time-Sensitive COVID-19 Clinical Trial Prediction". Rochester, NY. doi:10.2139/ssrn.3981732. S2CID 245174381. SSRN 3981732. {{cite journal}}: Cite journal requires |journal= (help)
^ "NSF Award Search: Award # 1919333 - Human Forests versus Random Forest Models in Prediction". www.nsf.gov. Retrieved 2022-09-20.
^ "Anthropic Bias | anthropic-principle.com". www.anthropic-principle.com. Retrieved 2015-11-03.
^ Bostrom, Nick (2002). Anthropic bias: observation selection effects in science and philosophy. Studies in philosophy. New York: Routledge. ISBN 978-0-415-88394-8.
^ Bostrom, Nick (2005). "Self-Location and Observation Selection Theory". anthropic-principle.com. Retrieved 2023-07-02.

Bibliography

Lovallo, D; Kahneman, D (2003). "Delusions of success. How optimism undermines executives' decisions". Harvard Business Review. 81 (7): 56–63. PMID 12858711.
Flyvbjerg, Bent (2008). "Public Planning of Mega-Projects: Overestimation of Demand and Underestimation of Costs". In Priemus, Hugo; Flyvbjerg, Bent; van Wee, Bert (eds.). Decision-making on Mega-Projects. doi:10.4337/9781848440173.00014. ISBN 9781848440173.
Flyvbjerg, Bent (2011). "Over Budget, Over Time, Over and Over Again: Managing Major Projects". In Morris, Peter W. G; Pinto, Jeffrey K; Söderlund, Jonas (eds.). The Oxford Handbook of Project Management. doi:10.1093/oxfordhb/9780199563142.003.0014. ISBN 9780199563142.

[Kahneman_and_Tversky_1979-1] Kahneman, Daniel; Tversky, Amos (1979). "Prospect Theory: An Analysis of Decision under Risk" (PDF). Econometrica. 47 (2): 263–291. CiteSeerX 10.1.1.407.1910. doi:10.2307/1914185. JSTOR 1914185.

[Kahneman_and_Tversky_1977-2] Kahneman, Daniel; Tversky, Amos (1977). "Intuitive prediction: Biases and corrective procedures" (PDF). Archived (PDF) from the original on September 8, 2013. {{cite journal}}: Cite journal requires |journal= (help) Decision Research Technical Report PTR-1042-77-6. In Kahneman, Daniel; Tversky, Amos (1982). "Intuitive prediction: Biases and corrective procedures". In Kahneman, Daniel; Slovic, Paul; Tversky, Amos (eds.). Judgment Under Uncertainty: Heuristics and Biases. pp. 414–421. doi:10.1017/CBO9780511809477.031. ISBN 9780511809477.

[Conceptually-3] "Week 10: Reference Class Forecasting". Conceptually. Retrieved 20 April 2017.

[lw-4] "Outside view". LessWrong Wiki. Retrieved 20 April 2017.

[5] "Procedures for Dealing with Optimism Bias in Transport Planning" (PDF). The British Department for Transport. June 2004. Retrieved 23 July 2021.

[Flyvbjerg2006-6] Flyvbjerg, Bent (2006). "From Nobel Prize to Project Management: Getting Risks Right". Project Management Journal. 37 (3): 5–15. arXiv:1302.3642. Bibcode:2013arXiv1302.3642F. doi:10.1177/875697280603700302. S2CID 13203075. SSRN 2238013.

[7] "Report for the Edinburgh Tram Inquiry" (PDF). February 2018. Retrieved 23 July 2021.

[8] AACE International, "Total Cost Management Framework, Section 7.3, Cost Estimating and Budgeting", 2011. p. 147

[9] Merrow, E. and Yarossi, M., "Assessing Project Cost and Schedule Risk", 1990 AACE Transactions. pp H.6.1-7

[10] Atanasov, Pavel D.; Joseph, Regina; Feijoo, Felipe; Marshall, Max; Conway, Amanda; Siddiqui, Sauleh (2022-07-27). "Human Forest vs. Random Forest in Time-Sensitive COVID-19 Clinical Trial Prediction". Rochester, NY. doi:10.2139/ssrn.3981732. S2CID 245174381. SSRN 3981732. {{cite journal}}: Cite journal requires |journal= (help)

[11] "NSF Award Search: Award # 1919333 - Human Forests versus Random Forest Models in Prediction". www.nsf.gov. Retrieved 2022-09-20.

[12] "Anthropic Bias | anthropic-principle.com". www.anthropic-principle.com. Retrieved 2015-11-03.

[:0-13] Bostrom, Nick (2002). Anthropic bias: observation selection effects in science and philosophy. Studies in philosophy. New York: Routledge. ISBN 978-0-415-88394-8.

[14] Bostrom, Nick (2005). "Self-Location and Observation Selection Theory". anthropic-principle.com. Retrieved 2023-07-02.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

Overview

Reference class tennis

Practical use in policy and planning

See also

References

Bibliography