Quote
"All models are wrong; some models are useful."
G
George E. P. BoxGeorge E. P. Box
George E. P. Box
George Edward Pelham Box was a British statistician, who worked in the areas of quality control, time-series analysis, design of experiments, and Bayesian inference. He has been called "one of the great statistical minds of the 20th century". His quote "All models are wrong but some are useful" has been widely discussed.
"All models are wrong; some models are useful."
"Statistical criteria should (1) be sensitive to change in the specific factors tested, (2) be insensitive to changes, of a magnitude likely to occur in practice, in extraneous factors."
"Since all models are wrong the scientist cannot obtain a "correct" one by excessive elaboration. On the contrary following William of Occam he should seek an economical description of natural phenomena. Just as the ability to devise simple but evocative models is the signature of the great scientist so overelaboration and overparameterization is often the mark of mediocrity."
"We have a large reservoir of engineers (and scientists) with a vast background of engineering know how. They need to learn statistical methods that can tap into the knowledge. Statistics used as a catalyst to engineering creation will, I believe, always result in the fastest and most economical progress…"
"One important idea is that science is a means whereby learning is achieved, not by mere theoretical speculation on the one hand, nor by the undirected accumulation of practical facts on the other, but rather by a motivated iteration between theory and practice."
"The researcher hoping to break new ground in the theory of experimental design should involve himself in the design of actual experiments. The investigator who hopes to revolutionize decision theory should observe and take part in the making of important decisions."
"For the theory-practice iteration to work, the scientist must be, as it were, mentally ambidextrous; fascinated equally on the one hand by possible meanings, theories, and tentative models to be induced from data and the practical reality of the real world, and on the other with the factual implications deducible from tentative theories, models and hypotheses."
"A man in daily muddy contact with field experiments could not be expected to have much faith in any direct assumption of independently distributed normal errors."
"The penalty for scientific irrelevance is, of course, that the statisticians work is ignored by the scientific community."
"An innovative discussion of building empirical models and the fitting of surfaces to data. Introduces the general philosophy of response surface methodology, and details least squares for response surface work, factorial designs at two levels, fitting second-order models, adequacy of estimation and the use of transformation, occurrence and elucidation of ridge systems, and more. Some results are presented for the first time. Includes real-life exercises, nearly all with solutions."
"A mechanistic model has the following advantages: 1. It contributes to our scientific understanding of the phenomenon under study. 2. It usually provides a better basis for extrapolation (at least to conditions worthy of further experimental investigation if not through the entire range of all input variables). 3. It tends to be parsimonious (i.e, frugal) in the use of parameters and to provide better estimates of the response"
"Remember that all models are wrong; the practical question is how wrong do they have to be to not be useful."