Abstract
The numerical inaccuracies caused by floating point arithmetic, although often not important, can change the conclusions of an analysis. Computational accuracy is of increasing concern because the number of software packages has exploded as computers have evolved and statistical software is increasingly written and used by non-statisticians who may not be aware of potential computational problems.To address this problem, SED developed the Statistical Reference Datasets (StRD) web site (
http://www.itl.nist.gov/div898/strd/index.html) which provides datasets with certified values for assessing the numerical accuracy of software. Four areas of statistical computation were originally addressed, univariate statistics, linear regression, nonlinear regression, and analysis of variance. Recently Markov chain Monte Carlo (MCMC) has become popular and is a new area in which intensive statistical computations are used. Despite its importance, the numerical accuracy of the software for MCMC is largely unknown. By way of specific datasets, we demonstrate in this paper some of the anomalies in MCMC computations.