Ecological Archives
E087-051-A1
Otso Ovaskainen and Anna-Liisa Laine. 2006. Inferring evolutionary signals from ecological data in a plantpathogen metapopulation. Ecology 87:880891.
Appendix A. Technical details on parameter estimation.
The parameters of the model for the spread of the disease were estimated using a Bayesian approach through a Metropolis-Hastings-Gibbs algorithm in which each parameter was updated in turn, conditional on the values of the other parameters. The estimation scheme was programmed with Mathematica 5.0. and its validity was checked against simulated data. The convergence of the Markov chain was assessed by running five chains (100000 iterations each, out of which 10000 was treated as burn-in period) with overdispersed starting points (Gelman et al. 2004). The prior distributions and the methods by which the Markov chain was updated are described below separately for each parameter.
- The parameters d2 and
are positive, and we assumed a prior which is lognormal, the underlying normal distribution having mean log 0.1 and standard deviation 2. We used the Metropolis-Hastings algorithm with a jumping distribution which was lognormal with the current value as mean and standard deviation adjusted to give roughly a 30% accept ratio.
- The parameters h and g are restricted to the range [0,1], and we assumed a flat prior distribution on this range. As the beta distribution is conjugate to the binomial distribution, we expressed the prior as Beta(1,1), in which case the full-conditional posterior distribution for h is Beta(H + 1, P H + 1), where H =
i Hi and P =
i Pi are the total numbers of hosts and plants. We used the Gibbs sampler to update the value of h and the Metropolis-Hastings algorithm with logit-normal proposal distribution to update g.
- Missing data Ii,t. We assumed a prior distribution that is discrete uniform in [Ii,t1, Ii,t2], where t1 < t < t2 are the closest time steps for which data is available. If there were several missing time steps between t1 and t2, the priors for each Ii,t were taken to be independent for each t. The likelihood of the missing data is zero if Ii,t > Ii,t+1, which restricts the set of feasible prior distributions. We used the Metropolis-Hastings algorithm with a jumping distribution which jumps with probability f to a value drawn from the discrete uniform distribution in [Ii,t-1, Ii,t+1] (and is thus independent of the previous value of Ii,t) and stays at Ii,t with probability 1 f. The value of f was fixed to f = 0.05 to give a roughly 30% accept ratio. The joint distribution for missing data Ii,t was updated simultaneously for all i, but for a single t at a time.
- Number of hosts Hi. As the prior distribution we used the discrete uniform distribution in [Ii,tmax, Pi] (note that the likelihood of the data is zero if Pi < Ii,tmax). The jumping distribution was based on the discrete uniform distribution in [Ii,tmax, Pi]. The number of hosts was updated separately for each quadrat i.
Figure A1 illustrates the performance of the algorithm against simulated data assuming biologically reasonable parameter values. While the fraction of hosts becomes very accurately estimated in this data set (4% relative error in the 95% highest posterior density interval; HPDI), the estimation of the other parameters proved to be slightly more difficult (10%20% relative error in the 95% HPDIs).
|
| |
FIG. A1. The performance of the estimation scheme against simulated data. The dots represent the assumed parameter values ( = 0.06, h = 0.7, g = 0.5, d = 0.277) and the lines the estimated posterior distributions. The number of plants and the initial number of infected plants was set to correspond to the data from population 1 in the year 2001. |
LITERATURE CITED
Gelman, A., J. B. Carlin, H. S. Stern, and A. D. B. Rubin. 2004. Bayesian Data Analysis. Second edition. Chapman and Hall/CRC, Boca Raton, Florida, USA.
[Back to E087-051]