Ecological Archives E087-009-A3

Thomas W. Yee. 2006. Constrained additive ordination. Ecology 87:203–213.

Appendix C. The reduced-rank multinomial logit model and CQO.

 

The reduced-rank multinomial logit model

Although the article has proposed a new fast algorithm, there is yet another method for obtaining an (approximate) estimate of the constrained coefficients C for equal-tolerances Poisson data. It is shown in this Appendix that, for large Poisson data sets, fitting a reduced-rank multinomial logit model (RR-MLM) is approximately the same as fitting an equal-tolerances QRR-VGLM. For further details about the RR-MLM see Yee and Hastie (2003).

The advantage of fitting a RR-MLM is its speed (Tables A1 and A2 in Appendix A) when applied to large data sets. Another advantage is that it appears to be less prone to convergence to suboptimal solutions.

 

An RR-MLM example

We now illustrate the efficacy of a RR-MLM as an approximation to the equal-tolerances Poisson by comparing both on the hunting spiders data (see ter Braak (1986), Yee (2004)). We show it gives a good result despite the fact that 45.8% of Y are zero counts which convey little information on C. Furthermore, many of the species have low abundances.

The hunting spiders data set was collected in a Dutch dune area, and consists of abundances (numbers trapped over a 60 week period) of 12 species of hunting spiders and six environmental variables (water, bare sand, twigs, cover moss, cover herbs, and light reflection). There were $n=28$ sites and the environmental variables were standardized to zero mean and unit variance in this article.

The following code fits a rank-1 RR-MLM:


hsrrmlm1 = rrvglm(cbind(Alopacce, Alopcune, Alopfabr, Arctlute, Arctperi,
                        Auloalbi, Pardlugu, Pardmont, Pardnigr, Pardpull,
                        Trocterr, Zoraspin) ~
           WaterCon + BareSand + FallTwig + CoveMoss + CoveHerb + ReflLux,
           family = multinomial, data = hspider, Rank = 1)
The equal-tolerances Poisson model is stored in an object called etp1, and can be obtained as p1 in Yee (2004) but with the option EqualTolerance = TRUE. The two models have a different normalization for C, but we can make them comparable by applying a centering and scaling using scale():
>  round(t(scale(ccoef(hsrrmlm1))), digits=3)
   WaterCon BareSand FallTwig CoveMoss CoveHerb ReflLux
      0.017    0.894   -1.668    0.555   -0.638   0.839
> round(t(scale(ccoef(etp1))), digits=3)
   WaterCon BareSand FallTwig CoveMoss CoveHerb ReflLux
      -0.57    0.886   -1.464     0.51   -0.483    1.12
Apart from the coefficient for water content, all the others agree quite well. Thus the RR-MLM gives effectively the same result as the CQO model. The constrained coefficients of etp1 agree with CCA in their signs, and can be interpreted as a moisture gradient.

 

The RR-MLM and Poisson CQO model

In this section it is shown that fitting a reduced-rank multinomial logit model (RR-MLM) to a matrix of counts Y, assumed to be generated from $Y_{is} \sim$ Poisson $(\lambda_{is})$, gives the same result asymptotically as fitting an equal-tolerances Poisson CQO. The matrix of explanatory variables is X.

Heuristic Proof          Let $Y$ be the number of counts of $S$ species at site $i$ (we omit the subscript $i$ for simplicity). It is well known that

\begin{displaymath}
\mbox{\boldmath $Y$}\left\vert   \sum_{s=1}^S Y_{s}=n \right. \sim 
\mathrm{Multinomial}(n, p_{1},\ldots,p_{S})
\end{displaymath}

where $n = \sum\limits_{s=1}^S y_{s}$ is the total number of species at site $i$, and $p_{s} = \lambda_{s} / \sum\limits_{s=1}^S \lambda_{s}$. A Poisson CQO model implies $\log \lambda_{s} = \eta_s^*$ where $\eta_s^*$ are the linear predictors $\eta_s$ defined by Eq. 5.

A MLM can be written

\begin{eqnarray*}
\log \left( \frac{p_j}{p_S} \right) = 
\eta_j,     j=1,\ldots,S-1,
\end{eqnarray*}



where $p_j$ is the probability a count is of species $j$, or equivalently, the proportion of species at the site $i$ being of species $j$. Here, species $S$ is the baseline category. Then, from the $j$th row of Eq. 5,

\begin{eqnarray*}
\eta_j
&=&
\log \left( \frac{\lambda_{j}}{\lambda_{S}} \righ...
...^T (\mbox{\rm\bf D}_j - \mbox{\rm\bf D}_S) \mbox{\boldmath$\nu$}
\end{eqnarray*}



for some vectors $b$ and $a$. If an equal-tolerances assumption is made then the quadratic form vanishes, and the linear predictors $\eta_j$ are linear in $\nu$, i.e., a RR-MLM. Q.E.D.


The above proof shows another way of estimating the constrained coefficients C of an equal-tolerances Poisson CQO. A list of all of them to date are:

(i)
the Yee (2004) slow algorithm,

(ii)
the fast algorithm,

(iii)
using the initial values method (an approximate solution only but requires no iteration),

(iv)
fitting a RR-MLM.

The above result can be made more rigorous by making use of the well known Poisson-multinomial logit relationship (see Baker (1994)). See also related work: Section 3.9 of ter Braak and Smilauer (1998) and Chapter 1 of ter Braak (1996).

Simulations show the RR-MLM works well if the species abundances are high, there is a lot of Y data (i.e., large $n$ and/or $S$), and all species have an equal-tolerance. One characteristic about the RR-MLM is that sites with no species at all must be deleted ($n_i=0$ not allowed).

 

LITERATURE CITED
Baker, S. G. 1994. The multinomial-Poisson transformation. The Statistician 43:495–504.

ter Braak, C. J. F. 1986. Canonical correspondence analysis: A new eigenvector technique for multivariate direct gradient analysis. Ecology 67:1167–1179.

ter Braak, C. J. F. 1996. Unimodal Models to Relate Species to Environment. DLO-Agricultural Mathematics Group, Wageningen, The Netherlands.

ter Braak, C. J. F., and P. Smilauer. 1998. CANOCO Reference Manual and User's Guide to CANOCO for Windows: Software for Canonical Community Ordination (version 4). Microcomputer Power, Ithaca, New York, USA.

Yee, T. W. 2004. A new technique for maximum-likelihood canonical Gaussian ordination. Ecological Monographs 74:685–701.

Yee, T. W., and T. J. Hastie. 2003. Reduced-rank vector generalized linear models. Statistical Modelling 3:15–41.



[Back to E087-009]