ELECTRONIC SUPPLEMENTARY MATERIAL

Nonlinear selection model

The Lande-Arnold approximation (Lande and Arnold 1983; see also Phillips and Arnold 1989) of selection on two quantitative traits is:

(1) w = α + βdxd + βvxv + ½ γdxd2 + ½ γvxv2 + γdvxdxv

where β and γ are the linear and non-linear selection coefficients, respectively, α is the intercept and w is fitness. The variable ‘x’, denotes the value of a trait in an individual and subscripts ‘d’, and ‘v’ are days to first flower and vegetative size at first flower, respectively. We consider only positive values of xd and xv since it is impossible to flower before day zero, or to have negative size. In Lythrum salicaria fitness effects for these two traits are additive, such that γdv = 0 in Eq. 1 (O’Neil 1997).

We approximated the fitness function for flowering time as a decelerating curve with a maximum fitness at xd ≤ 0 (i.e. no delay to flowering) and vegetative size as an accelerating function with a minimum at xv ≤ 0. Therefore, γd < 0 and βd ≤ 0, γv > 0 and βv ≥ 0. These two traits trade-off if there is a positive genetic correlation, in which case xd and xv can each be rescaled by their intercepts and variances to a common principal component (PC1); for simplicity we substitute PC1 into Eq. 1 and redefine α, β and γ for this new scale of measurement (i.e. standardized selection coefficients), simplifying Eq. 1 to:

(2) w = (βd + βv) PC1 + ½ (γd + γv) PC12

Assuming there is an optimum (θpc) for Eq. 2, fitness is maximized when:

(3) PC1 = - (βd + βv) / (γd + γv)

Southern populations of L. salicaria experience a longer growing season, so the marginal cost of flowering later will be smaller in the south, resulting in weaker stabilizing selection on flowering time, or γd,s < γd,n, where the subscripts ‘s’ and ‘n’ denote hypothetical southern and northern environments, respectively. Given that γd,s < γd,n, it is obvious that θpc,n < θpc,s, if we hold constant the selection coefficients for vegetative size (i.e. βv and γv), and the linear selection coefficient for flowering time (βd). Therefore, weaker stabilizing selection in the southern environment alone is sufficient to shift the optimum (θpc) towards smaller plants that flower earlier in northern populations.

In the absence of a mutation threshold preventing early flowering, the expected mean of a population is given by θpc (Eq. 3). This remains a good approximation in the presence of a threshold, provided θpc is larger than the threshold value (figure 1). The expected variance of PC1 is proportional to the curvature of the fitness function, effective population size, the distribution of mutational effects, and the rate of new mutations (Kimura 1965, Lande 1975, Turelli 1984). We assume that genetic variance for PC1 follows a Gaussian distribution and that the effects of population size and mutation do not correlate strongly with latitude. Variance of PC1 for northern vs. southern sites will be proportional to the x-intercepts of Eq. 2 (where PC1 > 0), which are inversely proportional to the strength of stabilizing selection. However, a threshold prevents small plants from flowering early, resulting in a positive skew that is inversely proportional to the difference between the optimum and the threshold size. That is, a population with an optimum much larger than the threshold will have a skew closer to zero. This occurs because the probability of observing the phenotypic value PC1 is zero for all values of PC1 below the threshold (figure 1).

Our predicted latitudinal trends in mean, variance and skew (hereafter ‘qualitative predictions’) are robust to a wide range of parameters and require only that:

(4) - (βd,n + βv,n) / (γd,n + γv,n) < - (βd,s + βv,s) / (γd,s + γv,s)

If βv and γv do not differ predictably with latitude then it is clear from Eq. 4 that our qualitative predictions hold wherever βd,n / γd,n > βd,s / γd,s. Recall that both γd and βd are less than zero; thus, in addition to stabilizing selection, stronger directional selection for early flowering will result in similar fitness functions and qualitative predictions (i.e. latitudinal clines). We have assumed no latitudinal change in selection on vegetative size. However, if there is a latitudinal trend it is likely towards a decrease in βv and γv as latitude increases because plants from southern populations experience a longer growing season, and therefore have more opportunity for vegetative growth and competitive interactions, increasing the marginal benefit of being large when they initiate flowering. A correlated decrease in βv and/or γv with increasing latitude would strengthen our predicted latitudinal gradients. Our results are also robust to the assumption of the form of selection on vegetative size. An optimum for Eq. 2 will only occur when γv < - γd. Since γd is negative in our model, Eq. 2 will have an optimum for all γv ≤ 0. This suggests that our assumption of an accelerating fitness function for vegetative size is conservative, since an optimum will also exist for any linear or decelerating fitness function. Therefore, from our interpretation of Eq. 4 it appears that our qualitative predictions are robust to a wide range of model assumptions.

Genetic estimates of variance and skew

We used BLUPs of population means and estimated the coefficient of skew from BLUP estimates of family means using the model:

PCijk = Bl + Pi+ Fj(Pi) + εijk

where Bl is the fixed effect of experimental block, population (P) i and seed family (F) j (nested within population i) are random factors, and ε is the error term of individual k. Variances among populations and among families were estimated directly by ReML; coefficient of skew was calculated from the best linear unbiased predictor (BLUP) of family means.

We also estimated variances among families for each population separately by ReML using the statistical model:

PCjk = Bl + Fj + εjk

and a separate analysis for each population. Latitudinal trends using genetic estimates support our phenotypic analysis (figure 3), with similar clines in the mean, variance and skew (see figure S1).

Scaling effects

Positive correlations between the mean and central moments (i.e. variance, skew and kurtosis) of morphological and life-history traits is primarily a problem for comparing statistics based on central moments (e.g. heritability) among different traits, or different species, where the underlying genetic architecture may differ significantly. The issue of scaling is less clear for comparisons of days to flower and vegetative size among populations of the same species, where differences are more likely due to variation in allele frequencies than genetic architecture. Scaling can occur for traits like biomass or flower number that are log-normal or Poisson-distributed if data are assumed to be Gaussian. However, there was no significant correlation between the mean and variance (n = 20, r = -0.065, p = 0.785), or between the mean and skew (n = 20, r = +0.047, p = 0.843) of PC2 allowing us to reject a general scaling relation in our data. Furthermore, a Box-Cox (Box & Cox 1964) transformation of traits (λ: days = -0.75; vegetative size = 0.25; inflorescence length = 0.5) stabilized the variance and skew of PC1 and still supported the results in table 1 and figure 2, with significant latitudinal gradients in the mean (n = 20, r = -0.764, p < 0.001), variance (n = 20, r = -0.484, p < 0.05) and skew (n = 20, r = +0.782, p < 0.001) of PC1. Therefore, statistical artefacts are unlikely to explain latitudinal gradients in the mean, variance and skew of PC1.

Controlled pollinations

All of our field experiments used open-pollinated seed families and therefore maternal effects could have contributed to the latitudinal clines in days to flower and vegetative size, as well as the genetic correlation between them. For example, seeds formed under cooler conditions, or under rapid fruit maturation at higher latitudes, could conceivably be of lower quality, resulting in earlier maturation at smaller size. Maternal effects should be most evident in seed size and at early life stages (Roach & Wulff 1987). However, seed size did not correlate with latitude (n = 22, r = +0.150, p = 0.500; Montague et al. 2008), and in the glasshouse latitude did not correlate with time to germination (n = 20, r = +0.275, p = 0.241) or seedling height at transplant (n = 20, r = -0.349, p = 0.132). The concordance between glasshouse and field studies (table 2) also suggests that plasticity in growth and flowering time across environments is minor relative to genetic differences among populations.

We directly tested for maternal effects and trans-generational plasticity using a diallel F1 crossing design. We also used these crosses to test the consistency of the genetic correlation estimated from open-pollinated seed families. We created ten F1 families of L. salicaria by randomly choosing one of five plants from a northern population and crossing it with one of five plants from a southern population. Each of these 10 parental plants represents an open-pollinated family used in the glasshouse and field experiments. Each plant was mated twice, once as a maternal plant and once as a pollen donor. We grew three offspring from each of the ten crosses. Our F1 crossing design is similar to a diallel in that each F1 family is crossed with itself and every other family. Because L. salicaria is tristylous successful crosses only occur between different floral morphs, limiting the number of total crosses to 88 of a possible 100 inter- and intra-family crosses.

We germinated 12 F2 seeds from each cross in 2cm x 2cm plug trays in early spring of 2007 and transplanted four offspring from each cross into a common garden at the Koffler Scientific Reserve at Jokers Hill in early June. We measured days to flower and vegetative size at flower in the first growing season using the same protocol as our other common garden studies (see methods). Our statistical method follows Fry (2004) and used restricted maximum likelihood (ReML) in SAS 9.1 (SAS Institute Inc., Cary, NC). For simplicity we assumed a diploid model. Since L. salicaria is an autotetraploid, this will underestimate the significance of digenic interactions (i.e. dominance variance) and additive variance, but should not underestimate maternal, paternal, or cyto-nuclear (i.e. maternal*paternal interactions) effects.

Results from our mixed model do not support maternal influences on days to flower (d.f. = 1, χ2 = 0.0, p = 0.963) or vegetative size at first flower (d.f. = 1, χ2 = 2.7, p = 0.102). ReML estimates of paternal effects and cytonuclear effects were near zero (χ2 < 0.01). Thus, our results do not support a strong effect of trans-generational plasticity and are more consistent with a genetic basis for latitudinal clines in mean, variance and skew of PC1 (and see further discussion in Montague et al 2008). In addition to examining the potential for trans-generational plasaticity, we tested for a genetic correlation between days to flower and vegetative size among F2 families. The Pearson product-moment correlation between these traits was highly significant across family means, where a family included reciprocal full-sibs (i.e. full-sibs with reversed ovule and pollen parents) (N = 34, R = 0.638, P < 0.001). Therefore, the genetic correlation between days to flower and vegetative size are the result of additive or dominant genes, with no detectable maternal effects.

Major genes and migration

Two other mechanisms could potentially explain the increase in skew and decrease in variance for PC1 in northern populations relative to southern populations. First, a change in allele frequencies for a few alleles of large effect would be qualitatively similar to a one locus, two-allele model, which can produce a correlated decline in the mean and variance with an increase in skew (Lynch and Walsh 1998). Second, a small proportion of long-distant migrants from southern populations could increase the skew of northern populations. However, the effects of migration on skew and variance should scale with each other, resulting in no change in the coefficient of skew, and the effect of major genes should be evident in frequency histograms of the phenotypic distributions within populations. The latitudinal clines observed for the coefficient of skew do not match the observed phenotype frequency histograms, which show continuous variation among PC1 phenotypes (figure S2). Variance among individuals from the five most northern population (VN = 1.56) was lower than that of southern populations (VS = 2.30), while the coefficient of skew was higher in the north (CSN = 0.64) compared to the south (CSS = 0.15). Removing rare individuals (i.e. bars with < 5 individuals in figure S2) had little effect on the relative variance (VN = 1.29, VS = 2.12) and skew (CSN = 0.35, CSS = 0.05). Our results suggest that a few alleles of large effect, or rare long-distance migration events, do not explain latitudinal changes in variance or skew of PC1 in our data.

References

Box, G. E. P. & Cox, D. R. 1964 An analysis of transformations. J. R. Stat. Soc. Series B 26, 211–243.

Fry, J. D. 2004 Estimation of genetic variances and covariances by restricted maximum likelihood using PROC MIXED. Genetic analysis of complex traits using SAS (A. M. Saxon, ed.), pp 11–34. Cary: SAS Institute.

Kimura, M. 1965 A stochastic model concerning the maintenance of genetic variability in quantitative characters. Proc. Natl. Acad. Sci. U.S.A. 54, 731–736.

Lande, R. 1975 The maintenance of genetic variability by mutation in a polygenic character with linked loci. Genet. Res. 26, 221–235.