Supplementary materials for:

“Evolutionary reshaping of fungal mating pathway scaffold proteins”.

Pierre Côte‡, Traian Sulea, Daniel Dignard, Cunle Wu and Malcolm Whiteway‡*.

Genetics Group, Biotechnology Research Institute,

National Research Council of Canada,

6100 Royalmount Ave., Montreal, Quebec, Canada H4P 2R2

‡ Department of Biology, McGill University, Montreal, Quebec, Canada H3A 1B1

* Corresponding author:

Dr. Malcolm Whiteway

Genetics Group,

Biotechnology Research Institute,

National Research Council of Canada,

6100 Royalmount Ave., Montreal, Quebec,

Canada H4P 2R2

Tel: 001-514-496-6146

Fax: 001-514-496-6213

Email:

Supplementary materials

Experimental procedures

Comparative sequence analysis, domain annotation, and structural homology modeling

Homologous protein sequences were retrieved by either BLASTP or TBLASTN search (1) in the Fungal genome database ( and by protein domain architecture search in the SMART database ( 2). Comparative sequence analysis of the assembled datasets (29 Far1-like, 19 Ste5-like and 31 Ste11-like sequences) was carried out with the MAFFT 6 program (3) for deriving multiple sequence alignments with the L-INS-i iterative refinement algorithm and to generate phylogenetic trees, and visualized using Jalview 2.4.0B2 (4). Complete set of all sequences, as well as their accession number, is available at

Structural domain detection for representative Far1-like and Ste5-like proteins was carried out at the Structure Prediction Meta Server ( which assembles state-of-the-art fold recognition methods, and provides a consensus sequence-to-structure scoring using the 3D-Jury meta-predictor (5). In order to confirm the available domain annotations (e.g., from InterPro and member databases, as well as previous reports of predicted domains in some family members (6, 7, 8), full-length sequences were queried. Iteratively, reliably detected domains were excised out and the flanking sequences were subjected to new rounds of fold detection. Finally, the excised sequences of detected domains were re-submitted to the Structure Prediction Meta Server to obtain the final template ranking, reliability indicators and query-to-template sequence alignments. Multiple-queries-multiple-templates sequence alignments were assembled using the (i) MAFFT multiple sequence alignment of the queries, (ii) 3D-Coffee structure-based sequence alignment of the templates (9), (iii) 3D-Jury consensus fold recognition based alignment between queries and templates, (iv) manual minor local improvements in the overall secondary structure alignment, and (v) 3D-structural alignment using the domain alternate fit in Swiss-PdbViewer v4.0.1 (10). Secondary structure predictions were based on three methods: PROFsec (11), PSI-PRED (12) and Jnet (13), from which a consensus was derived for each sequence by majority voting.

Potential nuclear localization signals/plasma membrane-binding (NLS/PM) basic motifs and kinase docking sites were inferred from the multiple sequence alignment at positions corresponding to the respective motifs confirmed in S. cerevisiae (14, 15, 16, 17). No attempt was made here to locate such motifs elsewhere in the homologous sequences.

Homology modeling of the C. albicans Ste11 kinase domain was done with the MODELLER 9v1 program (18). Suitable template structures were located by BLASTP searches ( against sequences in the PDB database ( using the C. albicans Ste11 kinase domain and activation loop sequences as queries. The structure of activated human PAK5 phosphorylated at Ser602 and bound to an ATP-competitive inhibitor (PDB code 2F57) was used as template (19). This template structure provides a good overall alignment of the kinase domain and also has suitable homology for constructing the activation loop comprising the acidic insert Asp705-Asp710 probed in this study. The model of C. albicans Ste11 kinase domain phosphorylated at Ser719 was refined by energy minimization using the AMBER force-field (20). An unphosphorylated model was also obtained in which the conformation of the Ser702-Thr714 sequence in the activation loop was refined in MODELLER for 1000 loop generation cycles.

Mapping of protein-protein interactions among Cst5 and other components of the mating pheromone pathway

To map protein-protein interactions among domains of the pheromone signaling pathway components, we used an alternative yeast two-hybrid system (Y2H) we developed recently for the detection of protein-protein interactions in the cytoplasm (detailed information of this Y2H system will be published elsewhere). Briefly, this yeast two-hybrid system is based on the interaction of Ste11p (MEKK) and Ste50p that is required for the HOG pathway activation and osmoadaptation, which is critical for the survival of yeast cells under hyperosmotic stress in the absence the two-component osmosensing branch (Figure 3A). The interaction of Ste11p and Ste50p through their respective SAM domains that is required to activate the HOG pathway can be replaced by interaction of other protein interacting modules (21). This property offers a unique potential to analyze bait-prey interactions by substituting them for the respective SAM domains and using the activation of the HOG pathway as a reporter (Figure 3B and 3C).

Plasmid pYL40 contains the fragment of STE11 encoding Ste11p lacking its SAM domain (aa 110-717) at the SalI siteof pGREG506 (22), and a HIS3 stuffer marker inserted at the SmaI site in front the Ste11SAM. Similarly, pYL45 contains the fragment of STE50 encoding Ste50p without its SAM domain (aa 115-346) at the SalI siteof pGREG503 (22), and a URA3 stuffer marker inserted at the SmaI site in front the Ste50SAM.

All candidate ORFs or their fragments were PCR amplified and cloned into the vector plasmids pYL40 and pYL45 at the SmaI site through in vivo recombination (IVR) in yeast strains YCW1476 and YCW1477. All the primers used for the PCR amplification reactions contain gene specific sequences and common sequences used for IVR in a layout as follows: 5’-ATTCTAGAGCGGCCGCACTAGTGGATCCCCCGGG-gene specific sequence (starting with ATG)-3’ for the forward orientation, and 5’-TCGATAAGCTTGATATCGAATTCCTGCAGCCCGGG-gene specific sequence (delete the stop codon)-3’ for the reverse orientation. To query the bait-prey interaction, IVR positive clones (stuffer marker negative with right inserts) of the baits in one of the two yeast strains were crossed to the IVR positive clones of the preys in the other yeast strain, mating products were selected and their ability to activate the HOG pathway measured by the ability to grow on hyperosmolarity media (Figure 3D&E).

References

1. Altschul,S.F. and Lipman,D.J. (1990). Protein database searches for multiple alignments. Proc. Natl. Acad. Sci. U. S. A 87, 5509-5513.

2. Letunic,I., Copley,R.R., Pils,B., Pinkert,S., Schultz,J. et al. (2006). SMART 5: domains in the context of genomes and networks. Nucleic Acids Res. 34, D257-D260.

3. Katoh,K., Kuma,K., Toh,H., and Miyata,T. (2005). MAFFT version 5: improvement in accuracy of multiple sequence alignment. Nucleic Acids Res. 33, 511-518.

4. Waterhouse,A.M., Procter,J.B., Martin,D.M., Clamp,M., and Barton,G.J. (2009). Jalview Version 2--a multiple sequence alignment editor and analysis workbench. Bioinformatics. 25, 1189-1191.

5. Ginalski,K. and Rychlewski,L. (2003). Detection of reliable and unexpected protein fold predictions using 3D-Jury. Nucleic Acids Res. 31, 3291-3292.

6. Garrenton,L.S., Young,S.L., and Thorner,J. (2006). Function of the MAPK scaffold protein, Ste5, requires a cryptic PH domain. Genes Dev. 20, 1946-1958.

7. Inouye,C., Dhillon,N., Durfee,T., Zambryski,P.C., and Thorner,J. (1997). Mutational analysis of STE5 in the yeast Saccharomyces cerevisiae: application of a differential interaction trap assay for examining protein-protein interactions. Genetics 147, 479-492.

8. Wiget,P., Shimada,Y., Butty,A.C., Bi,E., and Peter,M. (2004). Site-specific regulation of the GEF Cdc24p by the scaffold protein Far1p during yeast mating. EMBO J. 23, 1063-1074.

9. Armougom,F., Moretti,S., Poirot,O., Audic,S., Dumas,P. et al. (2006). Expresso: automatic incorporation of structural information in multiple sequence alignments using 3D-Coffee. Nucleic Acids Res. 34, W604-W608.

10. Guex,N. and Peitsch,M.C. (1997). SWISS-MODEL and the Swiss-PdbViewer: an environment for comparative protein modeling. Electrophoresis 18, 2714-2723.

11. Rost,B. (2001). Review: protein secondary structure prediction continues to rise. J. Struct. Biol. 134, 204-218.

12. Jones,D.T. (1999). Protein secondary structure prediction based on position-specific scoring matrices. J. Mol. Biol. 292, 195-202.

13. Cuff,J.A. and Barton,G.J. (2000). Application of multiple sequence alignment profiles to improve protein secondary structure prediction. Proteins 40, 502-511.

14. Bhattacharyya,R.P., Remenyi,A., Good,M.C., Bashor,C.J., Falick,A.M. et al. (2006). The Ste5 Scaffold Allosterically Modulates Signaling Output of the Yeast Mating Pathway. Science 311, 822-826.

15. Blondel,M., Alepuz,P.M., Huang,L.S., Shaham,S., Ammerer,G. et al. (1999). Nuclear export of Far1p in response to pheromones requires the export receptor Msn5p/Ste21p. Genes Dev. 13, 2284-2300.

16. Mahanty,S.K., Wang,Y., Farley,F.W., and Elion,E.A. (1999). Nuclear shuttling of yeast scaffold Ste5 is required for its recruitment to the plasma membrane and activation of the mating MAPK cascade. Cell 98, 501-512.

17. Remenyi,A., Good,M.C., Bhattacharyya,R.P., and Lim,W.A. (2005). The role of docking interactions in mediating signaling input, output, and discrimination in the yeast MAPK network. Mol. Cell 20, 951-962.

18. Marti-Renom,M.A., Stuart,A.C., Fiser,A., Sanchez,R., Melo,F. et al. (2000). Comparative protein structure modeling of genes and genomes. Annu. Rev. Biophys. Biomol. Struct. 29, 291-325.

19. Eswaran,J., Lee,W.H., Debreczeni,J.E., Filippakopoulos,P., Turnbull,A. et al. (2007). Crystal Structures of the p21-activated kinases PAK4, PAK5, and PAK6 reveal catalytic domain plasticity of active group II PAKs. Structure. 15, 201-213.

20. Cornell,W.D., Cieplak,P., Bayly,C.I., Gould,I.R., Merz,K.M. et al. (1995). A Second Generation Force Field for the Simulation of Proteins, Nucleic Acids, and Organic Molecules. Journal of the American Chemical Society 117, 5179-5197.

21. Wu,C., Jansen,G., Zhang,J., Thomas,D.Y., and Whiteway,M. (2006). Adaptor protein Ste50p links the Ste11p MEKK to the HOG pathway through plasma membrane association. Genes Dev. 20, 734-746.

22. Jansen,G., Wu,C., Schade,B., Thomas,D.Y., and Whiteway,M. (2005). Drag&Drop cloning in yeast. Gene 344, 43-51.