Supplemental Data I

PROTEIN CONSERVED DOMAIN STRUCTURES, AND ALIGNMENTS WITH KNOWN PROTEINS

TaRSZ38

AtRSZ33 ACCESSION CAC03605

AUTHORS Lopato,S., Forstner,C., Kalyna,M., Hilscher,J., Langhammer,U.,

Indrapichate,K., Lorkovic,Z.J. and Barta,A.

TITLE Network of interactions of a novel plant-specific Arg/Ser-rich

protein, atRSZ33, with atSC35-like splicing factors

JOURNAL J. Biol. Chem. 277 (42), 39989-39998 (2002)

PUBMED 12176998

1 50

AtRSZ33 (1) MPRYDDRYGNTRLYVGRLSSRTRTRDLERLFSRYGRVRDVDMKRDYAFVE

TaRSZ38 (1) MPRYDDRYGNARLYVGRLSSRTRSRDLEYLFSKYGRIREVELKRDYAFIE

51 100

AtRSZ33 (51) FGDPRDADDARHYLDGRDFDGSRITVEFSRGAPRGS-----RDFDSRGPP

TaRSZ38 (51) YSDPRDADEARYNLDGRDVDGSRIIVEFAKGVPRGSGGSREREYVGRGPP

101 150

AtRSZ33 (96) PGAGRCFNCGVDGHWARDCTAGDWKNKCYRCGERGHIERNCKNQPKKLRR

TaRSZ38 (101) PGTGRCFNCGIDGHWARDCKAGDWKNKCYRCGERGHIERNCQNSPRSLRR

151 200

AtRSZ33 (146) SGSYSRSPVRSRSPRRRRSPSRSLSRSGSYSRSRS------PVRRRERSV

TaRSZ38 (151) ERSYSRSPS----PRRGRARSRSYSRSRSYSRSRSRSYSESPRGRRTERD

201 250

AtRSZ33 (190) EERSR------SPKR------MDDSLSPR-----ARDRSPVLDDEGS-

TaRSZ38 (197) ERRSRSISYSRSPRRSLSPGGKEMDRSPTPDRSRSPRRSISPVAKDNGDS

251 300

AtRSZ33 (220) PKIIDGSPPPSPKLQKEVGSDRDGGSPQDNG-----RNSVVSPVVGAGGD

TaRSZ38 (247) PRGRETSRSPSDGYRSPVANGRSPRSPVNNGSPSPTRDNRASPSLRGNNG

301 338

AtRSZ33 (265) S------SKEDRSPVDDDYEPNRTSPRG-SESP-

TaRSZ38 (297) SPSPKGNGNGGSPSPRGNGDDDGRRGSGSPRGRSVSP-

Consensus positions: 61,5%; Identity positions: 54.4%

TaDRH1 (partial)

AtDRH1 ACCESSION BAA28347

AUTHORS Okanami,M., Meshi,T. and Iwabuchi,M.

TITLE Characterization of a DEAD box ATPase/RNA helicase protein of

Arabidopsis thaliana

JOURNAL Nucleic Acids Res. 26 (11), 2638-2643 (1998)

NtDB10 ACCESSION P46942

AUTHORS Itadani,H., Sugita,M. and Sugiura,M.

TITLE Structure and expression of a cDNA encoding an RNA helicase-like

protein in tobacco

JOURNAL Plant Mol. Biol. 24 (1), 249-252 (1994)

Hsp72 ACCESSION AAC50787

AUTHORS Lamm,G.M., Nicol,S.M., Fuller-Pace,F.V. and Lamond,A.I.

TITLE p72: a human nuclear DEAD box protein highly related to p68

JOURNAL Nucleic Acids Res. 24 (19), 3739-3747 (1996)

1 50

AtDRH1 (1) -MAATAAASVVRYAPEDHTLPKPWKGLIDDRTGYLYFWNPETNVTQYEKP

Hsp72 (1) ------MRGGGFGDRDRDRDRGGFGARGGG-G------

NtDB10 (1) MAVVTASSAGPSYAPEDPTLPKPWKGLVDGTTGFIYFWNPETNDTQYERP

TaDRH1_partial (1) ------

51 100

AtDRH1 (50) TPSLPPKFSPAVSVSSSVQVQQTDAYAPPKDDDKYSRGSERVSRFSEGGR

Hsp72 (26) ------LPPK--KFG------

NtDB10 (51) VPSSHAVSAPAH-KSS---VFVSSSVE--KPSQGQ------RYDADGG

TaDRH1_partial (1) ------

101 150

AtDRH1 (100) SGPPYSNGAANGVGDSAYGAASTRVPLPSSAPASELSPEAYSRRHEITVS

Hsp72 (33) NPGERLRKKKWDLSELPKFEKNFYVEHPEVARLTPYEVDELRRKKEITVR

NtDB10 (87) HNRGSNNKIARSSSDRFHDGTSVHEGYGSLGVGSDISQESYCRRNEISVT

TaDRH1_partial (1) ------

151 200

AtDRH1 (150) GGQVP-PPLMSFEATGFPPELLREVLSAGFSAPTPIQAQSWPIAMQGRDI

Hsp72 (83) GGDVCPKPVFAFHHANFPQYVMDVLMDQHFTEPTPIQCQGFPLALSGRDM

NtDB10 (137) GGDVP-APLTSFEATGFPSEIVREMHQAGFSAPTPIQAQSWPIALQGRDI

TaDRH1_partial (1) ------

201 250

AtDRH1 (199) VAIAKTGSGKTLGYLIPGFLHLQRIRNDSR-MGPTILVLSPTRELATQIQ

Hsp72 (133) VGIAQTGSGKTLAYLLPAIVHINHQPYLERGDGPICLVLAPTRELAQQVQ

NtDB10 (186) VAIAKTGSGKTLGYLMPAFIHLQQRRKNPQ-LGPTILVLSPTRELATQIQ

TaDRH1_partial (1) ------

251 300

AtDRH1 (248) EEAVKFGRSSRISCTCLYGGAPKGPQLRDLERGADIVVATPGRLNDILEM

Hsp72 (183) QVADDYGKCSRLKSTCIYGGAPKGPQIRDLERGVEICIATPGRLIDFLES

NtDB10 (235) AEAVKFGKSSRISCTCLYGGAPKGPQLRELSRGVDIVVATPGRLNDILEM

TaDRH1_partial (1) ------GAPKGPQLRDLERGADIVVATPGRLNDILEM

301 350

AtDRH1 (298) RRISLRQISYLVLDEADRMLDMGFEPQIRKIVKEIPTKRQTLMYTATWPK

Hsp72 (233) GKTNLRRCTYLVLDEADRMLDMGFEPQIRKIVDQIRPDRQTLMWSATWPK

NtDB10 (285) RRVSLGQVSYLVLDEADRMLDMGFEPQIRKIVKEVPVQRQTLMYTATWPK

TaDRH1_partial (32) GKVSLRQVAYLVLDEADRMLDMGFEPQIRKIVKQVQPKRQTLMFTATWPR

351 400

AtDRH1 (348) GVRKIAADLLVNPAQVNIGNVDELVANKSITQHIEVVAPMEKQRRLEQIL

Hsp72 (283) EVRQLAEDFLRDYTQINVGN-LELSANHNILQIVDVCMESEKDHKLIQLM

NtDB10 (335) GVRKIAADLLVNSVQVNIGNVDELVANKSITQHIEVVLPMEKQRRVEQIL

TaDRH1_partial (82) EVRKIASDLLTNPVQVNIGNTDELVANKSITQHVEVTTSFEKGRRLDQIL

401 450

AtDRH1 (398) RSQEPG--SKVIIFCSTKRMCDQLTRNLTRQ-FGAAAIHGDKSQPERDNV

Hsp72 (332) EEIMAEKENKTIIFVETKRRCDDLTRRMRRDGWPAMCIHGDKSQPERDWV

NtDB10 (385) RSKEPG--SKIIIFCSTKKMCDQLSRNLTRN-FGAAAIHGDKSQGERDYV

TaDRH1_partial (132) RQQEPG--SKVIIFCSTKRMCDQLSRNLSRQ-YGASAIHGDKSQAERDSV

451 500

AtDRH1 (445) LNQFRSGRTPVLVATDVAARGLDVKDIRAVVNYDFPNGVEDYVHRIGRTG

Hsp72 (382) LNEFRSGKAPILIATDVASRGLDVEDVKFVINYDYPNSSEDYVHRIGRTA

NtDB10 (432) LSQFRAGRSPVLVATDVAARGLDIKDIRVVINYDFPTGIEDYVHRIGRTG

TaDRH1_partial (179) LSEFRTGRCPILVATDVAARGLDVKDIRVVVNYDFPTGVEDYVHRIGRTG

501 550

AtDRH1 (495) RAGATGQAFTFFGDQDSKHASDLIKILEGANQRVPPQIREMATR--GGGG

Hsp72 (432) RSTNKGTAYTFFTPGNLKQARELIKVLEEANQAINPKLMQLVDHRGGGGG

NtDB10 (482) RAGASGLAYTFFSDQDSKHALDLVKVLEGANQCVPTELRDMASR--GGGM

TaDRH1_partial (229) RAGATGIAYTFFCDQDSKYASDLVKILEGANQNVSPELRAMVGR--GGYG

551 600

AtDRH1 (543) MNKFSRWGPPSGGRG------R

Hsp72 (482) GGGRSRYRTTSSANNPNLMYQDECDRRLRGVKDGGRRDSASYRDRSETDR

NtDB10 (530) GRARNHWGSGPGGRGG------R

TaDRH1_partial (277) GRGPRRWASSNDSYGG------Q

601 650

AtDRH1 (559) GG------DSGYGGR--GSFASRDSRSSNGWGRERERSRSPERFNR

Hsp72 (532) AGYANGSGYGSPNSAFGAQAGQYTYGQGTYGAAAYGTSSYTAQEYGAGTY

NtDB10 (547) GGPY------NSSYVGRNGGHGYDRGSRDSDRYGHGTYNADAPRKRSR

TaDRH1_partial (294) GAYG------SQTRDGPSFQSSFNNSSSGNQYGGAPSFHTSSSNNQTS

651 700

AtDRH1 (597) APPP-SSTGSPPRSFHETMMMKHR------

Hsp72 (582) GASSTTSTGRSSQSSSQQFSGIGRSGQQPQPLMSQQFAQPPGATNMIGYM

NtDB10 (589) SRSPNIGSGWSGKKSRFTD------

TaDRH1_partial (336) GAASLPASGGSGEGLSFHDRFYSGNSRGGDRARSRSPPKAVGVSNW----

701 719

AtDRH1 (620) ------

Hsp72 (632) GQTAYQYPPPPPPPPPSRK

NtDB10 (608) ------

TaDRH1_partial (382) ------

Consensus positions: 40.6%; Identity positions: 36.5% (for TaDRH1-partial and AtDRH1)

TaLuc7

HsLuc7b ACCESSION NP_958815

AUTHORS Tufarelli,C., Frischauf,A.M., Hardison,R., Flint,J. and Higgs,D.R.

TITLE Characterization of a widely expressed gene (LUC7-LIKE; LUC7L)

defining the centromeric boundary of the human alpha-globin domain

JOURNAL Genomics 71 (3), 307-314 (2001)

ScLuc7p ACCESSION NP_010196

AUTHORS Jacq,C., Alt-Morbe,J., Andre,B., Arnold,W., Bahr,A., Ballesta,J.P.,

Bargues,M., Baron,L., Becker,A., Biteau,N., Blocker,H.,

Blugeon,C., Boskovic,J., Brandt,P., Bruckner,M., Buitrago,M.J.,

Coster,F., Delaveau,T., del Rey,F., Dujon,B., Eide,L.G., Garcia-

Cantalejo,J.M., Goffeau,A., Gomez-Peris,A., Zaccaria,P. et al.

TITLE The nucleotide sequence of Saccharomyces cerevisiae chromosome IV

JOURNAL Nature 387 (6632 Suppl), 75-78 (1997)

PUBMED 9169867

REFERENCE 2 (residues 1 to 261)

AUTHORS Goffeau,A., Barrell,B.G., Bussey,H., Davis,R.W., Dujon,B.,

Feldmann,H., Galibert,F., Hoheisel,J.D., Jacq,C., Johnston,M.,

Louis,E.J., Mewes,H.W., Murakami,Y., Philippsen,P., Tettelin,H. and

Oliver,S.G.

TITLE Life with 6000 genes

JOURNAL Science 274 (5287), 546-547 (1996)

1 50

HsLuc7b (1) ----MSAQAQMRALLDQLMGTARDG-----DETRQRVKFTDDRVCKSHLL

ScLuc7p (1) MSTMSTPAAEQRKLVEQLMGRDFSFRHNRYSHQKRDLGLHDPKICKSYLV

TaLuc7 (1) ------MDAMRKQLDVLMGANRNG-----DVREVNRKYFDRDVCRLFLA

51 100

HsLuc7b (42) DCCPHDILAGTRMDLGECTKIHDLALRADYEIASKER--DLFFELDAMDH

ScLuc7p (51) GECPYDLFQGTKQSLGKCPQMHLTKHKIQYEREVKQGKTFPEFEREYLAI

TaLuc7 (39) GLCPHDLFQLTKMDLGPCPKVHSLQLRKDYEEVKAKG--TENFDRELEDM

101 150

HsLuc7b (90) LESFIAECDRRTELAKKRLAETQEEISAEVSA----KAEKVHELNEEIGK

ScLuc7p (101) LSRFVNECNGQISVALQNLKHTAEERMKIQQVT-----EELDVLDVRIGL

TaLuc7 (87) IDRLIVECERKIQRALKRLADEDAKAAIAISVSEVTQSEEVAQLSKEIKE

151 200

HsLuc7b (136) LLAKAEQLGAEGNVDESQKILMEVEKVRAK------

ScLuc7p (146) MGQEIDSLIRADEVSMGMLQSVKLQELISKR------

TaLuc7 (137) KMKEADIFDFEGKTDDKIKTMELVEELRSKRADLQATLLLDAFNKDRAAI

201 250

HsLuc7b (166) ------KKEAE

ScLuc7p (177) ------KEVAK

TaLuc7 (187) PQPIPPPQMATLPAPPPPDARTQELINEKLSKAEALGEQGMVEEAQKALE

251 300

HsLuc7b (171) EEYRNSMPASSFQQ------QKLRVCEVCSAYLGLHDND

ScLuc7p (182) RVRNITENVGQSAQ------QKLQVCEVCGAYLSRLDTD

TaLuc7 (237) EAEALKKLAAARQEPVPDPSKYSVADVRITDQKLRLCDICGAFLSVYDND

301 350

HsLuc7b (204) RRLADHFGGKLHLGFIQIREKLDQLRKTVAEKQEKRNQDRLRRREERERE

ScLuc7p (215) RRLADHFLGKIHLGYVKMREDYDRLMK------N------

TaLuc7 (287) RRLADHFGGKLHLGYMLIREKLKELQ------EERN----KKRTEKPED

351 400

HsLuc7b (254) ERLSRRSGSRTRDRRRSRSRDRRRRRSRSTSRERRKLSRSRSRDRHRRHR

ScLuc7p (243) -----NR---TTNASKTATTLPGRRFV------

TaLuc7 (326) DRRSREN-SRDRNGRASRDRDAERKDRVEPRDSRRDHDRDRDR-RHDRDR

401 450

HsLuc7b (304) SRSRSHSRGHRRASRDRSAKYKFSRERASREESWESGRSERGPPDWRLES

ScLuc7p (262) ------

TaLuc7 (374) RHDRDRDRDHDRSSRGREHDRDRRRERS------RS------

451 468

HsLuc7b (354) SNGKMASRRSEEKEAGEI

ScLuc7p (262) ------

TaLuc7 (404) ---RDRSRRHERY-----

Consensus positions: 38.7%; Identity positions: 30.0% (for TaLuc7 and ScLuc7p)

TaPrp38

ScPrp38p ACCESSION NP_011589

DEFINITION Unique component of the U4/U6.U5 tri-snRNP particle, dispensable

for spliceosome assembly, but required for conformational changes

which lead to catalytic activation of the spliceosome; Prp38p

[Saccharomyces cerevisiae].

AUTHORS Tettelin,H., Agostoni Carbone,M.L., Albermann,K., Albers,M.,

Arroyo,J., Backes,U., Barreiros,T., Bertani,I., Bjourson,A.J.,

Bruckner,M., Bruschi,C.V., Carignani,G., Castagnoli,L.,

Cerdan,E., Clemente,M.L., Coblenz,A., Coglievina,M., Coissac,E.,

Defoor,E., Del Bino,S., Delius,H., Delneri,D., de Wergifosse,P.,

Dujon,B., Kleine,K. et al.

TITLE The nucleotide sequence of Saccharomyces cerevisiae chromosome VII

JOURNAL Nature 387 (6632 Suppl), 81-84 (1997)

PUBMED 9169869

AUTHORS Goffeau,A., Barrell,B.G., Bussey,H., Davis,R.W., Dujon,B.,

Feldmann,H., Galibert,F., Hoheisel,J.D., Jacq,C., Johnston,M.,

Louis,E.J., Mewes,H.W., Murakami,Y., Philippsen,P., Tettelin,H. and

Oliver,S.G.

TITLE Life with 6000 genes

JOURNAL Science 274 (5287), 546-547 (1996)

PUBMED 8849441

HsPrp38 ACCESSION NP_116253

AUTHORS Beausoleil,S.A., Jedrychowski,M., Schwartz,D., Elias,J.E.,

Villen,J., Li,J., Cohn,M.A., Cantley,L.C. and Gygi,S.P.

TITLE Large-scale characterization of HeLa cell nuclear phosphoproteins

JOURNAL Proc. Natl. Acad. Sci. U.S.A. 101 (33), 12130-12135 (2004)

PUBMED 15302935

1 50

HsPrp38 (1) -----MANRTVKDAHSIHGTNPQYLVEKIIRTRIYESKYWKEECFG--LT

ScPrp38p (1) MAVNEFQVESNISPKQLNNQSVSLVIPRLTRDKIHNSMYYKVNLSNESLR

Ta Prp38 (1) -----MANRTDPRARSIHGTNPQNLVEKIVRAKIYQSNYWKEQCFG--LT

51 100

HsPrp38 (44) AELVVDKAMELRFVGGVYGGN------IKPTPFLCLTLKMLQIQP---

ScPrp38p (51) GNTMVELLKVMIGAFGTIKGQNGHLHMMVLGGIEFKCILMKLIEIRPNFQ

Ta Prp38 (44) AETLVDKAMELDYTGGTHGGN------RRPTPFLCLALKMLQIQP---

101 150

HsPrp38 (83) --EKDIIVEFIKNEDFKYVRMLGALYMRLTGTAIDCYKYLEPLYNDYRKI

ScPrp38p (101) QLNFLLNVKNENGFDSKYIIALLLVYARLQYYYLNGNNKNDDDENDLIKL

Ta Prp38 (83) --DKEIVVEFIKDEDYKYVRVLGAFYLRLTGTVADVYQYLEPLYNDYRKI

151 200

HsPrp38 (131) K------SQNRNGEFELMHVDEFIDELLHSER

ScPrp38p (151) FKVQLYKYSQHYFKLKSFPLQVDCFAHSYNEELCIIHIDELVDWLATQDH

Ta Prp38 (131) R------QKLSDGKFTLTHVDEFIDELLTKDY

201 250

HsPrp38 (157) VCDIILPRLQKRYVLEEAEQLEPRVSALEEDMDDVESSEEE------

ScPrp38p (201) IWGIPLGKCQWNKIYNSDEESSSSESESNGDSEDDNDTSSES------

Ta Prp38 (157) SCGTALPRIQKRWILEASGTLEPRRSALEDDFEEEEEDKEDGQPMDVDEP

251 300

HsPrp38 (198) --EEEDEKLERVPSPDHRR------RSYRDLDKPR-RSPTLRYRR

ScPrp38p (243) ------

Ta Prp38 (207) NTHEKDHLRGRSPTKERDRERERDRDRKHERHHRDRDHDRDRDHDRDYGR

301 350

HsPrp38 (234) SRSRSPRRRSRSPKRRSPSPRRERHRSKSPRRHRSRSRDRR------

ScPrp38p (243) ------

Ta Prp38 (257) GRERDRDRDRGRERDRERDRERDRHRIRDDDYHRDRDRDGRERERRDRDR

351 400

HsPrp38 (275) --HRSRSKSP------GHHRSHRHRSHSKSP------ERSK

ScPrp38p (243) ------

Ta Prp38 (307) GRHRSRSGSRSRDRRERDREVGELRKRRGRGSASPPRGRAEDGPREEPKK

401 438

HsPrp38 (302) KSHKKSRRGNE------

ScPrp38p (243) ------

Ta Prp38 (357) RKEKKEKKGSGNGPDPNDPEIIEMNKLRASIGLGPLK-

Consensus positions: 52.2%; Identity positions: 44.0% (for TaPrp38 and HsPrp38)

TaRSZ22

AtRSZ22 ACCESSION T05112

AUTHORS Golovkin,M. and Reddy,A.S.

TITLE The plant U1 small nuclear ribonucleoprotein particle 70K protein

interacts with two novel serine/arginine-rich proteins

JOURNAL Plant Cell 10 (10), 1637-1648 (1998)

PUBMED 9761791

AtRSZp22 ACCESSION CAA05352

AUTHORS Lopato,S., Gattoni,R., Fabini,G., Stevenin,J. and Barta,A.

TITLE A novel family of plant splicing factors with a Zn knuckle motif:

examination of RNA binding and splicing activities

JOURNAL Plant Mol. Biol. 39 (4), 761-773 (1999)

PUBMED 10350090

AtRSZp21 ACCESSION CAA05351

AUTHORS Lopato,S., Gattoni,R., Fabini,G., Stevenin,J. and Barta,A.

TITLE A novel family of plant splicing factors with a Zn knuckle motif:

examination of RNA binding and splicing activities

JOURNAL Plant Mol. Biol. 39 (4), 761-773 (1999)

1 50

AtRSZ22 (1) MSRVYVGNLDPRVTERELEDEFRAFGVVRSVWVARRPPGYAFLDFEDPRD

AtRSZp21 (1) MTRVYVGNLDPRVTERELEDEFKAFGVLRNVWVARRPPGYAFLEFDDERD

AtRSZp22 (1) MSRVYVGNLDPRVTERELEDEFRAFGVVRSVWVARRPPGYAFLDFEDPRD

TaRSC22a (1) MARLYVGNLDARVTAGELEDEFRVFGILRSVWVARKPPGFAFIDFDDKRD

TaRSZ22 (1) MARVYVGSLDPAVTARELEDEFRVFGVLRSVWVARKPPGFAFVDFDDRRD

51 100

AtRSZ22 (51) ARDAIRALDGKNGWRVEQSHNRGERGGGGRGGDRGGGGGGRGGRGGSDLK

AtRSZp21 (51) ALDAISALDRKNGWRVELSHKDKG-GRGGGGGRRGG------IEDSK

AtRSZp22 (51) ARDAIRALDGKNGWRVAQSHNRGERGGGGRGGDRGGGGAGRGGRGGSDLK

TaRSC22a (51) AEDALRDLDGKNGWRVELSRNDRG-DRGGRGGGGGGGGGGRG-RGGSDMK

TaRSZ22 (51) AQDAIKDLDGKNGWRVELSRNASS-GRGGRDRSGG------SEMK

101 150

AtRSZ22 (101) CYECGETGHFARECRNRGG-----TGRRRSKSRSRTPP---RYRRSPSYG

AtRSZp21 (91) CYECGELGHFARECRRGRG-----SVRRRSPSPRRRRS------PDYGYA

AtRSZp22 (101) CYECGETGHFARECRNRGG-----TGRRRSKSRSRTPP---RYRRSPSYG

TaRSC22a (99) CYECGESGHFARECRLRIGAGGLGSGRRRSRSRSRSRS--PRYRRSPSYG

TaRSZ22 (89) CYECGESGHFARECRLRIGSGGLGSGRRRSRSRSRSRSRSPRYRRSPSYS

151 200

AtRSZ22 (143) RRSYSPRARSPPPPRRRSPSPPPARGRSYSRSPPPYRAREEVPYANGNGL

AtRSZp21 (130) RRSISPSGRRS-PPRRR--SVTPPR--RYSRSPPYRGSRRDSPRRRDSPY

AtRSZp22 (143) RRSYSPRARSPPPPRRRSPSPPPARGRSYSRSPPPYRAREEVPYANGNGL

TaRSC22a (147) RRSYSPRDR---SPRRRSASPAPARGRSYSKSP--VRARDDSPDAKG---

TaRSZ22 (139) RRSYS--R----SPRRR--SVSPARGRSVSRSP--VRGT------

201

AtRSZ22 (193) KERRRSRS--

AtRSZp21 (175) GRRSPYANGV

AtRSZp22 (193) KERRRSRS--

TaRSC22a (189) --YRRSRS--

TaRSZ22 (168) ------

Consensus positions: 76.7%; Identity positions: 68.0% (for TaRSZ22a and AtRSZ22)

TaRSZ38BP1

No conserved domains

No similar proteins from dicotyledonous plants

No sequence orthologues in rice

However, several wheat ESTs were identified:

gi|9420538|gb|BE422695.1| WHE0058_H05_P10ZS Wheat endosperm cDNA library Triticum aestivum

cDNA clone WHE0058_H05_P10, mRNA sequence.

Length=592

Score = 179 bits (453), Expect = 2e-44

Identities = 86/93 (92%), Positives = 87/93 (93%), Gaps = 2/93 (2%)

Frame = -1

Query 387 MVGCCCCDPRVASPSQPMKYPAGNQQLTSISREANRICQTLVCQSGWTVTFNPRWCQRTL 446

MVGCCCCD RVA+PSQPMK PAGNQQLTSISREANRICQTLVCQSGWTVTFNPRWCQRTL

Sbjct 556 MVGCCCCDTRVANPSQPMKNPAGNQQLTSISREANRICQTLVCQSGWTVTFNPRWCQRTL 377

Query 447 TSVFDVGLGS--LPINRNDLLSLRFGCWSVLVL 477

TSVFDVGLGS L INRNDLLSLRF CWSVLVL

Sbjct 376 TSVFDVGLGSCYLSINRNDLLSLRFFCWSVLVL 278

Score = 28.9 bits (63), Expect = 2e-44

Identities = 11/11 (100%), Positives = 11/11 (100%), Gaps = 0/11 (0%)

Frame = -2

Query 375 VRVDGYLPDYP 385

VRVDGYLPDYP

Sbjct 591 VRVDGYLPDYP 559

gi|49520332|emb|AL814631.1| AL814631 h:116 Triticum aestivum cDNA clone G04_h116_plate_13,

mRNA sequence.

Length=612

Score = 147 bits (372), Expect = 1e-34

Identities = 81/129 (62%), Positives = 89/129 (68%), Gaps = 19/129 (14%)

Frame = +3

Query 213 SGNFGTSLSPPSHPTTEYGASRRH------GTTEY------VLPEHAANVSR----G 253

SG+ G ++P +Y RR EY VLPE A+V+ G

Sbjct 225 SGSRGPYMNPRGSGPADYEMERRSVPLHHDVPNVEEYTGRPLNMVLPEGIASVNTYSILG 404

Query 254 PYMNPRGSGPADYEMERRSVPLHHDVPNVEEYTGQPLNMVLAEGIAPVNTYSLRGESPGA 313

PY+NPRGSGPADYEMERRSVPLHHDVPNVEEYTGQ LNMVLAEGIAPV+TYSLRGESPGA

Sbjct 405 PYINPRGSGPADYEMERRSVPLHHDVPNVEEYTGQALNMVLAEGIAPVDTYSLRGESPGA 584

Query 314 YGPGTDAGM 322

Y PGTDAGM

Sbjct 585 YRPGTDAGM 611

Score = 145 bits (366), Expect = 5e-43

Identities = 74/100 (74%), Positives = 80/100 (80%), Gaps = 3/100 (3%)

Frame = +3

Query 220 LSPPSH-PTTEYGASRRHGTTEYVLPEHAANVSRGPYMNPRGSGPADYEMERRSVPLHHD 278

+S P H TEYGASRRHG+ Y EHAA+ SRGPYMNPRGSGPADYEMERRSVPLHHD

Sbjct 135 VSVPLHIRPTEYGASRRHGSPPYQRSEHAASGSRGPYMNPRGSGPADYEMERRSVPLHHD 314

Query 279 VPNVEEYTGQPLNMVLAEGIAPVNTYSLRGE--SPGAYGP 316

VPNVEEYTG+PLNMVL EGIA VNTYS+ G +P GP

Sbjct 315 VPNVEEYTGRPLNMVLPEGIASVNTYSILGPYINPRGSGP 434

Score = 57.4 bits (137), Expect = 5e-43

Identities = 26/26 (100%), Positives = 26/26 (100%), Gaps = 0/26 (0%)

Frame = +2

Query 202 RRLSPQQSALPSGNFGTSLSPPSHPT 227

RRLSPQQSALPSGNFGTSLSPPSHPT

Sbjct 83 RRLSPQQSALPSGNFGTSLSPPSHPT 160

TaSRp30

ZmSRP31 ACCESSION AAU29336

AUTHORS Gao,H., Gordon-Kamm,W.J. and Lyznik,L.A.

TITLE ASF/SF2-like maize pre-mRNA splicing factors affect splice site

utilization and their transcripts are alternatively spliced

JOURNAL Gene 339, 25-37 (2004)

PUBMED 15363843

AtSRp30 ACCESSION CAB42557

AUTHORS Lopato,S., Kalyna,M., Dorner,S., Kobayashi,R., Krainer,A.R. and

Barta,A.

TITLE atSRp30, one of two SF2/ASF-like proteins from Arabidopsis

thaliana, regulates splicing of specific plant genes

JOURNAL Genes Dev. 13 (8), 987-1001 (1999)

PUBMED 10215626

AtSR1 ACCESSION AAD52609

AUTHORS Lazar,G. and Goodman,H.M.

TITLE The Arabidopsis splicing factor SR1 is regulated by alternative

splicing

JOURNAL Plant Mol. Biol. 42 (4), 571-581 (2000)

PUBMED 10809003

1 50

AtSR1 (1) MSSRSSRTVYVGNLPGDIREREVEDLFSKYGPVVQIDLKVPPRPPGYAFV

AtSRp30 (1) MSSRWNRTIYVGNLPGDIRKCEVEDLFYKYGPIVDIDLKIPPRPPGYAFV

TaSRp30 (1) MSRRNGRTIYVGNLPEDIREREIEDLFCKYGPIVDIDLKIPPRPPVYAFV

TaSRp30a (1) MSRRWSRTIYVGNLPGDIREREVEDLFYKYGRIVEIDLKVPPRPPGFAFV

ZmSRP31 (1) MTRRNGCTIYVGNLPGDIREREVDDLFYKYGRIVEIDLKIPPRPPGFAFV

51 100

AtSR1 (51) EFDDARDAEDAIHGRDGYDFDGHRLRVELAHGGRRSS-DDTRGSFNGGGR

AtSRp30 (51) EFEDPRDADDAIYGRDGYDFDGCRLRVEIAHGGRRFSPSVDRYSSSYSAS

TaSRp30 (51) EFEDPRDADDAIYGRDGYDFDGCKLRVELAHGGKGPS-FDRPNSYTSSGR

TaSRp30a (51) EFEDPRDAEDAIHGRDGYNFDGNRLRVELAHGGRANS-SSLPNSYGGGGR

ZmSRP31 (51) EFEDARDAEDAIYGRDGYNFDGHRLRVELAHGGRGTSSFDRSSSYSSAGQ

101 150

AtSR1 (100) GGGRGRGDGRGDGGSRGPSRRSEFRVLVTGLPSSASWQDLKDHMRKGGDV

AtSRp30 (101) RAP------SRRSDYRVLVTGLPPSASWQDLKDHMRKAGDV

TaSRp30 (100) RG------ALRRSDYRVIVTGLPSSASWQDLKDHMRRAGDV

TaSRp30a (100) RGG------VSRHTEYRVLVTGLPSSASWQDLKDHMRKAGDV

ZmSRP31 (101) RG------ASKRSDYRVMVTGLPSSASWQDLKDHMRRAGDV

151 200

AtSR1 (150) CFSQVYRDARGTTGVVDYTCYEDMKYALKKLDDTEFRNAFSNGYVRVREY

AtSRp30 (136) CFSEVFPDRKGMSGVVDYSNYDDMKYAIRKLDATEFRNAFSSAYIRVREY

TaSRp30 (135) CFSDVYPGAGAITGIVEFPNYEDMKHAIRKLDDSEFRNAFSRTYIRVREY

TaSRp30a (136) CFSEVYREGGGTIGIVDYTNYDDMKYAIRKLDDTEFKNAFSRAPIRVKEY

ZmSRP31 (136) CFTDVYREAGATIGIADYTNYEDMKHAIRKLDDSEFRNAFSRTYVRVREY

201 250

AtSR1 (200) DSRKDSRSPSRGRSYSKSRSRSRGRSVSRSRS------RSRSRSRSPKA

AtSRp30 (186) ESRSVSRSP-DDSKSYRSRSRSRGPSCSYS------SKSRS-VS

TaSRp30 (185) NARGS------RSYSRSRSRSCSYSRSRS------HSYSRSRSPRS

TaSRp30a (186) AGKSS------RSYSRSRSRSRSGSYSRSPSPKKKPSRRSASRSRSRSV

ZmSRP31 (186) DARRS------RSRSRGRNRSKSRSRSHS------HSYSRSRS-CS

251 300

AtSR1 (243) KSSRRSPAKSTSRSPGPRSKSRSPSPRRSRSRSRSPLPSVQKEGSKSPSK

AtSRp30 (222) PARSISPRS------RPLSRSRSLYSSVSRSQSRSKSRSRSRSNSPVS

TaSRp30 (219) SSRSLSPAAP---A---RDKSASRSPIRSRSLPRSQSPVKSE------

TaSRp30a (229) SSHSRSPSKERSPS---RSPAKSRSPVAASPVVNGEAASPKRDPSKSPSR

ZmSRP31 (219) YSKSRSPRS------RSASESKSPVKASCTTETRKLWPCWTISIHVGY

301 315

AtSR1 (293) PSPAKSPIHTRSPSR

AtSRp30 (264) PVISG------

TaSRp30 (255) ------

TaSRp30a (276) SRSPDAKSE------

ZmSRP31 (261) IFVC------

Consensus positions: 69.5%; Identity positions: 60.0% (for TaSRp30 and AtSRp30)

TaTra2

MpTra2-like ACCESSION JC7299

AUTHORS Nishiyama,R., Yamato,K.T., Miura,K., Sakaida,M., Okada,S., Kono,K.,

Takahama,M., Sone,T., Takenaka,M., Fukuzawa,H. and Ohyama,K.

TITLE Comparison of expressed sequence tags from male and female sexual

organs of Marchantia polymorpha

JOURNAL DNA Res. 7 (3), 165-174 (2000)

PUBMED 10907846

NtTra2-like ACCESSION CAA70700

AUTHORS Petitot,A.S., Blein,J.P., Pugin,A. and Suty,L.

TITLE Cloning of two plant cDNAs encoding a beta-type proteasome subunit

and a transformer-2-like SR-related protein: early induction of the

corresponding genes in tobacco cells treated with cryptogein

JOURNAL Plant Mol. Biol. 35 (3), 261-269 (1997)

PUBMED 9349250

HsTra2 ACCESSION AAD19277

AUTHORS Beil,B., Screaton,G. and Stamm,S.

TITLE Molecular cloning of htra2-beta-1 and htra2-beta-2, two human

homologs of tra-2 generated by alternative splicing

JOURNAL DNA Cell Biol. 16 (6), 679-690 (1997)

PUBMED 9212162

DmTra2 ACCESSION P19018

AUTHORS Heinrichs,V., Ryner,L.C. and Baker,B.S.

TITLE Regulation of sex-specific selection of fruitless 5' splice sites

by transformer and transformer-2

JOURNAL Mol. Cell. Biol. 18 (1), 450-458 (1998)

PUBMED 9418892

1 50

DmTra2 (1) --MDREPLSSGRLHCSARYKHKRSASSSSAGTTSSGHKDRRSDYDYCGSR

HsTra2 (1) MSDSGEQNYGERESRSAS-RSGSAHGSGKSARHTPARSRSKEDSRRSRSK

MpTra2-like (1) ------SCFRVRECASLFPTYLVSQLSSWSVASIMADSPRHRSDRYSRSP

NtTra2-like (1) ------GFLVSWEMADSPRK---RYSRSP

TaTra2 (1) ---MSHSNEVRYTARSITPPADRDSGSKSPPPRRRAASKSPPPRRRAASK

51 100

DmTra2 (49) RHQ------RSSSRRRSRSRSSSESPPPEPRHRSGRS

HsTra2 (50) SRS------RSESRSRSRRSSRRHYTRSRSRSRSHRR

MpTra2-like (45) SPR------ERTERSRSRSPS---RRSPPPRERVE-R

NtTra2-like (21) SPW------EKNSRSKSPPES---YSRPRGRSRS--R

TaTra2 (48) SPPLPPPPPFPPMGVRTISRSPPPSSRRRSVSRSPPPKRRGRSRSRSRSR

101 150

DmTra2 (80) SRDRERMHK------SREHPQASRCIGVFGLNTNTS

HsTra2 (81) SRSRSYSRDYRRRHSHSHSPMSTRRRHVGNRANPDPNCCLGVFGLSLYTT

MpTra2-like (72) SRSRSPSR------RFRHDALNPGN--NLYVTGLSTRVN

NtTra2-like (47) SRSRSRSRS------RGRGEVSNPGN--TLYVTGLSTRVT

TaTra2 (98) NRSRSRSRD------KDDVRNPGN--NLYVTGLSTRTQ

151 200

DmTra2 (110) QHKVRELFNKYGPIERIQMVIDAQTQRSRGFCFIYFEKLSDARAAKDSCS

HsTra2 (131) ERDLREVFSKYGPIADVSIVYDQQSRRSRGFAFVYFENVDDAKEAKERAN

MpTra2-like (103) EKDLEEHFSREGKVLECRLVLDPRTRESRGFGFVTMEHLEGAERCIKYLN

NtTra2-like (79) ERDLEEHFSKEGKVKSVFLVVEPRSRISRGFAFITMDSLEDANRCIKHLN

TaTra2 (128) ETDLEKFFSKEGKVKDCRVVIDPRTKESRDFAFVTMENVEDARRCIKYLH

201 250

DmTra2 (160) GIEVDGRRIRVDFSITQRAHTPTPGVYLGRQPRGKAPRSFSPRRGRRVYH

HsTra2 (181) GMELDGRRIRVDFSITKRPHTPTPGIYMGRPTYGSSRRRDYYDRGYDRGY

MpTra2-like (153) RSTLEGRMITVEKAKRKRARTPTPGEYLGVRAMQSNRR---GGRGGGFRR

NtTra2-like (129) QSVLEGRYITVEKSRRKRARTPTPGHYLGLKNARGEGR---GDRGRYRDR

TaTra2 (178) RTVLEGRLISVAKAKRTRERTPTPGEYCGPRGGRSRVE---PRRSRSPRR

251 300

DmTra2 (210) DRSASPYDNYRDRYDYRN-----DRYDRNLRR-SPSRNRYTR------

HsTra2 (231) DDRDYYSRSYRGGGGGGGGWRAAQDRDQIYRRRSPSPYYSRG---G----

MpTra2-like (200) DARNSRHSPEYAPYSGG------RDRSPRY-SPYRGHQR------

NtTra2-like (176) EDYGYRRSPRHSPYRSR------RDYSPRR-SPYGEGQEGSVLGRILL

TaTra2 (225) SSRSSRDRS-RSPAARRD------RDSRDRKRD------

301 328

DmTra2 (246) ------NRSYSRSRSPQLRRTSSRY

HsTra2 (274) ------YRSRSRSRSYSPRRY------

MpTra2-like (232) ------DRSSSPQYAPYRR------

NtTra2-like (217) MQEAMLVVQDRSLSSPMNS------

TaTra2 (251) ------

Consensus positions: 26.9%; Identity positions: 21.7% (for TaTra2 and DmTra2)

TaTRN-SR

HsTRN-SR ACCESSION Q9Y5L0

AUTHORS Lai,M.C., Lin,R.I. and Tarn,W.Y.

TITLE Transportin-SR2 mediates nuclear import of phosphorylated SR

proteins

JOURNAL Proc. Natl. Acad. Sci. U.S.A. 98 (18), 10154-10159 (2001)

PUBMED 11517331

ScMtr10p ACCESSION NP_014803

AUTHORS Dujon,B., Albermann,K., Aldea,M., Alexandraki,D., Ansorge,W.,

Arino,J., Benes,V., Bohn,C., Bolotin-Fukuhara,M., Bordonne,R.,

Boyer,J., Camasses,A., Casamayor,A., Casas,C., Cheret,G.,

Cziepluch,C., Daignan-Fornier,B., Dang,D.V., de Haan,M., Delius,H.,

Durand,P., Fairhead,C., Feldmann,H., Gaillon,L., Kleine,K. et al.

TITLE The nucleotide sequence of Saccharomyces cerevisiae chromosome XV

JOURNAL Nature 387 (6632 Suppl), 98-102 (1997)

PUBMED 9169874

1 50

HsTRN-SR (1) MEGAKPTLQLVYQAVQALYHDPDPSGKERASFWLGELQRSVHAWEISDQL

TaTRN-SR (1) --MEAQATATVKEALAALYHHPDDAIRAAADRWLQKFQHTLDAWQVADSL

ScMtr10p (1) --MDNLQVSDIETALQCISSTASQDDKNKALQFLEQFQRSTVAWSICNEI

51 100

HsTRN-SR (51) LQIR----QDVESCYFAAQTMKMKIQTSFYELPTDSHASLRDSLLTHIQN

TaTRN-SR (49) LHDES---SNLETLMFCSQTLRSKVQRDFEELPSEAFRPLQDSLYGLLKK

ScMtr10p (49) LSKEDPTNALLELNIFAAQTLRNKVTYDLSQL-ENNLPQFKDSLLTLLLS

101 150

HsTRN-SR (97) LKDLSPVIVTQLALAIADLALQMPSWKG----CVQTLVEKYSNDVTSLPF

TaTRN-SR (96) FNKGPPKVRTQICIAIAALAVHVPVEDWGGGGIVDWLGDEMKSQQEFIPS

ScMtr10p (98) HNQ--KLIITQLNVALARLAIQFLEWQN------PIFEIISLLNSSPSI

151 200

HsTRN-SR (143) LLEILTVLPEEVHSRSLRIGANRR--TEIIEDLAFYSSTVVSLLMTCVEK

TaTRN-SR (146) FLELLIILPQETSSYRIAARPERR--NQFENDLCSSANVALSLLTACLGF

ScMtr10p (139) LLNFLRILPEETLDIASTSLTEVEFNSRIHELIDPIAEDVLKFLVSCIDL

201 250

HsTRN-SR (191) ------AGTDEKMLMKVFRCLGSWFNLGVLDSNFMANNKLLALLFEVLQQ

TaTRN-SR (194) ------DELKEQVLEGFASWLR--FCHGITAATLASHPLVHTALSSLNTD

ScMtr10p (189) LQNTDGNSSSSISLEQILRCLNSWSYEFPVEQLLTVQPLINLVFETISNG

251 300

HsTRN-SR (235) DKTSSNLHEAASDCVCSALYAIENVETNLPLAMQLFQGVLTLET------

TaTRN-SR (236) QFLEAAVNVTSELIHFTVSRDSCGITEQFPLIQILIPHVMGLK------

ScMtr10p (239) NESDMEAFDSAIDCLCVILRESRDTTNEQLISALFHQLMLLQEKLLPTLF

301 350

HsTRN-SR (279) AYHMAVAREDLDKVLNYCRIFTELCETFLEKIVCTPGQGLGDLRTLELLL

TaTRN-SR (279) -EQLKDSSKDEEDVKAIARLFADMGDSYADLIATGSG---DAMQIVNALL

ScMtr10p (289) TDHPLNDEYDDDLLEGMTRLFVEAGEAWSVVISKNPD--FFKPMVLVLLM

351 400

HsTRN-SR (329) ICAGHPQYEVVEISFNFWYRLGEHLYKTNDEVIHG---IFKAYIQR----

TaTRN-SR (325) EVTSHSEFDISSMTFNFWHHLKRNLTVRDSYTSCGSEVSIEAERNRRMQL

ScMtr10p (337) LTCKNEDLDVVSYTFPFWFNFKQSLVLPRYQESRKAYSDIFVKLIN----

401 450

HsTRN-SR (372) ------LLHALARHCQLEPDHEGVPEETDDFGEFRMRVSDLVKDLIFL

TaTRN-SR (375) FRPPFEVLVSLVSSRVEYPEDYHTFSEEDRRDFRYARYAVSDVLLDATDV

ScMtr10p (383) ------GIITHLQYPSGQFSSKEEEDKFKDFRYHMGDVLKDCTAV

451 500

HsTRN-SR (414) IGSMECFAQLYSTLK------EG-NPPWEVTEAVLFIMAAIAKSVDPKKP

TaTRN-SR (425) LGGDSTLKILFMKLIQACGSGAEQNQNWQPLEAALFCIQAIAKSLSIEE-

ScMtr10p (422) VGTSEALSQPLIRIKSA----IENNNSWQIMEAPLFSLRTMAKEISLTE-

501 550

HsTRN-SR (457) FSNAACHHSLLFGQNITSEISNCEYLPPVLRENNPTLVEVLEGVVRLPET

TaTRN-SR (474) ------KEILPQVMPLLPRFPHQEQL

ScMtr10p (467) ------NTILPEIIKIICNLPEQAKI

551 600

HsTRN-SR (507) VHTAVRYTSIELVGEMSEVVDRNPQFLDPVLGYLMKGLCEKP------L

TaTRN-SR (494) LQTVC--STIGAFSKWIDAAPAELPILPPLVDILNKGMSTSED------T

ScMtr10p (487) RYAST---LVLGRYTEWTAKH--PELLEVQLQYIFNGFQLHEGSSDMQSI

601 650

HsTRN-SR (550) ASAAAKAIHNICSVCRDHMAQHFNGLLEIARSLDS----FLLSPEAAVGL

TaTRN-SR (536) AAAASVAFKYICEDCRGKFSGSLDGLFQIYHVAISGVGGYKVSSEDSLHL

ScMtr10p (532) ITASSHALMFFCSDCSKLLVGYIDQLINFFLNVQS-----SIDIESQFEL

651 700

HsTRN-SR (596) LKGTALVLARLPLDKITECLSELCSVQVMALKKLLSQ---EPSNGISSDP

TaTRN-SR (586) VEALSVVITTLPQDHARRALELICMPIINSLQEIIQQGESAPQQVPARHL

ScMtr10p (577) CQGLSAVINNQPEAKVSVIFQKLVDDNLRQIEALIPQWK-ANPTLLAPQI

701 750

HsTRN-SR (643) TVFLDRLAVIFRHTNPIVE--NGQTHPCQKVIQEIWPVLSETLNKH--RA

TaTRN-SR (636) TVHIDRLSTIFSNVK------LPEVVAEAVNRYWSTLKIIFDHR--AW

ScMtr10p (626) ADKIDLLYALFEELKPRYNYPQQGSEPLLPRIEFIWKALRTLLVDAGAMT

751 800

HsTRN-SR (689) DNRIVERCCRCLRFAVRCVGKGSAALLQPLVTQMVNVYHVHQHSCFLYLG

TaTRN-SR (676) DTRTMESLCRSCKFAVRTCGRSMGITIGAMLLEIQTLYQQHNQSCFLYLS

ScMtr10p (676) DSIIVERVAKLLRRIFERFHVFCEPILPSVAEFLIQGYLTTGFGSYLWCS

801 850

HsTRN-SR (739) SILVDEYGMEEG------CRQGLLDMLQALCIPTFQLLEQQNGLQNH

TaTRN-SR (726) SEVIKIFGSDPS------CASYLTCLIQTLFNHTIQLLRTIQDFTAR

ScMtr10p (726) GSLIVIFGDDESFPISPSLKDAVWKFALSQCETFILNFNKFDKLQLNDYH

851 900

HsTRN-SR (780) PDTVDDLFRLATRFIQRSPVTLLRSQVVIPILQWAIASTT-LDHRDANCS

TaTRN-SR (767) PDIADDCFLLASRCIRYCPDLFVPTEIFPRLVDCAMAGVT-IQHREACKS

ScMtr10p (776) EAIIDFFSLISDLIMFYPGAFLNSTELLGPVLNVALECVNKLDNYDAYIC

901 950

HsTRN-SR (829) VMRFLRDLIHTGVAND------HEEDFELRKELIGQVMNQLGQQLVSQL

TaTRN-SR (816) ILCFLSDTFDLAKS------PEGEKYRDLINTIVLQRGATLARIM

ScMtr10p (826) ILRCLDDIISWGFKTPPISTVSIEIVPDEWRKQVINEVVIAHGNQLILVL

951 1000

HsTRN-SR (872) LHTCCFCLPPYTLPDVAEVLWEIMQVDRPTFCRWLENSLKGLPKETTVGA

TaTRN-SR (855) IASLTGALPSGRLEEASYVLLSLNRAFGGNMLNWTRDCIALIPPQALTDS