Amino acid sequence alignments of enzymes involved in the tryptophan biosynthesis pathway

Anthranilate synthase-Alpha subunit amino acid sequence alignment

69 351

Anav2 LDTPVSAWYKVYFLLESVEGIGRYSLLGDPLWILEAGQTPFTALPVKGGLFGFWGYELIRWIEHSQDERIPDGLWMQVDHLLIFDQVKRKIWAIADLAYQ

Nost3 LDTPVSAWYKVYFLLESVEGIGRYSLLGDPLWVLEAGQTPFTALPVKGGLFGFWGYELIRWIEHPQDERIPDGLWMQVDHLLIFDQVKRKIWAIADLAYQ

Syncys LETPVSAWYKVYFLLESVEGIGRYSFLGDPMWVLEAGQVPLDILPVNGGLFGVWGYELIRWMEYEPQPEPPDGIWMQVDHLLIFDQVKRKIWAIADLAYR

Glovi LETPVSAWYRVEFLLESVEGLGRYSFLGEPLWTLAAGQVPFAVLPVHGGLFGYWGYELIRWIEYS-DPDLPDGFWMQVDSLLIFDQIKRKIWVIADLAYR

Syncoc LETPVSAWYRVYFLLESVEGVARYSFLGDPLWVLEIGQRPFAILPVHGGLFGYWGYELIRWIEHPLQPGPPDAVLMQVDSILLFDQVKRKIWVVADTAYA

Silpo LDTPVSLMLKLDFMLESVTGRGRYSIIGKPDLIWRCGLNPLDNLIALAGLFGYLGYDMVRLVEVNPDPLLPDAVMMRPSVVAVLDGVKGEVTVVSWVAYA

Lokve LDTPVSLMLKLNFMLESVTGRGRYSIIGNPDLIWDCGVNPLDALINLAGLFGYLGYDMIRLVEVNPDPLLPDAIMLRPSVVAVLDGVKGDVILVAYVAYA

Oceal LETPVSAYLKLNFLLESVEGRGRYSAIGDPDLIWRCGIAPLQSLLTLAGLFGYLGYDTIRQVEAGPDPLLPEGQLLRPRVMVVFDALRQEILVAARPQVD

Eryli TETPVGAALKLGFLLESVEGRGRYSLLGDPDLVFRAAINSLDALIDVACLVGYFAYETIGQVEAPESELLPDMVFTRPTLLLVFDSLTDNLFIIAWPAIE

Psefl FDTPLSIYLKLNYLLESVQGWGRYSIIGPCRTVLRVGVTPLAFVVPTGGLVGYFGYDCVRYVEPNPDPIVPDILLMVSDAVVVFDNLAGKVHAIVDPAFE

Azovi FDTPLSIYLKLNYLLESVQGWGRYSIIGPARTVLRIGVSPLAFVVPDGGLVGYFGYDSVRYVEPNPDPLTPDILLMVSDAVVVFDNLAGKMHLIVDPAFE

Xanor LDTPLSVYLKLYYLFESVEGFGRYSIIGPARRVYSFGVSPLAEVVPQGGLVGWFGFECIQYIEDKADELTPDILLMLSEELAVFDNLKGRLYLIVDPAYV

Dechar LDTPLSIYLKLYYLLESVQGFGRYSIIGAAQTRIVVGVLPLEFIAPPGGLVGCFGYDTVRYVENKPDEITPDIGLLLSEEIAVVDNLSGKLTLIVEPAYQ

Niteu LDTPLSIYLKLYYLLESILGFGRYSVIGPAEIRLEASVIVLGFIVAPGGLAGYFSYDTIRYIEARPDTITPDILLLLSEELVVMDNLSGKLYLIIDPAYQ

Metfl LDTPLSLYLKLYYLLESVQGFGRYSIIGPARVRIEVGVLPLDFIIPPGGLAGYFGYDTIRYIEAKQDALVPDVLLMVSEEIAVVDNLSGKLYFIVDPAYT

Mothe TETPISLYLKFDFLLESVEGLGRYSLIGDPLLTFTASLSPFKALLPPGGLVGYLGYDMVRELEGPGNDLIPDTHLTLHRCYLVYDHILRTVRITCRGGYE

Nicta HLTPVLAYRCLPFLFESVEPVGRYSVVGQPSMEIVAEILPMTIPPRLGGWVGYFSYDTVRYVEAPEDDRLADIQLGLYEDVIVFDHVEKKAHVIHQLAYL

Camac HLTPVLAYRCLPFLYESVEPVGRYSVVGQPAMEIVAEIVPMTVPPQLGGWVGFFSYDTVRYVEAPQDDRLADIHLGLYDDVLVFDHVEKKVYVIHRLAYM

Catro HLTPVLAYRCLPFLFESVEPVGRYSVIGQPTMEIVAEVMPMVVPPQRGGWVGYFSYDTVRYVEAPVDDRLPDIHLGLYDDVIVFDHVEKKAYVIHRLAYK

Rutgr HLTPVLAYRCLPFLFESVEPIGRYSVVGQPAIEIVAEILPMDVPPQLGGWVGYFSYDTVRYVEAPTDDRLPDVHLGLYDDVIVFDHVEKKAFVIHRLAYN

Arab HLTPILAYRCLPFLFESVEPIGRYSVVGQPTIEIVAGVMPMMVPPQGGGWVGYFSYDTVRYVEAPEDDRLPDVNLGLYDDVIVFDHVEKKAYVIHRINFR

Oryz HLTPVLAYRCLPFLFESVEQVGRYSMVGHPVMEVVAEIMPMQIPPQQGGWVGFFSYDTVRYVEAPQDDRLPDVHLGLYDDVLVFDNVEKKVYVIHNLAFQ

Zeam HLTPVLAYRCLPFLFESVEQVGRYSMVGHPVMEIVADIMPMQIPPQQGGWVGFFSYDTVRYVEAPQDDRLPDVHLGLYDDVLVFDNVEKKVYVIHNVAYQ

Chlre HLTPVSAYRCLPFLLESVVNQGRYSFLGSPALEVVAQLLPMQLPPAAGGWVGYAGYDTEASDP------AYE

Cme DLTPVMAYRRLPFLFESVVGIGRFSYVGSPCMQIIAQIIPWVFMVAEGGWVGFGGYDTVRYSEAPRDDRLPDLHFALYQEVVIFDNVTKTVYIVVPLARA

Cloth METPISLFKRFCFLLESVEGWARYSIIGNPFLVVESYIIPVEIIGANGGAVGYFGYDLIRHYEVPEDDMLPECHFMFTDEVLVYDHLKQKIHIIVHVAYI

Alkme METPITLFKKLNYLLESVERKGRYSYVGDPFMTIKGHEIPLEIVTPFGGAVGFIGYDTIRNYEVNEEDIIPEIHLLLTKEVIVYDHLKHKIIILVVLSYE

Theret ELTPINIFYSLNFLLESANGWGRYSFIGDPYLSILSYKLILDEINSLGGAIGYASYDLIRLYEKNPDEIIPDVYFMFYKSFICYDHLKHRIYVVYYPEYE

Bacha AVTPIHIVQQLAFILESKDDWARFSFIGNPFVELIENSIKSCLQQPLGGAVGYMAYDAIETIEANARESAPLYHFLFCETLIVYDHTEKKLAVIYTERYK

Sachce DLTPHVAYLKLEFLLESAKTLDRYSFIGSPRKTIKTPETTFKVAPKLGGAIGYISYDCVRYFEPLKDVLLPEAYLMLCDTIIAFDNVFQRFQIIHNTGYQ

Cangl DLTPHVAYLKLEFLLESAKTLDRYSFIGSPRKIIKTPETQFKLAPKLGGAIGYISYDCIRYFEDLKDVLLPEAYLMMCDTIIAFDHVFQRFQIMHNTGYS

Yarli ALTPHTAYLKLPFLFESALGLSRYSFIGNPRKMIKTADVQYRQFPTLGGAIGYISYDCIKYFEDLKDVLVPESALMLYDTIVAFDHVYQRLQVITKVAYE

Asfum LLTPTLAYLKILFLYESAATIGRYSFVGEPYKVLKTPECQFRVAPPLGGAIGYVGYDCVRYFEPLRDVLIPESLFMMFKTIVAFDHFFQVIKVVTPIEYR

Schipo MLTPSVAYLKLYFILESVTQVSRYSFIGSPYRILMAGKTEVKTAPSFGGAVGYVSYDCIKYFEPLEDTLLPEAMFFMTDDLVAFDHAYQTVKIISCIAYE

Neucr LLTPSAIYLKLYFLLESATGVGRYSFIGNPRKVLETPVGQDKVLPKLGGAVGYISYDCIKYFEELKDNLIPEALFMLYDTMVAFDHFSSAFTVVTRLAYN

Ustma LLTPVTAYLKLRFLFESVTGIGRYSFIGDPLKIIRTPEGPYKYIPTFGGAIGYISYDCVQYFEELKDVVIPESVFMVADSLIVFDHVFSTVRCVSHLLYA

Pram LDTPVSVLLKLHFLLESVAPIARYSFIGGPLKVVATEQGQFRTVVPMGGAVGYCGFDAIRHFEAQKDVLVPESIYMFFDTVVIFDHVFHSLKIVSRLAYK

Psoj LDTPVSVLLKLHFLLESVAPIARYSFIGGPLKVVATPEGQFRTVVPMGGAVGYCGFDAIRHFEAQKDVLVPESIYMFFDTVVIFDHVFHSLKIVSRLAYK

Natph AVEPLEAYAALHFLLESAEKHARYSFVGDPAAVVTVATVTVATLDVPGGLVGFLAYDAVYDLWER-PDSFPDAQFMLTTKTLVFDDAAGTVSLVCLVLYD

Metac ACSPLELYGALYYLLESVEKSPLFEAICKMEEVCGPTNEVFDALGIEGGAIGYTAYDAIYDSWEKGFESIPDLQYLLVSKSFVLDHLTEEVYIVLFVVYE

Pyrab RVEPIKLYSVINFIFTIIGKEAKLTYISSPEFTVEIGLDKVSNESLKGGFMGYIAYDAVHNYIEE------PSVFGYYPWVFIYNHGKGELRFYYLRIVE

Theco PVDPLKLYSALMFMLRSAEKKARFTYISEPEFVVEVGIDRVSDEALKGGFVGYVSYDSVHSIIEE------PSVFGYYPWTFIYDHSTGALSFFYLRLVE

Nost2 LLTSSYEYPGRPLQLTTRENAFTISSLNRGQVLLPTFAQQRSKQEILLGLYGAFGYDLVFQFEKIARPAQRDLVLYLPDELIVVDYYLQKAYRHQFALPR

Anava LLTSSYEYPGRPLQLTTRENAFTISSLNRGQVLLPTFAQQRSKQEILLGLYGAFGYDLVFQFEKIARPAQRDLVLYLPDELIVVDYYLQKAYRHQFALPR

Nost1 LLTSSYEYPGRPVELSTSGNTFTLTALNRGYVLLPVFKSERSKQEILLGLYGAFGYDLVFQFECLERPQQRDLVLYLPDELIVVDYYQQQAFRLEFILPR

Legpn VFASSFEYPGRPLVMICQSNTICFEALNRGEILLSFYTTERSHQLLLLGLYGAFGFDLIYQFECKPRNEQRDMVLYLPDEIYIVNHRKEEAFVRRFQLSR

Synwol LFSSSYDYPGRPLELRAQGRDFCLEALKSGLKLLFGYGRERSRQVIRLGLYGAFGYDLVYQFEKRPRPEQCDLLLYLPDELTVVDHRMARAYQLNFRIPQ

Bruab VFSSNYEYPGRPVVITSRARTMRIEALNRGVILLRPLALERSRMAIVLGLYGAFGYDLAFQFDKLKRPDQRDLVLFIPDEIFVADHYAARAWVDRFRLDR

Brume VFSSNYEYPGRPVVITSRARTMRIEALNRGVILLRPLALERSRMAIVLGLYGAFGYDLAFQFDKLKRPDQRDLVLFIPDEIFVADHYAARAWVDRFRLDR

Brusu VFSSNYEYPGRPVVITSRARTMRIEALNRGVILLRPLALERSRMAIVLGLYGAFGYDLAFQFDKLKRPDQRDLVLFIPDEIFVADHYAARAWVDRFRLDR

Rhile VFSSNYEYPGRPLGISSFGRDVWIEAYNRGEVILGFTTVERSKMAVTLGFYGAFGYDIAFQFDKLTRPSQRDMVLYLPDEILVVDNYAAKAWVDRFEKAG

Rhiet VFSSNYEYPGRPLGISSFGREVWIEAYNRGEVLLGFTTVERSKMAVNLGLYGAFGYDIAFQFDKLTRPSQRDMVLYLPDEILVVDNYAAKAWIDRFAKAG

Sinme VFSSNYEYPGRPLAISSFGRSLWIEAYNRGEVLLALASVERSKMAVTLGLYGAFGYDLAFQFDKLSRPDQRDMVLFLPDEILVVDHYAAKAWIDRFAKAA

Simed VFSSNYEYPGRPLAISSFGRSLWIEAYNRGEVLLALASVERSKMAVTLGLYGAFGYDLAFQFDKLTRPEQRDMVLFLPDEILVVDHYAAKAWIDRFAKAA

Agrob VFSSNYEYPGRPLGISCFGRKMWIEAYNRGEVLLDFTATERSKIAIVIGLFGAFGYDLAFQFDSLARPEQRDMVLFLPDEILVVDHYSAKAWIDRFEKSS

Meslo VFSSNYEYPGRPLVISARGRAMRIEALNRGEALLPVGGLERSRVAITLGLYGAFGYDLSFQFDKLERKPQRDLVLFLPDEILVVDHYSAKAWTDRYSLPR

Auran VLFSNYEYPGRPLVISSDRRDMRIEALNRGRVLLGFDGNERSRMTIVLGLYGAFGYDLAYQFEVLERDARRDLVLYLPDELLEVDHYSASAFRTRFALEG

Fulpe VLFSNYEYPGRPLVLTSAGRRMVIEALNKGEILLSMAGHERSRIAIVCGLYGAFGYDLAFQFDSLERPAQRDLVLYMPDEVLEVDHYSASAVLSTFAVKG

Oceba LFSSNYEYPGRPVMIEARGRAMRIAALNRGRVLLAMRELERSRAAIVLGLYGAFGYDLAFQFEHLERPAQRDLVLFLADDILVADHYSASAVRYRFDLQG

Azobr LLSSGVEAPGRPLAATARGRTLRIDALNRGRVLLPAAGLERSRQAVLLGLYGAFAYDLAFQFERLERPDQRDLVLYLPDRLVALDPVAGLARLVEFALER

Nitwi VLSSGTTVPGRPLKIETTGFNFTIEAANRGEVLIAFGEPQRTRRALVLGLYGAFAYDLVFQIEKRAREPQRDIVLFIPDRLLAYDRAAGHGVVLSFAQPR

Nitham VLSSGTTVPGRPLKIETTGLNFAIEATNRGEVLIAFGEPQRTRRALVLGLYGAFAYDLVFQMEKRAREAQRDIVLFVPDRLLAYDRATGRGVTLNFALAT

Braja VLSSGTTVPGRPLKLETTGVNFKLEALNRGQVLIAFAEPQRTRRDLVLGLFGAFAYDLVFQIEKRARESQRDIVLYVPDRLLAYDRATGRGVVLSFTLPR

Rhopa MLSSGTTVPGRPLRLVSTGDNFALTALNRGEVLLAFAEPQRTRRALVLGLFGAFAYDLVFQFEKRAREAQRDIVLYVPDRLLAYDRATGRGVHLAFSLSH

Rhoru VIASTYEYPGRPLVLEGKGREAHLLALNRGRALLPAAGAERSRQALAFGLVGAFGYDLGLAFEARPRSAHRDLVLYLPDRLLIDDPEAGGLAERLITLAR

Phatr LLTSSYEFPGRPLEISGRGQACTITALNRGRVLMPAETLERSRQALVLGLYGAFGYDLTFQFEAQERDSQRDLLLYLPDTMLVVDQDKRDAWRVCFNIPR

thaps VLTSGYEFPGRPLEISGNGNKCKITALNRGQVLLQPLNLERSRQALVLGLYGSFGYDLTFQFEAQERDPQRDLLLYLADELVVVDQSRHSAWTVSFSLPH

Thefu VLSSGMEYPGRPLEVVARGRTIGATALNRGRVLLPAAAHERSRRALVLGLYGAFGYDLAFQFEVLPRDPDRDLVLHLPDEIIVHDRKREICQRYSFTLPR

Solus YLSSGYEYPGRPLEIIAYDRRFELRPLNRGREILRLHKLERSKQALILGLVGAFGYDLLFQFEKLPRHGHKDLHLFLCDDIWYMDRKKEQVERFQFQLPR

QACFCASVQKAKEYIKAGDIFQVVISQRLSTQYPFSLYRSLRQINPSPYMYFNFWQIIGSSPEVMVKAEVATVRPIAGTRPRGKTTKEDAELAADLLQDP

QACFCASVQKAKEYIKAGDIFQVVISQRLSTQYPFSLYRSLRQINPSPYMYFNFWQIIGSSPEVMVKAEIATVRPIAGTRPRGKTTKEDAELAADLLQDP

NACFLEEVAIAKDYITAGDIFQVVLSQRLSTIYPFKLYRSLRLINPSPYMYYNFWQIIGSSPEVMVKADMATVRPIAGTRPRGKTHPEDEQLAEELLNDP

LAGYCGAVERAREYIRAGDIFQVVLAQRFSTPFSFDLYRMLRAINPSPYMYYQFFQLIGSSPEVMVRLDLATVRPIAGTRRRGQSEAEDRWLEKDLLADP

QACYCQAVERAKDYIRAGDIFQVVLSQRFTVSLPFRLYRSLRLINPSPYMFLQFLCLIGSSPEVMVKLSIATVRPIAGTRPRGTTPLEDRQLEQELLADP

QAAYKAAVEKAKDYIRAGDIFQVVPAQRWTQEFPFALYRSLRRTNPSPFMYFNFFQVVGASPEILVRVFEVTIRPIAGTRPRGATPEEDRANEADLLADK

QAAYLQAVEKAKDYIRAGDIFQVVPSQRWTQEFPFALYRSLRRTNPSPFMFFNFFQVIGASPEILVRVFEVTIRPIAGTRPRGATPAEDDANEADLLADQ

AARYRQNVETAKDYIRAGDIFQVVPSQRFSAPYPLSLYRSLRRTNPSPFLFFNLFAIVGSSPEILVRLRTVTVRPIAGTRPRGNSEEADQAHEADLRADP

RAGYSSMVAKAKEYITAGDIFQVVLAQRFTCPFPLALYRALRRVNPSPFLFLDLFAVVGSSPEILVRLREVTIRPIAGTRPRGATPEADREAEASLLADA

AGQYERAVDTIKEYILAGDCMQVVPSQRMSIDFPIDLYRALRCFNPTPYMFFNFFHVVGSSPEVLVRVELITVRPIAGTRPRGASEEADLALEQDLLSDA

RGRFERTVARIKDYILAGDCMQVVISQRMSIPFPIDLYRALRCFNPTPYMFFDFFHVVGSSPEVLVRVELVTVRPIAGTRPRGASEEADLALERDLLSDA

RANYHAVVRKAQEYVRAGDIFQVVPSQRLRVPFPVDVYRALRALNPSPYMFLDVTQVVGSSPEILARLRVVTVRPIAGTRPRGATPELDKALEEELLADP

KARFKQAVLKAKAYITEGDIMQVVLSQRMTKPFPLALYRTLRSLNPSPYMYFDFFHVVGASPEILVRLERVTVRPIAGTRKRGASPEEDAALAVELLADE

TSKFIAAVEKAKHYILEGDIMQVVLSQRTSKPYPLALYRALRSLNPSPYMNYHLFHIVGASPEILVRLETVTVRPIAGTRPRGQDTQADLALAADLLADP

QGIFKEAVAKSKQYIFDGDIMQVVLSQRMAKPFPLSLYRALRSLNPSPYMYYDMHHVVGASPEILVRLETVTSRPIAGTRPRGKTREEDIALAEELLADP

EAVFTGMVTKAKEYIAAGDIFQVVLSQRLSLPFALVVYRHLRALNPSPYMYLNFVQLVGASPEMLVRVETIDYRPIAGTRRRGRTAAEDRALAAELLASE

DGKYKNAVLQAKEHIAAGDIFQIVLSQRFERRTPFEVYRALRIVNPSPYMYIQACILVASSPEILTRVKRIVNRPLAGTSRRGKTPDEDVMLEMQMLKDE

DGMYKNAVLQAKDHILAGDIFQIVLSQRFERRTPFEVYRALRVVNPSPYMYLQACILVASSPEILTRVNKIVNRPLAGTTRRGRTPHEDEMLKKQLLNDK

DGMYEKAVLQAKEHILAGDIFQLVLSQRFERRTPFEVYRALRIVNPSPYMYLQACILVASSPEILTRVKTITNRPLAGTTRRGKTPKEDYMLEQQLLNDE

DGMYKEAVLEAKEHILAGDIFQIVLSQRFERRTPFEIYRSLRIVNPSPYMYLQACILVASSPEILTRVKKITNRPLAGTIRRGKTRKEDLVFEKELLNDE

EGMYKEAVVEAKEHILAGDIFQIVLSQRFERRTPFEIYRALRIVNPSPYMYLQVCILVASSPEILLRSKKITNRPLAGTVRRGKTPKEDLMLEKELLSDE

DGKYKNAVMQAKEHIMAGDIFQIVLSQRFERRTPFEVYRALRIVNPSPYMYVQACVLVASSPEILTRVRKIINRPLAGTVRRGKTEKEDEMQEQQLLSDE

DGRYKNAVLQAKEHIMAGDIFQIVLSQRFERRTPFEVYRALRIVNPSPYMYVQACVLVASSPEILTRVSKIINRPLAGTVRRGKTEKEDQMQEQQLLSDE

AGQFMDAVGATKEHIQAGDIFQLVLSQRFERRTPFEIYRALRVVNPSPYMYMQACHIAD------LLADQ

HGWFFHALERISEYIYLGDTFQTVFSQRFERDTPFQVYRALRIVNPSPYMYMRACILISSSPEILCKTRKVWNRPLAGTRPRGSSPEEDARLEQELLADE

SAVFCRNVLKAKQYIRDGDIFQVVLSQRLCVETPFNIYRALRVINPSPYMYLKFYRIIGSSPEMLVRVEIVETCPIAGTRKRGRTKEEDEALEKELLSDE

AGVFMGKVLRAKEYIQNGDIFQVVLSQRLQVEVPFQIYRNLRSISPSPYMYINFYQVVGASPELLVKVKKVETCPIAGTRPRGKTSQEDERLAKELLEDE

EVLFCKIVEKAKEYIEKGDIFQVVLSQRLKAAVPFEIYRRLRSKNPSPYLYIDFFQLLGSSPESLVSVFKVTTNPIAGTRRRGKDEEEDLRLKEELLKDE

CAVFLRDVERIKEYIRAGDVFQAVLSQRFERELALDVYRVLRMINPSPYLYIKFVEVVGSSPERLVQVQHVEIHPIAGTRKRGATREEDEALAKELLADE

AAAYENHVSTLKKHIKKGDIIQGVPSQRVARPTPFNIYRHLRTVNPSPYLYIDCFQIIGASPELLCKSDRVITHPIAGTVKRGATTEEDDALADQLRGSL

HAVYENHVGNLIEHIKKGDIIQAVPSQRVARPTPFNIYRHLRTVNPSPYLYIDCFQIIGASPELLCKSDKVITHPIAGTIKRGATTEEDDELGNQLRNSL

DAKYEQHVTTLKERIKKGDIIQAVPSQRVARPTPFNVYRHLRTVNPSPYLYIDYFQLVGASPELLVKIERIVSHPIAGTVKRGATPAEDNQLAQELLNSE

NGQYERHVTRLKEHISKGDIFQTVPSQRLSRPTPFNLFRHLRTVNPSPYLYIDCFQLVGASPELLVKEERIITHPIAGTVKRGKSPEEDDALAAELRGSL

AAVYKAFVSNLKEHIFNGDIFQAVPSQRIARRTPFNLYRHLRTVNPSPYMYIHCFDIIGASPELLVKSERIINHPIAGTVPRGKTKEEDEAYAKDLLASV

AACYETFVTELKKNIVKGDIIQAVPSQRFSRSTPFNIYRTLRTLNPSPYLFLSCFHIVGASPECLMKTDRIVNHAIAGTIKRGLNSEEDDELAAVLLAST

KANYEAFVTSLRKNIVKGDIIQAVPSQRLKRRTAFNCYRHLRQVNPSPYMYVDCCQLVGASPETLCKIEKVAVHAIAGTVKRGANDVEDAELASQLASSV

EATYMGFVEKLKEHIVDGDIFQAVPSQRLAVPKPLDLYRQMRAINPSPYMYLEMFQIVGASPEMLVKVDIVETHPIAGTRHRGANDEEDAALVAELLADE

EATYMGFVDKLKEHIVDGDIFQAVPSQRLAVPKPLDLYRQMRAINPSPYMFLDMFQIVGASPEMLVKVDVVETHPIAGTRHRGANDEEDEALVKDLLADE

DLTYEDAVETAKEHVLDGDIYQGVISRTRELDGTLGLYRALRDINPSPYMYLLDLAVVGASPETLVSVREVMSNPIAGTCPRGESPVEDRRLAGEMLADD

EALFEESVLQAKEHIFAGDIFQVVLSRKCEFKMPFELYIQLRAINPSPYMYIFELAIVGASPETLLTVHTVIINPIAGTCPRGKSEAEDETLASHMLNDE

RAKFMEMVEKGKEYIFAGDVFQVVLSREYVLRSPVELYRKLISINPSPYTFILEKILVGASPETMGSVETVRINPIAGTAPRGKTPEEDEEIRRRLLSDE

RARFVEIVRAGKEYIYSGDVFQVVLSREYRVRTALEIYKRLVELNPSPYTFILEKTVVGASPETMGSVETFKINPIAGTAPRGRTGEEDRELEKALLSDE

TGQYANLVEQALDYFRRGDLFEVVPSQNFFTEQPSQLFQTLRQINPSPYGLLNLEYLIGASPEMFVRVDRVETCPISGTIRRGEDALGDAVQIRQLLNSH

TGQYANLVEQALDYFRRGDLFEVVPSQNFFTEQPSQLFQTLRQINPSPYGLLNLEYLIGASPEMFVRVDRVETCPISGTIRRGEDALGDAVQIRQLLNSH

TGEYAKLVEFALDYFRRGDLFEVVPSQNFFTEAPSQLFETLKQINPSPYGIFNLEYIIGASPEMFVRVERVETCPISGTITRGHDAIDDAVQIRQLLNSH

EGKYAEVVKKAKEKFSCGDLFEVVPSQTFYTAEPSILFRQMREINPSPYGFVNLEYLVGASPEMYVRVQRVETCPISGTIKRGADAIEDAHNIQTLLDSE

YSQYTALVEKAKQSFQRGDLFEVVPSYSLLEPIPSEAFKNLKRINPSPYGIINLEQLVGASPEMYVRVERVETCPISGTIKRGKDAIEDALQIRKLLNSG

ATPYARLVERAKESFKRGDLFEVVPGQTFYEHTPSEIFRRLKSINPSPYSFINLEYLVGASPEMFVRVNRIETCPISGTIKRGEDAISDSEQILKLLNSK

ATPYARLVERAKESFKRGDLFEVVPGQTFYEHTPSEIFRRLKSINPSPYSFINLEYLVGASPEMFVRVNRIETCPISGTIKRGEDAISDSEQILKLLNSK

ATPYARLVERAKESFKRGDLFEVVPGQTFYEHTPSEIFRRLKSINPSPYSFINLEYLVGASPEMFVRVNRIETCPISGTIKRGEDAISDSEQILKLLNSK

DIAYAELVTKAKESFRKGDLFEVVPGQKFMEDSPSDISKRLKAINPSPYSFINLEYLVGASPEMFVRVSRIETCPISGTIKRGDDPIADSEQILKLLNSK

DIAYADLVVKAKESFRKGDLFEVVPGQKFMEESPSDISKRLKAINPSPYSFINLEYLVGASPEMFVRVSRIETCPISGTIKRGDDPIADSEQILKLLNSK

DIAYAELVVKAKESFRRGDLFEVVPGQKFYEESPSEISNRLKAINPSPYSFINLEYLVGASPEMFVRVSRIETCPISGTIKRGDDPIADSEQILKLLNSK

DIAYAELVVKAKESFRRGDLFEVVPGQKFYEDSPSEISNRLKAINPSPYSFINLEYLVGASPEMFVRVSRIETCPISGTIKRGDDPIADSEQILKLLNSK

DITYSELVVKAKESFRRGDLFEVVPGQKFMEESPSAISRRLKAINPSPYSFINLEYLVGASPEMFVRVSRIETCPISGTIKRGDDPIADSEQILKLLNSK

DAIYANLVRRAMDSFKRGDLFEVVPGQMFYEETPSDISRKLKSINPSPYSFINLEYLIGASPEMFVRVNRVETCPISGTIKRGDDAISDSEQILKLLNSK

GGPYADLVRSAKEKFMRGDLFEVVPGQVFYEEHPSEISRRLKAVNPSPYSFVNLEFLIGASPEMFVRVNRVETCPISGTIKRGDDAIADSEQILKLLNSK

GGSYADLVRSAKERFRRGDLFEVVPGQVFYEEHPSEISRRLKAVNPSPYSFVNLEFLVGASPEMFVRVARVETCPISGTVPRGQDAIGDAEQVLKLLNSK

GGTYAELVREAKKSFRRGDLFEVVPGQVFYEKDPSAIFKRLKKINPAPFGIMNLEYLVGASPEMFVRVTRVETCPISGTIRRGADAIEDSEQILKLLQSK

AGRYQRVVETAKAAFRRGDLFEVVPGQTFAEADPSAVFRRLRAANPAPYEFVNLEFLVAASPEMYVRVARVETCPISGTVARGADALGDSSQILRLLTSA

DTPYQATVETARAAFARGDLFEAVPGQLFAEERPAEVFQRLCRINPSPYGLVNLEFLVSASPEMFVRSDRIETCPISGTIARGADAIGDAEQIRELLNSE

DTGYQATVETARAAFARGDLFEAVPGQLFAEERPAEVFQRLCRINPSPYGLVNLEFLVSASPEMFVRSDRIETCPISGTIARGADAIGDAEQIRELLNSE

ETAYQATVETARAAFARGDLFEAVPGQLFAEDRPAEVFQRLCVINPSPYGLMNLEFLVSASPEMFVRSDRVETCPISGTIARGTDAIGDAEQIRQLLNSE

DTPYQATVEVARAAFARGDLFEAVPGQLFAEERPAEVFQRLCRINPSPYGLMNLEFLVAASPEMFVRSDRIETCPISGTIARGVDAIGDAEQIRQLLNSE

ETAYGAIVRGLKEAFAAGDLFEAVPSRALRRAEPSRLYRRLRAANPAPYLLANLEHLIGASPEMFVRVGRVETCPISGTIARGPDALGDAEAIRTLLNST

TGTFANSVLKAKEEFKVGNLFEAVLSQTFRETVPSTLFRRLRARNPAPYGLINLEYLVGASPEMFVRCERVETCPISGTVARGADALEDAQRVKSLMMNA

EGNFKESVAKAKHEFKMGNLFEAVLSQTFRKEKPSKLFRRLRAKNPSPYGFFNLEYLVGASPEMFVRCERVETCPISGTVARGVDALEDAARVKSLIMNL

HTEYARIVAEAKERFRRGDLFEVVPSHRLYAASPARFYERLRERNPAPYEFLNLEYLVGASPEMFVRVTRVETCPISGTIKRGADAVGDAENIKELLSSA

EGEYMANVEKVREGMRRGDYYEVVLRQTFRTSGASELFRRVQTASPSPYELLQFEQLVGASPEMFVRVERVETCPISGTAQRTGDPLRDADNIRELLVST

KEIAEHVMLVDLGRNDLGRVCASGTVKVDELMVVERYSHVMHIVSNVVGKLAANKNAWDLLKACFPAGTVSGAPKIRAMEIINELEPSRRGVYSGVYGYY

KEIAEHVMLVDLGRNDLGRVCASGTVKVDELMVVERYSHVMHIVSNVVGKLAANKNAWDLLKACFPAGTVSGAPKIRAMEIINELEPSRRGVYSGVYGYY

KEIAEHVMLVDLGRNDLGRVCVQGSVKVNELMVIERYSHVMHIVSNVVGELASDKTAWDLLKACFPAGTVSGAPKIRAMEIINELEPERRGPYSGVYGYY

KEVAEHVMLVDLGRNDLGRVCKPGSVRVDELLTVERYSHVMHIVSNVTGELASGRGAWDLLRATFPAGTVSGAPKIRAMEIIHALEPFRRGPYAGAYGYY

KERAEHVMLVDLARNDLGRVCQLGSVQVDDLMQIERYSHVMHIVSNVVGRLDPQYSAWDLLRATFPAGTVTGAPKIRAMQIIHELEGCRRGPYAGAYGYY

KELAEHLMLLDLGRNDAGRVSKIGTVRPTEKFIIERYSHVMHIVSNVVGELDPDKDALDAFFAGMPAGTVSGAPKVRAMEIIDELEPEKRGIYGGGVGYF

KELAEHLMLLDLGRNDTGKVSKIGSVRPTEQFIVERYSHVMHIVSNVVGELADDQDALSAFFAGMPAGTVSGAPKVRAMQIIDELEPEKRGVYGGGCGYF

KERAEHLMLLDLGRNDVGRVAQAGTVRVTEREIIERYSHVMHIVSNVEGQLADGEDAISALMAGFPAGTVSGAPKVRAMEIIDELEPHRRGIYAGAVGYF

KERAEHLMLLDLGRNDVGRVAAKGTVEVTDSFTVERYSHVMHIVSNVVGQLDPAKDALDALFAGFPAGTVSGAPKIRACEIIAELEPETRGPYAGGVGYF

KEIAEHLMLIDLGRNDTGRVSEIGSVKLTEKMVIERYSNVMHIVSNVTGQLKAGLTAMDALRAILPAGTLSGAPKIRAMEIIDELEPVKRGIYGGAVGYF

KELAEHLMLIDLGRNDVGRVADTGSVKLTEKMVIERYSNVMHIVSNVTGHLRQGLTAMDALRAILPAGTLSGAPKVRAMEIIDELEPVKRGVYGGAVGYL

KERAEHVMLIDLGRNDVGRVAEPGTVKVGEQFVIERYSHVMHIVSEVTGTLKAGLNYSDVLRATFPAGTVSGAPKIRALEIIRELEPVKRNVYSGAVGYI

KERAEHTQLLDLGRNDCGRVARVGSVKLTENMIVERYSHVMHIVSNVEGKLQPGLDALDVLRATFPAGTVSGAPKVRAMEIIDELEPVKRGIYAGSVGYL

KERAEHIMLMDLGRNDIGRVAQTGSVKVTENMQIEYYSHVMHIVSNVEGKLKSGLNAIDVLRATFPAGTVSGAPKVRAMEIIDELEISKRGIYAGAVGYL

KEIAEHVQLMDLGRNDVGRVAQVGSVAVTEKMVIERYSHVMHIVSNVEGRLKSGLHAIDVLKATFPAGTLSGAPKVRAMEIIDELEPSKRGIYGGAVGYL

KERAEHLMLLDLGRNDVGRIAVPGSLQVTRQMVVEYYSHVMHLVSSITARLAPGRSALDALLACFPAGTVTGAPKVRAMEIITELEPVNRGPYAGAVGYL

KQRAEHIMLVDLGRNDVGKVSKPGSVNVEKLMSVERYSHVMHISSTVSGELLDHLTCWDALRAALPVGTVSGAPKVKAMELIDQLEVARRGPYSGGFGGI

KQCAEHIMLVDLGRNDVGKVSKSGSVNVERLMNVERYSHVMHISSTVTGELLDHLTCWDALRAALPVGTVSGAPKVKAMELIDQLEATRRGPYSGGFGGI

KQCAEHIMLVDLGRNDVGKVSKPGSEKVEKLMNIEPYSHVMHISSTVTGELLDNLTSWDVLRAALPVGTVSGAPKVKAMELIDELEVTRRGPYSGGFGGI

KQCAEHIMLVDLGRNDVGKVSEPGSVKVEKLMNIEHYSHVMHISSTVTGELLDHLTSWDALRAALPVGTVSGAPKVKAMEIIDKLEVTRRGPYGGGFGGI

KQCAEHIMLVDLGRNDVGKVSKPGSVEVKKLKDIEWFSHVMHISSTVVGELLDHLTSWDALRAVLPVGTVSGAPKVKAMELIDELEVTRRGPYSGGFGGI

KQCAEHIMLVDLGRNDVGKVSKPGSVKVEKLMNIERYSHVMHISSTVSGELDDHLQSWDALRAALPVGTVSGAPKVKAMELIDELEVTRRGPYSGGLGGI

KQCAEHIMLVDLGRNDVGKVSKPGSVKVEKLMNIERYSHVMHISSTVSGQLDDHLQSWDALRAALPVGTVSGAPKVKAMELIDKLEVTRRGPYSGGLGGI

KEIAEHVMLVDLGRNDVGKVAVSGSVVVQKLMEVERYSHVMHISSTVTGELLPQLDSWDALRAALPAGTVSGAPKVRAMQIIDELEVNKRGPYGGGVGHV

KDRAEHVMLVDLGRNDVGRIAELGSVQVEKLFEIERYSHVMHISSTVTGKLRAELSCWDALRATLPAGTISGAPKIRSMQIIDELEPTKRGPYGGGIGYV

KEIAEHVMLVDLGRNDIGRVSKFGTVAVKNLMHIERYSHVMHVVTNVQGEIREDKTPFDALMSILPAGTLSGAPKVRAMEIIDELETVKRGPYGGAIGYL

KERAEHLMLVDLARNDIGKIANFGTVELKEYMEVYYYSHVMHIVSIVTGSLQEKKDMYDALISCLPAGTLSGAPKIRAMEIIDELENKKRGIYGGAVGYF

KERAEHVMLVDLGRNDIGKVSEFGSVKIERFMEVDFYSHVMHIVSTVSGKLKRGLTAFDALIACLPAGTVSGTPKIRAMEIIDELENVRRSFYAGAVGYF

KERAEHYMLVDLARNDIGRIAEYGTVKTPTLLEIGKFSHVMHIISKVTGELKQTLHPLDALRYGFPAGTVSGAPKIRAMEILNELEPTKRGIYAGAIAYL

KDRAEHVMLVDLARNDINRICDPLTTSVDKLLTIQKFSHVQHLVSQVSGVLRPEKTRFDAFRSIFPAGTVSGAPKVRAMELIAELEGERRGVYAGAVGHW

KDRAEHVMLVDLARNDINRVCDPKTTNVDKLLTIQKFSHVQHLVSQVSGTLRPDKTRFDAFRSIFPAGTVSGAPKVKAMELISELEGERRGVYAGAVGNW

KDRAEHVMLVDLARNDVNRVCDPRSTSVDRLLGVETFSHVQHLVSQVSGVLRPDQTRFDAFRSIFPAGTVSGAPKVKAMELVGELEKEKRGVYAGAVGSF

KDRAEHVMLVDLARNDVNRVCDPTTTQVDRLMVVEKFSHVQHLVSQVSGILRPDKTRFDAFRSIFPAGTVSGAPKVRAMQLIAELEGEKRGVYAGAVGYF

KDRAEHVMLVDLARNDVSRVCDLDTTSVDKLMTIEKFSHVQHLVSQVSGVLRPDKTRFDAFRSIFPAGTVSGSPKVRAIQLVYGLEKEKRGIYAGAVGRW

KDRAEHVMLVDLARNDVNRVCHPSTVKVDRLMRIDRFSHVQHITSEVSGLLRPECTRWDALRSIFPAGTVSGAPKIRAMELIYDLEKEKRGIYAGAAGWF

KDQAEHVMLVDLARNDINRVCDPATTQVESFMNVEKFSHVMHLTSRITGQLRAGKSRFDALRSIFPAGTVSGAPKIRAIELVSELEQEKRGVYAGAVGRI

KERAEHIMLVDLGRNDVGRVAKPGSVRVEHLMQIEKYSHVMHIVSVVKGDLREDRTVYDAYRAMFPAGTLSGAPKVRAMELICSLETERRGVYSGSVGYF

KERAEHIMLVDLGRNDVGRVAKPGSVRVERLMQIEKYSHVMHIVSVVKGDLREDRTVYDAYRAMFPAGTLSGAPKVRAMELICSLETERRGVYSGSVGYF

KERAEHTMLVDLARNDVRRVSEAGSVRVEEFMNVLKYSHVQHIESTVTGRLREDCDAFDATRASFPAGTLSGAPKIRAMEIIDDLERTPRGVYGGGVGYY

KERAEHVMLVDLGRNDVRMVSESGSVKVSGFMKVLKYSHVQHIESTVSGTLRPECDQFDAFRAVFPAGTLSGAPKIRAMEIISEREAVPRGIYGGGVGYY

KERAEHVMLVDLARNDVRKVSKPGSVKLVRFFDVIKYSHVQHIESEVIGELADDKDMFDAIEASFPAGTLTGAPKIRAMEIIDELEKSRRRVYGGAIGYF

KERAEHVMLVDLARNDVRRVSKPGSVRLTRFFDVLKYSHVQHIESEVVGELDEGKNAFDAMEAAFPAGTLTGAPKIRAMEIIDELERSRRKVYGGAVGYF

KDEAELTMCTDVDRNDKSRICEPGSVRVIGRRQIELYSHLIHTVDHVEGILRPEFDALDAFLSHTWAVTVTGAPKRAAMQFIEQHERSARRWYGGAVGYL

KDEAELTMCTDVDRNDKSRICEPGSVRVIGRRQIELYSHLIHTVDHVEGILRPEFDALDAFLSHTWAVTVTGAPKRAAMQFIEQHERSARRWYGGAVGYL

KDEAELTMCTDVDRNDKSRICEPGSVKVIGRRQIELYSHLIHTVDHVEGILRPEFDALDAFLSHTWAVTVTGAPKRAAIQFIEKNERSVRRWYGGAVGYL

KEESELTMCTDVDRNDKSRICEAGSVKVIGRRQIEMYSRLIHTVDHVEGVLRDGFDAVDAFLTHMWVVTVTGAPKIWAMNFIEQHEKSPRKWYAGAVGWF

KDEAELTMCTDVDRNDKSRICEPGSVKVIGRRQIELYSHLIHTVDHVEGILRPDFDALDAFLTHMWAVTVTGAPKRAAIKWLEENEESPRGWYGGAVGYL

KDESELTMCSDVDRNDKSRVCEPGSVRVIGRRQIEMYSRLIHTVDHIEGRLRDGMDAFDGFLSHAWAVTVTGAPKLWAMRFLEENERSPRAWYGGAIGMM

KDESELTMCSDVDRNDKSRVCEPGSVRVIGRRQIEMYSRLIHTVDHIEGRLRDGMDAFDGFLSHAWAVTVTGAPKLWAMRFLEENERSPRAWYGGAIGMM

KDESELTMCSDVDRNDKSRVCEPGSVRVIGRRQIEMYSRLIHTVDHIEGRLRDGMDAFDGFLSHAWAVTVTGAPKLWAMRFLEENERSPRAWYGGAIGMM

KDESELTMCSDVDRNDKSRVCEPGSVKVIGRRQIEMYSRLIHTVDHIEGRLRDDMDAFDGFLSHAWAVTVTGAPKLWAMRFIESHEKSPRAWYGGAIGMV

KDESELTMCSDVDRNDKSRVCEPGSVKVIGRRQIEMYSRLIHTVDHIEGRLRDDMDAFDGFLSHAWAVTVTGAPKLWAMRFIESHEKSPRAWYGGAIGMV

KDESELTMCSDVDRNDKSRVCVPGSVKVIGRRQIEMYSRLIHTVDHIEGRLRDDMDAFDGFLSHAWAVTVTGAPKLWAMRFIESHEKSPRAWYGGAIGMV

KDESELTMCSDVDRNDKSRVCVPGSVKVIGRRQIEMYSRLIHTVDHIEGRLRDDMDAFDGFLSHAWAVTVTGAPKLWAMRFIESHEKSPRAWYGGAIGMV

KDESELTMCSDVDRNDKSRVCEPGSVKVIGRRQIEMYSRLIHTVDHIEGRLRDDMDAFDGFLSHAWAVTVTGAPKLWAMRFIEGHEKSPRAWYGGAIGMV

KDESELTMCSDVDRNDKSRVCEPGSVRVIGRRQIEMYSRLIHTVDHIEGRLREGMDAFDAFLSHAWAVTVTGAPKLWAMRFIEQNEKSPRAWYGGAIGMV

KDESELTMCSDVDRNDKSRVCDPGSVRVIGRRQIEMYSRLIHTVDHIEGRLRDGMDAFDAFLSHAWAVTVTGAPKLWAMRFIENNEKSPRAWYGGAVGMV

KDESELTMCSDVDRNDKSRVCDPGSVKVIGRRQIEMYSRLIHTVDHITGVLRDGMDAFDAFLSHAWAVTVTGAPKLWAMRFIEKHEKSPRAWYGGAVGMV

KDESELTMCSDVDRNDKSRVCDPGSVRVIGRRQIEMYSRLIHTVDHIEGRLREGMDAFDAFLSHAWAVTVTGAPKLWAMRFIERHEKSTRFWYGGAVGAM

KDAAELTMCTDVDRNDKARVCEPGSVRVIGRRMIELYSRLIHTVDHVEGRLRPGLDALDAFLTHTWAVNGTGAPKRWAMQFLEDTEQSPRRWYGGAFGRL

KDEFELNMCTDVDRNDKARICMPGTIKVLARRQIETYSKLFHTVDHVEGILRSGFDALDGFLTHAWAVTVTGAPKKWAIQFVEDNERSTRRWYAGAFGVV

KDEFELNMCTDVDRNDKARICMPGTIKVLARRQIETYSKLFHTVDHVEGILRPGFDALDAFLTHAWAVTVTGAPKKWAIQFVEDNERSSRRWYAGAFGVV

KDEFELNMCTDVDRNDKARVCVPGTIKVLARRQIETYSKLFHTVDHVEGMLRPGFDALDAFLTHAWAVTVTGAPKLWAMQFVEDHERSPRRWYAGAIGAV

KDEFELNMCTDVDRNDKARVCVPGTIKVLARRQIETYSKLFHTVDHVEGMLRPGFDALDAFLTHAWAVTVTGAPKLWAMQFVEDHERSSRRWYAGAIGCV

KDEAELTMCTDVDRNDKARVCVAGSVTVIGRRQIELYSRLIHTVDHVEGRLRPELDALDAFLSHCWAVTVTGAPKRAAMAAVEAVERAPRAWYGGAIGRL

KEESELTMCTDVDRNDKSRICEPGSVQVIGRRQIEMYSRLIHTVDHVEGYLRPEFDALDAFLCHTWAVTVTGAPKTWAIQFVEDNERSPRCWYGGAVGMV

KEESELTMCTDVDRNDKSRICEPGSVKVIGRRQIEMYSRLIHTVDHVEGYLRPEFDALDAFLCHTWAVTVTGAPKTWAIRFVEENERSPRCWYGGAVGLV

KEESELTMCTDVDRNDKSRVCVPGSVRVIGRRQIEMYSRLIHTVDHIEGILRPELDAIDAFLTHMWAVTVTGAPKTWAMRFIEQHESSPRRWYGGAVGVI

KEESELTMCTDVDRNDKSRVCEPGTVKVIGRRLIESYAGVFHTVDHVEGILQEGFDALDAFLSHMWAVTVIGAPKKAAAQTVEALERNARGWYGGAVGMI

DFEGQLNSAIAIRTMVVRVTVQAGAGLVADSDPEKEYEETLNKARGLLLAI

DFEGQLNSAIAIRTMVVRVTVQAGAGLVADSEPEKEYEETLNKARGLLLAI

DFEGQLNTAIAIRTMVVQVSVQTGAGIVADSDPQKEYEETLNKARGLLEAI

SFDGQLNTAITIRTLVVHANIQAGAGLVADSVPETEYEETLNKARGMLETI

DFSGQLNTAITIRTLLVHVSLQAGAGIVADSDPEREYQECLNKARGMLMAV

SAGGDMDMCIALRTAIVKLYIQAGGGVVYDSDPEAEFMETVHKSNAIRRAA

SAGGDMDMCIALRTAVLQLYIQAGGGVVYDSDPEAEYQETVHKSNAIRKAA

GANGDMDMAIALRTAIVKMHVQAGAGVVLDSDPESEHQETVNKARALFRAA

APDGSFDSCIVLRTAVLKMHVQAGAGIVADSDPAYEQRECEAKAGALIAAA

AWNGNMDTAIAIRTAVIKLHVQAGGGIVADSVPTLEWEETLNKRRAMFRAV

AWNGNMDTAIAIRTAVIKLHVQAGAGIVADSVPALEWEETLNKRRAMFRAV

GWHGDADTAIAIRTAVIQLYVQAGGGVVYDSDPDLEWQETMNKGRALFRAV

GFNGDMDVAIAIRTAVLKLYVQAGAGIVADSDPNSEWTETLNKARAVLRAA

EFNGDMDLAIAIRTGLIKLHVQAGAGIVADSVPQSEWTETCNKARAVLRAA

GFNGDMDLAIAIRTGVIKLYSQAGAGIVADSIPENEWIETQNKARAVLRAA

GLHGNLDTCIAIRTIVFAAFIQAGAGIVADSDPEAEYEETLNKARALLQVL

SFSGDMDIALALRTMVFLAHLQSGAGIVADSNPDEEQIECENKVAGLCRAI

SFSGDLDIALALRTIVFPAYLQAGAGIVADSDPDDEQRECENKAAGLARAI

SFSGDMDVALALRTIVFSAHLQAGAGIVADSDPADEQRECENKSAALARAI

SFTGDLDIALALRTMVFQAHLQAGAGIVADSDPADEQRECENKAAALARAI

SFNGDMDIALALRTMVFPAHIQAGAGIVADSNPDDEHRECENKAAALARAI

SFDGDMLIALALRTIVFSAHLQAGAGIVADSSPDDEQRECENKAAALARAI

SFDGDMQIALSLRTIVFSAHLQAGAGIVADSSPDDEQRECENKAAALARAI

SFTGAMDMALGLRTMIIPVHIQAGAGIVADSKPEAEYEETVNKAAALGRAV

SFDGEMDVALALRTMVIPVHIQAGAGIVLDSNPESEYLETINKAAALGRAI

SFNGNLDSCITIRTIILKAYVQAGAGIVADSVPEREYEECYNKAMALLKAI

GFDGNMDMCIAIRTLLIKAYLQAGAGIVADSNPEAEYKETLRKLDALVETI

SYNGNMDMCIAIRTILFKAYVQAGAGIVYDSIPEMEYCETLNKAMALKEVL

GFDGNIDSCIAIRTMIVKAYIQAGAGIVADSVPENEYEETRNKAKALLKAV

SYDGTMDNCIALRTMVYKAYLQAGGGIVYDSDEYDEYVETMNKMMANHSTI

SYDGTMDTCIALRTMVYKAYLQAGGGIVFDSDKYDEYIETMNKMMANHNTI

GYNGAMDTCIALRTMLLKAYLQAGGGIVFDSDEYDEYVETINKMMANNRCI

GFDSAMDTCIALRTMLLKAYLQAGGGIVFDSDPYDEYVETLNKLGANIQCI

GYEDNMDTCIAIRTMVYKVYLQAGGGIVFDSDEQDEYVETLNKLRSNVTAI

AYDVQMDTCIAIRTMLVKAYLQAGGGIVFDSEKTEEWMETMNKLAANLRCI

DFAHEMDVCIAIRTMTFKAYLQAGGGIVYDSIEEDEYIETINKLRANIRCI

SFSGFLDTAIAIRTMVVKVYSQAGGGIVYDSDPQAEYMETVNKLGSAIKTL

SFSGFLDTAIAIRTMVVKVYSQAGGGIVYDSDPQAEYMETVNKLGSAVKTL

SWSGDADFAIVIRTATVEITVQAGAGIVADSDPESEYEETEQKMDGVLEAI

SWNGDADFAIVIRTLLIQASVQAGAGIVADSDPAYEFRETDRKMAAMLTAI

SITGYADFAIAIRMAEIEAHVRAGAGIVADSIPEKEFYETENKMKAVLKAF

SLTGDADMAIAIRMAEIEASVRAGAGIVADSVPEKEFFETENKMRAVLKAL

GFNGNLNTGLTLRTIRLQAEVRVGATVLYDSIPSAEEEETITKATALFETI

GFNGNLNTGLTLRTIRLQAEVRVGATVLYDSVPSAEEEETITKATALFETI

NFNGNLNTGLILRTIRLQAEVRVGATLLYDSIPQAEEQETITKAAAAFETI

GFDGNLNTGLVLRTVRIEAEIRVGATLLYDSIPEAEEEETRLKASAFLDIL

SFNGDLNTGLTLRSIRIKAEIRVGATLLMDSIPEEEEAETLVKAAAMLKAI

HFNGDMNTGLTLRTIRIKAEIRAGATLLFDSNPDEEEAETELKASAMIAAV

HFNGDMNTGLTLRTIRIKAEIRAGATLLFDSNPDEEEAETELKASAMIAAV

HFNGDMNTGLTLRTIRIKAEIRAGATLLFDSNPDEEEAETELKASAMIAAV

GFNGDMNTGLTLRTVRIKAEVRAGATLLNDSIPDEEEAETELKASAMLSAI

GFNGDMNTGLTLRTVRIKAEVRAGATLLNDSIPEEEEAETELKASAMLSAI

GFNGDMNTGLTLRTIRIKAEVRAGATLLYDSNPEEEEAETELKASAMIAAI

GFNGDMNTGLTLRTIRIKAEVRAGATLLYDSNPEEEEAETELKASAMIAAI

GFNGDMNTGLTLRTIRIKAEVRAGATLLNDSNPQEEEAETELKASAMISAI

NFNGDMNTGLTLRTIRIKAEVRAGATLLFDSIPEEEEAETELKASAMLSAI

HFNGDMNTGLTLRTIRLKAEVRAGATLLYDSNPEDEEAETELKASALIGAI

HFNGDLNTGLTLRTIRLKAEVRAGATLLFDSDPDAEEAETELKASALLGAI

GFDGDMNTGLTLRTIRIKAEVRAGATLLYDSDPDEEEAETELKASAMRAAI

GFDGGMDTGLTLRTIRMAAYVRAGATLLSDSDPDAEDAECRLKAAAFRDAI

GFDGSINTGLTIRTIRMKAEVRVGATCLFDSKPELEDRECQTKAAALFQAL

GFDGSINTGLTIRTIRMKAEVRVGATCLFDSKPELEDRECQTKAAALFQAL

NFDGSINTGLTIRTIRMKAEVRVGATCLFDSDPAAEDRECQVKAAALFQAL

NFDGSINTGLTIRTIRMKAEVRVGATLLFDSDPVAEEKECQTKAAALFQAL

GFDGTLDTGLVLRTIRLRAEVRVGATLLHRSDPEEEEAETLLKASALLALL

GFDGGLNTGLTLRTVRVKAEVRAGATLLFDSEPEAEEKETELKASAMIDAI

GFDGSMNTGLTLRTVRVKAEVRAGATLLYDSDAEAEELETELKASAMLDAI

NFDGSMNTGLTLRTAHIRATVRAGATLLYDSDPEAEERETFLKARALLETL

SLGGDINTGILIRTTYLRASYPVGATLLFDSVPVMEERETRLKATGFFRTL

Anthranilate phosphoribosyl transferase amino acid sequence alignment

49 301

Thaps PYIETLIAGRLTSDETYDAFSLILSTIASLLTLLRARRENPQEIAGMVRAMNDACVKFELLDIVGTGGDGADTINISTASVVLAAACGCTVAKAGNRSVS

Phatr PYIEILIQGPLTADETEAAFSEILQAVGSLLTLLRARGETPSEIAGMVRAMNKACVDLGLLDIVGTGGDGADTINISTASVVLAAACGCIVAKAGNRSVS

Oryz KVLETLIGGHFSEEEAEATLRLLLEEIAAFLVLLRAKGETYEEIVGLAKAMIGCCVDGLAVDIVGTGGDGADTVNISTGSTILAAAAGAKVAKQGSRASS

Arab QLIETLIDRDLSETEAESSLEFLLNAISAFLVLLRAKGETYEEIVGLARAMMKHAVEGLAVDIVGTGGDGANTVNISTGSSILAAACGAKVAKQGNRSSS

CHlre EVIEKLIVRDLTEKQAEEALGTLLDFAAAFLVLLRAKGETPAEIAGLAKAMLDKAVKTSVVDIVGTGGDGIGSVNISTGASILAAAAGAKVAKHGNRSVS

Pram TFAVQQIASAVENPVAVGVLLALLAEVAAFAKHMRSEAVS------VTSGTLDIVGTGGDGANTVNLSTAAAVLAASCGALVAKHGNRSVS

Psoj ------MRSEAVS------VSSGTLDIVGTGGDGANTVNLSTAAAVLAASCGALVAKHGNRSVS

CME RLVERLIAAELSFDEAADAMHCMLEEVAAFLVLLRRNGAETGQLAGMAAALLERAVSTGTLDIVGTGGDGSNTVNISTAAAIVAAACGARVAKHGNRSAS

Desre TEAIQKVVANLSEAEAMETMQEVMEAIASLLTALHLKGETVPEITGFARTMRTKVVQTKLVDTCGTGGDGANTFNISTACAFVLAGAGLPVAKHGNRSVS

Moorth KAQISKVVAHLSEAEAAEAMDIIMAAIAAFLTALRLKGEMVDEITGFARSMRRRALTTSFVDTCGTGGDGRQTFNISTTAAFVVAGAGVAVAKHGNRSVS

Geosu KKAIAKVVEDLTEAEMIEVMDQIMSAIAAFITALRMKGETVEEITGAARVMRDRAIRVGILDVVGTGGDGTNTFNISTTVSFVVASCGVKVAKHGNRAVS

Cloth KKAISKLVENLSESEIIEALDCIMEAIGSFITALRIKGETIEEITGCAKVMRAKAICPNYIDTCGTGGDGTNTFNISTATAFVAAAGGVYVAKHGNRSVS

Alkam QQAIDKVIRDLAETEMMAVMQGIMEVIGGFLTALRMKGETVEEITASAKVMRSKAVEVNSIDTCGTGGDQANTFNISTAVAFVAAAAGVTVVKHGNRSVS

Metha KEYIKKLEEDLSSEEAEAAIGEILSAIGAFLLALRAKGEKPQEIAGFVRGMKQAGIKPVVIDTCGTGGDGLNTINVSTAAAIVTAAAGVPVAKHGNRAAT

MethM KEYIKKLEEDLSSEEAEAALEEVLSAIETFLLALKAKGEKPQEIVGFVRGMKKAGIKPNIVDTCGTGGDGLNTINVSTAAAIVTAAAGVPVAKHGNRAAT

MethS MNYLARLIENLTIEEAESLLGAFFDAIASALTALRMKGETAEELAGMAKRMRESAIRPRLVDTCGTGGDSTNTINVSTAAAIVAAACGVPVAKHGNYAVS

Nathp KDYIRRVSDDLTQAEAREAASLVFEAIGALLSALRAKGETEPEIAGFAEGMRAAAIDPDLVDTCGTGGDGHDTINVSTTSSFVVAGAGVPVAKHGNYSVS

Halmar KEYIERVTDDLTQAEARAVATTVFEAIGALLTALRAKGETEAEIAGFAEGMRDAAIRPDLVDTCGTGGDDYNTINVSTTSAIVAAGAGVPIAKHGNYSVS

Lacca KQAIEKVVNNLTFEESEAVLDEIMNATASLLTALTAKNPTIDEIAGAAASMRSHAFPETVLEIVGTGGDHANTFNISTTSAIVVAATGTQVAKHGNRAAS

Rhopa KAIIAKVATTLTRDEAASAFDGMMSAMGALLMGLRVRGETVDEITGAVSTMRAKMVDAPAVDIVGTGGDGSGSVNVSTCASFIVAGCGVPVAKHGNRALS

Braja KSIIGKVATSLSRDEAASAFDAMMSAMGGLLMALRVRGETVDEITGAVAAMRSKMVTAPAVDIVGTGGDGSGSVNVSTCASFIVSGAGVPVAKHGNRALS

Nitwi KVLIGKVATTLTREEAAAAFNSMMSAMGGLLMALRVRGETVDEITGAVSAMRSMMVKAPAVDVVGTGGDGSGSVNVSTCASFIVAGAGVTVAKHGNRALS

Brusu KPYIAKAASPLSLGDAKAAFDIMMSAIGGFLMALRVRGETVPEIAGAVASMRSRMVIAPAMDIVGTGGDQSGSYNVSSCTAFVVAGAGVPVAKHGNRALS

Agrous KPLIAKVANSLNREDARTAFDILMSAIGGFLMALRVRGETVDEIVGAVSSMRARMVSAPAIDIVGTGGDGIGTYNISTLASIITAGTGLPVAKHGNRALS

Lockve KPLIYAASEPLSRAQAEEAFGYLFAAIGGLLMALRARGEAVSEYAAAAAVMRAHCVTAPAMDIVGTGGDGKHTLNISTATAFVVAGAGVPVAKHGNRNLS

Rhods KPLIGTAATPLSREEAEFAFECLFEAMGGLLMALRTRGETVDEYAAAASVMRAKCVRAPAIDIVGTGGDGKGTLNISTATAFVVAGAGVPVAKHGNRNLS

Burmu QEALQRTIEEIFHDEMLHLMRLIMRMAAAIITGLRVKKETIGEIAAAATVMREFAVEVQFVDIVGTGGDGSHTFNISTASMFVTAAAGAKVAKHGNRGVS

Ralso QDALTRCIEEIFHDEMLHLMRQIMRMASALIMGLRVKKETIGEIAAAATVMREFAVEVPFVDIVGTGGDGANTFNISTASMFVAAAAGARVAKHGGRGVS

Neigo QQAIERLISELFYDEMTDLMRQMMSVIAAILTGLRIKVETVSEITAAAAVMCEFAVPLELVDIVGTGGDGAKTFNISTTSMFVAAAAGAKVAKHGGRSVS

Psaer KGALNRIVNDLTTEEMQAVMRQIMTCIGAFLMGMRMKSETIDEIVGAVAVMRELAVQLPVVDVVGTGGDGANIFNVSSAASFVVAAAGGKVAKHGNRAVS

Azovi KQALARVAEDLNTAEMQGVMRQIMTCIGAFLMGMRMKSESIDEIVGAALVMRELAVTIGLVDTCGTGGDGMNIFNVSTAAAFVVAAAGGRVAKHGNRAVS

Ecoli QPILEKLYQTLSQQESHQLFSAVVRLLAAALVSMKIRGEHPNEIAGAATALLENAFPRPFADIVGTGGDGSNSINISTASAFVAAACGLKVAKHGNRSVS

Shiboy QPILEKLYQTLSQQESHQLFSAVVRLLAAALVSMKIRGEHPNEIAGAATALLENAFPRPFADIVGTGGDGSNSINISTASAFVAAACGLKVAKHGNRSVS

Vibvul EAIINKLYQSLTEQESQQLFDTIIRLMASALTALKIKGETPDEIAGAAKALLANAFPRPFADIVGTGGDGHNTINISTTAAFVAAACGLKVAKHGNRSVS

Prmar SQILEMLLENLPEVEATALMEAWLALTGAFLAALRAKGVTGNELSGMAQVLRGACPCPLMVDTCGTGGDGADTFNISTAVAFTAAACGANVAKHGNRSAS

Synech PRLLDRLLEQLLPSDAAVLMEAWLALTGAFLAALRARGAQGGELAAMAGVLRQACPCARLVDTCGTGGDGADTFNISTAVAFTAAACGAVVAKHGNRSAS

Anava YLLLQQLIDSLSRSQAAELMQGWLSVSGAILTALNFKGVSADELTGMAEVLQSQSGSGEIIDTCGTGGDGSSTFNISTAVAFVAAAYGVPVAKHGNRSAS

Triery PSLLQQLLDSLSSSQASNLMQGWLQISGAILAALQAKGVSAQELAGMAKVLQSLSTKEYIIDTCGTGGDGASTFNISTAVAFVLAAAGVPVAKHGNRSAS

Crowa QSLLQQLLDSLSQTQAGQLMQGWLDISGAILVTLQGKGVSGDELAGMARVLQQQSETAIVIDTCGTGGDGASTFNISTAVAFVAAAAGVKVAKHGNRSAS

Nost SAILQQLLKSLTVAQATDLMQGWLTISGAILAAIQAKGVSSEELVGMARVLQSQSSPPHLIDTCGTGGDGASTFNISTAVAFVAAAAGVKVAKHGNRSAS

Syncys PPFLQQLLDSLTRQQAVQLMEGWLDISGAILAAIQAKGLDPEELTGMAQVLQEQSNQGRLVDTCGTGGDGSSTFNISTAVAFVVAAAGVKVAKHGNRSAS

Eugl KKWIASLQGPVSAEDQRAAIHELLAVKAALLALLPPASLGEDTLQLFVDTLLEHGVAVPVADIVGSGGDGQNTWNVPTPAAIVAAGAGIRMAKHGNRSAS

Sachce LSYTKKLLAQLSSTDLHDALLVILSKVSSFLTALRVTKLDHKAIAEAAKAVLRHSLVDLILDIVGTGGDGQNTFNVSTSAAIVASGIQLKICKHGGKAST

Cangl VQLTKTLLETLTPKQLYRAMVIILESIASFLSCLKASRLDHRAIAEAAKAVLGFSVVELVLDIVGTGGDGQNTFNVSTSAAIVAAGIPLKVCKHGGKAST

CanAl TPYLKKLVVTLEPKDLSEALELIFSSTAAFLSCLRLRGLDQEAIAAAVTTVLHFATIPPYIDIVGTGGDGQNTFNVSTSSAIVAAGMGLPVCKHGGKAST

Yarli NTQLKAILDTFTPEDLAEVLALVAQPIITFLALLHAKGLDMNALAAAANTLRHAALPDTYVDIVGTGGDGQNTFNVSTSAAIVAAGMGIKVGKHGGKAST

Schipo RLAIHDLDKAIPLENYEAALRAILTSTASFLASLHLTKAEEVPLMQTVQILKSYSIANIFVDIVGTGGDGHNTFNVSTASAIVAAGAGLWVCKHGNKAST

Ustma RPLLKALAVSGTSTITHKQLEQILEDIGSALTCLKFCRLDIQAFALAARIFLNCCVHVPTLDLVGTGGDGKDTFNVSTTASMVAAGVRVRVCKHGAKASS

ASfum SPLLQKLAYPVDPAEIASAFALIFESTAALLTLLHSTGKDRDAIALCSLRMREAAIEKSLCDIVGTGGDSHSTFNISTTASIIAS--PLMMAKHGNRAQT

SCCGSADVLEALGVQVLNPTQVVECVEQCDIAFLFAPVNHPAMKAVAPVRKQLGVRTCFNILGPMTNAAGQHAVIGVFHEELLELMAETLKEVGVDHVIH

SACGSADVLEALGVKVLTPEQVVKCVDAVRMAFMFAPVNHPSMKYVAPIRKKLGVRTCFNILGPMTNAAGQHAVIGVFHPELLSLMAGALKEVGVDHVIH

SACGSADVLEAFGVNILGPEGIKRCVNEVGVGFMMSANYHPAMKIVKPVRKKLKIKTVFNILGPLLNPARPYAVIGVYHENIVTKMAKAAQKFGMKRVVH

SACGSADVLEALGVVLLGPEGIKRCVEEGGIGFMMSPMYHPAMKIVGPVRKKLKIKTVFNILGPMLNPARSYAVVGVYHKDLVVKMAKALQRFGMKRVVH

SLCGSADVLEALGIAILGPAGVNHCLDQAGIAFMYAPRYHPGMKAVRPVRSALKVRTALNMLGPLLNPAESYGLVGVYDTSISELMAGSLLRMGVQKVVH

SKSGSADVLEELGVPMLKPEHVASCLEEAQIAFMYAPHFHPAMRYVGPVRKAIGIRSMFNILGPLLNPAGKRVVIGVYTPTLLDVFGEVLLALGVEHVVH

SRSGSADVLEELGVPMLKPEHVGPCLEEAQIAFMFAPHFHPAMRHVAPVRKAIGIRSVFNILGPLLNPAGKRVVIGVYTPKLLDVFGEVLLALGVEHVVH

SKCGSADVLEALGVAILGPEQVARCIAETGISFLMAPRFHPQLATVAPLRRSLRVRTVFNNLGPLLNPARRYQVIGVAAPELMEPMAEAVARLGTERIVH

SKCGSADVLEQLGVFVLTPEEAGLCLDQVGIAFLFAPLLHGAMKYAAAPRKEIGIRTVFNILGPLTNPAFENQVLGVYSSDLAPVLAQVLANLGTKRVIH

SRCGSADMLEALGIKVLPPDAVARCLDEVGMAFLFAPVFHGAMKYAAGPRREIGIRTAFNLLGPLTNPAGPCQLVGVYDPDLTETVAAVLGRLGSRRVVH

SACGSADVLESLGVNLVTPETVEQAIAKIGIGFLFAPALHGAMKHAIGPRKEIGIRTIFNILGPLTNPAGDCQVLGVYREELVEPLARVLHKLGCRRVVH

SKSGSADVLEALGVNILDPVQVKECIEKVGIGFIYAPVFHKSMKHAAGPRKELGIRTIFNILGPLTNPSNKGQVLGVFNPNLTELMANVLLNLGIERVIH

SQCGSADVLEKLGVNILTPKQVETCVEQVNMGFMFAPKFHQAMKYAAAARRELGVRTIFNILGPLTNPAKKGQVLGVFDESLTEVMAQVLKELGVERVVH

SMSGSSDVLEALGIKVLTPGQVRKTIEKIGIGFMFAPVFHPAMKRVAGVRKKLGVRTVFNILGPLTNPAGKGQVVGVFDKKLCEPIAYALAELGTEHVVH

SMTGSSDVLEALGIKVLSPEYVRKTIEKIGIGFMFAPVFHPAMKRVAGVRKKLGVRTVFNILGPLTNPAGKGQVVGVFDKNLCEPIAYALAELGTEHVVH

SRCGSANVLEALGVNICPPERVESIIESVGIGFMLAPLFHPAMKRVAHIRKEMGIRTVFNVLGPLTNPAGEAQVVGVYSPALCEKIANVLNLLGTKRVVH

SSSGSADVLEEIGVTIAEPPAVETCIEETGMGFMLAPVFHPAMKAVIGPRKELGMRTIFNVLGPLTNPADDAQVVGVYDEALVPTLADALSRMSVDRVVH

SSSGSADVLEEVGVDIAEPPDVEETIERDGIGFMLAPVFHPAMKAVIGPRQELGMRTVFNILGPLTNPADDAQVLGVYDPDLVPVMAEALARLDVERVVH

SKSGAADVLEALGLDIETPAVSYESLQENNLAFLFAQEYHKSMKYVATVRKQLGFRTIFNILGPLANPAHTHQLLGVYDETLLEPLANVLKKLGVTNVVH

SKSGAADVLNALGVRIITPEHVGRCVTEAGIGFMFAPTHHPAMKNVGPTRVELATRTIFNLLGPLSNPAGKRQMIGVFSRQWVQPLAQVLKNLGSESVVH

SRSGAADVLASLGVRILRPEQVGRCVRECGIGFMFAPAHHPAMKNVGPTRVELATRTIFNLLGPLSNPAGKRQMVGVFSRQWVQPLAQVLKNLGSESVVH

SRSGAADCLAALGIRILTPEQVGRCINEAGIGFMFAPAHHPAMKNVGPTRVELATRTIFNLLGPLSNPAGRRQMVGVFSRQWVQPLAQVLKNHGSESVVH

SRSGAADALAALGINIADADTIGRSISEAGLGFMFAPMHHSAMRHVSPSRVELGTRTIFNLLGPLSNPASKRQLVGVFAPQWLEPLAHVLKELGSETVVY

SKSGTADALSALGVRLIGPDLIARCIAEAGLGFMFAQMHHSAMRHVGPSRVELGTRTIFNLLGPLSNPAGKRQLLGVFSPRWLVPLAEVLRDLGSESVVH

SKSGTADVQSALGINVVTPDVVERAIAQAGIGFMMAPLHHPAMRHVGPVRLELGCKTIFNILGPLTNPAGKRQLTGAFAIDLIFPMAETLQQLGTEKLVH

SKSGAADALTEMGLNVIGPEQVEACLMEAGIGFMMAPMHHPAMRHVGPVRAELGTRTIFNILGPLTNPAGKRQLTGAFSPDLIRPMAEVLSALGSEKLVH

SKSGSADVLEALGVNILQPDQVAASIAETGMGFMFAPNHHPAMKNIAAVRRELGVRTIFNILGPLTNPAGPNQLMGVFHPDLVGIQVRVMQRLGAQHVVY

SKSGSADVLEALGVNILTPEQVAESVATVGIGFMFAPNHHPAMKSIAPIRKELGVRTIFNILGPLTNPAGPNILMGVFHPDLVGIQVRVMQRLGAKHVVY

SSSGAADVMEQMGANLLTPEQIAQSIRQTGIGFMFAPNHHSAMRHVAPVRRSLGFRSIFNILGPLTNPAGPNQLLGVFHTDLCGILSRVLQQLGSKHVVC

GKSGSADLLEAAGIYLLTSEQVARCIDTVGVGFMFAQVHHKAMKYAAGPRRELGLRTLFNMLGPLTNPAGRHQVVGVFTQELCKPLAEVLKRLGSEHVVH

GKSGSADLLEAAGVYLLSPEQVARCVESVGVGFMFAPAHHGAMKHAIGPRRELGLRTLFNMLGPMTNPAGKRQVLGVFSQALCRPLAEVMARLGSVHVVH

SKSGSSDLLAAFGINLMNADKSRQALDELGVCFLFAPKYHTGFRHAMPVRQQLKTRTLFNVLGPLINPAHPLALIGVYSPELVLPIAETLRVLGYQRVVH

SKSGSSDLLAAFGINLMNADKSRQALDELGVCFLFAPKYHTGFRHAMPVRQQLKTRTLFNVLGPLINPAHPLALIGVYSPELVLPIAETLRVLGYQRVVH

SKSGSSDLLDSFGINLMSAEDTRSAVDNIGVAFLFAPQYHSGVKHAMPVRQTMKTRTIFNILGPLINPARNIELMGVYSQELVRPIAETMLKMGMKRVVH

GKVGSADVLEGLGLQLAPLVSVVEALVEVGVTFLFAPAWHPALVNLAPLRRSLGVRTVFNLLGPLVNPLQNAQVLGVAKAELLNPMAEALQRLGLQRVVH

GRVGSADVLEGLGLRLAEARQVVEALPAVGVTFLFAPAWHPALVNLAPLRRSLGVRTVFNLLGPLVNPLLDGQVLGVARPDLLDPMAEALLQLGQRRVVH

SLTGSADVLEALGVNLASSEKVQAALQEVGITFLFAPGWHPALKAVATLRRTLRIRTVFNLLGPLVNPLRTGQVVGLFTPKLLTTVAQALDNLGKQKVLH

GKVGSADVLEALGIRLAPTEKVISALSEVGITFLFAPGWHPAMKCVVPLRRTLKVRTVFNLLGPLVNPLRQAQIIGVYNSTLVKTVAQALGILGVEYALH

SKVGSADVLEYLGVKLATPEKVAEALEEVGITFLFAPGWHPAMKHVAPLRKTLKVRTIFNLLGPLVNPLRTGQIIGVYHPRFIRPMAEALHQLGIAQVLH

SKTGSADVLEALGINLANADKVQAAVSEVGITFLFAPGWHPALKTVATLRKTLKVRTIFNLLGPLVNPLRTGQIIGVNDPLLIEEIALALSHLGCRKALH

SKVGSADVLEALGLNLAGADQVAAAVSAVGITFLFAPGWHPALKSVAPIRKTLKVRTVFNLLGPLVNPLRTGQVIGVYSPDFLSVMAIALKNLGTARVLH

SNSGSADLLEQFGAYLVSPTVVPELVDRTNFSFLYAPAFHPALRNVAKVRQDLGTKTVFNFLGPLLNPAAQYNVVGVSDRAMAEVMARCLSRRPGVRVVH

SNSGAGDLIGTLGDMFVNSSTVPKLWPDNTFMFLLAPFFHHGMGHVSKIRKFLGIPTVFNVLGPLLHPVSNKRILGVYSKELAPEYAKAAALVYGSEIVW

SNSGSGDLINTLGQSSVTAETVPALW-ENKFMFLLAPYFHYGFGKTSNLRKLMHIPTIFNILGPLLHPVADKRVLGVYSKDLAPEYAKAASIVYNSEVVW

SSSGSGDLLKSLGDLSVNEVTTPEIVKKSKFCFLFAPSFHPGMGLVAHIRSQLGVPTIFNILGPLINPIPRARVLGVYSEKLGESYAQAASILAKNEVVF

SMSGAGDLLTCLGDMSVHQSTVPDIVAKGPFCFLFAPMFHPVMNRVAPVRKSMGIPTLFNVLGPLVNPIPKARIIGVYSEALGQIFAEAITHINVQGVVW

SASGSADLLMSFGDLLVTPKNIVSITEQCKFSFLFAPMCHPTLKNVAPIRKQLGLPTIFNLVGPLLNPIPYARIIGVSKLSLGEVVAKTLLKLGGRSVVC

STSGSADLLMSLGPLLLPASQLPNVLRKSKFSFLFAQLFHPALAPLGPIRRSLGFPTIFNVLGPLINPAKERCILGVHSYYLGRIFAEALRKRGERAIVC

SFSGSADVLNAISKISVTAENLAQVYEATSYAFLFAPNFHPGMMYANPVRRGLGLRTIFNLMGPLANPVDEARVVGVAYQSLGPVFAEALRSSGTKAVVC

GVLDEISPMGPATIVEVS-TRKYQFDPLSVNIPRCVVDLKGDPEQNAKEFENVLEGNAKKDAIVLNAGVGCYVYGMTIEEGCLARKTLVEGEVLKKWKAVS

GLLDEISPLGPSTILEIKNEREFEFDPLSIGIARCELDLKGGPEENAQKFRDVLLGDAKRDSIVLNAGVGCYVFGLAIEDGCLARATLESGSLLESWVKVS

SKLDEISPLGPGYILDVTPIEKMLFDPLDFGIPRCTLDLKGDPAFNAKVLQDVLAGGSIADALVLNAAASLLVSGKVLHDGVLAQETQRSGNTLESWIKIS

SCLDEMSPLGGGLVYDVTPIEEFSFDPLDFGIPRCTLDLRGGPDYNADVLRRVLSGGAIADSLILNAAAALLVSNRVLAEGVVAREVQSSGKTLDSWINIS

SMLDELTPMGPADVVEVTALKRYSLDPKEVGIPRCEVDLKGDAQLNAAILRDVFAGGAVADALCLNAGYALAACRVAPAEGVMAQEVQRRGATLQRWAEAS

CALDELNSIGPAEVVEVRLLRHYQLRPEEVGVPAVTLQLKGDATENAAILREAFTGGPVGNTIAYNAGAGLYVYGLAIKSGYMAKKQLASGATLDKWAQVA

CALDELNSMGPADVVEVRQFRRYELRPEDVGVPAVTLQLKGDATENATILRKVFSGCPVGNTIAYNAGAGLYVYGLAIKSGYMAKKQLTSGATLDNWAQVA

CALDEMAPIGATEVIEVRKPCRFRVEPETLGMPLCTVDLVDDCAANAATITSILQGGPISNAIIMNAGAGLVVYGLALQDACMAANALMSGETLEKWKSLS

GGLDEISLAGEALVYEVKDVKEMIIDPMDYGLDRAPIALAGDAKRNARMIKNILSGGPQRDTIIINAALGLIAGGLVLAMGILAEQIIDEGKKLNLLVEFS

GDLDEVTITGPSKITCLDKIRTYTFTPEDVGLPRANLDLAGTATDNAAIARAVLSGGPARDVVLINAAFALLAAGAALQQALLAESSIDSGAKLQAMVAWV

GDMDEITLTRETRIAEVTRVSVRTITPEEFGFASCPAELRGDAAGNARIVRGILEGGPRRDVVLLNAAFGLVAAGKAPAEGVIAAEAIDSGAKLEELITLT

GDMDEITTTTSTKVSEVRNVITYELFPENYGIALASPDLTGNAEENAQIIRRIFNGGPKRDIVVLNSAAALYVGKVVIEEGILAEEVIDSGQKLDEFVEFT

GDLDEITTTTKTKVSELKNISNYVIDPRQFDIPLTDKDLAGDAEKNASIILNIVKGGGKRNMVLINAGAAIYVGNAALQEGIRAAEVIDMGDKLNQLIKLS

GDMDEISNTGETFVAELKDVSTYTITPEAMGMLRAKPDIKGTPKENARDLLCIFKGGPKRDLVILNAAAALYVSGIVIRQAIIAEDAIDSGVKFNQFRNFT

GDMDEITNTGETYVAELKNVSTYTLTPESLGMLRASPDTKGSPKENARDLLCIFKGGPKRDLIILNAAAALYVSGIVIRQAIIAEDAIDSGVKFNQFRTFT

GSLDEISNTGSTFVSELCDVRNYVVDPRDLGYPLADLEIAGTPDENAERLVRILKGSRARELVAMNAGAAVYVSGIALREGCIAEGAISSGSALETLKTLV

GALDEIGVHGESTVAEVDGVEQYTVTPSDLGVDSHDLAVAGSPTENAADMRGILEGGAKRDIILANAGAAIYIAGEALAAGVAARESIDTGAAAAQLASLR

GDLDEIAIHGETVVAEVTDIAEYTITPEDMGLETRDIAISGSPEENAADLRGIVTGGAKRDIILANAGAAIYVAGVAHEAGVQARQAIESGAAADKLDDLI

GDLDEMTTADETAVVELQDLTKYTVTPEQFGLKRRQRDLVGTPEANADITRRILAGGPQRDIVLLNAGAALHVAHPAIQAGILAAKTIDDGEELNRLLAFS

GDLDEITLSGPTAVAELKNIRTFEVTPDEAGLPRAHAALRGDAEANAVALRSVLEGSPYRDVALLNAAAALVVAGKALKEGVLGAKSIDSGGRLRRLIAVT

GDLDEITLTGPTFVSSLHNIRNFEVTPEEAGLPRCEPALKGDADANAIALQSVLNGSAYRDVALMNAAAALVVAGRALKEGVLGAKSLDSGARLKHLIAVS

GDLDEITLAGPTFVAALEDIRTFEVTPEDAGLERADGALKGDAEANAASLRAVLEGGAFRDVALLNAAAALIVAGKALKEGVLGAKSLDGGNKLKQLIAVS

GDLDEMTTAGTTQVAALENIRTFEITPEEVGLRRCSPELKGEAAENAKALLGVLEGSAYRDIVLLNSGAALVVAGKALKDGIQAVQSIDSGAVLQKVIAVS

GDMDEVTTTGVTHVAALEDIRTFDLTPKDFGVEPALMDLKGDGIANAAALREVLSGNAYRDISLCNAAAALVIAGKALSQAMIASDALDSGAALDRLVAVS

GDTDEITICGPTSVAVLDGITSRQIHPEDAGLPEHAFDIIGSPQDNAQALRALLDGGAYRDAVLFNAAAALVVADKALPDGVIARDSIDSGAALDALVRVT

GDTDELAISAASKVAALEGIREFELHPEEAGLPVHPFEIVGTPAENAQAFRALLDGGAYRDAVLLNAAAALVVADRALREGVIATDSILSGAKVALLARLT

GDMDEVSLGAATLVGELRDVHEYEIHPEDFGLQMSNRTLKENAEESRAMLLGALDNGVAREIVTLNAGTALYAANVAIADGILAREAIASGAKVDELVRFT

GDMDEVSLGAATLVGELKDVSEYEIHPEDYGLQMSNRGLKADAEESKAMLIGALENGTPREIVTLNAGAALYAANLAIGDGMLAREAIASGAKLDELVRVT

GGLDEITLTGKTRVAELKDISEYDIRPEDFGIETRNLEIKANTQESLLKMNEVLDGGAARDIVLLNTAAALYAGNIALSDGIAAREGIDSGAKKEEFVGFT

SDLDEFSLAAATHIAELKDVREYEVRPEDFGIKSQTLGLEDSPQASLELIRDALGRQKAAELIVMNAGPALYAADLALHEGILAHDALHTGEKMDELVAFT

ADLDEISLAAPTHIAELRNISEYLVQPEDFGIKSQSLGLDEGPQESLALIRDALGRQKAAEMIVLNAGAALYAADQALKEGVLAHDALHTGDKLEELASIT

SGMDEVSLHAPTIVAELHDIKSYQLTAEDFGLTPYHQQLAGTPEENRDILTRLLQGAAHEAAVAANVAMLMRLHGHEQANAQTVLEVLRSGDRVTALAARG

SGMDEVSLHAPTIVAELHDIKSYQLTAEDFGLTPYHQQLAGTPEENRDILTRLLQGAAHEAAVAANVAMLMRLHGHEQANAQTVLEVLRSGDRVTALAARG

GGLDEVAIHGDTLVAEIKDIHEYTLTPADFGVNTHPLAIKGDPEENKAIITHLLTGEAQLSAVAVNVALLMRLFGHEKANTQQAIDVMNSGQLVEKLAQHA

GGLDEASLEGANAMRLLENLRQASIDSAELGLTRAPLALQGDLATNQAILSAVLQGAPQRDVVALNTALVLWAAGLQDDLQATAKTCLQEGQRLEGLRMAL

GGLDEASLAGPNAVRIVEDIRAEILAPADFGLREAPLALKGDLELNQAILRELLQGEAQRDVVAFNTALVLWVAGVEMDLRSRAITALAEGERLEQLRQAL

GELDEAGLGDLTDLAVLSDLQLTTINPQEVGVTPAPIALRGDVQENAEILKAVLQGQAQQDAVALNAALALQVAGAVLDHAKVAKEILQTGAKLEQLVHFL

GELDEAGLGDITDIAILSHVKATSINPQYLGLNYAPITLQGDVEQNAEILKNVLQGSQQTDVVALNSSLALQVAGVVEAHQEKAKDILQSGLKLEQLVQFL

GELDEAGLGDVTDVAFLKEVNLGEINPQSLGLTSTPLALKGNVEENASILTDVLQGSAQQDAVAVNAGLALQVGDVVCEHQKKAKDILKSGDKLTSLVEFL

GELDEAGLADVTDLAILQDVSCLALNPQELGLNHAPTVLRGDVAENAEILKAILQGQAQQDVVALNTALALQVGEAITDIVEIAREVLQSGTKLEQLAEFL

GELDEAGLGAPTDIASFNQVTPQVLDPQNFGLAPAPLALKGDLAENVTILSQVLQGQAQIDAVALNASLALQVGDAVGDHGQLAKDILSQGDKLQQLVAFL

SDMDKISPVRNTHSWRVEGPVYQLLRPEDFGLVSAKDGAGGSPAHNAAALLRVLQG---PDPLL------

GVLDEVSPIGKTTVWHIDPLKTFQLEPSMFGLEEELSCASYGPKENARILKEEVLSNPIYDYILMNTAVLYCLSQGHWKEGIKAEESIHSGRSLEHFIDSV

GVLDEVSPIGKTTVWHVTTVDIFELEPAMFGLQEPLDCKSYGPERNAEILRDDILSHPVYDYILLNTAVLYCLSQGHWKQGVVANESIQSGKALEHFIKDV

GVLDEISPIGYTKTWTIDKIERNRISPKDFGLPEDLSVKSGTPQQNAEILSHILNQHPLVDYILMNSAAVAVVSGIAWVDGVLAKESIVSGKALEDFQNSS

GELDEISPAGRTKVWRVTPIEELYLTPEDFGLSRPLSVSSGTPTENATVLKQLLSNHPISDYVIMNAAALAVIDGHAWKHGVLARESIKSGNALETFVEAS

GELDEISPAGPTHTWLVRDITHEVYTPESFHLQSPLSVASGTPSANAILLEELLSNHPILDYVLMNTAALLHVAGMALREGVIAQQSISSGRELSNFSTIS

GELDEISPAGKTDVWELKDIEEFTIEPEDFGLKKPLEVGSHSADENAAIVLKMFSTQAIKDYTLLQTAALLYVGSYALEDAALARESIESGRAMETFRDES

GELDEISCAGPTNCWKLTGIETFQLHPSDFGLPAPLSVYGKMPKENAAKIMSILRNDPILTFVLINVAALLVISGICWKEGVRARWAVESGKCLEQFIEVT

Phosphoribosyl anthranilate isomerase amino acid sequence alignment

47 177

Thaps KLIKICGLTQPDDALVACRAGANLLGVIFAESKRRVSVEQAKAIVDAPSVVGVFQNPLEFVKEMVEVCGLDLVQLHGSEGMEAANAKNALRVVDIESGEG

Phatr QVVKVCGITSSEDALVACQAGANLIGVIFAPSARKVTPEQAKAVVQAPLVVGVFQNDSSFVREMVDSCGLDLVQLHGQEGFAAAKVESAIRVVDIVSGR-

Pram PLAKVCGVTKVEYALAALRNGANMIGIIMAESPRYVQVEQAKAIAQAPLVVGVFANTAAEMNAAAEEIGLDLVQLHGDEGYEICNDIKTIRALHLPSDGV

Psoj PLAKVCGVTTVEYALAALRNGANMIGIIMAESPRYVQAEQAKAIAKAPLVVGVFANTAAEMNAAAEDIGLDLVQLHGDEGFEICKDIKTIRALHLPSDGV

Ppar PLAKVCGITTVEYALAALRNGANMIGIIMAESPRYVEKEEAKAIAKAPLVVGVFVNTATEMNAAAEEIGLDLVQLHGDEGFEICKDIKTIRALHLPCDGV

Gibze LYVKICGTRSAEAARRAAESGADFVGICLVPAKRCISHETALAISEAPQLVGIFQNPLSEVLEKQKQYNLDLVQLHGDEPIEWANLIPVVRCFKPS----

Neucr LLVKICGTRSAEAAAEAIKAGADLVGMIMVPTKRCVDHETALSISQAPLLVGVFMNPLEEVLEKQHLYDLDIVQLHGDEPLEWANLIPVVRKFKPG----

Asory PLVKICGTRSEDGARAAIEAGADLVGIIQVQRKRTVSDDVALRISQVALLVGVFQNPLSYILEQQQKLELDVVQLHGSEPLEWAKLIPVIRKFGLD----

Schipo PLVKVCGTRSLLAAKTIVESGGDLIGLIFVESKRKVDLSVAKEISHFPLLVGVFQNPLEYIRSIIAEVNLDIVQLHGQEPFEWIHMLDVIKVFPLN----

Ustma SLVKICGLSTVEAAVTAAEAGADMLGMILAPTKRTVSLEQAAEIIKAPLLVGVFRDPLAEVASTAERLGLDVVQLHGSEGTEWAKFLPVIRVFSVKLG--

Agabi PLVKICGVKTKDQALAIADAGADLLGLMFAKSKRRIDRQVAKEIAAAPLLVGIFQNPLEEILETVADVQLDIVQLHGNEPVEWATQIPVIRAFHLGRI--

Sachce PLVKVCGLQSTEAAECALDSDADLLGIICVPRKRTIDPVIARKISSLKYLVGVFRNPKEDVLALVNDYGIDIVQLHGESWQEYQEFLGVIKRLVFP----

Canal ---KICGIKTVEAASVAIDNGANLLGCILVPRARTIDLEVAKQIARMPFLVGVFRNPKEEVFRIAREVGLDFIQLHGE---DKLEFLGLIPRYVVP----

Stret TKVKICGLSTPEAVATAVKAGADYIGFVFAKSKRQVSLEQAHELAKGTKIVGVFVSSLKELEEAISQVPLDIVQIHG--TFDEDLIPKVIRAIQIS----

Chlpha VKIKICGITRLSDALDACFAGADALGFNFSSSPRAIAPERAKEIIEKVESTGIFVDQSPEEINALCYCRLQIAQLHSQYSPEQAR-SIVIKVFRPEVEEV

Pellu TRIKICGITRMEDARAAALFGADALGFNFSPSPRCVTKDAARTMVCAIEGVGVFVEQDPREIIEICYCGLSVAQLHGRYSEEETKLVLVLKVFRPEPAAV

Cloth TCVKICGLRRKEDIDYVNLYKPQFAGFVFAESRRKVSKETARMLVKAIKSVGIFVNEKKETVAEIVYTGLDCVQLHGETPEYVEKLKEIWKAVRVK---S

Mothe VRVKICGIRDWEEARMVLDAGVDTLGFVFARSPRAIKPEAAREIITKTTTVGVFVNEPRYSLMEIAFCRLDVLQLHGEPPEYCHGLSQ-IKAIRVR---S

Bacce MKVKICGITDMETAKRACEYGADALGFVFAESKRKITPGLAKEIIQEVLKIGVFVNESVEMIQKIAECGLTHVQLHGEDNYQIRRLNI-IKSLGVTMKNA

Desha PRIKICGIRTGEEARWAVEAGADALGFIFVPSKRYIQPETAREIILNISKVGVFAQASPEHVGRIVECSLDTIQLHGEDPRLYRHLSV-IKAFSFPPGNS

theko EFVKICGVKTMDELRLVERY-ADATGVVVNSSKRKVPLKTAAELIEMLVSTMKTFPEWAN----AVKTGAEYIQVHSMHPKAVNRLKDVMKAFMVPSDDP

Pyrfur MFVKICGIKSLEELEIVEKH-ADATGVVVNSSKRRIPLEKAREIIENLVSTMVGFSEWAM----AIRTGAQYIQVHSALPQTIDTLKKVMKAFRVPSKNP

Metmaz TRIKICGMCSPEDMEMAALYGADAVGFITEVIESPRKLDSDTAASLILDSVMVIMPENSSRALELIKVRPDIVQIHSLPSVELEVIREIIKTLSVPASRV

Brad LLVKICGLTTPETLGAALDAGAEMVGFVFFPSPRHVGLTAARELGQQALKVALTVDDDATFENIVETLRPDLLQLHGESVARIRDLKQVMKAIAVATTAD

Nitwi LIVKICGLSTPSTLDVALQAGADMVGFVFFPSPRHLELARAQELGAQAAKVALTADDDETLCGIIEALRPDLLQLHGETVPRIREIKRVMKAIGVEIAAD

Rhopa LDVKICGLSTRATCDAALAAGADMVGLVFFSSPRHVDLGTAADLARAASVVALTVDDDAQLAAIVETVRPDLLQLHGESPARVAEIKRVMKALPIATRDD

Roseb IRVKICGLKTPQDVSAAAAAGAAYVGFVFFPSPRNVSIEQAVNLALELCKVALTVNDDALLDALTDAVPLDMLQLHGESAERVAQVKAVMKAIGVAGAED

Eryt IQIKICGLSTPETIAASADAGASHIGFNFYPSPRSVAPELASELAAILLKVGVFVNDDTLLNEAVRHGALDAIQLHGETPARVQEVKAVWKVLSVETDED

Natph TRVKVCGITNRSDLDTAVDAGVDAVGLIVDVTPREISPQQAAELAAAVTAVLVTMAVPDAATELVEAVRPDAVQVHGSTPDALASLGAVIKAT------D

Proma TAVKICGLTNIDQAKSIAALGVEAIGVIGVASPRFVAEQQRRDLFAQLQRVWVVADDDTDLSEALQEGAPSAIQLHGETPEHCANLRIWWKALRIRSHED

Synec PAVKICGLTDTEQALAIAAMGADAIGVIGVATPRYLEDSPRRGLFSQLQRVWVVADSNAMLDASLQEGTPTVIQLHGEPPAQCQALRQVWKALRLRSQDD

Nospu MRVKICGITQPQQSIAIASLGATALGFICVPSPRYVTTSQIRAAVAEIDTIGVFANSIVEISQIVVDSGLTGVQLHGELPDFCYQLRQIIKALRIRSLEH

Anava MRVKICGITQPQQSVAIASLGATALGFICVPSPRYVTAAQIWAAVAPIDKIGVFANSIAEIKQTVIDCGLTGVQLHGETPEFCDQLRRILKALRVRSLEH

Crowa MRVKICGITQVQQGQAIAELGATALGFICVESPRYINSQQIQEIIRVIDLIGVFANSLTKIKGVLQVAQLTAIQLHGETPEFCAQVKQIIKAFRIKTVGS

Glowa MRVKICGFTDPGQAQAAARLGVHALGFVCVPTPRYVDAARLREIAAATFKVGVFVDPVEAMRAAVEAGELQGVQLHGESPETCAHLARRIKALRVREPAD

Xylfa TRIKFCGMTRVGDVRLASELGVDAVGLIFASSSRLLTVSAACAIRRTVNVVALFQNSADEIHTVVRTVRPTLLQFHGEEDAFCRTFNVYLKAIPMAKRIC

Legpn IRVKMCGMTRSEDIQYAIDLGVDAIGLIFYPSARNVSLEKARIIVNNVDIVAVLVNEQSFVQLIINEIPVQLLQFHGESSEFCRQFNKFIKAIHPKTAIQ

Ralme TRIKICGLTREEDVRAAVDAGADAIGLVFYASPRHVDVAHAAALAELVSVVGLFVNDAEEVAHVAERVPLTLLQFHGETPQQCTEIARFMRAARVRPGLD

Decha TRIKICGLTREEDVDAAVAAGADAIGFVFYPSPRYVSPQRAAELVKRVDVVGLFVNAPEVVRIACEALPINVLQFHGEDAAYCSQFARYLRAARVRPGLD

Psent VRSKICGITRLEDALAAVEAGADAIGFVFYASPRAVDVRQARAIIAEVTTVGLFVNSRCELNEILEVVPLDLLQFHGETPADCEGYHRWIKALRVRPDDD

Nitoc TRVKFCGITRREDAIQAIRLGADAIGLVFYPSPRAVSPQQAYQIVREVTVVGLFVNASCYLQQILDKVPIDILQFHGESPEECGYYGRYIKAIRMAEGVD

Niesm IRTKICGITTPEDALYAAHAGADALGLVFYPSPRAVDIIKAQKITAAVSVVALFVNSAQNIRRILAEVPIHIIQFHGEDDAFCRQFHRYIKAIRVQTASD

Medtr PLVKMCGITSAKDAAMAAEAGANFIGMIVWPSKRSVSLSVAKEISKVAEPVGVFVDDAETILRASDASNLEFVQLHGGSRAAFPSLIQ--RVIYVLDGSL

Arab PLVKMCGITSARDAAMAVEAGADFIGMIIWPSKRSISLSVAKDISKVAKPVGVFVEDENTILRAADSSDLELVQLHGGSRAAFSRLVR--RVIYVLDGKL

Oryz PVVKMCGITSAKDAETALEAGAKLIGMILWPSKRSVALAEAKEISRVAESVGVFVDDEETILRVSDSCDLNLVQLHGESRSLLHVLSK--RIIYVLDGKL

chlre PLVKICGITNAEDAQHAVQSGADLLGMIMWQAKRAVSADTARAIAAVAKAVGVFVDDAATISARCRDAQIPIAQLHGGARAALPDLDP--EVVYVLDGTP

CME KSLKVCGVTSAEDATWILETIQRRLSVQVWISRRCVSVREARDIAAAARTVAVFVDDLSRIRQVCTDASINVAQLHGLDWQEHVLRAVYARVLRPDEPSK

KSADIAAAIIAILLDGGGGTGQAFDWTPVIIAGGLTPENIGEAVLNVKPWGVDVAGGVEAAGKDTEKVEKFVGGAKR

AAENAVETIIAILLDEGGGTSRSFDWTPVIIAGGLSPENVKDAVAGTRPFGVDVSSGTEAPGKDHQKVRDFVQNAKQ

DAEAILQQVNYILLDQQGGTGVAFDWKPCLMAGGLTPENVVKALSVGHPVGVDVSSGVEVGSKDLDKVTAFLRAVKD

DAEAVLQQVNYILLDQQGGTGVTFDWKPCLMAGGLTPENVVKALSVGHPVGVDVSSGVEVGSKDLDKVAAFLKAVKD

DAEAVLQQVNYILLDQQGGTGVAFDWKPCLMAGGLTPENVVKALSVGHPVGVDVSSGVEVGSKDLDKVAAFLKAVKD

-----VGIGTVPLLDGSG-SGKLLNVARVFLAGGLNPDNVAESVKALGPIGVDVSSGVEEGKQSLDKITAFIKAAKE

-----VGLAAVPLLDGAG-SGTLLDLETVLLAGGLEPSNVVETVKSLGPIGVDVSSGVEEGKQSLEKIREFVKAAKS

-----TGIATLPLLDGAGGSGELLDQSRVILAGGLDPTNVADTIKKLGSVGVDVSSGVESGVQDPSKIHAFVQAVRG

-----SEISIVPLIDYVGGESGGLG-KSYVLAGGLTPKNVQDAISVSRPAVVDVSSGVETGKQDLEKIKAFINAVKE

-KDLLREVSHIAALDGDGGTGVSFDWSPVLLAGGLTAENVAQAIKVAQPFAVDTSSGVETGNKDLAKIKAFVLAART

-RG-VENITHFILFDLSGGSGKIVDWAPIILAGGLTPENVAEAVKQVRPWAVDVSGGVETDGKDLEKVRLFIARVKL

-ILLS-AASFIPLFDEAGGTGELLDWNHFMLAGGLTPENVGDALRLNGVIGVDVSGGVETGV-DSNKIANFVKNAKK

-LLEEQSLSSLPLLDEVGGEGKLLDWT—ILAGGLTPENLP----TFDNILGYDVSGGVETGV-DSLKIIKFIQKGHA

-----DSQV-YLLFD-IAGSGQTFDWQDYFIAGGLAVDNVAEAKETFHPYALDVSSGVETGY-DLKKIKAFIERVKA

FAFAQNSG-NAFLFDMAGGTGETIEASYAILAGGLNATNVGEAIRRIQPYGVDTASGVESRPKDSREIRSFVKAVHH

LRFREATG-SAFLFDMEGGTGEQMEGSYGVLAGGLNPGNVREAVQSLRPYALDTASGVEESPKSHEKMQAFVQAVRA

LEIISEF—DAFLLDSYGGAGAVFDWQLRIILAGGLNPENVKTAVAKVKPYGVDVSSGVETD-KDAEKIRDFIMKVRE

LASLEAYR-QGFLLDKAGGTGTTFNWEPVILAGGLTPENVGAAIQLVHPYAVDVSSGVEVD-KNPARIAAFLEAVRK

QEYETDY----ILFDFHGGNGKTFSWEKTILAGGLNALNIEEAIRTVRPYMVDVSSGVETE-KDVEKIKQFIIKTKE

SEAAPDFSLHGILLDRTGGTGIPLPWHPLILAGGLNPDNILEAIRLTRPYGVDVSSGVERN-KDREKIQHFISQARK

AEDAERLLEDKILLD----SGRRHDYRPIVLAGGLTPENVGEAIRWVKPAGVDVSSGVERN-KDRVLIEAFMAVVRN

EEDANRLLSDMVLLD----SGKLHDLRPVIVAGGLNAENVEEVIKVVKPYGVDVSSGVEKY-KDPKLVEEFVRRAKN

QSPVKRLLDDSILLDKTGGTGYVHDWDPLILAGGLKPENVQEAIRIVSPYAVDAASGVEIL-KDAVKIRSFIEEVRC

LVPLAGYADDRILFDRPGGLGATFDWHPFMVSGGLSADNVAEAVRITRAGGVDVSSGVERAPKDCDMIRNFIRAARA

LADLPRYAADRLLFDRPGGLGVPFDWRPFMLSGGLAAGNVDDAVRITRAGGVDVSSGVESAPKDAGMVRDFIRAARA

LAALPDYAADRILFDRPGGLGVAFDWTPFMVSGGLTLDNVADALRITRAGGVDISSGVESAPKDPELIRAFIRAARA

LPQIDLYSQDQLLIDLPGGNGLAFDWRPWMLAGGLTPDNVAEAVRMTGARQVDVSSGVETAPKDAALIAQFNAAARS

IAAADAYSSDLILFDLPGGMGMGFDWSPWGLAGGLNSENVADAIRATGAPLVDTSSGVESAPKDVDRIAAFCKAALD

PGTAATYDGDAVLVDGAGGTGTVHDWDPVVLAGGLTPENVADAVEAAAPFAVDVASGVEAESKDADAVSAFVDAAGG

LSLAHTYAGDALLLDQLGGTGHRLPLNPWWLAGGVSAEWIPELLSQVNPWGLDASSRLEISPKDLKLVEALVEAVR-

LHAVKGYVQDGLLLDQLGGTGHRLPLDPWWLAGGISAEWIPELLDRVTPDGLDASSRLEVRPKDLEKVNALLSAVR-

LGTAADYTKDTLLLDQLGGTGKTLDWKPWFLAGGLTADNIVEALSQVSPSGVDLSSGVERAPKDLDKVSKLFEKLG-

LEQAIIYTQNTLLLDQLGGTGQTLDWQPWLLAGGLTPDNILEALSQLNPDGIDLSSGVERKPKDLDKVALLFEKLG-

LENIPQYVEDTLLLDQLGGTGKTLNWDPWFLAGGLNPNNILQALDNLAPDGIDLSSGVERSPKEIGKVAQLFTKLNQ

LEKIALYVDEAVLLDQAGGTGRTLDWKPWMLSGGLRADNLAQALDILTPDAVDLSSGVENVPKDLSKITQILHIAQG

TRTLYLKYPAGFIFDLKGGTGQTFDWSPFLLAGGITPENVFDAIAATVPWGVDVSSGIELQPKDGDKMRQFVEEVRR

IQSAVDEFFSAILLDGRGGTGLTFDWNPYILAGGLNESNILEAITMCHPYAVDVCSGIEASPKDHLKMSRFIKAIWG

LVEFANQYRSGLLLDYGGG-GHVFDWTRIVLSGGLNAQNVAGAIERVRPYAVDVSSGVEASKKDHARIAAFVRAVRQ

LVEFAGSFPRGLLLDYGGG-GHVFDWTYLVLSGGLTADNVGDAVRRVRPVAVDISSGVEASKKDHSKIAAFVAAVRK

LQAACKHYASGILLDVPGGTGESFDWSPIILAGGLSADNVAEAIRQVRPYAVDVSGGVEQRKKDAAKIEAFMRAVRQ

LPSLARSYESALLLDVPGGTGRAFDWRAVILAGGLTPENIAQAVRQVRPYAVDVSGGVERIKKDAAKMAAFMRGVDS

IRNAADRFPQALLFDEYGGTGHRFDWTPWVLAGGLTPENVDEAIRITGAEAVDVSGGVEASKKDPAKVAAFIATANR

LNTIPDE--DWVLVD--GGSGEAFDWAGWLLAGGVNPENVGEALSSLKPGGVDVSSGICADGKDQSRIASFMDAVHS

LNEIPEE—-DWILVD--GGSGHGFNWAGWLLAGGINPTNVSEALSILQPDGIDVSSGICGDGKDKSKISSFITAVRS

INALPDE--DWFLVD--GGSGKGFNWQGWLLAGGLHADNVCDAFYALKPNGVDVSSGICADGKDPTRISSFMRNVKS

LTPPPSELLDWVLVD--GGSGQALDWRGWLLAGGLNPDNVATAAGLAQPSAVDVSSGVCGDGKDHGKVSSFISSAKA

RAITTADGRCYTLID---GDGKPFDWRLWMIAGGLHAGNVVEAVRVTNA-GVDVASGVEVTDKDPARLEAFLDALCS

Indole-3-glycerol phosphate synthase amino acid sequence alignment

52 228

Thaps2 FVRKAKEADVRKYCEESVLLKQINDAPTTTVGEVQGLISIIAEYKRKLEGSGFLSEILPPEILSPVFREFGATAVAVLADERTGGCTYDDIVEMVPLPVI

Phatr2 LVKKAKEQELRKYIEEHVKYNLLKENLDQVVGPLQESITVIAEYKRKFNPTGLIHEMHVPELLSPSFREYGASAIAVMADPRMGGCDYDDIRHFVPLPVI

Thaps1 KITAATILDVEAYTSPTILIETAAEFHLPLQHQIQQTMALAAEFKRASPSKGDIAPHLNAGEQASVYYKAGASVISVLTEGRWFKGSLADLREAR-----

Phatr1 TITATRLLDYQSLLEPSLLAQEAKSFALNLQTVIQSQMALAAEFKRASPSKGDIATHLNAGEQAVKYTKAGANIISVLTESHWFKGSLDDMTQARRPAIL

Pram EIAAQRRLDVEAAKQVTPLVKKIESTELRVLDRLNAPVALAAEFKRASPSKGDIATGLNLREQVKSYADAGASMISVLTEPKWFKGSLEDMMAARRPAIL

Psoj EIAAQRRLDVEAAKQVSALTKKIESAELPVLDRLNAPIALAAEFKRASPSKGDIATELNLQEQVKAYADAGASMISVLTEPKWFKGSLEDMQAARRPAIL

Ppar EIAAQRRLDVAAAKQVSTLAKKIEHTELPVLERLNAP------EQVQAYANAGASMISVLTEPKWFKGSLDDMMEARRPAIL

Gibze RIYANRKAAVDAQKQPSQEDLQVAYDLIPLVSRLRESVALMAEIKRGSPSKGIFALDISAPAQARKYALAGASVISVLTEPEWFKGSIEDLRAVRRPAIL

Neucr KIYAHRKAAVDAQKQPSLSDLQAAYNLISLVDRLRNSVALCAEIKRASPSKGVFALDIDAPSQARKYALAGASVISVLTEPEWFKGSIDDLRAVRRPAVL

Asory KIYDHRRAAVAIQKTPSQADLQAAYDLVSFPARLRQSLSLMAEIKRASPSKGMIAENACAPAQAREYAKAGASVISVLTEPEWFKGSIDDLRAVRRPAIL

Sachce RIYARRKIDVNEQSKPGFQDLQSNYDLQDFYTVLSSSAVVLAEVKRASPSKGPICLKAVAAEQALKYAEAGASAISVLTEPHWFHGSLQDLVNVRRPCVL

Schipo KIHAQRLIDIAESKRPGLGDLQTYLNLINFYERLKQSPALMAEVKRASPSKGDIKLDANAAIQALTYAQVGASVISVLTEPKWFKGSLNDLFVARRPAIL

Ustma RIHVQRLKDIAATKAPGFRDLDIALSLINFPERLQRHPGVMAEMKRASPSKGDIDPTAHAGAQALAYARGGASVISVLTEPKWFKGTMHDLSLARRPAIL

Agabio KIYKQRLLDVDQAQAPGSQDLQTLHSLIDFTVRIRQGPAMFAEIKRASPSKGPIALNINPAAQALKYALAGAHTISVLTEPKWFLGSLHDMLHARRPAIL

Chlaf NIVAYKKQEVERLKEVCTPDHYLSQILEHFATALKGPLSIIGEIKRQSPTRGKIGCIDNPADLALKYCCGGAAAISVLTDTRGFGGSFLDMQQVSHVSVL

Chlac NIIAYKKQEVERLKEVSSKNHYLSQILEQFATALKEPLSIIGEIKRQSPTRGKIRSIDSPADLALKYCCGGASAISVLTDTQGFGGSFLDMQQVNHVSVL

Coxbur AILKNKKQEIAHLKAFSSDAFSLS---KSFKRIISSTTTIIAEIKRRSPSKGHLAEIADPVALAKQYVQGGAAGVSVLTDKLAFDGSIRDLQQVS-VAVL

Cme3 EILRRKQQEVASLKAVAQAEHPLQRRLRQFSSAIRKPISVIAEIKRRSPSSGLIAEIPDVRQLSDLYYNGGAAAISVLTD-AAFDGTLDDLQTVVPCPVL

Bacce KIVEQKKKEVAELYEYTP------KRMTHSLVEAFTVIAEVKRASPSKGDINLHVDVRKQVKTYEECGAGAVSVLTDGQFFKGSFYDLQTAR-IPLL

Arab1 EIVWHKDKEVAQMKEKPLLKKALDNVPKDFIGALRSAPGLIAEVKKASPSRGILREDFNPVEIAQAYEKGGAACLSVLTDDKYFKGSYENLQAIMKCPLL

Arab2 EITWYKDVEVSRMKENPLLKKAVEDAPRDFVGALRMAPGLIAEVKKASPSRGILKENFDPVEIAQAYEKGGAACLSVLTDQKYFQGGFENLEAIRKCPLL

Oryza QIIWDKEVEVSQRKAKPLVIESSQHAPRDFVGALTAAPALIAEVKKASPSRGVLREDFNPVEIAQSYEKNGAACLSILTDEKHFQGSFENLETVRKCPLL

Nostpu EIVLHKRQEVAQMQQLPLLQQQLITAPRNFLAALQQNPSLIAEVKKASPSRGIIRADFDPVAIAQAYERGGAACLSVLTDRKFFHGSFDNLRNVRALPLL

Proma KILWEKDREVKVSREVPLLKAQINNLPKDFLGALRQSPAVIAEIKKASPSKGVIRENFDPIEIALAYKLGGATCLSVLTDKSFFQGGFEVLVQVRDLPLL

Synel EIVWYKEREVNAWRELPLLQNQVRGLTRDFLAALRQAPAVIAEVKKASPSKGVLREDFDPVAIAQAYAANGAACISVLTDEKFFQGGFENLQRVRDVPLL

Theel KIVWHKEQEVEQLRELPLLQRQVLEAPRSFLAAVQNPPALIAEVKKASPSKGIIRPNFDPVAIAQAYVAAGATCLSVLTDSEFFQGSFEYLAQIRAVPLL

Ralme KILAVKADEVAAARKRDLLRAEAESLRRGFERALRDKAGVIAEVKKASPSKGVLRENFVPEAIAESYAAHGAACLSVLTDVNFFQGHAEYLKRARPLPAL

Nitmu EIVATKRQEIAALKAGSLLRRQAEALPRDFIGALRKKPAVIAEIKKASPSKGVLREQFDPAAIAVSYGQHGAACLSVLTDQQYFGGSAEYLKQARDLPVL

Xylfa KIIAWKVEEIAERLLVSQLVARCADLPRGFAGALQATPAVIAEIKKASPSKGVLREDFRPAEIAISYELGGASCLSVLTDVHFFKGHDDYLSQARTLPVL

Nitoc KILQRKAEEISERAQLSILSQKVEESPRGFFKALAKKSAVIAEIKKASPSKGLLCESFHPEAIAQSYEKGGAACLSVLTDQDFFRGSEADLQKARNLPVL

Mesloti KIEAYKRDEIAAAKAVPLIKARAKDADRGFLAALEAKFALIAEIKKASPSKGLIRADFDPPALAQAYEKGGAACLSVLTDAPSFQGAPEFLTKARALPAL

Brume KIEAYKREEIAAAKALALLKARTRDQSRGFLKALEAKFALIAEIKKASPSKGLIRPDFDPPALAKAYEEGGAACLSVLTDTPSFQGAPEFLTAARSLPAL

Agrous KIETYKLEEIAAAKAVSLLKAMAADQSRGFYKALRAKFGLIAEIKKASPSKGLIRPDFDPPALAAAYEAGGAACLSVLTDTPSFQGAPEFLTAARALPAL

Eryli EICAAKLDEVATRKALSDLADRIAAQSRGFEAALRAKFALIAEIKKASPSKGLIRADFHPADHARAYEAGGAACLSVLTDEDYFQGHADYLREARTLPAL

Mothe EILDYKRREVAAAREHPLLEKAAAAMPRDFGAALRRRLKVIAELKQASPSRGLIREDFDPEGQARSYIAAGAAAISVLTDQRFFRGRPEYLILVRTLPLL

Chloro RILETKAREVAELKKKPEYREACGDLPRDFRSAITSRINLIAEVKKASPSRGVLVEDFRPLDIAARYAELGASAFSVLTDSHYFQGSPDYLKAITSIPVL

Pelodi KILKEKRREVAAIKARPLYLELRSSLARGFQEALRGELRLIAEVKKASPSRGVIVHDFDPVRIALHYEEIGASALSVLTDRQFFQGSPDYLRAVSRLPVL

Metth DIIRSKKLEVKELMKTPLLRDD----IVSFPAAVGG-VSLICEYKRASPSMGRISE-RGLEEMMEVY-QDLADAVSIVTDGKYFKGSLDLLSGAT-KPLL

Metst RIIENKKVTLTQTKKKPLLKEEAIDIVFRFQEKLKN-TQLIAEYKPASPSKGNIST-LKAEDVIPIYDKNNVDMMSILTEETYFKSNLKNFNIANNKPLL

Clothe EIVAQKKIQLKKDMSITIWKQKIK--RLDFYGALKNNISIIAEVKKASPSKGIIKEDFDPLKIAKEYVESDIQAISVLTERNFFKGDEDYLVKIRPLPIL

Sulso RYLKGWLKDVVQLSLPSFSRQRP----ISLNERILEFTAIIAEYKRKSPSGLDVERDPIEYSKFMERYAVG---LSILTEEKYFNGSYETLRKIASIPIL

Thaps4 GYMWEKETQVDRLREISLLMSQCKAAMRDWISPVKQAFVIIPECKRMEPTSGSLRKRYDIPKLVKQLLSAGAPAISVNSDGVLFGGSMEDITIARSPPVL

Phatr4 GFMWDKETEVDRFREVPLLVSQCRLSQRGFVVPIQQMFVVIPECKRMEPTIGSLRRRYDLSKLARDFTFDGAVAISVNCDAVLFGGSLGDVTAARVPPIL

Thaps3 IERKRLAAALRRVYDPYNNPNAKQKTRERMLENG—VGSFIVDIKRK------DAGMVAEAMVRLGADVVFVNVDYHSYGGDLSELKSAVEAAVV

Phatr3 VSPMKLAKALRRVYEPSNNPDHVPLSDQTLQQVG--MGSFIVDIKRKSRPGEVFCNYDDAGMVAEAMVRLGADAVFVNTDYQAYGGDMTELKSAVSAAVV

Plber RIIENKKYEVTKLLENADSPLQIRLKYNKLSESLKIILSMVADIKRKCSTNKAFLNLTNPGEASLMLHKIGFDVLIVNIDSLSAQGTLNDLSDVIRPAVV

Plyoe RIIENKKYEVTKLLENADSPLQIRLKYNKLSESLKRILSMVADIKRKCSTNKNFLNLTNPGEASLMLHKIGFDVLIVNIDSLSTQGTLNDLSDVIRPAVV

Plafa KIIENKKYEITKLLENCDNPLQIRMKYNKLSESLKRSLSLIADMKRKCSIEKEFLNLSNPGNVSLLLHEIGFDVLIVNIDELSTQGNINDLKDIIRPAIV

Tann KMITDKYSEVDQLIEHKDDKLQLRLNYLKLSDMLSRNLCVIADMKRRTHNPRCVLSYTDAGEVALNMASVGFDVIFVNVDQKNYGGHINDLNKVFRPAIV

Theipa ------MLRRNLCVIADMKRRTHNPKCVLSYTDAGEVAMNMASVGFDVIFVNVDQINYGGHINDLNKVFRPAIV

Cme1 EPLAAQQEELVDWLPRKFVRDELSNREEAPSRDIRAKLSVVVDLKRRTSEPARLNMYAAAEILSDLRAEIPLDGIMVSVDSELYDQDCASLSDFAAPPII

Cme2 ELVWAKEREIEVRRQMPLMKSRLEKSD-DLNSVFWDS-VLFFQYQRC------PEDTESAPILANVAEEKNARAVVVNAESRRFYGSYEDIRRVHRCPVI

SSDLIVDEIQIARAADAGAKAVTVTHGVVVSQFIKNARVLGLETIVNVGTAEEAQSAVDVGISVTGVDGADNKFAVIEPEGRT----LAKDNKALEEVEE

NNDLIVDELQVARSAAYGCAALVLNLPTLTKVLLKATKAVDLEAIVAVSSKEEAQTAIDSGLSILHLGTVEDMVAAVDPEGQQ----LANNDQQLQEIED

AKDFITSRYQIAQAAASGADTVLLIVATTLKDLILYSRSLNMEPLVEVHALVELDVALEAGAKVLGVNNRNLHTFQLDLANTDKIAGTLCALSGMSSAHD

RKDFVINEYMIAEAAAKGADTVLLIVAVLLTRLIGFSRSLGMEPLVEVHADTELDVAIKAGAKVIGVNNRNLHTFQMDLGTSERVARTVCALSGMSTAMD

RKDFIIDVYQLLEARAYGADLAALIMN---VDVEFATHNLGMCALVEVNSVEELDIALAARSRLIGVNNRDLRSFKVDMSTTARVADTLFALSGIRSHAD

RKDFIIDVYQLLEARAYGADCVLLIVALLLNELIEVTHKLGMCALVEVNSVKELDIALAARARLIGVNNRDLRTFKVDLNTTARVADTLFALSGIRSHAD

RKDFIIDVYQLLEARAYGADCVLLIVTLLLIELIDATHNLGMCALVEVNSVQELDIALAAKARLIGVNNRDLRTFKVDMNTTARVADALFALSGIRSHTD

RKEFIFDEYQILEARLAGADTVLLIVKMLLHRLYNYSLQLGMEPLVEVQNAEEMTTAVKLGAKVIGVNNRNLESFEVDLNTTGRLRSIICALSGINTHDD

RKEFIFDEYQILEARLAGADTVLLIVKMLLERLYKYSLSLGMEPLVEVQNTEEMATAIKLGAKVIGVNNRNLESFEVDLGTTGRLRSFLCALSGINTHQD

RKEFVFDEYQILEARLAGADTILLIVKMLLTRLYHYSRSLGMEPLVEVNTPEEMKIAVQLGAEVIGVNNRDLTSFEVDLGTTSRLMDIVCALSGISGPKD

RKEFIFSKYQILEARLAGADTVLLIVKMLLKELYSYSKDLNMEPLVEVNSKEELQRALEIGAKVVGVNNRDLHSFNVDLNTTSNLVELLIALSGITTRDD

RKDFIIDPYEIMEARLNGADSVLLIVAMLLESLYKFSKSLGMEPLVEVNCAEEMKTAIELGAKVIGVNNRNLHSFEVDLSTTSKLAEILAALSGISSPAD

RKDFIVDTYQIAEARLAGADTVLLIVAMLLKELYDYSLSLGMEPLVEVNNPEEMTRAIRLGAKVVGVNNRNLHDFNVDMETTSRLADILCALSGISGRSD

RKDFILSRYQVLESRIWGADSILLIASMLLRDLYSYALELGMEPLVEVNNAQEMELALSLPAKVIGVNNRNLHDFKVDMNTTSRLSDVLCALSGIASRND

RKDFILDPLQLAEAIFFGANAVLLIVSVVLKFLIQEAHRLGLEVLTEIHDFSELELALEAEASIIGINHRNLKTFEIDLNLSESLKPITVAESGIHHPIQ

RKDFILHPLQLAEAIFFGAHAVLLIVSVVLKFLIQEAQRLGLEVLTEIHDLSELELALEAEAPIIGVNHRNLQTFEIDLNLSEILKPITVAESGIHHPTQ

RKDFILDPLQIEEAAVAGADAILLIVAILTKILLQKAHECGLEALVEVHNRQELDQAIEIGAEIIGVNNRNLTTFSVDPNNALKLKPISVAESGIHTVSD

RKDFIIDEIQIAEAAERGAAAVLLIVAAILRKLLEATHRHGLEALVEIHDEEELEIAKAAGAEIIGTNARDLRTFNVDLNRCFELIGIPVAESGIHDVMD

CKDFIIDKIQIDRAYEAGADIILLIVAALLKELYSYVREKGLEAIVEVHDEEELEIAIELNPHVIGINNRNLKTFEVDLGQTEKLGKLWISESGIHSKED

LKEFIVEAWQIYYGRSKGADAVLLIASVLIKYMIKICKILGMATLVEVHDEREMDRVLAIEVELIGINNRNLETFEVDLGITKKLL?LVVGESGLFTPED

CKEFVVDPWQIYYARTKGADAVLLIAAVLITFLLKICKKLSLAALVEVHDEREMGRVLGIEIELVGINNRSLETFEVDISNTKKLLAIVVGESGLFTPDD

CKEFVIDIWQIYYARSKGADAILLIAAVLIKYMLRICKNLGMTALIEVHDERELDRVLKIDVQLIGINNRSLETFKVDTSNTKTLLELVVGESGLFTPDD

CKEFIIDPYQIYLARTAGADAVLLIAAILLQNFLQVIHDLGMNALVEVHTLAELDRVLKLDLHLVGINNRNLEDFTVDLGITQQLLATVVSESGLYTPAD

CKEFIIQPYQIYQARVAGADAVLLIAAILLLYLRKVAISLGLTILVEVHDSNELKRVLDLEFPLVGINNRDLKTFNTDLRTTKEVVKLLVSESGLFNSAD

CKDFVIYPYQIYKARLLGADAVLLIAAILLRYFLKIAHSLGLNALVEVHSLPELERVLALDLRLVGINNRNLKTFVTDLAVTEHLAALLVSESGLFTGAD

CKDFILYPYQMFLARVRGADAVLLIAAILLRYFLRIAHGLGLTALVEVHTAEEMERVLALEVQLVGINNRNLMDFSVDLATTEKLLALVVSESGIHQRAD

RKDFMVDMYQVYEARTWGADCILLIVSALMAELEACAHELGMDVLVEVHGGEELDSALRLKTPLLGVNNRNLRTFEVSLDNTLDLLPLVVTESGILGPDD

RKDFMLDEYQVAEARLMGADCILIIVAAFMRKLESLAHALGMAVLVEVHDGAELELALELASPLIGINNRNLHTFETRLDTTLHLLDMAITESGVLTPDD

RKDFTIDPYQVYEARVLGADCILLIVAALLVDLSGLALQLGMDVLVEVHDIDELERAIQISAPLIGINNRNLSTFNVSLETTLTMKGLLVSESGILTSAD

RKDFIIDPYQVYEARVIGADCILLIVAALMLELAQLAEHLNMDVLIEVHDSEELERALVLDTPLIGINNRNLKSFETRLETTFELLSLVVTESGIHSPAD

RKDFLFDPYQVYEARAWGADAILIIMASVASALEATAFELGMDALIEVHDEAETQRALKLSSRLIGINNRNLRTFETSLETSERLATLLVSESGIFTHDD

RKDFLFDPYQVYEARSWGADCILIIMASVAKELEDTAFALGMDALIEVHDEAEMERALKLSSRLLGVNNRNLRSFEVNLAVSERLAKLLVGESGIFTHED

RKDFMFDTYQVHEARAWGADCILLIMASLAKRLEDEAFALGMDVLIEVHDAEETERALKLTSPLLGINNRNLRTFEVGLETSEKLAGLLVGESGIFTFED

RKDFMVDPWQCAEARSMGADAILIIVAALMHEIEAAALEHRMDALVEVHDEEEMSRAHALQSRLVGVNNRDLKTFTTDLATTERLAPLLVGESGIDNHAD

RKDFIIDPYQIYEARALGADAILLIVAALLVEYLDLARRLGLAALVEVHTAPELEVALRAGAGIIGINNRDLHTFKVDLQTTGRLRSVVVSESGIRSRAD

RKEFIIDESQIYETRLMGADAALLIVAALLRDYLQLFAELGLHALVEVHDRRELDIAIEQGSTIVGVNNRDLRDFTVDLMTSVNLKRLSVAESGLKRRDD

RKDFIVDESQIFESRLMGADAILLIVAALLRDYLQTASETGLDVLVEVHDRRELDTAAEEGATMIGVNNRNLKDFSVDPATSSDLRPIAVSESGLKTAGD

MKDFLVDEYKIYQARASGASSVLLITGVFLEAGIQKCRELSMEPLVECHTSLDIFRALEAGAEIIGVNNRDLETFEVDLERTHALAPILVSESGVRGPED

RKDFIIDEYMIYEAAVNNASGILLISGICIEEYLNISRNLGLDAVVECHTLEDIESVVDYNPEIIGINNRNLDDFTINLKTTKELKKYLISESGVKTIED

RKDFIIDLWQIYESRYIGADAILLIVSLLLKKFQIVANILGMQCLVEVHDERELERALESGARIIGINNRDLRTFEVDLKNTEKLMNVVVSESGIKDTED

MKDFIVKESQIDDAYNLGADTVLLIVKILLESLLEYARSYGMEPLIEINDENDLDIALRIGARFIGINSRDLETLEINKENQRKLISVKVAESGISERNE

ASDLLLYPYQLYKLRLAGADAVTMVVGALLLYLTKIASTINLQVIASVTSEVQIDMLSKLGISALVVSNRDLETFDFDNSGEQALSLLVEGRVGMVDGEG

ASDLILYPYQLYKLRLAGADAINLLVGALLSYLTKIASSLQLQSFATVTSEVQLLEVASLQIDGIIVSNRELEDFSFDMTGEQALYLLAEGRVGIIDRPQ

MKDIVVDEIQLGLAKDAGADGIVLISSVLLENFLNLATIIGLETIVECHTYNEVQAALDSLAQNIMVSNRDRITGQLLIKLAGMFPGITIAGGGISTPEQ

MKDIVVDEIQLGLAKEAGADGIVLMSSVLLENFLNLATMIGLETIVECHTHDEVQRAIDILAPNILVNNYDRVAQELHIKLAGMFPGICLAAGEIETTDQ

VDDVIIHPIQIALAVENKADGVILNLSYLLEDMLNYCVNLGTQAIVEVHDYNDIYYATQCGSYILMINEFDFINNRYEIKAISYSIPITIAKINVSDINY

VDDVIIHPIQIALAVENKADGVILNLSYLLEDMLNYCVNLGTQAIVEVHDYNDIYYATQCGSYILMINEFDFINNRYEIKAISYSIPITIAKINVSDINY

VDDIIIHPIQIALAVENQADGVILNLSYLMEEMLQYCVNVGTQAIVEVHDYNDIYFATQCGAYILMINEYDFYNNIYEIKAINYTIPITIAKVNTNEVNY

MKDIILHPIQIAQAVEMRADGVILNAAILLKDLLTSCITMGIEAIVEVHTTADALRSAEIGFTNFMINQWDRIKNILYLEIKEVLPETTIAAGGIMTMEQ

MKDIILHPIQIAQAVEMRADGVILNAAILLQDLLTACVTMGIEAIVEVHTTADALRSADIGFNHFMINQWDRIKNKLYLEIKEALPDITIAAGGIMTMEQ

YKDIIVHELQLAEAAEAGAKAACLVAGACLEKLLNAAYILGMEVLVEVHTEEELQYAFEAGAGIYVFTDVDRARRRVVVDLMDKYRVVCLGSGRIETLEQ

CKDIFVYPWQLYDARFYQADACVLIMKVLTLYFAKASVALGMAPIIEVHDAAELDALLTAKSDDGRISILLLAKRDLD-SLVPMLSDLLERFAPEARRLG

AWICRDKGFNSVWVSDALYKGNDPVEHP

AWSVRDQGFNSVWVGEALYKGADPNEQP

VGRYRSVGVGMCLIGESLMRSDPSLAIK

VDRYRQVGLGMCLIGESLMRTDPGQAIA

VVKYEKCGARGILVGEYLMKGDVATTVK

VVKYEKCGARGILVGEFLMKGDVATTVK

VVKYEKCGARGILVGEYLMKGDIATTVK

VLMNKRDGVNAVLVGEAIMKPDASVFIS

VLDCKRDGVNGILVGEAIMRPDATQFIR

VEAYKKEGVKAILVGEALMRSNTSAFVA

AEKYKKEGVHGFLVGEALMKTDVKKFIH

VAHYSSQGVSAVLVGESLMRSDPAAFAR

VAKYLKEGVGAVLVGEALMRKDKRAFIH

VERYKGQGVGAILVGESLMKKDVGEYIK

AKRMRELGFDAVLVGEALVRKDPALLIK

AKRMRELGFDAILVGEALVSKDPSLLIK

AKRYISAGYNAVLVGEALVKENPQQFIS

AWRLRDAGYSSILVGEALIRAESSKLPS

IIRVKKAGAKGVLVGEALMTSSIRTFFE

IAFVQEAGVKAVLVGESLIKSDPGKAIS

IAYVQAAGVKAVLVGESIVKNDPEKGIA

VAYVQNAGVSAILVGESLVKENPGQAIA

LSLVAEAGVRAVLVGESLVKSDVEQAVR

LEEVSSYGAKAVLVGESLMRPDIGLALK

LDRVTQAGAQAVLIGESLVKPDPGLALQ

VERMAQAGAQAILVGESLMRPAAMAHFS

VKRMRDADVHAFLVGEAFMRPEPGVELA

VALMRGHGVGIFLVGEAFMRPDPGTELA

VQRLRAAGVNAFLVGEAFMRTEPGESLR

VRKMKKNGVNAFLIGEAFMKEDPAAKLA

CLRLQKKDIGTFLVGESLMREDVTAATR

CLRLEKSGIGTFLIGESLMRHDVAAATR

CQRLEKSGISTFLVGESLMRDDVTAATK

CVRLSQAGVQTFLVGESLMRDDVEAATR

AALVAGAGVDAILVGEALMTGDVMAKMA

VLLMQDAGFDAVLIGEGLLAEELRQFSW

IAHLRDAGFHAVLIGEGLQTKELTGLTW

AEILAGYGADALLIGTAPMSQNPRELLE

AKKLKEYGADGILIGTSILQNTEKEIEK

LKYLKELGVDAVLIGETFMRRSISEKIR

IEELRKLGVNAFLIGSSLMRPEKIKEFI

VKTLKDAGAFGAIVGGGLAFSDDEASNV

ITELREAGAVGAIMGGALAVG------

MKKHLAVGYDGVVVGKTVMGARAPEFIR

MKRHLAVGYDGVVVGKAVMGPAAPEFIR

VEKLGSLGYDSICLEKKLIDEDLEVFVQ

VEKLGSIGYDSICLEKKLIDEDLEVFVQ

IEKLGSLGYDSICLEKKLIDDDLQQFVT

VHMLALAGIDAVCFGRRLVYPDVPDFIN

VHMLALAGIDGVCFGRRLVCPDVPDFIN

VEQLRDVGADGIVIGGKLMAIARGDVEA

IVELPREHAPNASLLRGLREGADAVLAS

Indole-3-glycerol phosphate synthase nucleotide sequence alignment

52 686

Thaps1 TTGCAAAAGATTACGGCTGCCGACGTGGAGGCATACACGTCGTCATCCTCCTCGATCGAAACAGCTGCACACAGACTGCACGGGCCTCTTCCATTGCAGC

Thaps2 GGTGTCTTTGTACGCAAGGCCGACGTGCGCAAATACTGTGAGGAGGGACCCTCGCTCCTGAAACAAATCCCCGACGAGGCTGCTGCAACAACAACGGTGG

Thaps3 GCCCAAGAGGGTATTGAACGCCTCGCTGCCGCATTACGTCGAGTCTACGATGATCCTTCAAATCCCAATGCCAAGCAAAAGACATTGGAGAAGGAAGAAC

Thaps4 TTGGAGGGATACATGTGGGAGCAGGTGGATCGTCTGAGGGAACGTATTTCGTTGATGAGTCAGTGTAAAGTGGATCCAGCAAAACCTCGCGACTGGATTA

Phatr1 CTTCAAACGATTACAGCAACGGACTATCAATCCCTTTTGGAAAGTAGCACGTCTGCCCAAGAAGCAAAAGCCTTTGATCATGGAATATTGAATTTGCAGA

Phatr2 GGTGCGCTCGTAAAGAAAGCCGAACTCCGGAAATACATTGAAAGTGGCGTTGAAAAGTACAATCTACTCCTGGCAACGATCAACGCCGATCAAGTTGTGG

Phatr3 GCACAGGAATCGGTCTCGCCGCTCGCCAAAGCCTTACGAAGAGTTTACGAGGATCCAGCCAATCCCGACCATGTTCCTCTATCTAAGAAGCGCGCTCAGA

Phatr4 CTGGAAGGATTTATGTGGGACGAAGTTGACCGCTTCCGAGAGCGGGTACCTCTGGTATCTCAGTGCAGAATGGATCCAAAAGCTCCTCGAGGCTTCGTTG

CME1 CTCGAAAAGGACGTCATGCGC------GAGTACGTGCGCGATGAACTGAGCCCCCTGTCGTTTCGATTAAGCTACATGTCATCGCGAGATATCCGTG

CME2 CTCGAGGAGCTCGTTTGGGCCGAAATTGAAGTGCGTCGGCAGCGCATGCCTCTGAAAAGCCGCTTAGAGGTCGCTCCGGTGCGC------CTCA

CME3 CTCAAAGAAATACTACGGCGAGAAGTTGCCAGTCTGAAGGCTGAGGTCGCCCAGCCGCTGCAGCGTCGCCTCGATGCAGCCGCGTCACGTCAGTTTAGCA

Pram CTGGAGGAGATCGCGGCGCAGGACGTGGAGGCCGCCAAGCAGGTCGTGACGCCCGTGAAGAAGATCGAAGAGAGCGTCTACGGCGCCCTGCGCGTGCTCG

Psoj CTGGAGGAGATCGCGGCGCAGGACGTGGAGGCCGCCAAGCAGGTCGTGTCGGCGACCAAGAAGATCGAGGAGGCCGTCTACGGCGCGCTGCCCGTGCTCG

PparOK CTCGAGGAGATCGCGGCGCAGGACGTGGCGGCCGCCAAGCAGGTCGTGTCAACTGCCAAGAAGATCGAGGAGAGCGTCTACGGCGCCCTGCCCGTGCTCG

AgabioOK TTGGACAAAATCTACAAACAGGACGTTGACCAAGCGCAGGCAACACCAGGGTCTGATCTTCAGACTCTCTTGAATATCTCCCCACAGATCGACTTCACTG

Ustma CTCGAGAGAATTCATGTGCAGGACATTGCTGCCACCAAGGCCATTCCAGGCTTCGATCTTGATATAGCTCTGCACCTCGCACCTCTGATCAACTTCCCAG

Gize CTGCAGCGGATTTACGCAAACGCAGTAGACGCTCAGAAGCAGATTCCTTCTCAGGACCTCCAGGTCGCTCTGAATGCCGCCCCTCAGATTCCTCTTGTTA

Schipo CTTGAGAAAATCCACGCGCAGGACATTGCTGAAAGTAAAAGAAAGCCTGGTCTTGATTTGCAAACATATCTTAACATAGCACCTTGTATAAATTTTTATG

Asory TTAGAGAAGATATATGATCACGCCGTTGCCATCCAGAAGACTATTCCTTCACAAGATCTTCAGGCTGCTCTCAACATTGCTCCTCAAGTCTCATTCCCTG

Sachce TTGGACCGTATCTATGCTCGGGACGTCAATGAGCAGTCTAAAATCCCAGGTTTCGACTTACAATCTAACTTAGGTCTTGCCCCATTACAGGATTTCTACA

Neucr CTTCAAAAGATTTACGCCCACGCTGTGGATGCTCAGAAGCAGATTCCTTCCCTGGACCTCCAAGCCGCTCTGAGCATCGCCCCTCAAATCTCTCTTGTCG

Arab1 TTGGAAGAGATCGTATGGCACGAAGTTGCTCAGATGAAAGAGAGAAAGCCTCTTAGCTTGAAGAAGGCTCTTGATAATGTTCCTGCTAAAGACTTCATTG

Arab2 TTGGAGGAGATCACATGGTACGAAGTTTCCCGGATGAAGGAGCTAAATCCGCTTGTGCTGAAGAAAGCTGTAGAGGATGCTCCTACTAGGGATTTTGTTG

Oryz CTCGAGCAGATCATCTGGGACGAGGTGTCCCAGAGGAAGGCGAAGAAGCCGCTGAAGGTGATCGAGTCGAGCCAGCACGCGCCAGCCAGGGACTTCGTCG

Coxbur TTAAAAGCAATTCTTAAAAATGAAATTGCGCATTTGAAAGCGAATTTCTCTTCCTCTCTTTCA------AAAAAATCCTTTAAAA

Chlafe TTAGCAAACATCGTTGCTTATGAAGTAGAACGGCTCAAAGAAGAAGTTTGCACATATCTGAGTCAAATTCTAAACCAAAATCATAGAGAGCATTTTGCCA

Chlaca TTAACAAACATTATTGCTTATGAAGTAGAACGGCTCAAAGAAGAAGTTAGCTCATATCTGAGTCAAATTCTAAAGCAAAATCACAAAGAGCAGTTTGCCA

ChloteOK CTG---CCTCAACTCCTCGCTACCCTCGCCGATGAGCACAGCGTCAAAGCCGGCCATGAGCAGCACATCTTTCAGGCCGCTTTCGGAGAGCACGCCTTCA

PelluOK CTTGCAAAGATCCTGAAAGAGGAGGTCGCGGCCATAAAGGCTGAGCGCCCTCTTCGTTACCTTGAGCTTCGCAGTTCGCTTGCTGCCAGAGGCTTTCAGG

Nitoc TTGAAAAAGATTCTCCAGCGGGAGATCAGCGAACGTGCCCAAAAGCTTTCAATCGAACTTAGCCAGAAAGTGGAAGAATCGCCCGTGCGCGGTTTCTTTA

MoortOK CTGGCGGAGATCCTGGACTACGAAGTGGCGGCCGCCAGGGAACAACACCCCCTGGACCTGGAGAAGGCAGCGGCAGCCATGCCCACCCGGGACTTCGGGG

Clothe CTGGATGAAATTGTAGCGCAACAGCTTAAAAAAGATATGAGCAGGATTACCATA---TGGAAGCAAAAAATTAAAAGACCGGGACCTCTGGACTTTTACG

Xylfa CTCACTAAGATTATTGCGTGGGAGATTGCCGAGCGTCTTTTGCATGTCTCACAGGAATTGGTTGCGCGTTGTGCCGATTTGCCGCCCCGTGGTTTTGCTG

Bacce TTAGATAAAATTGTAGAACAGGAAGTTGCGGAGTTATATGAAATATATACACCA------GTAAAAACAAAAAGAACACATTCACTTGTAG

Ralme CTGGAAAAGATCCTGGCCGTGGAGGTGGCAGCCGCGCGCAAGAAGCGCGACCTGAGCCTGCGTGCCGAGGCCGAGAGCCTGCGTCCGCGTGGCTTCGAGC

Nitros CTCGGTCCCGGGATCGGGAGCGGCTTCGCCCACGAGGAAGATGCCTACCCCGTGCATCAGCGCAACATCGAGTACCCCGCTTTCGGCCATACGGTCTGGG

Agrob CTCAAAAAGATCGAGACCTACGAAATCGCCGCCGCCAAGGCCAAGGTTTCGCTGGATCTGAAGGCCATGGCCGCAGACCAGAGCCCGCGCGGTTTCTATA

Meslo CTGCGCAAGATCGAGGCCTACGAGATCGCCGCCGCCAAGGCACACGTGCCGCTGGAGATCAAGGCGCGGGCGAAGGACGCCGACCCACGCGGCTTTCTTG

Brume CTTCGCAAGATCGAAGCCTATGAAATAGCCGCCGCCAAGGCGCGTCTTGCCCTTGAACTGAAGGCCCGGACGAGGGATCAGTCACCGCGCGGTTTTCTGA

Eryli CTTGAAGAGATATGCGCCGCCGAAGTCGCGACGCGCAAGGCTGCGCTGTCGGATGAATTGGCCGATCGCATCGCAGCCCAATCCCCGCGCGGGTTCGAAG

Proma CTTGAAAAAATCTTGTGGGAGGAAGTAAAAGTTTCAAGAGAGAGAGTTCCTCTTGAATTAAAGGCTCAGATCAATAATTTGCCTACTAAAGATTTCTTAG

Nospu CTTGAAGAGATTGTGTTGCATGAAGTTGCACAGATGCAGCAGGAACTTCCTCTATCTTTGCAGCAGCAGTTAATTACGGCTCCAGTCCGAAATTTCTTAG

Synel CTCGAAGAGATTGTTTGGTATGAGGTGAATGCTTGGCGAGAGCAGCTGCCGCTGCAGCTCCAGAACCAAGTCCGCGGCTTGACCCCCCGTGACTTTTTGG

Theel CTTGAGAAGATTGTTTGGCACGAGGTGGAGCAACTGCGGGAAGTCCTGCCCTTGGATTTACAGCGTCAGGTGCTGGAGGCGCCAGTGCGTTCTTTTTTGG

Sulso ATGCCACGTTATCTTAAAGGAGACGTCGTACAATTATCTTTAAGGAGGCCCTCA---TTTAGGGCTTCA---AGACAAAGGCCAATTATTTCCTTAAACG

Metth CTCAGGGATATTATAAGGTCAGAAGTTAAGGAACTCATGAAAAAAACACCCCTCATCCTAAGGGATGAT---ATAACAGTCATGCCAGTGAGCTTCCCAG

Mesta ATAGATAGAATAATAGAAAATACACTCACACAAACTAAAAAACAAAAACCCCTAAAACTTAAAGAAGAGATAGTTTCTACAAAATTATTTAGATTTCAAG

Plabe ATAAACAGAATTATTGAAAATGAAGTAACTAAGTTATTAGAAGAAAATGCTGATCCATTGCAAATAAGGTTAAAGTATCTCCAGAATAATAAATTATCTG

Thean TTTTATAAAATGATTACGGACGAAGTAGATCAACTAATAGAACAACATAAAGACAAACTTCAGTTGAGGTTAAATTATTTGCAAAACTTGAAGCTTTCCG

Playo ATAAACAGGATTATTGAAAATGAAGTAACTAAGTTATTAGAAGAAAATGCCGATCCATTGCAAATACGATTAAAATATCTTCAGAATAATAAATTATCTG

Thepa ------

Plafa ATAAATAAGATTATTGAAAATGAAATTACGAAATTATTAGAAGAAAATTGTGATCCATTACAAATAAGAATGAAATATTTACAAAATAATAAATTATCCG

ATCAAATCCAACAAGCACTTGCGGCGGAGTTCAAGCGCGCCTCTCCCTCCAAAGGAGATATTGCACCCCATTTGAACGCCGGCGAACAAGCGAGCGTGTA

GTGAGGTGCAGGGTTCAATCATTGCCGAGTACAAACGTAAATTGGAAGGAAGTGGCTTTCTCTCTGAAATCCTCCCTCCAGAGATTTTATCTCCCGTCTT

GCATGTTGGAGAATTCGTTTATTGTGGACATTAAACGAAAGTCCCCAGGA---GAGCAGTTTGCCCGATACGATGATGCAGGAATGGTAGCTGAGGCGAT

GTCCTGTGAAGCAAGTCATCATTCCGGAGTGCAAAAGAATG---CCGACTTCTGGCAGCTTACGCAAACGATACGACATTCCAAAGCTTGTGAAACAACT

CTGTCATCCAATCCGCGTTGGCGGCAGAGTTTAAACGGGCGTCGCCGAGCAAGGGTGACATTGCTACCCATCTGAATGCCGGTGAGCAAGCCGTCAAGTA

GTCCCTTGCAAGAAACGGTGATTGCCGAATACAAGCGCAAGTTCAATCCAACCGGACTTATTCACGAAATGCATGTACCGGAATTGCTCTCCCCCTCCTT

CACTGCAACAAGTTAGTTTCATTGTCGACATTAAGCGCAAGAGTCCAGGC---GAAGTTTTTTGTAATTATGATGATGCTGGTATGGTAGCAGAGGCTAT

TACCAATCCAGCAAGTTGTTATTCCAGAATGCAAGCGAATG---CCCACGATCGGGAGTTTGCGACGTCGCTATGATTTGAGTAAGCTTGCTCGCGATTT

CCAAGTTGCGAAGATCCGTGGTGGTCGACCTCAAGCGACGTTCTCCTGCGTACGCCGCTGCCGAGATACTTTCGGATCTAAGAGCGGAAATACCACTTGG

ACTCGGTTTTCTGGGTGCTCTTTTTCCAGTACCAGCGCTGC---CCGGAAGAC------ACAGAGTCTGCACCTATTCTGGCAAATGTGGCTGAAGAGAA

GCGCAATTCGTAAATCTGTTATTGCGGAGATTAAGCGACGGTCACCGAGCTCTGGTTTAATTGCCGAAATCCCGGACGTGCGACAGCTCAGTGACCTGTA

ACCGCCTCAACGCGGCGCTGGCAGCCGAGTTCAAGCGCGCCAGCCCCAGCAAGGGAGATATCGCCACGGGACTCAACCTGCGCGAGCAAGTCAAGTCGTA

ACCGTCTGAACGCGGCGCTGGCCGCCGAGTTCAAGCGCGCGAGCCCCAGCAAGGGAGACATCGCCACGGAGCTCAACCTCCAGGAACAAGTGAAGGCCTA

AGCGTCTTAACGCG------GAGCAGGTTCAGGCCTA

TACGCATCAGGCAAGCCATGTTCGCTGAGATAAAGCGCGCATCGCCCTCGAAAGGCCCTATTGCATTAAATATCAATCCTGCCGCACAGGCTCTCAAATA

AGCGTCTCCAGCGAGGAGTCATGGCCGAGATGAAGCGTGCCAGTCCCAGTAAGGGCGACATCGATCCCACCGCCCACGCTGGCGCCCAAGCGCTTGCATA

GCCGGCTACGGGAGGCTCTCATGGCTGAAATCAAGCGGGGTTCTCCTTCAAAGGGTATATTTGCTCTTGATATCTCTGCTCCTGCCCAGGCTCGAAAGTA

AGAGATTAAAGCAAGCTTTAATGGCTGAAGTCAAACGTGCTTCCCCTTCTAAAGGTGACATTAAGTTAGACGCAAATGCTGCTATCCAAGCTCTTACTTA

CCCGTCTGAGACAATCTCTTATGGCTGAAATAAAGCGCGCTTCCCCTTCCAAGGGAATGATTGCGGAGAACGCATGTGCCCCTGCCCAGGCTAGAGAGTA

CGGTGTTGTCATCAGTTGTTCTTGCTGAAGTCAAGCGTGCCTCTCCATCGAAGGGACCCATTTGTTTAAAAGCTGTTGCTGCTGAACAGGCTCTCAAATA

ACCGTCTTCGCAATGCTCTTTGCGCCGAGATCAAGAGGGCATCTCCCTCCAAGGGTGTCTTTGCGCTTGATATTGACGCTCCGTCGCAAGCTCGCAAGTA

GTGCTCTTAGATCGGGTTTAATAGCTGAGGTTAAGAAAGCTTCTCCAAGTAGAGGAATCCTGAGAGAGGATTTTAACCCTGTTGAAATTGCACAAGCTTA

GGGCTCTTAGGATGGGCTTGATAGCTGAGGTTAAGAAGGCTTCTCCAAGTAGAGGAATCTTAAAAGAGAATTTTGACCCGGTCGAGATTGCTCAAGCTTA

GCGCGCTAACGGCGGCGCTGATCGCCGAGGTGAAGAAGGCGTCGCCGAGCAGAGGCGTGCTCCGGGAGGACTTCAATCCGGTTGAGATCGCGCAATCCTA

GAATAATTTCTTCTACTATCATTGCCGAAATTAAACGGCGTTCTCCTTCAAAAGGACATTTAGCCGAAATTGCTGATCCAGTCGCCTTGGCAAAGCAATA

CAGCCCTAAAAGGATCCATTATTGGCGAAATAAAAAGACAATCTCCAACACGTGGAAAGATTGGATGCATAGATAATCCAGCAGATCTTGCATTAAAATA

CAGCTCTAAAGGAATCCATTATTGGTGAAATAAAAAGACAGTCTCCAACACGTGGAAAAATTAGAAGTATAGATAGTCCAGCAGATCTTGCATTAAAATA

GGATACTCCCGCTTGGTCATGAGATCGACGGTGAAGTCGCG------CAGATCGCGGTTGTTGACCCCGACAATCGTAGAACCCTGCTCGATGGCAATAT

AAGCGCTTCGGGGACGCCTCATTGCTGAAGTCAAGAAAGCCTCTCCCTCAAGGGGAGTCATTGTCCATGATTTCGATCCGGTCCGGATCGCCCTCCACTA

AAGCCCTTGCTAAAGCTGTCATCGCTGAAATCAAAAAGGCCTCTCCTAGTAAGGGGCTTTTATGCGAATCTTTTCATCCGGAGGCGATTGCCCAAAGCTA

CCGCCCTGCGGCGCAAGGTTATAGCTGAGCTAAAACAGGCTTCTCCCTCCCGGGGCCTTATCAGGGAGGATTTTGACCCTGAAGGTCAGGCCAGGAGTTA

GAGCTTTAAAGAACTCCATTATTGCCGAGGTAAAGAAGGCTTCACCGTCAAAAGGGATTATAAAGGAGGACTTTGATCCGCTCAAAATTGCAAAAGAATA

GTGCGTTGCAGGCGGCAGTGATTGCTGAGATTAAGAAGGCCAGTCCTTCAAAAGGGGTGCTTCGCGAGGATTTTCGTCCTGCAGAGATCGCTATCTCATA

AAGCGTTACAGCAG---GTCATTGCAGAAGTAAAGCGAGCATCACCATCAAAAGGAGATATCAATTTACACGTTGATGTACGAAAACAAGTGAAAACATA

GCGCACTGCGCGACGGCGTGATCGCCGAGGTCAAGAAAGCCTCGCCGTCGAAGGGCGTGCTGCGCGAGAACTTCGTGCCGGAGGCCATCGCCGAAAGCTA

GGAATGCGGTCTAGCGTATCCAGGCGGGTTTCAAACGTATG------CAGGTTGCGGTTATTGATGCCGATCAGGGGAGACGCCAACTCCAGCGCCAGCT

AGGCGCTGCGCGCAGGCCTCATCGCCGAAATCAAGAAGGCCAGCCCCTCCAAGGGCCTCATCCGCCCGGATTTCGACCCGCCGGCGCTGGCCGCTGCCTA

CAGCCCTGGAGGCAGCCCTGATCGCCGAAATCAAGAAGGCGAGCCCCTCCAAAGGCCTGATCCGTGCCGATTTCGATCCGCCGGCACTGGCGCAGGCCTA

AGGCACTTGAGGCCGCACTGATCGCGGAGATAAAGAAAGCAAGCCCGTCCAAAGGCTTGATCCGCCCCGATTTCGACCCGCCAGCGCTTGCAAAGGCTTA

CCGCTCTGCGTGCCGCACTGATTGCCGAGATCAAGAAAGCCTCACCGTCGAAGGGTCTCATCCGCGCAGACTTTCATCCCGCAGACCACGCCCGCGCCTA

GGGCTTTGAGACAAGCTGTTATAGCAGAAATAAAGAAAGCAAGTCCTAGCAAGGGAGTGATTCGCGAAAACTTTGATCCAATAGAAATAGCACTTGCTTA

CTGCTTTACAACAAAGCTTAATTGCTGAGGTAAAAAAGGCATCACCTAGCCGTGGGATTATTCGGGCGGATTTTGATCCAGTTGCGATCGCTCAAGCTTA

CAGCACTGCGACAGGCCGTGATTGCCGAAGTCAAAAAAGCATCTCCTAGCAAAGGCGTACTGCGAGAAGATTTCGATCCCGTGGCGATCGCCCAAGCCTA

CGGCTGTGCAAAACGCGCTGATTGCTGAGGTCAAGAAGGCCTCCCCCAGTAAAGGCATTATTCGCCCCAATTTCGACCCGGTGGCGATCGCCCAAGCCTA

AAAGAATTTTAGAAGCTATAATAGCCGAATATAAACGCAAATCTCCCTCT---GGA---TTAGATGTTGAAAGGGATCCAATAGAATATTCAAAATTCAT

CTGCAGTGGGAGGATCTCTCATATGCGAATACAAGAGGGCATCACCATCAATGGGCAGAATATCAGAG---AGAGGCCTTGAAGAGATGATGGAGGTATA

AAAAACTAAAAAATCAACTCATTGCAGAATACAAACCTGCAAGTCCATCAAAAGGAAATATCAGTACA---TTAAAAGCAGAAGATGTTATTCCAATATA

AAAGTTTAAAAATATCAATGGTAGCAGATATAAAAAGAAAGAGTAGTAAACAAAATAATTTTCTTAATTTAACAAATCCTGGTGAAGCAAGTTTAATGCT

ATATGCTTAGTAGATGCGTAATAGCTGACATGAAGAGAAGAACACCAAGGGATAATAACGTACTTTCCTATACGGACGCGGGTGAAGTGGCCCTGAATAT

AAAGTTTAAAAAGATCAATGGTAGCAGATATAAAAAGAAAGAGTAGTAAACAAAATAATTTTCTTAATTTAACAAACCCTGGTGAAGCAAGTTTAATGTT

--ATGCTTAGGAGATGTGTAATTGCTGATATGAAGAGAAGAACACCAAAGGATAATAACGTACTTTCCTACACCGACGCGGGTGAAGTGGCCATGAATAT

AAAGTTTAAAAAGATCATTAATTGCTGATATGAAGAGGAAAAGCGAAAAACAACATAATTTTTTAAATTTAAGTAACCCAGGAAATGTCAGTTTGTTATT

TTACAAAGCGGGCGCTAGTGTGATTAGTGTCTTGACGGAGGGGAGGTGGTTTAAGGGCAGTTTGGCCGATTTGAGGGAGGCTAGACTGAGTACGGGTGAT

TCGCGAGTTTGGTGCTACTGCTGTTGCCGTTCTAGCTGATGAGCGTACGGGTGGATGTACCTACGATGACATCGTCGAGATGGTGGGTGAAGTCCTGCCC

GGTACGATTGGGTGCCGACGTGGTGTTTGTGAATGTGGATTATCATTCATATGGAGGAGATCTGTCCGAGTTGAAATCAGCAGTGCGAGGTGTGGAGGCT

CCTCTCTGCAGGAGCTCCTGCTATCTCTGTCAACAGTGATGGAGTCCTCTTCGGTGGATCAATGGAAGACATTACAATAGCGAGGGAAGCATTACCTCCC

TACCAAAGCAGGGGCCAATATCATTTCTGTATTGACCGAGTCCCACTGGTTTAAAGGGAGTCTGGACGACATGACACAAGCCAGACTCGAAACAGTTTCC

TCGCGAGTACGGTGCGTCGGCCATTGCCGTCATGGCCGATCCACGCATGGGGGGTTGCGACTACGATGACATCCGACACTTTGTGATGGAAGTCTTGCCG

GGTACGCTTGGGAGCCGACGCCGTCTTTGTAAACACCGACTATCAGGCCTACGGCGGTGACATGACGGAATTGAAATCGGCTGTCCGCGCCGTTTCGGCG

CACTTTTGACGGCGCTGTAGCCATTAGTGTGAATTGCGATGCAGTCCTCTTTGGCGGGTCTCTGGGCGACGTCACTGCAGCACGTGAAGCTGCTCCCCCA

AAGCAAATTTGGACTCGACGGGATTATGGTTAGCGTCGATTCGGAACTCTACGATCAAGACTGTGCAAGTCTATCAGACTTTGCGAGGTATTTTGCACCC

G------AACGATGCCAGAGCAGTCGTGGTGAATGCCGAGTCGCGCCGCTTCTACGGCAGCTACGAGGATATTCGGAGGGTTCATGAGGTCGTCTGTCCG

CTACAACGGAGGCGCAGCCGCAATCTCGGTACTAACAGAT---GCTGCCTTCGATGGAACACTGGATGATCTGCAAACCGTGGTGGGAAGATATTGTCCG

TGCGGACGCGGGCGCCAGCATGATCTCCGTGCTGACGGAGCCCAAGTGGTTCAAGGGGTCGTTGGAGGACATGATGGCGGCCAGAGAAGTGGTGATGAGC

CGCGGACGCAGGCGCCAGCATGATCTCCGTGCTGACGGAGCCCAAGTGGTTCAAGGGCTCGCTTGAGGACATGCAGGCGGCCAGGGACGTCGTGATGAGC

CGCTAACGCAGGCGCCAGTATGATCTCCGTGCTGACGGAGCCTAAGTGGTTCAAGGGCTCACTAGACGACATGATGGAAGCTCGCGAGGTGGTTATGAGT

TGCTCTGGCTGGAGCCCACACAATCTCTGTCTTGACTGAACCTAAATGGTTCCTTGGATCACTTCATGACATGCTGCATGCCCGTCAAGCAGTTCTCCCC

CGCTCGCGGAGGTGCCAGCGTTATCAGTGTTCTCACTGAGCCCAAGTGGTTCAAGGGTACGATGCATGACCTATCGCTCGCAAGACGAGCGGTCCTACCG

CGCCTTGGCGGGTGCGAGTGTCATTTCTGTTCTGACAGAGCCTGAGTGGTTCAAGGGCAGCATTGAAGATTTGAGGGCAGTCCGACAAGTTTTAATGCCT

TGCTCAGGTTGGCGCCTCAGTCATATCTGTGTTAACAGAACCAAAATGGTTCAAAGGCTCATTGAATGATTTGTTTGTTGCGCGAAAAGCTGTTGTAGCT

CGCTAAGGCCGGCGCCAGCGTTATTTCTGTCCTCACTGAGCCAGAATGGTTCAAAGGTAGCATCGACGATTTGCGTGCCGTTCGTCAGAGCTTAGTTACA

CGCAGAGGCTGGTGCATCCGCAATTTCCGTATTGACCGAACCTCATTGGTTTCACGGTTCGTTACAGGATTTAGTAAATGTGAGGAAAATCCTATTTCCT

TGCGCTTGCCGGCGCGAGTGTCATCTCGGTCCTGACCGAGCCAGAGTGGTTCAAGGGCAGCATCGATGACCTCCGTGCTGTCCGTCAGGTCCTTATGCCC

TGAGAAGGGGGGAGCAGCATGTCTTAGTGTTTTGACAGATGACAAATACTTCAAGGGAAGCTATGAGAACCTGCAAGCTATAATGGCTGGTGTGTGCCCT

TGAAAAAGGCGGAGCAGCATGCCTCAGCGTTTTGACAGACCAGAAGTATTTCCAGGGAGGCTTTGAAAACTTGGAAGCAATAAGGGCTGGTGTGTGTCCA

TGAGAAGAACGGCGCGGCTTGCCTCAGTATCCTGACCGACGAAAAACACTTTCAGGGTAGCTTTGAGAATCTCGAGACCGTTCGCTCCGGAGTATGCCCT

TGTTCAAGGAGGTGCTGCGGGAGTATCAGTCCTTACTGATAAACTTGCCTTTGATGGTTCCATTCGTGATTTACAGCAGGTCAGCGACAGGCCTGTAGCT

TTGTTGCGGGGGGGCTGCTGCTATTTCAGTTCTTACTGACACCCGAGGCTTCGGTGGATCTTTTCTAGATATGCAACAGGTAAGCTCACAATACGTTTCT

TTGTTGTGGAGGGGCCTCTGCTATTTCAGTTCTTACTGATACTCAAGGCTTTGGCGGATCTTTTTTAGATATGCAACAGGTAAATTCACAATACGTTTCT

CAAGTTCACGGCGATCGTGCACTTCGACCAGCGC---GTGCAGGCCAAGTTCAGCGAAAAGCTGGAGGTAGTCGCGGAGTTGCGACGGTTC------

CGAGGAGATCGGCGCTTCTGCACTGTCGGTTTTGACCGACAGGCAGTTCTTCCAGGGCTCCCCGGACTATCTCCGGGCGGTCTCGGCTGCTGTACTTCCC

CGAAAAAGGAGGAGCAGCTTGCCTTTCCGTGCTCACAGACCAGGATTTTTTTAGGGGTAGTGAGGCGGATCTACAGAAGGCCCGCAGCGCCTGTCTGCCT

TATAGCCGCCGGGGCGGCGGCCATCTCCGTCCTGACGGACCAGAGGTTTTTCCGGGGTCGCCCTGAGTACCTGATCCTGGTGCGCCGGGTAACCCTGCCC

CGTTGAGTCGGATATTCAGGCTATATCGGTGCTTACGGAAAGGAACTTTTTCAAGGGGGATGAAGACTACCTTGTAAAGATTCGTCAGTTCTGTCTTCCT

TGAGCTTGGTGGTGCTAGCTGTCTATCAGTGCTGACGGATGTGCATTTCTTCAAGGGTCACGACGATTATTTGAGTCAGGCGCGTGATGCTTGCCTTCCG

CGAAGAATGCGGCGCAGGAGCAGTTTCTGTTTTAACAGACGGTCAATTTTTTAAAGGATCTTTTTATGATTTACAAACAGCACGAGAAGAAAGTATTCCG

CGCCGCCCATGGCGCGGCGTGCCTGTCCGTGCTGACCGACGTGAATTTCTTCCAGGGTCATGCCGAGTACCTGAAGCGCGCGCGTGGCGCCTGCCTGCCG

CGAGTTCCGCCCCATCATGAACCTCCACCAGAACGGCCATCCCGAGCGCATGAGCCAACGACTCCAGCTTGCGCATTTCCACTACCGCTTCTTCCTGCCG

TGAGGCGGGTGGTGCCGCCTGCCTCTCGGTCCTGACGGATACGCCGAGCTTTCAGGGCGCGCCGGAATTTCTGACCGCCGCCCGTAATGCCTGCCTGCCG

TGAGAAAGGCGGCGCCGCCTGTCTTTCGGTATTGACCGATGCGCCGTCCTTCCAGGGCGCGCCGGAATTCCTCACCAAGGCAAGGGCAGCCGTCCTGCCC

TGAAGAAGGCGGTGCGGCCTGTCTTTCCGTGCTGACCGACACGCCTTCCTTTCAGGGCGCGCCGGAATTCCTCACGGCCGCACGCCAGGCTTGCCTGCCC

TGAGGCAGGCGGCGCGGCGTGCCTGTCCGTCCTGACCGATGAAGACTATTTCCAGGGCCACGCCGATTACCTGCGCGAAGCGCGCGATGCGTGCCTCCCC

TAAGTTAGGAGGTGCAACATGCTTATCAGTGTTGACAGATAAAAGTTTTTTTCAAGGAGGCTTTGAGGTACTTGTTCAAGTCAGAAAGACTGTTTTACCA

TGAACGAGGTGGTGCAGCTTGTCTATCAGTCCTCACCGATCGTAAGTTTTTTCATGGCAGTTTTGACAATTTGCGTAATGTGCGATCGCACGTTTTACCC

TGCGGCTAATGGGGCGGCTTGCATTTCGGTGCTGACCGACGAGAAGTTTTTCCAAGGCGGCTTCGAAAATCTACAGCGGGTGCGAGCAGCAGTCGTACCG

CGTTGCCGCGGGTGCAACCTGCCTTTCCGTCCTAACGGACAGTGAGTTTTTCCAGGGCAGCTTTGAGTACCTCGCTCAAATTCGCCAAGAGGTGGTGCCG

GGAAAGGTAT---GCAGTAGGTCTTAGCATATTAACTGAGGAGAAGTACTTTAATGGTTCATATGAAACTTTGAGAAAGATAGCCAGTTCAGTTATTCCC

C---CAGGACCTCGCCGATGCAGTATCCATAGTAACCGACGGCAAATACTTCAAAGGCTCCCTGGATCTGCTGAGCGGGGCCACAGATTACGGTAAACCA

TGATAAAAATAACGTAGACATGATGTCAATTTTAACTGAAGAAACATACTTTAAAAGTAATCTAAAAAATTTTAACATAGCAAACAACTTAACAAAACCA

GCATAAAATAGGATTCGATGTATTAATAGTTAACATTGATAGCTTATCAGCACAAGGAACATTAAATGATTTATCTGATGTCATTAGAACATTGAGACCA

GGCCTCAGTAGGTTTTGACGTAATATTTGTGAACGTTGACCAGAAAAACTACGGTGGCCACATCAACGACTTGAATAAAGTTTTTAGAAAGCTAAGACCA

ACATAAAATAGGATTTGATGTATTAATAGTTAACATTGATAGCTTATCAACACAAGGAACATTAAATGATTTATCTGATGTAATCAGAACATTGAGACCA

GGCCTCAGTGGGGTTTGACGTAATATTTGTAAATGTTGACCAGATTAACTACGGTGGCCATATCAACGACCTGAATAAAGTTTTCAGAAAGCTAAGACCG

ACATGAAATAGGATTTGATGTATTAATAGTTAATATTGATGAATTATCAACTCAAGGAAATATAAATGATTTAAAAGATATAATAAGAACATTAAGACCA

ATTTTGAGAAAGGACTTCATCACGTCGAGGTATCAAATTGCTCAAGCGGCAGCGAGTGGGGCTGATACAGTCTTGCTCATTGTTGCAACTACTCCATTGC

GTGATTTCGTCAGATTTGATTGTGGACGAGATTCAAATTGCACGGGCTGCCGATGCAGGTGCCAAAGCTGTCACTGTCACGCATGGGGTGGTCGGTGAAG

GTGGTTATGAAGGATATCGTCGTGGATGAGATTCAATTGGGATTGGCGAAAGATGCTGGTGCTGATGGGATCGTGCTGATATCATCTGTTCTTGGACCAC

GTCCTTGCATCTGATCTCCTTCTTTATCCATACCAACTTTACAAACTACGTCTCGCTGGTGCTGATGCTGTAACTATGGTGGTGGGTGCTTTGGAGAGCT

ATTTTACGCAAAGATTTCGTGATCAACGAATACATGATTGCGGAGGCGGCCGCAAAAGGTGCCGATACTGTCCTGTTGATCGTCGCTGTTCTACCACAAC

GTCATTAACAACGACCTGATTGTGGACGAACTGCAGGTGGCGCGCTCCGCCGCCTACGGATGTGCCGCACTCGTTCTCAATTTACCTACCCTCGGTGTTA

GTCGTGATGAAAGATATTGTAGTGGATGAGATTCAATTGGGACTCGCGAAAGAGGCCGGTGCTGACGGAATCGTTCTTATGTCATCAGTGTTGGGGCCTC

ATTCTCGCATCCGACTTGATCCTTTATCCGTATCAGCTTTATAAGCTGCGTTTGGCTGGCGCCGATGCAATTAACCTCTTGGTTGGAGCTCTAGAGAAGC

ATCATCTACAAGGACATTATCGTCCACGAACTGCAGCTGGCGGAGGCCGCAGAGGCCGGTGCCAAGGCCGCCTGTTTGGTCGCCGGTGCATGCCTGCCGC

GTTATCTGCAAGGACATTTTCGTGTATCCATGGCAACTGTACGACGCGCGCTTTTATCAAGCTGATGCTTGTGTGCTTATCATGAAGGTGCTTGGGTTGA

GTCCTCCGCAAGGACTTTATCATTGATGAAATCCAAATCGCTGAAGCTGCAGAGCGAGGTGCAGCAGCAGTGCTTTTGATTGTTGCTGCAATTGGGGAAC

ATCCTCCGCAAGGACTTCATTATCGACGTGTACCAGCTGCTGGAGGCTCGCGCCTACGGAGCCGACTTAGCTGCGTTAATAATGAAT------

ATCCTGCGCAAGGACTTCATCATCGACGTGTACCAGCTGCTGGAGGCCCGCGCCTACGGTGCCGACTGCGTGCTGCTCATCGTGGCGCTGCTGTCCCAGC

ATCCTGCGCAAGGACTTCATCATCGACGTGTATCAATTATTGGAGGCCCGTGCCTACGGGGCTGACTGCGTGCTGCTCATCGTGACGTTGCTGTCCAAGC

ATCCTTCGCAAAGACTTTATTCTCAGCAGGTACCAAGTACTTGAATCGAGGATTTGGGGTGCAGATAGTATTCTTCTCATTGCCAGTATGCTCTCGGAGC

ATTCTGAGAAAGGACTTCATCGTCGACACTTACCAAATTGCCGAAGCCAGGCTTGCTGGTGCCGACACGGTGCTGCTCATCGTTGCCATGCTCACGGACC

ATTCTGCGAAAGGAATTCATCTTTGATGAGTATCAAATTCTCGAGGCCCGATTGGCTGGTGCTGATACTGTTCTTCTCATTGTCAAGATGCTTGACACTC

ATTTTGAGAAAGGATTTCATTATTGACCCTTATGAAATTATGGAGGCTCGTTTGAATGGTGCTGATAGTGTATTGCTAATCGTTGCGATGTTAAGCCGTT

ATCTTAAGAAAGGAGTTTGTTTTTGACGAATATCAGATTCTTGAAGCCCGTCTTGCTGGGGCTGATACTATCCTGCTCATCGTGAAGATGCTAAGCGTCC

GTTTTGAGAAAAGAATTTATTTTCAGCAAGTATCAAATACTAGAAGCAAGATTAGCTGGAGCTGACACTGTCCTTCTTATAGTCAAGATGCTATCTCAAT

GTCCTGCGCAAGGAGTTCATCTTTGACGAGTACCAGATCCTCGAAGCCAGACTTGCCGGTGCTGACACTGTTCTCCTCATTGTCAAGATGCTCGAGTATC

TTGCTATTGAAAGAGTTCATTGTTGAGGCATGGCAGATATACTATGGTAGAAGTAAAGGCGCAGATGCAGTTTTGTTGATCGCTTCTGTGTTACCTGACA

CTATTATGCAAAGAGTTTGTTGTAGATCCATGGCAGATCTACTATGCTCGGACTAAAGGCGCAGATGCAGTACTGCTTATTGCTGCTGTATTGGCTGACA

CTTCTTTGCAAGGAGTTTGTGATAGATATCTGGCAAATCTATTACGCTCGGTCAAAGGGTGCGGATGCAATTCTGTTGATTGCTGCCGTGTTACCTGATA

GTCCTTCGCAAAGATTTTATTTTAGACCCTCTACAAATTGAAGAAGCTGCTGTCGCCGGTGCTGATGCGATATTATTAATTGTAGCAATTTTAAAAGACA

GTATTAAGAAAAGATTTTATCCTAGATCCTTTACAACTAGCGGAAGCTATTTTCTTCGGAGCAAACGCAGTTCTTTTAATCGTGAGCGTTGTTGGAGAAT

GTATTGAGAAAAGATTTCATCCTACATCCTTTACAACTAGCCGAAGCTATTTTCTTCGGAGCACATGCTGTTCTTTTAATCGTGAGCGTTGTTGGAGAAT

------GAGCGCCGCCACGATGAGGAGCGCCGCATCCGCGCCCATGAGGCGGGTTTCGTAAATCTGGCTCTCGTCGATAATC

GTGCTCCGCAAGGATTTCATTGTCGATGAATCGCAGATTTTCGAGTCGCGCCTTATGGGTGCTGACGCCATCCTTCTGATTGTTGCGGCCCTCGAGCCCC

GTTTTGCGAAAAGACTTTATTATCGACCCCTATCAGGTCTATGAAGCACGGGTAATAGGCGCCGACTGCATTCTGCTAATTGTAGCAGCGTTGGGTGATA

CTTCTGCGCAAGGATTTTATCATTGATCCCTACCAGATTTATGAGGCCCGGGCCCTGGGAGCGGATGCCATCCTACTGATTGTCGCTGCCCTGGAACCTC

ATTTTGAGAAAGGATTTTATAATCGATTTGTGGCAGATATATGAGTCGCGGTATATTGGGGCCGATGCAATATTGCTTATTGTATCACTGCTTTCCGATC

GTGTTGCGCAAGGATTTCACGATAGACCCGTATCAGGTGTATGAGGCACGTGTGCTTGGTGCTGATTGTATTTTGCTGATTGTCGCTGCATTGGACGATT

CTTTTATGTAAAGATTTCATAATTGATAAGATTCAAATTGATAGAGCATATGAAGCTGGTGCAGATATTATTTTATTAATTGTAGCAGCTTTAACGAAAT

GCGCTGCGCAAGGACTTCATGGTCGATATGTACCAGGTCTACGAAGCCCGCACCTGGGGCGCCGACTGCATCCTGCTGATCGTCTCCGCGCTCGACCACA

CCCGGCCGAAATGAAAGGATCGCGGAAGGCGGCGACAATGATGAGGATGCAATCCGCACCCATGAGCCTTGCCTCGGCGACCTGATATTCGTCCAGCATC

GCACTGCGCAAGGATTTCATGTTCGATACCTATCAGGTGCATGAGGCCCGCGCCTGGGGTGCGGATTGCATCCTGCTGATCATGGCATCGCTTTCCGACG

GCCCTGCGCAAGGATTTCCTGTTCGATCCCTATCAGGTCTATGAGGCGCGCGCCTGGGGCGCGGATGCTATCCTGATCATCATGGCCAGCGTCGACGATG

GCATTGCGCAAGGATTTCCTGTTCGACCCCTATCAGGTCTATGAAGCGCGTAGCTGGGGAGCGGATTGCATTCTCATCATCATGGCCAGCGTCGATGACG

GCCCTGCGCAAGGACTTCATGGTCGATCCGTGGCAATGCGCCGAAGCCCGGAGCATGGGTGCCGACGCGATCCTGATTATCGTTGCGGCACTCTCCAATA

TTGTTATGCAAGGAATTCATTATTCAGCCTTACCAGATTTATCAAGCAAGAGTTGCTGGTGCTGATGCGGTTTTATTGATTGCAGCCATACTTTCTGATC

CTACTGTGCAAAGAATTCATCATTGACCCTTATCAAATCTATCTAGCACGGACAGCAGGCGCAGATGCGGTTTTGTTGATTGCTGCTATTTTATCAGACC

CTGCTCTGCAAAGACTTTGTGATCTACCCCTATCAGATCTACAAAGCCCGCTTACTGGGGGCCGATGCCGTGCTGTTGATCGCGGCGATTCTTTCGGATC

CTGTTGTGCAAGGACTTTATCCTCTATCCCTACCAGATGTTTCTGGCAAGGGTGCGGGGCGCCGACGCAGTGCTACTCATCGCGGCTATCCTCTCGGACT

ATACTAATGAAGGATTTTATCGTTAAGGAATCGCAAATTGATGATGCATATAACCTAGGTGCTGATACTGTATTGCTAATAGTCAAAATACTAACTGAAT

CTGCTCATGAAGGACTTCCTGGTCGACGAGTACAAGATCTACCAGGCAAGGGCATCGGGGGCATCCTCGGTTCTCCTAATCACAGGCGTATTCCCTGACC

TTACTTAGAAAAGATTTTATTATAGATGAATACATGATCTATGAAGCTGCTGTTAATAATGCTAGTGGAATATTATTAATAAGTGGAATCTGTCCTAATA

GTTGTAGTAGATGATGTAATAATACACCCAATCCAAATAGCGTTAGCAGTTGAAAATAAAGCTGATGGAGTCATATTAAATTTGTCTTATTTAAAAAATT

ATAGTCATGAAAGATATAATTTTACATCCAATACAAATAGCACAAGCTGTGGAAATGAGAGCTGACGGAGTCATTTTGAATGCAGCAATACTGGGCAATT

GTTGTAGTAGATGATGTAATAATACACCCAATCCAAATAGCATTAGCAGTTGAAAATAAAGCTGATGGAGTCATATTAAATTTGTCTTATTTAAAAAGTT

ATAGTCATGAAAGATATCATTCTACATCCAATCCAAATAGCACAAGCTGTGGAAATGAGAGCGGACGGAGTTATTTTGAATGCAGCAATACTCGGGAATT

ATTGTTGTAGATGACATAATTATCCATCCTATACAAATAGCTTTAGCCGTAGAAAACCAAGCAGATGGTGTTATATTAAATTTATCTTATTTAAAAAATA

TCAAGGATTTGATCTTGTACTCTCGTTCACTCAACATGGAGCCATTGGTGGAGGTACACGCCTTGGTAGAATTGGATGTGGCACTTGAGGCTGGGGCGAA

TGTCGCAATTCATCAAGAATGCACGTGTACTTGGATTGGAAACGATTGTGAATGTCGGTACAGCCGAAGAAGCTCAGAGTGCGGTTGACGTGGGAGCTTC

TTGAAAACTTTTTGAACTTGGCGACCATTATTGGATTAGAGACTATTGTGGAATGCCACACGTACAATGAAGTTCAGGCGGCGTTGGATTCTTTGGCTCA

TGCTTTACCTCACAAAGATTGCATCGACGATCAATTTACAAGTGATTGCCAGTGTGACGTCCGAGGTACAAATAGACATGCTCTCCAAACTAGGTATCAG

TCACTCGATTAATCGGGTTTTCTCGTTCCCTCGGGATGGAGCCACTAGTCGAAGTCCACGCGGATACCGAACTTGATGTGGCCATCAAAGCCGGTGCCAA

CCAAAGTCCTTCTCAAAGCAACCAAGGCTGTGGACCTGGAAGCTATTGTGGCCGTGTCTTCCAAGGAAGAAGCGCAAACCGCAATCGATAGCGGGGCTCG

TCGAGAATTTCTTGAACCTGGCAACCATGATTGGTTTGGAGACGATCGTTGAGTGTCATACACACGATGAAGTACAGAGAGCCATCGACATCCTGGCACC

TGTCATACCTTACCAAGATAGCGTCTAGTCTTCAGCTTCAGTCATTTGCCACCGTAACTTCCGAAGTGCAATTACTGGAAGTGGCAAGTCTGCAGATTGA

TCGAGAAACTCCTGAACGCGGCATACATTCTCGGGATGGAGGTGCTCGTAGAGGTCCATACGGAAGAGGAGCTGCAGTACGCTTTCGAGGCAGGAGCCGG

CGCTGTATTTTGCCAAGGCAAGCGTTGCGCTGGGGATGGCGCCCATCATCGAAGTGCACGATGCCGCCGAGTTGGATGCGCTTTTGACAGCAAAGATTAG

TGCGCAAACTGCTTGAGGCAACACATCGGCACGGTCTTGAAGCACTCGTAGAGATTCATGACGAAGAAGAGCTGGAGATCGCAAAGGCTGCCGGTGCAGA

--GTTGACGTTGAGTTTGCAACTCACAACCTCGGCATGTGCGCCTTGGTGGAGGTGAACAGCGTTGAGGAGCTGGACATCGCGCTGGCTGCCAGGTCGCG

TCAACGAGCTCATTGAGGTGACCCACAAGCTCGGCATGTGCGCCCTGGTGGAGGTGAACAGCGTCAAGGAGCTGGATATCGCGCTGGCTGCCCGCGCCAG

TTATCGAGCTCATTGACGCAACTCACAATCTCGGTATGTGCGCTCTGGTCGAGGTGAACAGCGTCCAGGAGCTGGACATCGCTCTGGCTGCCAAGGCCCG

TCCGCGATCTTTACAGTTACGCACTAGAGTTGGGCATGGAACCTCTTGTTGAAGTCAATAATGCGCAGGAAATGGAGCTCGCCCTGTCCTTACCTGCCAA

TCAAGGAGCTCTACGACTACAGTCTTAGCCTGGGCATGGAGCCGCTCGTGGAAGTCAACAATCCCGAAGAAATGACGCGCGCCATTCGACTCGGTGCAAA

TGCATCGTCTATACAACTACTCTCTTCAGCTGGGAATGGAGCCCCTCGTCGAGGTTCAGAATGCTGAAGAGATGACAACAGCAGTCAAGCTGGGTGCCAA

TAGAGTCCCTCTATAAATTTTCTAAATCTTTGGGCATGGAACCGTTAGTAGAGGTTAACTGTGCTGAGGAAATGAAGACTGCTATAGAACTTGGTGCTAA

TTACAAGATTGTATCACTATTCTCGCAGCTTGGGAATGGAGCCTCTCGTAGAGGTTAATACTCCGGAGGAAATGAAGATCGCGGTGCAGCTTGGAGCCGA

TGAAGGAACTGTACAGCTACAGTAAAGATTTGAACATGGAACCTCTCGTTGAGGTGAACTCCAAAGAGGAATTACAAAGGGCTCTAGAAATTGGTGCTAA

TCGAGCGCCTATACAAGTACTCCTTGTCTCTCGGCATGGAGCCCCTAGTCGAGGTCCAGAACACCGAGGAGATGGCCACAGCCATCAAGCTCGGCGCCAA

TCAAATACATGATTAAGATTTGCAAAATACTTGGAATGGCTACACTTGTGGAGGTCCATGACGAAAGGGAGATGGATCGTGTTCTTGCAATTGAAGTCGA

TAACCTTCTTGCTTAAGATTTGCAAGAAGCTTAGCTTGGCTGCCCTTGTTGAGGTACATGATGAGAGAGAGATGGGTCGTGTACTTGGAATAGAAATCGA

TCAAGTACATGCTTCGCATCTGCAAGAACCTCGGAATGACAGCTCTCATAGAGGTTCATGATGAGAGGGAGCTGGATCGTGTGCTCAAAATAGATGTTCA

CAAAAATTTTATTACAAAAAGCGCATGAATGCGGTCTTGAAGCGTTAGTGGAAGTGCACAATCGTCAAGAATTGGATCAAGCTATTGAAATCGGTGCTGA

TAAAATTTCTCATACAAGAAGCGCATAGATTAGGTTTAGAAGTTTTAACCGAAATTCATGACTTTTCAGAACTTGAATTAGCTTTAGAAGCAGAAGCTTC

TAAAATTTCTCATACAAGAAGCCCAGAGATTAGGTTTAGAAGTTTTAACCGAAATTCATGATTTGTCAGAACTTGAGTTAGCTTTAGAAGCAGAAGCTCC

TTGCGGAGCACAGGAATGGAGAACTGCTGCGTGATCGCCTTCAAGTAGTCTGGCGACCCCTGAAAGTAGTGGGAGTCGGTAAGGACGGAGAAGGCCGCGC

TTCGCGACTACCTCCAGACGGCCTCTGAAACGGGGCTCGATGTGCTCGTTGAGGTGCATGACCGGCGGGAACTCGACACGGCGGCCGAAGAGGGTGCCAC

TGTTAGAGCTAGCTCAACTCGCCGAACACTTAAATATGGATGTACTCATAGAAGTCCATGATTCCGAAGAGCTAGAGCGAGCGCTGGTTCTCGATACACC

TGGTGGAATATCTGGACCTGGCCCGGAGGTTAGGCCTGGCAGCCCTGGTCGAGGTTCATACGGCCCCGGAACTGGAGGTAGCCCTCCGGGCAGGAGCCGG

TGAAAAAATTCCAGATCGTTGCAAATATTTTAGGGATGCAGTGCCTGGTTGAGGTGCATGATGAAAGGGAACTTGAGCGGGCTTTGGAATCCGGCGCAAG

TGGTTGATTTGTCCGGCTTGGCATTGCAATTGGGAATGGATGTGCTGGTTGAGGTTCACGATATTGACGAACTTGAGCGTGCAATACAGATTTCTGCGCC

TAAAAGAGCTGTATAGCTACGTACGAGAAAAAGGACTAGAGGCAATTGTTGAAGTTCATGATGAAGAGGAATTAGAAATTGCAATCGAATTAAATCCGCA

TGGCCGAGCTGGAAGCGTGCGCGCACGAACTCGGCATGGATGTGCTCGTGGAAGTCCACGGCGGCGAGGAACTCGATAGTGCGCTGCGACTGAAGACGCC

TTGCGCAGGACCGGCAGGTCGCAGGCGCTTCTGGCCTGCTTCAGATATTCCGCGCTGCCACCGAAATACTGCTGGTCCGTCAGAACCGACAGACACGCGC

CGAAGCGGCTCGAGGACGAGGCTTTTGCGCTTGGCATGGACGTGCTGATCGAAGTTCATGATGCGGAGGAAACCGAACGGGCGCTGAAACTCACTTCCCC

CGAGCGCATTGGAAGCGACCGCATTCGAACTCGGCATGGATGCGCTGATCGAGGTACATGACGAAGCCGAGACGCAACGAGCGCTAAAGCTGTCGTCGCG

CAAAAGAGCTGGAAGACACGGCTTTCGCACTGGGCATGGATGCGCTGATCGAAGTGCATGACGAAGCTGAAATGGAACGCGCCCTGAAGCTTTCCTCGCG

TGCACGAGATCGAAGCTGCTGCGCTGGAGCATCGGATGGATGCGCTGGTCGAAGTGCACGACGAAGAAGAAATGAGTCGCGCCCATGCGCTGCAATCGCG

TTCTTTATCTGAGAAAAGTTGCAATTAGCCTTGGATTAACAATATTGGTTGAAGTGCATGACTCTAATGAGTTGAAAAGAGTACTAGATTTAGAATTTCC

TGCAAAACTTTTTGCAAGTAATTCACGATTTGGGCATGAATGCACTGGTAGAAGTTCATACTTTGGCTGAATTGGATAGAGTTCTTAAGCTTGACTTACA

TGCGCTACTTCCTGAAGATCGCCCACAGTCTGGGACTCAATGCGCTGGTGGAAGTGCATAGCCTGCCGGAACTTGAGCGTGTCCTTGCCTTGGATCTGCG

TGCGCTATTTTCTGCGGATTGCCCATGGGTTGGGTCTAACCGCCCTTGTGGAGGTGCACACCGCCGAGGAGATGGAACGGGTGCTGGCACTAGAGGTGCA

TAGAGAGTTTATTGGAATATGCCAGAAGTTATGGTATGGAACCATTGATAGAAATTAATGACGAAAATGATTTAGATATAGCCCTAAGGATAGGGGCTAG

TTGAGGCAGGGATTCAAAAATGCAGGGAGCTCTCAATGGAGCCACTGGTGGAGTGTCACACATCGCTTGACATCTTCAGGGCCCTTGAAGCAGGTGCAGA

TAGAAGAATATTTGAATATTTCTAGAAATCTTGGATTAGATGCAGTAGTTGAATGTCATACATTAGAGGATATTGAATCTGTAGTAGATTATAACCCTGA

TAGAAGATATGCTAAATTATTGTGTCAATCTAGGTACCCAAGCTATAGTAGAAGTACATGATTATAACGATATATATTATGCTACTCAATGTGGTAGTTA

TGAAAGACCTATTAACCTCGTGCATTACCATGGGAATTGAGGCAATAGTTGAAGTTCATACCACTGCTGATGCACTTAGATCTGCAGAAATAGGCTTCAC

TAGAAGATATGCTAAATTATTGTGTCAATCTAGGTACCCAAGCTATAGTAGAAGTACATGATTATAACGATATATATTATGCTACTCAATGTGGTAGTTA

TGCAAGATTTACTAACCGCATGTGTGACCATGGGAATTGAGGCAATAGTTGAAGTTCATACCACAGCCGATGCGCTCAGATCAGCAGATATAGGGTTTAA

TGGAAGAAATGTTACAATATTGTGTTAATGTTGGTACTCAAGCTATTGTTGAAGTACATGATTATAATGATATATATTTTGCTACACAATGTGGAGCTTA

AGTGCTTGGAGTGAACAATCGCAATCTACACACGTTTCAATTAGATTTATTACTCTATGTGCTCTGTCGGGTATGTCATCTGCACATGATGTGGACAGGT

CATCATTAGTGTCACCGGAGTGGATGGTGCCGATAACAAGTTTGCTGTATCCTCGCCAAGGACAACAAGGCTCTGGAAGAAGTGGAAGAGGCGTGGATCT

GAATATAATGGTCTCAAATCGTGATCGTATCACAGGTCAGTTGTTACCATTATCACTATTGCAGGAGGTGGTATATCAACACCTGAACAGATGAAGAAGC

CGCTCTTGTCGTATCCAATAGAGATTTGGAGACTTTTGACTTTGATAA------

GGTTATCGGAGTGAACAATCGCAATTTGCACACCTTCCAAATGGATTTTATACGGTTTGCGCGCTTTCGGGAATGTCTACGGCTATGGACGTGGACCGTT

AATGCTCAGTATCTTGCATCTGGGAACGGTAGAGGATATGGTGGCCGCATCCTTGCCAATAATGACCAGCAACTGCAAGAGATTGAGGATGCTTGGTCGG

CAATATTTTGGTCAACAACTACGATCGAGTTGCACAGGAACTTCACCCATTATTTGTTTGGCTGCGGGAGAGATCGAAACCACCGATCAAATGAAGCGCC

CGGAATAATCGTATCCAATCGTGAGCTTGAAGACTTTTCCTTCGATATTTGAAAAGCAATGCTCTGGCGAAAGTCCGCGCAAAACATGGTGAAGACCTTC

TATCTACGTGTTTACGGATGTGGATCGCGCTCGTCGACGAGTTGTTAAGCAGTGTGCCTCGGCAGTGGACGCATTGAGACGCTTGAACAAGTTGAACAGT

CATTCTGCTTCTCGCGAAGAGAGACCTCGACAGTCTTGTGCCTATGCTGAGCGGTTCGCACCGGAAGCTAGGAGACTAGGGCTTCGA------

GATCATCGGCACTAACGCACGCGACCTGCGCACTTTCAACGTCGACCTGTGATTCCAGTGGCGGAAAGCGGTATCCACGATGTGATGGATGCTTGGCGTT

CCTGATTGGCGTCAATAACCGAGACCTCCGTTCGTTCAAGGTGGACATGTCACGCTCTTTGCGCTCAGCGGCATTCGGTCTCACGCAGACGTGGTCAAGT

GCTGATCGGAGTCAACAACCGCGACCTCCGCACGTTTAAGGTGGACCTGTCACGCTCTTCGCGCTGAGCGGCATTCGCTCGCACGCGGACGTCGTCAAGT

ACTGATCGGCGTCAACAACCGCGATCTCCGCACGTTCAAGGTGGACATGTCGCCCTCTTTGCTCTCAGTGGCATTCGCTCGCACACAGACGTGGTCAAAT

AGTTATCGGCGTCAATAATCGCAATCTCCATGACTTTAAGGTTGACATGTCGTTCTTTGTGCGCTTAGTGGTATTGCGTCGAGGAATGATGTTGAACGGT

GGTTGTCGGCGTCAACAATCGCAACCTCCATGACTTTAATGTCGACATGTCATCCTTTGTGCTCTTAGCGGTATCAGCGGTCGCAGCGATGTAGCCAAGT

GGTCATCGGCGTCAACAACCGTAACCTGGAGAGCTTTGAGGTGGATCTACAATCATCTGCGCACTTAGCGGCATCAACACTCACGATGACGTTCTAATGA

AGTTATCGGCGTTAATAACAGGAATTTGCATAGTTTTGAAGTAGACCTGTTATTCTTGCAGCTCTCAGTGGAATTAGTAGTCCTGCTGATGTTGCCCATT

GGTGATTGGCGTGAACAACAGAGACTTGACGAGCTTCGAGGTTGACCTACTATCGTTTGCGCTCTTAGTGGAATTTCCGGACCCAAGGATGTCGAGGCTT

AGTTGTAGGTGTCAATAATAGGGACCTGCATTCATTCAACGTAGACCTGTTCTTCTAATTGCTCTATCGGGAATTACCACCAGGGACGATGCTGAAAAAT

GGTTATCGGCGTCAACAACCGCAATCTCGAGAGCTTCGAAGTCGACCTACCTTCCTCTGCGCTCTCAGCGGCATCAACACTCACCAAGATGTTCTTGACT

GCTCATTGGCATCAATAACCGTAACCTTGAAACATTTGAGGTAGATCTATCCTTGTGGTTGGAGAATCTGGGTTATTCACTCCCGAAGATATCGCCTTTG

GCTTGTTGGCATCAATAACCGAAGTCTTGAAACATTTGAAGTGGACATATGATTGTGGTTGGCGAATCTGGTCTGTTTACACCAGATGACATTGCCTATG

GCTTATCGGCATCAATAACCGAAGTTTAGAGACGTTTAAGGTTGATACATTCTGGTTGTTGGTGAATCTGGTCTTTTCACGCCTGATGATGTTGCATACG

AATTATTGGGGTCAACAATCGAAACTTAACTACCTTTTCGGTTGACCCATCATAAGCGTTGCTGAATCGGGAATTCACACCGTAAGCGATGCAAAACGTT

CATTATTGGCATTAACCACCGTAATCTTAAAACTTTTGAAATCGATCTATCATTACCGTAGCTGAATCAGGAATCCATCACCCTATACAAGCCAAACGGA

CATTATTGGGGTTAACCACCGCAATCTTCAAACTTTTGAAATCGATCTATCATTACTGTAGCTGAATCAGGAATCCATCACCCTACACAAGCCAAACGGA

CAAGTTCGGCGTAGCGCGCAGCGATGTCAAGCGGACGAAAATCCTCGAGAGATTGATGCCGCCGTCACGGCTCGTAATGGCCGAGCGGAAATCGCGCGTC

AATGATTGGCGTCAACAACCGCAACCTGAAGGACTTCAGCGTGGATCCACTATCGCGGTGTCGGAGAGCGGCCTGAAAACGGCTGGCGACATCGCTCATT

ACTTATCGGCATTAATAATCGCAATTTAAAAAGTTTCGAAACGAGATTCGCTTAGTGGTCACCGAAAGCGGAATTCATAGTCCCGCAGACGTAAGGAAAA

TATTATCGGGATTAACAACCGGGACCTGCACACCTTCAAGGTTGACCTCCGGTGGTAGTGAGCGAGAGTGGCATCCGGAGCCGGGCCGATGCCGCCCTGG

AATAATCGGAATAAACAACAGGGATCTTAGAACTTTTGAAGTGGATTTCGGGTGGTGGTAAGCGAAAGTGGAATTAAGGACACCGAAGATTTAAAGTATC

ATTGATTGGTATCAACAACCGTAATCTGAGTACCTTCAACGTGTCATTCGTTTGCTCGTAAGCGAGAGCGGTATCCTTACCTCTGCCGATGTACAACGGC

CGTTATCGGTATTAACAACCGTAATTTAAAAACATTCGAAGTCGATTTTTACTTTGGATTAGCGAAAGCGGGATTCATTCAAAAGAGGATATTATTCGTG

GCTGCTGGGCGTGAACAACCGCAACCTGCGCACCTTCGAGGTCTCGCTCGGCTCGTGGTAACCGAATCGGGCATCCTCGGGCCCGACGACGTCAAGCGGA

CATGCTGGCCATAGCTCACGGCGATGGCGGCAGGGTCGAACTGCTCACGACGGCCGGTAATCCCAGGGCGAGCTTCTTGCGAAGAATGAAATCCCTTGGC

GCTTCTCGGTATCAACAACCGCAATCTGCGCACCTTCGAAGTCGGCCTAAGCTGCTGGTCGGCGAAAGCGGCATCTTCACCTTCGAGGATTGCCAGCGCC

GCTGATCGGCATAAACAACCGCAATCTGAGAACGTTCGAGACCAGCCTCGCTTGCTGGTCAGTGAGAGCGGCATCTTCACACATGACGATTGCCTCAGGC

CCTGCTCGGCGTCAACAATCGCAATCTGCGCAGCTTCGAGGTCAATCTCGTCTGCTGGTTGGCGAAAGCGGCATCTTCACGCATGAGGACTGCCTGCGGC

GCTGGTTGGCGTCAATAATCGCGACTTGAAAACCTTCACCACCGATCTGCGCTTCTGGTCGGAGAAAGCGGTATCGACAACCACGCCGATTGCGTGCGGC

TCTGGTTGGAATTAATAATCGCGACCTTAAGACTTTCAATACTGATTTGTTCTATTGGTAAGTGAGTCTGGTTTATTTAACTCTGCAGATCTAGAAGAAG

TTTAGTAGGAATCAACAATCGCAATCTAGAAGATTTTACGGTTGATTTATCACTGTTGTCAGTGAATCTGGACTGTATACACCTGCTGATTTATCTCTTG

GCTGGTTGGCATTAACAACCGCAACCTCAAGACTTTTGTGACCGATCTTTGCTCTTGGTCAGTGAATCGGGTTTGTTCACCGGCGCCGATCTCGATCGCG

GCTTGTGGGGATCAACAATCGCAACCTGATGGACTTTAGTGTTGATCTATCCTTGTGGTCAGCGAATCGGGCATTCATCAGCGCGCCGATGTCGAACGGA

ATTTATAGGAATTAATTCAAGAGATCTAGAAACCCTTGAGATAAATAAGTGGTAAAGGTGGCAGAAAGTGGAATTTCTGAGAGGAATGAAATAGAAGAAT

GATAATAGGTGTGAACAACAGGGACCTTGAAACCTTCGAGGTGGACCTCTCATACTCGTATCAGAGAGTGGTGTCAGGGGCCCCGAGGATGCTGAGATAC

AATTATTGGTATAAACAATAGAAATCTAGATGATTTTACTATTAATCT---TACTTAATATCAGAAAGTGGAGTAAAAACCATAGAAGATGCTAAAAAAT

TATTCTTATGATAAACGAATTTGATTTTATTAATAATCGCTATGAATACTAATAACTATAGCTAAAATAAATGTTAGTGATATAAATTATGTAGAAAAAT

TAACTTCATGATAAACCAGTGGGATAGAATAAAAAATATTCTATACCCGCCACAACAATTGCAGCTGGTGGCATAATGACAATGGAGCAGGTTCATATGT

TATTCTTATGATAAACGAATTTGATTTTATTAATAATCGTTATGAATACTAATAACTATAGCTAAAATAAATGTTAGTGATATAAATTATGTAGAAAAAT

TCACTTCATGATAAATCAATGGGATAGAATAAAAAATAAACTATACCCGCTATAACAATTGCAGCGGGCGGTATAATGACGATGGAGCAAGTACATATGT

TATACTTATGATCAATGAATATGATTTTTATAATAATATATATGAATAATTATTACTATTGCTAAAGTTAATACTAATGAAGTGAATTATATAGAAAAAT

ATAGATCTGTTGGTGTTGGAATGTGTCTTATTGGTGAATCACTCATGAGGGCTTCGGACCCATCCTTGGCAATCAAGGGACTGTGC

GTCGTGATAGGGGATTCAACTGCGTGTGGGTGAGTGATGCTCTGTACAAAAGTGGAGATCCAGTAGAACACCCCGGTGCAATCATC

ATTTAGCAGTGGGATATGATGGAGTGGTGGTTGGTAAAACAGTCATGGGATCGGCACGAGCACCTGAGTTTATTAGGACTGTGAGA

------

ATCGTCAGGTCGGTCTTGGTATGTGTTTAATCGGTGAGAGTTTGATGCGTGCGACTGATCCGGGACAAGCGATTGCTGCTCTATGT

TTCGGGATCAAGGCTTCAACTCGGTATGGGTTGGGGAAGCCCTGTACAAGGGTGGGGATCCCAACGAACAACCCGGCGGTATCATC

ATTTGGCGGTTGGGTACGACGGAGTTGTGGTCGGTAAAGCAGTCATGGGAAGTCCGGCAGCTCCCGAGTTCATTCGAGCGGTTCGG

TTATCTTGGCTGAAGGAAGAGTCGGTATAATCGATCGTCCTCAGGCAGACAGCACAAGAAGTGCTAAGTATATTACCGAATTAAGG

TACGTGACGTTGGCGCCGACGGCATCGTTATCGGTGGCAAGCTCATG------GCTTCGATCGCAAGA

------GTCATCGTGGAGCTGCCACGTGAGCACGCCCCCAATGCCTCA---TTGCTGAGAGGTCTGCGT

TGCGGGACGCTGGCTACTCGTCGATTCTAGTTGGTGAGGCACTCATTCGCGCCGCAGAATCCTCGAAACTACCGAGTAATGCTTAC

ACGAGAAGTGCGGCGCTCGCGGCATCTTGGTTGGCGAGTACTTGATGAAGAGTGGCGACGTAGCGACTACGGTGAAGGACCTCCTG

ACGAGAAGTGCGGCGCCCGCGGCATCTTGGTCGGCGAGTTCTTGATGAAGAGCGGAGACGTTGCCACGACGGTGAAGGACCTCCTG

ACGAGAAATGCGGCGCACGTGGCATTTTGGTCGGCGAGTACTTGATGAAGAGCGGCGATATCGCCACGACAGTGAAGGATCTACTG

ACAAAGGTCAAGGCGTCGGCGCGATACTCGTGGGCGAGAGCTTGATGAAAGCAAAAGATGTAGGTGAATACATCAAGGAGCTAATG

ATCTCAAGGAGGGCGTTGGCGCCGTCCTGGTAGGCGAAGCGCTGATGCGTGCCAAGGACAAGCGCGCTTTCATTCACGATCTGCTA

ACAAGAGGGATGGTGTCAATGCTGTTCTTGTCGGTGAGGCCATTATGAAGGCGCCTGACGCCAGTGTTTTTATCAGTCAGCTGTGC

ATAGTAGTCAAGGTGTATCAGCAGTTCTTGTTGGTGAATCTCTTATGAGAGCTTCGGATCCTGCCGCTTTTGCACGAGAGTTACTT

ACAAGAAGGAAGGTGTCAAAGCAATTCTCGTGGGAGAGGCACTTATGCGGGCATCTAACACATCTGCTTTTGTTGCAGAGCTGCTC

ACAAAAAAGAAGGTGTCCATGGATTTTTAGTGGGTGAAGCCCTAATGAAATCAACCGATGTGAAGAAGTTCATTCATGAATTATGC

GCAAGCGCGACGGTGTCAACGGCATTCTTGTCGGCGAGGCCATCATGCGTGCCCCTGATGCCACCCAGTTCATCCGTGAGCTCTGC

TTCAAGAAGCCGGTGTCAAAGCAGTTTTAGTCGGCGAATCTCTTATTAAACAAAGCGATCCCGGGAAGGCAATCAGCACCCTATTT

TACAAGCAGCCGGAGTCAAAGCAGTTTTGGTTGGAGAATCCATTGTGAAGCAGAACGACCCTGAGAAAGGAATAGCTGGACTTTTT

TGCAGAATGCTGGTGTTTCTGCAATTTTGGTAGGAGAGTCCCTGGTGAAACAGGAAAATCCCGGGCAAGCCATTGCTGGACTATAT

ATATTTCTGCTGGTTACAACGCCGTTCTCGTGGGAGAGGCATTAGTTAAATCAGAAAATCCTCAACAATTTATTAGC---GCTATT

TGCGTGAGCTTGGATTTGACGCCGTTTTGGTCGGAGAAGCTCTTGTGCGATCTAAAGACCCTGCTTTGTTAATTAAA---CAAATG

TGCGCGAGCTTGGATTTGACGCTATTTTGGTCGGAGAAGCCCTTGTGAGCTCTAAAGATCCTTCTTTGTTAATTAAA---CAAATG

GCCGG------AAGATCGCCGCACGCCTCACGATAGCGGCGCTCCGGTTTCAGCAGTTCGGCAACTCTTGCCTTGGTCTCGAG

TGCGCGATGCCGGTTTCCATGCTGTGCTCATCGGAGAAGGCCTCCAGACGAGCAAAGAACTGACGGGCCTTACCTGGCCCGTCAGC

TGAAGAAAAATGGAGTTAACGCCTTTCTTATCGGTGAAGCCTTCATGAAAGCGGAAGACCCCGCAGCTAAACTTGCAGCATTATTC

TGGCCGGCGCCGGCGTGGATGCCATCTTGGTGGGCGAGGCCCTGATGACAGCCGGGGACGTTATGGCTAAGATGGCCGAACTACGG

TGAAAGAGCTTGGCGTGGATGCTGTTTTAATTGGCGAGACCTTTATGCGGGCCCGGTCCATAAGTGAAAAAATAAGAGAGTTTAAA

TGCGTGCTGCGGGTGTGAATGCGTTTCTTGTTGGTGAGGCGTTTATGCGTGCGACCGAGCCTGGTGAGTCACTCAGAGAGATGTTC

TTAAAAAAGCAGGAGCAAAAGGTGTATTAGTTGGGGAAGCACTTATGACAGCATCTTCTATTCGTACCTTTTTTGAAGATTGTAAG

TGCGCGACGCCGACGTCCACGCCTTCCTGGTCGGCGAAGCGTTCATGCGCGCGCCGGAGCCGGGCGTGGAACTGGCCCGCCTGTTC

GGCGGTAACGCTTCCGCCTGGCGGCGAAGCACGGCGAGAGAACCCTGCGCTTTCAAAATCTCCTGTCGTGTCGCCACAATCTCCTG

TGGAAAAGAGCGGCATCAGCACATTCCTCGTCGGCGAAAGCCTGATGCGCAAGGACGATGTGACGGCGGCGACCAAGGCCCTGCTG

TGCAGAAGAAGGACATCGGCACCTTCCTTGTCGGCGAGAGCCTGATGCGGCAGGAGGATGTCACCGCGGCGACCCGCATTCTGCTG

TTGAAAAGTCCGGCATCGGCACTTTTCTGATAGGCGAAAGCCTTATGCGACAACATGACGTTGCGGCAGCCACCCGCGCGCTTTTG

TGTCGCAAGCGGGGGTCCAGACCTTCCTCGTTGGTGAGAGCCTGATGCGGCAGGACGACGTCGAAGCCGCCACCCGCGAACTTCTC

TTAGTTCCTATGGAGCAAAGGCTGTTCTTGTTGGTGAGTCTTTGATGAGACAACCTGATATTGGATTGGCGTTAAAAAACTTGCAA

TCGCTGAAGCTGGTGTGCGTGCGGTTTTAGTCGGAGAGTCTTTAGTTAAACAAAGCGATGTAGAACAAGCTGTGCGTAGTCTTTTA

TTACCCAAGCTGGCGCACAGGCCGTCTTGATCGGTGAATCGCTGGTCAAGCAACCGGATCCAGGCTTGGCACTGCAGCAGTTAGTT

TGGCACAAGCCGGCGCCCAAGCCATTTTAGTCGGAGAATCCCTAATGCGCCCCCCAGAACTCCGAGCGCGGATTCAGCAGTTATTT

TAAGGAAATTAGGTGTTAACGCTTTCCTAATCGGATCATCACTGATGCGAAACCCA------GAAAAGATTAAAGAATTTATA

TGGCAGGTTACGGTGCAGATGCTCTGCTCATCGGCACAGCACCCATGTCAGCCCAGAACCCACGGGAACTCCTTGAGGAGATAGTA

TAAAGGAATATGGAGCAGATGGTATTTTAATAGGAACAAGTATCCTACAAAACAACACAGAAAAAGAAATTGAAAAATTTATTGAA

TAGGTAGTCTAGGTTATGATAGTATATGCTTAGAAAAAAAACTTATT---GACGAGGATTTGGAGGTTTTTGTGCAATCATGTAAA

TAGCACTGGCAGGTATAGATGCCGTTTGTTTCGGAAGAAGACTTGTA---TACCCAGACGTACCAGACTTCATCAACCAAGTCAAA

TAGGTAGTATAGGTTATGATAGTATATGTTTAGAAAAAAAACTTATT---GATGAAGATTTGGAGGTTTTTGTGCAATCATGTAAA

TAGCGCTCGCGGGCATAGACGGCGTTTGTTTCGGAAGAAGACTTGTA---TGCCCGGACGTACCAGATTTCATTAACCAAGTGAAG

TAGGAAGCTTGGGATATGATAGTATATGTTTAGAAAAAAAATTAATT---GATGATGACCTTCAACAATTTGTTACCTCATGTAAA

Tryptophan synthase Alpha subunit amino acid sequence alignment

64 227

Desred NATFNALVTYVTAGDPDLKTTGRLICSMDRAGADIIEIGIPFSDPSADGPVIQRASARALKEGTNPPAILELVKEVQVLAPLILMSYYNPILQYGCRDAA

Mooth TAAFAALIVYLCAGDPSLEVTGQAVRELAGAGVDLIELGVPFSDPVADGPVIQAASKRALAAGVTLPEILELVKSLGLAVPLILMSYYNPLLQYGTADLA

Bacce QAAFEAFIPYVMGGDGGLEILKERIRFLDEAGASIVEIGIPFSDPVADGPTIQRAGKRALDSGVTVKGIFQALIEAEVQIPFVLMTYLNPVLAFGIENCM

Grate NAITNALIPFITAGYPNIDICIKALKVLDREGADLIELGIPYSDALADGPIIQEASQAALKQGIYIEQVLSILTKVDLHAPIIIFTYYNPVLVRGICEIS

Antit NIISEALIPFITAGYPDINTTIQALYELDSQGADIIELGIPYSDALADGSVIQHSSLIALQGGTYIDQVLHILEVVKLNTPIIILPYYNPILKRGIKQIS

Porye NTISSALIPFITAGDPDLVSTGKALQILDSYGADIIELGLPYSDPLADGPIIQEASNRALKQGINLNKILSMVKTVTIKAPIVLFTYYNPVLHLGIYAIS

Porpu TTISSALIPFITAGDPDLVSTSKALKILDQHGADIIELGLPYSDPLADGPIIQAASSRALKQSINLNNILDMVNITNIVAPIVLFTYYNPVLNLGISAIS

Cyacal NSIKNLLLPFVSLGTPNTQINKQAIIAMDKNGANIIELGIPYSDPVADGPVIQDAYNKAIKNGVNIRKAFKILMNLKIKSPIIVFIYYNQLLNYGLEKLI

Cyame -----MLIAYLTAGAPDINTTKEAVMKLAKKGADVIEIGVPYSDALADGAILQKASKQALMNGFHLDHLWNLLSEVEIEVPLVILAYYNQIWHYGVKKLV

Crowa SDCFQALIPFITAGDPDLDTTAKALRVLDASGADIIELGVPYSDPLADGPVIQAAATRALGRGVKLEDVLKIVKEVEIKAPIILFTYYNPIFYRGLQQIK

Trier SDCFEALIPFITAGDPDLETTAKALEVLDRSGANMIELGVPYSDPLADGPVIQAAATRSLNRGTTLESVLEVVQTVKLRSPIILFTYYNPILYRGLKKIY

Nostpu SDCFEALIPFITAGDPDLETTAKALQVLDQSGADIIELGIPYSDPLADGPVIQAAATRALQRGTKLEHVLEMLQGIKLRSPIVLFTYYNPILHRGLQEIA

Anava SDRFEALIPFITAGDPDLETTAAALKILDSNGADIIELGIPYSDPLADGPVIQAAATRALQNGTKLESVLEMLKVTSLQAPIVLFTYYNSILHRGLEQVA

Theel SERFEALIPFLTAGDPDLETTVAALKILDDHGADLIELGMPYSDPLADGPVIQAAATRALQRGTRLEAVLEMTTDLQLTAPLILFSYYNPIYHRGLKAVA

Synel SDCFAALIPFLTAGDPDLETTRQALLALDREGADLIELGVPYSDPLADGPVIQAAATRALQAGTRLDDVLALLKDVQIKAPIVLFTYCNPILNRGLDQIA

arab1 ADTFTAFIPYITAGDPDLSTTAEALKVLDACGSDIIELGVPYSDPLADGPVIQAAATRSLERGTNLDSILEMLDKVQISCPISLFTYYNPILKRGMSSIR

Braol ------DACGSDIIELGVPYSDPLADGPVIQAAATRSLEKGTNLDSILDMLDKVELSCPVSLFTYYNPILKRGMSSIR

Arab2 SETFAALIPYITAGDPDLSTTAKALKVLDSCGSDIIELGVPYSDPLADGPAIQAAARRSLLKGTNFNSIISMLKEVQLSCPIALFTYYNPILRRGMTVIK

Oryz AETFSAFIPFITASDPDLATTSKALKILDSCGSDVIELGVPYSDPLADGPVIQAAATRALKKGATFDSVIAMLKGVELSCPIVIFTYYNPILKRGMAIIK

Zeam SDTMAAFIPYITAGDPDLATTAEALRLLDGCGADVIELGVPCSDPYIDGPIIQASVARALASGTTMDAVLEMLREVELSCPVVLLSYYKPIMSRSLAEMK

triae SDTMAALIPYITAGDPDLATTAEALRLLDACGADVIELGVPCSDPYVDGPIIQASSARALAGGATMDGVLAMLKEVELSCPVVLFSYYRPILCRGLAEIK

Ostlu SEAFKAFIPFICAGDPDLESTKKALKILDDAGADVIELGVPYSDPLADGPVIQAAATRALENGATLNKVIDLVREMQIKAPIVMFTYYNPIYQRGCADIA

Ostta SEQFGAFIPFICAGDPDLESTKKALKILDDAGADIIELGVPYSDPLADGPVIQAAATRALEAGATLDKVIALVKEMQIKAPIVMFTYFNPIYQRGCADIA

Chlre SGTMNAFIPFICAGDPDLDTTSLALRKLDEVGADVIELGVPYSDPLADGPVIQGAATRALDKHTTLDKVIEMVRRTAMKAPLVMFTYYNPIMRKGARTIK

Glovi ARVFAAFIPFITAGDPDLETTAEALLTLDRNGADLLELGLPYSDPLADGPTIQAAATRALARGTTPGAVLDLVARLELRAPLIVFTYFNLILAVGVERLA

Clothe ERAFSAFIPFITAGDPSLEITEQLVYRMAEAGADLIELGIPFSDPVAEGPVIQEADYRALSAGTTTDKIFDMVGRISCDIPIAFMTYANPIFTYGLKRCG

Laccas ADVFKVFIPFIVADDPDFETTVKNVVALAKGGADIVELGIPFSDPVADGPVIQAADLRAFAANVRTKTVFDIVEAAETAVPIVFLTYLNIVFKYGLKRCA

Theet DKKFEALITFITAGDPDIETTYDIVLAIEEVGADIIELGIPYSDPLADGPTIQASSQRALNKGVKIPDIMRIVEKIKSDIPLVYLVYYNSIFKYGLKESK

Metbur ADKFNALLAYVCAGDPDIDSTPRIVDSLIKGGADIIELGLPFSDPVADGPTIQAASERALTAGMNPDRYFELVANLDVQVPLVCMTYYNLIYKRGVKDCI

Metmaz SEKFDALIGYVMAGDPTFEASSEVVKALAKGGADIIELGFPFSDPVADGPTIQVAGQRALAEGMDIERYFAFARALEVDVPLVCMTYYNPVFRYGVENAA

Natpha EDAFDAFVPYLAAGDPDFESSLAYVEALARGGADVIELGLPFSEPIAEGPTIQQAVVRSLEGGMTPERFFEFVETLDVDVPLVCMTYYNLIYQYGVERAA

Methun KEAFELLMTFTVAGDPDFETSLEIIKALENGGADIIELGLPFSDPVADGPVIQQADQRALASGMNTDRFFDLVREVSSDIPLVVLTYTNLILQRDYQDAA

Metjan AEKFEAFVAFYVGGDPNLEISEKALEVICK-HADIVEIGIPFSDPVADGITIQKADVRALNSGMNPLKAFELAKKLAPNVPKVFLTYYNIIFKMGVKKCK

geome TGTFAALVTFITAGDPDLATTEELIPLLAENGADIIELGVPFSDPMADGPTIQLSSERALAAGTTLSRILATVKSVRTQVPIVLMGYFNPIFSYGAADAA

Pelpro THCFNALVTFITAGDPDLATTQAMIPLLQQAGADIIELGMPFSDPMADGPTIQLSSERALAASTTLERILAMVRAVSCQVPIVLMGYLNPIHAYGANDAA

Desac EETFAALIPFITAGDPNMDTTEKIIATLVDAGADLIELGVPFSDPMADGPTIQAASERALAAGATLDSVLDLVERVFSQVPIVLMGYYNPVFCYGAARAA

Ralso AQTFSGLIPFITAGDPYPELTVDLMHALVKGGANVIELGVPFSDPMADGPVIQRASERALAKKIGLRTVLDYVRAFDKTTPVVLMGYANPIERMGAKAAS

Burce QQTFAGLIPFITAGDPDPAKTVEFMHALAEGGADVIELGVPFSDPMADGPVIQRSSERALARGVTLKSVLADVKCFNQTTPVVLMGYANPIERMGATEAQ

Polna ESTFSALIPYVTAGFPFADVTPELMHGMVAGGADVIELGMPFSDPSADGPVIQKAGEKALSFGIGLVQVLEMVRIFDHTTPVVLMGYANPVERYDIRDAA

Nitmu STLFGALIPFITAGDPEPGMMVPLMHELVQAGADVIELGVPFSDPMADGPTIQRSSERALKHRVSLQDVLAMVGEFDSSTPVVLMGYANPVEAMGTARSK

Xylfas DETFRALIPFITAGDPSLEAAVPVMHALVRAGADVIELGVPFSDPMADGPVIQHSSERALQRGVGLAYVLQTVDVFDAVTPVVLMGYLNPLEIYGTQQAL

psefl QTRFAALVTFVTAGDPDYDTSLAILKGLPKAGADVIELGMPFTDPMADGPAIQLANIRALGAKQNLTKTLQMVREFNSDTPLVLMGYFNPIHKFGIAEAK

azovin QTRFAALVTFVTAGDPDYETSLAILKGLPEAGADVIELGMPFTDPMADGPAIQLANIRALGAGQNLVKTLRMVRAFDRTTPLVLMGYFNPIHYYGIAEAR

Meslo DRRMAALVTYFMGGDPDYDTSLSIMKALPGAGSDIIELGMPFSDPMADGPAIQAAGLRALKGGQTLVKTLKMASEFDNETPIVLMGYYNPIYIYGLKDAL

Agrob DKRFAALITYFMGGDPDFQTSLGIMKALPEAGADVIELGMPFSDPMADGPAIQLAGQRALKGGQTLKTTLDLAREFDNATPIVMMGYYNPIYIYGLDDAI

rhopa DTRFAAFVTFVMAGDPDLATSLQVLKALPAAGADIIEIGMPFTDPMADGPAIQAAGLRALHSGATLSHTLGLVRDFDDTTPMVLMGYYNPIYIYGLADAK

Azobr ARRFAGLVTFITAGDPDLETCRAVLHGLPAAGADLIELGLPFSDPMADGPAIQAASLRALHAGTTARKTLDLVRGFDADTPVILMGYYNPIHAYGLADAI

Oceal APTFAGFVAYVMAGDPDADTTLKMMQGLADKGADVLELGVPFTDPMADGPTIQRAAIRALESGMTLKGVLALVKRFHKDTPVVLMGYANPFFAYGASDAA

Pellu ENRITLLLAYYMPEFPVAGSTLPVLEALQDGGADIIELGIPFSDPVGDGPVIQNAAHIAIRNGVSVRSLLELVRKAKITVPILLMGYSNPLIAYGLHDAV

provib ENRITLLLAYYMPEFPVAGATLPVLEALQESGADIIELGIPFSDPVGDGPVIQEAAHRSIANGVSLHRLLDIVGRAKITVPILLMGYCNPLIAYGLTDAT

Clote ENRITLLIAYYMPEFPVPGATLPVLEALQESGVDLIELGMPYSDPIGDGSVIQDAAHKAISHGVHVGSIFELVRRAKITTPILLMGYCNPLIAYGMADAV

Phatr EDAFAAFVTFVTAGYPTAADTPAILMAMQEGGAALIELGIPYTDPQADGATIQHTNQVAIKGGSEIHQCLDMVKKSGLTVPVVLMGYYNPFLQYDCEETK

thaps EQAFAAFVTFVTAGFPVKEDTPAILLAMQAGGASVIELGIPYTDPQADGTTIQQTNQVAIKAGSDITQCLSMLESAGLTVPVVLMGYYNPFFQYGCKKAK

Lacbi RRVFEALVTFVTAGYPRKDDTVPILRAMQAGGADIIELGIPFSDPIADGPVIQETSTIALKNGIDYVTVLGQLREAGLTAPVLLMGYYNPLLAYGIQDAA

copci DCVLTCLRCFRDRRYPKKEDTVPVLLALQAGGADIIELGIPFSDPIADGPVIQEANTVALKNDIDYPTVLGQIREAGLTAPVLLMGYYNPMLAYGIQDAA

Ustma KAVFAVFVSFVTAGFPTKDDTVEVLLALEQGGADVIELGVPFSDPQADGPAIQESNQVALEQGVGYTQCLDYIRQAGLKAPVLLMGYYNPTLAYGVQDAK

Neucr KQTFQALVTYVTAGFPHPEQTPDILLAMEKGGA-VIELGVPFTDPIADGPTIQTANTIALQHGVTLQSTLQMVRDAGLKAPVMLMGYYNPLLSYGLNDCK

Asfum KSTFAALVAYITAGYPTVEETVDILLGLENGGAGIIELGIPFTDPIADGPTIQRANTQALANGVTVTTVLNMVRQAGLKAPLLLMGYYNPVLRYGLKDCK

schipo KKTFLVLVTFVTCGFPNVDETIKIMQGLQNGGAGIIELGIPFSDAVADGPTICKGNEIALKNNITLEKVFETVKLAGVTIPIILMGYYNPIFSYGIQKAK

Canal KETFAALVNFITAGFPTIDDTIPILQNMQNAGVDIIELGVPFSDPIADGPTIQQANNIALDNGITVPKCLELLSQAGVTVPIILMGYYNPILKYGLKDAA

Sacce RQTFAALVTFMTAGYPTVKDTVPILKGFQDGGVDIIELGMPFSDPIADGPTIQLSNTVALQNGVTLPQTLEMVSQAGVTVPIILMGYYNPILNYGIQDAA

Pram AEVIAAFITFVPCGFKTKADTVDILLGLQRGGANIIEVGIPYSDPQADGPTIQRAHQVGVDQGITLHDVLATVSEAGLITPVVLMGYYNNIMQYGCPDAQ

Psoj AEVIAAFITFVPCGFKTKADTVEILLGLQRGGANIIEVGIPYSDPQADGPTIQRAHQVGVDQGITLTDVLATVSEAGLTTPVVLMGYYNNILQYGCPDAQ

Pinf SEVIAAFITFVPCGFKTKADTVDILLGLQRGGANIIEVGIPYSDPQADGPTIQRAHQVGVDQGITLHDVLATVSEAGLTTPVVLMGYYNNILQYGCPDAQ

NAGAAGLIVPDLPLEESTELLLAAGQVGLALIPLVAPTTRRRLARITAAAQAFVYCVTVTGITGTSQNVTGEIEELSKEVREELPMVAGFGIASPEQAVK

AAGVDGLIVPDLPLEENPPLRQTLEPAGLALIPLVAPTTGERLARIAATARGFIYCVSLTGVTGVREGLPPGIDEYLAGVRADLPLGIGFGIGSPDQARL

EAGVDGIIVPDLPYEEQDIIAPLLREANIALIPLVTVTSIERIKKITSESEGFVYAVTVAGVTGVRQNFKDEIHSYLEKVKSHLPVVAGFGISTKEHVEE

QAGAKGLIIPDLPLEEVDYILELCNLYSIELILFVAPTSQSRIQLIASKSPGCIYLVSSCGVTGLRDNFDVKIQHLANNIKSNKLIMLGFGINNPDQISQ

LMGAKGLIVPDLPLEETDELIVICNDNQIELVLFVAPTSMKRINSISKKSPGCIYLVSSTGVTGVRDDIDIKVMELSNYIKKNKFIMLGFGISTPEHIKK

NAGIRGLLIPDLPIEESEYVISVCNLFNIELILLLAPTSRERISKIIKRAPGCIYLVSTTGVTGQKSQLTSQLKELTETVKTNKSIILGFGISTTEQIKE

RAGIKGLLIPDLPIEESDYIISVCKLFNIELIFLLSPTSIERINKIVEQAPGCIYLVSTTGVTGQKPELTGKLKRLTETIKKQKPIILGFGISTAEQIKE

QLEVQGIIVPDLPYDESQILKKKCTINNIALISLIALTSFSRIKKIARNAEGFLYLISKTGVTGGTGKLMNKLKIIIKTIQKSKPVVVGFGINSRRQIKQ

AHNVKGLIVPDLPYEESKTLRQICDRYGLNIIWLISPTTKTRAQELARACKDWIYVISRTGVTGLETEFDKQIPKLIGELKKKAPIALGFGISKSEQVKL

AAGVQGLVVPDLPLEEAETLLKPAAAMGIEVTLLVAPTSIERIQAIATQSQGFIYLVSVTGVTGMRTQVGSRVEELLKNLRSNKPIGVGFGISEPEHALQ

DVGARGLVVPDLPLEEADILLEPAKDIGIELTLLVAPTSKERIKAIAHQSQGFIYLVSVTGVTGMRAQMQTRVEDLLAQMREDKPIGVGFGISQPEQALQ

AAGVAGLVVPDLPLEEAAGLLEPAKEMGIDVILLVAPTSAKRIEAIAHSSQGFIYLVSVTGVTGVRSQLESRVSDLLKQIRGEKPIGVGFGISDAAQARQ

AAGVAGLVVPDLPLEEAAGLLKPATERGIDLILLIAPTSSERIEAIARSSQGFIYLVSVTGVTGMRSQVEGRVLDLLQKVRQDKPLGVGFGISQPAQATQ

QAGIKGLVIPDLPLEEAEPVLAETANLGLELTLLIAPTTPERMRAIATASQGFIYLVSTTGVTGMRQEMASRVQELLHTLRQPKPIGVGFGIASPEHARQ

AAGANGLVVPDLPLEESQRLSEVAAERGIDLILLIAPTSADRIAAISKQARGFIYLVSVTGVTGMRQGMQSRVADLLQEIRQDKPIGVGFGISGAEQARQ

AVGVQGLVVPDVPLEETEMLRKEALNNDIELVLLTTPTTTERMKRIVDASEGFIYLVSSIGVTGARSSVSGKVQSLLKDIKEDKPVAVGFGISKPEHVKQ

DVGVQGLVVPDVPLEETEFLRKEALNNSIELVLLTTPTTTERMKRIVDASEGFIYLVSSIGVTGARASVSGKVQSLLKDIKEDKPVAVGFGISKPEHVKQ

NAGVHGLLVPDVPLEETETLRNEARKHQIELVLLTTPTTKERMNAIVEASEGFIYLVSSVGVTGTRESVNEKVQSLLQQIKESKPVAVGFGISKPEHVKQ

QAGVHGLVVPDLPLEETALLRNEAVMHGIELVLLTTPTTTERMKEIAKASEGFIYLVSSVGVTGARSNVNLRVEYLLQEIKKDKPVAVGFGISTPEHVKQ

EAGVHGLIVPDLPYVAAHSLWSEAKNNNLELVLLTTPAIEDRMKEITKASEGFVYLVSVNGVTGPRANVNPRVESLIQEVKKNKPVAVGFGISKPEHVKQ

EAGVHGLIVPDLPYVAAHALWSEAKKNNLELVLLTTPAIEERMKEITKASEGFIYLVSVNGVTGPRENVNLRVESLIQEIKKDKPVAVGFGISKPEHVKQ

AAGAKGLLVPDIPLEETYDVSEIASKHGIELVLLSTPTTVERAKKIAQATKGFVYLVSVTGVTGVQSNVATRVEQLVEELRSDKPIAVGFGVSEAKHAKQ

AAGAKGLLVPDIPLEETYSMSEIASTHGIELVLLSTPTTVERAKKIAQATKGFVYLVSVTGVTGVQTQVASRVESLVEELRADKPIAVGFGVSQAAQAKQ

EAGAAGLLVPDLPLEETVSVRAACEKAGIELVLLATPTTQARMRAIAQASQGFVYLVSVTGVTGMKEQVSGRVEGLVSELKADKPVCVGFGVSRAEHAKQ

ASGASGLLVPDLPVEEGDALQTAANVQGLDVIWLVAPTSPERLRRIAERTTGFVYLVSTTGVTGARTQVASSVRTSLAQLRATRPVAVGFGISTPEQAHE

ETGIDALIVPDIPFEEKEELAPFCKEYDVRFISMIAPTSKERIRMIAREAEGFIYCVSSMGVTGVREKIGDDAKEMIKIVKEDIPCAVGFGISTPEQAAQ

DLNVAGLVIPDLPYESRDEIVPIAEKYGIDIIPLITPTSGHRIEKIAKSASGFIYVVSSMGITGERDEFFAGLKALVAEIKQNVPTAIGFGIHTPEQAQT

DVGIDGLIIPDLPLEERKDILEEADKYGIYLIPLVAPTSKERIKLITENGKGFVYCVSITGVTGAREDIETDIEEYMKTVSQNMPKAIGFGISTPEMAKK

SSGISGLIIPDLPAEESADLANCCSQEGVDLIFLVAPITDERIEMILSKTSGFVYIVSRSGVTGTRSDVTAATSDIISRVRTDIPKAVGFGISNAEQAAK

EAGISGLIIPDIPVEEAADLKTGCDAHGLDLIFLVAPTTEARIRKILQRGSGFIYLVSRLGVTGARDDVAGSTKELLSRVNTDIPKAVGFGISTGEQAAE

EVGLKGFVVPDLPAEEAGPLREACDEYGLDLVFIVAPTTGDRLERMMEQVSGYVYVQARLGVTGAREDVSDRTAETLARLEADVPKAVGFGISSGEQAEA

DAGIDAVVVADLPYEEAGPYITAAETAGVAPVMMVSTTTPERLSKILTVKSGFIYLVAALGVTGMRQKTDPVAQKLLADLKNDIPIAPGFGISDREQVRE

EAGVSGIIVPDLPIEEADSLYNYCKKYGVDLIFLVAPTTDERLKKILEKCSGFVYVVSVTGITGAREKVAEETKELIKRVKKKIPACVGFGISKREHVEE

AAGVDGVLLVDLPPEEAGEFKACADRHGIDVIFLLTPTSETRIRTVTNRARGFIYYVSVTGVTGVRSGIEASVAGNVNIIKEKVPVAVGFGIATPEQAGE

EAGVDGVLVVDMPPEEAESFLNHANARDLQVAFLLTPTSDSRIATVGRLGRGFVYYVTVTGVTGARQQVSTTLGGELAKVRASVPIVAGFGISTPQQAAD

QAGVDGLLLVDLPSEEREELHIHLKPKGIHLITLLAPTTPDRAAQLLKQAQGFVYYVSMTGVTGTSKVDGSAIESQVVQLREPVPVAVGFGITTEQDAAA

EAGVDGVLVVDYPPEECEAFAKTMRAAGIDPIFLLAPTSEARMAQIARVASGYIYYVSLKGVTGAATLDLDSVAARIPQIRQRLPVGVGFGIRDAATARA

AAGVDGVLVVDYPPEEAGVFAEKMRAAQIDPIFLLAPTSDERIADVGKIASGYVYYVSLKGVTGAGNLDVSSIAGKIPAIKSPVPVGVGFGIRDAETARA

AAGVDGMLIVDYPPEECVEFSARLKAHGMDLIFLLAPTSDARMAQVAQVASGYVYYVSLKGVTGAGTLDVDAVEAMLPRIRRNVPVGVGFGIRDAATAKA

ACGVDGVLIVDYPPEESVKWVEYLKRQNIAPIFLLSPTTQQRVERVASLAEGYVYYVSLKGVTGSLHLDLHDVAEKLDGLRSSIPIGVGFGIRDGATARA

ASGVDGVLLVDLPPEEADEIRAIFSAAGLALIVLASPTTASRLATLSGVAQGYLYYVSFAGVTGADRLDAQSAGDRLRGLRAQVPVVVGFGIRDAASAVV

EAGVDGLIVVDMPPEHNSELCDPAQAAGIDFIRLTTPTTDARLPKVLNGSSGFVYYVSVAGVTGAGAATLEHVEEAVARLRRDLPISIGFGIRTPEQAAS

EAGVDGLIVVDLPPEHNEDLCDPAQAAGLDFIRLTTPTTDKRLPRVLAGSSGFVYYVSVAGVTGAHAASLEHVEQAVARLRRDLPLCIGFGIRSPEHAGS

ASGIDGLIVVDLPPEMDEELCIPALKAGINFIRLATPTTDKRLPKVLQNTSGFVYYVSMTGITGSALADTGKVAAAVNRIKGDLPVCVGFGVKTAEQARV

ASGIDGLIVVDLPPEMDDELCIPALARGINFIRLATPTTGKRLPAVLKNTSGFVYYVSMNGITGSALPDPSLISGAVGRIKAELPVCVGFGVKTADHAKA

AAGVDGLIIVDLPPEEDSELCLPAMKAGLNFIRLATPTTEKRLPAVLANTSGFVYYVSITGITGSASADSAAVGDAVARIKRDLPVCVGFGIRTPEAARA

EAGVDGLIVVDLPPEEDEELCIPALKAGVNFIRLATPTTDKRLPAVLQNTSGFVYYVSIAGITGAASADNAAVGAAVERLKRDLPVAVGFGIKTPEQAAE

AAGADGVICVDIPPEEDTEFRSALDANGLSFVRLAAPTTDKRLPQVVAHTSGFVYYVSTTGVTGAGSGATGDIEAAVARVRAGLPVAVGFGVKTPERAQE

KAGVDGLLIPDLPPEESADFLQRAKSLGLTVVYLISPVTPERIEWIDSLSTDFSYCLAVNATTGADASTEASVDRYLERVRLRKKFVVGFGIRDRARVEH

EAGIDGLLLPDLPPEEAGEFLERAKAFSMTVVFLISPVTPERIEMIDGMSTDFSYCLAVNATTGSEGDTEASVDEYLKRVRRKKKFVVGFGIKDRERVEH

KAGVDGLLIPDLPPEESEDFLERAKHFGLSVIYLISPVTPDRIELIDSMSTDFSYCLAVNATTGDVAGMDEKIAEYLKRVRQKKKFVVGFGIKDRERVRK

AAGADGFIVVDLPPEEGIALNKACIANGLSNIPLVAPTSDKRIASLTDMASTFLYCVSVTGVTGARESLPPDLEEFITRVRSELPLAVGFGISNPEMVNG

ECGADGFIVVDLPPEEGADLAKACNKYGLSNIPLIAPTTDERIGHLAKTASTFIYCVSTTGVTGARSELPSDLDDFIKRVRSDLPLAVGFGISNATMVQS

EAGANGFIMVDLPPEEAISFREKCRKYRLSYVPLIAPSTLHRIKFLASIADTFIYVVSKMGTTGEKIAMNNALPDILARIRSSIPLAVGFGVATRDHFNV

EAGANGFIMVDLPPEEAIAFRQKCAASNLSYVPLIAPSTLKRIQFLASIADSFIYVVSKMGTTGANVAVNEELPTILSRIREHVPLAVEFGVATRDQFNY

AAGANGFIMVDLPPEEAADFRASCTKHGLSYVPLIAPSTTKRIEHLASLADSFIYVVSKMGTTGATAAVSSSLPDLISKIRSSIPLAVGFGVSTRQHFIE

EAGVNGFIIVDLPPEEAVSFRQLCTRGGLSYVPLIAPATDARMRVLCQLADSFIYVVSRQGVTGASGTLNANLPELLARVKKNKPAAVGFGVSTHDHFTQ

EAGVNGFIMVDLPPEEAVRFRDLCASNGMSYVPLIAPATEARMKLLCKIADSFIYVVSRMGVTGATGKLSSNLPELLKRVHQNVPAALGFGVSTREHFLS

EVGANGFIIVDLPPEEAVGFREECKKQGVSFVPLVAPSTDRRMELLASVADSFIYVVSRMGSTGATGVINTALPQLCQRVRKDTPLAVGFGVNTSEHFHQ

EAGANGFIVVDLPPEEAIKFRTECTKYGLSYVPLVAPATNDRLKILGEIADSFIYVVSKMGTTGASTKVSTGIQELCDRVRKDTPLAVGFGVSTREHFLT

KAGANGFIIVDLPPEEALKVRNYINDNGLSLIPLVAPSTDERLELLSHIADSFVYVVSRMGTTGVQSSVASDLDELISRVRKDTPLAVGFGVSTREHFQS

KAGVDGFIIVDLPPEEAKVLSDDAAKHGLSYIPLVSPTTEERMKLIDSVAHGFVYCVSLTGVTGARTELPPNLDSFMTKIRAKHPLALGFGLSTRQHFVQ

KAGVDGFIIVDLPPEEAKPLSDDAAKHGLAYIPLVSPTTEDRMKLIDSVAHGFVYCVSLTGVTGARTDLPPNLDAFMATIRAKHPLALGFGLSTRQHFVL

KAGVDGFIIVDLPPEEAKTLSDDAAKHGLAYIPLVSPTTEERMKLIDSVAHGFVYCVSLTGVTGARNELPPNLDAFMAKIRAKHPLALG------

VAKYCDGVVVGSALVKLVETHLTRQLK

LAPMGDGIIVGSALVDVLY--LVERLR

MVTICDGVVVGSKVIELLENEATKQKE

IINWIDGIVVGSAIITHIVGEFCKKLK

IMKWIDGVVVGSAFVKKLSALLCKSLK

IKGWINGIVIGSAFVKRLSGNFCQDAK

IKGWINGIVIGSAFVKRLSSDFCTTAK

LIEWSNGIVIGSPCVQILLQSLIKQIK

VKSWADGVIIGSACMQILL--WISVMK

VKNWSDAVIVGSACVKRLAQGFCQSLK

VKKWSDAVIVGSAVVKRLAEGFCQNLK

VKEWADAAIVGSAVVKRLAEGLCQSLK

VRDWADAAIVGSAFVQRLATGFCQSLK

VRDWADAAIVGSAFVKRLAE-FCQSLR

VRDWADGVIVGSAFVNRLQE-LCRELR

IAGWADGVIVGSAMVKLLGDALTKSLK

IAGWADGVIVGSAMVRLLGDALTKSLK

VAEWADGVIVGSAMVKILGESFTKSLK

IAGWADGVIIGSAIVRQLGEAYAKNMK

IAQWADGVIIGSAMVRQLGEAYARGMK

IAGWADGVIIGSAMVRQLGEAYARSMK

IVDWADGVIVGSALVRALGEAKAEEIR

IVDWADGVIVGSALVRALGEAKADEIR

IVSWADGVICGSALVKALGEALARELR

VASLADGVIVGSACVQLLATAFCRQLK

MAGFSDGVIVGSAIVKIIAQ-YVRKMK

MAGIADGVIIGSAIVDLVAK-FTKQIR

LKDFSDGIIVGSALVERIAKGFVSILK

IIDAADGVIVGSAFVDIIASGLTAEIK

VRKAADGVIVGSAFVRIIEEGLARELK

VVAAADGVIVGSALVDIVAEGLARELK

WTDAADAVIVGSALVREIEDSLIPRIT

ITEIADGAIVGSAIVKIVEKHFLKELE

VAATADGVVVGSAIVKLFEKHFVSSLK

VAAMADGVVVGSALVKLFQLHFVASLR

IARFSDAVVVGSALVKVIQQHFVAELK

IGGVADAVVIGSRIVQLLEEAFIADIR

VAEVSDAVVIGSRLVQLLESAFIAELR

IGKVADAVVIGSKIIQLIENQFLKEIR

VAELADAVVVGSRIIEEIERSLVKSLR

MAVDADGVVVGSALVTALSDAFLAPLR

IARLADGVVVGSALIDHIANALCSALS

VARLAEGVVVGSALVDRIAKALCRELA

IGANADGVVVGTAIVNAVANVLVSGLA

IGAVADGVVVGSAIVNQIAGSLVKGLS

IAAEADGAVVGSALIDALQKSLVASLA

VARVADAAVVGSAIVTRLAGGFVRELA

IGKVADTVVVGSAIVEELATKLAGTLA

MWRLADGAVVGTALLEHIAGAFWRGLR

MWRFADGAVVGTALLQNIASAFWRTLR

MWELADGAVVGSALLQHVATAFWKSLR

VANMADGVVVGSAILKAMDSLFLAELD

VANIGDGVVVGSAILRAVQSA------

VSDAADGVVIGSRLVSVIKDAFWKEFR

VADAADGVVIGSRIVNAIKAAFWEEVR

VGEHADGVVIGSKLIAKLREAFWKEFE

VGAIADGVVVGSMIITTIQKAFWEEYR

VQELAEGVVIGSQIITVLGQAFWEEFR

VGSVSDGVVVGSKIIDLILKAFWEEFR

VGEVADGVVIGSKIITLIGDSFWKEFR

VGSVADGVVIGSKIVTLCGDAFWEDFK

ASALADGVVIGSKIVKIIEDA------

ASALADGVVIGSKIVKIIEDA------

------

Tryptophan synthase Beta subunit amino acid sequence alignment

66 347

Crypa YKYGGYVPEVINNAMKEIEDAYKISKSEDFINELKKIRKFQPTPIYYAKNLTAEIYLKREDLNHTGAHKLNHCMGEALLAKYMGKKKLIAETGAGQHGVA

Cryho YKYGGYVPEVINNAMKEIEDAYKISKSEDFINELKKIRKFQPTPIYYAKNLTAEIYLKREDLNYTGAHKLNHCMGEALLAKYMGKKKLIAETGAGQHGVA

PhatrB YEYGGQLPPQLVEIMNEISESYKLIRTDAFQIELDSLNKFIPSPIFYARRLTARIFLKREDLNHTGAHKINHCLGEALLAKHMGKTKVLAETGAGQHGVA

Salty YEFGGYVPQILMPALNQLEEAFRAQKDPEFQAQFADLLKYAPTALTKCQNITTTLYLKREDLLHGGAHKTNQVLGQALLAKRMGKSEIIAETGAGQHGVA

Ecoli YEFGGYVPQILMPALRQLEEAFSAQKDPEFQAQFNDLLKYAPTALTKCQNITTTLYLKREDLLHGGAHKTNQVLGQALLAKRMGKTEIIAETGAGQHGVA

Arab1 RKFGGYVPETLMHALSELESAFALATDDDFQRELAGILKYVESPLYFAERLTPLIYLKREDLNHTGAHKINNAVAQALLAKRLGKKRIIAETGAGQHGVA

Arab2 RKFGGYVPETLMHALSELETAFSLATDEDFQRELAEILKYVESPLYFAERLTPLIYLKREDLNHTGAHKINNAVAQALLAKRLGKKRIIAETGAGQHGVA

Oryz1 RTFQGYVGETLMHALSELESAFKLATDDDFQRELAGILKYVLSPLYFVESLTPLIYLKREDNNHTGAHKINNAVAQALSAKRLGKKRIIAETGAGQHGVA

Polyg RKFGGYVPETLMHALDELETAFSLATDVEFQKELDGILKYVETPLYFAERLTPQIYLKREDLNHTGAHKINNAVAQALLAKRLGKTRIIAETGAGQHGVA

Camp RKFGGYVPETLMYALTELESAFSLSGDQVFQKELDGILKYVESPLYFAERLTPEIYLKREDLNHTGAHKINNAVAQALLAKRLGKKRIIAETGAGQHGVA

Oryz2 RKFGGYVPETLMHALTELEAAFALAGDEDFQKELDGILKYVETPLYFAERLTPMIYLKREDLNHTGAHKINNAVAQVLLAKRLGKERIIAETGAGQHGVA

Sorgh RKFGGYVPETLMHALTELENAFALATDEEFQKELDGILKYVESPLYFAERLTPLIYLKREDLNHTGAHKINNAVAQALLAKRLGKQRIIAETGAGQHGVA

Nost RRFGGYVPETLMPALAELETAYQYRNDPGFQAELQQLLRYVATPLYFAERLTAQIYLKREDLNHTGAHKINNALGQVLLAKRMGKQRIIAETGAGQHGVA

Anab RRFGGYVPETLMPALAELETAYKYRHDPGFQAELQQLLRYVATPLYFAERLTAQIYLKREDLNHTGAHKINNALGQVLLAKRMGKQRIIAETGAGQHGVA

Croco RKYGGYVPETLMPALSELETAYRYKNAPEFQAELSQLLKYVPSPLYFAERLTPQIYLKREDLNHTGAHKINNALGQVLLAKRMGKKRIIAETGAGQHGVA

Proma RRFGGYVPETLMPALAELEKKAEAWQDSSFTNELSHLLKYVATPLYEAKRLSPRIWLKREDLNHTGAHKINNALGQALLAIRMGKKRIIAETGAGQHGVA

Chlre RQFGGYVPETLIPALEQLEKDYEAIADPAFKAEMEAILKYVETPLYHAERLSAEIYLKREDLNHTGAHKINNSLGQALLCKRLNKQRIIAETGAGQHGVA

CME KPFGGFVPETLISCLEELEEAYKVSKDPTFQQELAQQLRFAPTPLYFAERLTMQIYLKREDLLHTGAHKINNALGQVLLARRMGRTRIIAETGAGQHGVA

Gloeo RRFGGYVPETLMSALAQLEAAFQYRHDPQFLAEFAGHLRFVPTPLYFAERLTGRIFLKREDLNHTGAHKINNALGQALLALRMGKRRIIAETGAGQHGVA

Exisi QIYGGYIPETLMQAVLELEQAYEVKEDPAFRERMNDLLEYVQTPLYYAEHLTAKIYLKREDLNHTGAHKINNTIGQALLAERMGKRKIVAETGAGQHGVA

Lismo FKFGGFVPETLMKAVKELDEAYASKTDPAFQKELNYYLKYVETPLYFAEQLTAKIYLKREDLNHTGAHKINNTIGQALLARQMGKQKVVAETGAGQHGVA

Asory REFGGYVPESLMDCLAELERGFEALNDPKFWEEYRSYYPYMPSSLHLANRLTANIWLKREDLNHTGSHKINNALGQILLARRLGKTRIIAETGAGQHGVA

Neucr REFGGYVPEALMDCLSELEEGFKIKDDPAFWEEYRSYYPWMPGQLHKAERLTANIWLKREDLNHTGSHKINNALGQLLLARRLGKKKIIAETGAGQHGVA

Sachce RDFGGYVPEALHACLRELEKGFEAVADPTFWEDFKSLYSYIPSSLHKAERLTAQIWLKREDLNHTGSHKINNALAQVLLAKRLGKKNVIAETGAGQHGVA

Cangla RDFGGYVPEALHTCLKELEEGFDAIADPTFWEEFKSLYSYIPSSLHKAERLTAQIWLKREDLNHTGSHKINNALAQVLIARRLGKTEIIAETGAGQHGVA

Ustma RKFGGYIPEALFDAHQELEKAYDALDDPEFWKEFEGYYEYIPSELYHAERMSAQIWFKREDLNHTGSHKINNAVGQILLARRLGKTRIIAETGAGQHGVA

Phatr1 RKFGGYIPETLSVAFEEIEASYELKDDPSFLAELDEYRRFVPTPLHRADRLTATIWLKREDLAHTGAHKINNAIGQALLAKRIGKPRIIAETGAGQHGVA

Thaps RKFGGFIPETLSEAFREIEAEYKVKNDPAFLAELDVYRRFVPTPLHKAERLTATIWLKREDLAHTGAHKINNAVGQALLAKRIGKPRIIAETGAGQHGVA

Pram YEFGGFVAETLIQAHHNLIDEYRATQDASFREELEHLGRYIPTPLYHAKRLTAQIWLKREDLAHTGAHKINNALGQAVLAKRLGKTRIIAETGAGQHGVA

Psoj YEFGGFVAETLIQAHHNLIDEYKATQDPKFREELEHLGRYIPTPLYHAKRLTAQIWLKREDLAHTGAHKINNALGQAVLAKRLGKTRIIAETGAGQHGVA

Pyrab2 WKFGGYVPETLMEPLRELEKAYRLKNDEEFNRQLDYYLRWAPTPLYYAERLTAKIYLKREDLLHGGAHKTNNAIGQALLAKFMGKTRLIAETGAGQHGVA

Pyrfu2 WEFGGYVPETLIEPLKELEKAYRFKDDEEFNRQLNYYLKWAPTPLYYAKRLTAKIYLKREDLVHGGAHKTNNAIGQALLAKFMGKTRLIAETGAGQHGVA

Theko2 FRFGGFVPETLIEPLKKLERAYKFKDDPEFNETLEYYLRWAPTPLYYAERLSAKIYLKREDLLHGGAHKTNNGIGQALLAKFMGKERLIAETGAGQHGVA

Archa2 KEFGGFVPEVLIPPLEELEKAYRFKDDEEFKARLEYYLKYAPTPLYFAENLSVKIYLKREDLLHGGAHKINNTIGQALLAKFMGKKRVIAETGAGQHGVA

Aerpe1 EREESIALLSRILPSRLIEDEYILARWVDIPGEVRKALAIGPTPLIRAEGLEVRIYYKSEAVLPTGSHKINTAIAQAYYAKLDGAKEIVTETGAGQWGLA

Pyroa1 DEGESIGLLSKILPSALIDQEFTAERWVSIPEEVREAYRVGPTPLFRAEGLEVRIYYKYEGVLPVGSHKLNTALAQAYYAKADGAVEVATETGAGQWGMA

Pyrab1 EEPIDIEKLKRIFAEELVKQEISRERYIEIPGELRKLYSIGPTPLFRATNLEARIYFKYEGATVTGSHKINTALAQAYYAKKQGIERLVTETGAGQWGTA

Pyrfu1 EEPVDPKKLERIFAKELVKQEMSTKRYIKIPEEVRKMYSIGPTPLFRATNLEARIYFKFEGATVTGSHKINTALAQAYYAKKEGIERLVTETGAGQWGTA

Pyrho EETINPEKLTRIFAKELIKQEFSEKRYIKIPKEVRELYAIGPTPLFRATNLEARIYFKYEGATVTGSHKINTALAQAYYAKKEGIKRLVTETGAGQWGTA

Theko1 EEPMEPEKLLRIFAEELVKQEMSTDRYIEIPKEVREIYSIGPTPLFRATNLEARIYFKYEGATVTGSHKINTALAQAYYAKRQGIERLVTETGAGQWGTA

AERPE2 KEPVDPSALAKLFPKALIEQEVSRERYIEIPGEVHEAYIFAPTPLLRAVNLEAEIYYKYEGVTPTGSHKINTALAQAYYNKLEGVERLVTETGAGQWGSA

Pyroa2 ------MREVYLWRPTPLIRAKRLEARIYYKFEGVSPPGSHKPNTAVAQLYYISREGVDRVTTETGAGQWGSS

Archa1 AEPVKPEDLEPIFPKGLIQQEMSGERWIRIPEDVREIYRWRPTPLVRAERLEARIYFKYEGASPPGSHKPNTAVAQAYYNAKEGVERLTTETGAGQWGSA

Carhy FQPISPADLESLFPKSLIEQEISTERFIEIPEKVREAYAYRPTPLKRAKKLEARIYFKYEGTNASGSHKLNTALAQAYFNKLDGTEQLTTETGAGQWGSA

Magma GQPIGPDDLAPIFPMAVIAQEVTAERWVDIPEPVREVYRWRPSPLIRARRLEAKIYFKYEGVSPAGSHKPNTSVAQAFYNKQEGVKRIATETGAGQWGSS

Rhoru GKPLVADDLAPIFPRAVIEQEMSAERWIDIPEPVREVYRWRPSPLIRARRLEAHIYFKYEGVSPAGSHKPNTSVPQAFYNQQEGVKKLATETGAGQWGCS

Marib GQPVGPTDLEPLFPASLIEQEMTTEREVEIPEPVRDVYRWRPAPMFRAHRLEAKIFYKYEGGSPAGSHKPNSAIPQAFFNREAGIRRLTTETGAGQWGTS

Rhofer GQPVGPDDLAPLFPMALIMQEVSQEREIEIPEPVRDVFRWRPAPLFRAHRLEAKIYYKYEGGSPAGSHKPNTAVPQAFYNQEAGIKRLVTETGAGQWGSS

Rhopa GQPVGPSDLEPLFPMELILQEVATERYIEIPEPVRDVFRWRPAPLIRARRLEAKIYFKYEGVSPAGSHKPNTAVPQAWYNKQAGIKKLSTETGAGQWGSS

Arab3 KEPIKPEDLAHLFPNELIKQEATQERFIDIPEEVLEIYKWRPTPLIRAKRLEARIYFKYEGGSPAGSHKPNTAVPQAYYNAKEGVKNVVTETGAGQWGSS

Dechar -NPVPPEAMGAIFPGPILEQEMSAERWIAIPEEVRQIYAWRPAPLCRALRLEAKIFYKYEGVSPAGSHKPNSAVPQAYFNKIAGTKRLTTETGAGQWGSS

Magco -KPVTPEMMGMIFPPDILQQEMSTERWIDIPDEVRQILSWRPSPLYRAHRLEAKIYYKYEGVSPAGSHKPNSAVPQAYYNKKAGIKRLTTETGAGQWGSS

Geomet RQPVTPDDLLPIFPMAIIEQEVSGERWIPIPEEVREIYRWRPSPLYRARRLEAKIYYKYEGVSPAGSHKPNSAIPQAFYNKQAGIRRLATETGAGQWGSS

Clothe KKPVTLDELQAIFPMELIQQENSQERWIDIPEEVREMYRWRPSPLYRARALEARIYYKYEGTNATGSHKLNTSLPQAYYNKIAGIKRLSTETGAGQWGSA

Methm RQPINPEALEPIFAKELIRQEMSSDRYIDIPAEILDVYRWRPSPLFRAHQLEAKIYYKYEGVSPAGSHKTNTSIAQAYYNMKEGTERLTTETGAGQWGSA

Chloph MTPISPDDLAQVFPMNLIEQEMSTERWIDIPEEVLSILKWRPSPLYRAKRLEAKIFYKNEGVSPAGSHKPNTAVPQAWYNKQFGIKYLTTETGAGQWGSA

Chloli MLPISPDDLAKVFPMNLIEQEMSTERWIDIPEEVLGILKWRPSPLYRAKRLEAKIYYKNEGVSPAGSHKPNTAVAQAWYNREFGIKYLTTETGAGQWGSA

Provi MVPIGPDDLARVFPMNLIEQEMSTERWIDIPDEIQGILKWRPSPLYRARRLEAKIYYKNEGVSPAGSHKPNTAVAQAWYNKQFGIKYLTTETGAGQWGSA

Pelol LTPIGPDDLARVFPMNLIEQEMSTQRWIDIPEEILGILKWRPSPLYRAHRLEAKIYYKNEGVSPAGSHKPNTAVAQAWYNKQFGIKYLTTETGAGQWGSA

Praes LSPIKPEDLARVFPMNIIEQEVSTERWITIPEPVQDILKWRPSPLYRAKRLEAKIYYKNEGVSPAGSHKPNTAIAQAYYNKEFGIKYLTTETGAGQWGSA

Synfu GKPVEPQDLAAVFPMNLIEQEVSTQSWIDIPEAVLEKYLWRPTPLCRARNFEVKIYYKNEGVSAPGSHKPNTAIPQAYYNKVFGIRRISTETGAGQWGSA

Deshaf GKPCTIEDLQPVFCDELIEQELTKDLYIEIPEDIRDFYKYRPSPLVRAYCLEAEIYYKFEGTNTSGSHKLNSAAAQVYYAKKQGLTSLTTETGAGQWGTA

Sulso1 QAYFSIDLLRSILPKEVLRQQFTIERYIKIPEEVRDRYLIGPTPLFRAKRLEARIYFKYEGATPTGSHKINTAIPQAYFAKEEGIEHVVTETGAGQWGTA

Ferac KSDISINLLNKILPKEVLKQEFTFKRYEKIPDEVMEAYEIGPTPLIRARNLEGKIYYKYEGATATGSHKINTAIPQAYYAMKEGVKGVTTETGAGQWGSA

Sulso2 T--GKLELLKEVLPSKVLELEFAKERYVKIPDEVLERYLVGPTPIIRAKRLEIKIYLKMESYTYTGSHKINSALAHVYYAKLDNAKFVTTETGAGQWGSS

Thevo T--GEFETLKKAVPTKVLEYEFSGERYPKIPGEIYEKYMVGPTPIIRAKNLEIKIYLKMESYTYSGSHKINSALAHVFFAKQDNAKFVSTETGAGQWGSA

LATAAAYFGLECEIHMGEVDVKKEYPNVIRMKILGAKVVCVEFGDKTLKEAVDSAFEAYIKDITFYAIGSVVGMVRDFQSIVGIESREQFLIPDIVTACV

LATAAAYFGLECEIHMGEVDVKKEYPNVIRMKILGAKVVCVEFGDKTLKEAVDSAFEAYIKDITFYAIGSVVGMVRDFQSIVGIESREQFIIPDIVTACV

LATACALIGIECEIHMGQVDVEKEAPNVTKMRILGCKLITVTRGTRTLKDAVDSVFEEYLKDPYFYAIGSVI-----FKASWDLEG-----APDAIVACG

SALASALLGLKCRIYMGAKDVERQSPNVFRMRLMGAEVIPVHSGSATLKDACNEALRDWSGSYAHYMLGTAAGIVREFQRMIGEETKAQILRPDAVIACV

SALASALLGLKCRIYMGAKDVERQSPNVFRMRLMGAEVIPVHSGSATLKDACNEALRDWSGSYAHYMLGTAAGIVREFQRMIGEETKAQILRPDAVIACV

TATVCARFGLECIIYMGAQDMERQALNVFRMRLLGAEVRGVHSGTATLKDATSEAIRDWVTNVTHYILGSVAGMVRDFHAVIGKETRKQALGPDVLVACV

TATVCARFGLQCIIYMGAQDMERQALNVFRMRLLGAEVRGVHSGTATLKDATSEAIRDWVTNVTHYILGSVAGMVRDFHAVIGKETRKQAMGPDVLVACV

TATVCASFGLECIIYMGAQDMERQALNVFRMKLLGAEVREVHSGTATLKDATSEAIRDWVTNVTHYILGSVAGMVREFHKVIGKETRRQALGPDVLVACV

TATVCARFGLECIVYMGAQDMERQSLNVFRMRLLGAEVRGVHSGTATLKDATSEAIRDWVTNVTHYILGSVAGMVREFHAVIGKETRKQALGPDVLVACV

TATVCARFGLQCVIYMGAQDMERQALNVFRMRLLGAEVRAVHSGTATLKDATSEAIRDWVTNVTHYILGSVAGMVREFHAVIGKETRKQALGPDVLVACV

TATVCARFGLQCIIYMGAQDMERQALNVFRMKLLGAEVRAVHSGTATLKDATSEAIRDWVTNVTHYILGSVAGMVREFHKVIGKETRRQAMGPDVLVACV

TATVCARFGLQCIIYMGAQDMERQALNVFRMRLLGAEVRAVHSGTATLKDATSEAIRDWVTNVTHYILGSVAGMVREFHKVIGKETRRQAMGPDVLVACV

TATVCARFGLECVIYMGVHDMERQALNVFRMRLMGAEVRPVAAGTGTLKDATSEAIRDWVTNVTHYILGSVAGMVRDFHAVIGQETRAQALGPDILLACV

TATVCARFGLECVIYMGVHDMERQALNVFRMRLMGAEVRPVEAGTGTLKDATSEAIRDWVTNVTHYILGSVAGMVRDFHAVIGQETRAQALGPDILLACV

TATVCARFGLECVIYMGIHDMERQKLNVFRMKLLGATVQPVSAGTGTLKDATSEAIRDWVTNVTHYILGSVAGIVRDFHDIIGQETRRQCQSPHILLACV

TATVCARFGLECVIYMGQEDMERQALNVFRMKLLGAKVQSVTAGTATLKDATSEAIRDWVTNVTHYILGSVAGLVRDFHSVIGEETKQQCKRPDVLLACV

TATICARLGLKCIVYMGAKDMERQALNVFRMRLCGAEVRPVHSGTATLKDATSEAIRDWVTNVTHYILGSAAGMVREFQSVIGRETKVQAQGPDIVMACV

TATVCARFGLPCVVYMGARDMERQKLNVFRMRLLGAEVRPVHTGTATLKDALSDAIRDWVTNPTHYVVGSVAGMVRDFHYVIGEEVYQQVQRPDILVACV

TATVCARFGLECIIYMGVHDIERQKLNVYRMKLLGAEVRPVAAGTGTLKDATSEAIRDWVTHVTHYILGSAAGMVREFQAVIGRETRVQCLRPDVLIACV

TATVCALLDLECVIFMGEEDIRRQELNVFRMELLGAKVVSVTQGSRTLKDAVNEALRYWVKHVTHYLMGSVLGIVRDFQAVIGQETRTQILKPEAIVACV

TATVAALFNMKCTIFMGEEDVKRQSLNVFRMELLGAKVVSVKAGSRTLKDAVNEALRFWVANVTHYIMGSVLGIVRDYQSVIGIEARKQHLKPDAIVACV

TATVCAKFGMECVVYMGAEDVRRQALNVFRMKLLGASVVAVDAGSRTLRDAVNEALRAWVVDLTHYIIGSAIGIVRTFQSVIGDETKQQMMKPDAVVACV

TATVCAKFGMECTVFMGAEDVRRQALNVFRMKLLGAKVVAVEAGSRTLRDAVNEALRYWVVNLTHYIIGSAIGIVRTFQSVIGNETKQQMLKPDAVVACV

TATACAKFGLTCTVFMGAEDVRRQALNVFRMRILGAKVIAVTNGTKTLRDATSEAFRFWVTNLTYYVVGSAIGLVRTFQSVIGKETKEQFAKPDAVVACV

TATACAKFGLKCTVFMGAEDVRRQALNVFRMRILGAKVVSVTNGTKTLRDATSEAFRFWVTNLTHYVVGSAIGLVRTFQSVIGNETKEQFAKPDVVVACV

TATACAKFGMECVVYMGSEDVRRQSLNAFRMKMLGAKVVAVESGSKTLKDAINEANRDWVTNLTHYIVGSAIGIVRDFQSIIGKEVKQQLQKPDAIVACV

TATICAKLGLDCTVYMGAVDCERQKLNVFRMNTLGAKVVPVQDGQRTLKDAINEAMRDWVTNVTHYLIGSAVGIVRDFQSVMGREMRAQMLKPDAVVACV

TATICAKLGLDCTVYMGAVDCERQKLNVFRMNTLGAKVVPVKEGQATLKDAINEAMRDWVTNVTHYLIGSAVGIVRDFQSVMGKEMRAQMLKPDVVIACV

TATACALLGLDCVVYMGYVDTQRQSLNVFRMKMLGAKVVAVKSGSQTLKDAVNEAIRDWVTNVTHYIIGSAIGIVRDFQAIIGREIKAQLQRPDAIVACV

TATACALLGLDCVVYMGYVDTQRQSLNVFRMKMLGAKVIAVKSGSQTLKDAVNEAIRDWVTNVTHYIIGSAIGIVRDFQAIIGREIKAQLQRPDAIVACV

TAMAGALLGMKVDIYMGAEDVERQKMNVFRMKLLGANVIPVHTGSKTLKDAINEALRDWVATFSHYLIGSVVGIVRDFQSVIGREAREQILDPDVIVACV

TAMAGALLGMKVDIYMGAEDVERQKMNVFRMKLLGANVIPVNSGSRTLKDAINEALRDWVATFTHYLIGSVVGIVRDFQSVIGREAKAQILQPDVIVACV

TAMAGALLGMKVDVYMGAEDVERQKMNVFRMGLLGARVIPVESGSRTLKDAINEALRDWVATFSHYLIGSVVGIVRDFQSVIGREAREQILTPDAVVACV

TAMAAALLGLEAEIYMGAEDYERQKMNVFRMELLGAKVTAVESGSRTLKDAINEALRDWVESFTHYLIGSVVGIVRDFQAVIGKEARRQIIGPDAIIACV

ASTAAALMGLKATVFMTASSFKSKIQRRLLMEAQGARVISSPSGRESLGLAIAEAVEYTLESGRRYLPGSVLEAVLMHQTVIGLEALDQLPEPDVVVACV

VSLAAALFGLRAVVFMTRSSYNSKRQRLTFMRAYGAVVYPSPSGRRSLGIAISEAVEYVLSGERKYLPGSVMEFVLLHQTVIGLEAVRQLPEPDVAVACV

LSLAGALLGLKVRVYMARASYQQKPYRKTLMRIYGAEVFPSPSGKRSLGIAISEAIEDVLKDEARYSLGSVLNHVLMHQTVIGLEAKEQMEEPDVIIGCV

LSLAGALMGIKVRVYMARASYEQKPYRKVLMRIYGAEVFPSPSGKRSLGIAISEAIEDVLKDEARYSLGSVLNHVLMHQTVIGLEAKQQMEEPDVIIGCV

LSLAGALIGLKVRVYMTRASYYQKPYRKILMEIYGAEVFPSPSGKRSLGIAISEAIEDVLSDEARYSLGSVLNHVLMHQTVIGLEAKKQVKEPDVIIGCV

LSLAGALLGLNVRVYMARASYQQKPYRKTIMRLYGAEIYPSPSGRKGLGIAISEAIEDVLRDEARYALGSVLNHVLMHQTVIGLEAQEQMKEPDVIIGCV

LSAAGAYFGVKVRVYMVRVSYLQKPYRRTLMELYGAEVYPSPSGRKSLGIAISEAIEDVINSGAKYSLGSVLNHVLLHQTVIGLEAEKQFRVPDIMIGAV

VAFAASLFGVKATVYMVRASYLQKPYRRVLMELWGAEVVPSPSGRKSLGIAISEAVEDAVRSGAKYVLGSVLNHVLIHQTVIGLEALEQIRDPDYVVGAC

LCFATKLFEMACTVYMVKVSFMQKPYRRVMMETWGGEVIPSPSGRKSLGIAISEAIEDAAKNETKYSLGSVLNHVLLHQTVIGLETKAQLEEPDVLIGCV

LAYAANFFGLKLTVYMVGISYDQKPYRRIFMETFGARVVKSPSGRKSLGIAISEAVEVAMSDPTKYALGSVLDHVLLHQTVIGQEVYEELSEPDVLIACV

MAFAGSLFGLEVEVFMVKVSYDQKPYRRALMETYGATCIASPSGRASLGIAISEAVEIAASRDTKYALGSVLNHVCLHQTIIGEEAMMQMEDPDVVIACT

LAFAGSLFGLEVQVFMVKVSFNQKPYRRALMETYGATCIPSPSGRASLGIAISEAVEVAIAHDTKYALGSVLNHVCLHQTVIGEEALLQMEDPDVVIACT

LSFAGSLFDIDVTVFQVRVSYNQKPYRRAVMETYGANCVASPSGRRSLGIAISEAVELAVQDPTKYALGSVLNHVLLHQTVIGLESMKQMECPDVIVGCT

LAFAGSLYGLEVEVFQVRVSYDQKPYRRALMETYGARCVASPSGRSSLGIAISEAVELAVQRETKYALGSVLNHVLLHQTIIGQEAMLQMQDPDVVVGCA

LAFAGSLFGLDVLVFQVRVSYDQKPYRRALMETYGARCIASPSGRASLGIAISEAVEVAAKNPIKYALGSVLNHVMLHQTIIGEEAIKQFEDPDVVIGCA

LAFASSLFGLDCEVWQVANSYHTKPYRRLMMQTWGAKVHPSPSGRRSLGIAISEAVEVAARNETKYCLGSVLNHVLLHQTIIGEECIQQMEEPDLIIGCT

IAFAGQMFGLPVRVFMVKVSYEQKPFRRSMMQTWGAEVFASPTGRASLGLAISEAVEEAAADPTCYTLGSVLNHVLLHQSVIGLEAKKQFDLPDVIFGPC

IAFAGQMFGLEVEVYMVKVSYHQKPFRRSMMETWGAQVFASPSGRKSLGIAISEAVELAMQDPTNYSLGSVLNHVCLHQTIIGLEAKKQFADPDVVIGCC

LALACSMFGLECTVYMVKVSCTQKPYRKSMMQLWGANVIPSPSGRASLGIAISEAVEDAVSRPTNYALGSVLNHVCLHQTVIGQEAKEQLADPDVVIACC

LSLACNHFGLECTVYMVKVSYEQKPYRRSFMKTFGAQVYASPTGRASLGIAISEAVEDAATHDTNYALGSVLNHVCLHQTIIGLEAKKQLEEPDVVFACC

LSLACNYFDLECKVYMVRSSYYQKPYRKSLMTLWGGNVVPSPSGRKSLGIAISEAVEDAIAHDTKYTLGSVLNHVVLHQTVIGAECKKQLEEPDVVIGCC

LAMSCKLIGIECKVFMVRISFDQKPFRKIMMKTWGAECIASPSGRRSLGIAISEAIEQAVERETRYALGSVLNHVMLHQSIIGLEAQKQFELPDVVIGCA

LAMSCKLIGIECKVFMVRISFDQKPFRKIMMKTWGAECIASPSGRASLGIAISEAIEQAVERDTRYALGSVLNHVMLHQTIIGLEAKKQFDLPDVVIGCA

LAMSCKLVGIECKVFMVRISFDQKPFRKIMMKTWGADCIPSPSGRKSLAIAISEAIEQAVERETRYALGSVLNHVMMHQTIIGLEAKKQLERPDVVIGCA

LAMSCKLVGIECKVFMVRISFDQKPFRKIMMKTWGADCIPSPSGRKSLAIAISEAIEQAVERETRYALGSVLNHVMMHQTIIGLEAKKQLELPDVVIGCA

LAMSCKLIGIECKVFMVRISFDHKPFRKIMMKTWGADCIPSPSGRKSLGIAISEAIELAVERETRYALGSVLNHVMLHQSIIGLEAKKQFEAPDIVIGCA

LSMACRMFGLQCRVFMVRISYDQKPYRRMMMSTWGAECIPSPSGRRSLGIAISEAIEDAVGDEARYSLGSVLNHVLLHQTIIGLEAHRQFEEPDVVMGCA

LSMACAYYQIDLTVYMVKISSEQKPYRKAVIETYGGKVIPSPSGKKSLGCAISEAIEVALKSECRYVLGSVLDHVVLHQTVIGEETKIACEIPDIMIGCV

VALAASMYNMKSTIFMVKVSYEQKPMRRSIMQLYGANVYASPTGRKSLGIAMSEAIEYALKNEFRYLVGSVLDVVLLHQSVIGQETITQLDEADILIGCV

TALAASLYGLPSQIFMVKISFEQKPLRKTVMNLYNGNVVASPSGRKTLGIGISEAVEYALDNDYRYMVASVMNVSLTHQSVIGQETKKQLEEADVLIGCV

VALASALFRMKAHIFMVRTSYYAKPYRKYMMQMYGAEVHPSPSGRQSLGIAISDAVEYAHKNGGKYVVGSVVNSDIMFKTIAGMEAKKQMEEPDYIIGVV

VALASALFGVDSHIFMVRTSFYAKPYRKYMMYMYGAHPHPSPSGKESLGLAISEAIHYALDNGGKYIAGSVINSDILFKTIAGMEAKKQMEEPDYVVGVV

GGGSNAMGIFSGFISDKVGKGNKIGEHAASITYGSEGIMHGFNSIMLKDEEGNPSKVHSIASGLDYPSVGPEIAYLNSIGRTKTVCITDQEAINGFFELS

GGGSNAMGIFSGFISDKVGKGNKIGEHAASITYGSEGIMHGFNSIMLKDEEGNPSKVHSIASGLDYPSVGPEIAYLNSIGRTKTVCITDQEAINGFFELS

GGGCNARGIFTAFLEDPVGRGLETSDHAATMTLGVKGSIHGMNCYNLQDETGEPLPVYSIASGLDYPGVGPQHCLLKDIGRTKYVAVTDQECLDAFMQLS

GGGSNAIGMFADFINDTVGHGIETGEHGAPLKHGRVGIYFGMKAPMMQTADGQIEESYSISAGLDFPSVGPQHAYLNSIGRADYVSITDDEALEAFKTLC

GGGSNAIGMFADFINETVGHGIETGEHGAPLKHGRVGIYFGMKAPMMQTEDGQIEESYSISAGLDFPSVGPQHAYLNSTGRADYVSITDDEALEAFKTLC

GGGSNAMGLFHEFVNDTVGFGLDSGKHAATLTKGDVGVLHGAMSYLLQDDDGQIIEPHSISAGLDYPGVGPEHSFFKDMGRAEYYSITDEEALEAFKRVS

GGGSNAMGLFHEFVDDTVGFGLDSGKHAATLTKGDVGVLHGAMSYLLQDDDGQIIEPHSISAGLDYPGVGPEHSFLKDVGRAEYFSVTDEEALEAFKRVS

GGGSNAMGLFHEFVDDQVGFGVDSVKHAATLTKGDVGVLHGAMSYLLQDDDGQVIEPHSISAGLDYPGVGPEHSFVKDMGRAEYDSATDEEALAGFKRVS

GGGSNAMGLFHEFVDDKVGFGLDSGKHAATLTKGEVGVLHGAMSYLLQDDDGQVIEPHSISAGLDYPGVGPEHSFLKDMKRAEYYSITDEEALEAFRRLS

GGGSNAMGLFHEFVDDKVGFGLDSGKHAATLTKGEVGVLHGAMSYLLQDDDGQIIEPHSISAGLDYPGVGPEHSFLKDIGRAEYYCCTDEEALEAFKRLS

GGGSNAMGLFHEFVDDQIGYGVDTDKHAATLTKGEVGVLHGSLSYVLQDDDGQVIEPHSISAGLDYPGVGPEHSFLKDIGRAEYDSVTDQEALDAFKRVS

GGGSNAMGLFHEFVEDQVGHGVDTDKHAATLTKGEVGVLHGSMSYLLQDDDGQVIEPHSISAGLDYPGVGPEHSFLKDIGRAEYDSVTDQEALDAFKRVS

GGGSNAMGLFYEFVNESIGEGVNTEKHAATLTKGRVGVLHGAMSYLLQDEDGQVIEAHSISAGLDYPGVGPEHSYLKDVGRAEYYSVTDEEALAAFQRLS

GGGSNAMGLFYEFVNESIGEGVNTEKHAATLTKGRVGVLHGAMSYLLQDEDGQVIEAHSISAGLDYPGVGPEHSYLKDVGRAEYYSVTDEQALAAFQRLS

GGGSNAMGLFYEFVKDTIGESIASGKHAATLTKGRPGVLHGAMSYLLQDEEGQVIEAHSISAGLDYPGVGPEHSYLKDSGRAEYYSVTDAEAIAAFQRLS

GGGSNAMGLFHSFIEDLVGDGVNTKRHAATITQGSVGVLHGAMSLLLQDSDGQVQEAHSISAGLDYPGVGPEHSYLNEIGRAEYVAVTDKEALNALELVS

GGGSNAIGIFNEFINDTVGEGVNTTKHAATLTMGTPGVLHGSYSYLLQDDDGQIIDPHSISAGLDYPGIGPEHSFLKDVKRAEYYAVTDAEALEGFQLLS

GGGSNAIGLFRRFVDEPVGRGLDTHEHAATLTKGSVGVLHGSMSYLLQDDDGQIVEAHSISAGLDYPGVGPEHAFMRDTGRAEYLSVTDEEALDAFQLVS

GGGSNAIGLFHDFLDERVGEGIETGKHAATLTAGRAGVLHGAMSYVLQDEQGQVQEAHSLSAGLDYPGVGPEHSYLKDIGRAEYYSVTDSEALAALSLVC

GGGSNAIGMFHPFIDDEVGSGLETEKHAATMSKGEVGVLHGSMMYLLQDEHGQVTEAHSISAGLDYPGVGPEHSLLKDIGRVQYEAVTDQQALDALQLLC

GGGSNAMGLFYPFVDDAVGHGLETEFHAATISKGEIGILHGAMMDVLQDENGQILEAFSISAGLDYPGIGPEHSFFRDLGRAAYHSVTDDEAVEAFQLLC

GGGSNAVGMFYPFSKDTVGDGVDTDRHSATLSGGSKGVLHGVRTYVLQDEHGQISETHSISAGLDYPGVGPELSNWKDSDRAQFVAATDAQALAGFRALA

GGGSNAVGMFYPFSNDPVGDGVDTPRHSATLTAGSKGVLHGVRTYILQNQYGQIEDTHSISAGLDYPGVGPELSNWKDTERAKFVAATDAQAFEGFRLMS

GGGSNSTGMFSPFEHDTVGDGVDTKFHSATLTAGRPGVFHGVKTYVLQDSDGQVHDTHSVSAGLDYPGVGPELAYWKSTGRAQFIAATDAQALLGFKLLS

GGGSNSTGMFSPFEHDTVGDGIDTDYHSATLTAGRPGVFHGVKTYVLQDQDGQVHDTHSVSAGLDYPGVGPELAFWKATGRAEFIAATDAQALEGFKLIS

GGGSNAIGIFHPFVQDKVGDGIDTDRHSATLSRGTPGVLHGVRTYLLQDKFGQITETHSISAGLDYPGVGPEHSFLKDSGRAEYIAATDEEALRGFKLCT

GGGSNAIGAFHPFVNDEVGYGIDKDEHCATLTKGTPGVLQGAMTYVIQQKSGQTLNTHSISAGLDYPGVGPEHAFLKDSGRAVYEAVTDDEALEGFKLMC

GGGSNAIGAFHPFVEDKVGHGIDKDQHCATLTKGTPGVLQGAFTYVIQEKSGQTLNTHSVSAGLDYPGVGPEHAFLKDSGRAKYTAVTDDEALEGFKMMC

GGGSNAIGMFHPFVKDNVGDGVDTPRHSSTLLGGRPGVLHGTKTYIMQDAAGQVMETHSVSAGLDYAGVGPEHAFLKDSGRAKYVSVTDKEALEAFQLIS

GGGSNAIGMFHPFVKDNVGDGVDTPRHSSTLLGGRPGVLHGTKTYIMQDDAGQVLETHSVSAGLDYAGVGPEHAFLKDSGRAKYVSVTDKEALEAFQLIS

GGGSNAMGIFYPFVKDKVGKGIESGKHSASLNAGEIGVFHGMLSYFLQDEEGQIRTTHSIAPGLDYPGVGPEHAYLKESGRAEYVTVTDEEALRAFHELS

GGGSNAMGIFYPFVNDKVGKGLESGKHSASLNAGQVGVFHGMLSYFLQDEEGQIKPTHSIAPGLDYPGVGPEHAYLKKIQRAEYVTVTDEEALKAFHELS

GGGSNAMGIFYPFVNDRVGKGLETGLHAASLNAGELGVFHGMLSYFLQNEEGQITPTHSVSAGLDYPGVGPEHAYLKDSGRAEYVTVTDEEALRAFHELS

GGGSNAMGIFHPFLNDDVGEGIESGRHSASLTAGSKGVLHGMLSYFLQDEEGMMLDTHSVSAGLDYPGVGPEHAYLKETGRCEYVTVNDEEALRAFKTLS

GGGSNFGGFTYPMIGARRTRFIAAETAAPKLTRGEYRYDGLLPLAKMYTLGHRYTPPPSHAAGLRYHGVSPSLSILRRLGLVEAEAIPQEEALASILLMA

GGGSNFAGFTYPMIGMKRTRFVAVEEAAPKLTRGEYKYDFPLPMLKMYTLGHDYVPPAIHAAGLRYHGAAPSLSLLRKLGIVEAVAYPQEEVMRAALLFA

GGGSNFAGLAYPFVKDVDYEFIAVEKAAPTMTRGVYTYDYGTPKLKMHTLGHRYYVPPIHAGGLRYHGLAPTLSVLINHGIVKPIAYHQTEVFEAAVLFA

GGGSNFAGLAYPFVKEVDYEFIAVEKAAPSMTRGVYTYDFGTPKLKMHTLGHRYHVPPIHAGGLRYHGVAPTLSVLVNNGIVKPIAYHQTEVFEAAALFA

GGGSNFAGLSYPFIKDVDYEFIAVERAVPTMTKGVYTYDYGTPKIKMYTLGHTYYVPPIHAGGLRYHGLAPTLSVLMNHGIVKPMAYHQTEVFEAAVLFA

GGGSNFAGLAYPFVRDVKYEFIAVEKAAPSMTRGVYKYDYGTPKMKMHTLGHTYYVPPIHAGGLRYHGLAPTLSVLINHGIVKPVAYHQNEVFQAAHLFA

GGGSNFAGFTYPFIRHRKTRFIAVEKASPSMTRGVYTYDYGTPLLKMHTLGHTYQVPPIHAGGLRYHGVAPTLSVLLKHGIVEARAYHQREVFRAAHMFA

GGGSSFSGLFWPFYYEKSVKFIAVEAAVPTLTRGKYTYDLGTPLIKMYTVGHGYKPPPIHAGGLRYHGCAPALSLLVAEGEVGAVAYKQTEVFEAARLFA

GGGSNFAGLTYPFVNDANLEIIAVEAACPTLTAGEYKYDFGTPLLKMYTLGHDFIPPPIHAGGLRYHGDAPTLCMLVKHGVIKARAVKQLPTFEAGLLFA

GGGSNFGGFTFPFVREKKIEIIAVEKACPTLTEGEYKYDFGTPKMPMYTLGYDFIPPGIHAGGLRYHGDSPLVSLLYKNKIISAKAYPQLKVFEAGVTFA

GGGSNFAGLAFPYIRENKTRVIGVEASCPTLTKGLYAYDFGTPLVKMHTLGATFMPPGSHAGGLRYHGMSPMVSHVKELGLMEARAYHQTECFAAAVQFA

GGGSNFAGLAFPYIREKKIRIVGVEHSCPTLTKGRYAYDFGTPLMKMHTLGATFMPPGSHSGGLRYHGMSPMVSHVKELGLMEARSYHQTECFAAALQFA

GGGSNFAGVAFPFMGHARSRIVAVESACPTLTRGKYAYDYGTPLTKMHTLGSGFTPPGFHAGGLRYHGMAPMVSHAKELGLFDAVSYTQRECFEAGVLFA

GGGSNLAGIAFPFIGHGRPRIVAVEAACPTLTRGKYAYDFGTPLTKMYTLGSQFTPPGFHAGGLRYHGMAPLVSHCKALGLLEAVAYDQVACFEAGVLFA

GGGSNFAGLAFPFLGLQRRRIIAVEAACPTLTRGTYAYDFGTPLVKMHTLGSTFMPPGFHAGGLRYHGMSGMVSHAYELGLIEARAYKQVGCFEAGVQFA

GGGSNFAGLSFPFIREKKPVIRAVESACPSLTKGVYAYDFGTPLMKMHTLGHDFIPDPIHAGGLRYHGMAPLISHVYEQGFMEAISIPQIECFQGAIQFA

GGGSSFGGIAFPFLADKALRCVAVETSCPTLTKGHYAYDYGTPIMKMYTLGHDFMPPGIHAGGLRYHGDSPLVSQLLHEGQVEALAVPQVATFEAGVQFA

GGGSNFAGTAFPFLADKALRLLAMESSCPTLTRGHFAYDYGTPMMMQYTLGHDFTPPGIHAGGLRYHGDSSQLSQLVHDKIIEARSVNQLDTFRAGVTFA

GGGSNFAGTAFPFLADRAVRCLAVEASCPTLTKGVYAFDYGAPIAMMYTLGHDFMPPGIHAGGLRYHGESALVSQLHHAGLIEAKSYRQNACFEAAHLFA

GGGSNFAGIAFPFLMDKKVRAVAVETACPTLTKGVYAYDYSGPLAKMYTVGHDFVPAGIHAGGLRYHGVSPIVSQLYEDKLIEAKAYGQSSVFEAAVIFA

GGGSNLGGIGLEFIKDREARVVAVESACPSLTKGEYRYDFGTPLLKMYTLGHKHIPPAIHAGGLRYHGDSPIISKLCAEGLMEAVSYGQKEVFDAAVQFA

GGGSNFAGISFPFICDKHIQIIATEEACPTLTKGPYIYDSGTPLLAMHSLGHGFIPPAIHAGGLRYHGMAPLVSHVKQLGLIEATALPQTECYEAALLFA

GGGSNFAGISFPFICDKHVQVIATEEACPTLTRGPYVYDTGTPLLPMHSLGHRFIPPAIHAGGLRYHGMAPLVSHVKQLGLIEATALPQSECYEAALLFA

GGGSNFAGISFPFICDKHVRIIATEEACPTLTRGPYVYDAGTPLLPMHSLGHGFIPPAIHAGGLRYHGMAPLVSHVRQLGLIEANSLPQTECYEAALLFA

GGGSNFAGISFPFLCDKHVQVIATEEACPTLTRGPYVYDSGTPLLPMHSLGHGFIPPAIHAGGLRYHGMAPLVSHVRQLGLIEATALPQTECYQAALLFA

GGGSNFAGISFPFLYDKHIRVIATEEACPTLTRGPYVYDAGTPLLAMHSLGHGFIPPAIHAGGLRYHGMAPLVSHVLHNGLIEATALPQTECYEAALLFA

GGGSNFAGLSLPFVRDKHVSIIAAEASCPTLTRGPFAYDFGTPLLPMYTLGHGFIPASIHAGGLRYHGMAPIVSHLVKEGIVQAQAYDQIETFTAGLKWA

GGGSNYAGLIAPFFGDKQITFIGVEASCPSLTRGKYAYDFGTPLLKMYTLGSGFIPSPNHSGGLRYHGMSGIVSKLYHDGLMEARAVEQSKIFDAATLFA

GGGSNFGGFTYPFIGNKGKRYIAVSAEIPKFSKGEYKYDFPLPLVKMITLGKDYVPPPIYAGGLRYHGVAPTLSLLTKEGIVEWREYNEREIFEAAKIFI

GGGSNFGGFTFPFIPDGDTEIIATTNEVPKFSKGDYKYDLMLPQVRMYSLGADFVPPTIYAGGLRYHGASPSLSLLINKGRIKSDEVTEEEVKEALRTFA

GGGSNYAALAYPFLGDERRKYIASGSEVPKMTKGVYKYDYPLPMLKMYTIGSDFVPPPVYAGGLRYHGVAPTLSLLISKGIVQARDYSQEESFKWAKLFS

GGGSNYAALAFPFLADEQRTYIASGKEVPKMTEGEYRYDYPLPLLKMYTIGYDFIPPAVYAGGLRYHAVAPTLSLLMNKGIVQARDYDQEEAFKWARIFS

RTEGIIPAIESSHAIGYVLKIAKEMK-KILINLSGRGDKDLDFVVQN

RTEGIIPAIESSHAIGYVLKIAKEMK-KILINLSGRGDKDLDCVYKI

RVEGIIPALESAHAVAYATKLAVEIGPTILVNLSGRGDKDADFVANR

RHEGIIPALESSHALAHALKMMREQPELLVVNLSGRGDKDIFTVHDI

LHEGIIPALESSHALAHALKMMRENPDLLVVNLSGRGDKDIFTVHDI

RLEGIIPALETSHALAYLEKLCPTLSDRVVLNFSGRGDKDVQTVAKY

RLEGIIPALETSHALAHLEKLCPTLPDRVVLNFSGRGDKDVQTAIKY

RLEGIIPALETSHALAYLEKLCPTLFDRVVFNFSGRGDKDVDTVAKS

RLEGIIPALETSHALAYLEKLCPTLADRVVLNCSGRGDKDVHTAIKH

RLEGIIPALETSHALAFLEKLCPTLPNKVVLNCSGRGDKDVHTAIKH

RLEGIIPALETSHALAYLEKLCPTLPDRVVVNCSGRGDKDVHTASKY

RLEGIIPALETSHALAYLEKLCPTLPDRVVVNCSGRGDKDVHTASKY

RLEGIIPALETAHAIAYLETLCPQLDGRIVINCSGRGDKDVQTVAKF

RLEGIIPALETAHAIAYLETLCPQLDGRIIINCSGRGDKDVQTVAKF

ELEGIIPALETSHAIAYLETLCPQLEGRIIINCSGRGDKDVQTVAKY

KLEGIIPALETAHAFAWLDTLCPSLAPEIVINCSGRGDKDVNTVAKK

KLEGIIPALETSHAIAYLEKLIPTLKSRVVINCSGRGDKDVNNAMKY

GLEGILPALETSHAFAALKRINDTIPAIVVVNCSGRGDKDVNTVITA

STEGIIPALETAHAFAYLGILASQLHSIVVLNCSGRGDKDMGTVARA

QKEGIIPALESAHAVAHAKELARGMQPVVVICLSGRGDKDVMTVRNA

RTEGIIPALESSHAISYAVKLASKMRPSMVVCLSGRGDKDVNQLKER

QYEGIIPALESSHAIHGAMELAKTMKKNIVLNLSGRGDKDVQSVADE

QLEGIIPALESSHGIWGALELAKTMKPDVVICLSGRGDKDVQSVADE

QLEGIIPALESSHAVYGACELAKTMKPHLVINISGRGDKDVQSVAEV

QLEGIIPALESSHAVYGACERAKTMKPHLIINISGRGDKDVQSVAEV

QLEGIIPALETSHALWSAFQIAKTMKPDIVVSLSGRGDKDVEQIANA

EYEGIIPALETSHAIYYAVKLAKKLGPDIVINMSGRGDKDMPQIAKI

QYEGIIPALETSHAIYYAIQLAKTLGKDIVINMSGRGDKDMPQVAKI

LQEGIIPALESSHAVYYGVKLASTLSAVVVINISGRGDKDMLQVAKE

LHEGIIPALESSHAVHYGVKLASTLAPVVVINISGRGDKDMLQVANV

RTEGIIPALESAHAVAYAIKLAREMSRVIIVNLSGRGDKDLDIVLKV

RTEGIIPALESAHAVAYAMKLAKEMSRIIIVNLSGRGDKDLDIVLKV

RTEGILPALESAHAVAYAMKIAPEMDKIIIVNLSGRGDKDLDIVRRV

KLEGIIPALESAHAIAYAMKMAEEMQRVLVVNLSGRGDKDMDIVRRR

RSEGVVPAPESSHAVALAARIARKLPDVVAFNLSGHGLLDLDALQKA

RTEGIVPAPESAHAIKAAVDLAKKLPRVIAFNLSGHGLLDSDAYEKF

KAEGIVPAPESAHAVKAVIDKALEARRVILFNLSGHGLLDLKGYEDY

KLEGIVPAPESAHAIKATIDKAIEAKRVILFNLSGHGLLDLHGYEEY

KAEGIVPAPESAHAVRAVIDKALEAKRVILFNLSGHGLLDLKGYEDY

KTEGIVPAPESAHAIKGAIDRALEAKRVILFNLSGHGFLDLKGYEDY

KAEGIVPAPESAHAVKAAIDEAIKARDVIAFNLSGHGLLDLQGYREY

ATEGVVPAPESAHAVKAAVDLALQAKRTILFNMSGHGLLDLIAYDEY

RTEGIIPAPETNHAVRAAIDEAIKAREVIVFGFSGHGLLDLQAYDDY

RTEGIIPAPESSHAIAAAIDEAIKCRETIVFNLSGHGYFDLSAYEAY

RAEGIVPAPESSHAVKGAIDEALRCKATILFNLSGHGHFDMQAYTDY

RNEGIVPAPESSHAVKAAIDEALLAKATILFNLSGHGHFDMQAYTDY

RNEGIVPAPEANHAVKGAIDEALRCKRVILFNLCGHGHFDMAAYTSY

RNEGIVPAPESNHAIKGAIEEAMRCKRTILFCLSGHGHFDMTSYQKY

RTEGIVPAPESTHAVRCAIDEALRCKETILFNLSGHGHFDMQAYINY

RTEGIIPAPEPTHAIAATIREALRCKEVILMAMCGHGHFDLTSYDKY

RAEGIIPAPESCHAIRAAIDEALKCKVTILFSLTGHGHFDMASYDKF

QAEGIIPAPESNHAIRGAIEEALRCKETIFFTLSGHGHFDMTSYDRF

RSEGIVPAPESSHAVRAAIDEAVLAKETILFCLSGHGQLDMGAYDAY

RTEGIVPAPESSHAIRAAIDEALLCKEVILFNLSGHGYFDMAAYDNY

RTEGIVPAPESSHAIRCAIDEALEAKQVILFNLSGHGHFDMASYDKY

HTEGFIPAPETSHAIAQTIREAKKAKEVILMNWSGHGLMDLQGYDAF

HTEGFIPAPETSHAIAQTIREAKKAKEVILMNWSGHGLMDLQGYDAF

HTEGFIPAPETSHAIAETIREAKKAKEVILMNWSGHGLMDLQGYDAF

HTEGFIPAPETSHAIAQTIREAKKAKEVILMNWSGHGLMDLQGYDAY

HTEGFIPAPETSHAIAQTIREARHAKEVILMNWSGHGLMDLQGYDAY

QSEGFIPAPETNHVIAAVVREAELARQVILFNWSGHGIIDLPAYDAF

RSEGILPAPESSHALRVAIDKP------

ENQGIVPAPESAHAIRAVVDEAIEARKVIVFNLSGHGLLDLSNYESM

NTQGIIAAPESGHAIASAI-KYVKAHITIVVNVSGHGYLDLSIFGEK

ELEGYIPAPETSHALPILAEIAEEAKKTVLVSFSGHGLLDLGNYASV

EKEGYIPAPETSHALPILKEIADSNRGTVLVSFSGHGLLDLGNYAEA