Amino acid sequence alignments of enzymes involved in the tryptophan biosynthesis pathway
Anthranilate synthase-Alpha subunit amino acid sequence alignment
69 351
Anav2 LDTPVSAWYKVYFLLESVEGIGRYSLLGDPLWILEAGQTPFTALPVKGGLFGFWGYELIRWIEHSQDERIPDGLWMQVDHLLIFDQVKRKIWAIADLAYQ
Nost3 LDTPVSAWYKVYFLLESVEGIGRYSLLGDPLWVLEAGQTPFTALPVKGGLFGFWGYELIRWIEHPQDERIPDGLWMQVDHLLIFDQVKRKIWAIADLAYQ
Syncys LETPVSAWYKVYFLLESVEGIGRYSFLGDPMWVLEAGQVPLDILPVNGGLFGVWGYELIRWMEYEPQPEPPDGIWMQVDHLLIFDQVKRKIWAIADLAYR
Glovi LETPVSAWYRVEFLLESVEGLGRYSFLGEPLWTLAAGQVPFAVLPVHGGLFGYWGYELIRWIEYS-DPDLPDGFWMQVDSLLIFDQIKRKIWVIADLAYR
Syncoc LETPVSAWYRVYFLLESVEGVARYSFLGDPLWVLEIGQRPFAILPVHGGLFGYWGYELIRWIEHPLQPGPPDAVLMQVDSILLFDQVKRKIWVVADTAYA
Silpo LDTPVSLMLKLDFMLESVTGRGRYSIIGKPDLIWRCGLNPLDNLIALAGLFGYLGYDMVRLVEVNPDPLLPDAVMMRPSVVAVLDGVKGEVTVVSWVAYA
Lokve LDTPVSLMLKLNFMLESVTGRGRYSIIGNPDLIWDCGVNPLDALINLAGLFGYLGYDMIRLVEVNPDPLLPDAIMLRPSVVAVLDGVKGDVILVAYVAYA
Oceal LETPVSAYLKLNFLLESVEGRGRYSAIGDPDLIWRCGIAPLQSLLTLAGLFGYLGYDTIRQVEAGPDPLLPEGQLLRPRVMVVFDALRQEILVAARPQVD
Eryli TETPVGAALKLGFLLESVEGRGRYSLLGDPDLVFRAAINSLDALIDVACLVGYFAYETIGQVEAPESELLPDMVFTRPTLLLVFDSLTDNLFIIAWPAIE
Psefl FDTPLSIYLKLNYLLESVQGWGRYSIIGPCRTVLRVGVTPLAFVVPTGGLVGYFGYDCVRYVEPNPDPIVPDILLMVSDAVVVFDNLAGKVHAIVDPAFE
Azovi FDTPLSIYLKLNYLLESVQGWGRYSIIGPARTVLRIGVSPLAFVVPDGGLVGYFGYDSVRYVEPNPDPLTPDILLMVSDAVVVFDNLAGKMHLIVDPAFE
Xanor LDTPLSVYLKLYYLFESVEGFGRYSIIGPARRVYSFGVSPLAEVVPQGGLVGWFGFECIQYIEDKADELTPDILLMLSEELAVFDNLKGRLYLIVDPAYV
Dechar LDTPLSIYLKLYYLLESVQGFGRYSIIGAAQTRIVVGVLPLEFIAPPGGLVGCFGYDTVRYVENKPDEITPDIGLLLSEEIAVVDNLSGKLTLIVEPAYQ
Niteu LDTPLSIYLKLYYLLESILGFGRYSVIGPAEIRLEASVIVLGFIVAPGGLAGYFSYDTIRYIEARPDTITPDILLLLSEELVVMDNLSGKLYLIIDPAYQ
Metfl LDTPLSLYLKLYYLLESVQGFGRYSIIGPARVRIEVGVLPLDFIIPPGGLAGYFGYDTIRYIEAKQDALVPDVLLMVSEEIAVVDNLSGKLYFIVDPAYT
Mothe TETPISLYLKFDFLLESVEGLGRYSLIGDPLLTFTASLSPFKALLPPGGLVGYLGYDMVRELEGPGNDLIPDTHLTLHRCYLVYDHILRTVRITCRGGYE
Nicta HLTPVLAYRCLPFLFESVEPVGRYSVVGQPSMEIVAEILPMTIPPRLGGWVGYFSYDTVRYVEAPEDDRLADIQLGLYEDVIVFDHVEKKAHVIHQLAYL
Camac HLTPVLAYRCLPFLYESVEPVGRYSVVGQPAMEIVAEIVPMTVPPQLGGWVGFFSYDTVRYVEAPQDDRLADIHLGLYDDVLVFDHVEKKVYVIHRLAYM
Catro HLTPVLAYRCLPFLFESVEPVGRYSVIGQPTMEIVAEVMPMVVPPQRGGWVGYFSYDTVRYVEAPVDDRLPDIHLGLYDDVIVFDHVEKKAYVIHRLAYK
Rutgr HLTPVLAYRCLPFLFESVEPIGRYSVVGQPAIEIVAEILPMDVPPQLGGWVGYFSYDTVRYVEAPTDDRLPDVHLGLYDDVIVFDHVEKKAFVIHRLAYN
Arab HLTPILAYRCLPFLFESVEPIGRYSVVGQPTIEIVAGVMPMMVPPQGGGWVGYFSYDTVRYVEAPEDDRLPDVNLGLYDDVIVFDHVEKKAYVIHRINFR
Oryz HLTPVLAYRCLPFLFESVEQVGRYSMVGHPVMEVVAEIMPMQIPPQQGGWVGFFSYDTVRYVEAPQDDRLPDVHLGLYDDVLVFDNVEKKVYVIHNLAFQ
Zeam HLTPVLAYRCLPFLFESVEQVGRYSMVGHPVMEIVADIMPMQIPPQQGGWVGFFSYDTVRYVEAPQDDRLPDVHLGLYDDVLVFDNVEKKVYVIHNVAYQ
Chlre HLTPVSAYRCLPFLLESVVNQGRYSFLGSPALEVVAQLLPMQLPPAAGGWVGYAGYDTEASDP------AYE
Cme DLTPVMAYRRLPFLFESVVGIGRFSYVGSPCMQIIAQIIPWVFMVAEGGWVGFGGYDTVRYSEAPRDDRLPDLHFALYQEVVIFDNVTKTVYIVVPLARA
Cloth METPISLFKRFCFLLESVEGWARYSIIGNPFLVVESYIIPVEIIGANGGAVGYFGYDLIRHYEVPEDDMLPECHFMFTDEVLVYDHLKQKIHIIVHVAYI
Alkme METPITLFKKLNYLLESVERKGRYSYVGDPFMTIKGHEIPLEIVTPFGGAVGFIGYDTIRNYEVNEEDIIPEIHLLLTKEVIVYDHLKHKIIILVVLSYE
Theret ELTPINIFYSLNFLLESANGWGRYSFIGDPYLSILSYKLILDEINSLGGAIGYASYDLIRLYEKNPDEIIPDVYFMFYKSFICYDHLKHRIYVVYYPEYE
Bacha AVTPIHIVQQLAFILESKDDWARFSFIGNPFVELIENSIKSCLQQPLGGAVGYMAYDAIETIEANARESAPLYHFLFCETLIVYDHTEKKLAVIYTERYK
Sachce DLTPHVAYLKLEFLLESAKTLDRYSFIGSPRKTIKTPETTFKVAPKLGGAIGYISYDCVRYFEPLKDVLLPEAYLMLCDTIIAFDNVFQRFQIIHNTGYQ
Cangl DLTPHVAYLKLEFLLESAKTLDRYSFIGSPRKIIKTPETQFKLAPKLGGAIGYISYDCIRYFEDLKDVLLPEAYLMMCDTIIAFDHVFQRFQIMHNTGYS
Yarli ALTPHTAYLKLPFLFESALGLSRYSFIGNPRKMIKTADVQYRQFPTLGGAIGYISYDCIKYFEDLKDVLVPESALMLYDTIVAFDHVYQRLQVITKVAYE
Asfum LLTPTLAYLKILFLYESAATIGRYSFVGEPYKVLKTPECQFRVAPPLGGAIGYVGYDCVRYFEPLRDVLIPESLFMMFKTIVAFDHFFQVIKVVTPIEYR
Schipo MLTPSVAYLKLYFILESVTQVSRYSFIGSPYRILMAGKTEVKTAPSFGGAVGYVSYDCIKYFEPLEDTLLPEAMFFMTDDLVAFDHAYQTVKIISCIAYE
Neucr LLTPSAIYLKLYFLLESATGVGRYSFIGNPRKVLETPVGQDKVLPKLGGAVGYISYDCIKYFEELKDNLIPEALFMLYDTMVAFDHFSSAFTVVTRLAYN
Ustma LLTPVTAYLKLRFLFESVTGIGRYSFIGDPLKIIRTPEGPYKYIPTFGGAIGYISYDCVQYFEELKDVVIPESVFMVADSLIVFDHVFSTVRCVSHLLYA
Pram LDTPVSVLLKLHFLLESVAPIARYSFIGGPLKVVATEQGQFRTVVPMGGAVGYCGFDAIRHFEAQKDVLVPESIYMFFDTVVIFDHVFHSLKIVSRLAYK
Psoj LDTPVSVLLKLHFLLESVAPIARYSFIGGPLKVVATPEGQFRTVVPMGGAVGYCGFDAIRHFEAQKDVLVPESIYMFFDTVVIFDHVFHSLKIVSRLAYK
Natph AVEPLEAYAALHFLLESAEKHARYSFVGDPAAVVTVATVTVATLDVPGGLVGFLAYDAVYDLWER-PDSFPDAQFMLTTKTLVFDDAAGTVSLVCLVLYD
Metac ACSPLELYGALYYLLESVEKSPLFEAICKMEEVCGPTNEVFDALGIEGGAIGYTAYDAIYDSWEKGFESIPDLQYLLVSKSFVLDHLTEEVYIVLFVVYE
Pyrab RVEPIKLYSVINFIFTIIGKEAKLTYISSPEFTVEIGLDKVSNESLKGGFMGYIAYDAVHNYIEE------PSVFGYYPWVFIYNHGKGELRFYYLRIVE
Theco PVDPLKLYSALMFMLRSAEKKARFTYISEPEFVVEVGIDRVSDEALKGGFVGYVSYDSVHSIIEE------PSVFGYYPWTFIYDHSTGALSFFYLRLVE
Nost2 LLTSSYEYPGRPLQLTTRENAFTISSLNRGQVLLPTFAQQRSKQEILLGLYGAFGYDLVFQFEKIARPAQRDLVLYLPDELIVVDYYLQKAYRHQFALPR
Anava LLTSSYEYPGRPLQLTTRENAFTISSLNRGQVLLPTFAQQRSKQEILLGLYGAFGYDLVFQFEKIARPAQRDLVLYLPDELIVVDYYLQKAYRHQFALPR
Nost1 LLTSSYEYPGRPVELSTSGNTFTLTALNRGYVLLPVFKSERSKQEILLGLYGAFGYDLVFQFECLERPQQRDLVLYLPDELIVVDYYQQQAFRLEFILPR
Legpn VFASSFEYPGRPLVMICQSNTICFEALNRGEILLSFYTTERSHQLLLLGLYGAFGFDLIYQFECKPRNEQRDMVLYLPDEIYIVNHRKEEAFVRRFQLSR
Synwol LFSSSYDYPGRPLELRAQGRDFCLEALKSGLKLLFGYGRERSRQVIRLGLYGAFGYDLVYQFEKRPRPEQCDLLLYLPDELTVVDHRMARAYQLNFRIPQ
Bruab VFSSNYEYPGRPVVITSRARTMRIEALNRGVILLRPLALERSRMAIVLGLYGAFGYDLAFQFDKLKRPDQRDLVLFIPDEIFVADHYAARAWVDRFRLDR
Brume VFSSNYEYPGRPVVITSRARTMRIEALNRGVILLRPLALERSRMAIVLGLYGAFGYDLAFQFDKLKRPDQRDLVLFIPDEIFVADHYAARAWVDRFRLDR
Brusu VFSSNYEYPGRPVVITSRARTMRIEALNRGVILLRPLALERSRMAIVLGLYGAFGYDLAFQFDKLKRPDQRDLVLFIPDEIFVADHYAARAWVDRFRLDR
Rhile VFSSNYEYPGRPLGISSFGRDVWIEAYNRGEVILGFTTVERSKMAVTLGFYGAFGYDIAFQFDKLTRPSQRDMVLYLPDEILVVDNYAAKAWVDRFEKAG
Rhiet VFSSNYEYPGRPLGISSFGREVWIEAYNRGEVLLGFTTVERSKMAVNLGLYGAFGYDIAFQFDKLTRPSQRDMVLYLPDEILVVDNYAAKAWIDRFAKAG
Sinme VFSSNYEYPGRPLAISSFGRSLWIEAYNRGEVLLALASVERSKMAVTLGLYGAFGYDLAFQFDKLSRPDQRDMVLFLPDEILVVDHYAAKAWIDRFAKAA
Simed VFSSNYEYPGRPLAISSFGRSLWIEAYNRGEVLLALASVERSKMAVTLGLYGAFGYDLAFQFDKLTRPEQRDMVLFLPDEILVVDHYAAKAWIDRFAKAA
Agrob VFSSNYEYPGRPLGISCFGRKMWIEAYNRGEVLLDFTATERSKIAIVIGLFGAFGYDLAFQFDSLARPEQRDMVLFLPDEILVVDHYSAKAWIDRFEKSS
Meslo VFSSNYEYPGRPLVISARGRAMRIEALNRGEALLPVGGLERSRVAITLGLYGAFGYDLSFQFDKLERKPQRDLVLFLPDEILVVDHYSAKAWTDRYSLPR
Auran VLFSNYEYPGRPLVISSDRRDMRIEALNRGRVLLGFDGNERSRMTIVLGLYGAFGYDLAYQFEVLERDARRDLVLYLPDELLEVDHYSASAFRTRFALEG
Fulpe VLFSNYEYPGRPLVLTSAGRRMVIEALNKGEILLSMAGHERSRIAIVCGLYGAFGYDLAFQFDSLERPAQRDLVLYMPDEVLEVDHYSASAVLSTFAVKG
Oceba LFSSNYEYPGRPVMIEARGRAMRIAALNRGRVLLAMRELERSRAAIVLGLYGAFGYDLAFQFEHLERPAQRDLVLFLADDILVADHYSASAVRYRFDLQG
Azobr LLSSGVEAPGRPLAATARGRTLRIDALNRGRVLLPAAGLERSRQAVLLGLYGAFAYDLAFQFERLERPDQRDLVLYLPDRLVALDPVAGLARLVEFALER
Nitwi VLSSGTTVPGRPLKIETTGFNFTIEAANRGEVLIAFGEPQRTRRALVLGLYGAFAYDLVFQIEKRAREPQRDIVLFIPDRLLAYDRAAGHGVVLSFAQPR
Nitham VLSSGTTVPGRPLKIETTGLNFAIEATNRGEVLIAFGEPQRTRRALVLGLYGAFAYDLVFQMEKRAREAQRDIVLFVPDRLLAYDRATGRGVTLNFALAT
Braja VLSSGTTVPGRPLKLETTGVNFKLEALNRGQVLIAFAEPQRTRRDLVLGLFGAFAYDLVFQIEKRARESQRDIVLYVPDRLLAYDRATGRGVVLSFTLPR
Rhopa MLSSGTTVPGRPLRLVSTGDNFALTALNRGEVLLAFAEPQRTRRALVLGLFGAFAYDLVFQFEKRAREAQRDIVLYVPDRLLAYDRATGRGVHLAFSLSH
Rhoru VIASTYEYPGRPLVLEGKGREAHLLALNRGRALLPAAGAERSRQALAFGLVGAFGYDLGLAFEARPRSAHRDLVLYLPDRLLIDDPEAGGLAERLITLAR
Phatr LLTSSYEFPGRPLEISGRGQACTITALNRGRVLMPAETLERSRQALVLGLYGAFGYDLTFQFEAQERDSQRDLLLYLPDTMLVVDQDKRDAWRVCFNIPR
thaps VLTSGYEFPGRPLEISGNGNKCKITALNRGQVLLQPLNLERSRQALVLGLYGSFGYDLTFQFEAQERDPQRDLLLYLADELVVVDQSRHSAWTVSFSLPH
Thefu VLSSGMEYPGRPLEVVARGRTIGATALNRGRVLLPAAAHERSRRALVLGLYGAFGYDLAFQFEVLPRDPDRDLVLHLPDEIIVHDRKREICQRYSFTLPR
Solus YLSSGYEYPGRPLEIIAYDRRFELRPLNRGREILRLHKLERSKQALILGLVGAFGYDLLFQFEKLPRHGHKDLHLFLCDDIWYMDRKKEQVERFQFQLPR
QACFCASVQKAKEYIKAGDIFQVVISQRLSTQYPFSLYRSLRQINPSPYMYFNFWQIIGSSPEVMVKAEVATVRPIAGTRPRGKTTKEDAELAADLLQDP
QACFCASVQKAKEYIKAGDIFQVVISQRLSTQYPFSLYRSLRQINPSPYMYFNFWQIIGSSPEVMVKAEIATVRPIAGTRPRGKTTKEDAELAADLLQDP
NACFLEEVAIAKDYITAGDIFQVVLSQRLSTIYPFKLYRSLRLINPSPYMYYNFWQIIGSSPEVMVKADMATVRPIAGTRPRGKTHPEDEQLAEELLNDP
LAGYCGAVERAREYIRAGDIFQVVLAQRFSTPFSFDLYRMLRAINPSPYMYYQFFQLIGSSPEVMVRLDLATVRPIAGTRRRGQSEAEDRWLEKDLLADP
QACYCQAVERAKDYIRAGDIFQVVLSQRFTVSLPFRLYRSLRLINPSPYMFLQFLCLIGSSPEVMVKLSIATVRPIAGTRPRGTTPLEDRQLEQELLADP
QAAYKAAVEKAKDYIRAGDIFQVVPAQRWTQEFPFALYRSLRRTNPSPFMYFNFFQVVGASPEILVRVFEVTIRPIAGTRPRGATPEEDRANEADLLADK
QAAYLQAVEKAKDYIRAGDIFQVVPSQRWTQEFPFALYRSLRRTNPSPFMFFNFFQVIGASPEILVRVFEVTIRPIAGTRPRGATPAEDDANEADLLADQ
AARYRQNVETAKDYIRAGDIFQVVPSQRFSAPYPLSLYRSLRRTNPSPFLFFNLFAIVGSSPEILVRLRTVTVRPIAGTRPRGNSEEADQAHEADLRADP
RAGYSSMVAKAKEYITAGDIFQVVLAQRFTCPFPLALYRALRRVNPSPFLFLDLFAVVGSSPEILVRLREVTIRPIAGTRPRGATPEADREAEASLLADA
AGQYERAVDTIKEYILAGDCMQVVPSQRMSIDFPIDLYRALRCFNPTPYMFFNFFHVVGSSPEVLVRVELITVRPIAGTRPRGASEEADLALEQDLLSDA
RGRFERTVARIKDYILAGDCMQVVISQRMSIPFPIDLYRALRCFNPTPYMFFDFFHVVGSSPEVLVRVELVTVRPIAGTRPRGASEEADLALERDLLSDA
RANYHAVVRKAQEYVRAGDIFQVVPSQRLRVPFPVDVYRALRALNPSPYMFLDVTQVVGSSPEILARLRVVTVRPIAGTRPRGATPELDKALEEELLADP
KARFKQAVLKAKAYITEGDIMQVVLSQRMTKPFPLALYRTLRSLNPSPYMYFDFFHVVGASPEILVRLERVTVRPIAGTRKRGASPEEDAALAVELLADE
TSKFIAAVEKAKHYILEGDIMQVVLSQRTSKPYPLALYRALRSLNPSPYMNYHLFHIVGASPEILVRLETVTVRPIAGTRPRGQDTQADLALAADLLADP
QGIFKEAVAKSKQYIFDGDIMQVVLSQRMAKPFPLSLYRALRSLNPSPYMYYDMHHVVGASPEILVRLETVTSRPIAGTRPRGKTREEDIALAEELLADP
EAVFTGMVTKAKEYIAAGDIFQVVLSQRLSLPFALVVYRHLRALNPSPYMYLNFVQLVGASPEMLVRVETIDYRPIAGTRRRGRTAAEDRALAAELLASE
DGKYKNAVLQAKEHIAAGDIFQIVLSQRFERRTPFEVYRALRIVNPSPYMYIQACILVASSPEILTRVKRIVNRPLAGTSRRGKTPDEDVMLEMQMLKDE
DGMYKNAVLQAKDHILAGDIFQIVLSQRFERRTPFEVYRALRVVNPSPYMYLQACILVASSPEILTRVNKIVNRPLAGTTRRGRTPHEDEMLKKQLLNDK
DGMYEKAVLQAKEHILAGDIFQLVLSQRFERRTPFEVYRALRIVNPSPYMYLQACILVASSPEILTRVKTITNRPLAGTTRRGKTPKEDYMLEQQLLNDE
DGMYKEAVLEAKEHILAGDIFQIVLSQRFERRTPFEIYRSLRIVNPSPYMYLQACILVASSPEILTRVKKITNRPLAGTIRRGKTRKEDLVFEKELLNDE
EGMYKEAVVEAKEHILAGDIFQIVLSQRFERRTPFEIYRALRIVNPSPYMYLQVCILVASSPEILLRSKKITNRPLAGTVRRGKTPKEDLMLEKELLSDE
DGKYKNAVMQAKEHIMAGDIFQIVLSQRFERRTPFEVYRALRIVNPSPYMYVQACVLVASSPEILTRVRKIINRPLAGTVRRGKTEKEDEMQEQQLLSDE
DGRYKNAVLQAKEHIMAGDIFQIVLSQRFERRTPFEVYRALRIVNPSPYMYVQACVLVASSPEILTRVSKIINRPLAGTVRRGKTEKEDQMQEQQLLSDE
AGQFMDAVGATKEHIQAGDIFQLVLSQRFERRTPFEIYRALRVVNPSPYMYMQACHIAD------LLADQ
HGWFFHALERISEYIYLGDTFQTVFSQRFERDTPFQVYRALRIVNPSPYMYMRACILISSSPEILCKTRKVWNRPLAGTRPRGSSPEEDARLEQELLADE
SAVFCRNVLKAKQYIRDGDIFQVVLSQRLCVETPFNIYRALRVINPSPYMYLKFYRIIGSSPEMLVRVEIVETCPIAGTRKRGRTKEEDEALEKELLSDE
AGVFMGKVLRAKEYIQNGDIFQVVLSQRLQVEVPFQIYRNLRSISPSPYMYINFYQVVGASPELLVKVKKVETCPIAGTRPRGKTSQEDERLAKELLEDE
EVLFCKIVEKAKEYIEKGDIFQVVLSQRLKAAVPFEIYRRLRSKNPSPYLYIDFFQLLGSSPESLVSVFKVTTNPIAGTRRRGKDEEEDLRLKEELLKDE
CAVFLRDVERIKEYIRAGDVFQAVLSQRFERELALDVYRVLRMINPSPYLYIKFVEVVGSSPERLVQVQHVEIHPIAGTRKRGATREEDEALAKELLADE
AAAYENHVSTLKKHIKKGDIIQGVPSQRVARPTPFNIYRHLRTVNPSPYLYIDCFQIIGASPELLCKSDRVITHPIAGTVKRGATTEEDDALADQLRGSL
HAVYENHVGNLIEHIKKGDIIQAVPSQRVARPTPFNIYRHLRTVNPSPYLYIDCFQIIGASPELLCKSDKVITHPIAGTIKRGATTEEDDELGNQLRNSL
DAKYEQHVTTLKERIKKGDIIQAVPSQRVARPTPFNVYRHLRTVNPSPYLYIDYFQLVGASPELLVKIERIVSHPIAGTVKRGATPAEDNQLAQELLNSE
NGQYERHVTRLKEHISKGDIFQTVPSQRLSRPTPFNLFRHLRTVNPSPYLYIDCFQLVGASPELLVKEERIITHPIAGTVKRGKSPEEDDALAAELRGSL
AAVYKAFVSNLKEHIFNGDIFQAVPSQRIARRTPFNLYRHLRTVNPSPYMYIHCFDIIGASPELLVKSERIINHPIAGTVPRGKTKEEDEAYAKDLLASV
AACYETFVTELKKNIVKGDIIQAVPSQRFSRSTPFNIYRTLRTLNPSPYLFLSCFHIVGASPECLMKTDRIVNHAIAGTIKRGLNSEEDDELAAVLLAST
KANYEAFVTSLRKNIVKGDIIQAVPSQRLKRRTAFNCYRHLRQVNPSPYMYVDCCQLVGASPETLCKIEKVAVHAIAGTVKRGANDVEDAELASQLASSV
EATYMGFVEKLKEHIVDGDIFQAVPSQRLAVPKPLDLYRQMRAINPSPYMYLEMFQIVGASPEMLVKVDIVETHPIAGTRHRGANDEEDAALVAELLADE
EATYMGFVDKLKEHIVDGDIFQAVPSQRLAVPKPLDLYRQMRAINPSPYMFLDMFQIVGASPEMLVKVDVVETHPIAGTRHRGANDEEDEALVKDLLADE
DLTYEDAVETAKEHVLDGDIYQGVISRTRELDGTLGLYRALRDINPSPYMYLLDLAVVGASPETLVSVREVMSNPIAGTCPRGESPVEDRRLAGEMLADD
EALFEESVLQAKEHIFAGDIFQVVLSRKCEFKMPFELYIQLRAINPSPYMYIFELAIVGASPETLLTVHTVIINPIAGTCPRGKSEAEDETLASHMLNDE
RAKFMEMVEKGKEYIFAGDVFQVVLSREYVLRSPVELYRKLISINPSPYTFILEKILVGASPETMGSVETVRINPIAGTAPRGKTPEEDEEIRRRLLSDE
RARFVEIVRAGKEYIYSGDVFQVVLSREYRVRTALEIYKRLVELNPSPYTFILEKTVVGASPETMGSVETFKINPIAGTAPRGRTGEEDRELEKALLSDE
TGQYANLVEQALDYFRRGDLFEVVPSQNFFTEQPSQLFQTLRQINPSPYGLLNLEYLIGASPEMFVRVDRVETCPISGTIRRGEDALGDAVQIRQLLNSH
TGQYANLVEQALDYFRRGDLFEVVPSQNFFTEQPSQLFQTLRQINPSPYGLLNLEYLIGASPEMFVRVDRVETCPISGTIRRGEDALGDAVQIRQLLNSH
TGEYAKLVEFALDYFRRGDLFEVVPSQNFFTEAPSQLFETLKQINPSPYGIFNLEYIIGASPEMFVRVERVETCPISGTITRGHDAIDDAVQIRQLLNSH
EGKYAEVVKKAKEKFSCGDLFEVVPSQTFYTAEPSILFRQMREINPSPYGFVNLEYLVGASPEMYVRVQRVETCPISGTIKRGADAIEDAHNIQTLLDSE
YSQYTALVEKAKQSFQRGDLFEVVPSYSLLEPIPSEAFKNLKRINPSPYGIINLEQLVGASPEMYVRVERVETCPISGTIKRGKDAIEDALQIRKLLNSG
ATPYARLVERAKESFKRGDLFEVVPGQTFYEHTPSEIFRRLKSINPSPYSFINLEYLVGASPEMFVRVNRIETCPISGTIKRGEDAISDSEQILKLLNSK
ATPYARLVERAKESFKRGDLFEVVPGQTFYEHTPSEIFRRLKSINPSPYSFINLEYLVGASPEMFVRVNRIETCPISGTIKRGEDAISDSEQILKLLNSK
ATPYARLVERAKESFKRGDLFEVVPGQTFYEHTPSEIFRRLKSINPSPYSFINLEYLVGASPEMFVRVNRIETCPISGTIKRGEDAISDSEQILKLLNSK
DIAYAELVTKAKESFRKGDLFEVVPGQKFMEDSPSDISKRLKAINPSPYSFINLEYLVGASPEMFVRVSRIETCPISGTIKRGDDPIADSEQILKLLNSK
DIAYADLVVKAKESFRKGDLFEVVPGQKFMEESPSDISKRLKAINPSPYSFINLEYLVGASPEMFVRVSRIETCPISGTIKRGDDPIADSEQILKLLNSK
DIAYAELVVKAKESFRRGDLFEVVPGQKFYEESPSEISNRLKAINPSPYSFINLEYLVGASPEMFVRVSRIETCPISGTIKRGDDPIADSEQILKLLNSK
DIAYAELVVKAKESFRRGDLFEVVPGQKFYEDSPSEISNRLKAINPSPYSFINLEYLVGASPEMFVRVSRIETCPISGTIKRGDDPIADSEQILKLLNSK
DITYSELVVKAKESFRRGDLFEVVPGQKFMEESPSAISRRLKAINPSPYSFINLEYLVGASPEMFVRVSRIETCPISGTIKRGDDPIADSEQILKLLNSK
DAIYANLVRRAMDSFKRGDLFEVVPGQMFYEETPSDISRKLKSINPSPYSFINLEYLIGASPEMFVRVNRVETCPISGTIKRGDDAISDSEQILKLLNSK
GGPYADLVRSAKEKFMRGDLFEVVPGQVFYEEHPSEISRRLKAVNPSPYSFVNLEFLIGASPEMFVRVNRVETCPISGTIKRGDDAIADSEQILKLLNSK
GGSYADLVRSAKERFRRGDLFEVVPGQVFYEEHPSEISRRLKAVNPSPYSFVNLEFLVGASPEMFVRVARVETCPISGTVPRGQDAIGDAEQVLKLLNSK
GGTYAELVREAKKSFRRGDLFEVVPGQVFYEKDPSAIFKRLKKINPAPFGIMNLEYLVGASPEMFVRVTRVETCPISGTIRRGADAIEDSEQILKLLQSK
AGRYQRVVETAKAAFRRGDLFEVVPGQTFAEADPSAVFRRLRAANPAPYEFVNLEFLVAASPEMYVRVARVETCPISGTVARGADALGDSSQILRLLTSA
DTPYQATVETARAAFARGDLFEAVPGQLFAEERPAEVFQRLCRINPSPYGLVNLEFLVSASPEMFVRSDRIETCPISGTIARGADAIGDAEQIRELLNSE
DTGYQATVETARAAFARGDLFEAVPGQLFAEERPAEVFQRLCRINPSPYGLVNLEFLVSASPEMFVRSDRIETCPISGTIARGADAIGDAEQIRELLNSE
ETAYQATVETARAAFARGDLFEAVPGQLFAEDRPAEVFQRLCVINPSPYGLMNLEFLVSASPEMFVRSDRVETCPISGTIARGTDAIGDAEQIRQLLNSE
DTPYQATVEVARAAFARGDLFEAVPGQLFAEERPAEVFQRLCRINPSPYGLMNLEFLVAASPEMFVRSDRIETCPISGTIARGVDAIGDAEQIRQLLNSE
ETAYGAIVRGLKEAFAAGDLFEAVPSRALRRAEPSRLYRRLRAANPAPYLLANLEHLIGASPEMFVRVGRVETCPISGTIARGPDALGDAEAIRTLLNST
TGTFANSVLKAKEEFKVGNLFEAVLSQTFRETVPSTLFRRLRARNPAPYGLINLEYLVGASPEMFVRCERVETCPISGTVARGADALEDAQRVKSLMMNA
EGNFKESVAKAKHEFKMGNLFEAVLSQTFRKEKPSKLFRRLRAKNPSPYGFFNLEYLVGASPEMFVRCERVETCPISGTVARGVDALEDAARVKSLIMNL
HTEYARIVAEAKERFRRGDLFEVVPSHRLYAASPARFYERLRERNPAPYEFLNLEYLVGASPEMFVRVTRVETCPISGTIKRGADAVGDAENIKELLSSA
EGEYMANVEKVREGMRRGDYYEVVLRQTFRTSGASELFRRVQTASPSPYELLQFEQLVGASPEMFVRVERVETCPISGTAQRTGDPLRDADNIRELLVST
KEIAEHVMLVDLGRNDLGRVCASGTVKVDELMVVERYSHVMHIVSNVVGKLAANKNAWDLLKACFPAGTVSGAPKIRAMEIINELEPSRRGVYSGVYGYY
KEIAEHVMLVDLGRNDLGRVCASGTVKVDELMVVERYSHVMHIVSNVVGKLAANKNAWDLLKACFPAGTVSGAPKIRAMEIINELEPSRRGVYSGVYGYY
KEIAEHVMLVDLGRNDLGRVCVQGSVKVNELMVIERYSHVMHIVSNVVGELASDKTAWDLLKACFPAGTVSGAPKIRAMEIINELEPERRGPYSGVYGYY
KEVAEHVMLVDLGRNDLGRVCKPGSVRVDELLTVERYSHVMHIVSNVTGELASGRGAWDLLRATFPAGTVSGAPKIRAMEIIHALEPFRRGPYAGAYGYY
KERAEHVMLVDLARNDLGRVCQLGSVQVDDLMQIERYSHVMHIVSNVVGRLDPQYSAWDLLRATFPAGTVTGAPKIRAMQIIHELEGCRRGPYAGAYGYY
KELAEHLMLLDLGRNDAGRVSKIGTVRPTEKFIIERYSHVMHIVSNVVGELDPDKDALDAFFAGMPAGTVSGAPKVRAMEIIDELEPEKRGIYGGGVGYF
KELAEHLMLLDLGRNDTGKVSKIGSVRPTEQFIVERYSHVMHIVSNVVGELADDQDALSAFFAGMPAGTVSGAPKVRAMQIIDELEPEKRGVYGGGCGYF
KERAEHLMLLDLGRNDVGRVAQAGTVRVTEREIIERYSHVMHIVSNVEGQLADGEDAISALMAGFPAGTVSGAPKVRAMEIIDELEPHRRGIYAGAVGYF
KERAEHLMLLDLGRNDVGRVAAKGTVEVTDSFTVERYSHVMHIVSNVVGQLDPAKDALDALFAGFPAGTVSGAPKIRACEIIAELEPETRGPYAGGVGYF
KEIAEHLMLIDLGRNDTGRVSEIGSVKLTEKMVIERYSNVMHIVSNVTGQLKAGLTAMDALRAILPAGTLSGAPKIRAMEIIDELEPVKRGIYGGAVGYF
KELAEHLMLIDLGRNDVGRVADTGSVKLTEKMVIERYSNVMHIVSNVTGHLRQGLTAMDALRAILPAGTLSGAPKVRAMEIIDELEPVKRGVYGGAVGYL
KERAEHVMLIDLGRNDVGRVAEPGTVKVGEQFVIERYSHVMHIVSEVTGTLKAGLNYSDVLRATFPAGTVSGAPKIRALEIIRELEPVKRNVYSGAVGYI
KERAEHTQLLDLGRNDCGRVARVGSVKLTENMIVERYSHVMHIVSNVEGKLQPGLDALDVLRATFPAGTVSGAPKVRAMEIIDELEPVKRGIYAGSVGYL
KERAEHIMLMDLGRNDIGRVAQTGSVKVTENMQIEYYSHVMHIVSNVEGKLKSGLNAIDVLRATFPAGTVSGAPKVRAMEIIDELEISKRGIYAGAVGYL
KEIAEHVQLMDLGRNDVGRVAQVGSVAVTEKMVIERYSHVMHIVSNVEGRLKSGLHAIDVLKATFPAGTLSGAPKVRAMEIIDELEPSKRGIYGGAVGYL
KERAEHLMLLDLGRNDVGRIAVPGSLQVTRQMVVEYYSHVMHLVSSITARLAPGRSALDALLACFPAGTVTGAPKVRAMEIITELEPVNRGPYAGAVGYL
KQRAEHIMLVDLGRNDVGKVSKPGSVNVEKLMSVERYSHVMHISSTVSGELLDHLTCWDALRAALPVGTVSGAPKVKAMELIDQLEVARRGPYSGGFGGI
KQCAEHIMLVDLGRNDVGKVSKSGSVNVERLMNVERYSHVMHISSTVTGELLDHLTCWDALRAALPVGTVSGAPKVKAMELIDQLEATRRGPYSGGFGGI
KQCAEHIMLVDLGRNDVGKVSKPGSEKVEKLMNIEPYSHVMHISSTVTGELLDNLTSWDVLRAALPVGTVSGAPKVKAMELIDELEVTRRGPYSGGFGGI
KQCAEHIMLVDLGRNDVGKVSEPGSVKVEKLMNIEHYSHVMHISSTVTGELLDHLTSWDALRAALPVGTVSGAPKVKAMEIIDKLEVTRRGPYGGGFGGI
KQCAEHIMLVDLGRNDVGKVSKPGSVEVKKLKDIEWFSHVMHISSTVVGELLDHLTSWDALRAVLPVGTVSGAPKVKAMELIDELEVTRRGPYSGGFGGI
KQCAEHIMLVDLGRNDVGKVSKPGSVKVEKLMNIERYSHVMHISSTVSGELDDHLQSWDALRAALPVGTVSGAPKVKAMELIDELEVTRRGPYSGGLGGI
KQCAEHIMLVDLGRNDVGKVSKPGSVKVEKLMNIERYSHVMHISSTVSGQLDDHLQSWDALRAALPVGTVSGAPKVKAMELIDKLEVTRRGPYSGGLGGI
KEIAEHVMLVDLGRNDVGKVAVSGSVVVQKLMEVERYSHVMHISSTVTGELLPQLDSWDALRAALPAGTVSGAPKVRAMQIIDELEVNKRGPYGGGVGHV
KDRAEHVMLVDLGRNDVGRIAELGSVQVEKLFEIERYSHVMHISSTVTGKLRAELSCWDALRATLPAGTISGAPKIRSMQIIDELEPTKRGPYGGGIGYV
KEIAEHVMLVDLGRNDIGRVSKFGTVAVKNLMHIERYSHVMHVVTNVQGEIREDKTPFDALMSILPAGTLSGAPKVRAMEIIDELETVKRGPYGGAIGYL
KERAEHLMLVDLARNDIGKIANFGTVELKEYMEVYYYSHVMHIVSIVTGSLQEKKDMYDALISCLPAGTLSGAPKIRAMEIIDELENKKRGIYGGAVGYF
KERAEHVMLVDLGRNDIGKVSEFGSVKIERFMEVDFYSHVMHIVSTVSGKLKRGLTAFDALIACLPAGTVSGTPKIRAMEIIDELENVRRSFYAGAVGYF
KERAEHYMLVDLARNDIGRIAEYGTVKTPTLLEIGKFSHVMHIISKVTGELKQTLHPLDALRYGFPAGTVSGAPKIRAMEILNELEPTKRGIYAGAIAYL
KDRAEHVMLVDLARNDINRICDPLTTSVDKLLTIQKFSHVQHLVSQVSGVLRPEKTRFDAFRSIFPAGTVSGAPKVRAMELIAELEGERRGVYAGAVGHW
KDRAEHVMLVDLARNDINRVCDPKTTNVDKLLTIQKFSHVQHLVSQVSGTLRPDKTRFDAFRSIFPAGTVSGAPKVKAMELISELEGERRGVYAGAVGNW
KDRAEHVMLVDLARNDVNRVCDPRSTSVDRLLGVETFSHVQHLVSQVSGVLRPDQTRFDAFRSIFPAGTVSGAPKVKAMELVGELEKEKRGVYAGAVGSF
KDRAEHVMLVDLARNDVNRVCDPTTTQVDRLMVVEKFSHVQHLVSQVSGILRPDKTRFDAFRSIFPAGTVSGAPKVRAMQLIAELEGEKRGVYAGAVGYF
KDRAEHVMLVDLARNDVSRVCDLDTTSVDKLMTIEKFSHVQHLVSQVSGVLRPDKTRFDAFRSIFPAGTVSGSPKVRAIQLVYGLEKEKRGIYAGAVGRW
KDRAEHVMLVDLARNDVNRVCHPSTVKVDRLMRIDRFSHVQHITSEVSGLLRPECTRWDALRSIFPAGTVSGAPKIRAMELIYDLEKEKRGIYAGAAGWF
KDQAEHVMLVDLARNDINRVCDPATTQVESFMNVEKFSHVMHLTSRITGQLRAGKSRFDALRSIFPAGTVSGAPKIRAIELVSELEQEKRGVYAGAVGRI
KERAEHIMLVDLGRNDVGRVAKPGSVRVEHLMQIEKYSHVMHIVSVVKGDLREDRTVYDAYRAMFPAGTLSGAPKVRAMELICSLETERRGVYSGSVGYF
KERAEHIMLVDLGRNDVGRVAKPGSVRVERLMQIEKYSHVMHIVSVVKGDLREDRTVYDAYRAMFPAGTLSGAPKVRAMELICSLETERRGVYSGSVGYF
KERAEHTMLVDLARNDVRRVSEAGSVRVEEFMNVLKYSHVQHIESTVTGRLREDCDAFDATRASFPAGTLSGAPKIRAMEIIDDLERTPRGVYGGGVGYY
KERAEHVMLVDLGRNDVRMVSESGSVKVSGFMKVLKYSHVQHIESTVSGTLRPECDQFDAFRAVFPAGTLSGAPKIRAMEIISEREAVPRGIYGGGVGYY
KERAEHVMLVDLARNDVRKVSKPGSVKLVRFFDVIKYSHVQHIESEVIGELADDKDMFDAIEASFPAGTLTGAPKIRAMEIIDELEKSRRRVYGGAIGYF
KERAEHVMLVDLARNDVRRVSKPGSVRLTRFFDVLKYSHVQHIESEVVGELDEGKNAFDAMEAAFPAGTLTGAPKIRAMEIIDELERSRRKVYGGAVGYF
KDEAELTMCTDVDRNDKSRICEPGSVRVIGRRQIELYSHLIHTVDHVEGILRPEFDALDAFLSHTWAVTVTGAPKRAAMQFIEQHERSARRWYGGAVGYL
KDEAELTMCTDVDRNDKSRICEPGSVRVIGRRQIELYSHLIHTVDHVEGILRPEFDALDAFLSHTWAVTVTGAPKRAAMQFIEQHERSARRWYGGAVGYL
KDEAELTMCTDVDRNDKSRICEPGSVKVIGRRQIELYSHLIHTVDHVEGILRPEFDALDAFLSHTWAVTVTGAPKRAAIQFIEKNERSVRRWYGGAVGYL
KEESELTMCTDVDRNDKSRICEAGSVKVIGRRQIEMYSRLIHTVDHVEGVLRDGFDAVDAFLTHMWVVTVTGAPKIWAMNFIEQHEKSPRKWYAGAVGWF
KDEAELTMCTDVDRNDKSRICEPGSVKVIGRRQIELYSHLIHTVDHVEGILRPDFDALDAFLTHMWAVTVTGAPKRAAIKWLEENEESPRGWYGGAVGYL
KDESELTMCSDVDRNDKSRVCEPGSVRVIGRRQIEMYSRLIHTVDHIEGRLRDGMDAFDGFLSHAWAVTVTGAPKLWAMRFLEENERSPRAWYGGAIGMM
KDESELTMCSDVDRNDKSRVCEPGSVRVIGRRQIEMYSRLIHTVDHIEGRLRDGMDAFDGFLSHAWAVTVTGAPKLWAMRFLEENERSPRAWYGGAIGMM
KDESELTMCSDVDRNDKSRVCEPGSVRVIGRRQIEMYSRLIHTVDHIEGRLRDGMDAFDGFLSHAWAVTVTGAPKLWAMRFLEENERSPRAWYGGAIGMM
KDESELTMCSDVDRNDKSRVCEPGSVKVIGRRQIEMYSRLIHTVDHIEGRLRDDMDAFDGFLSHAWAVTVTGAPKLWAMRFIESHEKSPRAWYGGAIGMV
KDESELTMCSDVDRNDKSRVCEPGSVKVIGRRQIEMYSRLIHTVDHIEGRLRDDMDAFDGFLSHAWAVTVTGAPKLWAMRFIESHEKSPRAWYGGAIGMV
KDESELTMCSDVDRNDKSRVCVPGSVKVIGRRQIEMYSRLIHTVDHIEGRLRDDMDAFDGFLSHAWAVTVTGAPKLWAMRFIESHEKSPRAWYGGAIGMV
KDESELTMCSDVDRNDKSRVCVPGSVKVIGRRQIEMYSRLIHTVDHIEGRLRDDMDAFDGFLSHAWAVTVTGAPKLWAMRFIESHEKSPRAWYGGAIGMV
KDESELTMCSDVDRNDKSRVCEPGSVKVIGRRQIEMYSRLIHTVDHIEGRLRDDMDAFDGFLSHAWAVTVTGAPKLWAMRFIEGHEKSPRAWYGGAIGMV
KDESELTMCSDVDRNDKSRVCEPGSVRVIGRRQIEMYSRLIHTVDHIEGRLREGMDAFDAFLSHAWAVTVTGAPKLWAMRFIEQNEKSPRAWYGGAIGMV
KDESELTMCSDVDRNDKSRVCDPGSVRVIGRRQIEMYSRLIHTVDHIEGRLRDGMDAFDAFLSHAWAVTVTGAPKLWAMRFIENNEKSPRAWYGGAVGMV
KDESELTMCSDVDRNDKSRVCDPGSVKVIGRRQIEMYSRLIHTVDHITGVLRDGMDAFDAFLSHAWAVTVTGAPKLWAMRFIEKHEKSPRAWYGGAVGMV
KDESELTMCSDVDRNDKSRVCDPGSVRVIGRRQIEMYSRLIHTVDHIEGRLREGMDAFDAFLSHAWAVTVTGAPKLWAMRFIERHEKSTRFWYGGAVGAM
KDAAELTMCTDVDRNDKARVCEPGSVRVIGRRMIELYSRLIHTVDHVEGRLRPGLDALDAFLTHTWAVNGTGAPKRWAMQFLEDTEQSPRRWYGGAFGRL
KDEFELNMCTDVDRNDKARICMPGTIKVLARRQIETYSKLFHTVDHVEGILRSGFDALDGFLTHAWAVTVTGAPKKWAIQFVEDNERSTRRWYAGAFGVV
KDEFELNMCTDVDRNDKARICMPGTIKVLARRQIETYSKLFHTVDHVEGILRPGFDALDAFLTHAWAVTVTGAPKKWAIQFVEDNERSSRRWYAGAFGVV
KDEFELNMCTDVDRNDKARVCVPGTIKVLARRQIETYSKLFHTVDHVEGMLRPGFDALDAFLTHAWAVTVTGAPKLWAMQFVEDHERSPRRWYAGAIGAV
KDEFELNMCTDVDRNDKARVCVPGTIKVLARRQIETYSKLFHTVDHVEGMLRPGFDALDAFLTHAWAVTVTGAPKLWAMQFVEDHERSSRRWYAGAIGCV
KDEAELTMCTDVDRNDKARVCVAGSVTVIGRRQIELYSRLIHTVDHVEGRLRPELDALDAFLSHCWAVTVTGAPKRAAMAAVEAVERAPRAWYGGAIGRL
KEESELTMCTDVDRNDKSRICEPGSVQVIGRRQIEMYSRLIHTVDHVEGYLRPEFDALDAFLCHTWAVTVTGAPKTWAIQFVEDNERSPRCWYGGAVGMV
KEESELTMCTDVDRNDKSRICEPGSVKVIGRRQIEMYSRLIHTVDHVEGYLRPEFDALDAFLCHTWAVTVTGAPKTWAIRFVEENERSPRCWYGGAVGLV
KEESELTMCTDVDRNDKSRVCVPGSVRVIGRRQIEMYSRLIHTVDHIEGILRPELDAIDAFLTHMWAVTVTGAPKTWAMRFIEQHESSPRRWYGGAVGVI
KEESELTMCTDVDRNDKSRVCEPGTVKVIGRRLIESYAGVFHTVDHVEGILQEGFDALDAFLSHMWAVTVIGAPKKAAAQTVEALERNARGWYGGAVGMI
DFEGQLNSAIAIRTMVVRVTVQAGAGLVADSDPEKEYEETLNKARGLLLAI
DFEGQLNSAIAIRTMVVRVTVQAGAGLVADSEPEKEYEETLNKARGLLLAI
DFEGQLNTAIAIRTMVVQVSVQTGAGIVADSDPQKEYEETLNKARGLLEAI
SFDGQLNTAITIRTLVVHANIQAGAGLVADSVPETEYEETLNKARGMLETI
DFSGQLNTAITIRTLLVHVSLQAGAGIVADSDPEREYQECLNKARGMLMAV
SAGGDMDMCIALRTAIVKLYIQAGGGVVYDSDPEAEFMETVHKSNAIRRAA
SAGGDMDMCIALRTAVLQLYIQAGGGVVYDSDPEAEYQETVHKSNAIRKAA
GANGDMDMAIALRTAIVKMHVQAGAGVVLDSDPESEHQETVNKARALFRAA
APDGSFDSCIVLRTAVLKMHVQAGAGIVADSDPAYEQRECEAKAGALIAAA
AWNGNMDTAIAIRTAVIKLHVQAGGGIVADSVPTLEWEETLNKRRAMFRAV
AWNGNMDTAIAIRTAVIKLHVQAGAGIVADSVPALEWEETLNKRRAMFRAV
GWHGDADTAIAIRTAVIQLYVQAGGGVVYDSDPDLEWQETMNKGRALFRAV
GFNGDMDVAIAIRTAVLKLYVQAGAGIVADSDPNSEWTETLNKARAVLRAA
EFNGDMDLAIAIRTGLIKLHVQAGAGIVADSVPQSEWTETCNKARAVLRAA
GFNGDMDLAIAIRTGVIKLYSQAGAGIVADSIPENEWIETQNKARAVLRAA
GLHGNLDTCIAIRTIVFAAFIQAGAGIVADSDPEAEYEETLNKARALLQVL
SFSGDMDIALALRTMVFLAHLQSGAGIVADSNPDEEQIECENKVAGLCRAI
SFSGDLDIALALRTIVFPAYLQAGAGIVADSDPDDEQRECENKAAGLARAI
SFSGDMDVALALRTIVFSAHLQAGAGIVADSDPADEQRECENKSAALARAI
SFTGDLDIALALRTMVFQAHLQAGAGIVADSDPADEQRECENKAAALARAI
SFNGDMDIALALRTMVFPAHIQAGAGIVADSNPDDEHRECENKAAALARAI
SFDGDMLIALALRTIVFSAHLQAGAGIVADSSPDDEQRECENKAAALARAI
SFDGDMQIALSLRTIVFSAHLQAGAGIVADSSPDDEQRECENKAAALARAI
SFTGAMDMALGLRTMIIPVHIQAGAGIVADSKPEAEYEETVNKAAALGRAV
SFDGEMDVALALRTMVIPVHIQAGAGIVLDSNPESEYLETINKAAALGRAI
SFNGNLDSCITIRTIILKAYVQAGAGIVADSVPEREYEECYNKAMALLKAI
GFDGNMDMCIAIRTLLIKAYLQAGAGIVADSNPEAEYKETLRKLDALVETI
SYNGNMDMCIAIRTILFKAYVQAGAGIVYDSIPEMEYCETLNKAMALKEVL
GFDGNIDSCIAIRTMIVKAYIQAGAGIVADSVPENEYEETRNKAKALLKAV
SYDGTMDNCIALRTMVYKAYLQAGGGIVYDSDEYDEYVETMNKMMANHSTI
SYDGTMDTCIALRTMVYKAYLQAGGGIVFDSDKYDEYIETMNKMMANHNTI
GYNGAMDTCIALRTMLLKAYLQAGGGIVFDSDEYDEYVETINKMMANNRCI
GFDSAMDTCIALRTMLLKAYLQAGGGIVFDSDPYDEYVETLNKLGANIQCI
GYEDNMDTCIAIRTMVYKVYLQAGGGIVFDSDEQDEYVETLNKLRSNVTAI
AYDVQMDTCIAIRTMLVKAYLQAGGGIVFDSEKTEEWMETMNKLAANLRCI
DFAHEMDVCIAIRTMTFKAYLQAGGGIVYDSIEEDEYIETINKLRANIRCI
SFSGFLDTAIAIRTMVVKVYSQAGGGIVYDSDPQAEYMETVNKLGSAIKTL
SFSGFLDTAIAIRTMVVKVYSQAGGGIVYDSDPQAEYMETVNKLGSAVKTL
SWSGDADFAIVIRTATVEITVQAGAGIVADSDPESEYEETEQKMDGVLEAI
SWNGDADFAIVIRTLLIQASVQAGAGIVADSDPAYEFRETDRKMAAMLTAI
SITGYADFAIAIRMAEIEAHVRAGAGIVADSIPEKEFYETENKMKAVLKAF
SLTGDADMAIAIRMAEIEASVRAGAGIVADSVPEKEFFETENKMRAVLKAL
GFNGNLNTGLTLRTIRLQAEVRVGATVLYDSIPSAEEEETITKATALFETI
GFNGNLNTGLTLRTIRLQAEVRVGATVLYDSVPSAEEEETITKATALFETI
NFNGNLNTGLILRTIRLQAEVRVGATLLYDSIPQAEEQETITKAAAAFETI
GFDGNLNTGLVLRTVRIEAEIRVGATLLYDSIPEAEEEETRLKASAFLDIL
SFNGDLNTGLTLRSIRIKAEIRVGATLLMDSIPEEEEAETLVKAAAMLKAI
HFNGDMNTGLTLRTIRIKAEIRAGATLLFDSNPDEEEAETELKASAMIAAV
HFNGDMNTGLTLRTIRIKAEIRAGATLLFDSNPDEEEAETELKASAMIAAV
HFNGDMNTGLTLRTIRIKAEIRAGATLLFDSNPDEEEAETELKASAMIAAV
GFNGDMNTGLTLRTVRIKAEVRAGATLLNDSIPDEEEAETELKASAMLSAI
GFNGDMNTGLTLRTVRIKAEVRAGATLLNDSIPEEEEAETELKASAMLSAI
GFNGDMNTGLTLRTIRIKAEVRAGATLLYDSNPEEEEAETELKASAMIAAI
GFNGDMNTGLTLRTIRIKAEVRAGATLLYDSNPEEEEAETELKASAMIAAI
GFNGDMNTGLTLRTIRIKAEVRAGATLLNDSNPQEEEAETELKASAMISAI
NFNGDMNTGLTLRTIRIKAEVRAGATLLFDSIPEEEEAETELKASAMLSAI
HFNGDMNTGLTLRTIRLKAEVRAGATLLYDSNPEDEEAETELKASALIGAI
HFNGDLNTGLTLRTIRLKAEVRAGATLLFDSDPDAEEAETELKASALLGAI
GFDGDMNTGLTLRTIRIKAEVRAGATLLYDSDPDEEEAETELKASAMRAAI
GFDGGMDTGLTLRTIRMAAYVRAGATLLSDSDPDAEDAECRLKAAAFRDAI
GFDGSINTGLTIRTIRMKAEVRVGATCLFDSKPELEDRECQTKAAALFQAL
GFDGSINTGLTIRTIRMKAEVRVGATCLFDSKPELEDRECQTKAAALFQAL
NFDGSINTGLTIRTIRMKAEVRVGATCLFDSDPAAEDRECQVKAAALFQAL
NFDGSINTGLTIRTIRMKAEVRVGATLLFDSDPVAEEKECQTKAAALFQAL
GFDGTLDTGLVLRTIRLRAEVRVGATLLHRSDPEEEEAETLLKASALLALL
GFDGGLNTGLTLRTVRVKAEVRAGATLLFDSEPEAEEKETELKASAMIDAI
GFDGSMNTGLTLRTVRVKAEVRAGATLLYDSDAEAEELETELKASAMLDAI
NFDGSMNTGLTLRTAHIRATVRAGATLLYDSDPEAEERETFLKARALLETL
SLGGDINTGILIRTTYLRASYPVGATLLFDSVPVMEERETRLKATGFFRTL
Anthranilate phosphoribosyl transferase amino acid sequence alignment
49 301
Thaps PYIETLIAGRLTSDETYDAFSLILSTIASLLTLLRARRENPQEIAGMVRAMNDACVKFELLDIVGTGGDGADTINISTASVVLAAACGCTVAKAGNRSVS
Phatr PYIEILIQGPLTADETEAAFSEILQAVGSLLTLLRARGETPSEIAGMVRAMNKACVDLGLLDIVGTGGDGADTINISTASVVLAAACGCIVAKAGNRSVS
Oryz KVLETLIGGHFSEEEAEATLRLLLEEIAAFLVLLRAKGETYEEIVGLAKAMIGCCVDGLAVDIVGTGGDGADTVNISTGSTILAAAAGAKVAKQGSRASS
Arab QLIETLIDRDLSETEAESSLEFLLNAISAFLVLLRAKGETYEEIVGLARAMMKHAVEGLAVDIVGTGGDGANTVNISTGSSILAAACGAKVAKQGNRSSS
CHlre EVIEKLIVRDLTEKQAEEALGTLLDFAAAFLVLLRAKGETPAEIAGLAKAMLDKAVKTSVVDIVGTGGDGIGSVNISTGASILAAAAGAKVAKHGNRSVS
Pram TFAVQQIASAVENPVAVGVLLALLAEVAAFAKHMRSEAVS------VTSGTLDIVGTGGDGANTVNLSTAAAVLAASCGALVAKHGNRSVS
Psoj ------MRSEAVS------VSSGTLDIVGTGGDGANTVNLSTAAAVLAASCGALVAKHGNRSVS
CME RLVERLIAAELSFDEAADAMHCMLEEVAAFLVLLRRNGAETGQLAGMAAALLERAVSTGTLDIVGTGGDGSNTVNISTAAAIVAAACGARVAKHGNRSAS
Desre TEAIQKVVANLSEAEAMETMQEVMEAIASLLTALHLKGETVPEITGFARTMRTKVVQTKLVDTCGTGGDGANTFNISTACAFVLAGAGLPVAKHGNRSVS
Moorth KAQISKVVAHLSEAEAAEAMDIIMAAIAAFLTALRLKGEMVDEITGFARSMRRRALTTSFVDTCGTGGDGRQTFNISTTAAFVVAGAGVAVAKHGNRSVS
Geosu KKAIAKVVEDLTEAEMIEVMDQIMSAIAAFITALRMKGETVEEITGAARVMRDRAIRVGILDVVGTGGDGTNTFNISTTVSFVVASCGVKVAKHGNRAVS
Cloth KKAISKLVENLSESEIIEALDCIMEAIGSFITALRIKGETIEEITGCAKVMRAKAICPNYIDTCGTGGDGTNTFNISTATAFVAAAGGVYVAKHGNRSVS
Alkam QQAIDKVIRDLAETEMMAVMQGIMEVIGGFLTALRMKGETVEEITASAKVMRSKAVEVNSIDTCGTGGDQANTFNISTAVAFVAAAAGVTVVKHGNRSVS
Metha KEYIKKLEEDLSSEEAEAAIGEILSAIGAFLLALRAKGEKPQEIAGFVRGMKQAGIKPVVIDTCGTGGDGLNTINVSTAAAIVTAAAGVPVAKHGNRAAT
MethM KEYIKKLEEDLSSEEAEAALEEVLSAIETFLLALKAKGEKPQEIVGFVRGMKKAGIKPNIVDTCGTGGDGLNTINVSTAAAIVTAAAGVPVAKHGNRAAT
MethS MNYLARLIENLTIEEAESLLGAFFDAIASALTALRMKGETAEELAGMAKRMRESAIRPRLVDTCGTGGDSTNTINVSTAAAIVAAACGVPVAKHGNYAVS
Nathp KDYIRRVSDDLTQAEAREAASLVFEAIGALLSALRAKGETEPEIAGFAEGMRAAAIDPDLVDTCGTGGDGHDTINVSTTSSFVVAGAGVPVAKHGNYSVS
Halmar KEYIERVTDDLTQAEARAVATTVFEAIGALLTALRAKGETEAEIAGFAEGMRDAAIRPDLVDTCGTGGDDYNTINVSTTSAIVAAGAGVPIAKHGNYSVS
Lacca KQAIEKVVNNLTFEESEAVLDEIMNATASLLTALTAKNPTIDEIAGAAASMRSHAFPETVLEIVGTGGDHANTFNISTTSAIVVAATGTQVAKHGNRAAS
Rhopa KAIIAKVATTLTRDEAASAFDGMMSAMGALLMGLRVRGETVDEITGAVSTMRAKMVDAPAVDIVGTGGDGSGSVNVSTCASFIVAGCGVPVAKHGNRALS
Braja KSIIGKVATSLSRDEAASAFDAMMSAMGGLLMALRVRGETVDEITGAVAAMRSKMVTAPAVDIVGTGGDGSGSVNVSTCASFIVSGAGVPVAKHGNRALS
Nitwi KVLIGKVATTLTREEAAAAFNSMMSAMGGLLMALRVRGETVDEITGAVSAMRSMMVKAPAVDVVGTGGDGSGSVNVSTCASFIVAGAGVTVAKHGNRALS
Brusu KPYIAKAASPLSLGDAKAAFDIMMSAIGGFLMALRVRGETVPEIAGAVASMRSRMVIAPAMDIVGTGGDQSGSYNVSSCTAFVVAGAGVPVAKHGNRALS
Agrous KPLIAKVANSLNREDARTAFDILMSAIGGFLMALRVRGETVDEIVGAVSSMRARMVSAPAIDIVGTGGDGIGTYNISTLASIITAGTGLPVAKHGNRALS
Lockve KPLIYAASEPLSRAQAEEAFGYLFAAIGGLLMALRARGEAVSEYAAAAAVMRAHCVTAPAMDIVGTGGDGKHTLNISTATAFVVAGAGVPVAKHGNRNLS
Rhods KPLIGTAATPLSREEAEFAFECLFEAMGGLLMALRTRGETVDEYAAAASVMRAKCVRAPAIDIVGTGGDGKGTLNISTATAFVVAGAGVPVAKHGNRNLS
Burmu QEALQRTIEEIFHDEMLHLMRLIMRMAAAIITGLRVKKETIGEIAAAATVMREFAVEVQFVDIVGTGGDGSHTFNISTASMFVTAAAGAKVAKHGNRGVS
Ralso QDALTRCIEEIFHDEMLHLMRQIMRMASALIMGLRVKKETIGEIAAAATVMREFAVEVPFVDIVGTGGDGANTFNISTASMFVAAAAGARVAKHGGRGVS
Neigo QQAIERLISELFYDEMTDLMRQMMSVIAAILTGLRIKVETVSEITAAAAVMCEFAVPLELVDIVGTGGDGAKTFNISTTSMFVAAAAGAKVAKHGGRSVS
Psaer KGALNRIVNDLTTEEMQAVMRQIMTCIGAFLMGMRMKSETIDEIVGAVAVMRELAVQLPVVDVVGTGGDGANIFNVSSAASFVVAAAGGKVAKHGNRAVS
Azovi KQALARVAEDLNTAEMQGVMRQIMTCIGAFLMGMRMKSESIDEIVGAALVMRELAVTIGLVDTCGTGGDGMNIFNVSTAAAFVVAAAGGRVAKHGNRAVS
Ecoli QPILEKLYQTLSQQESHQLFSAVVRLLAAALVSMKIRGEHPNEIAGAATALLENAFPRPFADIVGTGGDGSNSINISTASAFVAAACGLKVAKHGNRSVS
Shiboy QPILEKLYQTLSQQESHQLFSAVVRLLAAALVSMKIRGEHPNEIAGAATALLENAFPRPFADIVGTGGDGSNSINISTASAFVAAACGLKVAKHGNRSVS
Vibvul EAIINKLYQSLTEQESQQLFDTIIRLMASALTALKIKGETPDEIAGAAKALLANAFPRPFADIVGTGGDGHNTINISTTAAFVAAACGLKVAKHGNRSVS
Prmar SQILEMLLENLPEVEATALMEAWLALTGAFLAALRAKGVTGNELSGMAQVLRGACPCPLMVDTCGTGGDGADTFNISTAVAFTAAACGANVAKHGNRSAS
Synech PRLLDRLLEQLLPSDAAVLMEAWLALTGAFLAALRARGAQGGELAAMAGVLRQACPCARLVDTCGTGGDGADTFNISTAVAFTAAACGAVVAKHGNRSAS
Anava YLLLQQLIDSLSRSQAAELMQGWLSVSGAILTALNFKGVSADELTGMAEVLQSQSGSGEIIDTCGTGGDGSSTFNISTAVAFVAAAYGVPVAKHGNRSAS
Triery PSLLQQLLDSLSSSQASNLMQGWLQISGAILAALQAKGVSAQELAGMAKVLQSLSTKEYIIDTCGTGGDGASTFNISTAVAFVLAAAGVPVAKHGNRSAS
Crowa QSLLQQLLDSLSQTQAGQLMQGWLDISGAILVTLQGKGVSGDELAGMARVLQQQSETAIVIDTCGTGGDGASTFNISTAVAFVAAAAGVKVAKHGNRSAS
Nost SAILQQLLKSLTVAQATDLMQGWLTISGAILAAIQAKGVSSEELVGMARVLQSQSSPPHLIDTCGTGGDGASTFNISTAVAFVAAAAGVKVAKHGNRSAS
Syncys PPFLQQLLDSLTRQQAVQLMEGWLDISGAILAAIQAKGLDPEELTGMAQVLQEQSNQGRLVDTCGTGGDGSSTFNISTAVAFVVAAAGVKVAKHGNRSAS
Eugl KKWIASLQGPVSAEDQRAAIHELLAVKAALLALLPPASLGEDTLQLFVDTLLEHGVAVPVADIVGSGGDGQNTWNVPTPAAIVAAGAGIRMAKHGNRSAS
Sachce LSYTKKLLAQLSSTDLHDALLVILSKVSSFLTALRVTKLDHKAIAEAAKAVLRHSLVDLILDIVGTGGDGQNTFNVSTSAAIVASGIQLKICKHGGKAST
Cangl VQLTKTLLETLTPKQLYRAMVIILESIASFLSCLKASRLDHRAIAEAAKAVLGFSVVELVLDIVGTGGDGQNTFNVSTSAAIVAAGIPLKVCKHGGKAST
CanAl TPYLKKLVVTLEPKDLSEALELIFSSTAAFLSCLRLRGLDQEAIAAAVTTVLHFATIPPYIDIVGTGGDGQNTFNVSTSSAIVAAGMGLPVCKHGGKAST
Yarli NTQLKAILDTFTPEDLAEVLALVAQPIITFLALLHAKGLDMNALAAAANTLRHAALPDTYVDIVGTGGDGQNTFNVSTSAAIVAAGMGIKVGKHGGKAST
Schipo RLAIHDLDKAIPLENYEAALRAILTSTASFLASLHLTKAEEVPLMQTVQILKSYSIANIFVDIVGTGGDGHNTFNVSTASAIVAAGAGLWVCKHGNKAST
Ustma RPLLKALAVSGTSTITHKQLEQILEDIGSALTCLKFCRLDIQAFALAARIFLNCCVHVPTLDLVGTGGDGKDTFNVSTTASMVAAGVRVRVCKHGAKASS
ASfum SPLLQKLAYPVDPAEIASAFALIFESTAALLTLLHSTGKDRDAIALCSLRMREAAIEKSLCDIVGTGGDSHSTFNISTTASIIAS--PLMMAKHGNRAQT
SCCGSADVLEALGVQVLNPTQVVECVEQCDIAFLFAPVNHPAMKAVAPVRKQLGVRTCFNILGPMTNAAGQHAVIGVFHEELLELMAETLKEVGVDHVIH
SACGSADVLEALGVKVLTPEQVVKCVDAVRMAFMFAPVNHPSMKYVAPIRKKLGVRTCFNILGPMTNAAGQHAVIGVFHPELLSLMAGALKEVGVDHVIH
SACGSADVLEAFGVNILGPEGIKRCVNEVGVGFMMSANYHPAMKIVKPVRKKLKIKTVFNILGPLLNPARPYAVIGVYHENIVTKMAKAAQKFGMKRVVH
SACGSADVLEALGVVLLGPEGIKRCVEEGGIGFMMSPMYHPAMKIVGPVRKKLKIKTVFNILGPMLNPARSYAVVGVYHKDLVVKMAKALQRFGMKRVVH
SLCGSADVLEALGIAILGPAGVNHCLDQAGIAFMYAPRYHPGMKAVRPVRSALKVRTALNMLGPLLNPAESYGLVGVYDTSISELMAGSLLRMGVQKVVH
SKSGSADVLEELGVPMLKPEHVASCLEEAQIAFMYAPHFHPAMRYVGPVRKAIGIRSMFNILGPLLNPAGKRVVIGVYTPTLLDVFGEVLLALGVEHVVH
SRSGSADVLEELGVPMLKPEHVGPCLEEAQIAFMFAPHFHPAMRHVAPVRKAIGIRSVFNILGPLLNPAGKRVVIGVYTPKLLDVFGEVLLALGVEHVVH
SKCGSADVLEALGVAILGPEQVARCIAETGISFLMAPRFHPQLATVAPLRRSLRVRTVFNNLGPLLNPARRYQVIGVAAPELMEPMAEAVARLGTERIVH
SKCGSADVLEQLGVFVLTPEEAGLCLDQVGIAFLFAPLLHGAMKYAAAPRKEIGIRTVFNILGPLTNPAFENQVLGVYSSDLAPVLAQVLANLGTKRVIH
SRCGSADMLEALGIKVLPPDAVARCLDEVGMAFLFAPVFHGAMKYAAGPRREIGIRTAFNLLGPLTNPAGPCQLVGVYDPDLTETVAAVLGRLGSRRVVH
SACGSADVLESLGVNLVTPETVEQAIAKIGIGFLFAPALHGAMKHAIGPRKEIGIRTIFNILGPLTNPAGDCQVLGVYREELVEPLARVLHKLGCRRVVH
SKSGSADVLEALGVNILDPVQVKECIEKVGIGFIYAPVFHKSMKHAAGPRKELGIRTIFNILGPLTNPSNKGQVLGVFNPNLTELMANVLLNLGIERVIH
SQCGSADVLEKLGVNILTPKQVETCVEQVNMGFMFAPKFHQAMKYAAAARRELGVRTIFNILGPLTNPAKKGQVLGVFDESLTEVMAQVLKELGVERVVH
SMSGSSDVLEALGIKVLTPGQVRKTIEKIGIGFMFAPVFHPAMKRVAGVRKKLGVRTVFNILGPLTNPAGKGQVVGVFDKKLCEPIAYALAELGTEHVVH
SMTGSSDVLEALGIKVLSPEYVRKTIEKIGIGFMFAPVFHPAMKRVAGVRKKLGVRTVFNILGPLTNPAGKGQVVGVFDKNLCEPIAYALAELGTEHVVH
SRCGSANVLEALGVNICPPERVESIIESVGIGFMLAPLFHPAMKRVAHIRKEMGIRTVFNVLGPLTNPAGEAQVVGVYSPALCEKIANVLNLLGTKRVVH
SSSGSADVLEEIGVTIAEPPAVETCIEETGMGFMLAPVFHPAMKAVIGPRKELGMRTIFNVLGPLTNPADDAQVVGVYDEALVPTLADALSRMSVDRVVH
SSSGSADVLEEVGVDIAEPPDVEETIERDGIGFMLAPVFHPAMKAVIGPRQELGMRTVFNILGPLTNPADDAQVLGVYDPDLVPVMAEALARLDVERVVH
SKSGAADVLEALGLDIETPAVSYESLQENNLAFLFAQEYHKSMKYVATVRKQLGFRTIFNILGPLANPAHTHQLLGVYDETLLEPLANVLKKLGVTNVVH
SKSGAADVLNALGVRIITPEHVGRCVTEAGIGFMFAPTHHPAMKNVGPTRVELATRTIFNLLGPLSNPAGKRQMIGVFSRQWVQPLAQVLKNLGSESVVH
SRSGAADVLASLGVRILRPEQVGRCVRECGIGFMFAPAHHPAMKNVGPTRVELATRTIFNLLGPLSNPAGKRQMVGVFSRQWVQPLAQVLKNLGSESVVH
SRSGAADCLAALGIRILTPEQVGRCINEAGIGFMFAPAHHPAMKNVGPTRVELATRTIFNLLGPLSNPAGRRQMVGVFSRQWVQPLAQVLKNHGSESVVH
SRSGAADALAALGINIADADTIGRSISEAGLGFMFAPMHHSAMRHVSPSRVELGTRTIFNLLGPLSNPASKRQLVGVFAPQWLEPLAHVLKELGSETVVY
SKSGTADALSALGVRLIGPDLIARCIAEAGLGFMFAQMHHSAMRHVGPSRVELGTRTIFNLLGPLSNPAGKRQLLGVFSPRWLVPLAEVLRDLGSESVVH
SKSGTADVQSALGINVVTPDVVERAIAQAGIGFMMAPLHHPAMRHVGPVRLELGCKTIFNILGPLTNPAGKRQLTGAFAIDLIFPMAETLQQLGTEKLVH
SKSGAADALTEMGLNVIGPEQVEACLMEAGIGFMMAPMHHPAMRHVGPVRAELGTRTIFNILGPLTNPAGKRQLTGAFSPDLIRPMAEVLSALGSEKLVH
SKSGSADVLEALGVNILQPDQVAASIAETGMGFMFAPNHHPAMKNIAAVRRELGVRTIFNILGPLTNPAGPNQLMGVFHPDLVGIQVRVMQRLGAQHVVY
SKSGSADVLEALGVNILTPEQVAESVATVGIGFMFAPNHHPAMKSIAPIRKELGVRTIFNILGPLTNPAGPNILMGVFHPDLVGIQVRVMQRLGAKHVVY
SSSGAADVMEQMGANLLTPEQIAQSIRQTGIGFMFAPNHHSAMRHVAPVRRSLGFRSIFNILGPLTNPAGPNQLLGVFHTDLCGILSRVLQQLGSKHVVC
GKSGSADLLEAAGIYLLTSEQVARCIDTVGVGFMFAQVHHKAMKYAAGPRRELGLRTLFNMLGPLTNPAGRHQVVGVFTQELCKPLAEVLKRLGSEHVVH
GKSGSADLLEAAGVYLLSPEQVARCVESVGVGFMFAPAHHGAMKHAIGPRRELGLRTLFNMLGPMTNPAGKRQVLGVFSQALCRPLAEVMARLGSVHVVH
SKSGSSDLLAAFGINLMNADKSRQALDELGVCFLFAPKYHTGFRHAMPVRQQLKTRTLFNVLGPLINPAHPLALIGVYSPELVLPIAETLRVLGYQRVVH
SKSGSSDLLAAFGINLMNADKSRQALDELGVCFLFAPKYHTGFRHAMPVRQQLKTRTLFNVLGPLINPAHPLALIGVYSPELVLPIAETLRVLGYQRVVH
SKSGSSDLLDSFGINLMSAEDTRSAVDNIGVAFLFAPQYHSGVKHAMPVRQTMKTRTIFNILGPLINPARNIELMGVYSQELVRPIAETMLKMGMKRVVH
GKVGSADVLEGLGLQLAPLVSVVEALVEVGVTFLFAPAWHPALVNLAPLRRSLGVRTVFNLLGPLVNPLQNAQVLGVAKAELLNPMAEALQRLGLQRVVH
GRVGSADVLEGLGLRLAEARQVVEALPAVGVTFLFAPAWHPALVNLAPLRRSLGVRTVFNLLGPLVNPLLDGQVLGVARPDLLDPMAEALLQLGQRRVVH
SLTGSADVLEALGVNLASSEKVQAALQEVGITFLFAPGWHPALKAVATLRRTLRIRTVFNLLGPLVNPLRTGQVVGLFTPKLLTTVAQALDNLGKQKVLH
GKVGSADVLEALGIRLAPTEKVISALSEVGITFLFAPGWHPAMKCVVPLRRTLKVRTVFNLLGPLVNPLRQAQIIGVYNSTLVKTVAQALGILGVEYALH
SKVGSADVLEYLGVKLATPEKVAEALEEVGITFLFAPGWHPAMKHVAPLRKTLKVRTIFNLLGPLVNPLRTGQIIGVYHPRFIRPMAEALHQLGIAQVLH
SKTGSADVLEALGINLANADKVQAAVSEVGITFLFAPGWHPALKTVATLRKTLKVRTIFNLLGPLVNPLRTGQIIGVNDPLLIEEIALALSHLGCRKALH
SKVGSADVLEALGLNLAGADQVAAAVSAVGITFLFAPGWHPALKSVAPIRKTLKVRTVFNLLGPLVNPLRTGQVIGVYSPDFLSVMAIALKNLGTARVLH
SNSGSADLLEQFGAYLVSPTVVPELVDRTNFSFLYAPAFHPALRNVAKVRQDLGTKTVFNFLGPLLNPAAQYNVVGVSDRAMAEVMARCLSRRPGVRVVH
SNSGAGDLIGTLGDMFVNSSTVPKLWPDNTFMFLLAPFFHHGMGHVSKIRKFLGIPTVFNVLGPLLHPVSNKRILGVYSKELAPEYAKAAALVYGSEIVW
SNSGSGDLINTLGQSSVTAETVPALW-ENKFMFLLAPYFHYGFGKTSNLRKLMHIPTIFNILGPLLHPVADKRVLGVYSKDLAPEYAKAASIVYNSEVVW
SSSGSGDLLKSLGDLSVNEVTTPEIVKKSKFCFLFAPSFHPGMGLVAHIRSQLGVPTIFNILGPLINPIPRARVLGVYSEKLGESYAQAASILAKNEVVF
SMSGAGDLLTCLGDMSVHQSTVPDIVAKGPFCFLFAPMFHPVMNRVAPVRKSMGIPTLFNVLGPLVNPIPKARIIGVYSEALGQIFAEAITHINVQGVVW
SASGSADLLMSFGDLLVTPKNIVSITEQCKFSFLFAPMCHPTLKNVAPIRKQLGLPTIFNLVGPLLNPIPYARIIGVSKLSLGEVVAKTLLKLGGRSVVC
STSGSADLLMSLGPLLLPASQLPNVLRKSKFSFLFAQLFHPALAPLGPIRRSLGFPTIFNVLGPLINPAKERCILGVHSYYLGRIFAEALRKRGERAIVC
SFSGSADVLNAISKISVTAENLAQVYEATSYAFLFAPNFHPGMMYANPVRRGLGLRTIFNLMGPLANPVDEARVVGVAYQSLGPVFAEALRSSGTKAVVC
GVLDEISPMGPATIVEVS-TRKYQFDPLSVNIPRCVVDLKGDPEQNAKEFENVLEGNAKKDAIVLNAGVGCYVYGMTIEEGCLARKTLVEGEVLKKWKAVS
GLLDEISPLGPSTILEIKNEREFEFDPLSIGIARCELDLKGGPEENAQKFRDVLLGDAKRDSIVLNAGVGCYVFGLAIEDGCLARATLESGSLLESWVKVS
SKLDEISPLGPGYILDVTPIEKMLFDPLDFGIPRCTLDLKGDPAFNAKVLQDVLAGGSIADALVLNAAASLLVSGKVLHDGVLAQETQRSGNTLESWIKIS
SCLDEMSPLGGGLVYDVTPIEEFSFDPLDFGIPRCTLDLRGGPDYNADVLRRVLSGGAIADSLILNAAAALLVSNRVLAEGVVAREVQSSGKTLDSWINIS
SMLDELTPMGPADVVEVTALKRYSLDPKEVGIPRCEVDLKGDAQLNAAILRDVFAGGAVADALCLNAGYALAACRVAPAEGVMAQEVQRRGATLQRWAEAS
CALDELNSIGPAEVVEVRLLRHYQLRPEEVGVPAVTLQLKGDATENAAILREAFTGGPVGNTIAYNAGAGLYVYGLAIKSGYMAKKQLASGATLDKWAQVA
CALDELNSMGPADVVEVRQFRRYELRPEDVGVPAVTLQLKGDATENATILRKVFSGCPVGNTIAYNAGAGLYVYGLAIKSGYMAKKQLTSGATLDNWAQVA
CALDEMAPIGATEVIEVRKPCRFRVEPETLGMPLCTVDLVDDCAANAATITSILQGGPISNAIIMNAGAGLVVYGLALQDACMAANALMSGETLEKWKSLS
GGLDEISLAGEALVYEVKDVKEMIIDPMDYGLDRAPIALAGDAKRNARMIKNILSGGPQRDTIIINAALGLIAGGLVLAMGILAEQIIDEGKKLNLLVEFS
GDLDEVTITGPSKITCLDKIRTYTFTPEDVGLPRANLDLAGTATDNAAIARAVLSGGPARDVVLINAAFALLAAGAALQQALLAESSIDSGAKLQAMVAWV
GDMDEITLTRETRIAEVTRVSVRTITPEEFGFASCPAELRGDAAGNARIVRGILEGGPRRDVVLLNAAFGLVAAGKAPAEGVIAAEAIDSGAKLEELITLT
GDMDEITTTTSTKVSEVRNVITYELFPENYGIALASPDLTGNAEENAQIIRRIFNGGPKRDIVVLNSAAALYVGKVVIEEGILAEEVIDSGQKLDEFVEFT
GDLDEITTTTKTKVSELKNISNYVIDPRQFDIPLTDKDLAGDAEKNASIILNIVKGGGKRNMVLINAGAAIYVGNAALQEGIRAAEVIDMGDKLNQLIKLS
GDMDEISNTGETFVAELKDVSTYTITPEAMGMLRAKPDIKGTPKENARDLLCIFKGGPKRDLVILNAAAALYVSGIVIRQAIIAEDAIDSGVKFNQFRNFT
GDMDEITNTGETYVAELKNVSTYTLTPESLGMLRASPDTKGSPKENARDLLCIFKGGPKRDLIILNAAAALYVSGIVIRQAIIAEDAIDSGVKFNQFRTFT
GSLDEISNTGSTFVSELCDVRNYVVDPRDLGYPLADLEIAGTPDENAERLVRILKGSRARELVAMNAGAAVYVSGIALREGCIAEGAISSGSALETLKTLV
GALDEIGVHGESTVAEVDGVEQYTVTPSDLGVDSHDLAVAGSPTENAADMRGILEGGAKRDIILANAGAAIYIAGEALAAGVAARESIDTGAAAAQLASLR
GDLDEIAIHGETVVAEVTDIAEYTITPEDMGLETRDIAISGSPEENAADLRGIVTGGAKRDIILANAGAAIYVAGVAHEAGVQARQAIESGAAADKLDDLI
GDLDEMTTADETAVVELQDLTKYTVTPEQFGLKRRQRDLVGTPEANADITRRILAGGPQRDIVLLNAGAALHVAHPAIQAGILAAKTIDDGEELNRLLAFS
GDLDEITLSGPTAVAELKNIRTFEVTPDEAGLPRAHAALRGDAEANAVALRSVLEGSPYRDVALLNAAAALVVAGKALKEGVLGAKSIDSGGRLRRLIAVT
GDLDEITLTGPTFVSSLHNIRNFEVTPEEAGLPRCEPALKGDADANAIALQSVLNGSAYRDVALMNAAAALVVAGRALKEGVLGAKSLDSGARLKHLIAVS
GDLDEITLAGPTFVAALEDIRTFEVTPEDAGLERADGALKGDAEANAASLRAVLEGGAFRDVALLNAAAALIVAGKALKEGVLGAKSLDGGNKLKQLIAVS
GDLDEMTTAGTTQVAALENIRTFEITPEEVGLRRCSPELKGEAAENAKALLGVLEGSAYRDIVLLNSGAALVVAGKALKDGIQAVQSIDSGAVLQKVIAVS
GDMDEVTTTGVTHVAALEDIRTFDLTPKDFGVEPALMDLKGDGIANAAALREVLSGNAYRDISLCNAAAALVIAGKALSQAMIASDALDSGAALDRLVAVS
GDTDEITICGPTSVAVLDGITSRQIHPEDAGLPEHAFDIIGSPQDNAQALRALLDGGAYRDAVLFNAAAALVVADKALPDGVIARDSIDSGAALDALVRVT
GDTDELAISAASKVAALEGIREFELHPEEAGLPVHPFEIVGTPAENAQAFRALLDGGAYRDAVLLNAAAALVVADRALREGVIATDSILSGAKVALLARLT
GDMDEVSLGAATLVGELRDVHEYEIHPEDFGLQMSNRTLKENAEESRAMLLGALDNGVAREIVTLNAGTALYAANVAIADGILAREAIASGAKVDELVRFT
GDMDEVSLGAATLVGELKDVSEYEIHPEDYGLQMSNRGLKADAEESKAMLIGALENGTPREIVTLNAGAALYAANLAIGDGMLAREAIASGAKLDELVRVT
GGLDEITLTGKTRVAELKDISEYDIRPEDFGIETRNLEIKANTQESLLKMNEVLDGGAARDIVLLNTAAALYAGNIALSDGIAAREGIDSGAKKEEFVGFT
SDLDEFSLAAATHIAELKDVREYEVRPEDFGIKSQTLGLEDSPQASLELIRDALGRQKAAELIVMNAGPALYAADLALHEGILAHDALHTGEKMDELVAFT
ADLDEISLAAPTHIAELRNISEYLVQPEDFGIKSQSLGLDEGPQESLALIRDALGRQKAAEMIVLNAGAALYAADQALKEGVLAHDALHTGDKLEELASIT
SGMDEVSLHAPTIVAELHDIKSYQLTAEDFGLTPYHQQLAGTPEENRDILTRLLQGAAHEAAVAANVAMLMRLHGHEQANAQTVLEVLRSGDRVTALAARG
SGMDEVSLHAPTIVAELHDIKSYQLTAEDFGLTPYHQQLAGTPEENRDILTRLLQGAAHEAAVAANVAMLMRLHGHEQANAQTVLEVLRSGDRVTALAARG
GGLDEVAIHGDTLVAEIKDIHEYTLTPADFGVNTHPLAIKGDPEENKAIITHLLTGEAQLSAVAVNVALLMRLFGHEKANTQQAIDVMNSGQLVEKLAQHA
GGLDEASLEGANAMRLLENLRQASIDSAELGLTRAPLALQGDLATNQAILSAVLQGAPQRDVVALNTALVLWAAGLQDDLQATAKTCLQEGQRLEGLRMAL
GGLDEASLAGPNAVRIVEDIRAEILAPADFGLREAPLALKGDLELNQAILRELLQGEAQRDVVAFNTALVLWVAGVEMDLRSRAITALAEGERLEQLRQAL
GELDEAGLGDLTDLAVLSDLQLTTINPQEVGVTPAPIALRGDVQENAEILKAVLQGQAQQDAVALNAALALQVAGAVLDHAKVAKEILQTGAKLEQLVHFL
GELDEAGLGDITDIAILSHVKATSINPQYLGLNYAPITLQGDVEQNAEILKNVLQGSQQTDVVALNSSLALQVAGVVEAHQEKAKDILQSGLKLEQLVQFL
GELDEAGLGDVTDVAFLKEVNLGEINPQSLGLTSTPLALKGNVEENASILTDVLQGSAQQDAVAVNAGLALQVGDVVCEHQKKAKDILKSGDKLTSLVEFL
GELDEAGLADVTDLAILQDVSCLALNPQELGLNHAPTVLRGDVAENAEILKAILQGQAQQDVVALNTALALQVGEAITDIVEIAREVLQSGTKLEQLAEFL
GELDEAGLGAPTDIASFNQVTPQVLDPQNFGLAPAPLALKGDLAENVTILSQVLQGQAQIDAVALNASLALQVGDAVGDHGQLAKDILSQGDKLQQLVAFL
SDMDKISPVRNTHSWRVEGPVYQLLRPEDFGLVSAKDGAGGSPAHNAAALLRVLQG---PDPLL------
GVLDEVSPIGKTTVWHIDPLKTFQLEPSMFGLEEELSCASYGPKENARILKEEVLSNPIYDYILMNTAVLYCLSQGHWKEGIKAEESIHSGRSLEHFIDSV
GVLDEVSPIGKTTVWHVTTVDIFELEPAMFGLQEPLDCKSYGPERNAEILRDDILSHPVYDYILLNTAVLYCLSQGHWKQGVVANESIQSGKALEHFIKDV
GVLDEISPIGYTKTWTIDKIERNRISPKDFGLPEDLSVKSGTPQQNAEILSHILNQHPLVDYILMNSAAVAVVSGIAWVDGVLAKESIVSGKALEDFQNSS
GELDEISPAGRTKVWRVTPIEELYLTPEDFGLSRPLSVSSGTPTENATVLKQLLSNHPISDYVIMNAAALAVIDGHAWKHGVLARESIKSGNALETFVEAS
GELDEISPAGPTHTWLVRDITHEVYTPESFHLQSPLSVASGTPSANAILLEELLSNHPILDYVLMNTAALLHVAGMALREGVIAQQSISSGRELSNFSTIS
GELDEISPAGKTDVWELKDIEEFTIEPEDFGLKKPLEVGSHSADENAAIVLKMFSTQAIKDYTLLQTAALLYVGSYALEDAALARESIESGRAMETFRDES
GELDEISCAGPTNCWKLTGIETFQLHPSDFGLPAPLSVYGKMPKENAAKIMSILRNDPILTFVLINVAALLVISGICWKEGVRARWAVESGKCLEQFIEVT
Phosphoribosyl anthranilate isomerase amino acid sequence alignment
47 177
Thaps KLIKICGLTQPDDALVACRAGANLLGVIFAESKRRVSVEQAKAIVDAPSVVGVFQNPLEFVKEMVEVCGLDLVQLHGSEGMEAANAKNALRVVDIESGEG
Phatr QVVKVCGITSSEDALVACQAGANLIGVIFAPSARKVTPEQAKAVVQAPLVVGVFQNDSSFVREMVDSCGLDLVQLHGQEGFAAAKVESAIRVVDIVSGR-
Pram PLAKVCGVTKVEYALAALRNGANMIGIIMAESPRYVQVEQAKAIAQAPLVVGVFANTAAEMNAAAEEIGLDLVQLHGDEGYEICNDIKTIRALHLPSDGV
Psoj PLAKVCGVTTVEYALAALRNGANMIGIIMAESPRYVQAEQAKAIAKAPLVVGVFANTAAEMNAAAEDIGLDLVQLHGDEGFEICKDIKTIRALHLPSDGV
Ppar PLAKVCGITTVEYALAALRNGANMIGIIMAESPRYVEKEEAKAIAKAPLVVGVFVNTATEMNAAAEEIGLDLVQLHGDEGFEICKDIKTIRALHLPCDGV
Gibze LYVKICGTRSAEAARRAAESGADFVGICLVPAKRCISHETALAISEAPQLVGIFQNPLSEVLEKQKQYNLDLVQLHGDEPIEWANLIPVVRCFKPS----
Neucr LLVKICGTRSAEAAAEAIKAGADLVGMIMVPTKRCVDHETALSISQAPLLVGVFMNPLEEVLEKQHLYDLDIVQLHGDEPLEWANLIPVVRKFKPG----
Asory PLVKICGTRSEDGARAAIEAGADLVGIIQVQRKRTVSDDVALRISQVALLVGVFQNPLSYILEQQQKLELDVVQLHGSEPLEWAKLIPVIRKFGLD----
Schipo PLVKVCGTRSLLAAKTIVESGGDLIGLIFVESKRKVDLSVAKEISHFPLLVGVFQNPLEYIRSIIAEVNLDIVQLHGQEPFEWIHMLDVIKVFPLN----
Ustma SLVKICGLSTVEAAVTAAEAGADMLGMILAPTKRTVSLEQAAEIIKAPLLVGVFRDPLAEVASTAERLGLDVVQLHGSEGTEWAKFLPVIRVFSVKLG--
Agabi PLVKICGVKTKDQALAIADAGADLLGLMFAKSKRRIDRQVAKEIAAAPLLVGIFQNPLEEILETVADVQLDIVQLHGNEPVEWATQIPVIRAFHLGRI--
Sachce PLVKVCGLQSTEAAECALDSDADLLGIICVPRKRTIDPVIARKISSLKYLVGVFRNPKEDVLALVNDYGIDIVQLHGESWQEYQEFLGVIKRLVFP----
Canal ---KICGIKTVEAASVAIDNGANLLGCILVPRARTIDLEVAKQIARMPFLVGVFRNPKEEVFRIAREVGLDFIQLHGE---DKLEFLGLIPRYVVP----
Stret TKVKICGLSTPEAVATAVKAGADYIGFVFAKSKRQVSLEQAHELAKGTKIVGVFVSSLKELEEAISQVPLDIVQIHG--TFDEDLIPKVIRAIQIS----
Chlpha VKIKICGITRLSDALDACFAGADALGFNFSSSPRAIAPERAKEIIEKVESTGIFVDQSPEEINALCYCRLQIAQLHSQYSPEQAR-SIVIKVFRPEVEEV
Pellu TRIKICGITRMEDARAAALFGADALGFNFSPSPRCVTKDAARTMVCAIEGVGVFVEQDPREIIEICYCGLSVAQLHGRYSEEETKLVLVLKVFRPEPAAV
Cloth TCVKICGLRRKEDIDYVNLYKPQFAGFVFAESRRKVSKETARMLVKAIKSVGIFVNEKKETVAEIVYTGLDCVQLHGETPEYVEKLKEIWKAVRVK---S
Mothe VRVKICGIRDWEEARMVLDAGVDTLGFVFARSPRAIKPEAAREIITKTTTVGVFVNEPRYSLMEIAFCRLDVLQLHGEPPEYCHGLSQ-IKAIRVR---S
Bacce MKVKICGITDMETAKRACEYGADALGFVFAESKRKITPGLAKEIIQEVLKIGVFVNESVEMIQKIAECGLTHVQLHGEDNYQIRRLNI-IKSLGVTMKNA
Desha PRIKICGIRTGEEARWAVEAGADALGFIFVPSKRYIQPETAREIILNISKVGVFAQASPEHVGRIVECSLDTIQLHGEDPRLYRHLSV-IKAFSFPPGNS
theko EFVKICGVKTMDELRLVERY-ADATGVVVNSSKRKVPLKTAAELIEMLVSTMKTFPEWAN----AVKTGAEYIQVHSMHPKAVNRLKDVMKAFMVPSDDP
Pyrfur MFVKICGIKSLEELEIVEKH-ADATGVVVNSSKRRIPLEKAREIIENLVSTMVGFSEWAM----AIRTGAQYIQVHSALPQTIDTLKKVMKAFRVPSKNP
Metmaz TRIKICGMCSPEDMEMAALYGADAVGFITEVIESPRKLDSDTAASLILDSVMVIMPENSSRALELIKVRPDIVQIHSLPSVELEVIREIIKTLSVPASRV
Brad LLVKICGLTTPETLGAALDAGAEMVGFVFFPSPRHVGLTAARELGQQALKVALTVDDDATFENIVETLRPDLLQLHGESVARIRDLKQVMKAIAVATTAD
Nitwi LIVKICGLSTPSTLDVALQAGADMVGFVFFPSPRHLELARAQELGAQAAKVALTADDDETLCGIIEALRPDLLQLHGETVPRIREIKRVMKAIGVEIAAD
Rhopa LDVKICGLSTRATCDAALAAGADMVGLVFFSSPRHVDLGTAADLARAASVVALTVDDDAQLAAIVETVRPDLLQLHGESPARVAEIKRVMKALPIATRDD
Roseb IRVKICGLKTPQDVSAAAAAGAAYVGFVFFPSPRNVSIEQAVNLALELCKVALTVNDDALLDALTDAVPLDMLQLHGESAERVAQVKAVMKAIGVAGAED
Eryt IQIKICGLSTPETIAASADAGASHIGFNFYPSPRSVAPELASELAAILLKVGVFVNDDTLLNEAVRHGALDAIQLHGETPARVQEVKAVWKVLSVETDED
Natph TRVKVCGITNRSDLDTAVDAGVDAVGLIVDVTPREISPQQAAELAAAVTAVLVTMAVPDAATELVEAVRPDAVQVHGSTPDALASLGAVIKAT------D
Proma TAVKICGLTNIDQAKSIAALGVEAIGVIGVASPRFVAEQQRRDLFAQLQRVWVVADDDTDLSEALQEGAPSAIQLHGETPEHCANLRIWWKALRIRSHED
Synec PAVKICGLTDTEQALAIAAMGADAIGVIGVATPRYLEDSPRRGLFSQLQRVWVVADSNAMLDASLQEGTPTVIQLHGEPPAQCQALRQVWKALRLRSQDD
Nospu MRVKICGITQPQQSIAIASLGATALGFICVPSPRYVTTSQIRAAVAEIDTIGVFANSIVEISQIVVDSGLTGVQLHGELPDFCYQLRQIIKALRIRSLEH
Anava MRVKICGITQPQQSVAIASLGATALGFICVPSPRYVTAAQIWAAVAPIDKIGVFANSIAEIKQTVIDCGLTGVQLHGETPEFCDQLRRILKALRVRSLEH
Crowa MRVKICGITQVQQGQAIAELGATALGFICVESPRYINSQQIQEIIRVIDLIGVFANSLTKIKGVLQVAQLTAIQLHGETPEFCAQVKQIIKAFRIKTVGS
Glowa MRVKICGFTDPGQAQAAARLGVHALGFVCVPTPRYVDAARLREIAAATFKVGVFVDPVEAMRAAVEAGELQGVQLHGESPETCAHLARRIKALRVREPAD
Xylfa TRIKFCGMTRVGDVRLASELGVDAVGLIFASSSRLLTVSAACAIRRTVNVVALFQNSADEIHTVVRTVRPTLLQFHGEEDAFCRTFNVYLKAIPMAKRIC
Legpn IRVKMCGMTRSEDIQYAIDLGVDAIGLIFYPSARNVSLEKARIIVNNVDIVAVLVNEQSFVQLIINEIPVQLLQFHGESSEFCRQFNKFIKAIHPKTAIQ
Ralme TRIKICGLTREEDVRAAVDAGADAIGLVFYASPRHVDVAHAAALAELVSVVGLFVNDAEEVAHVAERVPLTLLQFHGETPQQCTEIARFMRAARVRPGLD
Decha TRIKICGLTREEDVDAAVAAGADAIGFVFYPSPRYVSPQRAAELVKRVDVVGLFVNAPEVVRIACEALPINVLQFHGEDAAYCSQFARYLRAARVRPGLD
Psent VRSKICGITRLEDALAAVEAGADAIGFVFYASPRAVDVRQARAIIAEVTTVGLFVNSRCELNEILEVVPLDLLQFHGETPADCEGYHRWIKALRVRPDDD
Nitoc TRVKFCGITRREDAIQAIRLGADAIGLVFYPSPRAVSPQQAYQIVREVTVVGLFVNASCYLQQILDKVPIDILQFHGESPEECGYYGRYIKAIRMAEGVD
Niesm IRTKICGITTPEDALYAAHAGADALGLVFYPSPRAVDIIKAQKITAAVSVVALFVNSAQNIRRILAEVPIHIIQFHGEDDAFCRQFHRYIKAIRVQTASD
Medtr PLVKMCGITSAKDAAMAAEAGANFIGMIVWPSKRSVSLSVAKEISKVAEPVGVFVDDAETILRASDASNLEFVQLHGGSRAAFPSLIQ--RVIYVLDGSL
Arab PLVKMCGITSARDAAMAVEAGADFIGMIIWPSKRSISLSVAKDISKVAKPVGVFVEDENTILRAADSSDLELVQLHGGSRAAFSRLVR--RVIYVLDGKL
Oryz PVVKMCGITSAKDAETALEAGAKLIGMILWPSKRSVALAEAKEISRVAESVGVFVDDEETILRVSDSCDLNLVQLHGESRSLLHVLSK--RIIYVLDGKL
chlre PLVKICGITNAEDAQHAVQSGADLLGMIMWQAKRAVSADTARAIAAVAKAVGVFVDDAATISARCRDAQIPIAQLHGGARAALPDLDP--EVVYVLDGTP
CME KSLKVCGVTSAEDATWILETIQRRLSVQVWISRRCVSVREARDIAAAARTVAVFVDDLSRIRQVCTDASINVAQLHGLDWQEHVLRAVYARVLRPDEPSK
KSADIAAAIIAILLDGGGGTGQAFDWTPVIIAGGLTPENIGEAVLNVKPWGVDVAGGVEAAGKDTEKVEKFVGGAKR
AAENAVETIIAILLDEGGGTSRSFDWTPVIIAGGLSPENVKDAVAGTRPFGVDVSSGTEAPGKDHQKVRDFVQNAKQ
DAEAILQQVNYILLDQQGGTGVAFDWKPCLMAGGLTPENVVKALSVGHPVGVDVSSGVEVGSKDLDKVTAFLRAVKD
DAEAVLQQVNYILLDQQGGTGVTFDWKPCLMAGGLTPENVVKALSVGHPVGVDVSSGVEVGSKDLDKVAAFLKAVKD
DAEAVLQQVNYILLDQQGGTGVAFDWKPCLMAGGLTPENVVKALSVGHPVGVDVSSGVEVGSKDLDKVAAFLKAVKD
-----VGIGTVPLLDGSG-SGKLLNVARVFLAGGLNPDNVAESVKALGPIGVDVSSGVEEGKQSLDKITAFIKAAKE
-----VGLAAVPLLDGAG-SGTLLDLETVLLAGGLEPSNVVETVKSLGPIGVDVSSGVEEGKQSLEKIREFVKAAKS
-----TGIATLPLLDGAGGSGELLDQSRVILAGGLDPTNVADTIKKLGSVGVDVSSGVESGVQDPSKIHAFVQAVRG
-----SEISIVPLIDYVGGESGGLG-KSYVLAGGLTPKNVQDAISVSRPAVVDVSSGVETGKQDLEKIKAFINAVKE
-KDLLREVSHIAALDGDGGTGVSFDWSPVLLAGGLTAENVAQAIKVAQPFAVDTSSGVETGNKDLAKIKAFVLAART
-RG-VENITHFILFDLSGGSGKIVDWAPIILAGGLTPENVAEAVKQVRPWAVDVSGGVETDGKDLEKVRLFIARVKL
-ILLS-AASFIPLFDEAGGTGELLDWNHFMLAGGLTPENVGDALRLNGVIGVDVSGGVETGV-DSNKIANFVKNAKK
-LLEEQSLSSLPLLDEVGGEGKLLDWT—ILAGGLTPENLP----TFDNILGYDVSGGVETGV-DSLKIIKFIQKGHA
-----DSQV-YLLFD-IAGSGQTFDWQDYFIAGGLAVDNVAEAKETFHPYALDVSSGVETGY-DLKKIKAFIERVKA
FAFAQNSG-NAFLFDMAGGTGETIEASYAILAGGLNATNVGEAIRRIQPYGVDTASGVESRPKDSREIRSFVKAVHH
LRFREATG-SAFLFDMEGGTGEQMEGSYGVLAGGLNPGNVREAVQSLRPYALDTASGVEESPKSHEKMQAFVQAVRA
LEIISEF—DAFLLDSYGGAGAVFDWQLRIILAGGLNPENVKTAVAKVKPYGVDVSSGVETD-KDAEKIRDFIMKVRE
LASLEAYR-QGFLLDKAGGTGTTFNWEPVILAGGLTPENVGAAIQLVHPYAVDVSSGVEVD-KNPARIAAFLEAVRK
QEYETDY----ILFDFHGGNGKTFSWEKTILAGGLNALNIEEAIRTVRPYMVDVSSGVETE-KDVEKIKQFIIKTKE
SEAAPDFSLHGILLDRTGGTGIPLPWHPLILAGGLNPDNILEAIRLTRPYGVDVSSGVERN-KDREKIQHFISQARK
AEDAERLLEDKILLD----SGRRHDYRPIVLAGGLTPENVGEAIRWVKPAGVDVSSGVERN-KDRVLIEAFMAVVRN
EEDANRLLSDMVLLD----SGKLHDLRPVIVAGGLNAENVEEVIKVVKPYGVDVSSGVEKY-KDPKLVEEFVRRAKN
QSPVKRLLDDSILLDKTGGTGYVHDWDPLILAGGLKPENVQEAIRIVSPYAVDAASGVEIL-KDAVKIRSFIEEVRC
LVPLAGYADDRILFDRPGGLGATFDWHPFMVSGGLSADNVAEAVRITRAGGVDVSSGVERAPKDCDMIRNFIRAARA
LADLPRYAADRLLFDRPGGLGVPFDWRPFMLSGGLAAGNVDDAVRITRAGGVDVSSGVESAPKDAGMVRDFIRAARA
LAALPDYAADRILFDRPGGLGVAFDWTPFMVSGGLTLDNVADALRITRAGGVDISSGVESAPKDPELIRAFIRAARA
LPQIDLYSQDQLLIDLPGGNGLAFDWRPWMLAGGLTPDNVAEAVRMTGARQVDVSSGVETAPKDAALIAQFNAAARS
IAAADAYSSDLILFDLPGGMGMGFDWSPWGLAGGLNSENVADAIRATGAPLVDTSSGVESAPKDVDRIAAFCKAALD
PGTAATYDGDAVLVDGAGGTGTVHDWDPVVLAGGLTPENVADAVEAAAPFAVDVASGVEAESKDADAVSAFVDAAGG
LSLAHTYAGDALLLDQLGGTGHRLPLNPWWLAGGVSAEWIPELLSQVNPWGLDASSRLEISPKDLKLVEALVEAVR-
LHAVKGYVQDGLLLDQLGGTGHRLPLDPWWLAGGISAEWIPELLDRVTPDGLDASSRLEVRPKDLEKVNALLSAVR-
LGTAADYTKDTLLLDQLGGTGKTLDWKPWFLAGGLTADNIVEALSQVSPSGVDLSSGVERAPKDLDKVSKLFEKLG-
LEQAIIYTQNTLLLDQLGGTGQTLDWQPWLLAGGLTPDNILEALSQLNPDGIDLSSGVERKPKDLDKVALLFEKLG-
LENIPQYVEDTLLLDQLGGTGKTLNWDPWFLAGGLNPNNILQALDNLAPDGIDLSSGVERSPKEIGKVAQLFTKLNQ
LEKIALYVDEAVLLDQAGGTGRTLDWKPWMLSGGLRADNLAQALDILTPDAVDLSSGVENVPKDLSKITQILHIAQG
TRTLYLKYPAGFIFDLKGGTGQTFDWSPFLLAGGITPENVFDAIAATVPWGVDVSSGIELQPKDGDKMRQFVEEVRR
IQSAVDEFFSAILLDGRGGTGLTFDWNPYILAGGLNESNILEAITMCHPYAVDVCSGIEASPKDHLKMSRFIKAIWG
LVEFANQYRSGLLLDYGGG-GHVFDWTRIVLSGGLNAQNVAGAIERVRPYAVDVSSGVEASKKDHARIAAFVRAVRQ
LVEFAGSFPRGLLLDYGGG-GHVFDWTYLVLSGGLTADNVGDAVRRVRPVAVDISSGVEASKKDHSKIAAFVAAVRK
LQAACKHYASGILLDVPGGTGESFDWSPIILAGGLSADNVAEAIRQVRPYAVDVSGGVEQRKKDAAKIEAFMRAVRQ
LPSLARSYESALLLDVPGGTGRAFDWRAVILAGGLTPENIAQAVRQVRPYAVDVSGGVERIKKDAAKMAAFMRGVDS
IRNAADRFPQALLFDEYGGTGHRFDWTPWVLAGGLTPENVDEAIRITGAEAVDVSGGVEASKKDPAKVAAFIATANR
LNTIPDE--DWVLVD--GGSGEAFDWAGWLLAGGVNPENVGEALSSLKPGGVDVSSGICADGKDQSRIASFMDAVHS
LNEIPEE—-DWILVD--GGSGHGFNWAGWLLAGGINPTNVSEALSILQPDGIDVSSGICGDGKDKSKISSFITAVRS
INALPDE--DWFLVD--GGSGKGFNWQGWLLAGGLHADNVCDAFYALKPNGVDVSSGICADGKDPTRISSFMRNVKS
LTPPPSELLDWVLVD--GGSGQALDWRGWLLAGGLNPDNVATAAGLAQPSAVDVSSGVCGDGKDHGKVSSFISSAKA
RAITTADGRCYTLID---GDGKPFDWRLWMIAGGLHAGNVVEAVRVTNA-GVDVASGVEVTDKDPARLEAFLDALCS
Indole-3-glycerol phosphate synthase amino acid sequence alignment
52 228
Thaps2 FVRKAKEADVRKYCEESVLLKQINDAPTTTVGEVQGLISIIAEYKRKLEGSGFLSEILPPEILSPVFREFGATAVAVLADERTGGCTYDDIVEMVPLPVI
Phatr2 LVKKAKEQELRKYIEEHVKYNLLKENLDQVVGPLQESITVIAEYKRKFNPTGLIHEMHVPELLSPSFREYGASAIAVMADPRMGGCDYDDIRHFVPLPVI
Thaps1 KITAATILDVEAYTSPTILIETAAEFHLPLQHQIQQTMALAAEFKRASPSKGDIAPHLNAGEQASVYYKAGASVISVLTEGRWFKGSLADLREAR-----
Phatr1 TITATRLLDYQSLLEPSLLAQEAKSFALNLQTVIQSQMALAAEFKRASPSKGDIATHLNAGEQAVKYTKAGANIISVLTESHWFKGSLDDMTQARRPAIL
Pram EIAAQRRLDVEAAKQVTPLVKKIESTELRVLDRLNAPVALAAEFKRASPSKGDIATGLNLREQVKSYADAGASMISVLTEPKWFKGSLEDMMAARRPAIL
Psoj EIAAQRRLDVEAAKQVSALTKKIESAELPVLDRLNAPIALAAEFKRASPSKGDIATELNLQEQVKAYADAGASMISVLTEPKWFKGSLEDMQAARRPAIL
Ppar EIAAQRRLDVAAAKQVSTLAKKIEHTELPVLERLNAP------EQVQAYANAGASMISVLTEPKWFKGSLDDMMEARRPAIL
Gibze RIYANRKAAVDAQKQPSQEDLQVAYDLIPLVSRLRESVALMAEIKRGSPSKGIFALDISAPAQARKYALAGASVISVLTEPEWFKGSIEDLRAVRRPAIL
Neucr KIYAHRKAAVDAQKQPSLSDLQAAYNLISLVDRLRNSVALCAEIKRASPSKGVFALDIDAPSQARKYALAGASVISVLTEPEWFKGSIDDLRAVRRPAVL
Asory KIYDHRRAAVAIQKTPSQADLQAAYDLVSFPARLRQSLSLMAEIKRASPSKGMIAENACAPAQAREYAKAGASVISVLTEPEWFKGSIDDLRAVRRPAIL
Sachce RIYARRKIDVNEQSKPGFQDLQSNYDLQDFYTVLSSSAVVLAEVKRASPSKGPICLKAVAAEQALKYAEAGASAISVLTEPHWFHGSLQDLVNVRRPCVL
Schipo KIHAQRLIDIAESKRPGLGDLQTYLNLINFYERLKQSPALMAEVKRASPSKGDIKLDANAAIQALTYAQVGASVISVLTEPKWFKGSLNDLFVARRPAIL
Ustma RIHVQRLKDIAATKAPGFRDLDIALSLINFPERLQRHPGVMAEMKRASPSKGDIDPTAHAGAQALAYARGGASVISVLTEPKWFKGTMHDLSLARRPAIL
Agabio KIYKQRLLDVDQAQAPGSQDLQTLHSLIDFTVRIRQGPAMFAEIKRASPSKGPIALNINPAAQALKYALAGAHTISVLTEPKWFLGSLHDMLHARRPAIL
Chlaf NIVAYKKQEVERLKEVCTPDHYLSQILEHFATALKGPLSIIGEIKRQSPTRGKIGCIDNPADLALKYCCGGAAAISVLTDTRGFGGSFLDMQQVSHVSVL
Chlac NIIAYKKQEVERLKEVSSKNHYLSQILEQFATALKEPLSIIGEIKRQSPTRGKIRSIDSPADLALKYCCGGASAISVLTDTQGFGGSFLDMQQVNHVSVL
Coxbur AILKNKKQEIAHLKAFSSDAFSLS---KSFKRIISSTTTIIAEIKRRSPSKGHLAEIADPVALAKQYVQGGAAGVSVLTDKLAFDGSIRDLQQVS-VAVL
Cme3 EILRRKQQEVASLKAVAQAEHPLQRRLRQFSSAIRKPISVIAEIKRRSPSSGLIAEIPDVRQLSDLYYNGGAAAISVLTD-AAFDGTLDDLQTVVPCPVL
Bacce KIVEQKKKEVAELYEYTP------KRMTHSLVEAFTVIAEVKRASPSKGDINLHVDVRKQVKTYEECGAGAVSVLTDGQFFKGSFYDLQTAR-IPLL
Arab1 EIVWHKDKEVAQMKEKPLLKKALDNVPKDFIGALRSAPGLIAEVKKASPSRGILREDFNPVEIAQAYEKGGAACLSVLTDDKYFKGSYENLQAIMKCPLL
Arab2 EITWYKDVEVSRMKENPLLKKAVEDAPRDFVGALRMAPGLIAEVKKASPSRGILKENFDPVEIAQAYEKGGAACLSVLTDQKYFQGGFENLEAIRKCPLL
Oryza QIIWDKEVEVSQRKAKPLVIESSQHAPRDFVGALTAAPALIAEVKKASPSRGVLREDFNPVEIAQSYEKNGAACLSILTDEKHFQGSFENLETVRKCPLL
Nostpu EIVLHKRQEVAQMQQLPLLQQQLITAPRNFLAALQQNPSLIAEVKKASPSRGIIRADFDPVAIAQAYERGGAACLSVLTDRKFFHGSFDNLRNVRALPLL
Proma KILWEKDREVKVSREVPLLKAQINNLPKDFLGALRQSPAVIAEIKKASPSKGVIRENFDPIEIALAYKLGGATCLSVLTDKSFFQGGFEVLVQVRDLPLL
Synel EIVWYKEREVNAWRELPLLQNQVRGLTRDFLAALRQAPAVIAEVKKASPSKGVLREDFDPVAIAQAYAANGAACISVLTDEKFFQGGFENLQRVRDVPLL
Theel KIVWHKEQEVEQLRELPLLQRQVLEAPRSFLAAVQNPPALIAEVKKASPSKGIIRPNFDPVAIAQAYVAAGATCLSVLTDSEFFQGSFEYLAQIRAVPLL
Ralme KILAVKADEVAAARKRDLLRAEAESLRRGFERALRDKAGVIAEVKKASPSKGVLRENFVPEAIAESYAAHGAACLSVLTDVNFFQGHAEYLKRARPLPAL
Nitmu EIVATKRQEIAALKAGSLLRRQAEALPRDFIGALRKKPAVIAEIKKASPSKGVLREQFDPAAIAVSYGQHGAACLSVLTDQQYFGGSAEYLKQARDLPVL
Xylfa KIIAWKVEEIAERLLVSQLVARCADLPRGFAGALQATPAVIAEIKKASPSKGVLREDFRPAEIAISYELGGASCLSVLTDVHFFKGHDDYLSQARTLPVL
Nitoc KILQRKAEEISERAQLSILSQKVEESPRGFFKALAKKSAVIAEIKKASPSKGLLCESFHPEAIAQSYEKGGAACLSVLTDQDFFRGSEADLQKARNLPVL
Mesloti KIEAYKRDEIAAAKAVPLIKARAKDADRGFLAALEAKFALIAEIKKASPSKGLIRADFDPPALAQAYEKGGAACLSVLTDAPSFQGAPEFLTKARALPAL
Brume KIEAYKREEIAAAKALALLKARTRDQSRGFLKALEAKFALIAEIKKASPSKGLIRPDFDPPALAKAYEEGGAACLSVLTDTPSFQGAPEFLTAARSLPAL
Agrous KIETYKLEEIAAAKAVSLLKAMAADQSRGFYKALRAKFGLIAEIKKASPSKGLIRPDFDPPALAAAYEAGGAACLSVLTDTPSFQGAPEFLTAARALPAL
Eryli EICAAKLDEVATRKALSDLADRIAAQSRGFEAALRAKFALIAEIKKASPSKGLIRADFHPADHARAYEAGGAACLSVLTDEDYFQGHADYLREARTLPAL
Mothe EILDYKRREVAAAREHPLLEKAAAAMPRDFGAALRRRLKVIAELKQASPSRGLIREDFDPEGQARSYIAAGAAAISVLTDQRFFRGRPEYLILVRTLPLL
Chloro RILETKAREVAELKKKPEYREACGDLPRDFRSAITSRINLIAEVKKASPSRGVLVEDFRPLDIAARYAELGASAFSVLTDSHYFQGSPDYLKAITSIPVL
Pelodi KILKEKRREVAAIKARPLYLELRSSLARGFQEALRGELRLIAEVKKASPSRGVIVHDFDPVRIALHYEEIGASALSVLTDRQFFQGSPDYLRAVSRLPVL
Metth DIIRSKKLEVKELMKTPLLRDD----IVSFPAAVGG-VSLICEYKRASPSMGRISE-RGLEEMMEVY-QDLADAVSIVTDGKYFKGSLDLLSGAT-KPLL
Metst RIIENKKVTLTQTKKKPLLKEEAIDIVFRFQEKLKN-TQLIAEYKPASPSKGNIST-LKAEDVIPIYDKNNVDMMSILTEETYFKSNLKNFNIANNKPLL
Clothe EIVAQKKIQLKKDMSITIWKQKIK--RLDFYGALKNNISIIAEVKKASPSKGIIKEDFDPLKIAKEYVESDIQAISVLTERNFFKGDEDYLVKIRPLPIL
Sulso RYLKGWLKDVVQLSLPSFSRQRP----ISLNERILEFTAIIAEYKRKSPSGLDVERDPIEYSKFMERYAVG---LSILTEEKYFNGSYETLRKIASIPIL
Thaps4 GYMWEKETQVDRLREISLLMSQCKAAMRDWISPVKQAFVIIPECKRMEPTSGSLRKRYDIPKLVKQLLSAGAPAISVNSDGVLFGGSMEDITIARSPPVL
Phatr4 GFMWDKETEVDRFREVPLLVSQCRLSQRGFVVPIQQMFVVIPECKRMEPTIGSLRRRYDLSKLARDFTFDGAVAISVNCDAVLFGGSLGDVTAARVPPIL
Thaps3 IERKRLAAALRRVYDPYNNPNAKQKTRERMLENG—VGSFIVDIKRK------DAGMVAEAMVRLGADVVFVNVDYHSYGGDLSELKSAVEAAVV
Phatr3 VSPMKLAKALRRVYEPSNNPDHVPLSDQTLQQVG--MGSFIVDIKRKSRPGEVFCNYDDAGMVAEAMVRLGADAVFVNTDYQAYGGDMTELKSAVSAAVV
Plber RIIENKKYEVTKLLENADSPLQIRLKYNKLSESLKIILSMVADIKRKCSTNKAFLNLTNPGEASLMLHKIGFDVLIVNIDSLSAQGTLNDLSDVIRPAVV
Plyoe RIIENKKYEVTKLLENADSPLQIRLKYNKLSESLKRILSMVADIKRKCSTNKNFLNLTNPGEASLMLHKIGFDVLIVNIDSLSTQGTLNDLSDVIRPAVV
Plafa KIIENKKYEITKLLENCDNPLQIRMKYNKLSESLKRSLSLIADMKRKCSIEKEFLNLSNPGNVSLLLHEIGFDVLIVNIDELSTQGNINDLKDIIRPAIV
Tann KMITDKYSEVDQLIEHKDDKLQLRLNYLKLSDMLSRNLCVIADMKRRTHNPRCVLSYTDAGEVALNMASVGFDVIFVNVDQKNYGGHINDLNKVFRPAIV
Theipa ------MLRRNLCVIADMKRRTHNPKCVLSYTDAGEVAMNMASVGFDVIFVNVDQINYGGHINDLNKVFRPAIV
Cme1 EPLAAQQEELVDWLPRKFVRDELSNREEAPSRDIRAKLSVVVDLKRRTSEPARLNMYAAAEILSDLRAEIPLDGIMVSVDSELYDQDCASLSDFAAPPII
Cme2 ELVWAKEREIEVRRQMPLMKSRLEKSD-DLNSVFWDS-VLFFQYQRC------PEDTESAPILANVAEEKNARAVVVNAESRRFYGSYEDIRRVHRCPVI
SSDLIVDEIQIARAADAGAKAVTVTHGVVVSQFIKNARVLGLETIVNVGTAEEAQSAVDVGISVTGVDGADNKFAVIEPEGRT----LAKDNKALEEVEE
NNDLIVDELQVARSAAYGCAALVLNLPTLTKVLLKATKAVDLEAIVAVSSKEEAQTAIDSGLSILHLGTVEDMVAAVDPEGQQ----LANNDQQLQEIED
AKDFITSRYQIAQAAASGADTVLLIVATTLKDLILYSRSLNMEPLVEVHALVELDVALEAGAKVLGVNNRNLHTFQLDLANTDKIAGTLCALSGMSSAHD
RKDFVINEYMIAEAAAKGADTVLLIVAVLLTRLIGFSRSLGMEPLVEVHADTELDVAIKAGAKVIGVNNRNLHTFQMDLGTSERVARTVCALSGMSTAMD
RKDFIIDVYQLLEARAYGADLAALIMN---VDVEFATHNLGMCALVEVNSVEELDIALAARSRLIGVNNRDLRSFKVDMSTTARVADTLFALSGIRSHAD
RKDFIIDVYQLLEARAYGADCVLLIVALLLNELIEVTHKLGMCALVEVNSVKELDIALAARARLIGVNNRDLRTFKVDLNTTARVADTLFALSGIRSHAD
RKDFIIDVYQLLEARAYGADCVLLIVTLLLIELIDATHNLGMCALVEVNSVQELDIALAAKARLIGVNNRDLRTFKVDMNTTARVADALFALSGIRSHTD
RKEFIFDEYQILEARLAGADTVLLIVKMLLHRLYNYSLQLGMEPLVEVQNAEEMTTAVKLGAKVIGVNNRNLESFEVDLNTTGRLRSIICALSGINTHDD
RKEFIFDEYQILEARLAGADTVLLIVKMLLERLYKYSLSLGMEPLVEVQNTEEMATAIKLGAKVIGVNNRNLESFEVDLGTTGRLRSFLCALSGINTHQD
RKEFVFDEYQILEARLAGADTILLIVKMLLTRLYHYSRSLGMEPLVEVNTPEEMKIAVQLGAEVIGVNNRDLTSFEVDLGTTSRLMDIVCALSGISGPKD
RKEFIFSKYQILEARLAGADTVLLIVKMLLKELYSYSKDLNMEPLVEVNSKEELQRALEIGAKVVGVNNRDLHSFNVDLNTTSNLVELLIALSGITTRDD
RKDFIIDPYEIMEARLNGADSVLLIVAMLLESLYKFSKSLGMEPLVEVNCAEEMKTAIELGAKVIGVNNRNLHSFEVDLSTTSKLAEILAALSGISSPAD
RKDFIVDTYQIAEARLAGADTVLLIVAMLLKELYDYSLSLGMEPLVEVNNPEEMTRAIRLGAKVVGVNNRNLHDFNVDMETTSRLADILCALSGISGRSD
RKDFILSRYQVLESRIWGADSILLIASMLLRDLYSYALELGMEPLVEVNNAQEMELALSLPAKVIGVNNRNLHDFKVDMNTTSRLSDVLCALSGIASRND
RKDFILDPLQLAEAIFFGANAVLLIVSVVLKFLIQEAHRLGLEVLTEIHDFSELELALEAEASIIGINHRNLKTFEIDLNLSESLKPITVAESGIHHPIQ
RKDFILHPLQLAEAIFFGAHAVLLIVSVVLKFLIQEAQRLGLEVLTEIHDLSELELALEAEAPIIGVNHRNLQTFEIDLNLSEILKPITVAESGIHHPTQ
RKDFILDPLQIEEAAVAGADAILLIVAILTKILLQKAHECGLEALVEVHNRQELDQAIEIGAEIIGVNNRNLTTFSVDPNNALKLKPISVAESGIHTVSD
RKDFIIDEIQIAEAAERGAAAVLLIVAAILRKLLEATHRHGLEALVEIHDEEELEIAKAAGAEIIGTNARDLRTFNVDLNRCFELIGIPVAESGIHDVMD
CKDFIIDKIQIDRAYEAGADIILLIVAALLKELYSYVREKGLEAIVEVHDEEELEIAIELNPHVIGINNRNLKTFEVDLGQTEKLGKLWISESGIHSKED
LKEFIVEAWQIYYGRSKGADAVLLIASVLIKYMIKICKILGMATLVEVHDEREMDRVLAIEVELIGINNRNLETFEVDLGITKKLL?LVVGESGLFTPED
CKEFVVDPWQIYYARTKGADAVLLIAAVLITFLLKICKKLSLAALVEVHDEREMGRVLGIEIELVGINNRSLETFEVDISNTKKLLAIVVGESGLFTPDD
CKEFVIDIWQIYYARSKGADAILLIAAVLIKYMLRICKNLGMTALIEVHDERELDRVLKIDVQLIGINNRSLETFKVDTSNTKTLLELVVGESGLFTPDD
CKEFIIDPYQIYLARTAGADAVLLIAAILLQNFLQVIHDLGMNALVEVHTLAELDRVLKLDLHLVGINNRNLEDFTVDLGITQQLLATVVSESGLYTPAD
CKEFIIQPYQIYQARVAGADAVLLIAAILLLYLRKVAISLGLTILVEVHDSNELKRVLDLEFPLVGINNRDLKTFNTDLRTTKEVVKLLVSESGLFNSAD
CKDFVIYPYQIYKARLLGADAVLLIAAILLRYFLKIAHSLGLNALVEVHSLPELERVLALDLRLVGINNRNLKTFVTDLAVTEHLAALLVSESGLFTGAD
CKDFILYPYQMFLARVRGADAVLLIAAILLRYFLRIAHGLGLTALVEVHTAEEMERVLALEVQLVGINNRNLMDFSVDLATTEKLLALVVSESGIHQRAD
RKDFMVDMYQVYEARTWGADCILLIVSALMAELEACAHELGMDVLVEVHGGEELDSALRLKTPLLGVNNRNLRTFEVSLDNTLDLLPLVVTESGILGPDD
RKDFMLDEYQVAEARLMGADCILIIVAAFMRKLESLAHALGMAVLVEVHDGAELELALELASPLIGINNRNLHTFETRLDTTLHLLDMAITESGVLTPDD
RKDFTIDPYQVYEARVLGADCILLIVAALLVDLSGLALQLGMDVLVEVHDIDELERAIQISAPLIGINNRNLSTFNVSLETTLTMKGLLVSESGILTSAD
RKDFIIDPYQVYEARVIGADCILLIVAALMLELAQLAEHLNMDVLIEVHDSEELERALVLDTPLIGINNRNLKSFETRLETTFELLSLVVTESGIHSPAD
RKDFLFDPYQVYEARAWGADAILIIMASVASALEATAFELGMDALIEVHDEAETQRALKLSSRLIGINNRNLRTFETSLETSERLATLLVSESGIFTHDD
RKDFLFDPYQVYEARSWGADCILIIMASVAKELEDTAFALGMDALIEVHDEAEMERALKLSSRLLGVNNRNLRSFEVNLAVSERLAKLLVGESGIFTHED
RKDFMFDTYQVHEARAWGADCILLIMASLAKRLEDEAFALGMDVLIEVHDAEETERALKLTSPLLGINNRNLRTFEVGLETSEKLAGLLVGESGIFTFED
RKDFMVDPWQCAEARSMGADAILIIVAALMHEIEAAALEHRMDALVEVHDEEEMSRAHALQSRLVGVNNRDLKTFTTDLATTERLAPLLVGESGIDNHAD
RKDFIIDPYQIYEARALGADAILLIVAALLVEYLDLARRLGLAALVEVHTAPELEVALRAGAGIIGINNRDLHTFKVDLQTTGRLRSVVVSESGIRSRAD
RKEFIIDESQIYETRLMGADAALLIVAALLRDYLQLFAELGLHALVEVHDRRELDIAIEQGSTIVGVNNRDLRDFTVDLMTSVNLKRLSVAESGLKRRDD
RKDFIVDESQIFESRLMGADAILLIVAALLRDYLQTASETGLDVLVEVHDRRELDTAAEEGATMIGVNNRNLKDFSVDPATSSDLRPIAVSESGLKTAGD
MKDFLVDEYKIYQARASGASSVLLITGVFLEAGIQKCRELSMEPLVECHTSLDIFRALEAGAEIIGVNNRDLETFEVDLERTHALAPILVSESGVRGPED
RKDFIIDEYMIYEAAVNNASGILLISGICIEEYLNISRNLGLDAVVECHTLEDIESVVDYNPEIIGINNRNLDDFTINLKTTKELKKYLISESGVKTIED
RKDFIIDLWQIYESRYIGADAILLIVSLLLKKFQIVANILGMQCLVEVHDERELERALESGARIIGINNRDLRTFEVDLKNTEKLMNVVVSESGIKDTED
MKDFIVKESQIDDAYNLGADTVLLIVKILLESLLEYARSYGMEPLIEINDENDLDIALRIGARFIGINSRDLETLEINKENQRKLISVKVAESGISERNE
ASDLLLYPYQLYKLRLAGADAVTMVVGALLLYLTKIASTINLQVIASVTSEVQIDMLSKLGISALVVSNRDLETFDFDNSGEQALSLLVEGRVGMVDGEG
ASDLILYPYQLYKLRLAGADAINLLVGALLSYLTKIASSLQLQSFATVTSEVQLLEVASLQIDGIIVSNRELEDFSFDMTGEQALYLLAEGRVGIIDRPQ
MKDIVVDEIQLGLAKDAGADGIVLISSVLLENFLNLATIIGLETIVECHTYNEVQAALDSLAQNIMVSNRDRITGQLLIKLAGMFPGITIAGGGISTPEQ
MKDIVVDEIQLGLAKEAGADGIVLMSSVLLENFLNLATMIGLETIVECHTHDEVQRAIDILAPNILVNNYDRVAQELHIKLAGMFPGICLAAGEIETTDQ
VDDVIIHPIQIALAVENKADGVILNLSYLLEDMLNYCVNLGTQAIVEVHDYNDIYYATQCGSYILMINEFDFINNRYEIKAISYSIPITIAKINVSDINY
VDDVIIHPIQIALAVENKADGVILNLSYLLEDMLNYCVNLGTQAIVEVHDYNDIYYATQCGSYILMINEFDFINNRYEIKAISYSIPITIAKINVSDINY
VDDIIIHPIQIALAVENQADGVILNLSYLMEEMLQYCVNVGTQAIVEVHDYNDIYFATQCGAYILMINEYDFYNNIYEIKAINYTIPITIAKVNTNEVNY
MKDIILHPIQIAQAVEMRADGVILNAAILLKDLLTSCITMGIEAIVEVHTTADALRSAEIGFTNFMINQWDRIKNILYLEIKEVLPETTIAAGGIMTMEQ
MKDIILHPIQIAQAVEMRADGVILNAAILLQDLLTACVTMGIEAIVEVHTTADALRSADIGFNHFMINQWDRIKNKLYLEIKEALPDITIAAGGIMTMEQ
YKDIIVHELQLAEAAEAGAKAACLVAGACLEKLLNAAYILGMEVLVEVHTEEELQYAFEAGAGIYVFTDVDRARRRVVVDLMDKYRVVCLGSGRIETLEQ
CKDIFVYPWQLYDARFYQADACVLIMKVLTLYFAKASVALGMAPIIEVHDAAELDALLTAKSDDGRISILLLAKRDLD-SLVPMLSDLLERFAPEARRLG
AWICRDKGFNSVWVSDALYKGNDPVEHP
AWSVRDQGFNSVWVGEALYKGADPNEQP
VGRYRSVGVGMCLIGESLMRSDPSLAIK
VDRYRQVGLGMCLIGESLMRTDPGQAIA
VVKYEKCGARGILVGEYLMKGDVATTVK
VVKYEKCGARGILVGEFLMKGDVATTVK
VVKYEKCGARGILVGEYLMKGDIATTVK
VLMNKRDGVNAVLVGEAIMKPDASVFIS
VLDCKRDGVNGILVGEAIMRPDATQFIR
VEAYKKEGVKAILVGEALMRSNTSAFVA
AEKYKKEGVHGFLVGEALMKTDVKKFIH
VAHYSSQGVSAVLVGESLMRSDPAAFAR
VAKYLKEGVGAVLVGEALMRKDKRAFIH
VERYKGQGVGAILVGESLMKKDVGEYIK
AKRMRELGFDAVLVGEALVRKDPALLIK
AKRMRELGFDAILVGEALVSKDPSLLIK
AKRYISAGYNAVLVGEALVKENPQQFIS
AWRLRDAGYSSILVGEALIRAESSKLPS
IIRVKKAGAKGVLVGEALMTSSIRTFFE
IAFVQEAGVKAVLVGESLIKSDPGKAIS
IAYVQAAGVKAVLVGESIVKNDPEKGIA
VAYVQNAGVSAILVGESLVKENPGQAIA
LSLVAEAGVRAVLVGESLVKSDVEQAVR
LEEVSSYGAKAVLVGESLMRPDIGLALK
LDRVTQAGAQAVLIGESLVKPDPGLALQ
VERMAQAGAQAILVGESLMRPAAMAHFS
VKRMRDADVHAFLVGEAFMRPEPGVELA
VALMRGHGVGIFLVGEAFMRPDPGTELA
VQRLRAAGVNAFLVGEAFMRTEPGESLR
VRKMKKNGVNAFLIGEAFMKEDPAAKLA
CLRLQKKDIGTFLVGESLMREDVTAATR
CLRLEKSGIGTFLIGESLMRHDVAAATR
CQRLEKSGISTFLVGESLMRDDVTAATK
CVRLSQAGVQTFLVGESLMRDDVEAATR
AALVAGAGVDAILVGEALMTGDVMAKMA
VLLMQDAGFDAVLIGEGLLAEELRQFSW
IAHLRDAGFHAVLIGEGLQTKELTGLTW
AEILAGYGADALLIGTAPMSQNPRELLE
AKKLKEYGADGILIGTSILQNTEKEIEK
LKYLKELGVDAVLIGETFMRRSISEKIR
IEELRKLGVNAFLIGSSLMRPEKIKEFI
VKTLKDAGAFGAIVGGGLAFSDDEASNV
ITELREAGAVGAIMGGALAVG------
MKKHLAVGYDGVVVGKTVMGARAPEFIR
MKRHLAVGYDGVVVGKAVMGPAAPEFIR
VEKLGSLGYDSICLEKKLIDEDLEVFVQ
VEKLGSIGYDSICLEKKLIDEDLEVFVQ
IEKLGSLGYDSICLEKKLIDDDLQQFVT
VHMLALAGIDAVCFGRRLVYPDVPDFIN
VHMLALAGIDGVCFGRRLVCPDVPDFIN
VEQLRDVGADGIVIGGKLMAIARGDVEA
IVELPREHAPNASLLRGLREGADAVLAS
Indole-3-glycerol phosphate synthase nucleotide sequence alignment
52 686
Thaps1 TTGCAAAAGATTACGGCTGCCGACGTGGAGGCATACACGTCGTCATCCTCCTCGATCGAAACAGCTGCACACAGACTGCACGGGCCTCTTCCATTGCAGC
Thaps2 GGTGTCTTTGTACGCAAGGCCGACGTGCGCAAATACTGTGAGGAGGGACCCTCGCTCCTGAAACAAATCCCCGACGAGGCTGCTGCAACAACAACGGTGG
Thaps3 GCCCAAGAGGGTATTGAACGCCTCGCTGCCGCATTACGTCGAGTCTACGATGATCCTTCAAATCCCAATGCCAAGCAAAAGACATTGGAGAAGGAAGAAC
Thaps4 TTGGAGGGATACATGTGGGAGCAGGTGGATCGTCTGAGGGAACGTATTTCGTTGATGAGTCAGTGTAAAGTGGATCCAGCAAAACCTCGCGACTGGATTA
Phatr1 CTTCAAACGATTACAGCAACGGACTATCAATCCCTTTTGGAAAGTAGCACGTCTGCCCAAGAAGCAAAAGCCTTTGATCATGGAATATTGAATTTGCAGA
Phatr2 GGTGCGCTCGTAAAGAAAGCCGAACTCCGGAAATACATTGAAAGTGGCGTTGAAAAGTACAATCTACTCCTGGCAACGATCAACGCCGATCAAGTTGTGG
Phatr3 GCACAGGAATCGGTCTCGCCGCTCGCCAAAGCCTTACGAAGAGTTTACGAGGATCCAGCCAATCCCGACCATGTTCCTCTATCTAAGAAGCGCGCTCAGA
Phatr4 CTGGAAGGATTTATGTGGGACGAAGTTGACCGCTTCCGAGAGCGGGTACCTCTGGTATCTCAGTGCAGAATGGATCCAAAAGCTCCTCGAGGCTTCGTTG
CME1 CTCGAAAAGGACGTCATGCGC------GAGTACGTGCGCGATGAACTGAGCCCCCTGTCGTTTCGATTAAGCTACATGTCATCGCGAGATATCCGTG
CME2 CTCGAGGAGCTCGTTTGGGCCGAAATTGAAGTGCGTCGGCAGCGCATGCCTCTGAAAAGCCGCTTAGAGGTCGCTCCGGTGCGC------CTCA
CME3 CTCAAAGAAATACTACGGCGAGAAGTTGCCAGTCTGAAGGCTGAGGTCGCCCAGCCGCTGCAGCGTCGCCTCGATGCAGCCGCGTCACGTCAGTTTAGCA
Pram CTGGAGGAGATCGCGGCGCAGGACGTGGAGGCCGCCAAGCAGGTCGTGACGCCCGTGAAGAAGATCGAAGAGAGCGTCTACGGCGCCCTGCGCGTGCTCG
Psoj CTGGAGGAGATCGCGGCGCAGGACGTGGAGGCCGCCAAGCAGGTCGTGTCGGCGACCAAGAAGATCGAGGAGGCCGTCTACGGCGCGCTGCCCGTGCTCG
PparOK CTCGAGGAGATCGCGGCGCAGGACGTGGCGGCCGCCAAGCAGGTCGTGTCAACTGCCAAGAAGATCGAGGAGAGCGTCTACGGCGCCCTGCCCGTGCTCG
AgabioOK TTGGACAAAATCTACAAACAGGACGTTGACCAAGCGCAGGCAACACCAGGGTCTGATCTTCAGACTCTCTTGAATATCTCCCCACAGATCGACTTCACTG
Ustma CTCGAGAGAATTCATGTGCAGGACATTGCTGCCACCAAGGCCATTCCAGGCTTCGATCTTGATATAGCTCTGCACCTCGCACCTCTGATCAACTTCCCAG
Gize CTGCAGCGGATTTACGCAAACGCAGTAGACGCTCAGAAGCAGATTCCTTCTCAGGACCTCCAGGTCGCTCTGAATGCCGCCCCTCAGATTCCTCTTGTTA
Schipo CTTGAGAAAATCCACGCGCAGGACATTGCTGAAAGTAAAAGAAAGCCTGGTCTTGATTTGCAAACATATCTTAACATAGCACCTTGTATAAATTTTTATG
Asory TTAGAGAAGATATATGATCACGCCGTTGCCATCCAGAAGACTATTCCTTCACAAGATCTTCAGGCTGCTCTCAACATTGCTCCTCAAGTCTCATTCCCTG
Sachce TTGGACCGTATCTATGCTCGGGACGTCAATGAGCAGTCTAAAATCCCAGGTTTCGACTTACAATCTAACTTAGGTCTTGCCCCATTACAGGATTTCTACA
Neucr CTTCAAAAGATTTACGCCCACGCTGTGGATGCTCAGAAGCAGATTCCTTCCCTGGACCTCCAAGCCGCTCTGAGCATCGCCCCTCAAATCTCTCTTGTCG
Arab1 TTGGAAGAGATCGTATGGCACGAAGTTGCTCAGATGAAAGAGAGAAAGCCTCTTAGCTTGAAGAAGGCTCTTGATAATGTTCCTGCTAAAGACTTCATTG
Arab2 TTGGAGGAGATCACATGGTACGAAGTTTCCCGGATGAAGGAGCTAAATCCGCTTGTGCTGAAGAAAGCTGTAGAGGATGCTCCTACTAGGGATTTTGTTG
Oryz CTCGAGCAGATCATCTGGGACGAGGTGTCCCAGAGGAAGGCGAAGAAGCCGCTGAAGGTGATCGAGTCGAGCCAGCACGCGCCAGCCAGGGACTTCGTCG
Coxbur TTAAAAGCAATTCTTAAAAATGAAATTGCGCATTTGAAAGCGAATTTCTCTTCCTCTCTTTCA------AAAAAATCCTTTAAAA
Chlafe TTAGCAAACATCGTTGCTTATGAAGTAGAACGGCTCAAAGAAGAAGTTTGCACATATCTGAGTCAAATTCTAAACCAAAATCATAGAGAGCATTTTGCCA
Chlaca TTAACAAACATTATTGCTTATGAAGTAGAACGGCTCAAAGAAGAAGTTAGCTCATATCTGAGTCAAATTCTAAAGCAAAATCACAAAGAGCAGTTTGCCA
ChloteOK CTG---CCTCAACTCCTCGCTACCCTCGCCGATGAGCACAGCGTCAAAGCCGGCCATGAGCAGCACATCTTTCAGGCCGCTTTCGGAGAGCACGCCTTCA
PelluOK CTTGCAAAGATCCTGAAAGAGGAGGTCGCGGCCATAAAGGCTGAGCGCCCTCTTCGTTACCTTGAGCTTCGCAGTTCGCTTGCTGCCAGAGGCTTTCAGG
Nitoc TTGAAAAAGATTCTCCAGCGGGAGATCAGCGAACGTGCCCAAAAGCTTTCAATCGAACTTAGCCAGAAAGTGGAAGAATCGCCCGTGCGCGGTTTCTTTA
MoortOK CTGGCGGAGATCCTGGACTACGAAGTGGCGGCCGCCAGGGAACAACACCCCCTGGACCTGGAGAAGGCAGCGGCAGCCATGCCCACCCGGGACTTCGGGG
Clothe CTGGATGAAATTGTAGCGCAACAGCTTAAAAAAGATATGAGCAGGATTACCATA---TGGAAGCAAAAAATTAAAAGACCGGGACCTCTGGACTTTTACG
Xylfa CTCACTAAGATTATTGCGTGGGAGATTGCCGAGCGTCTTTTGCATGTCTCACAGGAATTGGTTGCGCGTTGTGCCGATTTGCCGCCCCGTGGTTTTGCTG
Bacce TTAGATAAAATTGTAGAACAGGAAGTTGCGGAGTTATATGAAATATATACACCA------GTAAAAACAAAAAGAACACATTCACTTGTAG
Ralme CTGGAAAAGATCCTGGCCGTGGAGGTGGCAGCCGCGCGCAAGAAGCGCGACCTGAGCCTGCGTGCCGAGGCCGAGAGCCTGCGTCCGCGTGGCTTCGAGC
Nitros CTCGGTCCCGGGATCGGGAGCGGCTTCGCCCACGAGGAAGATGCCTACCCCGTGCATCAGCGCAACATCGAGTACCCCGCTTTCGGCCATACGGTCTGGG
Agrob CTCAAAAAGATCGAGACCTACGAAATCGCCGCCGCCAAGGCCAAGGTTTCGCTGGATCTGAAGGCCATGGCCGCAGACCAGAGCCCGCGCGGTTTCTATA
Meslo CTGCGCAAGATCGAGGCCTACGAGATCGCCGCCGCCAAGGCACACGTGCCGCTGGAGATCAAGGCGCGGGCGAAGGACGCCGACCCACGCGGCTTTCTTG
Brume CTTCGCAAGATCGAAGCCTATGAAATAGCCGCCGCCAAGGCGCGTCTTGCCCTTGAACTGAAGGCCCGGACGAGGGATCAGTCACCGCGCGGTTTTCTGA
Eryli CTTGAAGAGATATGCGCCGCCGAAGTCGCGACGCGCAAGGCTGCGCTGTCGGATGAATTGGCCGATCGCATCGCAGCCCAATCCCCGCGCGGGTTCGAAG
Proma CTTGAAAAAATCTTGTGGGAGGAAGTAAAAGTTTCAAGAGAGAGAGTTCCTCTTGAATTAAAGGCTCAGATCAATAATTTGCCTACTAAAGATTTCTTAG
Nospu CTTGAAGAGATTGTGTTGCATGAAGTTGCACAGATGCAGCAGGAACTTCCTCTATCTTTGCAGCAGCAGTTAATTACGGCTCCAGTCCGAAATTTCTTAG
Synel CTCGAAGAGATTGTTTGGTATGAGGTGAATGCTTGGCGAGAGCAGCTGCCGCTGCAGCTCCAGAACCAAGTCCGCGGCTTGACCCCCCGTGACTTTTTGG
Theel CTTGAGAAGATTGTTTGGCACGAGGTGGAGCAACTGCGGGAAGTCCTGCCCTTGGATTTACAGCGTCAGGTGCTGGAGGCGCCAGTGCGTTCTTTTTTGG
Sulso ATGCCACGTTATCTTAAAGGAGACGTCGTACAATTATCTTTAAGGAGGCCCTCA---TTTAGGGCTTCA---AGACAAAGGCCAATTATTTCCTTAAACG
Metth CTCAGGGATATTATAAGGTCAGAAGTTAAGGAACTCATGAAAAAAACACCCCTCATCCTAAGGGATGAT---ATAACAGTCATGCCAGTGAGCTTCCCAG
Mesta ATAGATAGAATAATAGAAAATACACTCACACAAACTAAAAAACAAAAACCCCTAAAACTTAAAGAAGAGATAGTTTCTACAAAATTATTTAGATTTCAAG
Plabe ATAAACAGAATTATTGAAAATGAAGTAACTAAGTTATTAGAAGAAAATGCTGATCCATTGCAAATAAGGTTAAAGTATCTCCAGAATAATAAATTATCTG
Thean TTTTATAAAATGATTACGGACGAAGTAGATCAACTAATAGAACAACATAAAGACAAACTTCAGTTGAGGTTAAATTATTTGCAAAACTTGAAGCTTTCCG
Playo ATAAACAGGATTATTGAAAATGAAGTAACTAAGTTATTAGAAGAAAATGCCGATCCATTGCAAATACGATTAAAATATCTTCAGAATAATAAATTATCTG
Thepa ------
Plafa ATAAATAAGATTATTGAAAATGAAATTACGAAATTATTAGAAGAAAATTGTGATCCATTACAAATAAGAATGAAATATTTACAAAATAATAAATTATCCG
ATCAAATCCAACAAGCACTTGCGGCGGAGTTCAAGCGCGCCTCTCCCTCCAAAGGAGATATTGCACCCCATTTGAACGCCGGCGAACAAGCGAGCGTGTA
GTGAGGTGCAGGGTTCAATCATTGCCGAGTACAAACGTAAATTGGAAGGAAGTGGCTTTCTCTCTGAAATCCTCCCTCCAGAGATTTTATCTCCCGTCTT
GCATGTTGGAGAATTCGTTTATTGTGGACATTAAACGAAAGTCCCCAGGA---GAGCAGTTTGCCCGATACGATGATGCAGGAATGGTAGCTGAGGCGAT
GTCCTGTGAAGCAAGTCATCATTCCGGAGTGCAAAAGAATG---CCGACTTCTGGCAGCTTACGCAAACGATACGACATTCCAAAGCTTGTGAAACAACT
CTGTCATCCAATCCGCGTTGGCGGCAGAGTTTAAACGGGCGTCGCCGAGCAAGGGTGACATTGCTACCCATCTGAATGCCGGTGAGCAAGCCGTCAAGTA
GTCCCTTGCAAGAAACGGTGATTGCCGAATACAAGCGCAAGTTCAATCCAACCGGACTTATTCACGAAATGCATGTACCGGAATTGCTCTCCCCCTCCTT
CACTGCAACAAGTTAGTTTCATTGTCGACATTAAGCGCAAGAGTCCAGGC---GAAGTTTTTTGTAATTATGATGATGCTGGTATGGTAGCAGAGGCTAT
TACCAATCCAGCAAGTTGTTATTCCAGAATGCAAGCGAATG---CCCACGATCGGGAGTTTGCGACGTCGCTATGATTTGAGTAAGCTTGCTCGCGATTT
CCAAGTTGCGAAGATCCGTGGTGGTCGACCTCAAGCGACGTTCTCCTGCGTACGCCGCTGCCGAGATACTTTCGGATCTAAGAGCGGAAATACCACTTGG
ACTCGGTTTTCTGGGTGCTCTTTTTCCAGTACCAGCGCTGC---CCGGAAGAC------ACAGAGTCTGCACCTATTCTGGCAAATGTGGCTGAAGAGAA
GCGCAATTCGTAAATCTGTTATTGCGGAGATTAAGCGACGGTCACCGAGCTCTGGTTTAATTGCCGAAATCCCGGACGTGCGACAGCTCAGTGACCTGTA
ACCGCCTCAACGCGGCGCTGGCAGCCGAGTTCAAGCGCGCCAGCCCCAGCAAGGGAGATATCGCCACGGGACTCAACCTGCGCGAGCAAGTCAAGTCGTA
ACCGTCTGAACGCGGCGCTGGCCGCCGAGTTCAAGCGCGCGAGCCCCAGCAAGGGAGACATCGCCACGGAGCTCAACCTCCAGGAACAAGTGAAGGCCTA
AGCGTCTTAACGCG------GAGCAGGTTCAGGCCTA
TACGCATCAGGCAAGCCATGTTCGCTGAGATAAAGCGCGCATCGCCCTCGAAAGGCCCTATTGCATTAAATATCAATCCTGCCGCACAGGCTCTCAAATA
AGCGTCTCCAGCGAGGAGTCATGGCCGAGATGAAGCGTGCCAGTCCCAGTAAGGGCGACATCGATCCCACCGCCCACGCTGGCGCCCAAGCGCTTGCATA
GCCGGCTACGGGAGGCTCTCATGGCTGAAATCAAGCGGGGTTCTCCTTCAAAGGGTATATTTGCTCTTGATATCTCTGCTCCTGCCCAGGCTCGAAAGTA
AGAGATTAAAGCAAGCTTTAATGGCTGAAGTCAAACGTGCTTCCCCTTCTAAAGGTGACATTAAGTTAGACGCAAATGCTGCTATCCAAGCTCTTACTTA
CCCGTCTGAGACAATCTCTTATGGCTGAAATAAAGCGCGCTTCCCCTTCCAAGGGAATGATTGCGGAGAACGCATGTGCCCCTGCCCAGGCTAGAGAGTA
CGGTGTTGTCATCAGTTGTTCTTGCTGAAGTCAAGCGTGCCTCTCCATCGAAGGGACCCATTTGTTTAAAAGCTGTTGCTGCTGAACAGGCTCTCAAATA
ACCGTCTTCGCAATGCTCTTTGCGCCGAGATCAAGAGGGCATCTCCCTCCAAGGGTGTCTTTGCGCTTGATATTGACGCTCCGTCGCAAGCTCGCAAGTA
GTGCTCTTAGATCGGGTTTAATAGCTGAGGTTAAGAAAGCTTCTCCAAGTAGAGGAATCCTGAGAGAGGATTTTAACCCTGTTGAAATTGCACAAGCTTA
GGGCTCTTAGGATGGGCTTGATAGCTGAGGTTAAGAAGGCTTCTCCAAGTAGAGGAATCTTAAAAGAGAATTTTGACCCGGTCGAGATTGCTCAAGCTTA
GCGCGCTAACGGCGGCGCTGATCGCCGAGGTGAAGAAGGCGTCGCCGAGCAGAGGCGTGCTCCGGGAGGACTTCAATCCGGTTGAGATCGCGCAATCCTA
GAATAATTTCTTCTACTATCATTGCCGAAATTAAACGGCGTTCTCCTTCAAAAGGACATTTAGCCGAAATTGCTGATCCAGTCGCCTTGGCAAAGCAATA
CAGCCCTAAAAGGATCCATTATTGGCGAAATAAAAAGACAATCTCCAACACGTGGAAAGATTGGATGCATAGATAATCCAGCAGATCTTGCATTAAAATA
CAGCTCTAAAGGAATCCATTATTGGTGAAATAAAAAGACAGTCTCCAACACGTGGAAAAATTAGAAGTATAGATAGTCCAGCAGATCTTGCATTAAAATA
GGATACTCCCGCTTGGTCATGAGATCGACGGTGAAGTCGCG------CAGATCGCGGTTGTTGACCCCGACAATCGTAGAACCCTGCTCGATGGCAATAT
AAGCGCTTCGGGGACGCCTCATTGCTGAAGTCAAGAAAGCCTCTCCCTCAAGGGGAGTCATTGTCCATGATTTCGATCCGGTCCGGATCGCCCTCCACTA
AAGCCCTTGCTAAAGCTGTCATCGCTGAAATCAAAAAGGCCTCTCCTAGTAAGGGGCTTTTATGCGAATCTTTTCATCCGGAGGCGATTGCCCAAAGCTA
CCGCCCTGCGGCGCAAGGTTATAGCTGAGCTAAAACAGGCTTCTCCCTCCCGGGGCCTTATCAGGGAGGATTTTGACCCTGAAGGTCAGGCCAGGAGTTA
GAGCTTTAAAGAACTCCATTATTGCCGAGGTAAAGAAGGCTTCACCGTCAAAAGGGATTATAAAGGAGGACTTTGATCCGCTCAAAATTGCAAAAGAATA
GTGCGTTGCAGGCGGCAGTGATTGCTGAGATTAAGAAGGCCAGTCCTTCAAAAGGGGTGCTTCGCGAGGATTTTCGTCCTGCAGAGATCGCTATCTCATA
AAGCGTTACAGCAG---GTCATTGCAGAAGTAAAGCGAGCATCACCATCAAAAGGAGATATCAATTTACACGTTGATGTACGAAAACAAGTGAAAACATA
GCGCACTGCGCGACGGCGTGATCGCCGAGGTCAAGAAAGCCTCGCCGTCGAAGGGCGTGCTGCGCGAGAACTTCGTGCCGGAGGCCATCGCCGAAAGCTA
GGAATGCGGTCTAGCGTATCCAGGCGGGTTTCAAACGTATG------CAGGTTGCGGTTATTGATGCCGATCAGGGGAGACGCCAACTCCAGCGCCAGCT
AGGCGCTGCGCGCAGGCCTCATCGCCGAAATCAAGAAGGCCAGCCCCTCCAAGGGCCTCATCCGCCCGGATTTCGACCCGCCGGCGCTGGCCGCTGCCTA
CAGCCCTGGAGGCAGCCCTGATCGCCGAAATCAAGAAGGCGAGCCCCTCCAAAGGCCTGATCCGTGCCGATTTCGATCCGCCGGCACTGGCGCAGGCCTA
AGGCACTTGAGGCCGCACTGATCGCGGAGATAAAGAAAGCAAGCCCGTCCAAAGGCTTGATCCGCCCCGATTTCGACCCGCCAGCGCTTGCAAAGGCTTA
CCGCTCTGCGTGCCGCACTGATTGCCGAGATCAAGAAAGCCTCACCGTCGAAGGGTCTCATCCGCGCAGACTTTCATCCCGCAGACCACGCCCGCGCCTA
GGGCTTTGAGACAAGCTGTTATAGCAGAAATAAAGAAAGCAAGTCCTAGCAAGGGAGTGATTCGCGAAAACTTTGATCCAATAGAAATAGCACTTGCTTA
CTGCTTTACAACAAAGCTTAATTGCTGAGGTAAAAAAGGCATCACCTAGCCGTGGGATTATTCGGGCGGATTTTGATCCAGTTGCGATCGCTCAAGCTTA
CAGCACTGCGACAGGCCGTGATTGCCGAAGTCAAAAAAGCATCTCCTAGCAAAGGCGTACTGCGAGAAGATTTCGATCCCGTGGCGATCGCCCAAGCCTA
CGGCTGTGCAAAACGCGCTGATTGCTGAGGTCAAGAAGGCCTCCCCCAGTAAAGGCATTATTCGCCCCAATTTCGACCCGGTGGCGATCGCCCAAGCCTA
AAAGAATTTTAGAAGCTATAATAGCCGAATATAAACGCAAATCTCCCTCT---GGA---TTAGATGTTGAAAGGGATCCAATAGAATATTCAAAATTCAT
CTGCAGTGGGAGGATCTCTCATATGCGAATACAAGAGGGCATCACCATCAATGGGCAGAATATCAGAG---AGAGGCCTTGAAGAGATGATGGAGGTATA
AAAAACTAAAAAATCAACTCATTGCAGAATACAAACCTGCAAGTCCATCAAAAGGAAATATCAGTACA---TTAAAAGCAGAAGATGTTATTCCAATATA
AAAGTTTAAAAATATCAATGGTAGCAGATATAAAAAGAAAGAGTAGTAAACAAAATAATTTTCTTAATTTAACAAATCCTGGTGAAGCAAGTTTAATGCT
ATATGCTTAGTAGATGCGTAATAGCTGACATGAAGAGAAGAACACCAAGGGATAATAACGTACTTTCCTATACGGACGCGGGTGAAGTGGCCCTGAATAT
AAAGTTTAAAAAGATCAATGGTAGCAGATATAAAAAGAAAGAGTAGTAAACAAAATAATTTTCTTAATTTAACAAACCCTGGTGAAGCAAGTTTAATGTT
--ATGCTTAGGAGATGTGTAATTGCTGATATGAAGAGAAGAACACCAAAGGATAATAACGTACTTTCCTACACCGACGCGGGTGAAGTGGCCATGAATAT
AAAGTTTAAAAAGATCATTAATTGCTGATATGAAGAGGAAAAGCGAAAAACAACATAATTTTTTAAATTTAAGTAACCCAGGAAATGTCAGTTTGTTATT
TTACAAAGCGGGCGCTAGTGTGATTAGTGTCTTGACGGAGGGGAGGTGGTTTAAGGGCAGTTTGGCCGATTTGAGGGAGGCTAGACTGAGTACGGGTGAT
TCGCGAGTTTGGTGCTACTGCTGTTGCCGTTCTAGCTGATGAGCGTACGGGTGGATGTACCTACGATGACATCGTCGAGATGGTGGGTGAAGTCCTGCCC
GGTACGATTGGGTGCCGACGTGGTGTTTGTGAATGTGGATTATCATTCATATGGAGGAGATCTGTCCGAGTTGAAATCAGCAGTGCGAGGTGTGGAGGCT
CCTCTCTGCAGGAGCTCCTGCTATCTCTGTCAACAGTGATGGAGTCCTCTTCGGTGGATCAATGGAAGACATTACAATAGCGAGGGAAGCATTACCTCCC
TACCAAAGCAGGGGCCAATATCATTTCTGTATTGACCGAGTCCCACTGGTTTAAAGGGAGTCTGGACGACATGACACAAGCCAGACTCGAAACAGTTTCC
TCGCGAGTACGGTGCGTCGGCCATTGCCGTCATGGCCGATCCACGCATGGGGGGTTGCGACTACGATGACATCCGACACTTTGTGATGGAAGTCTTGCCG
GGTACGCTTGGGAGCCGACGCCGTCTTTGTAAACACCGACTATCAGGCCTACGGCGGTGACATGACGGAATTGAAATCGGCTGTCCGCGCCGTTTCGGCG
CACTTTTGACGGCGCTGTAGCCATTAGTGTGAATTGCGATGCAGTCCTCTTTGGCGGGTCTCTGGGCGACGTCACTGCAGCACGTGAAGCTGCTCCCCCA
AAGCAAATTTGGACTCGACGGGATTATGGTTAGCGTCGATTCGGAACTCTACGATCAAGACTGTGCAAGTCTATCAGACTTTGCGAGGTATTTTGCACCC
G------AACGATGCCAGAGCAGTCGTGGTGAATGCCGAGTCGCGCCGCTTCTACGGCAGCTACGAGGATATTCGGAGGGTTCATGAGGTCGTCTGTCCG
CTACAACGGAGGCGCAGCCGCAATCTCGGTACTAACAGAT---GCTGCCTTCGATGGAACACTGGATGATCTGCAAACCGTGGTGGGAAGATATTGTCCG
TGCGGACGCGGGCGCCAGCATGATCTCCGTGCTGACGGAGCCCAAGTGGTTCAAGGGGTCGTTGGAGGACATGATGGCGGCCAGAGAAGTGGTGATGAGC
CGCGGACGCAGGCGCCAGCATGATCTCCGTGCTGACGGAGCCCAAGTGGTTCAAGGGCTCGCTTGAGGACATGCAGGCGGCCAGGGACGTCGTGATGAGC
CGCTAACGCAGGCGCCAGTATGATCTCCGTGCTGACGGAGCCTAAGTGGTTCAAGGGCTCACTAGACGACATGATGGAAGCTCGCGAGGTGGTTATGAGT
TGCTCTGGCTGGAGCCCACACAATCTCTGTCTTGACTGAACCTAAATGGTTCCTTGGATCACTTCATGACATGCTGCATGCCCGTCAAGCAGTTCTCCCC
CGCTCGCGGAGGTGCCAGCGTTATCAGTGTTCTCACTGAGCCCAAGTGGTTCAAGGGTACGATGCATGACCTATCGCTCGCAAGACGAGCGGTCCTACCG
CGCCTTGGCGGGTGCGAGTGTCATTTCTGTTCTGACAGAGCCTGAGTGGTTCAAGGGCAGCATTGAAGATTTGAGGGCAGTCCGACAAGTTTTAATGCCT
TGCTCAGGTTGGCGCCTCAGTCATATCTGTGTTAACAGAACCAAAATGGTTCAAAGGCTCATTGAATGATTTGTTTGTTGCGCGAAAAGCTGTTGTAGCT
CGCTAAGGCCGGCGCCAGCGTTATTTCTGTCCTCACTGAGCCAGAATGGTTCAAAGGTAGCATCGACGATTTGCGTGCCGTTCGTCAGAGCTTAGTTACA
CGCAGAGGCTGGTGCATCCGCAATTTCCGTATTGACCGAACCTCATTGGTTTCACGGTTCGTTACAGGATTTAGTAAATGTGAGGAAAATCCTATTTCCT
TGCGCTTGCCGGCGCGAGTGTCATCTCGGTCCTGACCGAGCCAGAGTGGTTCAAGGGCAGCATCGATGACCTCCGTGCTGTCCGTCAGGTCCTTATGCCC
TGAGAAGGGGGGAGCAGCATGTCTTAGTGTTTTGACAGATGACAAATACTTCAAGGGAAGCTATGAGAACCTGCAAGCTATAATGGCTGGTGTGTGCCCT
TGAAAAAGGCGGAGCAGCATGCCTCAGCGTTTTGACAGACCAGAAGTATTTCCAGGGAGGCTTTGAAAACTTGGAAGCAATAAGGGCTGGTGTGTGTCCA
TGAGAAGAACGGCGCGGCTTGCCTCAGTATCCTGACCGACGAAAAACACTTTCAGGGTAGCTTTGAGAATCTCGAGACCGTTCGCTCCGGAGTATGCCCT
TGTTCAAGGAGGTGCTGCGGGAGTATCAGTCCTTACTGATAAACTTGCCTTTGATGGTTCCATTCGTGATTTACAGCAGGTCAGCGACAGGCCTGTAGCT
TTGTTGCGGGGGGGCTGCTGCTATTTCAGTTCTTACTGACACCCGAGGCTTCGGTGGATCTTTTCTAGATATGCAACAGGTAAGCTCACAATACGTTTCT
TTGTTGTGGAGGGGCCTCTGCTATTTCAGTTCTTACTGATACTCAAGGCTTTGGCGGATCTTTTTTAGATATGCAACAGGTAAATTCACAATACGTTTCT
CAAGTTCACGGCGATCGTGCACTTCGACCAGCGC---GTGCAGGCCAAGTTCAGCGAAAAGCTGGAGGTAGTCGCGGAGTTGCGACGGTTC------
CGAGGAGATCGGCGCTTCTGCACTGTCGGTTTTGACCGACAGGCAGTTCTTCCAGGGCTCCCCGGACTATCTCCGGGCGGTCTCGGCTGCTGTACTTCCC
CGAAAAAGGAGGAGCAGCTTGCCTTTCCGTGCTCACAGACCAGGATTTTTTTAGGGGTAGTGAGGCGGATCTACAGAAGGCCCGCAGCGCCTGTCTGCCT
TATAGCCGCCGGGGCGGCGGCCATCTCCGTCCTGACGGACCAGAGGTTTTTCCGGGGTCGCCCTGAGTACCTGATCCTGGTGCGCCGGGTAACCCTGCCC
CGTTGAGTCGGATATTCAGGCTATATCGGTGCTTACGGAAAGGAACTTTTTCAAGGGGGATGAAGACTACCTTGTAAAGATTCGTCAGTTCTGTCTTCCT
TGAGCTTGGTGGTGCTAGCTGTCTATCAGTGCTGACGGATGTGCATTTCTTCAAGGGTCACGACGATTATTTGAGTCAGGCGCGTGATGCTTGCCTTCCG
CGAAGAATGCGGCGCAGGAGCAGTTTCTGTTTTAACAGACGGTCAATTTTTTAAAGGATCTTTTTATGATTTACAAACAGCACGAGAAGAAAGTATTCCG
CGCCGCCCATGGCGCGGCGTGCCTGTCCGTGCTGACCGACGTGAATTTCTTCCAGGGTCATGCCGAGTACCTGAAGCGCGCGCGTGGCGCCTGCCTGCCG
CGAGTTCCGCCCCATCATGAACCTCCACCAGAACGGCCATCCCGAGCGCATGAGCCAACGACTCCAGCTTGCGCATTTCCACTACCGCTTCTTCCTGCCG
TGAGGCGGGTGGTGCCGCCTGCCTCTCGGTCCTGACGGATACGCCGAGCTTTCAGGGCGCGCCGGAATTTCTGACCGCCGCCCGTAATGCCTGCCTGCCG
TGAGAAAGGCGGCGCCGCCTGTCTTTCGGTATTGACCGATGCGCCGTCCTTCCAGGGCGCGCCGGAATTCCTCACCAAGGCAAGGGCAGCCGTCCTGCCC
TGAAGAAGGCGGTGCGGCCTGTCTTTCCGTGCTGACCGACACGCCTTCCTTTCAGGGCGCGCCGGAATTCCTCACGGCCGCACGCCAGGCTTGCCTGCCC
TGAGGCAGGCGGCGCGGCGTGCCTGTCCGTCCTGACCGATGAAGACTATTTCCAGGGCCACGCCGATTACCTGCGCGAAGCGCGCGATGCGTGCCTCCCC
TAAGTTAGGAGGTGCAACATGCTTATCAGTGTTGACAGATAAAAGTTTTTTTCAAGGAGGCTTTGAGGTACTTGTTCAAGTCAGAAAGACTGTTTTACCA
TGAACGAGGTGGTGCAGCTTGTCTATCAGTCCTCACCGATCGTAAGTTTTTTCATGGCAGTTTTGACAATTTGCGTAATGTGCGATCGCACGTTTTACCC
TGCGGCTAATGGGGCGGCTTGCATTTCGGTGCTGACCGACGAGAAGTTTTTCCAAGGCGGCTTCGAAAATCTACAGCGGGTGCGAGCAGCAGTCGTACCG
CGTTGCCGCGGGTGCAACCTGCCTTTCCGTCCTAACGGACAGTGAGTTTTTCCAGGGCAGCTTTGAGTACCTCGCTCAAATTCGCCAAGAGGTGGTGCCG
GGAAAGGTAT---GCAGTAGGTCTTAGCATATTAACTGAGGAGAAGTACTTTAATGGTTCATATGAAACTTTGAGAAAGATAGCCAGTTCAGTTATTCCC
C---CAGGACCTCGCCGATGCAGTATCCATAGTAACCGACGGCAAATACTTCAAAGGCTCCCTGGATCTGCTGAGCGGGGCCACAGATTACGGTAAACCA
TGATAAAAATAACGTAGACATGATGTCAATTTTAACTGAAGAAACATACTTTAAAAGTAATCTAAAAAATTTTAACATAGCAAACAACTTAACAAAACCA
GCATAAAATAGGATTCGATGTATTAATAGTTAACATTGATAGCTTATCAGCACAAGGAACATTAAATGATTTATCTGATGTCATTAGAACATTGAGACCA
GGCCTCAGTAGGTTTTGACGTAATATTTGTGAACGTTGACCAGAAAAACTACGGTGGCCACATCAACGACTTGAATAAAGTTTTTAGAAAGCTAAGACCA
ACATAAAATAGGATTTGATGTATTAATAGTTAACATTGATAGCTTATCAACACAAGGAACATTAAATGATTTATCTGATGTAATCAGAACATTGAGACCA
GGCCTCAGTGGGGTTTGACGTAATATTTGTAAATGTTGACCAGATTAACTACGGTGGCCATATCAACGACCTGAATAAAGTTTTCAGAAAGCTAAGACCG
ACATGAAATAGGATTTGATGTATTAATAGTTAATATTGATGAATTATCAACTCAAGGAAATATAAATGATTTAAAAGATATAATAAGAACATTAAGACCA
ATTTTGAGAAAGGACTTCATCACGTCGAGGTATCAAATTGCTCAAGCGGCAGCGAGTGGGGCTGATACAGTCTTGCTCATTGTTGCAACTACTCCATTGC
GTGATTTCGTCAGATTTGATTGTGGACGAGATTCAAATTGCACGGGCTGCCGATGCAGGTGCCAAAGCTGTCACTGTCACGCATGGGGTGGTCGGTGAAG
GTGGTTATGAAGGATATCGTCGTGGATGAGATTCAATTGGGATTGGCGAAAGATGCTGGTGCTGATGGGATCGTGCTGATATCATCTGTTCTTGGACCAC
GTCCTTGCATCTGATCTCCTTCTTTATCCATACCAACTTTACAAACTACGTCTCGCTGGTGCTGATGCTGTAACTATGGTGGTGGGTGCTTTGGAGAGCT
ATTTTACGCAAAGATTTCGTGATCAACGAATACATGATTGCGGAGGCGGCCGCAAAAGGTGCCGATACTGTCCTGTTGATCGTCGCTGTTCTACCACAAC
GTCATTAACAACGACCTGATTGTGGACGAACTGCAGGTGGCGCGCTCCGCCGCCTACGGATGTGCCGCACTCGTTCTCAATTTACCTACCCTCGGTGTTA
GTCGTGATGAAAGATATTGTAGTGGATGAGATTCAATTGGGACTCGCGAAAGAGGCCGGTGCTGACGGAATCGTTCTTATGTCATCAGTGTTGGGGCCTC
ATTCTCGCATCCGACTTGATCCTTTATCCGTATCAGCTTTATAAGCTGCGTTTGGCTGGCGCCGATGCAATTAACCTCTTGGTTGGAGCTCTAGAGAAGC
ATCATCTACAAGGACATTATCGTCCACGAACTGCAGCTGGCGGAGGCCGCAGAGGCCGGTGCCAAGGCCGCCTGTTTGGTCGCCGGTGCATGCCTGCCGC
GTTATCTGCAAGGACATTTTCGTGTATCCATGGCAACTGTACGACGCGCGCTTTTATCAAGCTGATGCTTGTGTGCTTATCATGAAGGTGCTTGGGTTGA
GTCCTCCGCAAGGACTTTATCATTGATGAAATCCAAATCGCTGAAGCTGCAGAGCGAGGTGCAGCAGCAGTGCTTTTGATTGTTGCTGCAATTGGGGAAC
ATCCTCCGCAAGGACTTCATTATCGACGTGTACCAGCTGCTGGAGGCTCGCGCCTACGGAGCCGACTTAGCTGCGTTAATAATGAAT------
ATCCTGCGCAAGGACTTCATCATCGACGTGTACCAGCTGCTGGAGGCCCGCGCCTACGGTGCCGACTGCGTGCTGCTCATCGTGGCGCTGCTGTCCCAGC
ATCCTGCGCAAGGACTTCATCATCGACGTGTATCAATTATTGGAGGCCCGTGCCTACGGGGCTGACTGCGTGCTGCTCATCGTGACGTTGCTGTCCAAGC
ATCCTTCGCAAAGACTTTATTCTCAGCAGGTACCAAGTACTTGAATCGAGGATTTGGGGTGCAGATAGTATTCTTCTCATTGCCAGTATGCTCTCGGAGC
ATTCTGAGAAAGGACTTCATCGTCGACACTTACCAAATTGCCGAAGCCAGGCTTGCTGGTGCCGACACGGTGCTGCTCATCGTTGCCATGCTCACGGACC
ATTCTGCGAAAGGAATTCATCTTTGATGAGTATCAAATTCTCGAGGCCCGATTGGCTGGTGCTGATACTGTTCTTCTCATTGTCAAGATGCTTGACACTC
ATTTTGAGAAAGGATTTCATTATTGACCCTTATGAAATTATGGAGGCTCGTTTGAATGGTGCTGATAGTGTATTGCTAATCGTTGCGATGTTAAGCCGTT
ATCTTAAGAAAGGAGTTTGTTTTTGACGAATATCAGATTCTTGAAGCCCGTCTTGCTGGGGCTGATACTATCCTGCTCATCGTGAAGATGCTAAGCGTCC
GTTTTGAGAAAAGAATTTATTTTCAGCAAGTATCAAATACTAGAAGCAAGATTAGCTGGAGCTGACACTGTCCTTCTTATAGTCAAGATGCTATCTCAAT
GTCCTGCGCAAGGAGTTCATCTTTGACGAGTACCAGATCCTCGAAGCCAGACTTGCCGGTGCTGACACTGTTCTCCTCATTGTCAAGATGCTCGAGTATC
TTGCTATTGAAAGAGTTCATTGTTGAGGCATGGCAGATATACTATGGTAGAAGTAAAGGCGCAGATGCAGTTTTGTTGATCGCTTCTGTGTTACCTGACA
CTATTATGCAAAGAGTTTGTTGTAGATCCATGGCAGATCTACTATGCTCGGACTAAAGGCGCAGATGCAGTACTGCTTATTGCTGCTGTATTGGCTGACA
CTTCTTTGCAAGGAGTTTGTGATAGATATCTGGCAAATCTATTACGCTCGGTCAAAGGGTGCGGATGCAATTCTGTTGATTGCTGCCGTGTTACCTGATA
GTCCTTCGCAAAGATTTTATTTTAGACCCTCTACAAATTGAAGAAGCTGCTGTCGCCGGTGCTGATGCGATATTATTAATTGTAGCAATTTTAAAAGACA
GTATTAAGAAAAGATTTTATCCTAGATCCTTTACAACTAGCGGAAGCTATTTTCTTCGGAGCAAACGCAGTTCTTTTAATCGTGAGCGTTGTTGGAGAAT
GTATTGAGAAAAGATTTCATCCTACATCCTTTACAACTAGCCGAAGCTATTTTCTTCGGAGCACATGCTGTTCTTTTAATCGTGAGCGTTGTTGGAGAAT
------GAGCGCCGCCACGATGAGGAGCGCCGCATCCGCGCCCATGAGGCGGGTTTCGTAAATCTGGCTCTCGTCGATAATC
GTGCTCCGCAAGGATTTCATTGTCGATGAATCGCAGATTTTCGAGTCGCGCCTTATGGGTGCTGACGCCATCCTTCTGATTGTTGCGGCCCTCGAGCCCC
GTTTTGCGAAAAGACTTTATTATCGACCCCTATCAGGTCTATGAAGCACGGGTAATAGGCGCCGACTGCATTCTGCTAATTGTAGCAGCGTTGGGTGATA
CTTCTGCGCAAGGATTTTATCATTGATCCCTACCAGATTTATGAGGCCCGGGCCCTGGGAGCGGATGCCATCCTACTGATTGTCGCTGCCCTGGAACCTC
ATTTTGAGAAAGGATTTTATAATCGATTTGTGGCAGATATATGAGTCGCGGTATATTGGGGCCGATGCAATATTGCTTATTGTATCACTGCTTTCCGATC
GTGTTGCGCAAGGATTTCACGATAGACCCGTATCAGGTGTATGAGGCACGTGTGCTTGGTGCTGATTGTATTTTGCTGATTGTCGCTGCATTGGACGATT
CTTTTATGTAAAGATTTCATAATTGATAAGATTCAAATTGATAGAGCATATGAAGCTGGTGCAGATATTATTTTATTAATTGTAGCAGCTTTAACGAAAT
GCGCTGCGCAAGGACTTCATGGTCGATATGTACCAGGTCTACGAAGCCCGCACCTGGGGCGCCGACTGCATCCTGCTGATCGTCTCCGCGCTCGACCACA
CCCGGCCGAAATGAAAGGATCGCGGAAGGCGGCGACAATGATGAGGATGCAATCCGCACCCATGAGCCTTGCCTCGGCGACCTGATATTCGTCCAGCATC
GCACTGCGCAAGGATTTCATGTTCGATACCTATCAGGTGCATGAGGCCCGCGCCTGGGGTGCGGATTGCATCCTGCTGATCATGGCATCGCTTTCCGACG
GCCCTGCGCAAGGATTTCCTGTTCGATCCCTATCAGGTCTATGAGGCGCGCGCCTGGGGCGCGGATGCTATCCTGATCATCATGGCCAGCGTCGACGATG
GCATTGCGCAAGGATTTCCTGTTCGACCCCTATCAGGTCTATGAAGCGCGTAGCTGGGGAGCGGATTGCATTCTCATCATCATGGCCAGCGTCGATGACG
GCCCTGCGCAAGGACTTCATGGTCGATCCGTGGCAATGCGCCGAAGCCCGGAGCATGGGTGCCGACGCGATCCTGATTATCGTTGCGGCACTCTCCAATA
TTGTTATGCAAGGAATTCATTATTCAGCCTTACCAGATTTATCAAGCAAGAGTTGCTGGTGCTGATGCGGTTTTATTGATTGCAGCCATACTTTCTGATC
CTACTGTGCAAAGAATTCATCATTGACCCTTATCAAATCTATCTAGCACGGACAGCAGGCGCAGATGCGGTTTTGTTGATTGCTGCTATTTTATCAGACC
CTGCTCTGCAAAGACTTTGTGATCTACCCCTATCAGATCTACAAAGCCCGCTTACTGGGGGCCGATGCCGTGCTGTTGATCGCGGCGATTCTTTCGGATC
CTGTTGTGCAAGGACTTTATCCTCTATCCCTACCAGATGTTTCTGGCAAGGGTGCGGGGCGCCGACGCAGTGCTACTCATCGCGGCTATCCTCTCGGACT
ATACTAATGAAGGATTTTATCGTTAAGGAATCGCAAATTGATGATGCATATAACCTAGGTGCTGATACTGTATTGCTAATAGTCAAAATACTAACTGAAT
CTGCTCATGAAGGACTTCCTGGTCGACGAGTACAAGATCTACCAGGCAAGGGCATCGGGGGCATCCTCGGTTCTCCTAATCACAGGCGTATTCCCTGACC
TTACTTAGAAAAGATTTTATTATAGATGAATACATGATCTATGAAGCTGCTGTTAATAATGCTAGTGGAATATTATTAATAAGTGGAATCTGTCCTAATA
GTTGTAGTAGATGATGTAATAATACACCCAATCCAAATAGCGTTAGCAGTTGAAAATAAAGCTGATGGAGTCATATTAAATTTGTCTTATTTAAAAAATT
ATAGTCATGAAAGATATAATTTTACATCCAATACAAATAGCACAAGCTGTGGAAATGAGAGCTGACGGAGTCATTTTGAATGCAGCAATACTGGGCAATT
GTTGTAGTAGATGATGTAATAATACACCCAATCCAAATAGCATTAGCAGTTGAAAATAAAGCTGATGGAGTCATATTAAATTTGTCTTATTTAAAAAGTT
ATAGTCATGAAAGATATCATTCTACATCCAATCCAAATAGCACAAGCTGTGGAAATGAGAGCGGACGGAGTTATTTTGAATGCAGCAATACTCGGGAATT
ATTGTTGTAGATGACATAATTATCCATCCTATACAAATAGCTTTAGCCGTAGAAAACCAAGCAGATGGTGTTATATTAAATTTATCTTATTTAAAAAATA
TCAAGGATTTGATCTTGTACTCTCGTTCACTCAACATGGAGCCATTGGTGGAGGTACACGCCTTGGTAGAATTGGATGTGGCACTTGAGGCTGGGGCGAA
TGTCGCAATTCATCAAGAATGCACGTGTACTTGGATTGGAAACGATTGTGAATGTCGGTACAGCCGAAGAAGCTCAGAGTGCGGTTGACGTGGGAGCTTC
TTGAAAACTTTTTGAACTTGGCGACCATTATTGGATTAGAGACTATTGTGGAATGCCACACGTACAATGAAGTTCAGGCGGCGTTGGATTCTTTGGCTCA
TGCTTTACCTCACAAAGATTGCATCGACGATCAATTTACAAGTGATTGCCAGTGTGACGTCCGAGGTACAAATAGACATGCTCTCCAAACTAGGTATCAG
TCACTCGATTAATCGGGTTTTCTCGTTCCCTCGGGATGGAGCCACTAGTCGAAGTCCACGCGGATACCGAACTTGATGTGGCCATCAAAGCCGGTGCCAA
CCAAAGTCCTTCTCAAAGCAACCAAGGCTGTGGACCTGGAAGCTATTGTGGCCGTGTCTTCCAAGGAAGAAGCGCAAACCGCAATCGATAGCGGGGCTCG
TCGAGAATTTCTTGAACCTGGCAACCATGATTGGTTTGGAGACGATCGTTGAGTGTCATACACACGATGAAGTACAGAGAGCCATCGACATCCTGGCACC
TGTCATACCTTACCAAGATAGCGTCTAGTCTTCAGCTTCAGTCATTTGCCACCGTAACTTCCGAAGTGCAATTACTGGAAGTGGCAAGTCTGCAGATTGA
TCGAGAAACTCCTGAACGCGGCATACATTCTCGGGATGGAGGTGCTCGTAGAGGTCCATACGGAAGAGGAGCTGCAGTACGCTTTCGAGGCAGGAGCCGG
CGCTGTATTTTGCCAAGGCAAGCGTTGCGCTGGGGATGGCGCCCATCATCGAAGTGCACGATGCCGCCGAGTTGGATGCGCTTTTGACAGCAAAGATTAG
TGCGCAAACTGCTTGAGGCAACACATCGGCACGGTCTTGAAGCACTCGTAGAGATTCATGACGAAGAAGAGCTGGAGATCGCAAAGGCTGCCGGTGCAGA
--GTTGACGTTGAGTTTGCAACTCACAACCTCGGCATGTGCGCCTTGGTGGAGGTGAACAGCGTTGAGGAGCTGGACATCGCGCTGGCTGCCAGGTCGCG
TCAACGAGCTCATTGAGGTGACCCACAAGCTCGGCATGTGCGCCCTGGTGGAGGTGAACAGCGTCAAGGAGCTGGATATCGCGCTGGCTGCCCGCGCCAG
TTATCGAGCTCATTGACGCAACTCACAATCTCGGTATGTGCGCTCTGGTCGAGGTGAACAGCGTCCAGGAGCTGGACATCGCTCTGGCTGCCAAGGCCCG
TCCGCGATCTTTACAGTTACGCACTAGAGTTGGGCATGGAACCTCTTGTTGAAGTCAATAATGCGCAGGAAATGGAGCTCGCCCTGTCCTTACCTGCCAA
TCAAGGAGCTCTACGACTACAGTCTTAGCCTGGGCATGGAGCCGCTCGTGGAAGTCAACAATCCCGAAGAAATGACGCGCGCCATTCGACTCGGTGCAAA
TGCATCGTCTATACAACTACTCTCTTCAGCTGGGAATGGAGCCCCTCGTCGAGGTTCAGAATGCTGAAGAGATGACAACAGCAGTCAAGCTGGGTGCCAA
TAGAGTCCCTCTATAAATTTTCTAAATCTTTGGGCATGGAACCGTTAGTAGAGGTTAACTGTGCTGAGGAAATGAAGACTGCTATAGAACTTGGTGCTAA
TTACAAGATTGTATCACTATTCTCGCAGCTTGGGAATGGAGCCTCTCGTAGAGGTTAATACTCCGGAGGAAATGAAGATCGCGGTGCAGCTTGGAGCCGA
TGAAGGAACTGTACAGCTACAGTAAAGATTTGAACATGGAACCTCTCGTTGAGGTGAACTCCAAAGAGGAATTACAAAGGGCTCTAGAAATTGGTGCTAA
TCGAGCGCCTATACAAGTACTCCTTGTCTCTCGGCATGGAGCCCCTAGTCGAGGTCCAGAACACCGAGGAGATGGCCACAGCCATCAAGCTCGGCGCCAA
TCAAATACATGATTAAGATTTGCAAAATACTTGGAATGGCTACACTTGTGGAGGTCCATGACGAAAGGGAGATGGATCGTGTTCTTGCAATTGAAGTCGA
TAACCTTCTTGCTTAAGATTTGCAAGAAGCTTAGCTTGGCTGCCCTTGTTGAGGTACATGATGAGAGAGAGATGGGTCGTGTACTTGGAATAGAAATCGA
TCAAGTACATGCTTCGCATCTGCAAGAACCTCGGAATGACAGCTCTCATAGAGGTTCATGATGAGAGGGAGCTGGATCGTGTGCTCAAAATAGATGTTCA
CAAAAATTTTATTACAAAAAGCGCATGAATGCGGTCTTGAAGCGTTAGTGGAAGTGCACAATCGTCAAGAATTGGATCAAGCTATTGAAATCGGTGCTGA
TAAAATTTCTCATACAAGAAGCGCATAGATTAGGTTTAGAAGTTTTAACCGAAATTCATGACTTTTCAGAACTTGAATTAGCTTTAGAAGCAGAAGCTTC
TAAAATTTCTCATACAAGAAGCCCAGAGATTAGGTTTAGAAGTTTTAACCGAAATTCATGATTTGTCAGAACTTGAGTTAGCTTTAGAAGCAGAAGCTCC
TTGCGGAGCACAGGAATGGAGAACTGCTGCGTGATCGCCTTCAAGTAGTCTGGCGACCCCTGAAAGTAGTGGGAGTCGGTAAGGACGGAGAAGGCCGCGC
TTCGCGACTACCTCCAGACGGCCTCTGAAACGGGGCTCGATGTGCTCGTTGAGGTGCATGACCGGCGGGAACTCGACACGGCGGCCGAAGAGGGTGCCAC
TGTTAGAGCTAGCTCAACTCGCCGAACACTTAAATATGGATGTACTCATAGAAGTCCATGATTCCGAAGAGCTAGAGCGAGCGCTGGTTCTCGATACACC
TGGTGGAATATCTGGACCTGGCCCGGAGGTTAGGCCTGGCAGCCCTGGTCGAGGTTCATACGGCCCCGGAACTGGAGGTAGCCCTCCGGGCAGGAGCCGG
TGAAAAAATTCCAGATCGTTGCAAATATTTTAGGGATGCAGTGCCTGGTTGAGGTGCATGATGAAAGGGAACTTGAGCGGGCTTTGGAATCCGGCGCAAG
TGGTTGATTTGTCCGGCTTGGCATTGCAATTGGGAATGGATGTGCTGGTTGAGGTTCACGATATTGACGAACTTGAGCGTGCAATACAGATTTCTGCGCC
TAAAAGAGCTGTATAGCTACGTACGAGAAAAAGGACTAGAGGCAATTGTTGAAGTTCATGATGAAGAGGAATTAGAAATTGCAATCGAATTAAATCCGCA
TGGCCGAGCTGGAAGCGTGCGCGCACGAACTCGGCATGGATGTGCTCGTGGAAGTCCACGGCGGCGAGGAACTCGATAGTGCGCTGCGACTGAAGACGCC
TTGCGCAGGACCGGCAGGTCGCAGGCGCTTCTGGCCTGCTTCAGATATTCCGCGCTGCCACCGAAATACTGCTGGTCCGTCAGAACCGACAGACACGCGC
CGAAGCGGCTCGAGGACGAGGCTTTTGCGCTTGGCATGGACGTGCTGATCGAAGTTCATGATGCGGAGGAAACCGAACGGGCGCTGAAACTCACTTCCCC
CGAGCGCATTGGAAGCGACCGCATTCGAACTCGGCATGGATGCGCTGATCGAGGTACATGACGAAGCCGAGACGCAACGAGCGCTAAAGCTGTCGTCGCG
CAAAAGAGCTGGAAGACACGGCTTTCGCACTGGGCATGGATGCGCTGATCGAAGTGCATGACGAAGCTGAAATGGAACGCGCCCTGAAGCTTTCCTCGCG
TGCACGAGATCGAAGCTGCTGCGCTGGAGCATCGGATGGATGCGCTGGTCGAAGTGCACGACGAAGAAGAAATGAGTCGCGCCCATGCGCTGCAATCGCG
TTCTTTATCTGAGAAAAGTTGCAATTAGCCTTGGATTAACAATATTGGTTGAAGTGCATGACTCTAATGAGTTGAAAAGAGTACTAGATTTAGAATTTCC
TGCAAAACTTTTTGCAAGTAATTCACGATTTGGGCATGAATGCACTGGTAGAAGTTCATACTTTGGCTGAATTGGATAGAGTTCTTAAGCTTGACTTACA
TGCGCTACTTCCTGAAGATCGCCCACAGTCTGGGACTCAATGCGCTGGTGGAAGTGCATAGCCTGCCGGAACTTGAGCGTGTCCTTGCCTTGGATCTGCG
TGCGCTATTTTCTGCGGATTGCCCATGGGTTGGGTCTAACCGCCCTTGTGGAGGTGCACACCGCCGAGGAGATGGAACGGGTGCTGGCACTAGAGGTGCA
TAGAGAGTTTATTGGAATATGCCAGAAGTTATGGTATGGAACCATTGATAGAAATTAATGACGAAAATGATTTAGATATAGCCCTAAGGATAGGGGCTAG
TTGAGGCAGGGATTCAAAAATGCAGGGAGCTCTCAATGGAGCCACTGGTGGAGTGTCACACATCGCTTGACATCTTCAGGGCCCTTGAAGCAGGTGCAGA
TAGAAGAATATTTGAATATTTCTAGAAATCTTGGATTAGATGCAGTAGTTGAATGTCATACATTAGAGGATATTGAATCTGTAGTAGATTATAACCCTGA
TAGAAGATATGCTAAATTATTGTGTCAATCTAGGTACCCAAGCTATAGTAGAAGTACATGATTATAACGATATATATTATGCTACTCAATGTGGTAGTTA
TGAAAGACCTATTAACCTCGTGCATTACCATGGGAATTGAGGCAATAGTTGAAGTTCATACCACTGCTGATGCACTTAGATCTGCAGAAATAGGCTTCAC
TAGAAGATATGCTAAATTATTGTGTCAATCTAGGTACCCAAGCTATAGTAGAAGTACATGATTATAACGATATATATTATGCTACTCAATGTGGTAGTTA
TGCAAGATTTACTAACCGCATGTGTGACCATGGGAATTGAGGCAATAGTTGAAGTTCATACCACAGCCGATGCGCTCAGATCAGCAGATATAGGGTTTAA
TGGAAGAAATGTTACAATATTGTGTTAATGTTGGTACTCAAGCTATTGTTGAAGTACATGATTATAATGATATATATTTTGCTACACAATGTGGAGCTTA
AGTGCTTGGAGTGAACAATCGCAATCTACACACGTTTCAATTAGATTTATTACTCTATGTGCTCTGTCGGGTATGTCATCTGCACATGATGTGGACAGGT
CATCATTAGTGTCACCGGAGTGGATGGTGCCGATAACAAGTTTGCTGTATCCTCGCCAAGGACAACAAGGCTCTGGAAGAAGTGGAAGAGGCGTGGATCT
GAATATAATGGTCTCAAATCGTGATCGTATCACAGGTCAGTTGTTACCATTATCACTATTGCAGGAGGTGGTATATCAACACCTGAACAGATGAAGAAGC
CGCTCTTGTCGTATCCAATAGAGATTTGGAGACTTTTGACTTTGATAA------
GGTTATCGGAGTGAACAATCGCAATTTGCACACCTTCCAAATGGATTTTATACGGTTTGCGCGCTTTCGGGAATGTCTACGGCTATGGACGTGGACCGTT
AATGCTCAGTATCTTGCATCTGGGAACGGTAGAGGATATGGTGGCCGCATCCTTGCCAATAATGACCAGCAACTGCAAGAGATTGAGGATGCTTGGTCGG
CAATATTTTGGTCAACAACTACGATCGAGTTGCACAGGAACTTCACCCATTATTTGTTTGGCTGCGGGAGAGATCGAAACCACCGATCAAATGAAGCGCC
CGGAATAATCGTATCCAATCGTGAGCTTGAAGACTTTTCCTTCGATATTTGAAAAGCAATGCTCTGGCGAAAGTCCGCGCAAAACATGGTGAAGACCTTC
TATCTACGTGTTTACGGATGTGGATCGCGCTCGTCGACGAGTTGTTAAGCAGTGTGCCTCGGCAGTGGACGCATTGAGACGCTTGAACAAGTTGAACAGT
CATTCTGCTTCTCGCGAAGAGAGACCTCGACAGTCTTGTGCCTATGCTGAGCGGTTCGCACCGGAAGCTAGGAGACTAGGGCTTCGA------
GATCATCGGCACTAACGCACGCGACCTGCGCACTTTCAACGTCGACCTGTGATTCCAGTGGCGGAAAGCGGTATCCACGATGTGATGGATGCTTGGCGTT
CCTGATTGGCGTCAATAACCGAGACCTCCGTTCGTTCAAGGTGGACATGTCACGCTCTTTGCGCTCAGCGGCATTCGGTCTCACGCAGACGTGGTCAAGT
GCTGATCGGAGTCAACAACCGCGACCTCCGCACGTTTAAGGTGGACCTGTCACGCTCTTCGCGCTGAGCGGCATTCGCTCGCACGCGGACGTCGTCAAGT
ACTGATCGGCGTCAACAACCGCGATCTCCGCACGTTCAAGGTGGACATGTCGCCCTCTTTGCTCTCAGTGGCATTCGCTCGCACACAGACGTGGTCAAAT
AGTTATCGGCGTCAATAATCGCAATCTCCATGACTTTAAGGTTGACATGTCGTTCTTTGTGCGCTTAGTGGTATTGCGTCGAGGAATGATGTTGAACGGT
GGTTGTCGGCGTCAACAATCGCAACCTCCATGACTTTAATGTCGACATGTCATCCTTTGTGCTCTTAGCGGTATCAGCGGTCGCAGCGATGTAGCCAAGT
GGTCATCGGCGTCAACAACCGTAACCTGGAGAGCTTTGAGGTGGATCTACAATCATCTGCGCACTTAGCGGCATCAACACTCACGATGACGTTCTAATGA
AGTTATCGGCGTTAATAACAGGAATTTGCATAGTTTTGAAGTAGACCTGTTATTCTTGCAGCTCTCAGTGGAATTAGTAGTCCTGCTGATGTTGCCCATT
GGTGATTGGCGTGAACAACAGAGACTTGACGAGCTTCGAGGTTGACCTACTATCGTTTGCGCTCTTAGTGGAATTTCCGGACCCAAGGATGTCGAGGCTT
AGTTGTAGGTGTCAATAATAGGGACCTGCATTCATTCAACGTAGACCTGTTCTTCTAATTGCTCTATCGGGAATTACCACCAGGGACGATGCTGAAAAAT
GGTTATCGGCGTCAACAACCGCAATCTCGAGAGCTTCGAAGTCGACCTACCTTCCTCTGCGCTCTCAGCGGCATCAACACTCACCAAGATGTTCTTGACT
GCTCATTGGCATCAATAACCGTAACCTTGAAACATTTGAGGTAGATCTATCCTTGTGGTTGGAGAATCTGGGTTATTCACTCCCGAAGATATCGCCTTTG
GCTTGTTGGCATCAATAACCGAAGTCTTGAAACATTTGAAGTGGACATATGATTGTGGTTGGCGAATCTGGTCTGTTTACACCAGATGACATTGCCTATG
GCTTATCGGCATCAATAACCGAAGTTTAGAGACGTTTAAGGTTGATACATTCTGGTTGTTGGTGAATCTGGTCTTTTCACGCCTGATGATGTTGCATACG
AATTATTGGGGTCAACAATCGAAACTTAACTACCTTTTCGGTTGACCCATCATAAGCGTTGCTGAATCGGGAATTCACACCGTAAGCGATGCAAAACGTT
CATTATTGGCATTAACCACCGTAATCTTAAAACTTTTGAAATCGATCTATCATTACCGTAGCTGAATCAGGAATCCATCACCCTATACAAGCCAAACGGA
CATTATTGGGGTTAACCACCGCAATCTTCAAACTTTTGAAATCGATCTATCATTACTGTAGCTGAATCAGGAATCCATCACCCTACACAAGCCAAACGGA
CAAGTTCGGCGTAGCGCGCAGCGATGTCAAGCGGACGAAAATCCTCGAGAGATTGATGCCGCCGTCACGGCTCGTAATGGCCGAGCGGAAATCGCGCGTC
AATGATTGGCGTCAACAACCGCAACCTGAAGGACTTCAGCGTGGATCCACTATCGCGGTGTCGGAGAGCGGCCTGAAAACGGCTGGCGACATCGCTCATT
ACTTATCGGCATTAATAATCGCAATTTAAAAAGTTTCGAAACGAGATTCGCTTAGTGGTCACCGAAAGCGGAATTCATAGTCCCGCAGACGTAAGGAAAA
TATTATCGGGATTAACAACCGGGACCTGCACACCTTCAAGGTTGACCTCCGGTGGTAGTGAGCGAGAGTGGCATCCGGAGCCGGGCCGATGCCGCCCTGG
AATAATCGGAATAAACAACAGGGATCTTAGAACTTTTGAAGTGGATTTCGGGTGGTGGTAAGCGAAAGTGGAATTAAGGACACCGAAGATTTAAAGTATC
ATTGATTGGTATCAACAACCGTAATCTGAGTACCTTCAACGTGTCATTCGTTTGCTCGTAAGCGAGAGCGGTATCCTTACCTCTGCCGATGTACAACGGC
CGTTATCGGTATTAACAACCGTAATTTAAAAACATTCGAAGTCGATTTTTACTTTGGATTAGCGAAAGCGGGATTCATTCAAAAGAGGATATTATTCGTG
GCTGCTGGGCGTGAACAACCGCAACCTGCGCACCTTCGAGGTCTCGCTCGGCTCGTGGTAACCGAATCGGGCATCCTCGGGCCCGACGACGTCAAGCGGA
CATGCTGGCCATAGCTCACGGCGATGGCGGCAGGGTCGAACTGCTCACGACGGCCGGTAATCCCAGGGCGAGCTTCTTGCGAAGAATGAAATCCCTTGGC
GCTTCTCGGTATCAACAACCGCAATCTGCGCACCTTCGAAGTCGGCCTAAGCTGCTGGTCGGCGAAAGCGGCATCTTCACCTTCGAGGATTGCCAGCGCC
GCTGATCGGCATAAACAACCGCAATCTGAGAACGTTCGAGACCAGCCTCGCTTGCTGGTCAGTGAGAGCGGCATCTTCACACATGACGATTGCCTCAGGC
CCTGCTCGGCGTCAACAATCGCAATCTGCGCAGCTTCGAGGTCAATCTCGTCTGCTGGTTGGCGAAAGCGGCATCTTCACGCATGAGGACTGCCTGCGGC
GCTGGTTGGCGTCAATAATCGCGACTTGAAAACCTTCACCACCGATCTGCGCTTCTGGTCGGAGAAAGCGGTATCGACAACCACGCCGATTGCGTGCGGC
TCTGGTTGGAATTAATAATCGCGACCTTAAGACTTTCAATACTGATTTGTTCTATTGGTAAGTGAGTCTGGTTTATTTAACTCTGCAGATCTAGAAGAAG
TTTAGTAGGAATCAACAATCGCAATCTAGAAGATTTTACGGTTGATTTATCACTGTTGTCAGTGAATCTGGACTGTATACACCTGCTGATTTATCTCTTG
GCTGGTTGGCATTAACAACCGCAACCTCAAGACTTTTGTGACCGATCTTTGCTCTTGGTCAGTGAATCGGGTTTGTTCACCGGCGCCGATCTCGATCGCG
GCTTGTGGGGATCAACAATCGCAACCTGATGGACTTTAGTGTTGATCTATCCTTGTGGTCAGCGAATCGGGCATTCATCAGCGCGCCGATGTCGAACGGA
ATTTATAGGAATTAATTCAAGAGATCTAGAAACCCTTGAGATAAATAAGTGGTAAAGGTGGCAGAAAGTGGAATTTCTGAGAGGAATGAAATAGAAGAAT
GATAATAGGTGTGAACAACAGGGACCTTGAAACCTTCGAGGTGGACCTCTCATACTCGTATCAGAGAGTGGTGTCAGGGGCCCCGAGGATGCTGAGATAC
AATTATTGGTATAAACAATAGAAATCTAGATGATTTTACTATTAATCT---TACTTAATATCAGAAAGTGGAGTAAAAACCATAGAAGATGCTAAAAAAT
TATTCTTATGATAAACGAATTTGATTTTATTAATAATCGCTATGAATACTAATAACTATAGCTAAAATAAATGTTAGTGATATAAATTATGTAGAAAAAT
TAACTTCATGATAAACCAGTGGGATAGAATAAAAAATATTCTATACCCGCCACAACAATTGCAGCTGGTGGCATAATGACAATGGAGCAGGTTCATATGT
TATTCTTATGATAAACGAATTTGATTTTATTAATAATCGTTATGAATACTAATAACTATAGCTAAAATAAATGTTAGTGATATAAATTATGTAGAAAAAT
TCACTTCATGATAAATCAATGGGATAGAATAAAAAATAAACTATACCCGCTATAACAATTGCAGCGGGCGGTATAATGACGATGGAGCAAGTACATATGT
TATACTTATGATCAATGAATATGATTTTTATAATAATATATATGAATAATTATTACTATTGCTAAAGTTAATACTAATGAAGTGAATTATATAGAAAAAT
ATAGATCTGTTGGTGTTGGAATGTGTCTTATTGGTGAATCACTCATGAGGGCTTCGGACCCATCCTTGGCAATCAAGGGACTGTGC
GTCGTGATAGGGGATTCAACTGCGTGTGGGTGAGTGATGCTCTGTACAAAAGTGGAGATCCAGTAGAACACCCCGGTGCAATCATC
ATTTAGCAGTGGGATATGATGGAGTGGTGGTTGGTAAAACAGTCATGGGATCGGCACGAGCACCTGAGTTTATTAGGACTGTGAGA
------
ATCGTCAGGTCGGTCTTGGTATGTGTTTAATCGGTGAGAGTTTGATGCGTGCGACTGATCCGGGACAAGCGATTGCTGCTCTATGT
TTCGGGATCAAGGCTTCAACTCGGTATGGGTTGGGGAAGCCCTGTACAAGGGTGGGGATCCCAACGAACAACCCGGCGGTATCATC
ATTTGGCGGTTGGGTACGACGGAGTTGTGGTCGGTAAAGCAGTCATGGGAAGTCCGGCAGCTCCCGAGTTCATTCGAGCGGTTCGG
TTATCTTGGCTGAAGGAAGAGTCGGTATAATCGATCGTCCTCAGGCAGACAGCACAAGAAGTGCTAAGTATATTACCGAATTAAGG
TACGTGACGTTGGCGCCGACGGCATCGTTATCGGTGGCAAGCTCATG------GCTTCGATCGCAAGA
------GTCATCGTGGAGCTGCCACGTGAGCACGCCCCCAATGCCTCA---TTGCTGAGAGGTCTGCGT
TGCGGGACGCTGGCTACTCGTCGATTCTAGTTGGTGAGGCACTCATTCGCGCCGCAGAATCCTCGAAACTACCGAGTAATGCTTAC
ACGAGAAGTGCGGCGCTCGCGGCATCTTGGTTGGCGAGTACTTGATGAAGAGTGGCGACGTAGCGACTACGGTGAAGGACCTCCTG
ACGAGAAGTGCGGCGCCCGCGGCATCTTGGTCGGCGAGTTCTTGATGAAGAGCGGAGACGTTGCCACGACGGTGAAGGACCTCCTG
ACGAGAAATGCGGCGCACGTGGCATTTTGGTCGGCGAGTACTTGATGAAGAGCGGCGATATCGCCACGACAGTGAAGGATCTACTG
ACAAAGGTCAAGGCGTCGGCGCGATACTCGTGGGCGAGAGCTTGATGAAAGCAAAAGATGTAGGTGAATACATCAAGGAGCTAATG
ATCTCAAGGAGGGCGTTGGCGCCGTCCTGGTAGGCGAAGCGCTGATGCGTGCCAAGGACAAGCGCGCTTTCATTCACGATCTGCTA
ACAAGAGGGATGGTGTCAATGCTGTTCTTGTCGGTGAGGCCATTATGAAGGCGCCTGACGCCAGTGTTTTTATCAGTCAGCTGTGC
ATAGTAGTCAAGGTGTATCAGCAGTTCTTGTTGGTGAATCTCTTATGAGAGCTTCGGATCCTGCCGCTTTTGCACGAGAGTTACTT
ACAAGAAGGAAGGTGTCAAAGCAATTCTCGTGGGAGAGGCACTTATGCGGGCATCTAACACATCTGCTTTTGTTGCAGAGCTGCTC
ACAAAAAAGAAGGTGTCCATGGATTTTTAGTGGGTGAAGCCCTAATGAAATCAACCGATGTGAAGAAGTTCATTCATGAATTATGC
GCAAGCGCGACGGTGTCAACGGCATTCTTGTCGGCGAGGCCATCATGCGTGCCCCTGATGCCACCCAGTTCATCCGTGAGCTCTGC
TTCAAGAAGCCGGTGTCAAAGCAGTTTTAGTCGGCGAATCTCTTATTAAACAAAGCGATCCCGGGAAGGCAATCAGCACCCTATTT
TACAAGCAGCCGGAGTCAAAGCAGTTTTGGTTGGAGAATCCATTGTGAAGCAGAACGACCCTGAGAAAGGAATAGCTGGACTTTTT
TGCAGAATGCTGGTGTTTCTGCAATTTTGGTAGGAGAGTCCCTGGTGAAACAGGAAAATCCCGGGCAAGCCATTGCTGGACTATAT
ATATTTCTGCTGGTTACAACGCCGTTCTCGTGGGAGAGGCATTAGTTAAATCAGAAAATCCTCAACAATTTATTAGC---GCTATT
TGCGTGAGCTTGGATTTGACGCCGTTTTGGTCGGAGAAGCTCTTGTGCGATCTAAAGACCCTGCTTTGTTAATTAAA---CAAATG
TGCGCGAGCTTGGATTTGACGCTATTTTGGTCGGAGAAGCCCTTGTGAGCTCTAAAGATCCTTCTTTGTTAATTAAA---CAAATG
GCCGG------AAGATCGCCGCACGCCTCACGATAGCGGCGCTCCGGTTTCAGCAGTTCGGCAACTCTTGCCTTGGTCTCGAG
TGCGCGATGCCGGTTTCCATGCTGTGCTCATCGGAGAAGGCCTCCAGACGAGCAAAGAACTGACGGGCCTTACCTGGCCCGTCAGC
TGAAGAAAAATGGAGTTAACGCCTTTCTTATCGGTGAAGCCTTCATGAAAGCGGAAGACCCCGCAGCTAAACTTGCAGCATTATTC
TGGCCGGCGCCGGCGTGGATGCCATCTTGGTGGGCGAGGCCCTGATGACAGCCGGGGACGTTATGGCTAAGATGGCCGAACTACGG
TGAAAGAGCTTGGCGTGGATGCTGTTTTAATTGGCGAGACCTTTATGCGGGCCCGGTCCATAAGTGAAAAAATAAGAGAGTTTAAA
TGCGTGCTGCGGGTGTGAATGCGTTTCTTGTTGGTGAGGCGTTTATGCGTGCGACCGAGCCTGGTGAGTCACTCAGAGAGATGTTC
TTAAAAAAGCAGGAGCAAAAGGTGTATTAGTTGGGGAAGCACTTATGACAGCATCTTCTATTCGTACCTTTTTTGAAGATTGTAAG
TGCGCGACGCCGACGTCCACGCCTTCCTGGTCGGCGAAGCGTTCATGCGCGCGCCGGAGCCGGGCGTGGAACTGGCCCGCCTGTTC
GGCGGTAACGCTTCCGCCTGGCGGCGAAGCACGGCGAGAGAACCCTGCGCTTTCAAAATCTCCTGTCGTGTCGCCACAATCTCCTG
TGGAAAAGAGCGGCATCAGCACATTCCTCGTCGGCGAAAGCCTGATGCGCAAGGACGATGTGACGGCGGCGACCAAGGCCCTGCTG
TGCAGAAGAAGGACATCGGCACCTTCCTTGTCGGCGAGAGCCTGATGCGGCAGGAGGATGTCACCGCGGCGACCCGCATTCTGCTG
TTGAAAAGTCCGGCATCGGCACTTTTCTGATAGGCGAAAGCCTTATGCGACAACATGACGTTGCGGCAGCCACCCGCGCGCTTTTG
TGTCGCAAGCGGGGGTCCAGACCTTCCTCGTTGGTGAGAGCCTGATGCGGCAGGACGACGTCGAAGCCGCCACCCGCGAACTTCTC
TTAGTTCCTATGGAGCAAAGGCTGTTCTTGTTGGTGAGTCTTTGATGAGACAACCTGATATTGGATTGGCGTTAAAAAACTTGCAA
TCGCTGAAGCTGGTGTGCGTGCGGTTTTAGTCGGAGAGTCTTTAGTTAAACAAAGCGATGTAGAACAAGCTGTGCGTAGTCTTTTA
TTACCCAAGCTGGCGCACAGGCCGTCTTGATCGGTGAATCGCTGGTCAAGCAACCGGATCCAGGCTTGGCACTGCAGCAGTTAGTT
TGGCACAAGCCGGCGCCCAAGCCATTTTAGTCGGAGAATCCCTAATGCGCCCCCCAGAACTCCGAGCGCGGATTCAGCAGTTATTT
TAAGGAAATTAGGTGTTAACGCTTTCCTAATCGGATCATCACTGATGCGAAACCCA------GAAAAGATTAAAGAATTTATA
TGGCAGGTTACGGTGCAGATGCTCTGCTCATCGGCACAGCACCCATGTCAGCCCAGAACCCACGGGAACTCCTTGAGGAGATAGTA
TAAAGGAATATGGAGCAGATGGTATTTTAATAGGAACAAGTATCCTACAAAACAACACAGAAAAAGAAATTGAAAAATTTATTGAA
TAGGTAGTCTAGGTTATGATAGTATATGCTTAGAAAAAAAACTTATT---GACGAGGATTTGGAGGTTTTTGTGCAATCATGTAAA
TAGCACTGGCAGGTATAGATGCCGTTTGTTTCGGAAGAAGACTTGTA---TACCCAGACGTACCAGACTTCATCAACCAAGTCAAA
TAGGTAGTATAGGTTATGATAGTATATGTTTAGAAAAAAAACTTATT---GATGAAGATTTGGAGGTTTTTGTGCAATCATGTAAA
TAGCGCTCGCGGGCATAGACGGCGTTTGTTTCGGAAGAAGACTTGTA---TGCCCGGACGTACCAGATTTCATTAACCAAGTGAAG
TAGGAAGCTTGGGATATGATAGTATATGTTTAGAAAAAAAATTAATT---GATGATGACCTTCAACAATTTGTTACCTCATGTAAA
Tryptophan synthase Alpha subunit amino acid sequence alignment
64 227
Desred NATFNALVTYVTAGDPDLKTTGRLICSMDRAGADIIEIGIPFSDPSADGPVIQRASARALKEGTNPPAILELVKEVQVLAPLILMSYYNPILQYGCRDAA
Mooth TAAFAALIVYLCAGDPSLEVTGQAVRELAGAGVDLIELGVPFSDPVADGPVIQAASKRALAAGVTLPEILELVKSLGLAVPLILMSYYNPLLQYGTADLA
Bacce QAAFEAFIPYVMGGDGGLEILKERIRFLDEAGASIVEIGIPFSDPVADGPTIQRAGKRALDSGVTVKGIFQALIEAEVQIPFVLMTYLNPVLAFGIENCM
Grate NAITNALIPFITAGYPNIDICIKALKVLDREGADLIELGIPYSDALADGPIIQEASQAALKQGIYIEQVLSILTKVDLHAPIIIFTYYNPVLVRGICEIS
Antit NIISEALIPFITAGYPDINTTIQALYELDSQGADIIELGIPYSDALADGSVIQHSSLIALQGGTYIDQVLHILEVVKLNTPIIILPYYNPILKRGIKQIS
Porye NTISSALIPFITAGDPDLVSTGKALQILDSYGADIIELGLPYSDPLADGPIIQEASNRALKQGINLNKILSMVKTVTIKAPIVLFTYYNPVLHLGIYAIS
Porpu TTISSALIPFITAGDPDLVSTSKALKILDQHGADIIELGLPYSDPLADGPIIQAASSRALKQSINLNNILDMVNITNIVAPIVLFTYYNPVLNLGISAIS
Cyacal NSIKNLLLPFVSLGTPNTQINKQAIIAMDKNGANIIELGIPYSDPVADGPVIQDAYNKAIKNGVNIRKAFKILMNLKIKSPIIVFIYYNQLLNYGLEKLI
Cyame -----MLIAYLTAGAPDINTTKEAVMKLAKKGADVIEIGVPYSDALADGAILQKASKQALMNGFHLDHLWNLLSEVEIEVPLVILAYYNQIWHYGVKKLV
Crowa SDCFQALIPFITAGDPDLDTTAKALRVLDASGADIIELGVPYSDPLADGPVIQAAATRALGRGVKLEDVLKIVKEVEIKAPIILFTYYNPIFYRGLQQIK
Trier SDCFEALIPFITAGDPDLETTAKALEVLDRSGANMIELGVPYSDPLADGPVIQAAATRSLNRGTTLESVLEVVQTVKLRSPIILFTYYNPILYRGLKKIY
Nostpu SDCFEALIPFITAGDPDLETTAKALQVLDQSGADIIELGIPYSDPLADGPVIQAAATRALQRGTKLEHVLEMLQGIKLRSPIVLFTYYNPILHRGLQEIA
Anava SDRFEALIPFITAGDPDLETTAAALKILDSNGADIIELGIPYSDPLADGPVIQAAATRALQNGTKLESVLEMLKVTSLQAPIVLFTYYNSILHRGLEQVA
Theel SERFEALIPFLTAGDPDLETTVAALKILDDHGADLIELGMPYSDPLADGPVIQAAATRALQRGTRLEAVLEMTTDLQLTAPLILFSYYNPIYHRGLKAVA
Synel SDCFAALIPFLTAGDPDLETTRQALLALDREGADLIELGVPYSDPLADGPVIQAAATRALQAGTRLDDVLALLKDVQIKAPIVLFTYCNPILNRGLDQIA
arab1 ADTFTAFIPYITAGDPDLSTTAEALKVLDACGSDIIELGVPYSDPLADGPVIQAAATRSLERGTNLDSILEMLDKVQISCPISLFTYYNPILKRGMSSIR
Braol ------DACGSDIIELGVPYSDPLADGPVIQAAATRSLEKGTNLDSILDMLDKVELSCPVSLFTYYNPILKRGMSSIR
Arab2 SETFAALIPYITAGDPDLSTTAKALKVLDSCGSDIIELGVPYSDPLADGPAIQAAARRSLLKGTNFNSIISMLKEVQLSCPIALFTYYNPILRRGMTVIK
Oryz AETFSAFIPFITASDPDLATTSKALKILDSCGSDVIELGVPYSDPLADGPVIQAAATRALKKGATFDSVIAMLKGVELSCPIVIFTYYNPILKRGMAIIK
Zeam SDTMAAFIPYITAGDPDLATTAEALRLLDGCGADVIELGVPCSDPYIDGPIIQASVARALASGTTMDAVLEMLREVELSCPVVLLSYYKPIMSRSLAEMK
triae SDTMAALIPYITAGDPDLATTAEALRLLDACGADVIELGVPCSDPYVDGPIIQASSARALAGGATMDGVLAMLKEVELSCPVVLFSYYRPILCRGLAEIK
Ostlu SEAFKAFIPFICAGDPDLESTKKALKILDDAGADVIELGVPYSDPLADGPVIQAAATRALENGATLNKVIDLVREMQIKAPIVMFTYYNPIYQRGCADIA
Ostta SEQFGAFIPFICAGDPDLESTKKALKILDDAGADIIELGVPYSDPLADGPVIQAAATRALEAGATLDKVIALVKEMQIKAPIVMFTYFNPIYQRGCADIA
Chlre SGTMNAFIPFICAGDPDLDTTSLALRKLDEVGADVIELGVPYSDPLADGPVIQGAATRALDKHTTLDKVIEMVRRTAMKAPLVMFTYYNPIMRKGARTIK
Glovi ARVFAAFIPFITAGDPDLETTAEALLTLDRNGADLLELGLPYSDPLADGPTIQAAATRALARGTTPGAVLDLVARLELRAPLIVFTYFNLILAVGVERLA
Clothe ERAFSAFIPFITAGDPSLEITEQLVYRMAEAGADLIELGIPFSDPVAEGPVIQEADYRALSAGTTTDKIFDMVGRISCDIPIAFMTYANPIFTYGLKRCG
Laccas ADVFKVFIPFIVADDPDFETTVKNVVALAKGGADIVELGIPFSDPVADGPVIQAADLRAFAANVRTKTVFDIVEAAETAVPIVFLTYLNIVFKYGLKRCA
Theet DKKFEALITFITAGDPDIETTYDIVLAIEEVGADIIELGIPYSDPLADGPTIQASSQRALNKGVKIPDIMRIVEKIKSDIPLVYLVYYNSIFKYGLKESK
Metbur ADKFNALLAYVCAGDPDIDSTPRIVDSLIKGGADIIELGLPFSDPVADGPTIQAASERALTAGMNPDRYFELVANLDVQVPLVCMTYYNLIYKRGVKDCI
Metmaz SEKFDALIGYVMAGDPTFEASSEVVKALAKGGADIIELGFPFSDPVADGPTIQVAGQRALAEGMDIERYFAFARALEVDVPLVCMTYYNPVFRYGVENAA
Natpha EDAFDAFVPYLAAGDPDFESSLAYVEALARGGADVIELGLPFSEPIAEGPTIQQAVVRSLEGGMTPERFFEFVETLDVDVPLVCMTYYNLIYQYGVERAA
Methun KEAFELLMTFTVAGDPDFETSLEIIKALENGGADIIELGLPFSDPVADGPVIQQADQRALASGMNTDRFFDLVREVSSDIPLVVLTYTNLILQRDYQDAA
Metjan AEKFEAFVAFYVGGDPNLEISEKALEVICK-HADIVEIGIPFSDPVADGITIQKADVRALNSGMNPLKAFELAKKLAPNVPKVFLTYYNIIFKMGVKKCK
geome TGTFAALVTFITAGDPDLATTEELIPLLAENGADIIELGVPFSDPMADGPTIQLSSERALAAGTTLSRILATVKSVRTQVPIVLMGYFNPIFSYGAADAA
Pelpro THCFNALVTFITAGDPDLATTQAMIPLLQQAGADIIELGMPFSDPMADGPTIQLSSERALAASTTLERILAMVRAVSCQVPIVLMGYLNPIHAYGANDAA
Desac EETFAALIPFITAGDPNMDTTEKIIATLVDAGADLIELGVPFSDPMADGPTIQAASERALAAGATLDSVLDLVERVFSQVPIVLMGYYNPVFCYGAARAA
Ralso AQTFSGLIPFITAGDPYPELTVDLMHALVKGGANVIELGVPFSDPMADGPVIQRASERALAKKIGLRTVLDYVRAFDKTTPVVLMGYANPIERMGAKAAS
Burce QQTFAGLIPFITAGDPDPAKTVEFMHALAEGGADVIELGVPFSDPMADGPVIQRSSERALARGVTLKSVLADVKCFNQTTPVVLMGYANPIERMGATEAQ
Polna ESTFSALIPYVTAGFPFADVTPELMHGMVAGGADVIELGMPFSDPSADGPVIQKAGEKALSFGIGLVQVLEMVRIFDHTTPVVLMGYANPVERYDIRDAA
Nitmu STLFGALIPFITAGDPEPGMMVPLMHELVQAGADVIELGVPFSDPMADGPTIQRSSERALKHRVSLQDVLAMVGEFDSSTPVVLMGYANPVEAMGTARSK
Xylfas DETFRALIPFITAGDPSLEAAVPVMHALVRAGADVIELGVPFSDPMADGPVIQHSSERALQRGVGLAYVLQTVDVFDAVTPVVLMGYLNPLEIYGTQQAL
psefl QTRFAALVTFVTAGDPDYDTSLAILKGLPKAGADVIELGMPFTDPMADGPAIQLANIRALGAKQNLTKTLQMVREFNSDTPLVLMGYFNPIHKFGIAEAK
azovin QTRFAALVTFVTAGDPDYETSLAILKGLPEAGADVIELGMPFTDPMADGPAIQLANIRALGAGQNLVKTLRMVRAFDRTTPLVLMGYFNPIHYYGIAEAR
Meslo DRRMAALVTYFMGGDPDYDTSLSIMKALPGAGSDIIELGMPFSDPMADGPAIQAAGLRALKGGQTLVKTLKMASEFDNETPIVLMGYYNPIYIYGLKDAL
Agrob DKRFAALITYFMGGDPDFQTSLGIMKALPEAGADVIELGMPFSDPMADGPAIQLAGQRALKGGQTLKTTLDLAREFDNATPIVMMGYYNPIYIYGLDDAI
rhopa DTRFAAFVTFVMAGDPDLATSLQVLKALPAAGADIIEIGMPFTDPMADGPAIQAAGLRALHSGATLSHTLGLVRDFDDTTPMVLMGYYNPIYIYGLADAK
Azobr ARRFAGLVTFITAGDPDLETCRAVLHGLPAAGADLIELGLPFSDPMADGPAIQAASLRALHAGTTARKTLDLVRGFDADTPVILMGYYNPIHAYGLADAI
Oceal APTFAGFVAYVMAGDPDADTTLKMMQGLADKGADVLELGVPFTDPMADGPTIQRAAIRALESGMTLKGVLALVKRFHKDTPVVLMGYANPFFAYGASDAA
Pellu ENRITLLLAYYMPEFPVAGSTLPVLEALQDGGADIIELGIPFSDPVGDGPVIQNAAHIAIRNGVSVRSLLELVRKAKITVPILLMGYSNPLIAYGLHDAV
provib ENRITLLLAYYMPEFPVAGATLPVLEALQESGADIIELGIPFSDPVGDGPVIQEAAHRSIANGVSLHRLLDIVGRAKITVPILLMGYCNPLIAYGLTDAT
Clote ENRITLLIAYYMPEFPVPGATLPVLEALQESGVDLIELGMPYSDPIGDGSVIQDAAHKAISHGVHVGSIFELVRRAKITTPILLMGYCNPLIAYGMADAV
Phatr EDAFAAFVTFVTAGYPTAADTPAILMAMQEGGAALIELGIPYTDPQADGATIQHTNQVAIKGGSEIHQCLDMVKKSGLTVPVVLMGYYNPFLQYDCEETK
thaps EQAFAAFVTFVTAGFPVKEDTPAILLAMQAGGASVIELGIPYTDPQADGTTIQQTNQVAIKAGSDITQCLSMLESAGLTVPVVLMGYYNPFFQYGCKKAK
Lacbi RRVFEALVTFVTAGYPRKDDTVPILRAMQAGGADIIELGIPFSDPIADGPVIQETSTIALKNGIDYVTVLGQLREAGLTAPVLLMGYYNPLLAYGIQDAA
copci DCVLTCLRCFRDRRYPKKEDTVPVLLALQAGGADIIELGIPFSDPIADGPVIQEANTVALKNDIDYPTVLGQIREAGLTAPVLLMGYYNPMLAYGIQDAA
Ustma KAVFAVFVSFVTAGFPTKDDTVEVLLALEQGGADVIELGVPFSDPQADGPAIQESNQVALEQGVGYTQCLDYIRQAGLKAPVLLMGYYNPTLAYGVQDAK
Neucr KQTFQALVTYVTAGFPHPEQTPDILLAMEKGGA-VIELGVPFTDPIADGPTIQTANTIALQHGVTLQSTLQMVRDAGLKAPVMLMGYYNPLLSYGLNDCK
Asfum KSTFAALVAYITAGYPTVEETVDILLGLENGGAGIIELGIPFTDPIADGPTIQRANTQALANGVTVTTVLNMVRQAGLKAPLLLMGYYNPVLRYGLKDCK
schipo KKTFLVLVTFVTCGFPNVDETIKIMQGLQNGGAGIIELGIPFSDAVADGPTICKGNEIALKNNITLEKVFETVKLAGVTIPIILMGYYNPIFSYGIQKAK
Canal KETFAALVNFITAGFPTIDDTIPILQNMQNAGVDIIELGVPFSDPIADGPTIQQANNIALDNGITVPKCLELLSQAGVTVPIILMGYYNPILKYGLKDAA
Sacce RQTFAALVTFMTAGYPTVKDTVPILKGFQDGGVDIIELGMPFSDPIADGPTIQLSNTVALQNGVTLPQTLEMVSQAGVTVPIILMGYYNPILNYGIQDAA
Pram AEVIAAFITFVPCGFKTKADTVDILLGLQRGGANIIEVGIPYSDPQADGPTIQRAHQVGVDQGITLHDVLATVSEAGLITPVVLMGYYNNIMQYGCPDAQ
Psoj AEVIAAFITFVPCGFKTKADTVEILLGLQRGGANIIEVGIPYSDPQADGPTIQRAHQVGVDQGITLTDVLATVSEAGLTTPVVLMGYYNNILQYGCPDAQ
Pinf SEVIAAFITFVPCGFKTKADTVDILLGLQRGGANIIEVGIPYSDPQADGPTIQRAHQVGVDQGITLHDVLATVSEAGLTTPVVLMGYYNNILQYGCPDAQ
NAGAAGLIVPDLPLEESTELLLAAGQVGLALIPLVAPTTRRRLARITAAAQAFVYCVTVTGITGTSQNVTGEIEELSKEVREELPMVAGFGIASPEQAVK
AAGVDGLIVPDLPLEENPPLRQTLEPAGLALIPLVAPTTGERLARIAATARGFIYCVSLTGVTGVREGLPPGIDEYLAGVRADLPLGIGFGIGSPDQARL
EAGVDGIIVPDLPYEEQDIIAPLLREANIALIPLVTVTSIERIKKITSESEGFVYAVTVAGVTGVRQNFKDEIHSYLEKVKSHLPVVAGFGISTKEHVEE
QAGAKGLIIPDLPLEEVDYILELCNLYSIELILFVAPTSQSRIQLIASKSPGCIYLVSSCGVTGLRDNFDVKIQHLANNIKSNKLIMLGFGINNPDQISQ
LMGAKGLIVPDLPLEETDELIVICNDNQIELVLFVAPTSMKRINSISKKSPGCIYLVSSTGVTGVRDDIDIKVMELSNYIKKNKFIMLGFGISTPEHIKK
NAGIRGLLIPDLPIEESEYVISVCNLFNIELILLLAPTSRERISKIIKRAPGCIYLVSTTGVTGQKSQLTSQLKELTETVKTNKSIILGFGISTTEQIKE
RAGIKGLLIPDLPIEESDYIISVCKLFNIELIFLLSPTSIERINKIVEQAPGCIYLVSTTGVTGQKPELTGKLKRLTETIKKQKPIILGFGISTAEQIKE
QLEVQGIIVPDLPYDESQILKKKCTINNIALISLIALTSFSRIKKIARNAEGFLYLISKTGVTGGTGKLMNKLKIIIKTIQKSKPVVVGFGINSRRQIKQ
AHNVKGLIVPDLPYEESKTLRQICDRYGLNIIWLISPTTKTRAQELARACKDWIYVISRTGVTGLETEFDKQIPKLIGELKKKAPIALGFGISKSEQVKL
AAGVQGLVVPDLPLEEAETLLKPAAAMGIEVTLLVAPTSIERIQAIATQSQGFIYLVSVTGVTGMRTQVGSRVEELLKNLRSNKPIGVGFGISEPEHALQ
DVGARGLVVPDLPLEEADILLEPAKDIGIELTLLVAPTSKERIKAIAHQSQGFIYLVSVTGVTGMRAQMQTRVEDLLAQMREDKPIGVGFGISQPEQALQ
AAGVAGLVVPDLPLEEAAGLLEPAKEMGIDVILLVAPTSAKRIEAIAHSSQGFIYLVSVTGVTGVRSQLESRVSDLLKQIRGEKPIGVGFGISDAAQARQ
AAGVAGLVVPDLPLEEAAGLLKPATERGIDLILLIAPTSSERIEAIARSSQGFIYLVSVTGVTGMRSQVEGRVLDLLQKVRQDKPLGVGFGISQPAQATQ
QAGIKGLVIPDLPLEEAEPVLAETANLGLELTLLIAPTTPERMRAIATASQGFIYLVSTTGVTGMRQEMASRVQELLHTLRQPKPIGVGFGIASPEHARQ
AAGANGLVVPDLPLEESQRLSEVAAERGIDLILLIAPTSADRIAAISKQARGFIYLVSVTGVTGMRQGMQSRVADLLQEIRQDKPIGVGFGISGAEQARQ
AVGVQGLVVPDVPLEETEMLRKEALNNDIELVLLTTPTTTERMKRIVDASEGFIYLVSSIGVTGARSSVSGKVQSLLKDIKEDKPVAVGFGISKPEHVKQ
DVGVQGLVVPDVPLEETEFLRKEALNNSIELVLLTTPTTTERMKRIVDASEGFIYLVSSIGVTGARASVSGKVQSLLKDIKEDKPVAVGFGISKPEHVKQ
NAGVHGLLVPDVPLEETETLRNEARKHQIELVLLTTPTTKERMNAIVEASEGFIYLVSSVGVTGTRESVNEKVQSLLQQIKESKPVAVGFGISKPEHVKQ
QAGVHGLVVPDLPLEETALLRNEAVMHGIELVLLTTPTTTERMKEIAKASEGFIYLVSSVGVTGARSNVNLRVEYLLQEIKKDKPVAVGFGISTPEHVKQ
EAGVHGLIVPDLPYVAAHSLWSEAKNNNLELVLLTTPAIEDRMKEITKASEGFVYLVSVNGVTGPRANVNPRVESLIQEVKKNKPVAVGFGISKPEHVKQ
EAGVHGLIVPDLPYVAAHALWSEAKKNNLELVLLTTPAIEERMKEITKASEGFIYLVSVNGVTGPRENVNLRVESLIQEIKKDKPVAVGFGISKPEHVKQ
AAGAKGLLVPDIPLEETYDVSEIASKHGIELVLLSTPTTVERAKKIAQATKGFVYLVSVTGVTGVQSNVATRVEQLVEELRSDKPIAVGFGVSEAKHAKQ
AAGAKGLLVPDIPLEETYSMSEIASTHGIELVLLSTPTTVERAKKIAQATKGFVYLVSVTGVTGVQTQVASRVESLVEELRADKPIAVGFGVSQAAQAKQ
EAGAAGLLVPDLPLEETVSVRAACEKAGIELVLLATPTTQARMRAIAQASQGFVYLVSVTGVTGMKEQVSGRVEGLVSELKADKPVCVGFGVSRAEHAKQ
ASGASGLLVPDLPVEEGDALQTAANVQGLDVIWLVAPTSPERLRRIAERTTGFVYLVSTTGVTGARTQVASSVRTSLAQLRATRPVAVGFGISTPEQAHE
ETGIDALIVPDIPFEEKEELAPFCKEYDVRFISMIAPTSKERIRMIAREAEGFIYCVSSMGVTGVREKIGDDAKEMIKIVKEDIPCAVGFGISTPEQAAQ
DLNVAGLVIPDLPYESRDEIVPIAEKYGIDIIPLITPTSGHRIEKIAKSASGFIYVVSSMGITGERDEFFAGLKALVAEIKQNVPTAIGFGIHTPEQAQT
DVGIDGLIIPDLPLEERKDILEEADKYGIYLIPLVAPTSKERIKLITENGKGFVYCVSITGVTGAREDIETDIEEYMKTVSQNMPKAIGFGISTPEMAKK
SSGISGLIIPDLPAEESADLANCCSQEGVDLIFLVAPITDERIEMILSKTSGFVYIVSRSGVTGTRSDVTAATSDIISRVRTDIPKAVGFGISNAEQAAK
EAGISGLIIPDIPVEEAADLKTGCDAHGLDLIFLVAPTTEARIRKILQRGSGFIYLVSRLGVTGARDDVAGSTKELLSRVNTDIPKAVGFGISTGEQAAE
EVGLKGFVVPDLPAEEAGPLREACDEYGLDLVFIVAPTTGDRLERMMEQVSGYVYVQARLGVTGAREDVSDRTAETLARLEADVPKAVGFGISSGEQAEA
DAGIDAVVVADLPYEEAGPYITAAETAGVAPVMMVSTTTPERLSKILTVKSGFIYLVAALGVTGMRQKTDPVAQKLLADLKNDIPIAPGFGISDREQVRE
EAGVSGIIVPDLPIEEADSLYNYCKKYGVDLIFLVAPTTDERLKKILEKCSGFVYVVSVTGITGAREKVAEETKELIKRVKKKIPACVGFGISKREHVEE
AAGVDGVLLVDLPPEEAGEFKACADRHGIDVIFLLTPTSETRIRTVTNRARGFIYYVSVTGVTGVRSGIEASVAGNVNIIKEKVPVAVGFGIATPEQAGE
EAGVDGVLVVDMPPEEAESFLNHANARDLQVAFLLTPTSDSRIATVGRLGRGFVYYVTVTGVTGARQQVSTTLGGELAKVRASVPIVAGFGISTPQQAAD
QAGVDGLLLVDLPSEEREELHIHLKPKGIHLITLLAPTTPDRAAQLLKQAQGFVYYVSMTGVTGTSKVDGSAIESQVVQLREPVPVAVGFGITTEQDAAA
EAGVDGVLVVDYPPEECEAFAKTMRAAGIDPIFLLAPTSEARMAQIARVASGYIYYVSLKGVTGAATLDLDSVAARIPQIRQRLPVGVGFGIRDAATARA
AAGVDGVLVVDYPPEEAGVFAEKMRAAQIDPIFLLAPTSDERIADVGKIASGYVYYVSLKGVTGAGNLDVSSIAGKIPAIKSPVPVGVGFGIRDAETARA
AAGVDGMLIVDYPPEECVEFSARLKAHGMDLIFLLAPTSDARMAQVAQVASGYVYYVSLKGVTGAGTLDVDAVEAMLPRIRRNVPVGVGFGIRDAATAKA
ACGVDGVLIVDYPPEESVKWVEYLKRQNIAPIFLLSPTTQQRVERVASLAEGYVYYVSLKGVTGSLHLDLHDVAEKLDGLRSSIPIGVGFGIRDGATARA
ASGVDGVLLVDLPPEEADEIRAIFSAAGLALIVLASPTTASRLATLSGVAQGYLYYVSFAGVTGADRLDAQSAGDRLRGLRAQVPVVVGFGIRDAASAVV
EAGVDGLIVVDMPPEHNSELCDPAQAAGIDFIRLTTPTTDARLPKVLNGSSGFVYYVSVAGVTGAGAATLEHVEEAVARLRRDLPISIGFGIRTPEQAAS
EAGVDGLIVVDLPPEHNEDLCDPAQAAGLDFIRLTTPTTDKRLPRVLAGSSGFVYYVSVAGVTGAHAASLEHVEQAVARLRRDLPLCIGFGIRSPEHAGS
ASGIDGLIVVDLPPEMDEELCIPALKAGINFIRLATPTTDKRLPKVLQNTSGFVYYVSMTGITGSALADTGKVAAAVNRIKGDLPVCVGFGVKTAEQARV
ASGIDGLIVVDLPPEMDDELCIPALARGINFIRLATPTTGKRLPAVLKNTSGFVYYVSMNGITGSALPDPSLISGAVGRIKAELPVCVGFGVKTADHAKA
AAGVDGLIIVDLPPEEDSELCLPAMKAGLNFIRLATPTTEKRLPAVLANTSGFVYYVSITGITGSASADSAAVGDAVARIKRDLPVCVGFGIRTPEAARA
EAGVDGLIVVDLPPEEDEELCIPALKAGVNFIRLATPTTDKRLPAVLQNTSGFVYYVSIAGITGAASADNAAVGAAVERLKRDLPVAVGFGIKTPEQAAE
AAGADGVICVDIPPEEDTEFRSALDANGLSFVRLAAPTTDKRLPQVVAHTSGFVYYVSTTGVTGAGSGATGDIEAAVARVRAGLPVAVGFGVKTPERAQE
KAGVDGLLIPDLPPEESADFLQRAKSLGLTVVYLISPVTPERIEWIDSLSTDFSYCLAVNATTGADASTEASVDRYLERVRLRKKFVVGFGIRDRARVEH
EAGIDGLLLPDLPPEEAGEFLERAKAFSMTVVFLISPVTPERIEMIDGMSTDFSYCLAVNATTGSEGDTEASVDEYLKRVRRKKKFVVGFGIKDRERVEH
KAGVDGLLIPDLPPEESEDFLERAKHFGLSVIYLISPVTPDRIELIDSMSTDFSYCLAVNATTGDVAGMDEKIAEYLKRVRQKKKFVVGFGIKDRERVRK
AAGADGFIVVDLPPEEGIALNKACIANGLSNIPLVAPTSDKRIASLTDMASTFLYCVSVTGVTGARESLPPDLEEFITRVRSELPLAVGFGISNPEMVNG
ECGADGFIVVDLPPEEGADLAKACNKYGLSNIPLIAPTTDERIGHLAKTASTFIYCVSTTGVTGARSELPSDLDDFIKRVRSDLPLAVGFGISNATMVQS
EAGANGFIMVDLPPEEAISFREKCRKYRLSYVPLIAPSTLHRIKFLASIADTFIYVVSKMGTTGEKIAMNNALPDILARIRSSIPLAVGFGVATRDHFNV
EAGANGFIMVDLPPEEAIAFRQKCAASNLSYVPLIAPSTLKRIQFLASIADSFIYVVSKMGTTGANVAVNEELPTILSRIREHVPLAVEFGVATRDQFNY
AAGANGFIMVDLPPEEAADFRASCTKHGLSYVPLIAPSTTKRIEHLASLADSFIYVVSKMGTTGATAAVSSSLPDLISKIRSSIPLAVGFGVSTRQHFIE
EAGVNGFIIVDLPPEEAVSFRQLCTRGGLSYVPLIAPATDARMRVLCQLADSFIYVVSRQGVTGASGTLNANLPELLARVKKNKPAAVGFGVSTHDHFTQ
EAGVNGFIMVDLPPEEAVRFRDLCASNGMSYVPLIAPATEARMKLLCKIADSFIYVVSRMGVTGATGKLSSNLPELLKRVHQNVPAALGFGVSTREHFLS
EVGANGFIIVDLPPEEAVGFREECKKQGVSFVPLVAPSTDRRMELLASVADSFIYVVSRMGSTGATGVINTALPQLCQRVRKDTPLAVGFGVNTSEHFHQ
EAGANGFIVVDLPPEEAIKFRTECTKYGLSYVPLVAPATNDRLKILGEIADSFIYVVSKMGTTGASTKVSTGIQELCDRVRKDTPLAVGFGVSTREHFLT
KAGANGFIIVDLPPEEALKVRNYINDNGLSLIPLVAPSTDERLELLSHIADSFVYVVSRMGTTGVQSSVASDLDELISRVRKDTPLAVGFGVSTREHFQS
KAGVDGFIIVDLPPEEAKVLSDDAAKHGLSYIPLVSPTTEERMKLIDSVAHGFVYCVSLTGVTGARTELPPNLDSFMTKIRAKHPLALGFGLSTRQHFVQ
KAGVDGFIIVDLPPEEAKPLSDDAAKHGLAYIPLVSPTTEDRMKLIDSVAHGFVYCVSLTGVTGARTDLPPNLDAFMATIRAKHPLALGFGLSTRQHFVL
KAGVDGFIIVDLPPEEAKTLSDDAAKHGLAYIPLVSPTTEERMKLIDSVAHGFVYCVSLTGVTGARNELPPNLDAFMAKIRAKHPLALG------
VAKYCDGVVVGSALVKLVETHLTRQLK
LAPMGDGIIVGSALVDVLY--LVERLR
MVTICDGVVVGSKVIELLENEATKQKE
IINWIDGIVVGSAIITHIVGEFCKKLK
IMKWIDGVVVGSAFVKKLSALLCKSLK
IKGWINGIVIGSAFVKRLSGNFCQDAK
IKGWINGIVIGSAFVKRLSSDFCTTAK
LIEWSNGIVIGSPCVQILLQSLIKQIK
VKSWADGVIIGSACMQILL--WISVMK
VKNWSDAVIVGSACVKRLAQGFCQSLK
VKKWSDAVIVGSAVVKRLAEGFCQNLK
VKEWADAAIVGSAVVKRLAEGLCQSLK
VRDWADAAIVGSAFVQRLATGFCQSLK
VRDWADAAIVGSAFVKRLAE-FCQSLR
VRDWADGVIVGSAFVNRLQE-LCRELR
IAGWADGVIVGSAMVKLLGDALTKSLK
IAGWADGVIVGSAMVRLLGDALTKSLK
VAEWADGVIVGSAMVKILGESFTKSLK
IAGWADGVIIGSAIVRQLGEAYAKNMK
IAQWADGVIIGSAMVRQLGEAYARGMK
IAGWADGVIIGSAMVRQLGEAYARSMK
IVDWADGVIVGSALVRALGEAKAEEIR
IVDWADGVIVGSALVRALGEAKADEIR
IVSWADGVICGSALVKALGEALARELR
VASLADGVIVGSACVQLLATAFCRQLK
MAGFSDGVIVGSAIVKIIAQ-YVRKMK
MAGIADGVIIGSAIVDLVAK-FTKQIR
LKDFSDGIIVGSALVERIAKGFVSILK
IIDAADGVIVGSAFVDIIASGLTAEIK
VRKAADGVIVGSAFVRIIEEGLARELK
VVAAADGVIVGSALVDIVAEGLARELK
WTDAADAVIVGSALVREIEDSLIPRIT
ITEIADGAIVGSAIVKIVEKHFLKELE
VAATADGVVVGSAIVKLFEKHFVSSLK
VAAMADGVVVGSALVKLFQLHFVASLR
IARFSDAVVVGSALVKVIQQHFVAELK
IGGVADAVVIGSRIVQLLEEAFIADIR
VAEVSDAVVIGSRLVQLLESAFIAELR
IGKVADAVVIGSKIIQLIENQFLKEIR
VAELADAVVVGSRIIEEIERSLVKSLR
MAVDADGVVVGSALVTALSDAFLAPLR
IARLADGVVVGSALIDHIANALCSALS
VARLAEGVVVGSALVDRIAKALCRELA
IGANADGVVVGTAIVNAVANVLVSGLA
IGAVADGVVVGSAIVNQIAGSLVKGLS
IAAEADGAVVGSALIDALQKSLVASLA
VARVADAAVVGSAIVTRLAGGFVRELA
IGKVADTVVVGSAIVEELATKLAGTLA
MWRLADGAVVGTALLEHIAGAFWRGLR
MWRFADGAVVGTALLQNIASAFWRTLR
MWELADGAVVGSALLQHVATAFWKSLR
VANMADGVVVGSAILKAMDSLFLAELD
VANIGDGVVVGSAILRAVQSA------
VSDAADGVVIGSRLVSVIKDAFWKEFR
VADAADGVVIGSRIVNAIKAAFWEEVR
VGEHADGVVIGSKLIAKLREAFWKEFE
VGAIADGVVVGSMIITTIQKAFWEEYR
VQELAEGVVIGSQIITVLGQAFWEEFR
VGSVSDGVVVGSKIIDLILKAFWEEFR
VGEVADGVVIGSKIITLIGDSFWKEFR
VGSVADGVVIGSKIVTLCGDAFWEDFK
ASALADGVVIGSKIVKIIEDA------
ASALADGVVIGSKIVKIIEDA------
------
Tryptophan synthase Beta subunit amino acid sequence alignment
66 347
Crypa YKYGGYVPEVINNAMKEIEDAYKISKSEDFINELKKIRKFQPTPIYYAKNLTAEIYLKREDLNHTGAHKLNHCMGEALLAKYMGKKKLIAETGAGQHGVA
Cryho YKYGGYVPEVINNAMKEIEDAYKISKSEDFINELKKIRKFQPTPIYYAKNLTAEIYLKREDLNYTGAHKLNHCMGEALLAKYMGKKKLIAETGAGQHGVA
PhatrB YEYGGQLPPQLVEIMNEISESYKLIRTDAFQIELDSLNKFIPSPIFYARRLTARIFLKREDLNHTGAHKINHCLGEALLAKHMGKTKVLAETGAGQHGVA
Salty YEFGGYVPQILMPALNQLEEAFRAQKDPEFQAQFADLLKYAPTALTKCQNITTTLYLKREDLLHGGAHKTNQVLGQALLAKRMGKSEIIAETGAGQHGVA
Ecoli YEFGGYVPQILMPALRQLEEAFSAQKDPEFQAQFNDLLKYAPTALTKCQNITTTLYLKREDLLHGGAHKTNQVLGQALLAKRMGKTEIIAETGAGQHGVA
Arab1 RKFGGYVPETLMHALSELESAFALATDDDFQRELAGILKYVESPLYFAERLTPLIYLKREDLNHTGAHKINNAVAQALLAKRLGKKRIIAETGAGQHGVA
Arab2 RKFGGYVPETLMHALSELETAFSLATDEDFQRELAEILKYVESPLYFAERLTPLIYLKREDLNHTGAHKINNAVAQALLAKRLGKKRIIAETGAGQHGVA
Oryz1 RTFQGYVGETLMHALSELESAFKLATDDDFQRELAGILKYVLSPLYFVESLTPLIYLKREDNNHTGAHKINNAVAQALSAKRLGKKRIIAETGAGQHGVA
Polyg RKFGGYVPETLMHALDELETAFSLATDVEFQKELDGILKYVETPLYFAERLTPQIYLKREDLNHTGAHKINNAVAQALLAKRLGKTRIIAETGAGQHGVA
Camp RKFGGYVPETLMYALTELESAFSLSGDQVFQKELDGILKYVESPLYFAERLTPEIYLKREDLNHTGAHKINNAVAQALLAKRLGKKRIIAETGAGQHGVA
Oryz2 RKFGGYVPETLMHALTELEAAFALAGDEDFQKELDGILKYVETPLYFAERLTPMIYLKREDLNHTGAHKINNAVAQVLLAKRLGKERIIAETGAGQHGVA
Sorgh RKFGGYVPETLMHALTELENAFALATDEEFQKELDGILKYVESPLYFAERLTPLIYLKREDLNHTGAHKINNAVAQALLAKRLGKQRIIAETGAGQHGVA
Nost RRFGGYVPETLMPALAELETAYQYRNDPGFQAELQQLLRYVATPLYFAERLTAQIYLKREDLNHTGAHKINNALGQVLLAKRMGKQRIIAETGAGQHGVA
Anab RRFGGYVPETLMPALAELETAYKYRHDPGFQAELQQLLRYVATPLYFAERLTAQIYLKREDLNHTGAHKINNALGQVLLAKRMGKQRIIAETGAGQHGVA
Croco RKYGGYVPETLMPALSELETAYRYKNAPEFQAELSQLLKYVPSPLYFAERLTPQIYLKREDLNHTGAHKINNALGQVLLAKRMGKKRIIAETGAGQHGVA
Proma RRFGGYVPETLMPALAELEKKAEAWQDSSFTNELSHLLKYVATPLYEAKRLSPRIWLKREDLNHTGAHKINNALGQALLAIRMGKKRIIAETGAGQHGVA
Chlre RQFGGYVPETLIPALEQLEKDYEAIADPAFKAEMEAILKYVETPLYHAERLSAEIYLKREDLNHTGAHKINNSLGQALLCKRLNKQRIIAETGAGQHGVA
CME KPFGGFVPETLISCLEELEEAYKVSKDPTFQQELAQQLRFAPTPLYFAERLTMQIYLKREDLLHTGAHKINNALGQVLLARRMGRTRIIAETGAGQHGVA
Gloeo RRFGGYVPETLMSALAQLEAAFQYRHDPQFLAEFAGHLRFVPTPLYFAERLTGRIFLKREDLNHTGAHKINNALGQALLALRMGKRRIIAETGAGQHGVA
Exisi QIYGGYIPETLMQAVLELEQAYEVKEDPAFRERMNDLLEYVQTPLYYAEHLTAKIYLKREDLNHTGAHKINNTIGQALLAERMGKRKIVAETGAGQHGVA
Lismo FKFGGFVPETLMKAVKELDEAYASKTDPAFQKELNYYLKYVETPLYFAEQLTAKIYLKREDLNHTGAHKINNTIGQALLARQMGKQKVVAETGAGQHGVA
Asory REFGGYVPESLMDCLAELERGFEALNDPKFWEEYRSYYPYMPSSLHLANRLTANIWLKREDLNHTGSHKINNALGQILLARRLGKTRIIAETGAGQHGVA
Neucr REFGGYVPEALMDCLSELEEGFKIKDDPAFWEEYRSYYPWMPGQLHKAERLTANIWLKREDLNHTGSHKINNALGQLLLARRLGKKKIIAETGAGQHGVA
Sachce RDFGGYVPEALHACLRELEKGFEAVADPTFWEDFKSLYSYIPSSLHKAERLTAQIWLKREDLNHTGSHKINNALAQVLLAKRLGKKNVIAETGAGQHGVA
Cangla RDFGGYVPEALHTCLKELEEGFDAIADPTFWEEFKSLYSYIPSSLHKAERLTAQIWLKREDLNHTGSHKINNALAQVLIARRLGKTEIIAETGAGQHGVA
Ustma RKFGGYIPEALFDAHQELEKAYDALDDPEFWKEFEGYYEYIPSELYHAERMSAQIWFKREDLNHTGSHKINNAVGQILLARRLGKTRIIAETGAGQHGVA
Phatr1 RKFGGYIPETLSVAFEEIEASYELKDDPSFLAELDEYRRFVPTPLHRADRLTATIWLKREDLAHTGAHKINNAIGQALLAKRIGKPRIIAETGAGQHGVA
Thaps RKFGGFIPETLSEAFREIEAEYKVKNDPAFLAELDVYRRFVPTPLHKAERLTATIWLKREDLAHTGAHKINNAVGQALLAKRIGKPRIIAETGAGQHGVA
Pram YEFGGFVAETLIQAHHNLIDEYRATQDASFREELEHLGRYIPTPLYHAKRLTAQIWLKREDLAHTGAHKINNALGQAVLAKRLGKTRIIAETGAGQHGVA
Psoj YEFGGFVAETLIQAHHNLIDEYKATQDPKFREELEHLGRYIPTPLYHAKRLTAQIWLKREDLAHTGAHKINNALGQAVLAKRLGKTRIIAETGAGQHGVA
Pyrab2 WKFGGYVPETLMEPLRELEKAYRLKNDEEFNRQLDYYLRWAPTPLYYAERLTAKIYLKREDLLHGGAHKTNNAIGQALLAKFMGKTRLIAETGAGQHGVA
Pyrfu2 WEFGGYVPETLIEPLKELEKAYRFKDDEEFNRQLNYYLKWAPTPLYYAKRLTAKIYLKREDLVHGGAHKTNNAIGQALLAKFMGKTRLIAETGAGQHGVA
Theko2 FRFGGFVPETLIEPLKKLERAYKFKDDPEFNETLEYYLRWAPTPLYYAERLSAKIYLKREDLLHGGAHKTNNGIGQALLAKFMGKERLIAETGAGQHGVA
Archa2 KEFGGFVPEVLIPPLEELEKAYRFKDDEEFKARLEYYLKYAPTPLYFAENLSVKIYLKREDLLHGGAHKINNTIGQALLAKFMGKKRVIAETGAGQHGVA
Aerpe1 EREESIALLSRILPSRLIEDEYILARWVDIPGEVRKALAIGPTPLIRAEGLEVRIYYKSEAVLPTGSHKINTAIAQAYYAKLDGAKEIVTETGAGQWGLA
Pyroa1 DEGESIGLLSKILPSALIDQEFTAERWVSIPEEVREAYRVGPTPLFRAEGLEVRIYYKYEGVLPVGSHKLNTALAQAYYAKADGAVEVATETGAGQWGMA
Pyrab1 EEPIDIEKLKRIFAEELVKQEISRERYIEIPGELRKLYSIGPTPLFRATNLEARIYFKYEGATVTGSHKINTALAQAYYAKKQGIERLVTETGAGQWGTA
Pyrfu1 EEPVDPKKLERIFAKELVKQEMSTKRYIKIPEEVRKMYSIGPTPLFRATNLEARIYFKFEGATVTGSHKINTALAQAYYAKKEGIERLVTETGAGQWGTA
Pyrho EETINPEKLTRIFAKELIKQEFSEKRYIKIPKEVRELYAIGPTPLFRATNLEARIYFKYEGATVTGSHKINTALAQAYYAKKEGIKRLVTETGAGQWGTA
Theko1 EEPMEPEKLLRIFAEELVKQEMSTDRYIEIPKEVREIYSIGPTPLFRATNLEARIYFKYEGATVTGSHKINTALAQAYYAKRQGIERLVTETGAGQWGTA
AERPE2 KEPVDPSALAKLFPKALIEQEVSRERYIEIPGEVHEAYIFAPTPLLRAVNLEAEIYYKYEGVTPTGSHKINTALAQAYYNKLEGVERLVTETGAGQWGSA
Pyroa2 ------MREVYLWRPTPLIRAKRLEARIYYKFEGVSPPGSHKPNTAVAQLYYISREGVDRVTTETGAGQWGSS
Archa1 AEPVKPEDLEPIFPKGLIQQEMSGERWIRIPEDVREIYRWRPTPLVRAERLEARIYFKYEGASPPGSHKPNTAVAQAYYNAKEGVERLTTETGAGQWGSA
Carhy FQPISPADLESLFPKSLIEQEISTERFIEIPEKVREAYAYRPTPLKRAKKLEARIYFKYEGTNASGSHKLNTALAQAYFNKLDGTEQLTTETGAGQWGSA
Magma GQPIGPDDLAPIFPMAVIAQEVTAERWVDIPEPVREVYRWRPSPLIRARRLEAKIYFKYEGVSPAGSHKPNTSVAQAFYNKQEGVKRIATETGAGQWGSS
Rhoru GKPLVADDLAPIFPRAVIEQEMSAERWIDIPEPVREVYRWRPSPLIRARRLEAHIYFKYEGVSPAGSHKPNTSVPQAFYNQQEGVKKLATETGAGQWGCS
Marib GQPVGPTDLEPLFPASLIEQEMTTEREVEIPEPVRDVYRWRPAPMFRAHRLEAKIFYKYEGGSPAGSHKPNSAIPQAFFNREAGIRRLTTETGAGQWGTS
Rhofer GQPVGPDDLAPLFPMALIMQEVSQEREIEIPEPVRDVFRWRPAPLFRAHRLEAKIYYKYEGGSPAGSHKPNTAVPQAFYNQEAGIKRLVTETGAGQWGSS
Rhopa GQPVGPSDLEPLFPMELILQEVATERYIEIPEPVRDVFRWRPAPLIRARRLEAKIYFKYEGVSPAGSHKPNTAVPQAWYNKQAGIKKLSTETGAGQWGSS
Arab3 KEPIKPEDLAHLFPNELIKQEATQERFIDIPEEVLEIYKWRPTPLIRAKRLEARIYFKYEGGSPAGSHKPNTAVPQAYYNAKEGVKNVVTETGAGQWGSS
Dechar -NPVPPEAMGAIFPGPILEQEMSAERWIAIPEEVRQIYAWRPAPLCRALRLEAKIFYKYEGVSPAGSHKPNSAVPQAYFNKIAGTKRLTTETGAGQWGSS
Magco -KPVTPEMMGMIFPPDILQQEMSTERWIDIPDEVRQILSWRPSPLYRAHRLEAKIYYKYEGVSPAGSHKPNSAVPQAYYNKKAGIKRLTTETGAGQWGSS
Geomet RQPVTPDDLLPIFPMAIIEQEVSGERWIPIPEEVREIYRWRPSPLYRARRLEAKIYYKYEGVSPAGSHKPNSAIPQAFYNKQAGIRRLATETGAGQWGSS
Clothe KKPVTLDELQAIFPMELIQQENSQERWIDIPEEVREMYRWRPSPLYRARALEARIYYKYEGTNATGSHKLNTSLPQAYYNKIAGIKRLSTETGAGQWGSA
Methm RQPINPEALEPIFAKELIRQEMSSDRYIDIPAEILDVYRWRPSPLFRAHQLEAKIYYKYEGVSPAGSHKTNTSIAQAYYNMKEGTERLTTETGAGQWGSA
Chloph MTPISPDDLAQVFPMNLIEQEMSTERWIDIPEEVLSILKWRPSPLYRAKRLEAKIFYKNEGVSPAGSHKPNTAVPQAWYNKQFGIKYLTTETGAGQWGSA
Chloli MLPISPDDLAKVFPMNLIEQEMSTERWIDIPEEVLGILKWRPSPLYRAKRLEAKIYYKNEGVSPAGSHKPNTAVAQAWYNREFGIKYLTTETGAGQWGSA
Provi MVPIGPDDLARVFPMNLIEQEMSTERWIDIPDEIQGILKWRPSPLYRARRLEAKIYYKNEGVSPAGSHKPNTAVAQAWYNKQFGIKYLTTETGAGQWGSA
Pelol LTPIGPDDLARVFPMNLIEQEMSTQRWIDIPEEILGILKWRPSPLYRAHRLEAKIYYKNEGVSPAGSHKPNTAVAQAWYNKQFGIKYLTTETGAGQWGSA
Praes LSPIKPEDLARVFPMNIIEQEVSTERWITIPEPVQDILKWRPSPLYRAKRLEAKIYYKNEGVSPAGSHKPNTAIAQAYYNKEFGIKYLTTETGAGQWGSA
Synfu GKPVEPQDLAAVFPMNLIEQEVSTQSWIDIPEAVLEKYLWRPTPLCRARNFEVKIYYKNEGVSAPGSHKPNTAIPQAYYNKVFGIRRISTETGAGQWGSA
Deshaf GKPCTIEDLQPVFCDELIEQELTKDLYIEIPEDIRDFYKYRPSPLVRAYCLEAEIYYKFEGTNTSGSHKLNSAAAQVYYAKKQGLTSLTTETGAGQWGTA
Sulso1 QAYFSIDLLRSILPKEVLRQQFTIERYIKIPEEVRDRYLIGPTPLFRAKRLEARIYFKYEGATPTGSHKINTAIPQAYFAKEEGIEHVVTETGAGQWGTA
Ferac KSDISINLLNKILPKEVLKQEFTFKRYEKIPDEVMEAYEIGPTPLIRARNLEGKIYYKYEGATATGSHKINTAIPQAYYAMKEGVKGVTTETGAGQWGSA
Sulso2 T--GKLELLKEVLPSKVLELEFAKERYVKIPDEVLERYLVGPTPIIRAKRLEIKIYLKMESYTYTGSHKINSALAHVYYAKLDNAKFVTTETGAGQWGSS
Thevo T--GEFETLKKAVPTKVLEYEFSGERYPKIPGEIYEKYMVGPTPIIRAKNLEIKIYLKMESYTYSGSHKINSALAHVFFAKQDNAKFVSTETGAGQWGSA
LATAAAYFGLECEIHMGEVDVKKEYPNVIRMKILGAKVVCVEFGDKTLKEAVDSAFEAYIKDITFYAIGSVVGMVRDFQSIVGIESREQFLIPDIVTACV
LATAAAYFGLECEIHMGEVDVKKEYPNVIRMKILGAKVVCVEFGDKTLKEAVDSAFEAYIKDITFYAIGSVVGMVRDFQSIVGIESREQFIIPDIVTACV
LATACALIGIECEIHMGQVDVEKEAPNVTKMRILGCKLITVTRGTRTLKDAVDSVFEEYLKDPYFYAIGSVI-----FKASWDLEG-----APDAIVACG
SALASALLGLKCRIYMGAKDVERQSPNVFRMRLMGAEVIPVHSGSATLKDACNEALRDWSGSYAHYMLGTAAGIVREFQRMIGEETKAQILRPDAVIACV
SALASALLGLKCRIYMGAKDVERQSPNVFRMRLMGAEVIPVHSGSATLKDACNEALRDWSGSYAHYMLGTAAGIVREFQRMIGEETKAQILRPDAVIACV
TATVCARFGLECIIYMGAQDMERQALNVFRMRLLGAEVRGVHSGTATLKDATSEAIRDWVTNVTHYILGSVAGMVRDFHAVIGKETRKQALGPDVLVACV
TATVCARFGLQCIIYMGAQDMERQALNVFRMRLLGAEVRGVHSGTATLKDATSEAIRDWVTNVTHYILGSVAGMVRDFHAVIGKETRKQAMGPDVLVACV
TATVCASFGLECIIYMGAQDMERQALNVFRMKLLGAEVREVHSGTATLKDATSEAIRDWVTNVTHYILGSVAGMVREFHKVIGKETRRQALGPDVLVACV
TATVCARFGLECIVYMGAQDMERQSLNVFRMRLLGAEVRGVHSGTATLKDATSEAIRDWVTNVTHYILGSVAGMVREFHAVIGKETRKQALGPDVLVACV
TATVCARFGLQCVIYMGAQDMERQALNVFRMRLLGAEVRAVHSGTATLKDATSEAIRDWVTNVTHYILGSVAGMVREFHAVIGKETRKQALGPDVLVACV
TATVCARFGLQCIIYMGAQDMERQALNVFRMKLLGAEVRAVHSGTATLKDATSEAIRDWVTNVTHYILGSVAGMVREFHKVIGKETRRQAMGPDVLVACV
TATVCARFGLQCIIYMGAQDMERQALNVFRMRLLGAEVRAVHSGTATLKDATSEAIRDWVTNVTHYILGSVAGMVREFHKVIGKETRRQAMGPDVLVACV
TATVCARFGLECVIYMGVHDMERQALNVFRMRLMGAEVRPVAAGTGTLKDATSEAIRDWVTNVTHYILGSVAGMVRDFHAVIGQETRAQALGPDILLACV
TATVCARFGLECVIYMGVHDMERQALNVFRMRLMGAEVRPVEAGTGTLKDATSEAIRDWVTNVTHYILGSVAGMVRDFHAVIGQETRAQALGPDILLACV
TATVCARFGLECVIYMGIHDMERQKLNVFRMKLLGATVQPVSAGTGTLKDATSEAIRDWVTNVTHYILGSVAGIVRDFHDIIGQETRRQCQSPHILLACV
TATVCARFGLECVIYMGQEDMERQALNVFRMKLLGAKVQSVTAGTATLKDATSEAIRDWVTNVTHYILGSVAGLVRDFHSVIGEETKQQCKRPDVLLACV
TATICARLGLKCIVYMGAKDMERQALNVFRMRLCGAEVRPVHSGTATLKDATSEAIRDWVTNVTHYILGSAAGMVREFQSVIGRETKVQAQGPDIVMACV
TATVCARFGLPCVVYMGARDMERQKLNVFRMRLLGAEVRPVHTGTATLKDALSDAIRDWVTNPTHYVVGSVAGMVRDFHYVIGEEVYQQVQRPDILVACV
TATVCARFGLECIIYMGVHDIERQKLNVYRMKLLGAEVRPVAAGTGTLKDATSEAIRDWVTHVTHYILGSAAGMVREFQAVIGRETRVQCLRPDVLIACV
TATVCALLDLECVIFMGEEDIRRQELNVFRMELLGAKVVSVTQGSRTLKDAVNEALRYWVKHVTHYLMGSVLGIVRDFQAVIGQETRTQILKPEAIVACV
TATVAALFNMKCTIFMGEEDVKRQSLNVFRMELLGAKVVSVKAGSRTLKDAVNEALRFWVANVTHYIMGSVLGIVRDYQSVIGIEARKQHLKPDAIVACV
TATVCAKFGMECVVYMGAEDVRRQALNVFRMKLLGASVVAVDAGSRTLRDAVNEALRAWVVDLTHYIIGSAIGIVRTFQSVIGDETKQQMMKPDAVVACV
TATVCAKFGMECTVFMGAEDVRRQALNVFRMKLLGAKVVAVEAGSRTLRDAVNEALRYWVVNLTHYIIGSAIGIVRTFQSVIGNETKQQMLKPDAVVACV
TATACAKFGLTCTVFMGAEDVRRQALNVFRMRILGAKVIAVTNGTKTLRDATSEAFRFWVTNLTYYVVGSAIGLVRTFQSVIGKETKEQFAKPDAVVACV
TATACAKFGLKCTVFMGAEDVRRQALNVFRMRILGAKVVSVTNGTKTLRDATSEAFRFWVTNLTHYVVGSAIGLVRTFQSVIGNETKEQFAKPDVVVACV
TATACAKFGMECVVYMGSEDVRRQSLNAFRMKMLGAKVVAVESGSKTLKDAINEANRDWVTNLTHYIVGSAIGIVRDFQSIIGKEVKQQLQKPDAIVACV
TATICAKLGLDCTVYMGAVDCERQKLNVFRMNTLGAKVVPVQDGQRTLKDAINEAMRDWVTNVTHYLIGSAVGIVRDFQSVMGREMRAQMLKPDAVVACV
TATICAKLGLDCTVYMGAVDCERQKLNVFRMNTLGAKVVPVKEGQATLKDAINEAMRDWVTNVTHYLIGSAVGIVRDFQSVMGKEMRAQMLKPDVVIACV
TATACALLGLDCVVYMGYVDTQRQSLNVFRMKMLGAKVVAVKSGSQTLKDAVNEAIRDWVTNVTHYIIGSAIGIVRDFQAIIGREIKAQLQRPDAIVACV
TATACALLGLDCVVYMGYVDTQRQSLNVFRMKMLGAKVIAVKSGSQTLKDAVNEAIRDWVTNVTHYIIGSAIGIVRDFQAIIGREIKAQLQRPDAIVACV
TAMAGALLGMKVDIYMGAEDVERQKMNVFRMKLLGANVIPVHTGSKTLKDAINEALRDWVATFSHYLIGSVVGIVRDFQSVIGREAREQILDPDVIVACV
TAMAGALLGMKVDIYMGAEDVERQKMNVFRMKLLGANVIPVNSGSRTLKDAINEALRDWVATFTHYLIGSVVGIVRDFQSVIGREAKAQILQPDVIVACV
TAMAGALLGMKVDVYMGAEDVERQKMNVFRMGLLGARVIPVESGSRTLKDAINEALRDWVATFSHYLIGSVVGIVRDFQSVIGREAREQILTPDAVVACV
TAMAAALLGLEAEIYMGAEDYERQKMNVFRMELLGAKVTAVESGSRTLKDAINEALRDWVESFTHYLIGSVVGIVRDFQAVIGKEARRQIIGPDAIIACV
ASTAAALMGLKATVFMTASSFKSKIQRRLLMEAQGARVISSPSGRESLGLAIAEAVEYTLESGRRYLPGSVLEAVLMHQTVIGLEALDQLPEPDVVVACV
VSLAAALFGLRAVVFMTRSSYNSKRQRLTFMRAYGAVVYPSPSGRRSLGIAISEAVEYVLSGERKYLPGSVMEFVLLHQTVIGLEAVRQLPEPDVAVACV
LSLAGALLGLKVRVYMARASYQQKPYRKTLMRIYGAEVFPSPSGKRSLGIAISEAIEDVLKDEARYSLGSVLNHVLMHQTVIGLEAKEQMEEPDVIIGCV
LSLAGALMGIKVRVYMARASYEQKPYRKVLMRIYGAEVFPSPSGKRSLGIAISEAIEDVLKDEARYSLGSVLNHVLMHQTVIGLEAKQQMEEPDVIIGCV
LSLAGALIGLKVRVYMTRASYYQKPYRKILMEIYGAEVFPSPSGKRSLGIAISEAIEDVLSDEARYSLGSVLNHVLMHQTVIGLEAKKQVKEPDVIIGCV
LSLAGALLGLNVRVYMARASYQQKPYRKTIMRLYGAEIYPSPSGRKGLGIAISEAIEDVLRDEARYALGSVLNHVLMHQTVIGLEAQEQMKEPDVIIGCV
LSAAGAYFGVKVRVYMVRVSYLQKPYRRTLMELYGAEVYPSPSGRKSLGIAISEAIEDVINSGAKYSLGSVLNHVLLHQTVIGLEAEKQFRVPDIMIGAV
VAFAASLFGVKATVYMVRASYLQKPYRRVLMELWGAEVVPSPSGRKSLGIAISEAVEDAVRSGAKYVLGSVLNHVLIHQTVIGLEALEQIRDPDYVVGAC
LCFATKLFEMACTVYMVKVSFMQKPYRRVMMETWGGEVIPSPSGRKSLGIAISEAIEDAAKNETKYSLGSVLNHVLLHQTVIGLETKAQLEEPDVLIGCV
LAYAANFFGLKLTVYMVGISYDQKPYRRIFMETFGARVVKSPSGRKSLGIAISEAVEVAMSDPTKYALGSVLDHVLLHQTVIGQEVYEELSEPDVLIACV
MAFAGSLFGLEVEVFMVKVSYDQKPYRRALMETYGATCIASPSGRASLGIAISEAVEIAASRDTKYALGSVLNHVCLHQTIIGEEAMMQMEDPDVVIACT
LAFAGSLFGLEVQVFMVKVSFNQKPYRRALMETYGATCIPSPSGRASLGIAISEAVEVAIAHDTKYALGSVLNHVCLHQTVIGEEALLQMEDPDVVIACT
LSFAGSLFDIDVTVFQVRVSYNQKPYRRAVMETYGANCVASPSGRRSLGIAISEAVELAVQDPTKYALGSVLNHVLLHQTVIGLESMKQMECPDVIVGCT
LAFAGSLYGLEVEVFQVRVSYDQKPYRRALMETYGARCVASPSGRSSLGIAISEAVELAVQRETKYALGSVLNHVLLHQTIIGQEAMLQMQDPDVVVGCA
LAFAGSLFGLDVLVFQVRVSYDQKPYRRALMETYGARCIASPSGRASLGIAISEAVEVAAKNPIKYALGSVLNHVMLHQTIIGEEAIKQFEDPDVVIGCA
LAFASSLFGLDCEVWQVANSYHTKPYRRLMMQTWGAKVHPSPSGRRSLGIAISEAVEVAARNETKYCLGSVLNHVLLHQTIIGEECIQQMEEPDLIIGCT
IAFAGQMFGLPVRVFMVKVSYEQKPFRRSMMQTWGAEVFASPTGRASLGLAISEAVEEAAADPTCYTLGSVLNHVLLHQSVIGLEAKKQFDLPDVIFGPC
IAFAGQMFGLEVEVYMVKVSYHQKPFRRSMMETWGAQVFASPSGRKSLGIAISEAVELAMQDPTNYSLGSVLNHVCLHQTIIGLEAKKQFADPDVVIGCC
LALACSMFGLECTVYMVKVSCTQKPYRKSMMQLWGANVIPSPSGRASLGIAISEAVEDAVSRPTNYALGSVLNHVCLHQTVIGQEAKEQLADPDVVIACC
LSLACNHFGLECTVYMVKVSYEQKPYRRSFMKTFGAQVYASPTGRASLGIAISEAVEDAATHDTNYALGSVLNHVCLHQTIIGLEAKKQLEEPDVVFACC
LSLACNYFDLECKVYMVRSSYYQKPYRKSLMTLWGGNVVPSPSGRKSLGIAISEAVEDAIAHDTKYTLGSVLNHVVLHQTVIGAECKKQLEEPDVVIGCC
LAMSCKLIGIECKVFMVRISFDQKPFRKIMMKTWGAECIASPSGRRSLGIAISEAIEQAVERETRYALGSVLNHVMLHQSIIGLEAQKQFELPDVVIGCA
LAMSCKLIGIECKVFMVRISFDQKPFRKIMMKTWGAECIASPSGRASLGIAISEAIEQAVERDTRYALGSVLNHVMLHQTIIGLEAKKQFDLPDVVIGCA
LAMSCKLVGIECKVFMVRISFDQKPFRKIMMKTWGADCIPSPSGRKSLAIAISEAIEQAVERETRYALGSVLNHVMMHQTIIGLEAKKQLERPDVVIGCA
LAMSCKLVGIECKVFMVRISFDQKPFRKIMMKTWGADCIPSPSGRKSLAIAISEAIEQAVERETRYALGSVLNHVMMHQTIIGLEAKKQLELPDVVIGCA
LAMSCKLIGIECKVFMVRISFDHKPFRKIMMKTWGADCIPSPSGRKSLGIAISEAIELAVERETRYALGSVLNHVMLHQSIIGLEAKKQFEAPDIVIGCA
LSMACRMFGLQCRVFMVRISYDQKPYRRMMMSTWGAECIPSPSGRRSLGIAISEAIEDAVGDEARYSLGSVLNHVLLHQTIIGLEAHRQFEEPDVVMGCA
LSMACAYYQIDLTVYMVKISSEQKPYRKAVIETYGGKVIPSPSGKKSLGCAISEAIEVALKSECRYVLGSVLDHVVLHQTVIGEETKIACEIPDIMIGCV
VALAASMYNMKSTIFMVKVSYEQKPMRRSIMQLYGANVYASPTGRKSLGIAMSEAIEYALKNEFRYLVGSVLDVVLLHQSVIGQETITQLDEADILIGCV
TALAASLYGLPSQIFMVKISFEQKPLRKTVMNLYNGNVVASPSGRKTLGIGISEAVEYALDNDYRYMVASVMNVSLTHQSVIGQETKKQLEEADVLIGCV
VALASALFRMKAHIFMVRTSYYAKPYRKYMMQMYGAEVHPSPSGRQSLGIAISDAVEYAHKNGGKYVVGSVVNSDIMFKTIAGMEAKKQMEEPDYIIGVV
VALASALFGVDSHIFMVRTSFYAKPYRKYMMYMYGAHPHPSPSGKESLGLAISEAIHYALDNGGKYIAGSVINSDILFKTIAGMEAKKQMEEPDYVVGVV
GGGSNAMGIFSGFISDKVGKGNKIGEHAASITYGSEGIMHGFNSIMLKDEEGNPSKVHSIASGLDYPSVGPEIAYLNSIGRTKTVCITDQEAINGFFELS
GGGSNAMGIFSGFISDKVGKGNKIGEHAASITYGSEGIMHGFNSIMLKDEEGNPSKVHSIASGLDYPSVGPEIAYLNSIGRTKTVCITDQEAINGFFELS
GGGCNARGIFTAFLEDPVGRGLETSDHAATMTLGVKGSIHGMNCYNLQDETGEPLPVYSIASGLDYPGVGPQHCLLKDIGRTKYVAVTDQECLDAFMQLS
GGGSNAIGMFADFINDTVGHGIETGEHGAPLKHGRVGIYFGMKAPMMQTADGQIEESYSISAGLDFPSVGPQHAYLNSIGRADYVSITDDEALEAFKTLC
GGGSNAIGMFADFINETVGHGIETGEHGAPLKHGRVGIYFGMKAPMMQTEDGQIEESYSISAGLDFPSVGPQHAYLNSTGRADYVSITDDEALEAFKTLC
GGGSNAMGLFHEFVNDTVGFGLDSGKHAATLTKGDVGVLHGAMSYLLQDDDGQIIEPHSISAGLDYPGVGPEHSFFKDMGRAEYYSITDEEALEAFKRVS
GGGSNAMGLFHEFVDDTVGFGLDSGKHAATLTKGDVGVLHGAMSYLLQDDDGQIIEPHSISAGLDYPGVGPEHSFLKDVGRAEYFSVTDEEALEAFKRVS
GGGSNAMGLFHEFVDDQVGFGVDSVKHAATLTKGDVGVLHGAMSYLLQDDDGQVIEPHSISAGLDYPGVGPEHSFVKDMGRAEYDSATDEEALAGFKRVS
GGGSNAMGLFHEFVDDKVGFGLDSGKHAATLTKGEVGVLHGAMSYLLQDDDGQVIEPHSISAGLDYPGVGPEHSFLKDMKRAEYYSITDEEALEAFRRLS
GGGSNAMGLFHEFVDDKVGFGLDSGKHAATLTKGEVGVLHGAMSYLLQDDDGQIIEPHSISAGLDYPGVGPEHSFLKDIGRAEYYCCTDEEALEAFKRLS
GGGSNAMGLFHEFVDDQIGYGVDTDKHAATLTKGEVGVLHGSLSYVLQDDDGQVIEPHSISAGLDYPGVGPEHSFLKDIGRAEYDSVTDQEALDAFKRVS
GGGSNAMGLFHEFVEDQVGHGVDTDKHAATLTKGEVGVLHGSMSYLLQDDDGQVIEPHSISAGLDYPGVGPEHSFLKDIGRAEYDSVTDQEALDAFKRVS
GGGSNAMGLFYEFVNESIGEGVNTEKHAATLTKGRVGVLHGAMSYLLQDEDGQVIEAHSISAGLDYPGVGPEHSYLKDVGRAEYYSVTDEEALAAFQRLS
GGGSNAMGLFYEFVNESIGEGVNTEKHAATLTKGRVGVLHGAMSYLLQDEDGQVIEAHSISAGLDYPGVGPEHSYLKDVGRAEYYSVTDEQALAAFQRLS
GGGSNAMGLFYEFVKDTIGESIASGKHAATLTKGRPGVLHGAMSYLLQDEEGQVIEAHSISAGLDYPGVGPEHSYLKDSGRAEYYSVTDAEAIAAFQRLS
GGGSNAMGLFHSFIEDLVGDGVNTKRHAATITQGSVGVLHGAMSLLLQDSDGQVQEAHSISAGLDYPGVGPEHSYLNEIGRAEYVAVTDKEALNALELVS
GGGSNAIGIFNEFINDTVGEGVNTTKHAATLTMGTPGVLHGSYSYLLQDDDGQIIDPHSISAGLDYPGIGPEHSFLKDVKRAEYYAVTDAEALEGFQLLS
GGGSNAIGLFRRFVDEPVGRGLDTHEHAATLTKGSVGVLHGSMSYLLQDDDGQIVEAHSISAGLDYPGVGPEHAFMRDTGRAEYLSVTDEEALDAFQLVS
GGGSNAIGLFHDFLDERVGEGIETGKHAATLTAGRAGVLHGAMSYVLQDEQGQVQEAHSLSAGLDYPGVGPEHSYLKDIGRAEYYSVTDSEALAALSLVC
GGGSNAIGMFHPFIDDEVGSGLETEKHAATMSKGEVGVLHGSMMYLLQDEHGQVTEAHSISAGLDYPGVGPEHSLLKDIGRVQYEAVTDQQALDALQLLC
GGGSNAMGLFYPFVDDAVGHGLETEFHAATISKGEIGILHGAMMDVLQDENGQILEAFSISAGLDYPGIGPEHSFFRDLGRAAYHSVTDDEAVEAFQLLC
GGGSNAVGMFYPFSKDTVGDGVDTDRHSATLSGGSKGVLHGVRTYVLQDEHGQISETHSISAGLDYPGVGPELSNWKDSDRAQFVAATDAQALAGFRALA
GGGSNAVGMFYPFSNDPVGDGVDTPRHSATLTAGSKGVLHGVRTYILQNQYGQIEDTHSISAGLDYPGVGPELSNWKDTERAKFVAATDAQAFEGFRLMS
GGGSNSTGMFSPFEHDTVGDGVDTKFHSATLTAGRPGVFHGVKTYVLQDSDGQVHDTHSVSAGLDYPGVGPELAYWKSTGRAQFIAATDAQALLGFKLLS
GGGSNSTGMFSPFEHDTVGDGIDTDYHSATLTAGRPGVFHGVKTYVLQDQDGQVHDTHSVSAGLDYPGVGPELAFWKATGRAEFIAATDAQALEGFKLIS
GGGSNAIGIFHPFVQDKVGDGIDTDRHSATLSRGTPGVLHGVRTYLLQDKFGQITETHSISAGLDYPGVGPEHSFLKDSGRAEYIAATDEEALRGFKLCT
GGGSNAIGAFHPFVNDEVGYGIDKDEHCATLTKGTPGVLQGAMTYVIQQKSGQTLNTHSISAGLDYPGVGPEHAFLKDSGRAVYEAVTDDEALEGFKLMC
GGGSNAIGAFHPFVEDKVGHGIDKDQHCATLTKGTPGVLQGAFTYVIQEKSGQTLNTHSVSAGLDYPGVGPEHAFLKDSGRAKYTAVTDDEALEGFKMMC
GGGSNAIGMFHPFVKDNVGDGVDTPRHSSTLLGGRPGVLHGTKTYIMQDAAGQVMETHSVSAGLDYAGVGPEHAFLKDSGRAKYVSVTDKEALEAFQLIS
GGGSNAIGMFHPFVKDNVGDGVDTPRHSSTLLGGRPGVLHGTKTYIMQDDAGQVLETHSVSAGLDYAGVGPEHAFLKDSGRAKYVSVTDKEALEAFQLIS
GGGSNAMGIFYPFVKDKVGKGIESGKHSASLNAGEIGVFHGMLSYFLQDEEGQIRTTHSIAPGLDYPGVGPEHAYLKESGRAEYVTVTDEEALRAFHELS
GGGSNAMGIFYPFVNDKVGKGLESGKHSASLNAGQVGVFHGMLSYFLQDEEGQIKPTHSIAPGLDYPGVGPEHAYLKKIQRAEYVTVTDEEALKAFHELS
GGGSNAMGIFYPFVNDRVGKGLETGLHAASLNAGELGVFHGMLSYFLQNEEGQITPTHSVSAGLDYPGVGPEHAYLKDSGRAEYVTVTDEEALRAFHELS
GGGSNAMGIFHPFLNDDVGEGIESGRHSASLTAGSKGVLHGMLSYFLQDEEGMMLDTHSVSAGLDYPGVGPEHAYLKETGRCEYVTVNDEEALRAFKTLS
GGGSNFGGFTYPMIGARRTRFIAAETAAPKLTRGEYRYDGLLPLAKMYTLGHRYTPPPSHAAGLRYHGVSPSLSILRRLGLVEAEAIPQEEALASILLMA
GGGSNFAGFTYPMIGMKRTRFVAVEEAAPKLTRGEYKYDFPLPMLKMYTLGHDYVPPAIHAAGLRYHGAAPSLSLLRKLGIVEAVAYPQEEVMRAALLFA
GGGSNFAGLAYPFVKDVDYEFIAVEKAAPTMTRGVYTYDYGTPKLKMHTLGHRYYVPPIHAGGLRYHGLAPTLSVLINHGIVKPIAYHQTEVFEAAVLFA
GGGSNFAGLAYPFVKEVDYEFIAVEKAAPSMTRGVYTYDFGTPKLKMHTLGHRYHVPPIHAGGLRYHGVAPTLSVLVNNGIVKPIAYHQTEVFEAAALFA
GGGSNFAGLSYPFIKDVDYEFIAVERAVPTMTKGVYTYDYGTPKIKMYTLGHTYYVPPIHAGGLRYHGLAPTLSVLMNHGIVKPMAYHQTEVFEAAVLFA
GGGSNFAGLAYPFVRDVKYEFIAVEKAAPSMTRGVYKYDYGTPKMKMHTLGHTYYVPPIHAGGLRYHGLAPTLSVLINHGIVKPVAYHQNEVFQAAHLFA
GGGSNFAGFTYPFIRHRKTRFIAVEKASPSMTRGVYTYDYGTPLLKMHTLGHTYQVPPIHAGGLRYHGVAPTLSVLLKHGIVEARAYHQREVFRAAHMFA
GGGSSFSGLFWPFYYEKSVKFIAVEAAVPTLTRGKYTYDLGTPLIKMYTVGHGYKPPPIHAGGLRYHGCAPALSLLVAEGEVGAVAYKQTEVFEAARLFA
GGGSNFAGLTYPFVNDANLEIIAVEAACPTLTAGEYKYDFGTPLLKMYTLGHDFIPPPIHAGGLRYHGDAPTLCMLVKHGVIKARAVKQLPTFEAGLLFA
GGGSNFGGFTFPFVREKKIEIIAVEKACPTLTEGEYKYDFGTPKMPMYTLGYDFIPPGIHAGGLRYHGDSPLVSLLYKNKIISAKAYPQLKVFEAGVTFA
GGGSNFAGLAFPYIRENKTRVIGVEASCPTLTKGLYAYDFGTPLVKMHTLGATFMPPGSHAGGLRYHGMSPMVSHVKELGLMEARAYHQTECFAAAVQFA
GGGSNFAGLAFPYIREKKIRIVGVEHSCPTLTKGRYAYDFGTPLMKMHTLGATFMPPGSHSGGLRYHGMSPMVSHVKELGLMEARSYHQTECFAAALQFA
GGGSNFAGVAFPFMGHARSRIVAVESACPTLTRGKYAYDYGTPLTKMHTLGSGFTPPGFHAGGLRYHGMAPMVSHAKELGLFDAVSYTQRECFEAGVLFA
GGGSNLAGIAFPFIGHGRPRIVAVEAACPTLTRGKYAYDFGTPLTKMYTLGSQFTPPGFHAGGLRYHGMAPLVSHCKALGLLEAVAYDQVACFEAGVLFA
GGGSNFAGLAFPFLGLQRRRIIAVEAACPTLTRGTYAYDFGTPLVKMHTLGSTFMPPGFHAGGLRYHGMSGMVSHAYELGLIEARAYKQVGCFEAGVQFA
GGGSNFAGLSFPFIREKKPVIRAVESACPSLTKGVYAYDFGTPLMKMHTLGHDFIPDPIHAGGLRYHGMAPLISHVYEQGFMEAISIPQIECFQGAIQFA
GGGSSFGGIAFPFLADKALRCVAVETSCPTLTKGHYAYDYGTPIMKMYTLGHDFMPPGIHAGGLRYHGDSPLVSQLLHEGQVEALAVPQVATFEAGVQFA
GGGSNFAGTAFPFLADKALRLLAMESSCPTLTRGHFAYDYGTPMMMQYTLGHDFTPPGIHAGGLRYHGDSSQLSQLVHDKIIEARSVNQLDTFRAGVTFA
GGGSNFAGTAFPFLADRAVRCLAVEASCPTLTKGVYAFDYGAPIAMMYTLGHDFMPPGIHAGGLRYHGESALVSQLHHAGLIEAKSYRQNACFEAAHLFA
GGGSNFAGIAFPFLMDKKVRAVAVETACPTLTKGVYAYDYSGPLAKMYTVGHDFVPAGIHAGGLRYHGVSPIVSQLYEDKLIEAKAYGQSSVFEAAVIFA
GGGSNLGGIGLEFIKDREARVVAVESACPSLTKGEYRYDFGTPLLKMYTLGHKHIPPAIHAGGLRYHGDSPIISKLCAEGLMEAVSYGQKEVFDAAVQFA
GGGSNFAGISFPFICDKHIQIIATEEACPTLTKGPYIYDSGTPLLAMHSLGHGFIPPAIHAGGLRYHGMAPLVSHVKQLGLIEATALPQTECYEAALLFA
GGGSNFAGISFPFICDKHVQVIATEEACPTLTRGPYVYDTGTPLLPMHSLGHRFIPPAIHAGGLRYHGMAPLVSHVKQLGLIEATALPQSECYEAALLFA
GGGSNFAGISFPFICDKHVRIIATEEACPTLTRGPYVYDAGTPLLPMHSLGHGFIPPAIHAGGLRYHGMAPLVSHVRQLGLIEANSLPQTECYEAALLFA
GGGSNFAGISFPFLCDKHVQVIATEEACPTLTRGPYVYDSGTPLLPMHSLGHGFIPPAIHAGGLRYHGMAPLVSHVRQLGLIEATALPQTECYQAALLFA
GGGSNFAGISFPFLYDKHIRVIATEEACPTLTRGPYVYDAGTPLLAMHSLGHGFIPPAIHAGGLRYHGMAPLVSHVLHNGLIEATALPQTECYEAALLFA
GGGSNFAGLSLPFVRDKHVSIIAAEASCPTLTRGPFAYDFGTPLLPMYTLGHGFIPASIHAGGLRYHGMAPIVSHLVKEGIVQAQAYDQIETFTAGLKWA
GGGSNYAGLIAPFFGDKQITFIGVEASCPSLTRGKYAYDFGTPLLKMYTLGSGFIPSPNHSGGLRYHGMSGIVSKLYHDGLMEARAVEQSKIFDAATLFA
GGGSNFGGFTYPFIGNKGKRYIAVSAEIPKFSKGEYKYDFPLPLVKMITLGKDYVPPPIYAGGLRYHGVAPTLSLLTKEGIVEWREYNEREIFEAAKIFI
GGGSNFGGFTFPFIPDGDTEIIATTNEVPKFSKGDYKYDLMLPQVRMYSLGADFVPPTIYAGGLRYHGASPSLSLLINKGRIKSDEVTEEEVKEALRTFA
GGGSNYAALAYPFLGDERRKYIASGSEVPKMTKGVYKYDYPLPMLKMYTIGSDFVPPPVYAGGLRYHGVAPTLSLLISKGIVQARDYSQEESFKWAKLFS
GGGSNYAALAFPFLADEQRTYIASGKEVPKMTEGEYRYDYPLPLLKMYTIGYDFIPPAVYAGGLRYHAVAPTLSLLMNKGIVQARDYDQEEAFKWARIFS
RTEGIIPAIESSHAIGYVLKIAKEMK-KILINLSGRGDKDLDFVVQN
RTEGIIPAIESSHAIGYVLKIAKEMK-KILINLSGRGDKDLDCVYKI
RVEGIIPALESAHAVAYATKLAVEIGPTILVNLSGRGDKDADFVANR
RHEGIIPALESSHALAHALKMMREQPELLVVNLSGRGDKDIFTVHDI
LHEGIIPALESSHALAHALKMMRENPDLLVVNLSGRGDKDIFTVHDI
RLEGIIPALETSHALAYLEKLCPTLSDRVVLNFSGRGDKDVQTVAKY
RLEGIIPALETSHALAHLEKLCPTLPDRVVLNFSGRGDKDVQTAIKY
RLEGIIPALETSHALAYLEKLCPTLFDRVVFNFSGRGDKDVDTVAKS
RLEGIIPALETSHALAYLEKLCPTLADRVVLNCSGRGDKDVHTAIKH
RLEGIIPALETSHALAFLEKLCPTLPNKVVLNCSGRGDKDVHTAIKH
RLEGIIPALETSHALAYLEKLCPTLPDRVVVNCSGRGDKDVHTASKY
RLEGIIPALETSHALAYLEKLCPTLPDRVVVNCSGRGDKDVHTASKY
RLEGIIPALETAHAIAYLETLCPQLDGRIVINCSGRGDKDVQTVAKF
RLEGIIPALETAHAIAYLETLCPQLDGRIIINCSGRGDKDVQTVAKF
ELEGIIPALETSHAIAYLETLCPQLEGRIIINCSGRGDKDVQTVAKY
KLEGIIPALETAHAFAWLDTLCPSLAPEIVINCSGRGDKDVNTVAKK
KLEGIIPALETSHAIAYLEKLIPTLKSRVVINCSGRGDKDVNNAMKY
GLEGILPALETSHAFAALKRINDTIPAIVVVNCSGRGDKDVNTVITA
STEGIIPALETAHAFAYLGILASQLHSIVVLNCSGRGDKDMGTVARA
QKEGIIPALESAHAVAHAKELARGMQPVVVICLSGRGDKDVMTVRNA
RTEGIIPALESSHAISYAVKLASKMRPSMVVCLSGRGDKDVNQLKER
QYEGIIPALESSHAIHGAMELAKTMKKNIVLNLSGRGDKDVQSVADE
QLEGIIPALESSHGIWGALELAKTMKPDVVICLSGRGDKDVQSVADE
QLEGIIPALESSHAVYGACELAKTMKPHLVINISGRGDKDVQSVAEV
QLEGIIPALESSHAVYGACERAKTMKPHLIINISGRGDKDVQSVAEV
QLEGIIPALETSHALWSAFQIAKTMKPDIVVSLSGRGDKDVEQIANA
EYEGIIPALETSHAIYYAVKLAKKLGPDIVINMSGRGDKDMPQIAKI
QYEGIIPALETSHAIYYAIQLAKTLGKDIVINMSGRGDKDMPQVAKI
LQEGIIPALESSHAVYYGVKLASTLSAVVVINISGRGDKDMLQVAKE
LHEGIIPALESSHAVHYGVKLASTLAPVVVINISGRGDKDMLQVANV
RTEGIIPALESAHAVAYAIKLAREMSRVIIVNLSGRGDKDLDIVLKV
RTEGIIPALESAHAVAYAMKLAKEMSRIIIVNLSGRGDKDLDIVLKV
RTEGILPALESAHAVAYAMKIAPEMDKIIIVNLSGRGDKDLDIVRRV
KLEGIIPALESAHAIAYAMKMAEEMQRVLVVNLSGRGDKDMDIVRRR
RSEGVVPAPESSHAVALAARIARKLPDVVAFNLSGHGLLDLDALQKA
RTEGIVPAPESAHAIKAAVDLAKKLPRVIAFNLSGHGLLDSDAYEKF
KAEGIVPAPESAHAVKAVIDKALEARRVILFNLSGHGLLDLKGYEDY
KLEGIVPAPESAHAIKATIDKAIEAKRVILFNLSGHGLLDLHGYEEY
KAEGIVPAPESAHAVRAVIDKALEAKRVILFNLSGHGLLDLKGYEDY
KTEGIVPAPESAHAIKGAIDRALEAKRVILFNLSGHGFLDLKGYEDY
KAEGIVPAPESAHAVKAAIDEAIKARDVIAFNLSGHGLLDLQGYREY
ATEGVVPAPESAHAVKAAVDLALQAKRTILFNMSGHGLLDLIAYDEY
RTEGIIPAPETNHAVRAAIDEAIKAREVIVFGFSGHGLLDLQAYDDY
RTEGIIPAPESSHAIAAAIDEAIKCRETIVFNLSGHGYFDLSAYEAY
RAEGIVPAPESSHAVKGAIDEALRCKATILFNLSGHGHFDMQAYTDY
RNEGIVPAPESSHAVKAAIDEALLAKATILFNLSGHGHFDMQAYTDY
RNEGIVPAPEANHAVKGAIDEALRCKRVILFNLCGHGHFDMAAYTSY
RNEGIVPAPESNHAIKGAIEEAMRCKRTILFCLSGHGHFDMTSYQKY
RTEGIVPAPESTHAVRCAIDEALRCKETILFNLSGHGHFDMQAYINY
RTEGIIPAPEPTHAIAATIREALRCKEVILMAMCGHGHFDLTSYDKY
RAEGIIPAPESCHAIRAAIDEALKCKVTILFSLTGHGHFDMASYDKF
QAEGIIPAPESNHAIRGAIEEALRCKETIFFTLSGHGHFDMTSYDRF
RSEGIVPAPESSHAVRAAIDEAVLAKETILFCLSGHGQLDMGAYDAY
RTEGIVPAPESSHAIRAAIDEALLCKEVILFNLSGHGYFDMAAYDNY
RTEGIVPAPESSHAIRCAIDEALEAKQVILFNLSGHGHFDMASYDKY
HTEGFIPAPETSHAIAQTIREAKKAKEVILMNWSGHGLMDLQGYDAF
HTEGFIPAPETSHAIAQTIREAKKAKEVILMNWSGHGLMDLQGYDAF
HTEGFIPAPETSHAIAETIREAKKAKEVILMNWSGHGLMDLQGYDAF
HTEGFIPAPETSHAIAQTIREAKKAKEVILMNWSGHGLMDLQGYDAY
HTEGFIPAPETSHAIAQTIREARHAKEVILMNWSGHGLMDLQGYDAY
QSEGFIPAPETNHVIAAVVREAELARQVILFNWSGHGIIDLPAYDAF
RSEGILPAPESSHALRVAIDKP------
ENQGIVPAPESAHAIRAVVDEAIEARKVIVFNLSGHGLLDLSNYESM
NTQGIIAAPESGHAIASAI-KYVKAHITIVVNVSGHGYLDLSIFGEK
ELEGYIPAPETSHALPILAEIAEEAKKTVLVSFSGHGLLDLGNYASV
EKEGYIPAPETSHALPILKEIADSNRGTVLVSFSGHGLLDLGNYAEA