Supplementary Figure 1. Multiple sequence alignment of mucin domains 4 thru 16 of the Cryptosporidium extremely
large mucin, cgd3_720.
Dom16 PFFPVGIIPAPISGNYHNTPSGYYYNKTSNLFVKYP--NNTSNELHPYQFQELSPGTKECDLYILSICNSTIVNLKPTGSLF-QSGWTLFASINTGIPESEQISSSSTNTGTYN YGF
Dom15 PHFPPGIQAAPLNGSLINTPSGYYYNTATGLFEKYP--NNTNIGINPFQPKELIEGLSECSLHMFDICNSTSISLKPKDTFF-KQGWTLFVMLNNGIPTS------VSEKNDFN YKY
Dom13 SFFPVGINPAPIFGDYHNTPPGYFYNTTSGLFEIEDPTIIKQDQINPYQPREFIKGISECDLELLSVCNS-SSIIIKDNLIF-KQDWTLFVMVSTGIPEYE---DNSNNIDLYN YKY
Dom12 AFYPPGIKPAPIIGNYHTTPTNYHYNTTTGQFEKNE-DHPNPDEINPFQPNHIVDGISECDLYLFSTCNATSAKIMPEDTIF-KQDWTLFVIISTGIPEYD---NNSNNIDLYN YKY
Dom14 PFLPVGIHSAPYFGTSENTPPGYHYNSTTSQFEKNE-DHPNPDEINPFQPNHIVDGISECDLFMFDTCNATSAQIMPEDTTF-KQGWTLFILISTGIPIRDP--NNVNNNDIHH5IRF
Dom10 PFFPPGIKPAPIDGDENNTPPNYHYNTTTGQFEKNE-DHPNLDEINPFQPTHIVDGISECDLYLFSTCNATSAKIMPEDTTF-NEDWTLFVILSTGVPEYG----DNSNNILFN YKY
Dom8 PFFPPGVQPAPIDGDENNTPPNYHYNTTTGQFEKNE-DHPNPDEINPFQPNHIVDGISECDLYLFSTCNGTSAKIMPEDATF-NEGWTLFVILSTGVPEYG----DNSNNIAFN YKY
Dom7 PFFPPGIKPAPIDGDENNTPPGYHYNTTTGQFEKDE-DNTNLDEINPFQPNHIVDGISECELYLFSTCNGTSAKIMPEDATF-NEGWTLFVILSTGVPEYG----DNSNNILFN YKY
Dom6 EEFPLGLKPAPIDGDESNTPPGYHYNTTTGQFEKNE-DHSDPDEINPFQPTHIVDGISECNLYLFSTCNGTSAKIMPEDATF-NEGWTLFVILSTGVPEYG----DNSNNIAFN YKY
Dom9 PFFPPGIKPAPIDGDESNTPPGYHYNTTTGQFEKDE-DNTNLDEINPFQPNHIVDGISECDLYLFSTCNATSAKIMPEDVTF-KEGWTLFVILSTGVPDY-----DETKYNGFN YKY
Dom5 PFYPPGVHPAPIVGDQLNTPPNYHYNTTTGQFEKNE-DHSDPDEINPFQPNHIVNGISECNLDLFTTCNASSIKIMPEDITF-KEFWTLFVISSTGIPNKG----SESD--PSN YKY
Dom11 PFFPPGVQPAPIDGDENNTPPNYHYNTTTGQFEKNE-DHPNPDEINPFQPNHIVDGISECDLYLFSTCNATSTKFMPDGTIF-KQGWTLFVMINTGVPDYD----PNNVNNNYN YRY
Dom4 PFLPVGVPVTDDQG--LVVPPGFYFNKTTSQYEPLG-SAEDSNSLRPPVPFQLVQGIEECELEIYSFCNSTKVTMESLMTNLDKKDWTLFAMVSTGIPTYT-----PDSSPNFN FQF
Consensus/100% ..hP.Gl.sss..G...ssPssaaaNpsssba...... p...lpP....ph..G.pECpL.hhs.CNu....hb.....h.pp.WTLFh..ssGlP...... s.....p...h.a
Dom16 QFINTFTS---- -KIVLSINLYDDFTEIKNEITGEKAVSVSPQCGPLCSTYKGGYFTFWMSYDSLNEEISISINRNRQILLKLRNAQAVSS----FDRIQPCCSNSVM------
Dom15 EFFNSKQGN--- -KVVLSLYYNNETVVFRNEVSGEFSVAGSPQCGPYCSTYPNGFFTFWLSYDYSSNNYIISVSSNQKYLLHLDAHGAK------FDHIKPCCGSEIN------
Dom13 DFINYDKDTGN- EELILSLKLYNDSIIFRNEKTGDASIAGSPQCGPYCSTYPKGFFTFWLTYNNDSKKYLISVNSNKEHLVEVSGYKSF------FNKVKGEIRSNNNHI-----NI
Dom12 DFVNFDKDTGN- EEIILSLLLRNETVTLKNHITNIEKTVGSPQCGPYCSTYPKGYFAFWLTYDEDNDLYEISINSNTEILLKLPASKQK------FNKIISRSSSNTNTNPNPNPNN
Dom14 DFINVEIGGNNQ QTLVLSLELFNDTVILRNEKTGDSSIAGSPQCGPYCSTYPKGYFAFWLTYDPEIHVYTVSVDYNQQKLVSINSANMK------FNLIKPICYNGSQNNN----CN
Dom10 NFVKSDSTSS-- DDLVLSLSFTNNTVTLKNEKTNVEIVVGSPQCGPYCSTYPKGYFAFWLTYDSITNKYIISVENNTKKLVEIEADITSDSV---FNKIIPVIDTNGGAGVD---IS
Dom8 NFIKSYSTGS-- DDLVLSLSFTNNTVTLKNEKTSTESVVGSPQCGPYCSTYPKGYFAFWLTYDSVTSKYIISVENNSKKLVEIEADITSDSV---FNKITPVIDTNGGAGVD---IS
Dom7 NFIKTDSANS-- DDIILSLSFTNNTVTFKNEKTSTESVVGSPQCGPYCSTYPKGYFAFWLTYDSATSKYIISVENNTKKLVEIEADITGNLV---FNKITPVIDTN-GVGVE---VS
Dom6 NFVKSDSTKI-- DNVVLSLSFTNSTVTFKNEKTSTESVVGSPQCGPYCSTYPKGYFAFWLTYDSATSKYIISVENNTKKLVEIEAEITGSDDDPIFNKITPMIDGNSGAGVE---IS
Dom9 EFKKSDNSNGGL EETVLSLEYSNDTVTFKNLKTNEEEVVGSPQCGPYCSTYPKGYFAFWLTYEPIRKMYSIGIENNSKYLMEIKAEDNVV-----FNKIVP------CCGDM---VD
Dom5 EFMKTDTGVE-- -ELVLSLTISNETVILKNEKTNEEEVVGSPQCGPYCSTYPKGYFAFWLTLDSINKKYIVSIENNSKKLVEINSEGAD------FNQVKP---CKSCQGTP---DS
Dom11 EFISSTNSNTGG4DKLILSLEYNNETVTFRNEITGEEDVVGSPQCGPYCSTYPKGYFAFWLTYDSKYNYFIISVNSNNDYLTSISADNTDMI----FNKIIPRGNN------VN
Dom4 KFKDKQDQ------IKFSIDISNSSIVISGSDSSQIAEAVSPQCGPYCSTYPKGYFTFWLQKKFSENKFVVGVNYNNLLLVEFDSDNRD------FTKIDTGGDPGATLSPS-----
Consensus/100% pF.p...... hSl.h.sp...hps..ss....ssSPQCGPhCSTY..GaFsFWhpbp...p.h.lulp.Np..L.ph.s...... Fs.l.s......
Dom16 NSFSLWQLSNVAHIPQ------IEATTTTTTTSPPWQDSVHNECLLVENI- HCKGFNATLKDPPV-FHHENILWMNFKLPT- QNCNSPFNLN- EKFWY
Dom15 NSFSLWQLSNTVIYPL------RISSTTTTTTLPPWFETIYDECQLQTSST5PCRGHSASLVDED--FSYGYLISLNFTLSRV5NNVLSPILPIS33KYIWN
Dom13 NTYSIWQLTNIVDNPI------SISTTTTTTTSPPWDSITFDNCNL-EINI PCKGYAATIKDE---LKMGNIIWISTKVHNG NIGITNFNNE- EKYWI
Dom12 TTFNVWYLTNVKQTPK------SISTTTTTTTTPPWDKVEKEVCDLSEIFM PCQGYNATIDPY---INLGDLISMKFIIEKE CNCINKINHLG NNYWL
Dom14 NVFNIFQLTNKQKIPR------SLSTTTTTTTAIPWFSQVLDECKLNIDI- PCRGIHASIDNNNIPFSQGNILWFNTTIISS KDKIHGLDSS- TLYWF
Dom10 KTFSIWQLTNIKEIPIDTATTTTTTTTTTT18TTTSTTSTTTERPWYEIELDECPLELNK- PCRGINGKLDKP---FNHGNLLWINGTLGK- DINGRLLLD-- DTNLM
Dom8 KTFSIWQLTNIKEIPIDTATTTTT------TTTSTTSTTTEQPWFEKDHDECPLELNK- PCRGKDGNLDSP---FEKGQMVWLNRTVSI- SNPNLDFEN-- GKYWF
Dom7 KTFSIWQLTNIKEIPADTATTTTTTTTTTT 4TTTSTTSTTTEQPWYEIELDECPVKLEK- PCRGLDAVINDPE--FQLGNLLWIQLKMEV- SNPLIEENG-- RKYWM
Dom6 KTFSVWHLFNKKIINNVVATT------TTTSTTSTTTEQPWHEIELEECPLEFKK- PCKGHNAIIKDSQ--FNKGNLIWISMKTVK- STPEIELAP-- NKYWH
Dom9 KSYNIWQLTDSVSIPKSQATTTTTTTTTTS TASTESTTTTKTPWEDVELDDCPLDMSS- PCKGINSTLTIPDK-FEEGNFIWINSSIAD- HGKTININN-- IEMTK
Dom5 EMYSIWKLFNSKIQPTEVA------TTTTTTTKTPWQDQIIENCIVSYDSD PCKGTNVEIDPE---LQEGDILWINTTLQI- SDPKVEING-- KSYWQ
Dom11 NSYNIWHLNSQNRLPVSIATTT----350(S/T)--TTTTTTTNPPWDQIVQDNCELEPNNQ PCKGFNSTIPSI---VNQGNLFWIDSNIAYY5NIRKLYFTENN 6GYYWM
Dom4 KTYVLWNLLDISNPPK------QSATTTTTKTPWDQDIKDECHLELDQ- PCRGNDSILDKP---LNQGDIVWINTLLGK- SDQPIEVDG-- KNYWF
Consensus/100% p.asla.L.s....s...... s.ssTTT..PW.p...p.C.l...... CpG..s.l...... hp...hh.hp...... p......
Dom16 GYFFTNK-DKAVLSILFNETLISLYDWKNQQEFYSPYTDGSLVYQGQNLTIGLGWSRLGFFLLNKDSESLINIKSM-TDFSFDKVSNHGES-LNPSIFLLKS--GFLYPNEVLVHGYREC
Dom15 GFEFKDKNDQPVLSLKFNETTITLYDHKDDLEYISPYTNESTAYSGRFESLSLGWSRLGLFLMNSDFNSLIKIPTR-NNFEFNKITKLVHD-NIPLNFTLKQ--DFMYPNEILFQGYHSC
Dom13 GINLKSSSSESLMNLYFNETFITLIDDLQGRQYISRYTEEILIYEGFELTIGIAWSKFGVFILNEKMNSLITFQTH-QEYPISKILPLHGS-RRQIYHFLFS-GGFLFPGNTLYEGYTTC
Dom12 GFDFN-HNDDKILSIFLNESFILMYDWKNHQEYISPFTNESLSYDHMDLEILIGWSRLGLFLMDNNHNGLIHIQNI-PDNKINKIKQHSNE-QKTLIRFELINKQFLYPNEILYQGYHTC
Dom14 GYQFLFD-SNPIFSLLFNETFVTFTDLISRIEYSSKYTNESLVYPGMEMIFGIGWSRLGIFVVNSRNEALIQINTL-TDITFNQISLYGSE-TIPTNFFLTN--DFVFPKQILYKGYESC
Dom10 GYDIMNDNK-KIMTLLLNETFIGVYDLTKDQEYYSYYTNESLAYNGMDFKIGIGWSKLGLFILNEYSSSLIEIQTH-SDYKFNKVLLRGPTMDKYVNYILLD--NFLYPNGILYLGYDTC
Dom8 GFRFEDENNNKLLSLYFNESFITVVDWRQDLKYITPYTNESLSYSGREENIGVGWSKMGLFILNEDLNSLVELQTD-LDFGFKKIIQE-SNPQVPIKYILEE--GFFYPNNILFIGYETC
Dom7 GYNFH-KDTKEVMKIRLNETHILLQDQLNNKDYSSPLTNSSLGYIERELTIGIGWSRLGLFIINDDHNSYIDIKGR-DDYSFNKIEQILVPDKKASNYILED--SFLYPINIMYLGYETC
Dom6 GFSFYNKQNEKVLSILFNETHLLLEDKESGMDFTSRYTNSSLNYVGETITIGLGWSRLGLFLINENKDSFIEVKNN-LDYTFVKIVHDSTQ-KSSSNYYLEE--NFLYPNIFLFKGYETC
Dom9 IYHFKDNNQDEILILGFNMTHLTLIDNQSSKVYSSYYTNQTTSFPTKSFSIGIGWSRLGLFILNEYGDGLIQLKSD-KKYEFVKIEQAQVL-ETLTIYEWSN--KFIYPREMLFLGYSIC
Dom5 GIQLK-KDDKKVFSILFDELFISINIEDSGNEYFSPYTNQTLGYSGREISIGIGKSKYGYFILNEILGSLITLNANGLDLSFNKASPLSSH-DIVSKFELQK--GFVLPANQLFIGYETC
Dom11 GYNFMKD-NDPVFTLLFNETYLTIKDWKGKQEYYSPYTNEKLAYNNMNFTIGIGLSRLGLYLMNSKLQSLIQIQPQ-YDYSFNHVKQLLYDNQVITNYLLQD--GFLYPNTLLYEGYEKC
Dom4 EYKFQ-KESDSVISLLFNETSVAIYFYDRAETLSSKYTNQHLLYEGMDITLGITSNRFGFFLVNKDLNALIHDPSL-KILDYNKILQESVP-NHISNFLLEE--GFTYPKNILFRGYETC
Consensus/100% .h.h..p.p..lh.l.hsb..l.h...... h.s.hTp....a..b.b.h.ls.s+hGhalhsp...uhlp..s...... h.ph...... b.....F.hP...hh.GYp.C
Dom16 SFFENCV-SNSLSCNSQVLSQICSNPTAGMSWEIETEIMNTQVNNGTWGKRFESQDIMNVFALSPN12GNLEDPGFIYILKDRLVLQNHLG-SVCSGPLPGAKEVNF-GDKIKLSIGI
Dom15 SLFNDCK-SKSTSCESQVVSETCNRNMNKTQWIIETNITDTQINNGTWGPNFELTELLSAFNIN-- NGLEDVLSIYIFNQRIAIQNHIGHSICDGIYPESRSLNI-GDTIVWSIGV
Dom13 SLYKDCD-STSTGCSAQTLTNPCSKSQPGISWVIESEILDTKVNNGTWGSDLELNNLLNVYLVN-- NGIEDTLAIYFFNNRLALQDLKENIVCNGPYPKDISQNF-GSVLNWILSV
Dom12 SFYNDCS-TEIISCSSQALIDTCNKPHPGMNWIIDTDISETNVNNGTWGSKLEFPTLINLFKVN-- NLTDDIYTIYIFNNRISIQDEHNKISCDGYYPNFKKLNINGDHLTWSLGI
Dom14 SFFEDCP-MN-ATCKSQSLTGICNRKDPGSYYQIETVIKETFLTNSTWGHRLELNSLLAEFVINGG DFEEDLLAIYIFENRIAMQDLKHNVVCSGPYPKDSIIDY-GDTLKWSIGF
Dom10 TFNGECN-VKSLECSSQVLEGLCLKKLPGITWKMETILAQTNVNNGTWGFDYSINGLINLYTLN-- NGIEDVLVIYFFNNRVALQDLINSNSCSGPYPENSQVNV-GDLIKWSIGI
Dom8 SLRNECR-LDSTSCNSQVLKDTCDKRIDEMSWTFETSFSETNVNNGTWSQDLVIPDLLGVYLIN-- NGHIDISIIYLFKNRLVLKGLILDEVCSGPYPGNQIKQI-GDKITWSLAI
Dom7 SLFSDCS-QNIVHCGSQVLKEICTKRIPNMEWTFEGLNAETNVNNGTWGEELKMDELMNLYIVNSG EEEKNIFGIYIFKNRMALQDFENFQSCSGIYPKGKNLSF-GDKMKWSLGL
Dom6 GLYEECS-TSPVNCYSQVLTEICPKKVPGMYWTFENIIHETNVNNGTWGSEFILDGLLNVYLIN-- DGDSDIATIYFFNNRMVLLDSKNKVACSGPYPNNHIISV-SDKIKWSIGF
Dom9 SAYNDCTDDNSLSCSAQAFGDLCNRRKTNISWDIEFKVTETYVNNGTWGNEFELPGLVNIYFVS-- NDHQNILSIYFFQSYLALRDLENQNICSGPYPESMNISQ-DDKINWSLNI
Dom5 SLNDDCS-LETTSCNSQVVKGLCPSPSNTRIWTIEVEISKTNINNGTWGESLVMDELMNIYFLS-- SSSEILYNVLFFENRVVIEDADTQVTCSGAYPGGVSISY-GEKFIWSFGV
Dom11 SLYNNCL-SNKLECNSQVLTGICSTLSPGLNLEFETEIKSTNLNNGTWGEIFKQSDLLNTYIIS-- SNNEDQYIIYIYSNRIAIQDLNNYISCSGPYPNGNLIHI-GSKLIWKIGF
Dom4 TLYGDCK-TVSKSCESQVLIEMCNPIQAGTEISIQTSISQTNINNGTWGAPLIQSDLINIFHIG-- NEDKDVYTVYIYTNRMVLEDMVNFISCGGPYPGGKTLSS-SSNLEWSIAL
Consensus/100% sh..pC...... C.uQsh...C....s...h.hp....pT.lsNuTWu..h....lhs.a.ls...... lhhhppbhshbs...... CsG.hP.....p..sp.h.h.hsh
Dom16 DSMS-MLYLNVFNE---DDNEYNTVCTIGKIE- NGWRFKYIFPIGHSPSVSSFVQIKLG-FPDGGF
Dom15 DDNK-LLYLNVMNE---NKSKFYSVCTLKYNE- DFNYFKFIYPRGYSPSRSIFTQKIGD-FPNGGF
Dom13 DENQ-MLYLNIKQK---DQS-IFSVCVIPFGD- HLKPIRYIYPRGYAPSTSNFTQILNS-IPKGGF
Dom12 DHSY-LLYLNIHDI---NNGKNYTVCVLPMDI- SSGALSIIYPKGYSPLKSNFTQQIFG-FPNGGY
Dom14 DISN-QMYLNVHNFK--NIDGNFTICTLSTGI- QNSDFLYLYPQGYAPKPSTFKEFKNG-FPKGGF
Dom10 DSLM-NLYLNILDDN--VTGKNYTVCYIKNLNQ FGGGFKYLSPLGYRPNYSRFIQEISPKFQDGGY
Dom8 DSND-LLYLNIIDDSDPKNKEYYTICPISYDN- RFGRFKYIYPLGHAPSKVTFTQDLKG-FPQGGY
Dom7 DSDN-LVYLNYIDID--DNTK-YTVCVLKHNP- GFTRVKYIHPLGYSPSYMIMTQTKNG-LDEGGY
Dom6 GKNY-LLYLNVYDLN--NNNKEYSVCTIQYGS- GFFNIEYVYPLGYAPSMSKFNQYKDG-FPNGGY
Dom9 GHSG-EIYLNIESWKGSNKGTKFTICMIGGGSN 4YIQDVKYIHPLGYSPSFNKYTQINSLLIPNGGY
Dom5 NENSRMIYFNSAPKD--NPNKLNTICSMPYMG- KRSQITTVYPLGYAPSKALFTQKKDG-LPTGGF
Dom11 DSYY-SIYLNVYDQ---INHEYYSICQLKPVIH34GIDSIDYIYTRGYNPSNTKFIQYINN-FPRGGY
Dom4 DSKK-FLYLNVLDE---ENTEMKTICSMSLIG- GIHSLDYISPSGFKPNNSVFTQKINS-FKQGGY
Consensus/100% s.....hYhN...... slC.h...... h..l.s.Ga.P....h.p.....h.pGGa