Supplementary Figure 1. Multiple sequence alignment of mucin domains 4 thru 16 of the Cryptosporidium extremely

large mucin, cgd3_720.

Dom16 PFFPVGIIPAPISGNYHNTPSGYYYNKTSNLFVKYP--NNTSNELHPYQFQELSPGTKECDLYILSICNSTIVNLKPTGSLF-QSGWTLFASINTGIPESEQISSSSTNTGTYN YGF

Dom15 PHFPPGIQAAPLNGSLINTPSGYYYNTATGLFEKYP--NNTNIGINPFQPKELIEGLSECSLHMFDICNSTSISLKPKDTFF-KQGWTLFVMLNNGIPTS------VSEKNDFN YKY

Dom13 SFFPVGINPAPIFGDYHNTPPGYFYNTTSGLFEIEDPTIIKQDQINPYQPREFIKGISECDLELLSVCNS-SSIIIKDNLIF-KQDWTLFVMVSTGIPEYE---DNSNNIDLYN YKY

Dom12 AFYPPGIKPAPIIGNYHTTPTNYHYNTTTGQFEKNE-DHPNPDEINPFQPNHIVDGISECDLYLFSTCNATSAKIMPEDTIF-KQDWTLFVIISTGIPEYD---NNSNNIDLYN YKY

Dom14 PFLPVGIHSAPYFGTSENTPPGYHYNSTTSQFEKNE-DHPNPDEINPFQPNHIVDGISECDLFMFDTCNATSAQIMPEDTTF-KQGWTLFILISTGIPIRDP--NNVNNNDIHH5IRF

Dom10 PFFPPGIKPAPIDGDENNTPPNYHYNTTTGQFEKNE-DHPNLDEINPFQPTHIVDGISECDLYLFSTCNATSAKIMPEDTTF-NEDWTLFVILSTGVPEYG----DNSNNILFN YKY

Dom8 PFFPPGVQPAPIDGDENNTPPNYHYNTTTGQFEKNE-DHPNPDEINPFQPNHIVDGISECDLYLFSTCNGTSAKIMPEDATF-NEGWTLFVILSTGVPEYG----DNSNNIAFN YKY

Dom7 PFFPPGIKPAPIDGDENNTPPGYHYNTTTGQFEKDE-DNTNLDEINPFQPNHIVDGISECELYLFSTCNGTSAKIMPEDATF-NEGWTLFVILSTGVPEYG----DNSNNILFN YKY

Dom6 EEFPLGLKPAPIDGDESNTPPGYHYNTTTGQFEKNE-DHSDPDEINPFQPTHIVDGISECNLYLFSTCNGTSAKIMPEDATF-NEGWTLFVILSTGVPEYG----DNSNNIAFN YKY

Dom9 PFFPPGIKPAPIDGDESNTPPGYHYNTTTGQFEKDE-DNTNLDEINPFQPNHIVDGISECDLYLFSTCNATSAKIMPEDVTF-KEGWTLFVILSTGVPDY-----DETKYNGFN YKY

Dom5 PFYPPGVHPAPIVGDQLNTPPNYHYNTTTGQFEKNE-DHSDPDEINPFQPNHIVNGISECNLDLFTTCNASSIKIMPEDITF-KEFWTLFVISSTGIPNKG----SESD--PSN YKY

Dom11 PFFPPGVQPAPIDGDENNTPPNYHYNTTTGQFEKNE-DHPNPDEINPFQPNHIVDGISECDLYLFSTCNATSTKFMPDGTIF-KQGWTLFVMINTGVPDYD----PNNVNNNYN YRY

Dom4 PFLPVGVPVTDDQG--LVVPPGFYFNKTTSQYEPLG-SAEDSNSLRPPVPFQLVQGIEECELEIYSFCNSTKVTMESLMTNLDKKDWTLFAMVSTGIPTYT-----PDSSPNFN FQF

Consensus/100% ..hP.Gl.sss..G...ssPssaaaNpsssba...... p...lpP....ph..G.pECpL.hhs.CNu....hb.....h.pp.WTLFh..ssGlP...... s.....p...h.a

Dom16 QFINTFTS---- -KIVLSINLYDDFTEIKNEITGEKAVSVSPQCGPLCSTYKGGYFTFWMSYDSLNEEISISINRNRQILLKLRNAQAVSS----FDRIQPCCSNSVM------

Dom15 EFFNSKQGN--- -KVVLSLYYNNETVVFRNEVSGEFSVAGSPQCGPYCSTYPNGFFTFWLSYDYSSNNYIISVSSNQKYLLHLDAHGAK------FDHIKPCCGSEIN------

Dom13 DFINYDKDTGN- EELILSLKLYNDSIIFRNEKTGDASIAGSPQCGPYCSTYPKGFFTFWLTYNNDSKKYLISVNSNKEHLVEVSGYKSF------FNKVKGEIRSNNNHI-----NI

Dom12 DFVNFDKDTGN- EEIILSLLLRNETVTLKNHITNIEKTVGSPQCGPYCSTYPKGYFAFWLTYDEDNDLYEISINSNTEILLKLPASKQK------FNKIISRSSSNTNTNPNPNPNN

Dom14 DFINVEIGGNNQ QTLVLSLELFNDTVILRNEKTGDSSIAGSPQCGPYCSTYPKGYFAFWLTYDPEIHVYTVSVDYNQQKLVSINSANMK------FNLIKPICYNGSQNNN----CN

Dom10 NFVKSDSTSS-- DDLVLSLSFTNNTVTLKNEKTNVEIVVGSPQCGPYCSTYPKGYFAFWLTYDSITNKYIISVENNTKKLVEIEADITSDSV---FNKIIPVIDTNGGAGVD---IS

Dom8 NFIKSYSTGS-- DDLVLSLSFTNNTVTLKNEKTSTESVVGSPQCGPYCSTYPKGYFAFWLTYDSVTSKYIISVENNSKKLVEIEADITSDSV---FNKITPVIDTNGGAGVD---IS

Dom7 NFIKTDSANS-- DDIILSLSFTNNTVTFKNEKTSTESVVGSPQCGPYCSTYPKGYFAFWLTYDSATSKYIISVENNTKKLVEIEADITGNLV---FNKITPVIDTN-GVGVE---VS

Dom6 NFVKSDSTKI-- DNVVLSLSFTNSTVTFKNEKTSTESVVGSPQCGPYCSTYPKGYFAFWLTYDSATSKYIISVENNTKKLVEIEAEITGSDDDPIFNKITPMIDGNSGAGVE---IS

Dom9 EFKKSDNSNGGL EETVLSLEYSNDTVTFKNLKTNEEEVVGSPQCGPYCSTYPKGYFAFWLTYEPIRKMYSIGIENNSKYLMEIKAEDNVV-----FNKIVP------CCGDM---VD

Dom5 EFMKTDTGVE-- -ELVLSLTISNETVILKNEKTNEEEVVGSPQCGPYCSTYPKGYFAFWLTLDSINKKYIVSIENNSKKLVEINSEGAD------FNQVKP---CKSCQGTP---DS

Dom11 EFISSTNSNTGG4DKLILSLEYNNETVTFRNEITGEEDVVGSPQCGPYCSTYPKGYFAFWLTYDSKYNYFIISVNSNNDYLTSISADNTDMI----FNKIIPRGNN------VN

Dom4 KFKDKQDQ------IKFSIDISNSSIVISGSDSSQIAEAVSPQCGPYCSTYPKGYFTFWLQKKFSENKFVVGVNYNNLLLVEFDSDNRD------FTKIDTGGDPGATLSPS-----

Consensus/100% pF.p...... hSl.h.sp...hps..ss....ssSPQCGPhCSTY..GaFsFWhpbp...p.h.lulp.Np..L.ph.s...... Fs.l.s......

Dom16 NSFSLWQLSNVAHIPQ------IEATTTTTTTSPPWQDSVHNECLLVENI- HCKGFNATLKDPPV-FHHENILWMNFKLPT- QNCNSPFNLN- EKFWY

Dom15 NSFSLWQLSNTVIYPL------RISSTTTTTTLPPWFETIYDECQLQTSST5PCRGHSASLVDED--FSYGYLISLNFTLSRV5NNVLSPILPIS33KYIWN

Dom13 NTYSIWQLTNIVDNPI------SISTTTTTTTSPPWDSITFDNCNL-EINI PCKGYAATIKDE---LKMGNIIWISTKVHNG NIGITNFNNE- EKYWI

Dom12 TTFNVWYLTNVKQTPK------SISTTTTTTTTPPWDKVEKEVCDLSEIFM PCQGYNATIDPY---INLGDLISMKFIIEKE CNCINKINHLG NNYWL

Dom14 NVFNIFQLTNKQKIPR------SLSTTTTTTTAIPWFSQVLDECKLNIDI- PCRGIHASIDNNNIPFSQGNILWFNTTIISS KDKIHGLDSS- TLYWF

Dom10 KTFSIWQLTNIKEIPIDTATTTTTTTTTTT18TTTSTTSTTTERPWYEIELDECPLELNK- PCRGINGKLDKP---FNHGNLLWINGTLGK- DINGRLLLD-- DTNLM

Dom8 KTFSIWQLTNIKEIPIDTATTTTT------TTTSTTSTTTEQPWFEKDHDECPLELNK- PCRGKDGNLDSP---FEKGQMVWLNRTVSI- SNPNLDFEN-- GKYWF

Dom7 KTFSIWQLTNIKEIPADTATTTTTTTTTTT 4TTTSTTSTTTEQPWYEIELDECPVKLEK- PCRGLDAVINDPE--FQLGNLLWIQLKMEV- SNPLIEENG-- RKYWM

Dom6 KTFSVWHLFNKKIINNVVATT------TTTSTTSTTTEQPWHEIELEECPLEFKK- PCKGHNAIIKDSQ--FNKGNLIWISMKTVK- STPEIELAP-- NKYWH

Dom9 KSYNIWQLTDSVSIPKSQATTTTTTTTTTS TASTESTTTTKTPWEDVELDDCPLDMSS- PCKGINSTLTIPDK-FEEGNFIWINSSIAD- HGKTININN-- IEMTK

Dom5 EMYSIWKLFNSKIQPTEVA------TTTTTTTKTPWQDQIIENCIVSYDSD PCKGTNVEIDPE---LQEGDILWINTTLQI- SDPKVEING-- KSYWQ

Dom11 NSYNIWHLNSQNRLPVSIATTT----350(S/T)--TTTTTTTNPPWDQIVQDNCELEPNNQ PCKGFNSTIPSI---VNQGNLFWIDSNIAYY5NIRKLYFTENN 6GYYWM

Dom4 KTYVLWNLLDISNPPK------QSATTTTTKTPWDQDIKDECHLELDQ- PCRGNDSILDKP---LNQGDIVWINTLLGK- SDQPIEVDG-- KNYWF

Consensus/100% p.asla.L.s....s...... s.ssTTT..PW.p...p.C.l...... CpG..s.l...... hp...hh.hp...... p......

Dom16 GYFFTNK-DKAVLSILFNETLISLYDWKNQQEFYSPYTDGSLVYQGQNLTIGLGWSRLGFFLLNKDSESLINIKSM-TDFSFDKVSNHGES-LNPSIFLLKS--GFLYPNEVLVHGYREC

Dom15 GFEFKDKNDQPVLSLKFNETTITLYDHKDDLEYISPYTNESTAYSGRFESLSLGWSRLGLFLMNSDFNSLIKIPTR-NNFEFNKITKLVHD-NIPLNFTLKQ--DFMYPNEILFQGYHSC

Dom13 GINLKSSSSESLMNLYFNETFITLIDDLQGRQYISRYTEEILIYEGFELTIGIAWSKFGVFILNEKMNSLITFQTH-QEYPISKILPLHGS-RRQIYHFLFS-GGFLFPGNTLYEGYTTC

Dom12 GFDFN-HNDDKILSIFLNESFILMYDWKNHQEYISPFTNESLSYDHMDLEILIGWSRLGLFLMDNNHNGLIHIQNI-PDNKINKIKQHSNE-QKTLIRFELINKQFLYPNEILYQGYHTC

Dom14 GYQFLFD-SNPIFSLLFNETFVTFTDLISRIEYSSKYTNESLVYPGMEMIFGIGWSRLGIFVVNSRNEALIQINTL-TDITFNQISLYGSE-TIPTNFFLTN--DFVFPKQILYKGYESC

Dom10 GYDIMNDNK-KIMTLLLNETFIGVYDLTKDQEYYSYYTNESLAYNGMDFKIGIGWSKLGLFILNEYSSSLIEIQTH-SDYKFNKVLLRGPTMDKYVNYILLD--NFLYPNGILYLGYDTC

Dom8 GFRFEDENNNKLLSLYFNESFITVVDWRQDLKYITPYTNESLSYSGREENIGVGWSKMGLFILNEDLNSLVELQTD-LDFGFKKIIQE-SNPQVPIKYILEE--GFFYPNNILFIGYETC

Dom7 GYNFH-KDTKEVMKIRLNETHILLQDQLNNKDYSSPLTNSSLGYIERELTIGIGWSRLGLFIINDDHNSYIDIKGR-DDYSFNKIEQILVPDKKASNYILED--SFLYPINIMYLGYETC

Dom6 GFSFYNKQNEKVLSILFNETHLLLEDKESGMDFTSRYTNSSLNYVGETITIGLGWSRLGLFLINENKDSFIEVKNN-LDYTFVKIVHDSTQ-KSSSNYYLEE--NFLYPNIFLFKGYETC

Dom9 IYHFKDNNQDEILILGFNMTHLTLIDNQSSKVYSSYYTNQTTSFPTKSFSIGIGWSRLGLFILNEYGDGLIQLKSD-KKYEFVKIEQAQVL-ETLTIYEWSN--KFIYPREMLFLGYSIC

Dom5 GIQLK-KDDKKVFSILFDELFISINIEDSGNEYFSPYTNQTLGYSGREISIGIGKSKYGYFILNEILGSLITLNANGLDLSFNKASPLSSH-DIVSKFELQK--GFVLPANQLFIGYETC

Dom11 GYNFMKD-NDPVFTLLFNETYLTIKDWKGKQEYYSPYTNEKLAYNNMNFTIGIGLSRLGLYLMNSKLQSLIQIQPQ-YDYSFNHVKQLLYDNQVITNYLLQD--GFLYPNTLLYEGYEKC

Dom4 EYKFQ-KESDSVISLLFNETSVAIYFYDRAETLSSKYTNQHLLYEGMDITLGITSNRFGFFLVNKDLNALIHDPSL-KILDYNKILQESVP-NHISNFLLEE--GFTYPKNILFRGYETC

Consensus/100% .h.h..p.p..lh.l.hsb..l.h...... h.s.hTp....a..b.b.h.ls.s+hGhalhsp...uhlp..s...... h.ph...... b.....F.hP...hh.GYp.C

Dom16 SFFENCV-SNSLSCNSQVLSQICSNPTAGMSWEIETEIMNTQVNNGTWGKRFESQDIMNVFALSPN12GNLEDPGFIYILKDRLVLQNHLG-SVCSGPLPGAKEVNF-GDKIKLSIGI

Dom15 SLFNDCK-SKSTSCESQVVSETCNRNMNKTQWIIETNITDTQINNGTWGPNFELTELLSAFNIN-- NGLEDVLSIYIFNQRIAIQNHIGHSICDGIYPESRSLNI-GDTIVWSIGV

Dom13 SLYKDCD-STSTGCSAQTLTNPCSKSQPGISWVIESEILDTKVNNGTWGSDLELNNLLNVYLVN-- NGIEDTLAIYFFNNRLALQDLKENIVCNGPYPKDISQNF-GSVLNWILSV

Dom12 SFYNDCS-TEIISCSSQALIDTCNKPHPGMNWIIDTDISETNVNNGTWGSKLEFPTLINLFKVN-- NLTDDIYTIYIFNNRISIQDEHNKISCDGYYPNFKKLNINGDHLTWSLGI

Dom14 SFFEDCP-MN-ATCKSQSLTGICNRKDPGSYYQIETVIKETFLTNSTWGHRLELNSLLAEFVINGG DFEEDLLAIYIFENRIAMQDLKHNVVCSGPYPKDSIIDY-GDTLKWSIGF

Dom10 TFNGECN-VKSLECSSQVLEGLCLKKLPGITWKMETILAQTNVNNGTWGFDYSINGLINLYTLN-- NGIEDVLVIYFFNNRVALQDLINSNSCSGPYPENSQVNV-GDLIKWSIGI

Dom8 SLRNECR-LDSTSCNSQVLKDTCDKRIDEMSWTFETSFSETNVNNGTWSQDLVIPDLLGVYLIN-- NGHIDISIIYLFKNRLVLKGLILDEVCSGPYPGNQIKQI-GDKITWSLAI

Dom7 SLFSDCS-QNIVHCGSQVLKEICTKRIPNMEWTFEGLNAETNVNNGTWGEELKMDELMNLYIVNSG EEEKNIFGIYIFKNRMALQDFENFQSCSGIYPKGKNLSF-GDKMKWSLGL

Dom6 GLYEECS-TSPVNCYSQVLTEICPKKVPGMYWTFENIIHETNVNNGTWGSEFILDGLLNVYLIN-- DGDSDIATIYFFNNRMVLLDSKNKVACSGPYPNNHIISV-SDKIKWSIGF

Dom9 SAYNDCTDDNSLSCSAQAFGDLCNRRKTNISWDIEFKVTETYVNNGTWGNEFELPGLVNIYFVS-- NDHQNILSIYFFQSYLALRDLENQNICSGPYPESMNISQ-DDKINWSLNI

Dom5 SLNDDCS-LETTSCNSQVVKGLCPSPSNTRIWTIEVEISKTNINNGTWGESLVMDELMNIYFLS-- SSSEILYNVLFFENRVVIEDADTQVTCSGAYPGGVSISY-GEKFIWSFGV

Dom11 SLYNNCL-SNKLECNSQVLTGICSTLSPGLNLEFETEIKSTNLNNGTWGEIFKQSDLLNTYIIS-- SNNEDQYIIYIYSNRIAIQDLNNYISCSGPYPNGNLIHI-GSKLIWKIGF

Dom4 TLYGDCK-TVSKSCESQVLIEMCNPIQAGTEISIQTSISQTNINNGTWGAPLIQSDLINIFHIG-- NEDKDVYTVYIYTNRMVLEDMVNFISCGGPYPGGKTLSS-SSNLEWSIAL

Consensus/100% sh..pC...... C.uQsh...C....s...h.hp....pT.lsNuTWu..h....lhs.a.ls...... lhhhppbhshbs...... CsG.hP.....p..sp.h.h.hsh

Dom16 DSMS-MLYLNVFNE---DDNEYNTVCTIGKIE- NGWRFKYIFPIGHSPSVSSFVQIKLG-FPDGGF

Dom15 DDNK-LLYLNVMNE---NKSKFYSVCTLKYNE- DFNYFKFIYPRGYSPSRSIFTQKIGD-FPNGGF

Dom13 DENQ-MLYLNIKQK---DQS-IFSVCVIPFGD- HLKPIRYIYPRGYAPSTSNFTQILNS-IPKGGF

Dom12 DHSY-LLYLNIHDI---NNGKNYTVCVLPMDI- SSGALSIIYPKGYSPLKSNFTQQIFG-FPNGGY

Dom14 DISN-QMYLNVHNFK--NIDGNFTICTLSTGI- QNSDFLYLYPQGYAPKPSTFKEFKNG-FPKGGF

Dom10 DSLM-NLYLNILDDN--VTGKNYTVCYIKNLNQ FGGGFKYLSPLGYRPNYSRFIQEISPKFQDGGY

Dom8 DSND-LLYLNIIDDSDPKNKEYYTICPISYDN- RFGRFKYIYPLGHAPSKVTFTQDLKG-FPQGGY

Dom7 DSDN-LVYLNYIDID--DNTK-YTVCVLKHNP- GFTRVKYIHPLGYSPSYMIMTQTKNG-LDEGGY

Dom6 GKNY-LLYLNVYDLN--NNNKEYSVCTIQYGS- GFFNIEYVYPLGYAPSMSKFNQYKDG-FPNGGY

Dom9 GHSG-EIYLNIESWKGSNKGTKFTICMIGGGSN 4YIQDVKYIHPLGYSPSFNKYTQINSLLIPNGGY

Dom5 NENSRMIYFNSAPKD--NPNKLNTICSMPYMG- KRSQITTVYPLGYAPSKALFTQKKDG-LPTGGF

Dom11 DSYY-SIYLNVYDQ---INHEYYSICQLKPVIH34GIDSIDYIYTRGYNPSNTKFIQYINN-FPRGGY

Dom4 DSKK-FLYLNVLDE---ENTEMKTICSMSLIG- GIHSLDYISPSGFKPNNSVFTQKINS-FKQGGY

Consensus/100% s.....hYhN...... slC.h...... h..l.s.Ga.P....h.p.....h.pGGa