Mrs. Pershads Blast Search Questions
For this part of the project you will need to access the following
1. Here is the DNA sequence. First log on to the website for BLAST given above;then copy and paste the sequence into the box and perform a BLASTnucleotide search against the mouse data base.
661 gataaaaaag aaaaagacta gctccttgga ttctaatcta tgaagtagag agtgtgggca
721 attcttgtgc taatagagtt ttcctttcaa cgggaatatc tccaagttca aaacagttgc
781 atttagtaga attctacctt cctatgacag cttctgatat gagtcacgct aggccctcaa
841 ggcctactta cttttcaaat ccagtgctat actggaaaga gagatccttc cagatttttt
901 ctaacacttc aagataaaaa caccagtgct gctaggctct gagtctttat ctgcgcctct
961 ccaactttaa tgtgttcagg ttttgatttg gtacattcag gttgaggctc ttgggggagc
1021 gtggggtgga ctttctaaga agtcttctgg tgatggtgaa actgttagtt ctagaaggac
1081 attttaagaa tcagtgctat aatatcaaag gttgtttctt aagaatccag ggtagccggg
1141 tattgtggtg tacaccttta atcctagcaa tggggaggaa gaaacaggtg gatatctgtg
1201 agttcaagac cagcctggtc tgatctactt agtgagttct aggacagtca gaactatgtg
1261 agagaccctg tctcaaaaaa taaataaata aagattcata gactcaggat aagtttgatt
1321 ttttttcctg tcctacttct ttaattttca gttgaggaag acaagggtgt agaagggagg
1381 tgaatacttc tcatgtgatt attttttcct ttcattttct aaaaagaaaa cgccaagccc1 gatccttact caattgattt ctaatagtta caagtgtcca cgctgtgagg gctgtacctt
61 ctccttgaag taaacaatat ttaagtgatg tccagcatcg gtgcgatgat tactctaact
121 gatttacttt agaagcactg ctttctcatc aaaggccact aatgcctgca ctctagagag
181 aggcagactt ggaatctact tttaggccta cggacatcct cgcccaaaaa taagtctgag
241 tgtgtcctcc ttagaccgca ctagcagctg cctcttcagc tcactagctc agaactgcta
301 aagcattaca agcaccagac tgagtcgctg aagagaaatg ccaaaaaatg acatctttgc
361 tttaaaatag caacgctagg cttacctaca ggcacatatc tttagtggta acgtttgcat
421 ttttatttgg tttgtcaaaa gtaatgaaga agccgccaag tcccctttta ctttctccat
481 ttctgcctct ttgttgtgcc gtaccatacc aataaatgga aattgatgtt tgcaggtcct
541 ttaggcttaa acaatctaag caataaatgt agaggggaga aaagatgata tttaaaaaca
601 taaaaggaaa acacgtaaaa cgtagactta aggggatgaa ccctccagaa gagattgaaa
1441 acacactaat tatctacgat gaggtcattt gcacctaggt catttattta ggcaacaagt
1501 agccttgcca gaagattttt aaggggatgt tggcaagagg ggatgtggct tccaggttct
1561 tgggtctagc ttcaactcct ttcgatcttt ctttatgtca ccacccaccc cccatttctg
1621 ttctgctttg aaagggctca ctgccctgtc cctggctatc caggttccca ggtccttagg
Provide the answers to the following questions:
b) What is the accession number for this DNA sequence?
c) Which chromosome is this located on?
d) Who discovered this sequence and where was the data published? Cite the full journal title and year and provide a summary of the findings.
2.Enter the following Protein sequence in to BLASTp
1 mrfgskmmpm fltvylsnne qhftevpvtp eticrdvvdl ckepgesdch laevwcgser
61 pvadnermfd vlqrfgsqrn evrfflrher ppgrdivsgp rsqdpslkrn gvkvpgeyrr
121 kengvnsprm dltlaelqem asrqqqqiea qqqllatkeq rlkflkqqdq rqqqqvaeqe
181 klkrlkeiae nqeaklkkvr alkghveqkr lsngklveei eqmnnlfqqk qrelvlavsk
241 veeltrqlem lkngridshh dnqsavaeld rlykelqlrn klnqeqnakl qqqreclnkr
301 nsevavmdkr vnelrdrlwk kkaalqqken lpvssdgnlp qqaasapsrv aavgpyiqss
361 tmprmpsrpe llvkpalpdg slviqasegp mkiqtlpnmr sgaasqtkgs kihpvgpdws
421 psnadlfpsq gsasvpqstg naldqvddge vplrekekkv rpfsmfdavd qsnappsfgt
481 lrknqssedi lrdaqvankn vakvpppvpt kpkqinlpyf gqtnqppsdi kpdgssqqls
541 tvvpsmgtkp kpagqqprvl lspsipsvgq dqtlspgskq esppaaavrp ftpqpskdtl
601 lppfrkpqtv aassiysmyt qqqapgknfq qavqsaltkt htrgphfssv ygkpviaaaq
661 nqqqhpeniy snsqgkpgsp epetepvssv qenhenerip rplsptkllp flsnpyrnqs
721 dadlealrkk lsnaprplkk rssitepegp ngpniqklly qrttiaamet isvpsypsks
781 asvtassesp veiqnpylhv epekevvslv peslspedvg nastensdmp apspgldyep
841 egvpdnspnl qnnpeepnpe aphvldvyle eyppyppppy psgepegpge dsvsmrppei
901 tgqvslppgk rtnlrktgse riahgmrvkf nplallldss legefdlvqr iiyevddpsl
961 pndegitalh navcaghtei vkflvqfgvn vnaadsdgwt plhcaascnn vqvckflves
1021 gaavfamtys dmqtaadkce emeegytqcs qflygvqekm gimnkgviya lwdyepqndd
1081 elpmkegdcm tiihredede iewwwarlnd kegyvprnll glyprikprq rsla
//
a) What protein is this the sequence for?
b) Provide the answers to the following questions:
i)What organism did this sequence come from? ii) What is the accession number for this sequence?
iii) Which chromosome is this located on?
iv) Who discovered this sequence and where was it data published? Cite the full journal title and year and provide a summary of the findings.
6.Input the following DNA sequence into BLAST.Upload the human sequence and see what gene you have identified.
gaagttatca gtcgacgtga gctcgctgag acttcctgga cgggggacag gctgtggggt
61 ttctcagata actgggcccc tgcgctcagg aggccttcac cctctgctct gggtaaagtt
121 cattggaaca gaaagaaatg gatttatctg ctcttcgcgt tgaagaagta caaaatgtca
181 ttaatgctat gcagaaaatc ttagagtgtc ccatctgtct ggagttgatc aaggaacctg
241 tctccacaaa gtgtgaccac atattttgca aattttgcat gctgaaactt ctcaaccaga
301 agaaagggcc ttcacagtgt cctttatgta agaatgatat aaccaaaagg agcctacaag
361 aaagtacgag atttagtcaa cttgttgaag agctattgaa aatcatttgt gcttttcagc
421 ttgacacagg tttggagtat gcaaacagct ataattttgc aaaaaaggaa aataactctc
481 ctgaacatct aaaagatgaa gtttctatca tccaaagtat gggctacaga aaccgtgcca
541 aaagacttct acagagtgaa cccgaaaatc cttccttgca ggaaaccagt ctcagtgtcc
601 aactctctaa ccttggaact gtgagaactc tgaggacaaa gcagcggata caacctcaaa
661 agacgtctgt ctacattgaa ttgggatctg attcttctga agataccgtt aataaggcaa
721 cttattgcag tgtgggagat caagaattgt tacaaatcac ccctcaagga accagggatg
781 aaatcagttt ggattctgca aaaaaggctg cttgtgaatt ttctgagacg gatgtaacaa
841 atactgaaca tcatcaaccc agtaataatg atttgaacac cactgagaag cgtgcagctg
901 agaggcatcc agaaaagtat cagggtagtt ctgtttcaaa cttgcatgtg gagccatgtg
961 gcacaaatac tcatgccagc tcattacagc atgagaacag cagtttatta ctcactaaag
1021 acagaatgaa tgtagaaaag gctgaattct gtaataaaag caaacagcct ggcttagcaa
1081 ggagccaaca taacagatgg gctggaagta aggaaacatg taatgatagg cggactccca
1141 gcacagaaaa aaaggtagat ctgaatgctg atcccctgtg tgagagaaaa gaatggaata
1201 agcagaaact gccatgctca gagaatccta gagatactga agatgttcct tggataacac
1261 taaatagcag cattcagaaa gttaatgagt ggttttccag aagtgatgaa ctgttaggtt
1321 ctgatgactc acatgatggg gagtctgaat caaatgccaa agtagctgat gtattggacg
1381 ttctaaatga ggtagatgaa tattctggtt cttcagagaa aatagactta ctggccagtg
1441 atcctcatga ggctttaata tgtaaaagtg aaagagttca ctccaaatca gtagagagta
1501 atattgaaga caaaatattt gggaaaacct atcggaagaa ggcaagcctc cccaacttaa
1561 gccatgtaac tgaaaatcta attataggag catttgttac tgagccacag ataatacaag
1621 agcgtcccct cacaaataaa ttaaagcgta aaaggagacc tacatcaggc cttcatcctg
1681 aggattttat caagaaagca gatttggcag tttaaaagac tcctgaaatg ataaatcagg
1741 gaactaacca aacggagcag aatggtcaag tgatgaatat tactaatagt ggtcatgaga
1801 ataaaacaaa aggtgattct attcagaatg agaaaaatcc taacccaata gaatcactcg
1861 aaaaagaatc tgctttcaaa acgaaagctg aacctataag cagcagtata agcaatatgg
1921 aactcgaatt aaatatccac aattcaaaag cacctaaaaa gaataggctg aggaggaagt
1981 cttctaccag gcatattcat gcgcttgaac tagtagtcag tagaaatcta agcccaccta
2041 attgtactga attgcaaatt gatagttgtt ctagcagtga agagataaag aaaaaaaagt
2101 acaaccaaat gccagtcagg cacagcagaa acctacaact catggaaggt aaagaacctg
2161 caactggagc caagaagagt aacaagccaa atgaacagac aagtaaaaga catgacagtg
2221 atactttccc agagctgaag ttaacaaatg cacctggttc ttttactaag tgtccaaata
2281 ccagtgaact taaagaattt gtcaatccta gccttccaag agaagaaaaa gaagagaaac
2341 tagaaacagt taaagtgtct aataatgctg aagaccccaa agatctcatg ttaagtggag
2401 aaagggtttt gcaaactgaa agatctgtag agagtagcag tatttcactg gtacctggta
2461 ctgattatgg cactcaggaa agtatctcgt tactggaagt tagcactcta gggaaggcaa
2521 aaacagaacc aaataaatgt gtgagtcagt gtgcagcatt tgaaaacccc aagggactaa
2581 ttcatggttg ttccaaagat aatagaaatg acacagaagg ctttaagtat ccattgggac
2641 atgaagttaa ccacagtcgg gaaacaagca tagaaatgga agaaagtgaa ctcgatgctc
2701 agtatttgca gaatacattc aaggtttcaa agcgccagtc atttgctctg ttttcaaatc
2761 caggaaatgc agaagaggaa tgtgcaacat tctctgccca ctctgggtcc ttaaagaaac
2821 aaagtccaaa agtcactttt gaatgtgaac aaaaggaaga aaatcaagga aagaatgagt
2881 ctaatatcaa gcctgtacag acagttaata tcactgcagg ctttcctgtg gttggtcaga
2941 aagataagcc agttgataat gccaaatgta gtatcaaagg aggctctagg ttttgtctat
3001 catctcagtt cagaggcaac gaaactggac tcattactcc aaataaacat ggacttttac
3061 aaaacccata tcgtatacca ccactttttc ccatcaagtc atttgttaaa actaaatgta
3121 agaaaaatct gctagaggaa aactttgagg aacattcaat gtcacctgaa agagaaatgg
3181 gaaatgagaa cattccaagt acagtgagca caattagccg taataacatt agagaaaatg
3241 tttttaaagg agccagctca agcaatatta atgaagtagg ttccagtact aatgaagtgg
3301 gctccagtat taatgaaata ggttccagtg atgaaaacat tcaagcagaa ctaggtagaa
3361 acagagggcc aaaattgaat gctatgctta gattaggggt tttgcaacct gaggtctata
3421 aacaaagtct tcctggaagt aattgtaagc atcctgaaat aaaaaagcaa gaatatgaag
3481 aagtagttca gactgttaat acagatttct ctccatatct gatttcagat aacttagaac
3541 agcctatggg aagtagtcat gcatctcagg tttgttctga gacacctgat gacctgttag
3601 atgatggtga aataaaggaa gatactagtt ttgctgaaaa tgacattaag gaaagttctg
3661 ctgtttttag caaaagcgtc cagagaggag agcttagcag gagtcctagc cctttcaccc
3721 atacacattt ggctcagggt taccgaagag gggccaagaa attagagtcc tcagaagaga
3781 acttatctag tgaggatgaa gagcttccct gcttccaaca cttgttattt ggtaaagtaa
3841 acaatatacc ttctcagtct actaggcata gcaccgttgc taccgagtgt ctgtctaaga
3901 acacagagga gaatttatta tcattgaaga atagcttaaa tgactgcagt aaccaggtaa
3961 tattggcaaa ggcatctcag gaacatcacc ttagtgagga aacaaaatgt tctgctagct
4021 tgttttcttc acagtgcagt gaattggaag acttgactgc aaatacaaac acccaggatc
4081 ctttcttgat tggttcttcc aaacaaatga ggcatcagtc tgaaagccag ggagttggtc
4141 tgagtgacaa ggaattggtt tcagatgatg aagaaagagg aacgggcttg gaagaaaata
4201 atcaagaaga gcaaagcatg gattcaaact taggtgaagc agcatctggg tgtgagagtg
4261 aaacaagcgt ctctgaagac tgctcagggc tatcctctca gagtgacatt ttaaccactc
4321 agcagaggga taccatgcaa cataacctga taaagctcca gcaggaaatg gctgaactag
4381 aagctgtgtt agaacagcat gggagccagc cttctaacag ctacccttcc atcataagtg
4441 actcctctgc ccttgaggac ctgcgaaatc cagaacaaag cacatcagaa aaagattcgc
4501 atatacatgg ccaaagggac aactccatgt tttctaaaag gcctagagaa catatatcag
4561 tattaacttc acagaaaagt agtgaatacc ctataagcca gaatccagaa ggcctttctg
4621 ctgacaagtt tgaggtgtct gcagatagtt ctaccagtaa aaataaagaa ccaggagtgg
4681 aaaggtcatc cccttctaaa tgcccatcat tagatgatag gtggtacatg cacagttgct
4741 ctgggagtct tcagaataga aactacccat ctcaagaggg gctcattaag gttgttgatg
4801 tggaggagca acagctggaa gagtctgggc cacacgattt gacggaaaca tcttacttgc
4861 caaggcaaga tctagaggga accccttacc tggaatctgg aatcagcctc ttctctgatg
4921 accctgaatc tgatccttct gaagacagag ccccagagtc agctcgtgtt ggcaacatac
4981 catcttcaac ctctgcattg aaagttcccc aattgaaagt tgcagaatct gcccagggtc
5041 cagctgctgc tcatactact gatactgctg ggtataatgc aatggaagaa agtgtgagca
5101 gggagaagcc agaattgaca gcttcaacag aaagggtcaa caaaagaatg tccatagtgg
5161 tgtctggcct gaccccagaa gaatttatgc tcgtgtacaa gtttgccaga aaacaccaca
5221 tcactttaac taatctaatt actgaagaga ctactcatgt tgttatgaaa acagatgctg
5281 agtttgtgtg tgaacggaca ctgaaatatt ttctaggaat tgcgggagga aaatgggtag
5341 ttagctattt ctgggtgacc cagtctatta aagaaagaaa aatgctgaat gagcatgatt
5401 ttgaagtcag aggagatgtg gtcaatggaa gaaaccacca aggtccaaag cgagcaagag
5461 aatcccagga cagaaagatc ttcagggggc tagaaatctg ttgctatggg cccttcacca
5521 acatgcccac agatcaactg gaatggatgg tacagctgtg tggtgcttct gtggtgaagg
5581 agctttcatc attcaccctt ggcacaggtg tccacccaat tgtggttgtg cagccagatg
5641 cctggacaga ggacaatggc ttccatgcaa ttgggcagat gtgtgaggca cctgtggtga
5701 cccgagagtg ggtgttggac agtgtagcac tctaccagtg ccaggagctg gacacctacc
5761 tgatacccca gatcccccac agccactact gactgcagcc agccacaggt acagagccac
5821 aggaccccaa gaatgagctt aagctttcta acca
2. Provide the answers to the following questions:
a)What are the names of this sequence?
b) What is the location of this DNA sequence?
c)Who discovered this sequence and where was this data published? Cite the full journal title and year and provide a summary of the findings.