Mrs. Pershads Blast Search Questions

For this part of the project you will need to access the following

1. Here is the DNA sequence. First log on to the website for BLAST given above;then copy and paste the sequence into the box and perform a BLASTnucleotide search against the mouse data base.

661 gataaaaaag aaaaagacta gctccttgga ttctaatcta tgaagtagag agtgtgggca

721 attcttgtgc taatagagtt ttcctttcaa cgggaatatc tccaagttca aaacagttgc

781 atttagtaga attctacctt cctatgacag cttctgatat gagtcacgct aggccctcaa

841 ggcctactta cttttcaaat ccagtgctat actggaaaga gagatccttc cagatttttt

901 ctaacacttc aagataaaaa caccagtgct gctaggctct gagtctttat ctgcgcctct

961 ccaactttaa tgtgttcagg ttttgatttg gtacattcag gttgaggctc ttgggggagc

1021 gtggggtgga ctttctaaga agtcttctgg tgatggtgaa actgttagtt ctagaaggac

1081 attttaagaa tcagtgctat aatatcaaag gttgtttctt aagaatccag ggtagccggg

1141 tattgtggtg tacaccttta atcctagcaa tggggaggaa gaaacaggtg gatatctgtg

1201 agttcaagac cagcctggtc tgatctactt agtgagttct aggacagtca gaactatgtg

1261 agagaccctg tctcaaaaaa taaataaata aagattcata gactcaggat aagtttgatt

1321 ttttttcctg tcctacttct ttaattttca gttgaggaag acaagggtgt agaagggagg

1381 tgaatacttc tcatgtgatt attttttcct ttcattttct aaaaagaaaa cgccaagccc1 gatccttact caattgattt ctaatagtta caagtgtcca cgctgtgagg gctgtacctt

61 ctccttgaag taaacaatat ttaagtgatg tccagcatcg gtgcgatgat tactctaact

121 gatttacttt agaagcactg ctttctcatc aaaggccact aatgcctgca ctctagagag

181 aggcagactt ggaatctact tttaggccta cggacatcct cgcccaaaaa taagtctgag

241 tgtgtcctcc ttagaccgca ctagcagctg cctcttcagc tcactagctc agaactgcta

301 aagcattaca agcaccagac tgagtcgctg aagagaaatg ccaaaaaatg acatctttgc

361 tttaaaatag caacgctagg cttacctaca ggcacatatc tttagtggta acgtttgcat

421 ttttatttgg tttgtcaaaa gtaatgaaga agccgccaag tcccctttta ctttctccat

481 ttctgcctct ttgttgtgcc gtaccatacc aataaatgga aattgatgtt tgcaggtcct

541 ttaggcttaa acaatctaag caataaatgt agaggggaga aaagatgata tttaaaaaca

601 taaaaggaaa acacgtaaaa cgtagactta aggggatgaa ccctccagaa gagattgaaa

1441 acacactaat tatctacgat gaggtcattt gcacctaggt catttattta ggcaacaagt

1501 agccttgcca gaagattttt aaggggatgt tggcaagagg ggatgtggct tccaggttct

1561 tgggtctagc ttcaactcct ttcgatcttt ctttatgtca ccacccaccc cccatttctg

1621 ttctgctttg aaagggctca ctgccctgtc cctggctatc caggttccca ggtccttagg

Provide the answers to the following questions:

b) What is the accession number for this DNA sequence?

c) Which chromosome is this located on?

d) Who discovered this sequence and where was the data published? Cite the full journal title and year and provide a summary of the findings.

2.Enter the following Protein sequence in to BLASTp

1 mrfgskmmpm fltvylsnne qhftevpvtp eticrdvvdl ckepgesdch laevwcgser

61 pvadnermfd vlqrfgsqrn evrfflrher ppgrdivsgp rsqdpslkrn gvkvpgeyrr

121 kengvnsprm dltlaelqem asrqqqqiea qqqllatkeq rlkflkqqdq rqqqqvaeqe

181 klkrlkeiae nqeaklkkvr alkghveqkr lsngklveei eqmnnlfqqk qrelvlavsk

241 veeltrqlem lkngridshh dnqsavaeld rlykelqlrn klnqeqnakl qqqreclnkr

301 nsevavmdkr vnelrdrlwk kkaalqqken lpvssdgnlp qqaasapsrv aavgpyiqss

361 tmprmpsrpe llvkpalpdg slviqasegp mkiqtlpnmr sgaasqtkgs kihpvgpdws

421 psnadlfpsq gsasvpqstg naldqvddge vplrekekkv rpfsmfdavd qsnappsfgt

481 lrknqssedi lrdaqvankn vakvpppvpt kpkqinlpyf gqtnqppsdi kpdgssqqls

541 tvvpsmgtkp kpagqqprvl lspsipsvgq dqtlspgskq esppaaavrp ftpqpskdtl

601 lppfrkpqtv aassiysmyt qqqapgknfq qavqsaltkt htrgphfssv ygkpviaaaq

661 nqqqhpeniy snsqgkpgsp epetepvssv qenhenerip rplsptkllp flsnpyrnqs

721 dadlealrkk lsnaprplkk rssitepegp ngpniqklly qrttiaamet isvpsypsks

781 asvtassesp veiqnpylhv epekevvslv peslspedvg nastensdmp apspgldyep

841 egvpdnspnl qnnpeepnpe aphvldvyle eyppyppppy psgepegpge dsvsmrppei

901 tgqvslppgk rtnlrktgse riahgmrvkf nplallldss legefdlvqr iiyevddpsl

961 pndegitalh navcaghtei vkflvqfgvn vnaadsdgwt plhcaascnn vqvckflves

1021 gaavfamtys dmqtaadkce emeegytqcs qflygvqekm gimnkgviya lwdyepqndd

1081 elpmkegdcm tiihredede iewwwarlnd kegyvprnll glyprikprq rsla

//

a) What protein is this the sequence for?

b) Provide the answers to the following questions:

i)What organism did this sequence come from? ii) What is the accession number for this sequence?

iii) Which chromosome is this located on?

iv) Who discovered this sequence and where was it data published? Cite the full journal title and year and provide a summary of the findings.

6.Input the following DNA sequence into BLAST.Upload the human sequence and see what gene you have identified.

gaagttatca gtcgacgtga gctcgctgag acttcctgga cgggggacag gctgtggggt

61 ttctcagata actgggcccc tgcgctcagg aggccttcac cctctgctct gggtaaagtt

121 cattggaaca gaaagaaatg gatttatctg ctcttcgcgt tgaagaagta caaaatgtca

181 ttaatgctat gcagaaaatc ttagagtgtc ccatctgtct ggagttgatc aaggaacctg

241 tctccacaaa gtgtgaccac atattttgca aattttgcat gctgaaactt ctcaaccaga

301 agaaagggcc ttcacagtgt cctttatgta agaatgatat aaccaaaagg agcctacaag

361 aaagtacgag atttagtcaa cttgttgaag agctattgaa aatcatttgt gcttttcagc

421 ttgacacagg tttggagtat gcaaacagct ataattttgc aaaaaaggaa aataactctc

481 ctgaacatct aaaagatgaa gtttctatca tccaaagtat gggctacaga aaccgtgcca

541 aaagacttct acagagtgaa cccgaaaatc cttccttgca ggaaaccagt ctcagtgtcc

601 aactctctaa ccttggaact gtgagaactc tgaggacaaa gcagcggata caacctcaaa

661 agacgtctgt ctacattgaa ttgggatctg attcttctga agataccgtt aataaggcaa

721 cttattgcag tgtgggagat caagaattgt tacaaatcac ccctcaagga accagggatg

781 aaatcagttt ggattctgca aaaaaggctg cttgtgaatt ttctgagacg gatgtaacaa

841 atactgaaca tcatcaaccc agtaataatg atttgaacac cactgagaag cgtgcagctg

901 agaggcatcc agaaaagtat cagggtagtt ctgtttcaaa cttgcatgtg gagccatgtg

961 gcacaaatac tcatgccagc tcattacagc atgagaacag cagtttatta ctcactaaag

1021 acagaatgaa tgtagaaaag gctgaattct gtaataaaag caaacagcct ggcttagcaa

1081 ggagccaaca taacagatgg gctggaagta aggaaacatg taatgatagg cggactccca

1141 gcacagaaaa aaaggtagat ctgaatgctg atcccctgtg tgagagaaaa gaatggaata

1201 agcagaaact gccatgctca gagaatccta gagatactga agatgttcct tggataacac

1261 taaatagcag cattcagaaa gttaatgagt ggttttccag aagtgatgaa ctgttaggtt

1321 ctgatgactc acatgatggg gagtctgaat caaatgccaa agtagctgat gtattggacg

1381 ttctaaatga ggtagatgaa tattctggtt cttcagagaa aatagactta ctggccagtg

1441 atcctcatga ggctttaata tgtaaaagtg aaagagttca ctccaaatca gtagagagta

1501 atattgaaga caaaatattt gggaaaacct atcggaagaa ggcaagcctc cccaacttaa

1561 gccatgtaac tgaaaatcta attataggag catttgttac tgagccacag ataatacaag

1621 agcgtcccct cacaaataaa ttaaagcgta aaaggagacc tacatcaggc cttcatcctg

1681 aggattttat caagaaagca gatttggcag tttaaaagac tcctgaaatg ataaatcagg

1741 gaactaacca aacggagcag aatggtcaag tgatgaatat tactaatagt ggtcatgaga

1801 ataaaacaaa aggtgattct attcagaatg agaaaaatcc taacccaata gaatcactcg

1861 aaaaagaatc tgctttcaaa acgaaagctg aacctataag cagcagtata agcaatatgg

1921 aactcgaatt aaatatccac aattcaaaag cacctaaaaa gaataggctg aggaggaagt

1981 cttctaccag gcatattcat gcgcttgaac tagtagtcag tagaaatcta agcccaccta

2041 attgtactga attgcaaatt gatagttgtt ctagcagtga agagataaag aaaaaaaagt

2101 acaaccaaat gccagtcagg cacagcagaa acctacaact catggaaggt aaagaacctg

2161 caactggagc caagaagagt aacaagccaa atgaacagac aagtaaaaga catgacagtg

2221 atactttccc agagctgaag ttaacaaatg cacctggttc ttttactaag tgtccaaata

2281 ccagtgaact taaagaattt gtcaatccta gccttccaag agaagaaaaa gaagagaaac

2341 tagaaacagt taaagtgtct aataatgctg aagaccccaa agatctcatg ttaagtggag

2401 aaagggtttt gcaaactgaa agatctgtag agagtagcag tatttcactg gtacctggta

2461 ctgattatgg cactcaggaa agtatctcgt tactggaagt tagcactcta gggaaggcaa

2521 aaacagaacc aaataaatgt gtgagtcagt gtgcagcatt tgaaaacccc aagggactaa

2581 ttcatggttg ttccaaagat aatagaaatg acacagaagg ctttaagtat ccattgggac

2641 atgaagttaa ccacagtcgg gaaacaagca tagaaatgga agaaagtgaa ctcgatgctc

2701 agtatttgca gaatacattc aaggtttcaa agcgccagtc atttgctctg ttttcaaatc

2761 caggaaatgc agaagaggaa tgtgcaacat tctctgccca ctctgggtcc ttaaagaaac

2821 aaagtccaaa agtcactttt gaatgtgaac aaaaggaaga aaatcaagga aagaatgagt

2881 ctaatatcaa gcctgtacag acagttaata tcactgcagg ctttcctgtg gttggtcaga

2941 aagataagcc agttgataat gccaaatgta gtatcaaagg aggctctagg ttttgtctat

3001 catctcagtt cagaggcaac gaaactggac tcattactcc aaataaacat ggacttttac

3061 aaaacccata tcgtatacca ccactttttc ccatcaagtc atttgttaaa actaaatgta

3121 agaaaaatct gctagaggaa aactttgagg aacattcaat gtcacctgaa agagaaatgg

3181 gaaatgagaa cattccaagt acagtgagca caattagccg taataacatt agagaaaatg

3241 tttttaaagg agccagctca agcaatatta atgaagtagg ttccagtact aatgaagtgg

3301 gctccagtat taatgaaata ggttccagtg atgaaaacat tcaagcagaa ctaggtagaa

3361 acagagggcc aaaattgaat gctatgctta gattaggggt tttgcaacct gaggtctata

3421 aacaaagtct tcctggaagt aattgtaagc atcctgaaat aaaaaagcaa gaatatgaag

3481 aagtagttca gactgttaat acagatttct ctccatatct gatttcagat aacttagaac

3541 agcctatggg aagtagtcat gcatctcagg tttgttctga gacacctgat gacctgttag

3601 atgatggtga aataaaggaa gatactagtt ttgctgaaaa tgacattaag gaaagttctg

3661 ctgtttttag caaaagcgtc cagagaggag agcttagcag gagtcctagc cctttcaccc

3721 atacacattt ggctcagggt taccgaagag gggccaagaa attagagtcc tcagaagaga

3781 acttatctag tgaggatgaa gagcttccct gcttccaaca cttgttattt ggtaaagtaa

3841 acaatatacc ttctcagtct actaggcata gcaccgttgc taccgagtgt ctgtctaaga

3901 acacagagga gaatttatta tcattgaaga atagcttaaa tgactgcagt aaccaggtaa

3961 tattggcaaa ggcatctcag gaacatcacc ttagtgagga aacaaaatgt tctgctagct

4021 tgttttcttc acagtgcagt gaattggaag acttgactgc aaatacaaac acccaggatc

4081 ctttcttgat tggttcttcc aaacaaatga ggcatcagtc tgaaagccag ggagttggtc

4141 tgagtgacaa ggaattggtt tcagatgatg aagaaagagg aacgggcttg gaagaaaata

4201 atcaagaaga gcaaagcatg gattcaaact taggtgaagc agcatctggg tgtgagagtg

4261 aaacaagcgt ctctgaagac tgctcagggc tatcctctca gagtgacatt ttaaccactc

4321 agcagaggga taccatgcaa cataacctga taaagctcca gcaggaaatg gctgaactag

4381 aagctgtgtt agaacagcat gggagccagc cttctaacag ctacccttcc atcataagtg

4441 actcctctgc ccttgaggac ctgcgaaatc cagaacaaag cacatcagaa aaagattcgc

4501 atatacatgg ccaaagggac aactccatgt tttctaaaag gcctagagaa catatatcag

4561 tattaacttc acagaaaagt agtgaatacc ctataagcca gaatccagaa ggcctttctg

4621 ctgacaagtt tgaggtgtct gcagatagtt ctaccagtaa aaataaagaa ccaggagtgg

4681 aaaggtcatc cccttctaaa tgcccatcat tagatgatag gtggtacatg cacagttgct

4741 ctgggagtct tcagaataga aactacccat ctcaagaggg gctcattaag gttgttgatg

4801 tggaggagca acagctggaa gagtctgggc cacacgattt gacggaaaca tcttacttgc

4861 caaggcaaga tctagaggga accccttacc tggaatctgg aatcagcctc ttctctgatg

4921 accctgaatc tgatccttct gaagacagag ccccagagtc agctcgtgtt ggcaacatac

4981 catcttcaac ctctgcattg aaagttcccc aattgaaagt tgcagaatct gcccagggtc

5041 cagctgctgc tcatactact gatactgctg ggtataatgc aatggaagaa agtgtgagca

5101 gggagaagcc agaattgaca gcttcaacag aaagggtcaa caaaagaatg tccatagtgg

5161 tgtctggcct gaccccagaa gaatttatgc tcgtgtacaa gtttgccaga aaacaccaca

5221 tcactttaac taatctaatt actgaagaga ctactcatgt tgttatgaaa acagatgctg

5281 agtttgtgtg tgaacggaca ctgaaatatt ttctaggaat tgcgggagga aaatgggtag

5341 ttagctattt ctgggtgacc cagtctatta aagaaagaaa aatgctgaat gagcatgatt

5401 ttgaagtcag aggagatgtg gtcaatggaa gaaaccacca aggtccaaag cgagcaagag

5461 aatcccagga cagaaagatc ttcagggggc tagaaatctg ttgctatggg cccttcacca

5521 acatgcccac agatcaactg gaatggatgg tacagctgtg tggtgcttct gtggtgaagg

5581 agctttcatc attcaccctt ggcacaggtg tccacccaat tgtggttgtg cagccagatg

5641 cctggacaga ggacaatggc ttccatgcaa ttgggcagat gtgtgaggca cctgtggtga

5701 cccgagagtg ggtgttggac agtgtagcac tctaccagtg ccaggagctg gacacctacc

5761 tgatacccca gatcccccac agccactact gactgcagcc agccacaggt acagagccac

5821 aggaccccaa gaatgagctt aagctttcta acca

2. Provide the answers to the following questions:

a)What are the names of this sequence?

b) What is the location of this DNA sequence?

c)Who discovered this sequence and where was this data published? Cite the full journal title and year and provide a summary of the findings.