Jian-Min Chen · Claude Férec · David N. Cooper
A systematic analysis of disease-associated variants in the 3’ regulatory regions of human protein-coding genes I: General principles and overview
Online Supplementary Materials
1. Supplementary Text
An unusually short, discrete 17-bp poly(A) tail was identified in both the mature mRNA and the nuclear pre-mRNA of the Xenopus serum albumin gene (Schoenberg et al. 1989; Rao et al. 1996). This short poly(A) tail also appears to be present in a subset of human mRNAs (Gu et al. 1999; Choi and Hagedorn 2003) and may be conferred by cis-acting poly(A)-limiting elements (Das Gupta et al. 1998). Although in an in vitro model system, mRNA with <20-nt poly(A) tails imparted by the poly(A)-limiting element is translated as efficiently as mRNA species with long poly(A) tails (Peng and Schoenberg 2005), mRNA with short poly(A) tails in vivo may be intrinsically correlated with the state of gene expression (Weber et al. 2005).
References
Choi YH, Hagedorn CH (2003) Purifying mRNAs with a high-affinity eIF4E mutant identifies the short 3' poly(A) end phenotype. Proc Natl Acad Sci USA 100:7033-7038
Das Gupta J, Gu H, Chernokalskaya E, Gao X, Schoenberg DR (1998) Identification of two cis-acting elements that independently regulate the length of poly(A) on Xenopus albumin pre-mRNA. RNA 4:766-776
Gu H, Das Gupta J, Schoenberg DR (1999) The poly(A)-limiting element is a conserved cis-acting sequence that regulates poly(A) tail length on nuclear pre-mRNAs. Proc Natl Acad Sci USA 96:8943-8948
Peng J, Schoenberg DR (2005) mRNA with a <20-nt poly(A) tail imparted by the poly(A)-limiting element is translated as efficiently in vivo as long poly(A) mRNA. RNA 11:1131-1140
Rao MN, Chernokalskaya E, Schoenberg DR (1996) Regulated nuclear polyadenylation of Xenopus albumin pre-mRNA. Nucleic Acids Res 24:4078-4083
Schoenberg DR, Moskaitis JE, Smith LH Jr, Pastori RL (1989) Extranuclear estrogen-regulated destabilization of Xenopus laevis serum albumin mRNA. Mol Endocrinol 3:805-814
Weber M, Hagedorn CH, Harrison DG, Searles CD (2005) Laminar shear stress and 3' polyadenylation of eNOS mRNA. Circ Res 96:1161-1168
2. Supplementary Tables
Note: gene symbols and the source of DNA sequence data for each variant are listed. Point mutations are indicated in red below the wild-type sequences and deletion variants are barred and highlighted in red. All the variants are annotated in the context of genomic sequences. The coding sequence of the last exon is shaded; the translational termination codon is in upper case and bold; the UCPAS hexamer is in bold and highlighted in blue; the vertical line in red indicates the cleavage site wherever it was possible to ascertain it.
Table S1 Genomic sequence of the NAT1 gene*
NC_000008.9 REGION: 18111000..18127000
12721 atacttataa ccattgtatt tttacatgtt taaaatatag ccataattag cctactcaaa
12781 tccaagtgta aaagtaaaat gatttgcttt cgttttgttt tccttgctta ggggatcatg
12841 gacattgaag catatcttga aagaattggc tataagaagt ctaggaacaa attggacttg
12901 gaaacattaa ctgacattct tcaacaccag atccgagctg ttccctttga gaaccttaac
12961 atccattgtg gggatgccat ggacttaggc ttagaggcca tttttgatca agttgtgaga
13021 agaaatcggg gtggatggtg tctccaggtc aatcatcttc tgtactgggc tctgaccact
13081 attggttttg agaccacgat gttgggaggg tatgtttaca gcactccagc caaaaaatac
13141 agcactggca tgattcacct tctcctgcag gtgaccattg atggcaggaa ctacattgtc
13201 gatgctgggt ttggacgctc ataccagatg tggcagcctc tggagttaat ttctgggaag
13261 gatcagcctc aggtgccttg tgtcttccgt ttgacggaag agaatggatt ctggtatcta
13321 gaccaaatca gaagggaaca gtacattcca aatgaagaat ttcttcattc tgatctccta
13381 gaagacagca aataccgaaa aatctactcc tttactctta agcctcgaac aattgaagat
13441 tttgagtcta tgaatacata cctgcagaca tctccatcat ctgtgtttac tagtaaatca
13501 ttttgttcct tgcagacccc agatggggtt cactgtttgg tgggcttcac cctcacccat
13561 aggagattca attataagga caatacagat ctaatagagt tcaagactct gagtgaggaa
13621 gaaatagaaa aagtgctgaa aaatatattt aatatttcct tgcagagaaa gcttgtgccc
13681 aaacatggtg atagattttt tactattTAG aataaggagt aaaacaatct tgtctatttg
13741 tcatccagct caccagttat caactgacga cctatcatgt atcttctgta cccttacctt
13801 attttgaaga aaatcctaga catcaaatca tttcacctat aaaaatgtca tcatatataa
13861 ttaaacagct ttttaaagaa acataaccac aaaccttttc aaataataat aataataata
13921 ataaaaaatg tattttaaag atggcctgtg gttatcttgg aaattggtga tttatgctag
13981 aaagctttta atgttggttt attgttgaat tcctagaaaa gttttattgg tagatgagta
14041 aataaaatat tgtaaaaaaa cttattgtct ataaagtata ttaaaacatt gttggctaat
14101 ataatttgaa aaaaagtggt tttttggaag acttaggata ttatggtgct acataatttt
14161 tcctcgatgc tctcttcctc tcatctttct tgtctcttaa attgctttac ttccttgcac
* Previously assigned UCPAS is in red, whereas the newly assigned UCPAS is in blue.
Table S2 FGG, γA
K02569.1
2761 tttttaaatg gagaaaatta tgtcttttta atatggtttt tgttttgtta tatattcaca
2821 ggctggagac gttTAAaaga ccgtttcaaa agagatttac ttttttaaag gactttatct
2881 gaacagagag atataatatt tttcctattg gacaatggac ttgcaaagct tcacttcatt
2941 ttaagagcaa aagaccccat gttgaaaact ccataacagt tttatgctga tgataattta
3001 tctacatgca tttcaataaa ccttttgttt cctaagacta gatacatggt acctttattg
aTctttattg
3061 accattaaaa accaccactt tttgccaatt taccaattac aattgggcaa ccatcagtag
3121 taattgagtc ctcattttat gctaaatgtt atgcctaact ctttgggagt tacaaaggaa
3181 atagcaatta tggcttttgc cctctaggag atacaggaca aatacaggaa aatacagcaa
Table S3
CTLA4
AF411058.1
89101 taaattttaa tagctgaatc aagaaaatct cctgaggttt ataattctgt atgctgtgaa
89161 cattcatttt taaccagcta gggacccaat atgtgttgag ttctattatg gttagaagtg
89221 gcttccgtat tcctcagtag taattactgt ttctttttgt gtttgacagc taaagaaaag
89281 aagccctctt acaacagggg tctatgtgaa aatgccccca acagagccag aatgtgaaaa
89341 gcaatttcag ccttatttta ttcccatcaa tTGAgaaacc attatgaaga agagagtcca
89401 tatttcaatt tccaagagct gaggcaattc taactttttt gctatccagc tatttttatt
89461 tgtttgtgca tttgggggga attcatctct ctttaatata aagttggatg cggaacccaa
89521 attacgtgta ctacaattta aagcaaagga gtagaaagac agagctggga tgtttctgtc
89581 acatcagctc cactttcagt gaaagcatca cttgggatta atatggggat gcagcattat
89641 gatgtgggtc aaggaattaa gttagggaat ggcacagccc aaagaaggaa aaggcaggga
89701 gcgagggaga agactatatt gtacacacct tatatttacg tatgagacgt ttatagccga
89761 aatgatcttt tcaagttaaa ttttatgcct tttatttctt aaacaaatgt atgattacat
89821 caaggcttca aaaatactca catggctatg ttttagccag tgatgctaaa ggttgtattg
89881 catatataca tatatatata tatatatata tatatatata ttttaatttg atagtattgt
89941 gcatagagcc acgtatgttt ttgtgtattt gttaatggtt tgaatataaa cactatatgg
90001 cagtgtcttt ccaccttggg tcccagggaa gttttgtgga ggagctcagg acactaatac
90061 accaggtaga acacaaggtc atttgctaac tagcttggaa actggatgag gtcatagcag
90121 tgcttgattg cgtggaattg tgctgagttg gtgttgacat gtgctttggg gcttttacac
90181 cagttccttt caatggtttg caaggaagcc acagctggtg gtatctgagt tgacttgaca
90241 gaacactgtc ttgaagacaa tggcttactc caggagaccc acaggtatga ccttctagga
90301 agctccagtt cgatgggccc aattcttaca aacatgtggt taatgccatg gacagaagaa
90361 ggcagcaggt ggcagaatgg ggtgcatgaa ggtttctgaa aattaacact gcttgtgttt
90421 ttaactcaat attttccatg aaaatgcaac aacatgtata atatttttaa ttaaataaaa
90481 atctgtggtg gtcgttttcc ggagttgtct ttatcatcct tgcatttgaa tattgtgttt
90541 aaatttttga ttgattcatt cagtatctgg tggagtctcc aatattagaa atactggaaa
90601 caaactgaaa aaccacaaaa ggacaaataa tgcttcatga gtcagctttg caccagccat
90661 tacctgcaag tcattcttgg aaggtatcca tcctctttcc ttttgatttc ttcaccacta
90721 tttgggatat aacgtgggtt aacacagaca tagcagtcct ttataaatca attggcatgc
aacAtgggtt
90781 tgtttaacac aggttcttca cctccccttt cttaccgcct gctttctcag ctcaactatc
90841 acaggcatta cagttgtcat ggcaacccca atgttggcaa ccacgtccct tgcagccatt
90901 ttgatctgcc ttcctgaaat atagagcttt tccctgtggc ttccaaatga actattttgc
90961 aaatgtgggg aaaacacaca cctgtggtcc tatgttgcta tcagctggca cacctaggcc
91021 tggcacacta agccctctgt gattcttgct taaccaatgt atagtctcag cacatttggt
CYP1A1
D12525.1
1 acaatccttc tattctagcc tgcattgagc ttgcatgctt gcataagagc ttaagaaacc
61 attgatttaa tgtaataggg aaaattctaa cccaggtatc caaaaatgtg taagaacaac
121 tacctgagct aaataaagat attgttcaga aaatcctata ggtggagatt ttttgaatca
181 taaatgattc atcactcgtc taaatactca ccctgaaccc cattctgtgt tgggttttac
241 tgtagggagg aagaagagga ggtagcagtg aagaggtgta gccgctgcac ttaagcagtc
301 tgtttgaggg acaagactct attttttgag acagggtccc caggtcatcc aggctggagt
361 gcactggtac cattttgttt cactgtaacc tccacctccc gggctcacac gattctccca
tccacctccT
421 cctcagcctc tgagtagttg gggccgccag acgccaccac agcttttttt tttttttttt
481 tttttttttg tagagatggg gtttcaccat gttgcccagg ctggtctcaa actcctgagc
541 tcaagtgatc cacctgcctc agcctcccaa agtgctggga ttacaggcat gagacaagac
601 tcctaatcac tgtgctgtct tagcgccctc tctaacttat cacaaattga
FABP3
U57623.1
8281 atgctcgcaa tggtgttcct ggctcccacc ccccatctca ctctgtcttt ccttccagac
8341 actcacccac ggcactgcag tttgcactcg cacttacgag aaagaggcaT GAcctgactg
8401 cactgttgct gactactact ctgccaatcg gctacccctc gactcagcac cacattgcct
8461 catttcttcc tctgcatttt gtacaaatcc acgaattctt ctggggtcag gtgccactga
8521 ccgggatcca gttccagttc ccatggtgta tgtggttttt tttttttttt tttaactgca
8581 ctcatagggt gctctgaggt caataaagca gagccaaggc cacccagttg ccttttggcc
8641 tttggtaaca taactctggg agtcttggtt tatcctgtgt gtcagagagt gggcagaaat
8701 aacggcctga aggttactga ggaagaagca ctggatggga gactgaaatg gacagtctcg
8761 gagcctgtta atcagctgat caccttacac atttaataat aaaagagctg tacctacacg
8821 ttgcctttac actgcccccc ctccatggtc aaatgaccta gttcagtcag tgatggggct
8881 tccccaggtt tggctattga actgtcactt caggcccatc ctacactgaa agctcttggg
8941 tctggctgtt ctctgtgaaa tgctgtagtc tctccctttc cagaattcag gttcagggca
9001 cagaacccag gcttgtacca tggtggtggg agaaaatgac cactggccaa gaggactgct
9061 gacctgtgca ccaggctagt acttatgact acaaattctt actgctTctc taatcaactc
9121 tgagggaaga gggcatctga tcattacaaa agggagggct tataagtgat
KCNS3
AC093731.2
175681 caaaatccta agtttttttg ggaaaaatct gttttgtgtt gattttcttc tgaacaaaat
175741 gcataataat gtattcattt ttattgcatt tcctgctttg cttttaatgt agcccatcct
175801 cctggaaagg gcagattaaa aataagcttg gctgggcacg gcaggacaga gtgctaatat
175861 catcttgtgc tctttccagg tgcagcctga tcttcctctt ctcccttgcc agccagcact
175921 ctgccttctg tatccaccat ggtgtttggt gagtttttcc atcgccctgg acaagacgag
Translational intiatin codon
175981 gaacttgtca acctgaatgt ggggggcttt aagcagtctg ttgaccaaag caccctcctg
176041 cggtttcctc acaccagact ggggaagctg cttacttgcc attctgaaga ggccattctg
176101 gagctgtgtg atgattacag tgtggccgat aaggagtact actttgatcg gaatccctcc
176161 ttgttcagat atgttttgaa tttttattac acggggaagc tgcatgtcat ggaggagctg
176221 tgcgtattct cattctgcca ggagatcgag tactggggca tcaacgagct cttcattgat
176281 tcttgctgca gcaatcgcta ccaggaacgc aaggaggaaa accacgagaa ggactgggac
176341 cagaaaagcc atgatgtgag taccgactcc tcgtttgaag agtcgtctct gtttgagaaa
176401 gagctggaga agtttgacac actgcgattt ggtcagctcc ggaagaaaat ctggattaga
176461 atggagaatc cagcgtactg cctgtccgct aagcttatcg ctatctcctc cttgagcgtg
176521 gtgctggcct ccatcgtggc catgtgcgtt cacagcatgt cggagttcca gaatgaggat
176581 ggagaagtgg atgatccggt gctggaagga gtggagatcg cgtgcattgc ctggttcacc
176641 ggggagcttg ccgtccggct ggctgccgct ccttgtcaaa agaaattctg gaaaaaccct
176701 ctgaacatca ttgactttgt ctctattatt cccttctatg ccacgttggc tgtagacacc
176761 aaggaggaag agagtgagga tattgagaac atgggcaagg tggtccagat cctacggctt
176821 atgaggattt tccgaattct aaagcttgcc cggcactcgg taggacttcg gtctctaggt
176881 gccacactga gacacagcta ccatgaagtt gggcttctgc ttctcttcct ctctgtgggc
176941 atttccattt tctctgtgct tatctactcc gtggagaaag atgaccacac atccagcctc
177001 accagcatcc ccatctgctg gtggtgggcc accatcagca tgacaactgt gggctatgga
177061 gacacccacc cggtcacctt ggcgggaaag ctcatcgcca gcacatgcat catctgtggc
177121 atcttggtgg tggcccttcc catcaccatc atcttcaaca agttttccaa gtactaccag
177181 aagcaaaagg acattgatgt ggaccagtgc agtgaggatg caccagagaa gtgtcatgag
177241 ctaccttact ttaacattag ggatatatat gcacagcgga tgcacacctt cattaccagt
177301 ctctcttctg taggcattgt ggtgagcgat cctgactcca cagatgcttc aagcattgaa
177361 gacaatgagg acatttgtaa caccacctcc ttggagaatt gcacagcaaa aTGAgcgggg
177421 gtgtttgtgc ctgtttctct tatcctttcc cgacattagg ttaacacagc tttataaacc
177481 tcagtgggtt cgttaaaatc atttaattct cagggtgtac ctttcagcca tagttggaca
177541 ttcattgctg aattctgaaa tgatagaatt gtctttattt ttctctgtga ggtcaattaa
177601 atgccttgtt ctgaaattta ttttttacaa gagagagttg tgatatagtt tggaatataa
177661 gataaatggt attgggtggg gtttgtggct acagcttatg catcattctg tgtttgtcat
177721 ttactcacat tgagctaact ttaaattact gacaagtaga atcaaaggtg cagctgactg
177781 agacgacatg catgtaagat ccacaaaatg agacaatgca tgtaaatcca tgctcatgtt
177841 ctaaacatgg aaactaggag cctaataaac ttcctaattc agtatggaga atgtcttggt
177901 tgtatgtttt tatgttgagt aactacattt tagcatgttc aggattggtt tggaagaaaa
177961 atgttctttt tggagtacca agagcccttc ctttctttat gcttttgaat tttaggtatg
178021 taccaagcat gtgctagctc ttttttcagt agacttgatg gtagttggca gatagaagaa
178081 catgtccatg actaaattgc gctgtccagc ataaactgat taaacatgta ggtgtttggg
178141 acacactcaa aattggcata tatgaatgaa gcgtgtactc ttacaaatat ttcttcagta
178201 tatcttttga attttatact aatttatact gctaggacaa aatctcacta cctaaaaaat
178261 attgcagagc agagtaaacc tacacattca caggcaggga tactttcagt gaatggaaga
178321 agggagaaat gttgtacaaa tacacagatg tgattcatta tgaccggagg agtgaaactt
178381 aaatctctgg aaaaatctat gccagcagct ataaaatgag ccaacatttt ccaatagccg
178441 tagagatggt aatttctgca tctcatgtag aaacttctca caaagaaaga agacatacat
178501 acgcattttc agacagtggc agagcaattc tatcccatgt ggagagaatt tagctaaagt
178561 gttaattttg gagatgtgtg gagactgtca cgcacgccac caggggctgg ttatggattc
cgcaTgccac
178621 tttacctgaa gtctttccag tggaagagcc acatggattt gttatgtttg tagatgcatt
178681 tcattgaaag aagggcatta tcagatatag gtaactcttt gactcatcac tcctcttaat
178741 gtttctatgg cttcggagat ggatataact aaacagggtt tatggccact tttaaatctg
178801 gaggtttatt acatgcatgt gtgttacccc ctgctcatta aaaaaaaatc ttttctccaa
178861 aatgcatttt tgatattgga ataagtatag ttttgaggca cagcaataga atttgggatc
Table S4 SERPINA1
NC_000014.7 REGION: complement (93900000..93930000)
15241 acaacgtgtc tctgcttctc tcccctccag gccgtgcata aggctgtgct gaccatcgac
15301 gagaaaggga ctgaagctgc tggggccatg tttttagagg ccatacccat gtctatcccc
15361 cccgaggtca agttcaacaa accctttgtc ttcttaatga ttgaacaaaa taccaagtct
15421 cccctcttca tgggaaaagt ggtgaatccc acccaaaaaT AActgcctct cgctcctcaa
15481 cccctcccct ccatccctgg ccccctccct ggatgacatt aaagaagggt tgagctggtc
15541 cctgcctgca tgtgactgta aatccctccc atgttttctc tgagtctccc tttgcctgct
15601 gaggctgtat gtgggctcca ggtaacagtg ctgtcttcgg gccccctgaa ctgtgttcat
15661 ggagcatctg gctgggtagg cacatgctgg gcttgaatcc aggggggact gaatcctcag
15721 cttacggacc tgggcccatc tgtttctgga gggctccagt cttccttgtc ctgtcttgga
15781 gtccccaaga aggaatcaca ggggaggaac cagataccag ccatgacccc aggctccacc
15841 aagcatcttc atgtccccct gctcatcccc cactcccccc cacccagagt tgctcatcct
15901 gccagggctg gctgtgccca ccccaaggct gccctcctgg gggccccaga actgcctgat
15961 cgtgccgtgg cccagttttg tggcatctgc agcaacacaa gagagaggac aatgtcctcc
16021 tcttgacccg ctgtcaccta accagactcg ggccctgcac ctctcaggca cttctggaaa
16081 atgactgagg cagattcttc ctgaagccca ttctccatgg ggcaacaagg acacctattc
16141 tgtccttgtc cttccatcgc tgccccagaa agcctcacat atctccgttt agaatcaggt
16201 cccttctccc cagatgaaga ggagggtctc tgctttgttt tctctatctc ctcctcagac
16261 ttgaccaggc ccagcaggcc ccagaagacc attaccctat atcccttctc ctccctagtc
16321 acatggccat aggcctgctg atggctcagg aaggccattg caaggactcc tcagctatgg
16381 gagaggaagc acatcaccca ttgacccccg caacccctcc ctttcctcct ctgagtcccg
16441 actggggcca catgcagcct gacttctttg tgcctgttgc tgtccctgca gtcttcagag
16501 ggccaccgca gctccagtgc cacggcagga ggctgttcct gaatagcccc tgtggtaagg
16561 gccaggagag tccttccatc ctccaaggcc ctgctaaagg acacagcagc caggaagtcc
16621 cctgggcccc tagctgaagg acagcctgct ccctccgtct ctaccaggaa tggccttgtc
16681 ctatggaagg cactgcccca tcccaaacta atctaggaat cactgtctaa ccactcactg
16741 tcatgaatgt gtacttaaag gatgaggttg agtcatacca aatagtgatt tcgatagttc
OCT A
16801 aaaatggtga aattagcaat tctacatgat tcagtctaat caatggatac cgactgtttc
16861 ccacacaagt ctcctgttct cttaagctta ctcactgaca gcctttcact ctccacaaat
16921 acattaaaga tatggccatc accaagcccc ctaggatgac accagacctg agagtctgaa
16981 gacctggatc caagttctga cttttccccc tgacagctgt gtgaccttcg tgaagtcgcc
17041 aaacctctct gagccccagt cattgctagt aagacctgcc tttgagttgg tatgatgttc
Table S5 HBD
NG_000007.3
64501 atctgcctac ctcttctccg cagctcttgg gcaatgtgct ggtgtgtgtg ctggcccgca
64561 actttggcaa ggaattcacc ccacaaatgc aggctgccta tcagaaggtg gtggctggtg
64621 tggctaatgc cctggctcac aagtaccatT GAgatcctgg actgtttcct gataaccata
64681 agaagaccct atttccctag attctatttt ctgaacttgg gaacacaatg cctacttcaa
64741 gggtatggct tctgcctaat aaagaatgtt cagctcaact tcctgattaa tttcacttat
64801 ttcatttttt tgtccaggtg tgtaagaagg ttcctgaggc tctacagata gggagcactt
Aggagcactt
64861 gtttatttta caaagagtac atgggaaaag agaaaagcaa gggaaccgta caaggcatta
64921 atgggtgaca cttctacctc caaagagcag aaattatcaa gaactcttga tacaaagata
64981 atactggcac tgcagaggtt ctagggaaga cctcaaccct aagacatagc ctcaagggta
Table S6 SLC9A3R1
NC_000017.9 REGION: 70250000..70280000
26161 agggactgac tcccaacttc ctgcccccac ttctctttac agctgaattc ccaagacagc
26221 cccccaaaac aggactccac agcgccctcg tctacctcct cctccgaccc catcctagac
26281 ttcaacatct ccctggccat ggccaaagag agggcccacc agaaacgcag cagcaaacgg
26341 gccccgcaga tggactggag caagaaaaac gaactcttca gcaacctcTG Agcgccctgc
26401 tgccacccag tgactggcag ggccgagcca gcattccacc ccaccttttt ccttctcccc
26461 aattactccc ctgaatcaat gtacaaatca gcacccacat cccctttctt gacaaatgat
26521 ttttctagag aactatgttc ttccctgact ttagggaagg tgaatgtgtt cccgtcctcc
26581 cgcagtcaga aaggagactc tgcctccctc ctcctcactg agtgcctcat cctaccgggt
26641 gtccctttgc caccctgcct gggacatcgc tggaacctgc accatgccag gatcatggga
26701 ccaggcgaga gggcaccctc ccttcctccc ccatgtgata aatgggtcca gggctgatca
26761 aagaactctg actgcagaac tgccgctctc agtggacagg gcatctgtta ccctgagacc
26821 tgtggcagac acgtcttgtt ttcatttgat ttttgttaag agtgcagtat tgcagagtct
26881 agaggaattt ttgtttcctt gattaacatg attttcctgg ttgttacatc cagggcatgg
26941 cagtggcctc agccttaaac ttttgttcct actcccaccc tcagcgaact gggcagcacg
27001 gggagggttt ggctacccct gcccatccct gagccaggta ccaccattgt aaggaaacac
27061 tttcagaaat tcagctggtt cctccaaacc cttcagcctc cgtgtgttcc ttggaagttt
27121 tgtcctctgg ccttggaccc cttataggta gaaattgaga aatggtaagc caaggtggtc
27181 tttggctggg agggtggggt acactggagg gagggccatc aagggctccc tgtgacccca
27241 agcctgggta gctttagcta gagggcctag ctgcagtcct gtaggaagga agatgcatgc
27301 acccagccgg gtattcagct tggtgtggtc agtgtgcctg tgtgctgggc tgcaagcacc
RUNX1 A
27361 gattgtgggc tggggacccc ttgtctaacg gggatattta caaggggaag tgggagctca
27421 gaccaacgtt ctcagaggac tctgggaggt tcctttaatt ccagaagcgt ggaaagtgtg
27481 tcccaggatg gagctgggtt tggaatgtga ggacttggct ttactctttc tgcctatagc
27541 cagtggggtg cagaattccc aggggcaggc tgggctggtg ccagatcctc tatcttatct