Jian-Min Chen · Claude Férec · David N. Cooper

A systematic analysis of disease-associated variants in the 3’ regulatory regions of human protein-coding genes I: General principles and overview

Online Supplementary Materials

1. Supplementary Text

An unusually short, discrete 17-bp poly(A) tail was identified in both the mature mRNA and the nuclear pre-mRNA of the Xenopus serum albumin gene (Schoenberg et al. 1989; Rao et al. 1996). This short poly(A) tail also appears to be present in a subset of human mRNAs (Gu et al. 1999; Choi and Hagedorn 2003) and may be conferred by cis-acting poly(A)-limiting elements (Das Gupta et al. 1998). Although in an in vitro model system, mRNA with <20-nt poly(A) tails imparted by the poly(A)-limiting element is translated as efficiently as mRNA species with long poly(A) tails (Peng and Schoenberg 2005), mRNA with short poly(A) tails in vivo may be intrinsically correlated with the state of gene expression (Weber et al. 2005).

References

Choi YH, Hagedorn CH (2003) Purifying mRNAs with a high-affinity eIF4E mutant identifies the short 3' poly(A) end phenotype. Proc Natl Acad Sci USA 100:7033-7038

Das Gupta J, Gu H, Chernokalskaya E, Gao X, Schoenberg DR (1998) Identification of two cis-acting elements that independently regulate the length of poly(A) on Xenopus albumin pre-mRNA. RNA 4:766-776

Gu H, Das Gupta J, Schoenberg DR (1999) The poly(A)-limiting element is a conserved cis-acting sequence that regulates poly(A) tail length on nuclear pre-mRNAs. Proc Natl Acad Sci USA 96:8943-8948

Peng J, Schoenberg DR (2005) mRNA with a <20-nt poly(A) tail imparted by the poly(A)-limiting element is translated as efficiently in vivo as long poly(A) mRNA. RNA 11:1131-1140

Rao MN, Chernokalskaya E, Schoenberg DR (1996) Regulated nuclear polyadenylation of Xenopus albumin pre-mRNA. Nucleic Acids Res 24:4078-4083

Schoenberg DR, Moskaitis JE, Smith LH Jr, Pastori RL (1989) Extranuclear estrogen-regulated destabilization of Xenopus laevis serum albumin mRNA. Mol Endocrinol 3:805-814

Weber M, Hagedorn CH, Harrison DG, Searles CD (2005) Laminar shear stress and 3' polyadenylation of eNOS mRNA. Circ Res 96:1161-1168

2. Supplementary Tables

Note: gene symbols and the source of DNA sequence data for each variant are listed. Point mutations are indicated in red below the wild-type sequences and deletion variants are barred and highlighted in red. All the variants are annotated in the context of genomic sequences. The coding sequence of the last exon is shaded; the translational termination codon is in upper case and bold; the UCPAS hexamer is in bold and highlighted in blue; the vertical line in red indicates the cleavage site wherever it was possible to ascertain it.

Table S1 Genomic sequence of the NAT1 gene*

NC_000008.9 REGION: 18111000..18127000

12721 atacttataa ccattgtatt tttacatgtt taaaatatag ccataattag cctactcaaa

12781 tccaagtgta aaagtaaaat gatttgcttt cgttttgttt tccttgctta ggggatcatg

12841 gacattgaag catatcttga aagaattggc tataagaagt ctaggaacaa attggacttg

12901 gaaacattaa ctgacattct tcaacaccag atccgagctg ttccctttga gaaccttaac

12961 atccattgtg gggatgccat ggacttaggc ttagaggcca tttttgatca agttgtgaga

13021 agaaatcggg gtggatggtg tctccaggtc aatcatcttc tgtactgggc tctgaccact

13081 attggttttg agaccacgat gttgggaggg tatgtttaca gcactccagc caaaaaatac

13141 agcactggca tgattcacct tctcctgcag gtgaccattg atggcaggaa ctacattgtc

13201 gatgctgggt ttggacgctc ataccagatg tggcagcctc tggagttaat ttctgggaag

13261 gatcagcctc aggtgccttg tgtcttccgt ttgacggaag agaatggatt ctggtatcta

13321 gaccaaatca gaagggaaca gtacattcca aatgaagaat ttcttcattc tgatctccta

13381 gaagacagca aataccgaaa aatctactcc tttactctta agcctcgaac aattgaagat

13441 tttgagtcta tgaatacata cctgcagaca tctccatcat ctgtgtttac tagtaaatca

13501 ttttgttcct tgcagacccc agatggggtt cactgtttgg tgggcttcac cctcacccat

13561 aggagattca attataagga caatacagat ctaatagagt tcaagactct gagtgaggaa

13621 gaaatagaaa aagtgctgaa aaatatattt aatatttcct tgcagagaaa gcttgtgccc

13681 aaacatggtg atagattttt tactattTAG aataaggagt aaaacaatct tgtctatttg

13741 tcatccagct caccagttat caactgacga cctatcatgt atcttctgta cccttacctt

13801 attttgaaga aaatcctaga catcaaatca tttcacctat aaaaatgtca tcatatataa

13861 ttaaacagct ttttaaagaa acataaccac aaaccttttc aaataataat aataataata

13921 ataaaaaatg tattttaaag atggcctgtg gttatcttgg aaattggtga tttatgctag

13981 aaagctttta atgttggttt attgttgaat tcctagaaaa gttttattgg tagatgagta

14041 aataaaatat tgtaaaaaaa cttattgtct ataaagtata ttaaaacatt gttggctaat

14101 ataatttgaa aaaaagtggt tttttggaag acttaggata ttatggtgct acataatttt

14161 tcctcgatgc tctcttcctc tcatctttct tgtctcttaa attgctttac ttccttgcac

* Previously assigned UCPAS is in red, whereas the newly assigned UCPAS is in blue.

Table S2 FGG, γA

K02569.1

2761 tttttaaatg gagaaaatta tgtcttttta atatggtttt tgttttgtta tatattcaca

2821 ggctggagac gttTAAaaga ccgtttcaaa agagatttac ttttttaaag gactttatct

2881 gaacagagag atataatatt tttcctattg gacaatggac ttgcaaagct tcacttcatt

2941 ttaagagcaa aagaccccat gttgaaaact ccataacagt tttatgctga tgataattta

3001 tctacatgca tttcaataaa ccttttgttt cctaagacta gatacatggt acctttattg

aTctttattg

3061 accattaaaa accaccactt tttgccaatt taccaattac aattgggcaa ccatcagtag

3121 taattgagtc ctcattttat gctaaatgtt atgcctaact ctttgggagt tacaaaggaa

3181 atagcaatta tggcttttgc cctctaggag atacaggaca aatacaggaa aatacagcaa

Table S3

CTLA4

AF411058.1

89101 taaattttaa tagctgaatc aagaaaatct cctgaggttt ataattctgt atgctgtgaa

89161 cattcatttt taaccagcta gggacccaat atgtgttgag ttctattatg gttagaagtg

89221 gcttccgtat tcctcagtag taattactgt ttctttttgt gtttgacagc taaagaaaag

89281 aagccctctt acaacagggg tctatgtgaa aatgccccca acagagccag aatgtgaaaa

89341 gcaatttcag ccttatttta ttcccatcaa tTGAgaaacc attatgaaga agagagtcca

89401 tatttcaatt tccaagagct gaggcaattc taactttttt gctatccagc tatttttatt

89461 tgtttgtgca tttgggggga attcatctct ctttaatata aagttggatg cggaacccaa

89521 attacgtgta ctacaattta aagcaaagga gtagaaagac agagctggga tgtttctgtc

89581 acatcagctc cactttcagt gaaagcatca cttgggatta atatggggat gcagcattat

89641 gatgtgggtc aaggaattaa gttagggaat ggcacagccc aaagaaggaa aaggcaggga

89701 gcgagggaga agactatatt gtacacacct tatatttacg tatgagacgt ttatagccga

89761 aatgatcttt tcaagttaaa ttttatgcct tttatttctt aaacaaatgt atgattacat

89821 caaggcttca aaaatactca catggctatg ttttagccag tgatgctaaa ggttgtattg

89881 catatataca tatatatata tatatatata tatatatata ttttaatttg atagtattgt

89941 gcatagagcc acgtatgttt ttgtgtattt gttaatggtt tgaatataaa cactatatgg

90001 cagtgtcttt ccaccttggg tcccagggaa gttttgtgga ggagctcagg acactaatac

90061 accaggtaga acacaaggtc atttgctaac tagcttggaa actggatgag gtcatagcag

90121 tgcttgattg cgtggaattg tgctgagttg gtgttgacat gtgctttggg gcttttacac

90181 cagttccttt caatggtttg caaggaagcc acagctggtg gtatctgagt tgacttgaca

90241 gaacactgtc ttgaagacaa tggcttactc caggagaccc acaggtatga ccttctagga

90301 agctccagtt cgatgggccc aattcttaca aacatgtggt taatgccatg gacagaagaa

90361 ggcagcaggt ggcagaatgg ggtgcatgaa ggtttctgaa aattaacact gcttgtgttt

90421 ttaactcaat attttccatg aaaatgcaac aacatgtata atatttttaa ttaaataaaa

90481 atctgtggtg gtcgttttcc ggagttgtct ttatcatcct tgcatttgaa tattgtgttt

90541 aaatttttga ttgattcatt cagtatctgg tggagtctcc aatattagaa atactggaaa

90601 caaactgaaa aaccacaaaa ggacaaataa tgcttcatga gtcagctttg caccagccat

90661 tacctgcaag tcattcttgg aaggtatcca tcctctttcc ttttgatttc ttcaccacta

90721 tttgggatat aacgtgggtt aacacagaca tagcagtcct ttataaatca attggcatgc

aacAtgggtt

90781 tgtttaacac aggttcttca cctccccttt cttaccgcct gctttctcag ctcaactatc

90841 acaggcatta cagttgtcat ggcaacccca atgttggcaa ccacgtccct tgcagccatt

90901 ttgatctgcc ttcctgaaat atagagcttt tccctgtggc ttccaaatga actattttgc

90961 aaatgtgggg aaaacacaca cctgtggtcc tatgttgcta tcagctggca cacctaggcc

91021 tggcacacta agccctctgt gattcttgct taaccaatgt atagtctcag cacatttggt

CYP1A1

D12525.1

1 acaatccttc tattctagcc tgcattgagc ttgcatgctt gcataagagc ttaagaaacc

61 attgatttaa tgtaataggg aaaattctaa cccaggtatc caaaaatgtg taagaacaac

121 tacctgagct aaataaagat attgttcaga aaatcctata ggtggagatt ttttgaatca

181 taaatgattc atcactcgtc taaatactca ccctgaaccc cattctgtgt tgggttttac

241 tgtagggagg aagaagagga ggtagcagtg aagaggtgta gccgctgcac ttaagcagtc

301 tgtttgaggg acaagactct attttttgag acagggtccc caggtcatcc aggctggagt

361 gcactggtac cattttgttt cactgtaacc tccacctccc gggctcacac gattctccca

tccacctccT

421 cctcagcctc tgagtagttg gggccgccag acgccaccac agcttttttt tttttttttt

481 tttttttttg tagagatggg gtttcaccat gttgcccagg ctggtctcaa actcctgagc

541 tcaagtgatc cacctgcctc agcctcccaa agtgctggga ttacaggcat gagacaagac

601 tcctaatcac tgtgctgtct tagcgccctc tctaacttat cacaaattga

FABP3

U57623.1

8281 atgctcgcaa tggtgttcct ggctcccacc ccccatctca ctctgtcttt ccttccagac

8341 actcacccac ggcactgcag tttgcactcg cacttacgag aaagaggcaT GAcctgactg

8401 cactgttgct gactactact ctgccaatcg gctacccctc gactcagcac cacattgcct

8461 catttcttcc tctgcatttt gtacaaatcc acgaattctt ctggggtcag gtgccactga

8521 ccgggatcca gttccagttc ccatggtgta tgtggttttt tttttttttt tttaactgca

8581 ctcatagggt gctctgaggt caataaagca gagccaaggc cacccagttg ccttttggcc

8641 tttggtaaca taactctggg agtcttggtt tatcctgtgt gtcagagagt gggcagaaat

8701 aacggcctga aggttactga ggaagaagca ctggatggga gactgaaatg gacagtctcg

8761 gagcctgtta atcagctgat caccttacac atttaataat aaaagagctg tacctacacg

8821 ttgcctttac actgcccccc ctccatggtc aaatgaccta gttcagtcag tgatggggct

8881 tccccaggtt tggctattga actgtcactt caggcccatc ctacactgaa agctcttggg

8941 tctggctgtt ctctgtgaaa tgctgtagtc tctccctttc cagaattcag gttcagggca

9001 cagaacccag gcttgtacca tggtggtggg agaaaatgac cactggccaa gaggactgct

9061 gacctgtgca ccaggctagt acttatgact acaaattctt actgctTctc taatcaactc

9121 tgagggaaga gggcatctga tcattacaaa agggagggct tataagtgat

KCNS3

AC093731.2

175681 caaaatccta agtttttttg ggaaaaatct gttttgtgtt gattttcttc tgaacaaaat

175741 gcataataat gtattcattt ttattgcatt tcctgctttg cttttaatgt agcccatcct

175801 cctggaaagg gcagattaaa aataagcttg gctgggcacg gcaggacaga gtgctaatat

175861 catcttgtgc tctttccagg tgcagcctga tcttcctctt ctcccttgcc agccagcact

175921 ctgccttctg tatccaccat ggtgtttggt gagtttttcc atcgccctgg acaagacgag

Translational intiatin codon

175981 gaacttgtca acctgaatgt ggggggcttt aagcagtctg ttgaccaaag caccctcctg

176041 cggtttcctc acaccagact ggggaagctg cttacttgcc attctgaaga ggccattctg

176101 gagctgtgtg atgattacag tgtggccgat aaggagtact actttgatcg gaatccctcc

176161 ttgttcagat atgttttgaa tttttattac acggggaagc tgcatgtcat ggaggagctg

176221 tgcgtattct cattctgcca ggagatcgag tactggggca tcaacgagct cttcattgat

176281 tcttgctgca gcaatcgcta ccaggaacgc aaggaggaaa accacgagaa ggactgggac

176341 cagaaaagcc atgatgtgag taccgactcc tcgtttgaag agtcgtctct gtttgagaaa

176401 gagctggaga agtttgacac actgcgattt ggtcagctcc ggaagaaaat ctggattaga

176461 atggagaatc cagcgtactg cctgtccgct aagcttatcg ctatctcctc cttgagcgtg

176521 gtgctggcct ccatcgtggc catgtgcgtt cacagcatgt cggagttcca gaatgaggat

176581 ggagaagtgg atgatccggt gctggaagga gtggagatcg cgtgcattgc ctggttcacc

176641 ggggagcttg ccgtccggct ggctgccgct ccttgtcaaa agaaattctg gaaaaaccct

176701 ctgaacatca ttgactttgt ctctattatt cccttctatg ccacgttggc tgtagacacc

176761 aaggaggaag agagtgagga tattgagaac atgggcaagg tggtccagat cctacggctt

176821 atgaggattt tccgaattct aaagcttgcc cggcactcgg taggacttcg gtctctaggt

176881 gccacactga gacacagcta ccatgaagtt gggcttctgc ttctcttcct ctctgtgggc

176941 atttccattt tctctgtgct tatctactcc gtggagaaag atgaccacac atccagcctc

177001 accagcatcc ccatctgctg gtggtgggcc accatcagca tgacaactgt gggctatgga

177061 gacacccacc cggtcacctt ggcgggaaag ctcatcgcca gcacatgcat catctgtggc

177121 atcttggtgg tggcccttcc catcaccatc atcttcaaca agttttccaa gtactaccag

177181 aagcaaaagg acattgatgt ggaccagtgc agtgaggatg caccagagaa gtgtcatgag

177241 ctaccttact ttaacattag ggatatatat gcacagcgga tgcacacctt cattaccagt

177301 ctctcttctg taggcattgt ggtgagcgat cctgactcca cagatgcttc aagcattgaa

177361 gacaatgagg acatttgtaa caccacctcc ttggagaatt gcacagcaaa aTGAgcgggg

177421 gtgtttgtgc ctgtttctct tatcctttcc cgacattagg ttaacacagc tttataaacc

177481 tcagtgggtt cgttaaaatc atttaattct cagggtgtac ctttcagcca tagttggaca

177541 ttcattgctg aattctgaaa tgatagaatt gtctttattt ttctctgtga ggtcaattaa

177601 atgccttgtt ctgaaattta ttttttacaa gagagagttg tgatatagtt tggaatataa

177661 gataaatggt attgggtggg gtttgtggct acagcttatg catcattctg tgtttgtcat

177721 ttactcacat tgagctaact ttaaattact gacaagtaga atcaaaggtg cagctgactg

177781 agacgacatg catgtaagat ccacaaaatg agacaatgca tgtaaatcca tgctcatgtt

177841 ctaaacatgg aaactaggag cctaataaac ttcctaattc agtatggaga atgtcttggt

177901 tgtatgtttt tatgttgagt aactacattt tagcatgttc aggattggtt tggaagaaaa

177961 atgttctttt tggagtacca agagcccttc ctttctttat gcttttgaat tttaggtatg

178021 taccaagcat gtgctagctc ttttttcagt agacttgatg gtagttggca gatagaagaa

178081 catgtccatg actaaattgc gctgtccagc ataaactgat taaacatgta ggtgtttggg

178141 acacactcaa aattggcata tatgaatgaa gcgtgtactc ttacaaatat ttcttcagta

178201 tatcttttga attttatact aatttatact gctaggacaa aatctcacta cctaaaaaat

178261 attgcagagc agagtaaacc tacacattca caggcaggga tactttcagt gaatggaaga

178321 agggagaaat gttgtacaaa tacacagatg tgattcatta tgaccggagg agtgaaactt

178381 aaatctctgg aaaaatctat gccagcagct ataaaatgag ccaacatttt ccaatagccg

178441 tagagatggt aatttctgca tctcatgtag aaacttctca caaagaaaga agacatacat

178501 acgcattttc agacagtggc agagcaattc tatcccatgt ggagagaatt tagctaaagt

178561 gttaattttg gagatgtgtg gagactgtca cgcacgccac caggggctgg ttatggattc

cgcaTgccac

178621 tttacctgaa gtctttccag tggaagagcc acatggattt gttatgtttg tagatgcatt

178681 tcattgaaag aagggcatta tcagatatag gtaactcttt gactcatcac tcctcttaat

178741 gtttctatgg cttcggagat ggatataact aaacagggtt tatggccact tttaaatctg

178801 gaggtttatt acatgcatgt gtgttacccc ctgctcatta aaaaaaaatc ttttctccaa

178861 aatgcatttt tgatattgga ataagtatag ttttgaggca cagcaataga atttgggatc

Table S4 SERPINA1

NC_000014.7 REGION: complement (93900000..93930000)

15241 acaacgtgtc tctgcttctc tcccctccag gccgtgcata aggctgtgct gaccatcgac

15301 gagaaaggga ctgaagctgc tggggccatg tttttagagg ccatacccat gtctatcccc

15361 cccgaggtca agttcaacaa accctttgtc ttcttaatga ttgaacaaaa taccaagtct

15421 cccctcttca tgggaaaagt ggtgaatccc acccaaaaaT AActgcctct cgctcctcaa

15481 cccctcccct ccatccctgg ccccctccct ggatgacatt aaagaagggt tgagctggtc

15541 cctgcctgca tgtgactgta aatccctccc atgttttctc tgagtctccc tttgcctgct

15601 gaggctgtat gtgggctcca ggtaacagtg ctgtcttcgg gccccctgaa ctgtgttcat

15661 ggagcatctg gctgggtagg cacatgctgg gcttgaatcc aggggggact gaatcctcag

15721 cttacggacc tgggcccatc tgtttctgga gggctccagt cttccttgtc ctgtcttgga

15781 gtccccaaga aggaatcaca ggggaggaac cagataccag ccatgacccc aggctccacc

15841 aagcatcttc atgtccccct gctcatcccc cactcccccc cacccagagt tgctcatcct

15901 gccagggctg gctgtgccca ccccaaggct gccctcctgg gggccccaga actgcctgat

15961 cgtgccgtgg cccagttttg tggcatctgc agcaacacaa gagagaggac aatgtcctcc

16021 tcttgacccg ctgtcaccta accagactcg ggccctgcac ctctcaggca cttctggaaa

16081 atgactgagg cagattcttc ctgaagccca ttctccatgg ggcaacaagg acacctattc

16141 tgtccttgtc cttccatcgc tgccccagaa agcctcacat atctccgttt agaatcaggt

16201 cccttctccc cagatgaaga ggagggtctc tgctttgttt tctctatctc ctcctcagac

16261 ttgaccaggc ccagcaggcc ccagaagacc attaccctat atcccttctc ctccctagtc

16321 acatggccat aggcctgctg atggctcagg aaggccattg caaggactcc tcagctatgg

16381 gagaggaagc acatcaccca ttgacccccg caacccctcc ctttcctcct ctgagtcccg

16441 actggggcca catgcagcct gacttctttg tgcctgttgc tgtccctgca gtcttcagag

16501 ggccaccgca gctccagtgc cacggcagga ggctgttcct gaatagcccc tgtggtaagg

16561 gccaggagag tccttccatc ctccaaggcc ctgctaaagg acacagcagc caggaagtcc

16621 cctgggcccc tagctgaagg acagcctgct ccctccgtct ctaccaggaa tggccttgtc

16681 ctatggaagg cactgcccca tcccaaacta atctaggaat cactgtctaa ccactcactg

16741 tcatgaatgt gtacttaaag gatgaggttg agtcatacca aatagtgatt tcgatagttc

OCT A

16801 aaaatggtga aattagcaat tctacatgat tcagtctaat caatggatac cgactgtttc

16861 ccacacaagt ctcctgttct cttaagctta ctcactgaca gcctttcact ctccacaaat

16921 acattaaaga tatggccatc accaagcccc ctaggatgac accagacctg agagtctgaa

16981 gacctggatc caagttctga cttttccccc tgacagctgt gtgaccttcg tgaagtcgcc

17041 aaacctctct gagccccagt cattgctagt aagacctgcc tttgagttgg tatgatgttc


Table S5 HBD

NG_000007.3

64501 atctgcctac ctcttctccg cagctcttgg gcaatgtgct ggtgtgtgtg ctggcccgca

64561 actttggcaa ggaattcacc ccacaaatgc aggctgccta tcagaaggtg gtggctggtg

64621 tggctaatgc cctggctcac aagtaccatT GAgatcctgg actgtttcct gataaccata

64681 agaagaccct atttccctag attctatttt ctgaacttgg gaacacaatg cctacttcaa

64741 gggtatggct tctgcctaat aaagaatgtt cagctcaact tcctgattaa tttcacttat

64801 ttcatttttt tgtccaggtg tgtaagaagg ttcctgaggc tctacagata gggagcactt

Aggagcactt

64861 gtttatttta caaagagtac atgggaaaag agaaaagcaa gggaaccgta caaggcatta

64921 atgggtgaca cttctacctc caaagagcag aaattatcaa gaactcttga tacaaagata

64981 atactggcac tgcagaggtt ctagggaaga cctcaaccct aagacatagc ctcaagggta

Table S6 SLC9A3R1

NC_000017.9 REGION: 70250000..70280000

26161 agggactgac tcccaacttc ctgcccccac ttctctttac agctgaattc ccaagacagc

26221 cccccaaaac aggactccac agcgccctcg tctacctcct cctccgaccc catcctagac

26281 ttcaacatct ccctggccat ggccaaagag agggcccacc agaaacgcag cagcaaacgg

26341 gccccgcaga tggactggag caagaaaaac gaactcttca gcaacctcTG Agcgccctgc

26401 tgccacccag tgactggcag ggccgagcca gcattccacc ccaccttttt ccttctcccc

26461 aattactccc ctgaatcaat gtacaaatca gcacccacat cccctttctt gacaaatgat

26521 ttttctagag aactatgttc ttccctgact ttagggaagg tgaatgtgtt cccgtcctcc

26581 cgcagtcaga aaggagactc tgcctccctc ctcctcactg agtgcctcat cctaccgggt

26641 gtccctttgc caccctgcct gggacatcgc tggaacctgc accatgccag gatcatggga

26701 ccaggcgaga gggcaccctc ccttcctccc ccatgtgata aatgggtcca gggctgatca

26761 aagaactctg actgcagaac tgccgctctc agtggacagg gcatctgtta ccctgagacc

26821 tgtggcagac acgtcttgtt ttcatttgat ttttgttaag agtgcagtat tgcagagtct

26881 agaggaattt ttgtttcctt gattaacatg attttcctgg ttgttacatc cagggcatgg

26941 cagtggcctc agccttaaac ttttgttcct actcccaccc tcagcgaact gggcagcacg

27001 gggagggttt ggctacccct gcccatccct gagccaggta ccaccattgt aaggaaacac

27061 tttcagaaat tcagctggtt cctccaaacc cttcagcctc cgtgtgttcc ttggaagttt

27121 tgtcctctgg ccttggaccc cttataggta gaaattgaga aatggtaagc caaggtggtc

27181 tttggctggg agggtggggt acactggagg gagggccatc aagggctccc tgtgacccca

27241 agcctgggta gctttagcta gagggcctag ctgcagtcct gtaggaagga agatgcatgc

27301 acccagccgg gtattcagct tggtgtggtc agtgtgcctg tgtgctgggc tgcaagcacc

RUNX1 A

27361 gattgtgggc tggggacccc ttgtctaacg gggatattta caaggggaag tgggagctca

27421 gaccaacgtt ctcagaggac tctgggaggt tcctttaatt ccagaagcgt ggaaagtgtg

27481 tcccaggatg gagctgggtt tggaatgtga ggacttggct ttactctttc tgcctatagc

27541 cagtggggtg cagaattccc aggggcaggc tgggctggtg ccagatcctc tatcttatct