Supplementary file S1

(A) Alignment of PMEPA1-NKILA promoter region sequences of representative animals.

A 1250 nucleotide human sequence, from Ensembl ( GRCh38.p2, is aligned with corresponding Ensembl database sequences for representative animals. From top to bottom these are: zebrafish (Danio rerio, dataset Zv9), coelacanth (Latimeria chalumnae, LatCha1), turtle (Chrysemys picta bellii, ChrPicBel3.0.1), opossum (Monodelphis domestica, monDom5), cattle (Bos Taurus, UMD3.1), mouse (Mus musculus, GRCm38.p3), lemur (Otolemur garnettii, OtoGar3),and chimpanzee (Pan troglodytes, CHIMP2.1.4). Letters shown below the alignment denote - in reverse orientation - the amino acids encoded by the humanPMEPA1-001 and -002 open reading frames belowthe 2nd nucleotide position of each codon. For sequences with no clear similarities to the respective human NKILA region, the 3' ends of the compared sequences were chosen somewhat arbitrarily. The sequence alignment was done using the Muscle program supplied within GENETYX Ver.12 software package (Genetyx Corp., Tokyo, Japan), and slightly modified by hand. Intron border positions are indicated by downward triangles, transcriptional start points by arrows, and orientations of transcripts by (+) or (-) symbols. Only two regions are quite similar throughout the compared species, which are the coding part of human PMEPA1-001 and parts of the intergenic promoter region that we shaded gray. Among the conserved promoter motifs there are typical cAMP-response elements (CRE) and SMAD binding element (SBE)motifs (boxed) for binding cAMP response element-binding protein (CREB)(Conkright et al., 2003, Mol. Cell 11, 1101-8) and SMAD (Hua et al., 1999, P.N.A.S. 96, 13130-5) transcription factors, respectively. The finding of SMAD binding elements is consistent with the reported upregulation of PMEPA1 by TGF- (e.g. Nakano et al., 2014, J. Biol. Chem. 289, 12680-92). The NF-B binding motif (boxed) identified by Liu et al. is well conserved among eutherian mammals, but not beyond. For the discussion on to what extentNKILAequivalents are conserved through evolution, the definition of this gene is relevant.If, for example, functional NKILA would be defined by the hairpin motifs that Liu et al. identified, then conservation would most likely be restricted to only closely related primates (in the figure below, human NKILA hairpin A and B regions are underlined). However,in mice there is an orthologous - though spliced - transcript with similar starting point as human NKILA (GenBank EST BY735079; not shown in the figure), and despite the minimal sequence similarity this can probably be considered a murine variant of an “NKILA”transcript. Although for a few more eutherian mammals “NKILA-like” transcripts can be found (not shown), database information is probably not extensive enough to determine whether they are common among most eutherian mammals. When studying database informationbeyond eutherian mammals, we could not find conservation of transcripts orthologousto NKILA.In summary, we feel that from an evolutionary point of view the sequences which are compared in this figure primarily concern an ancient PMEPA1 promoter region, and that the NF-B binding motif and NKILA transcription should be studied within that context. It appears interesting that for a paralogue of PMEPA1, C18ORF1 - alias LDLRAD4 - noncoding antisense transcripts also have been reported (see the Ensembl database).

(B) Alignment of deduced PMEPA1 amino acid sequences for the species compared in (A). This figure shows the substantial conservation of PMEPA1 through evolution. Basic residues are indicated in red, acidic ones in blue, and green residues are more hydrophilic than the orange ones (Hopp and Woods, 1981, P.N.A.S. 78, 3824-8); cysteines are in purple. The depicted sequences were aligned by hand and based on: zebrafish, GenBank AL925928 combined with Ensembl genomic sequence; coelacanth, GenBank XP_005990654; turtle, XP_005305886; opossum, XM_007475673; cattle, our prediction from Ensembl genomic sequence corrected for some nucleotide identities by comparison with GenBank EST sequences; mouse, GenBank NP_075371; lemur, our prediction from Ensembl genomic sequence; chimpanzee, GenBank XP_009435732; human, PMEPA-001 from Ensembl.

(A)

intron border PMEPA

PMEPA1(-)

Zebrafish 1:AGATACTCACAGATCGCCATACTCTGGAATGAAGTGGATCTCTGGCAGTTACATGTACAG 60

Coelacanth 1:ATTTACTTACTGATCTCCATACTCGGGAACAAA---GACCTTTTACAGTTACATGTGCAG 57

Turtle 1:AAGTACTTACTGATCTCCATAGTCTGGAACAAA---GACCTTTTGCAGTTGCATGTGCAG 57

Opossum 1:GGATACTCACTGATCTCCATGCTCTGGAACAAA---GACCTTTTGCAGTTGCATGTGCAG 57

Cattle 1:GGTCACTCACTGATCTCCATGCTCTGGAACAAA---GAGCGTTTGCAGTTGCACGTGCAG 57

Mouse 1:GCACACTCACTGATCTCCATGCTGGGGAACAAA---GAGCGCTGGCAGTTGCACGCGCAG 57

Lemur 1:GGTCACTCACTGATCTCCATGCTCTGGAACAAA---GAGCGTTTGCAGTTGCACGTGCAG 57

Chimpanzee 1:GGTCACTCACTGATCTCCATGCTCTGGAACAAA---GAGCGTTTGCAGTTGCACGTGCAG 57

Human 1:GGTCACTCACTGATCTCCATGCTCTGGAACAAA---GAGCGTTTGCAGTTGCACGTGCAG 57

Human aa (rev) I E M S Q F L S R K C N C T C S

PMEPA1(-)

Zebrafish 61:GAGACATTGGCGAGGG------TGTCGGTGGTTCCGTTCGTCAGACCCATGAAG 108

Coelacanth 58:GAGACATTGGGCTGAA------TGGCTGCTGTTGTGCTGTTCAGACCCATCAAG 105

Turtle 58:GAGACATTGGGCTGTG------TGGCTGCGGCTGTGCTGTTCACACCCATCAAG 105

Opossum 58:GACACATTGGGCTGCC------CGGCGGCCGCGGTGCTGTTGAGACCCATTAAG 105

Cattle 58:GAGACATTGGGCTGCCCAGCGGCGGCGGCGGCGGCGGCGGTGCTGTTGACCCCCATCAAG 117

Mouse 58:GAGACATTGGGCTGCC------CGGCGGCGGCGGCGGCGGTGCCGTTGACCCCCATCAAG 111

Lemur 58:GAGACATTGGGCTGCC------CGGCGGCGGCGGCGGCGGTGCTGTTGAACCCCATCAAG 111

Chimpanzee 58:GAGACATTGGGCTGCC------CGGCGGCGGCGGCGGCGGTGCTGTTGACCCCCATCAAG 111

Human 58:GAGACATTGGGCTGCC------CGGCGGCGGCGGCGGCGGTGCTGTTGACCCCCATCAAG 111

Human aa (rev) S V N P Q G A A A A A T S N V G M L

PMEPA1(-)

Zebrafish 109:CTGAACATTGAGGAAATCG------ATGTGAAAAACGGACGGTTTGTCGGTTGGTTAGT 161

Coelacanth 106:TTGAACATTTAAAAAAAAA---GATATTACTAGAGATAATATGTTTAAACCAAAAAAAAA 162

Turtle 106:TGATACATTTAAAAAAAGG----GGAGAGCGAAGAACGGA------AAGTCAGC 149

Opossum 106:TTATACATTTAAAAAGGAG------GGAGAGGAAAACGGGGGACTTG-----GAGAAGAG 154

Cattle 118:CGGTGCATGGACGGCGCGGCGGAGCGGCGCGGGGCGCAGGGGGCTCG-----GGGGCGGC 172

Mouse 112:CGGT-CACGGGCGGCGCGG------GGCGCGAGGCGCGGGGCG------147

Lemur 112:CGGTGCATGGACGGCGTGGCGGCGCGGCACAGGACGCGGGGGGCGCG-----GGGACGGC 166

Chimpanzee 112:CGGTGCATGGACGGCGCGGCGGCGCGGCGCGGGGCGCGGGGGGCGCG-----GGGGCGGC 166

Human 112:CGGTGCATGGACGGCGCGGCGGCGCGGCGCGGGGCGCGGGGGGCTCG-----GGGGCGGC 166

Human aa (rev) R H M

PMEPA1(-)

Zebrafish 162:CGGTCGCCTCAGCGCGCGTTGTCGAATAAAAACATTGACGCAGGCTTTTCCTATTCTGTG 221

Coelacanth 163:CA-----CTTCTCAAGTAAGTCCAAATAAAAACAACTAAGAAGTCT------GAG 206

Turtle 150:CGAG---CTCTCCGGGCAGCACCACATAAAG------CCGGGCTCCTCCTG----GGG 194

Opossum 155:CGAGGGGGGCTCCGGGAGGCACCACATAAAGGAAGATCCTGCGGCTGCCGCTGCTTCAGC 214

Cattle 173:CGGGGGGGGCTCCGGCCGGCGCCCGGCG------CTCGGGCCCCGCATGCAG-GAG 221

Mouse 148:CGGGGGCGGCTCCGGCCGGCGC---GCGGGG------CTCGGGCCCCGCATACAGCGAG 197

Lemur 167:CGGGGGGGGCTCCGGCCGGCGCC---CGGAG------CTCGGGCCCCGCATGCAG-GAG 215

Chimpanzee 167:CGGGGGGGGCTCCGGCCGGCGCC---CGGAG------CTGGGGCCCCGCATGCAG-GAG 215

Human 167:CGGGGGGGGCTCCGGCCGGCGCC---CGGAG------CTGGGGCCCCGCATGCAG-GAG 215

PMEPA1(-)

Zebrafish 222:TC------TACCGTCCT------232

Coelacanth 207:TGAATGGAAGAAAGGAAGGA------226

Turtle 195:CGGGGGGGAGAGGGAACGGCGAACCACTCCTAAGG------229

Opossum 215:CCGGAGAGGGAAGAGAAGGCAATGCGCTTTTCTCGTTTAAAAAAAAAAATCAATTCTGGA 274

Cattle 222:GCGCGCGGCGGGGAAGGCGCGCCCCGGCTAGCCGGGCT------259

Mouse 198:CAGCGCGGCG------CGGGCT------213

Lemur 216:GCGCGCGACGGGGGAGGCGCGCCCCGGCTCTCCGGGCT------253

Chimpanzee 216:GCGCGCGGCGGGGGAGTCGCGCCCCGGCTCGCCGGGCT------253

Human 216:GCGCGCGGCGGGGGAGGCGCGCCCCGGCTCGCCGGGCT------253

PMEPA1(-)

Zebrafish 233:------CTAACCGCCGTCCCCC------248

Coelacanth 226:------226

Turtle 230:------CAGCCCCCCGAC------241

Opossum 275:GAAATTAAGTTATAAAACAAAACACCCCCCCCCCCAAAAAAAAAAACAGTGGAGGGAAGA 334

Cattle 260:------CCGGGCTCTGCC------AAGTTACCGGGG 283

Mouse 214:------CGGGGCGCCGCC------GAGGTCCC---- 233

Lemur 254:------CGGGTCGCAGCC------AAGTTCCCGAGG 277

Chimpanzee 254:------CGGGTCGCCGCC------AAGTTCCCGGGG 277

Human 254:------CGGGTCGCCGCC------AAGTTCCCGGGG 277

PMEPA1(-)

Zebrafish 249:------CGCCCCCGGCCCGCT---- 263

Coelacanth 227:------CTTGTTTCCCC--GT 239

Turtle 242:-----GGAAAGACAAGGAGACCCGAGCCGGGCGCTGCTGGCTTCCCTCCCCTCCCC---C 293

Opossum 335:AACAGTCAAACTGGGGGAGGACCTAACTGCTCTTTCTTCCCTCCCCTCGGTTTCCCTTCT 394

Cattle 284:CGCCGTGGGACTCAGTGCT--CGGGGCCGCGCTCCTCTGCGCTCCCCCGGCTGCCC--CT 339

Mouse 234:-----CGAGGCGAGCTGGG------TCCCCCGGCCGCCG---- 261

Lemur 278:TGCCGCGGGGCTCAGTGCG--CCGGGCCTCGCTCCTCTGCGC-CCCTCGGCCTCCC--CT 332

Chimpanzee 278:CGCCGCGGGGCTCAGTGCG--CGGGACCGCGCTCCGCTGCGCCCCCCCGGCCTCCC--CT 333

Human 278:CGCCGCGGGGCTCAGTGCG--CGGGACCGCGCTCCGCTGCGCCCCCCCGGCCTCCC--CT 333

PMEPA1(-)

Zebrafish 264:------GTCTGGC 270

Coelacanth 240:TAGCAGAAAGCCAGTGAG------GAAATGGAGTTGC 270

Turtle 294:CGCCAG---CCCGGGAAGTTTGCTCAGGGCTGG------GTGAAACAGACTTGC 338

Opossum 395:CTCCAGTCTCTCGGTAAGTTTCCCTTGCCCCAGCGCCGG------CGGAAAGCGCCTAGC 448

Cattle 340:CCGCAG---CCCCGGGGGCGTCCGCAGTGCCCGCGGGTGGCGTCCGGGAAATGGGCTGGC 396

Mouse 262:-AGCGGGAGCCCCGGGGCGGCGGGCCGTGGCCGCGGGCGGCGTGCGGGGCACGGGCGGGC 320

Lemur 333:CGGCTG---CCCCGGGGGCGTCGGCAGTGCCAATGGGTGGCGTCCGGAAAATGGGCTGGC 389

Chimpanzee 334:CGGCAG---CCCCGGGGGCGTCGGCAGTGCCCGCGGGTGGCGTCCGGAAAATGGGCTGGC 390

Human 334:CGGCAG---CCCCGGGGGCGTCGGCAGTGCCCGCGGGTGGCGTCCGGAAAATGGGCTGGC 390

PMEPA1(-)

Zebrafish 271:CGCTGTGC---TCCAGTATTCCCGTT------ATTCCCAGACGGTTTGTACTACTTTCC 320

Coelacanth 271:TACT------TGTTTCTCTGCCTGGG------CTCTTAGCGGCTTT---- 304

Turtle 339:TACTTTGTTTCTCTCTCGCTCCTGCCCAGG------CAGGCTCCTAGCGGCTCCCCCC 390

Opossum 449:TACTACTCTGTTTCTGCG---CTGGCCGGACGGAGCGACGGACTCCCGGCGGCGGCTTTC 505

Cattle 397:AGCCGGG----GCGCGCGCTGCCGGCGGGGCTGAGCCTCTG-CTGCTAGCTTCCCCCAGC 451

Mouse 321:GGCCGGG------GCACGGGGCCGCC-GGGCTCAGCCTGCG-CGCCGGCCGATCCGCAGC 372

Lemur 390:AGCGGGG-----CGCGCGCTGCCGCC-GGGCTGAGCCTCCG-CCGCTAGC-TTCCCTAGC 441

Chimpanzee 391:AGCGGGG-----CGCGCGCTGCCGCCGGGGCTGAGCCTCTG-CCGCTAGCTTTCCCCAGC 444

Human 391:AGCGGGG-----CGCGCGCTGCCGCCGGGGCTGAGCCTCTG-CCGCTAGCTTTCCCCAGC 444

human PMEPA1-001 transcript start

conserved motif with unknown function

Zebrafish 321:--AGCGTCTGCGC------G------CACCTCATTCAAGTCCAAGTTGG 355

Coelacanth 305:------CCCCTCATTCAAGTCCAAGGAGA 327

Turtle 391:------TCCCCGCCCCCTTCG------CTCCTCCTCATTCAAGTCCAAGGAGA 431

Opossum 506:CCAGCCACTCCGCCTCCTCCG------CCTCCTCCGCCTCCTCCTTCAAGTCCAAGGAGA 559

Cattle 452:CGAGCGCCTCCGCCGCCGCCG------CCTCCTCCTCCTCATTCAAGTCCAAGGAGA 502

Mouse 373:TGAGCGCCGCCGCCCG------CTCCTCATTCAAGTCCAAGGAGA 411

Lemur 442:CGAGCGCCTCCGCCGCCGCCG------CCTCCTCCTCATTCAAGTCCAAGGAGA 489

Chimpanzee 445:CGAGCGCCTCCGCCGCCGCCGCCGCCGCCGCCTCCTCCTCCTCATTCAAGTCCAAGGAGA 504

Human 445:CGAGCGCCTCCGCCGCCGCCGCCGCCGCCGCCTCCTCCTCCTCATTCAAGTCCAAGGAGA 504

SBE

Zebrafish 356:CTAGG------AAGCGGCAGGGGGCGGAGCCAACATTGAGGGGCGGGATCTCGAGGCAG 408

Coelacanth 328:TTCAGTTTCATGTGGATAC--TAGTCTGAAAC------AGGCAG 363

Turtle 432:TTCAGTTTCCTATGGAAAGGGGCGTCTGAGGC------AGACAG 469

Opossum 560:TTGGGTTTCATATGGAGATACGGTACTGAGGC------AGGCAG 597

Cattle 503:TCGGGTTTCGCTCCGAGACCGCGGTCGGAGGC------AGGCAG 540

Mouse 412:TCGGGTTGCGCCGCGCGCCCGCGGTCGGAGGC------AGGCAG 449

Lemur 490:TCGGGTTTCGCTCCGAGACCGCGGTCGGAGGC------AGGCAG 527

Chimpanzee 505:TCGGGTTTCGCTCCGAGACCGCGGTCGGAGGC------AGGCAG 542

Human 505:TCGGGTTTCGCTCCGAGACCGCGGTCGGAGGC------AGGCAG 542

CRE SBE

Zebrafish 409:ACAGCG-TGACGTCATTGCCAGACTTGGCC-----TTTCG--CAGAGCGT---GCGAGCC 457

Coelacanth 364:ACAGTCATGACGTCACTGCCAGACTGGGCCAGCCCCTCCCA-GGCACTGCAAAGTGGGC- 421

Turtle 470:ACAGGG-TGACGTCAGCGCTAGACTGGGACTGCCCTTTCCAGCGCCCCGCCAGGCGGGGC 528

Opossum 598:ACGGCC-TGACGTCAGCGCTAGACGGAACTGCCCTTTTCCACCGCACTGGCGGAGGGGGG 656

Cattle 541:ACGGTC-TGACGTCAGCGCTAGAC-GGGGCTGCCGGTTCC--CACCGCGC---GGGGGCC 593

Mouse 450:ACGGTC-TGACGTCAGCGCTAGAC-GGGGCTGCCGGTTCC--CACCGCGC---GGGGGCC 502

Lemur 528:ACGGTC-TGACGTCAGCGCTAGACGGGGTCC------CACCGCGC---GGGGGCC 572

Chimpanzee 543:ACGGTC-TGACGTCAGCGCTAGAC-GGGGCTGCCGGTTCC--CACCGCGC---GGGGGCC 595

Human 543:ACGGTC-TGACGTCAGCGCTAGAC-GGGGCTGCCGGTTCC--CACCGCGC---GGGGGCC 595

NF-B

Zebrafish 458:AATCGGAGTCACACGGGGATCAAACAACACACACACACACACACACACACACACACACAC 517

Coelacanth 422:------CACCC 426

Turtle 529:GGGGAGGGGAAGAGAGAGAAGGGGGCGGGTGAGT------GACGGCG 569

Opossum 657:GAGCGGGGGGAGTCTAGGAAGGAGGAGAGGAGCCCCCCTCTAGCCTTGGCGGCTGCACCA 716

Cattle 594:GGGGAGGGG------GCGCGCGC------TGCGCCA 617

Mouse 503:GCGGAGGGG------GCGCGCGC------TGCGCCA 526

Lemur 573:GGGGAGGGG------GCGCGCGC------TGCGCCA 596

Chimpanzee 596:GGGGAGGGG------GCGCGCGC------TGCGCCA 619

Human 596:GGGGAGGGG------GCGCGCGC------TGCGCCA 619

Zebrafish 518:ACACACACACACACACACACACACACACACACACACACACACACACACACACACACACAC 577

Coelacanth 427:GCTTTGACTGA------CTCCTCCCGCCACCAATCACAATTTTACTTGGAAAT 473

Turtle 570:CTCCTAGCCAA----TCCCAGCTCCGCCGCCCGGC------GGGAGG 606

Opossum 717:ATCCCTACCGAGGTTCCCCCGCACCCCCCCCTTTCTCCCTCCCAGCGCTAGGAAGGGTGG 776

Cattle 618:ATCCCCGCCGAGGTTCTCCCGCACCCCCTCCCCGGGCC------GCGGGG 661

Mouse 527:ATCCCCGCCGACGGCCCC---CGCACCCTCCTCGGCCC------GGGGGG 567

Lemur 597:ATCCCCGCCGAGGTTCCCCCGCACCCCCTCCCGGTCT------GGGAGG 639

Chimpanzee 620:ATCCCCGCCGAGGTTCCCCCGCACCCCCTCCCCGAGCC------GGGGGG 663

Human 620:ATCCCCGCCGAGGTTCCCCCGCACCCCCTCCCCGAGCC------GGGGGG 663

Zebrafish 578:TGTATTGATCGATTTCACTG------GGACAGTCAAATAAAGTGTGTG------619

Coelacanth 474:TTAAAGGGCTGTTTTTAAAG--GTACATTGCCCTTCTAAAGGCTTGTGCC------521

Turtle 607:CGGGGCGCCCGA------GCCCCTCCAGGCGCAGCTGCGGTGACCCGCA 649

Opossum 777:GGGGGGGAATGAATGAGGGGGGTGAAACTGACTCTTAAAGGGACAGAGTG------826

Cattle 662:CGGGGTGACGGGGAAGGGGG------CGGGCTCTTAAAGGGCCAGAGCA------704

Mouse 568:CGGGCCCGTCGCTCGCGGGG------CGGGCTCTTAAAGGGCCCGAGCC------610

Lemur 640:CGGGGTGACGGTAAACGGGG------CGGGCTCTTAAAGGGCCAGAGCC------682

Chimpanzee 664:CGGGGTGACGGGAAACGGGG------CGGGCTCTTAAAGGGCCAGAGCT------706

Human 664:CGGGGTGACGGGAAACGGGG------CGGGCTCTTAAAGGGCCAGAGCT------706

human NKILA (+)

Zebrafish 620:------TGTGTGTGGACGGAAAGACATCACAGAGAGCACAGACGGAATATTTTACG 669

Coelacanth 522:------AGTAAGTACAT------TTCCCCAAAATCAACCAAGAGGCAAATCT-- 561

Turtle 650:CCCGCAGTGTGGGGTGAGCGCGCGGCGAGCTCGGTCGGGCCGCGCGTGCGGCTCGCAGCT 709

Opossum 827:------CATTCGCGAATGCAG------CAAAGCCAC---CGCCACACGCCATC 864

Cattle 705:------GGGCGGTCCGCGTAGACCCGGGACCCGCGAAGCGAAAGAAGGGTGCAGCG 754

Mouse 611:------TAGC-GTCCATGTAGACCGGTCGCCGGCGCTGCGTAG-----ACCCAGCG 654

Lemur 683:------AAGCGGCCCGCGTAGACCCGAGACCAGCGCGACGGAGGAGGGGCGCTGTG 732

Chimpanzee 707:------AGGCGGCCCACGTAGACCCGGCACCCGCGCAACGGAGGAGGGGCGCTGTG 756

Human 707:------AGGCGGCCCACGTAGACCCGGCACCCGCGCAACGGAGGAGGGGCGCTGTG 756

human NKILA transcript start

human NKILA (+)

Zebrafish 670:CACTT------TTAGGTCTGATT------AAGTTTGA 694

Coelacanth 562:------GAAAGTGTAATTTTTTTTTAAATAATTTTAAAAAAATCAGCATTTC- 607

Turtle 710:CGCCG------GGGAGTCGCGTTGTTCCCTGGTGGGACGCTGCCGTGTCCGGGCCCG- 760

Opossum 865:CGC------AGACTTCAGACA----CACACAGAGACACAGACGTGCACAC------904

Cattle 755:TCCCCTCCCAAACGGCGGTCAGTTT------GGAAGCTCTGACCCTCTCAGGCCAG- 804

Mouse 655:CCCTC------GAGACTC------CGCAGGCCCG- 676

Lemur 733:TCCCCTCCCCAGCGAGGGTCAAGTT------AGTCCCGCACAAGCTTG- 774

Chimpanzee 757:CCCTCTCCCCAACGGCGGTCAGCTTG------GAACGCCTGCCCGGCGCACGCCCG- 806

Human 757:CCCTCTCCCCAACGGCGGTCAGCTTG------GAACGCCTGCCCGGCGCACGCCCG- 806

human NKILA (+)

Zebrafish 695:TGTGCTTGAAAGATGTAATT-----AATGTAAATAT---TATGTATTTTATATCATAATT 746

Coelacanth 608:-ATGTAAGAATGATGGAATCAAAAAAAGTTCCATACATCTATTGTAGGAGTATAAGAACA 666

Turtle 761:-ATCCAAAGCCCCAAGGCTCCCGCTGGCCCCGGCGG------GCTTAGGATCGGGCCC 811

Opossum 905:-AGACTCGGAAGGGAAAATACATACGGTGGCGGTGGTGGTGACTCCATTAACTTATATAG 963

Cattle 805:-GCCCGGGAGAGCAGGATTG-----GGTCCCAGCCTTTGTGG------840

Mouse 677:-GTCCGCGGGAGCCCC------CGTGCCTGCGGCGGTCACAACACAGAACAAGAATC 726

Lemur 775:-GGCCGCGGGAGGGGGGCTC-----GGTCTCAGCCCC------ACC 808

Chimpanzee 807:-GGGCCAAGGAGCCGAACTC-----GGTGCCAGCCGC------ACC 840

Human 807:-GGGCCGGGGAGCCGAACTC-----GGTGCCAGCCGC------ACC 840

human NKILA (+)

Zebrafish 747:TGAGTGAATAA----AGC------GCTCTCCT------768

Coelacanth 667:AAAGAGAAAGAGAATGGGGATAAAAGCTCACCCATAGTGACCTGGTTTCCGAATTTCACC 726

Turtle 812:CGGCCGGGCAGCGG------GCTCGGCTCTCCGGTGGCTGGCCGCGG------C 853

Opossum 964:TAGTTGCTTAATG--AAT------GTCATTCTGAGTTATGGAATTCCCAAG------C 1007

Cattle 841:------GTGCGTTCCCCACAACCCGGACCTCGG------G 868

Mouse 727:AAGACGAAAAGTTAAGGT------GTTCTCCTCTCCAACTCCTGCCCCTGG------G 772

Lemur 809:CTGGCGTGTTGCC--GGT------GCTCCCAACCTCTGC-CCGCGCCCCGG------G 851

Chimpanzee 841:CGGGCGGGTTGCT--GGT------GCGCCCTCCCCTCGCCCCCGTCCCTGG------G 884

Human 841:CGGGCGGGTTGCT--GGT------GCGCCCTCCCCTCGCCCCCGTCCCTGG------G 884

human NKILA (+)

Zebrafish 769:---CTCGTTTTG------GGATTTTAAAGTTTGTCT 795

Coelacanth 727:AACTCTGACCAGTTCCACTGTCCCTCAATAATCAATACCAGCGTCTCTTAAAAGTATTTT 786

Turtle 854:AGCCCCGAGCGG------TGGCGGTGC--- 874

Opossum 1008:GTCCCTATTCCT------GCCTTTTTGAAG------1031

Cattle 869:GTCCTTGACCGA------GGCTTTGGGGGGAAGCTT 898

Mouse 773:GCCCTCCACCCA------GGCTTTTGGGGGCA---- 798

Lemur 852:GTCCTTGACCCA------AGCTTTGGGGGTTAGCCT 881

Chimpanzee 885:GTCCTTGACCCA------GGCTCTTGGGGCTAGCCT 914

Human 885:GTCCTTGACCCA------GGCTCTTGGGGCTAGCCT 914

human NKILA (+)

Zebrafish 796:AAAACGACATCAAAACAATAAAAGTGTAATGG----AGATTTAGA------ATGC 840

Coelacanth 787:TTGACCATACTAAATTTT----AATGTTTACA----GCATTGAGAAATATTTCATTCTGC 838

Turtle 875:------AGGTGCAGGTGGTTCCCGGC----GAAGTGGGTAATAATGAACCGTCC 918

Opossum 1032:------GGGGTGGCGA----AGATGGTGG------1050

Cattle 899:GTCCCCCTTCAGAGGAGCACGAGGTCCCTCGGACTCAGGTCAAGGGAAATT------949

Mouse 799:------CCCGAAGACTATAAGGTTTCTGGG----GCGCCTTGGAAGAGTTTA--ATGC 844

Lemur 882:AT------CTAAAGCGCAGGAGGGTCTCGGG----GGCTCAGGGAAAGGAAAT------924

Chimpanzee 915:AT----CTTCTGAGGAGCACAAGGTCCCTGGG----GGCTCAGGGAAGAGAAAT------960

Human 915:AT----CTTCTGAGGAGCACAAGGTCCCTGGG----GGCTCAGGGAAGAGAAAT------960

human NKILA (+)

Zebrafish 841:ACAT------AAAATAATG 853

Coelacanth 839:ACATCTATTACATTGTGATTTCCATCTCCTGGTCTCTAAATCCGTTCTCCCGAAGTGGTG 898

Turtle 919:CTGG------GGAGCGGCG 931

Opossum 1051:------AGAATGGGG 1059

Cattle 950:------AGGGAAAGG 958

Mouse 845:ATGG------AGATCAGAG 857

Lemur 925:-TGG------AGAAAGGGG 936

Chimpanzee 961:-TGG------AGAAAGGGG 972

Human 961:-TGG------AGAAAGGGG 972

intron border of human PMEPA1-002

PMEPA1 (-) human NKILA (+)

Zebrafish 854:------ACTATTATCAATTTTTCTCACGGTTTCCAGAAGA------887

Coelacanth 899:AAAAAAGTCTGCCCCTGTGTAGATACTCTTATCTGCCTCAGTATT------943

Turtle 932:GTGAGCACCTTGC------CCTCCCATGCACCCGAA--TTGAAGGGGGG------972

Opossum 1060:AAAGGAG------AGTTGGTTTTGT------1078

Cattle 959:CGGGGAGCCCTTCAGA------TCATCCGT-TACCTGAT--TTCGCAGTCAACTCCCC 1007

Mouse 858:CATGGAGGGT-CTGGAGTAAAGACCCATCCAT-AGCTGTAT--TTCTCAGGAGA------907

Lemur 937:AGGGAAGCCC-CCAAGATCAA---TCACCCAT-TGCCTGAT--TTCAATGGAGACTCTCC 989

Chimpanzee 973:GAGGAAGCCC-CCAAGATGGA---TCACCCAT-TGCCTGAT--TTCGCAGGAGACTGTCC 1025

Human 973:GAGGAAGCCC-CCAAGATGGA---TCACCCAT-TGCCTGGT--TTCGCAGGAGACTGTCC 1025

Human aa (rev) M

PMEPA1 (-) human NKILA (+)

Zebrafish 888:--TCCTAAAATTTCACTGACATTGAGTCTAA---TAATTGAGTCTCCAACAGTTGTGTTC 942

Coelacanth 944:---TTTAGTACTCAAATTACCCCTCCTCACAATCTCCTTGTCTTTGGGAAGAAAATACTA 1000

Turtle 973:------AGCTAGCCACCTGCACACGGGGACACCCCCAGCAGGCCCCCTGAGGAGGCAACT 1026

Opossum 1079:------CAATGGCAAGCTTTTTCATATCTGGCACATTCCCCTCAGACTTATCC 1125

Cattle 1008:GATTCCAGATCTGCAGCTCCAAGAGGCCCCAAGCCCACTGCGCCCCCCGC----GCAATT 1063

Mouse 908:------CTGAGAATTAGTCTGAGCCCCTCCGCAACCCCCAACCCCGCGCGG 952

Lemur 990:GCCCTCGGTTCTGCTGCAGCACGCGG------CCACCGAGCCCTG----GGTGCACTT 1037

Chimpanzee 1026:GCCTTCAGTTCTCCAGCAGCTCGGGGATCATGGCCCACTGAACCCCC-----AAGCGCTT 1080

Human 1026:GCCTTCAGTTCTCCAGCAGCTCGGGGATCATGGCCCACTGAACCCCC-----AAGCGCTT 1080

human NKILA hairpin A region

PMEPA1 (-) human NKILA (+)

Zebrafish 943:CCTTTTATTCACCAGGGGGTCACCACAACGGAATGAACCACCAACTATTTCAGCATATGT 1002

Coelacanth 1001:TTTTTTTTTCGTTATAACAACAATTTAGATCTTGGAATCATTCTTTATGCTTTTTTGCGT 1060

Turtle 1027:CCTGCCAGGCCTGGGGCTGCAGGCTAGCGAGGGGTGGTGACACATCGCCCCCCTTCCCCT 1086

Opossum 1126:TATTT------1130

Cattle 1064:TATCTCGACCTAGAGGAGGTCGACTGGGGAAAA------C 1097

Mouse 953:CTCTTTAACT------GGGCCTTTTGGATAGA------C 979

Lemur 1038:TAACCCGAATCCAAGGAGGTCGACTTGGCAAGA------C 1071

Chimpanzee 1081:TCACCCGAACCCAAGGAGGACGACCAGGAAAGA------C 1114

Human 1081:TCACCCGAACCCAAGGAGGACGACCAGGAAAGA------C 1114

PMEPA1 (-) human NKILA (+)

Zebrafish 1003:TGTACGCAGCGCATACCCTTCCAGCTGCAACCCAGTACTGGGAAACACCCAAACACACAC 1062

Coelacanth 1061:GCTTTTTTGCGTACTTTTTCCGGTGTATCAATGCTTTTCTTGAAACAAGACGACCAAAAC 1120

Turtle 1087:GGGGTCTAGCGCAGATTCTCC------CCCCCCCCCCAAAAAAAAAGCGAGACTGCAGCC 1140

Opossum 1131:------GGAAACTTCCTCC------GCAGTGGGGTACAGCC 1159

Cattle 1098:TAGAACTCCTGTAGACACGCC------CCGGACTGCTCCGAGAAACGCC------C 1141

Mouse 980:TGGAATTTCTGTAGACACGCC------CCGAGAAGCGCC------C 1013

Lemur 1072:TGGAACTCGCATAGACACGCC------C 1093

Chimpanzee 1115:GGGAACTCGCGTAGACACGCC------C 1136

Human 1115:GGGAACTCGCGTAGACACGCC------C 1136

human NKILA hairpin B region

human PMEPA1-002 transcript start

intron border of human NKILA transcript DA866558

human NKILA (+)

Zebrafish 1063:TCACACTCATACACTACG---GCCAG------TGTAGTTGATCAGTTCCCCTATAGC 1110

Coelacanth 1121:TATATCCAATAT----TGTAAATGAGGCCTAAACAAGTAATTGTTCCATGCTAAT----- 1171

Turtle 1141:CCAAACTCACGCAGCACGTCGGTGAC------AGTCAGGAATCGAACCCTGGGGCCA 1191

Opossum 1160:TAGGATCCAGGC----TTTGGGGGGGAGGTGTGTGTGTGGTGTATTTGTGTTT------1208

Cattle 1142:GGGAACCCTTGT--CATGTAAATACG------TGTTGGGGACTTGCAC------1181

Mouse 1014:CCTGGTCCTTGC--CTTGTAAATTAG------TGCCCGGGACTGGGGGTGAGAGTGT 1062

Lemur 1094:GGGAGCCGTTGT--CATGTAAATAGA------GGTCCTTGACTA------1129

Chimpanzee 1137:GGAAGCCCTTGT--CATGTAAATAGC------TGTCGGGGACTG------1172

Human 1137:GGAAGCCCTTGT--CATGTAAATAGC------TGTCGGGGACTG------1172

human NKILA (+)

Zebrafish 1111:GCATGTGTTTG---GACTGTGGGGGAAACC------GGAGCACCCGGA-----GGAAAC 1155

Coelacanth 1172:------GTAAATTCTCTTGATTCAAATTTGATATTCCTGGC----TATAATT 1213

Turtle 1192:------AATTCTGATCTAAACGCTGCCAGTGTAAATCCAGA-----GTAACT 1232

Opossum 1209:------GTCTGTTTGTGTCTCTCTGCCTGTCTGTGTCTGTGTGTCTGTGTCT 1254

Cattle 1182:------GGTCCCGGCCAGCGAGGCCCGGG-----GCGAAT 1210

Mouse 1063:GTGTGTGTGTGTGTGTGTGTGTGTGTGTTACTATTATGGGACACCCAAGCAGTTACACAC 1122

Lemur 1130:------GTGTATTGTGGCCGCCCCACCCGGCGGGGCCCGGG-----GCGAAT 1170

Chimpanzee 1173:------GTGTATTGTCGCCGCCCCAGCCGGCGGGACCTGGG-----GCGAAT 1213

Human 1173:------GTGTATTGTCGCCGCCCCAGCCGGCGGGACCTGGG-----GCGAAT 1213

human NKILA (+)

Zebrafish 1156:CCACACCAACACGGGGAGAACATGCAAACTC-CAC------AC------AGAAACA 1198

Coelacanth 1214:TCAAGTAGTC------TGTTTGCTCATTCAACTGTGCAGAGTGTTTCAGTTGTA 1261

Turtle 1233:CCACAGTGGC------TACTCCAGCTC-CACAGTGGTGTAATGGAAATTAGGTGTG 1281

Opossum 1255:C------TGTGTGTTT-TGC------AGGAGGGGTGTGGGGCG 1284

Cattle 1211:CCACACCCTA------CGTCTGCTG-CCC------AA------AGGGCCA 1241

Mouse 1123:CCTCACCCTC------TATCTGCCG-CCC------CA------AGGGCCA 1153

Lemur 1171:CCACACCCTT------TGTCTGCTG-CCC------GA------GGGGCTA 1201

Chimpanzee 1214:CCACACCCAT------TGTCTGCTG-CCC------GA------GGGGCCA 1244

Human 1214:CCACACCCAT------TGTCTGCTG-CCC------AA------GGGGCCT 1244

human NKILA (+)

Zebrafish 1199:CCAACTGACCCAGCCGAGACTCAAACCTTCTTACTGCGCCACCGTGCAGCCCTTATTTTA 1258

Coelacanth 1262:TTATTT------AGAATGCGATAGTCCTTTTCTTTAATCTACCTT------1300

Turtle 1282:CTGGCTCCCAGCACCAGGT------1300

Opossum 1285:CTGCGTAACTCTGGGA------1300

Cattle 1242:CTGGCC------1247

Mouse 1154:CCGACT------1159

Lemur 1202:CCGGCT------1207

Chimpanzee 1245:CCGGCT------1250

Human 1245:CCGGCT------1250

(B)

Zebrafish 1:MFSFMGLTNGTTD----TLANVSCTCNCQRSTSFQSMAISQLEFVQILVIVVVMMMMVLV 56

Coelacanth 1:MFNLMGLNSTTAA----IQPNVSCTCNCKRSL-FPSMEITDLEFVQIIIIVVVMMVMVVV 55

Turtle 1:MYHLMGVNSTAAA----TQPNVSCTCNCKRSL-FQTMEITELEFVQIIIIVVVMMVMVVV 55

Opossum 1:MYNLMGLNSTAAA----GQPNVSCTCNCKRSL-FQSMEITELEFVQIIIIVVVMMVMVVV 55

Cattle 1:MHRLMGVNSTAAAAAAAGQPNVSCTCNCKRSL-FQSMEITELEFVEITIIVVVVMVMVVV 59

Mouse 1: MGVNGTAAAAA--GQPNVSCACNCQRSL-FPSMEITELEFVQIVVIVVVMMVMVVM 53

Lemur 1:MHRLMGFNSTAAAAA--GQPNVSCTCNCKRSL-FQSMEITELEFVQIIIIVVVMMVMVVV 57

Chimpanzee 1:MHRLMGVNSTAAAAA--GQPNVSCTCNCKRSL-FQSMEITELEFVQIIIIVVVMMVMVVV 57

Human 1:MHRLMGVNSTAAAAA--GQPNVSCTCNCKRSL-FQSMEITELEFVQIIIIVVVMMVMVVV 57

Zebrafish 57:ITCLLNHYRLSARSLMSRHTHERRRHLPLPSEGSLWSSDGPGSSSAMSE--VYTP-RAPD 113

Coelacanth 56:ITCLLNHYKLSARSFINRHSQGRRREENLSPEGSLWPSDSTVSGSGMTE-QIYTP-RSTE 113

Turtle 56:ITCLLNHYKLSARSFINRHSQGRRRDENLSSEGSLWPSESTVSGNGVTEQQIYTP-RPTD 114

Opossum 56:ITCLLNHYKLSARSFIHRHSQGRRREENLSSEGSLWPSESTVSGNGLTE-QIYAPSRSAD 114

Cattle 60:ITCLLSHYKLSARSFIGRHSQSRRREDALSSEGCLWPSESTVSGNGIPEPQVYAPPRPTD 119

Mouse 54:ITCLLSHYKLSARSFISRHSQARRRDDGLSSEGCLWPSESTVSG-GMPEPQVYAPPRPTD 112

Lemur 58:ITCLLSHYKLSARSFISRHGQGRRREDALSSEGCLWPSESTVSGSGIPEPQVYTPPRPAD 117

Chimpanzee 58:ITCLLSHYKLSARSFISRHSQGRRREDALSSEGCLWPSESTVSGNGIPEPQVYAPPRPTD 117

Human 58:ITCLLSHYKLSARSFISRHSQGRRREDALSSEGCLWPSESTVSGNGIPEPQVYAPPRPTD 117

Zebrafish 114:R--VPSFLQRERVSRFQPTFPFLPPVIELPPTIALSDGEEPPPYQGPCTLQLRDREQQLE 171

Coelacanth 114:RLTVPSFLQRDRFNRFQPTYPYLQHEIDLPPTISLSDGEEPPPYQGPCTLQLRDPEQQME 173

Turtle 115:RLSVPSFLQRDRFNRFQPTYPYMQHEIDLPPTISLSDGEEPPPYQGPCTLQLRDPEQQME 174

Opossum 115:RLTVPSFLQRDRFNRFQPTYPYLQHEIDLPPTISLSDGEEPPPYQGPCTLQLRDPEQQME 174

Cattle 120:RLAVPAFAQRDRFHRFQPTYPYLQHEIDLPPTISLSDGEEPPPYQGPCTLQLRDPEQQLE 179

Mouse 113:RLAVPPFIQR---SRFQPTYPYLQHEIALPPTISLSDGEEPPPYQGPCTLQLRDPEQQLE 169

Lemur 118:RLAVPAFAQRDRLHRFQPTYPYLQHEIDLPPTISLSDGEEPPPYQGPCTLQLRDPEQQLE 177

Chimpanzee 118:RLAVPPFAQRDRFHRFQPTYPYLQHEIDLPPTISLSDGEEPPPYQGPCTLQLRDPEQQLE 177

Human 118:RLAVPPFAQRERFHRFQPTYPYLQHEIDLPPTISLSDGEEPPPYQGPCTLQLRDPEQQLE 177

Zebrafish 172:LNRESVRPPPNRTVYDSAL------THTGIS------GGRLQGPPPAYSEVIG 212

Coelacanth 174:LNRESVRAPPNRTIFDSDLIDTSMYGGPCPPSSNSGISATCYSSNGRMEGPPPTYNEVIG 233

Turtle 175:LNRESVRAPPNRTIFDSDLIDNSVFGGPCPPSSNSGISATCYGSNGRMEGPPPTYSEVIG 234

Opossum 175:LNRESVRAPPNRTIFDSDLIDNSMYGGPCPPSSNSGISATCYGSNGRMEGPPPTYNEVIG 234

Cattle 180:LNRESVRAPPNRTIFDSDLMESAMLGGPCPPSSNSGISATCYGGGGRMEGPPPTYSEVIG 239

Mouse 170:LNRESVRAPPNRTIFDSDLIDSTMLGGPCPPSSNSGISATCYSSGGRMEGPPPTYSEVIG 229

Lemur 178:LNRESVRAPPNRTIFDSDLMDSARLGGPCPPSSNSGISATCYGSGGRMEGPPPTYSEVIG 237

Chimpanzee 178:LNRESVRAPPNRTIFDSDLMDSARLGGPCPPSSNSGISATCYGSGGRMEGPPPTYSEVIG 237

Human 178:LNRESVRAPPNRTIFDSDLMDSARLGGPCPPSSNSGISATCYGSGGRMEGPPPTYSEVIG 237

Zebrafish 213:HYYH------PTTLPQAAALVHGFI---QLSRADTRSKTQPKAQPV 249

Coelacanth 234:HYPG-STFYQHQQ-NNGMPSILEGTRL-HSQINGLESTIARNKDKEKSKGHPF 283

Turtle 235:HYPG-STFFQHQQNNNGMPSILEGSRLHHSQINGLESTTAWNKDKEKQKGQPLLKKNKKN 293

Opossum 235:HYPG-STFYQHQQ-NNGTPSILENNRLHQSQISSLESTIAWNKEKEKQKGHPF 285

Cattle 240:HYPGAST-FQHQQ-SSGPASLLEGTRLHPAHIAPLESTAAWSKEKDKQKGHPL 290

Mouse 230:HYPG-SS-FQHQQ-SNGPSSLLEGTRLHHSHIAPLE-----NKEKEKQKGHPL 274

Lemur 238:HYPG-SS-FQHQQ-SNGPPSLLEGTRLQHTHMAPLESAAIWSKEKDKQKGHPL 287

Chimpanzee 238:HYPG-SS-FQHQQ-SSGPPSLLEGTRLHHTHIAPLDSAAIWSKEKDKQKGHPL 287

Human 238:HYPG-SS-FQHQQ-SSGPPSLLEGTRLHHTHIAPLESAAIWSKEKDKQKGHPL 287

Turtle 294:QTRK 297

1