Supplemental table 1. Sequence comparison of human and porcine collagen Type I α1 chain.

species / Number / Amino acid sequence / Number
human
porcine
human
porcine / 1
1
31
31 / MFSFVDLRLLLLLAATALLTHGQEEGQVEG
MFSFVDLRLLLLLAATALLTHGQEEGQEEG
Q------DEDIPPI TCVQNGLRYHDRDVWKPEP
QQGQEEDIPPVTCVQNGLRYHDRDVWKPVP / 30
30
57
60
human
porcine
human
porcine / 58
61
88
91 / CRICVCDNGKVLCDDVICDETKNCPGAEVP
CQICVCDNGNVLCDDVICDEI KNCPSARVP
EGECCPVCPDGSESPTDQET TGVEGPKGDT
AGECCPVCPEGEVSPTDQET TGVEGPKGDT / 87
90
117
120
human
porcine
human
porcine / 118
121
148
151 / GPRGPRGPAGPPGRDGIPGQPGLPGPPGPP
GPRGPRGPSGPPGRDGIPGQPGLPGPPGPP
GPPGPPGLGGNFAPQLSYGY DEKSTGGISV
GPPGPPGLGGNFAPQLSYGY DEKSA--GISV
a / 147
150
177
179
human
porcine
human
porcine / 178
180
208
210 / PGPMGPSGPR GLPGPPGAPG PQGFQGPPGE
PGPMGPSGPR GLSGPPGAPG PQGFQGPPGE
PGEPGASGPM GPRGPPGPPG KNGDDGEAGK
PGEPGASGPM GPRGPPGPPG KNGDDGEAGK / 207
209
237
239
Human
Porcine
human
porcine / 238
240
268
270 / PGRPGERGPP GPQGARGLPG TAGLPGMKGH
PGRPGERGPP GPQGARGLPG TAGLPGMKGH
RGFSGLDGAK GDAGPAGPKG EPGSPGENGA
RGFSGLDGAK GDAGPAGPKG EPGSPGENGA / 267
269
297
299
human
porcine
human
porcine / 298
300
328
330 / PGQMGPRGLP GERGRPGAPG PAGARGNDGA
PGQMGPRGLP GERGRPGPPG PAGARGNDGA
TGAAGPPGPT GPAGPPGFPG AVGAKGEAGP
TGAAGPPGPT GPAGPPGFPG AVGAKGEAGP / 327
329
357
359
human
porcine
human
porcine / 358
360
358
390 / QGPRGSEGPQ GVRGEPGPPG PAGAAGPAGN
QGARGSEGPQ GVRGEPGPPG PAGAAGPAGN
PGADGQPGAK GANGAPGIAG APGFPGARGP
PGADGQPGGK GANGAPGIAG APGFPGARGP / 387
389
417
419
human
porcine
human
porcine / 418
420
448
450 / SGPQGPGGPP GPKGNSGEPG APGSKGDTGA
SGPQGPSGPP GPKGNSGEPG APGSKGDTGA
KGEPGPVGVQ GPPGPAGEEG KRGARGEPGP
KGEPGPTGVQ GPPGPAGEEG KRGARGEPGP / 447
449
477
479
human
porcine
human
porcine / 478
480
508
510 / TGLPGPPGER GGPGSRGFPG ADGVAGPKGP
AGLPGPPGER GGPGSRGFPG ADGVAGPKGP
AGERGSPGPA GPKGSPGEAG RPGEAGLPGA
AGERGSPGPA GPKGSPGEAG RPGEAGLPGA / 507
509
537
539
human
porcine
human
porcine / 538
540
568
570 / KGLTGSPGSP GPDGKTGPPG PAGQDGRPGP
KGLTGSPGSP GPDGKTGPPG PAGQDGRPGP
PGPPGARGQA GVMGFPGPKG AAGEPGKAGE
PGPPGARGQA GVMGFPGPKG AAGEPGKAGE / 567
569
597
599
human
porcine
human
porcine / 598
600
628
630 / RGVPGPPGAV GPAGKDGEAG AQGPPGPAGP
RGVPGPPGAV GPAGKDGEAG AQGPPGPAGP
AGERGEQGPA GSPGFQGLPG PAGPPGEAGK
AGERGEQGPA GSPGFQGLPG PAGPPGEAGK / 627
629
657
659
human
porcine
human
porcine / 658
660
688
690 / PGEQGVPGDL GAPGPSGARG ERGFPGERGV
PGEQGVPGDL GAPGPSGARG ERGFPGERGV
QGPPGPAGPR GANGAPGNDG AKGDAGAPGA
QGPPGPAGPR GANGAPGNDG AKGDAGAPGA / 687
689
717
719
human
porcine
human
porcine / 718
720
748
750 / PGSQGAPGLQ GMPGERGAAG LPGPKGDRGD
PGSQGAPGLQ GMPGERGAAG LPGPKGDRGD
AGPKGADGSP GKDGVRGLTG PIGPPGPAGA
AGPKGADGAP GKDGVRGLTG PIGPPGPAGA / 747
749
777
779
human
porcine
human
porcine / 778
780
808
810 / PGDKGESGPS GPAGPTGARG APGDRGEPGP
PGDKGETGPS GPAGPTGARG APGDRGEPGP
PGPAGFAGPP GADGQPGAKG EPGDAGAKGD
PGPAGFAGPP GADGQPGAKG ------
b / 807
809
837
829
human
porcine
human
porcine / 838
830
868
853 / AGPPGPAGPAGPPGPIGNVG APGAKGARGS
------GPTGPPGPIGSVG APGPKGARGS
AGPPGATGFPGAAGRVGPPGPSGNAGPPGP
AGPPGATGFPGAAGRVGPPGPSGNAGPPGP / 867
852
897
882
human
porcine
human
porcine / 898
883
928
913 / PGPAGKEGGK GPRGETGPAG RPGEVGPPGP
PGPAGKEGSK GPRGETGPAG RPGEAGPPGP
PGPAGEKGSP GADGPAGAPG TPGPQGIAGQ
PGPAGEKGSP GADGPAGAPG TPGPQGIAGQ / 927
912
957
942
human
porcine
human
porcine / 958
943
988
973 / RGVVGLPGQR GERGFPGLPG PSGEPGKQGP
RGVVGLPGQR GERGFPGLPG PSGEPGKQGP
SGASGERGPP GPMGPPGLAG PPGESGREGA
SGPSGERGPP GPMGPPGLAG PPGESGREGA / 987
972
1017
1002
human
porcine
human
porcine / 1018
1003
1048
1033 / PGAEGSPGRD GS PGAKGDRG ETGPAGPPGA
PGAEGSPGRD GAPGP KGDRG ESGPAGPPGA
PGAPGAPGPVGPAGKSGDRG ETGPAGPAGP
PGAPGAPGPVGPAGKSGDRG ETGPAGPAGP / 1047
1032
1077
1062
human
porcine
human
porcine / 1078
1063
1108
1093 / VGPVGARGPAGPQGPRGDKGETGEQGDRGI
VGPVGARGPAGPQGPRGDKGETGEQGDRGI
KGHRGFSGLQGPPGPPGSPG EQGPSGASGP
KGHRGFSGLQGPPGPPGSPG EQGPSGASGP / 1107
1092
1137
1122
human
porcine
human
porcine / 1138
1123
1178
1153 / AGPRGPPGSAGAPGKDGLNG LPGPIGPPGP
AGPRGPPGSAGAPGKDGLNG LPGPIGPPGP
RGRTGDAGPVGPPGPPGPPGPPGPPSAGFD
RGRTGDAGPVGPPGPPGPPGPPGPPSGGFD / 1177
1152
1197
1182
human
porcine
human
porcine / 1198
1183
1228
1213 / FSFLPQPPQEKAHDGGRYYRADDANVVRDR
FSFLPQPPQEKAHDGGRYYRADDANVVRDR
DLEVDTTLKSLSQQIENIRSPEGSRKNPAR
DLEVDTTLKS LSQQIENIRS PEGSRKNPAR / 1227
1212
1257
1242
human
porcine
human
porcine / 1258
1243
1288
1273 / TCRDLKMCHSDWKSGEYWIDPNQGCNLDAI
TCRDLKMCHSDWKSGEYWIDPNQGCNLDAI
KVFCNMETGETCVYPTQPSVAQKNWYISKN
KVFCNMETGETCVYPTQPSVPQKNWYISKN / 1287
1272
1317
1302
human
porcine
human
porcine / 1318
1303
1348
1333 / PKDKRHVWFGESMTDGFQFEYGGQGSDPAD
PKDKRHVWYGESMTDGFQFEYGGEGSDPAD
VAIQLTFLRLMSTEASQNITYHCKNSVAYM
VAIQLTFLRLMSTEASQNITYHCKNSVAYM / 1347
1332
1377
1362
human
porcine
human
porcine / 1378
1363
1408
1393 / DQQTGNLKKALLLQGSNEIEIRAEGNSRFT
DQQTGNLKKALLLQGSNEIEIRAEGNSRFT
YSVTVDGCTSHTGAWGKTVIEYKTTKTSRL
YSVI YDGCTSHTGAWGKTVIEKKTTKTSRL / 1407
1392
1437
1422
human
porcine / 1438
1423 / PIIDVAPLDVGAPDQEFGFDVGPVCFL
PIIDVAPLDVGAPDQEFGI D LS PVCFL / 1464
1449

BLAST search result: sequence identity 1403/1464(96%), positives hits 1416/1467 (96%). Signal sequence, Von Willebrand factor type C domain and Fibrillar collagens C-terminal domain are absent in mature collagen and the difference locus in these region have no role in immunogenicity or antigenicity. Mature collagen was composed of three regions: N-telopeptide (non-helical) region, Gly-X-Y (helical) region and C-telopeptide (non-helical) region. Difference locus. Similar locus.

Sequence gaps (1%) may result in a slightly different secondary or tertiary structure

a: A difference locus and a sequence gap located in adjacent position of non-helical region, that is likely to be a antigenic determinant.

b: Consecutive deletion is likely to be a antigenic determinant.