Nebulin (NEB) - coding DNA reference sequence

(used for mutation description)

(last modified October 19, 2013)


This file was created to facilitate the description of sequence variants in the NEB gene based on a coding DNA reference sequence following the HGVS recommendations. The sequence was taken from NG_009382.2, covering transcript variant-4 (NM_001271208.1), containing all 183 exons. In reality exons 143 and 144 have thus far not been found together in one transcript (see variant-1 (NM_001164507.1) and variant-2 (NM_001164508.1)). Transcript variant-3 misses exons 82-105 (NM_004543.4).
NOTE: the sequence contains three identical blocks (exons 82-89, 90-97 and 98-105); variants found here can not be mapped exactly. The three repeated blocks roughly go from position c.12330+176_13789-968 (81i_89i), c.13789-969_15247-968 (89i_97i) and c.15247-969_16705-1485 (97i_105i). Blocks can be discriminated using intronic variants.

Please note that introns are available by clicking on the exon numbers above the sequence.


 (upstream sequence)
                                         .         .                g.5023
                                      aggcatcacgctggcttttgcct       c.-181

 .         .         .         .         .         .                g.5083
 ggggttctcaggaggggagagttgggagaggctttgctgctgaggaaatttatttggtag       c.-121

 .       | 02 .         .         .         .         .             g.5745
 attgaag | gtttgaacgagagctacagaaacgaaagaaaaagtctgtataagccaatggtg    c.-61

 .         .         .         . | 03       .         .             g.6331
 ttcgggaagaaaataaccccattgccttgag | tttgtaggtgccactactactctggaaaa    c.-1

          .         .         .       | 04 .         .         .    g.9855
 ATGGCAGATGACGAAGACTATGAGGAGGTGGTGGAG | TACTACACAGAAGAAGTGGTTTAC    c.60
 M  A  D  D  E  D  Y  E  E  V  V  E   | Y  Y  T  E  E  V  V  Y      p.20

          .         | 05         .         .         .         .    g.11623
 GAAGAGGTGCCGGGAGAG | ACAATAACAAAAATTTATGAGACTACGACAACAAGGACATCT    c.120
 E  E  V  P  G  E   | T  I  T  K  I  Y  E  T  T  T  T  R  T  S      p.40

          .         .         .         .         .         .       g.11683
 GACTATGAGCAATCAGAAACTTCCAAACCAGCTCTGGCACAGCCAGCACTGGCACAGCCA       c.180
 D  Y  E  Q  S  E  T  S  K  P  A  L  A  Q  P  A  L  A  Q  P         p.60

          .         .         .         .         .         .       g.11743
 GCATCAGCAAAGCCGGTGGAGAGGAGGAAGGTCATCCGGAAGAAAGTGGATCCTTCAAAG       c.240
 A  S  A  K  P  V  E  R  R  K  V  I  R  K  K  V  D  P  S  K         p.80

          .         .         .         .         .     | 06   .    g.13933
 TTCATGACCCCCTACATTGCACACAGTCAGAAAATGCAGGATCTTTTTAGCCCA | AATAAA    c.300
 F  M  T  P  Y  I  A  H  S  Q  K  M  Q  D  L  F  S  P   | N  K      p.100

          .         .         .         .         .         .       g.13993
 TACAAGGAGAAGTTTGAGAAAACAAAAGGACAGCCATACGCCAGCACAACAGATACTCCA       c.360
 Y  K  E  K  F  E  K  T  K  G  Q  P  Y  A  S  T  T  D  T  P         p.120

          .         .         .         .   | 07     .         .    g.14544
 GAACTTCGCAGAATCAAAAAAGTACAAGATCAACTCAGTGAG | GTTAAGTATCGAATGGAT    c.420
 E  L  R  R  I  K  K  V  Q  D  Q  L  S  E   | V  K  Y  R  M  D      p.140

          .         .         .         .         .         .       g.14604
 GGTGATGTTGCTAAGACTATATGTCACGTAGATGAAAAAGCAAAGGATATTGAACATGCA       c.480
 G  D  V  A  K  T  I  C  H  V  D  E  K  A  K  D  I  E  H  A         p.160

          .         .        | 08.         .         .         .    g.15156
 AAGAAAGTGTCGCAGCAAGTCAGTAAG | GTTTTATACAAGCAGAACTGGGAAGACACCAAG    c.540
 K  K  V  S  Q  Q  V  S  K   | V  L  Y  K  Q  N  W  E  D  T  K      p.180

          .         .         .         .         .         .       g.15216
 GATAAGTACCTGCTTCCTCCTGATGCCCCTGAACTTGTCCAGGCCGTTAAGAACACCGCC       c.600
 D  K  Y  L  L  P  P  D  A  P  E  L  V  Q  A  V  K  N  T  A         p.200

          .   | 09     .         .         .         .         .    g.16049
 ATGTTCAGCAAG | AAACTGTACACTGAAGACTGGGAAGCAGACAAAAGTTTGTTTTACCCC    c.660
 M  F  S  K   | K  L  Y  T  E  D  W  E  A  D  K  S  L  F  Y  P      p.220

          .         .         .         .         .        | 10.    g.21970
 TATAATGATAGCCCGGAACTGAGGAGAGTTGCCCAGGCCCAGAAAGCTCTCAGTGAT | GTT    c.720
 Y  N  D  S  P  E  L  R  R  V  A  Q  A  Q  K  A  L  S  D   | V      p.240

          .         .         .         .         .         .       g.22030
 GCCTACAAAAAAGGTCTCGCTGAACAGCAAGCTCAATTCACGCCTCTGGCTGATCCTCCA       c.780
 A  Y  K  K  G  L  A  E  Q  Q  A  Q  F  T  P  L  A  D  P  P         p.260

          .         .         .         .   | 11     .         .    g.28967
 GATATAGAATTTGCCAAGAAAGTAACCAATCAAGTGAGCAAG | CAAAAATACAAAGAAGAC    c.840
 D  I  E  F  A  K  K  V  T  N  Q  V  S  K   | Q  K  Y  K  E  D      p.280

          .         .         .         .         .         .       g.29027
 TATGAAAATAAAATCAAAGGCAAATGGAGTGAGACACCTTGCTTTGAAGTTGCAAATGCC       c.900
 Y  E  N  K  I  K  G  K  W  S  E  T  P  C  F  E  V  A  N  A         p.300

          .         .        | 12.         .         .         .    g.29757
 AGAATGAATGCTGATAACATTAGCACA | AGGAAATACCAGGAAGATTTTGAAAACATGAAA    c.960
 R  M  N  A  D  N  I  S  T   | R  K  Y  Q  E  D  F  E  N  M  K      p.320

          .         .         .         .         .         .       g.29817
 GACCAGATCTACTTCATGCAGACCGAAACACCAGAGTATAAAATGAATAAAAAAGCTGGT       c.1020
 D  Q  I  Y  F  M  Q  T  E  T  P  E  Y  K  M  N  K  K  A  G         p.340

          .      | 13  .         .         .         .         .    g.32535
 GTGGCAGCTAGCAAG | GTAAAATACAAAGAAGACTATGAAAAGAATAAAGGAAAAGCAGAT    c.1080
 V  A  A  S  K   | V  K  Y  K  E  D  Y  E  K  N  K  G  K  A  D      p.360

          .         .         .         .         .         .       g.32595
 TATAATGTGCTTCCTGCTTCAGAGAACCCACAGCTTAGGCAGCTGAAGGCAGCAGGAGAT       c.1140
 Y  N  V  L  P  A  S  E  N  P  Q  L  R  Q  L  K  A  A  G  D         p.380

          .   | 14     .         .         .         .         .    g.41887
 GCCCTAAGTGAC | AAACTATACAAGGAAAACTATGAAAAGACAAAAGCAAAGAGCATAAAT    c.1200
 A  L  S  D   | K  L  Y  K  E  N  Y  E  K  T  K  A  K  S  I  N      p.400

          .         .         .         .         .        | 15.    g.42033
 TACTGCGAGACCCCCAAATTCAAGCTCGATACTGTTCTGCAGAACTTCAGTAGTGAT | AAA    c.1260
 Y  C  E  T  P  K  F  K  L  D  T  V  L  Q  N  F  S  S  D   | K      p.420

          .         .         .         .         .         .       g.42093
 AAATATAAAGATTCCTACTTAAAAGATATTTTGGGACATTATGTAGGCAGCTTCGAGGAT       c.1320
 K  Y  K  D  S  Y  L  K  D  I  L  G  H  Y  V  G  S  F  E  D         p.440

          .         .         .         .      | 16  .         .    g.42250
 CCATACCATTCACACTGCATGAAAGTCACAGCTCAAAACAGTGAT | AAAAACTACAAAGCA    c.1380
 P  Y  H  S  H  C  M  K  V  T  A  Q  N  S  D   | K  N  Y  K  A      p.460

          .         .         .         .         .         .       g.42310
 GAATACGAAGAAGACAGAGGCAAAGGCTTCTTCCCTCAGACCATAACTCAAGAATATGAA       c.1440
 E  Y  E  E  D  R  G  K  G  F  F  P  Q  T  I  T  Q  E  Y  E         p.480

          .         .         . | 17       .         .         .    g.42782
 GCAATTAAGAAACTAGATCAGTGTAAAGAC | CACACCTACAAAGTCCATCCAGATAAGACA    c.1500
 A  I  K  K  L  D  Q  C  K  D   | H  T  Y  K  V  H  P  D  K  T      p.500

          .         .         .         .         .         .       g.42842
 AAATTCACCCAAGTTACAGACTCTCCTGTTCTGCTACAAGCCCAAGTCAATTCCAAACAA       c.1560
 K  F  T  Q  V  T  D  S  P  V  L  L  Q  A  Q  V  N  S  K  Q         p.520

           | 18        .         .         .         .         .    g.43856
 CTGAGTGAC | TTAAATTACAAAGCAAAACATGAAAGTGAAAAGTTCAAGTGCCATATCCCC    c.1620
 L  S  D   | L  N  Y  K  A  K  H  E  S  E  K  F  K  C  H  I  P      p.540

          .         .         .         .         .     | 19   .    g.44864
 CCTGATACTCCTGCTTTTATCCAGCACAAAGTCAATGCCTATAACTTGAGTGAT | AATCTT    c.1680
 P  D  T  P  A  F  I  Q  H  K  V  N  A  Y  N  L  S  D   | N  L      p.560

          .         .         .         .         .         .       g.44924
 TATAAGCAAGACTGGGAGAAGAGCAAAGCCAAAAAGTTTGACATTAAAGTGGATGCCATT       c.1740
 Y  K  Q  D  W  E  K  S  K  A  K  K  F  D  I  K  V  D  A  I         p.580

          .         .         .         .   | 20     .         .    g.45069
 CCCCTGCTGGCAGCCAAAGCCAACACCAAGAACACCAGCGAT | GTGATGTACAAGAAAGAC    c.1800
 P  L  L  A  A  K  A  N  T  K  N  T  S  D   | V  M  Y  K  K  D      p.600

          .         .         .         .         .         .       g.45129
 TATGAAAAAAACAAAGGGAAAATGATTGGAGTCCTCAGCATTAATGACGATCCCAAGATG       c.1860
 Y  E  K  N  K  G  K  M  I  G  V  L  S  I  N  D  D  P  K  M         p.620

          .         .         .       | 21 .         .         .    g.47149
 CTGCACTCCTTGAAGGTGGCCAAAAACCAGAGTGAT | AGATTATACAAGGAAAACTATGAG    c.1920
 L  H  S  L  K  V  A  K  N  Q  S  D   | R  L  Y  K  E  N  Y  E      p.640

          .         .         .         .         .         .       g.47209
 AAGACAAAGGCAAAGAGTATGAATTACTGTGAGACCCCAAAATATCAACTTGATACTCAG       c.1980
 K  T  K  A  K  S  M  N  Y  C  E  T  P  K  Y  Q  L  D  T  Q         p.660

          .         | 22         .         .         .         .    g.47363
 CTGAAGAACTTCAGTGAG | GCTAGATATAAAGACTTATATGTAAAGGATGTTTTGGGACAT    c.2040
 L  K  N  F  S  E   | A  R  Y  K  D  L  Y  V  K  D  V  L  G  H      p.680

          .         .         .         .         .         .       g.47423
 TATGTAGGCAGCATGGAGGACCCATATCACACACACTGCATGAAAGTTGCAGCTCAAAAC       c.2100
 Y  V  G  S  M  E  D  P  Y  H  T  H  C  M  K  V  A  A  Q  N         p.700

        | 23 .         .         .         .         .         .    g.47573
 AGTGAT | AAAAGTTACAAAGCAGAATATGAAGAAGATAAAGGAAAATGCTATTTCCCTCAG    c.2160
 S  D   | K  S  Y  K  A  E  Y  E  E  D  K  G  K  C  Y  F  P  Q      p.720

          .         .         .         .         .  | 24      .    g.48671
 ACAATAACACAAGAATATGAAGCAATCAAGAAGCTGGACCAGTGTAAAGAT | CATACCTAC    c.2220
 T  I  T  Q  E  Y  E  A  I  K  K  L  D  Q  C  K  D   | H  T  Y      p.740

          .         .         .         .         .         .       g.48731
 AAAGTTCATCCAGATAAGACCAAATTCACGGCAGTCACTGATTCTCCTGTACTGTTGCAA       c.2280
 K  V  H  P  D  K  T  K  F  T  A  V  T  D  S  P  V  L  L  Q         p.760

          .         .         . | 25       .         .         .    g.51121
 GCCCAGCTCAACACGAAACAGCTTAGTGAT | CTGAATTACAAAGCAAAACATGAAGGTGAG    c.2340
 A  Q  L  N  T  K  Q  L  S  D   | L  N  Y  K  A  K  H  E  G  E      p.780

          .         .         .         .         .         .       g.51181
 AAGTTCAAGTGCCATATACCAGCAGATGCTCCACAGTTTATCCAACACAGAGTCAATGCC       c.2400
 K  F  K  C  H  I  P  A  D  A  P  Q  F  I  Q  H  R  V  N  A         p.800

          .      | 26  .         .         .         .         .    g.51799
 TATAATCTGAGTGAT | AATGTTTATAAGCAAGACTGGGAGAAGAGCAAAGCCAAGAAGTTT    c.2460
 Y  N  L  S  D   | N  V  Y  K  Q  D  W  E  K  S  K  A  K  K  F      p.820

          .         .         .         .         .         .       g.51859
 GACATTAAAGTGGACGCCATTCCCCTGTTGGCAGCCAAAGCCAACACCAAGAACACCAGC       c.2520
 D  I  K  V  D  A  I  P  L  L  A  A  K  A  N  T  K  N  T  S         p.840

     | 27    .         .         .         .         .         .    g.52012
 GAT | GTGATGTACAAGAAAGACTATGAAAAGAGCAAAGGGAAAATGATTGGAGCCCTCAGC    c.2580
 D   | V  M  Y  K  K  D  Y  E  K  S  K  G  K  M  I  G  A  L  S      p.860

          .         .         .         .         .        | 28.    g.54515
 ATTAATGACGATCCAAAGATGCTGCACTCCTTGAAGACAGCCAAAAACCAGAGTGAT | CGC    c.2640
 I  N  D  D  P  K  M  L  H  S  L  K  T  A  K  N  Q  S  D   | R      p.880

          .         .         .         .         .         .       g.54575
 GAATATCGAAAAGATTATGAAAAGTCAAAAACTATCTACACGGCACCTCTTGATATGCTC       c.2700
 E  Y  R  K  D  Y  E  K  S  K  T  I  Y  T  A  P  L  D  M  L         p.900

          .         .         .         .         .         .       g.54635
 CAAGTCACTCAAGCTAAGAAATCTCAGGCAATTGCCAGCGACGTTGATTATAAGCACATC       c.2760
 Q  V  T  Q  A  K  K  S  Q  A  I  A  S  D  V  D  Y  K  H  I         p.920

          .         .         .         .         .         .       g.54695
 TTACACAGTTACAGCTACCCCCCTGATAGCATCAATGTGGACCTTGCCAAGAAGGCATAT       c.2820
 L  H  S  Y  S  Y  P  P  D  S  I  N  V  D  L  A  K  K  A  Y         p.940

          .      | 29  .         .         .         .         .    g.56763
 GCGCTGCAGAGCGAT | GTTGAATACAAAGCTGACTACAATAGCTGGATGAAAGGTTGTGGC    c.2880
 A  L  Q  S  D   | V  E  Y  K  A  D  Y  N  S  W  M  K  G  C  G      p.960

          .         .         .         .         .         .       g.56823
 TGGGTGCCTTTTGGGTCCTTAGAAATGGAAAAGGCAAAGCGAGCTTCAGACATCCTCAAT       c.2940
 W  V  P  F  G  S  L  E  M  E  K  A  K  R  A  S  D  I  L  N         p.980

     | 30    .         .         .         .         .         .    g.58716
 GAG | AAAAAATATCGCCAACATCCAGACACCCTCAAGTTTACCTCGATTGAAGATGCTCCA    c.3000
 E   | K  K  Y  R  Q  H  P  D  T  L  K  F  T  S  I  E  D  A  P      p.1000

          .         .         .         .   | 31     .         .    g.59483
 ATTACAGTACAGTCTAAAATTAACCAGGCCCAGAGGAGTGAT | ATCGCTTACAAAGCCAAA    c.3060
 I  T  V  Q  S  K  I  N  Q  A  Q  R  S  D   | I  A  Y  K  A  K      p.1020

          .         .         .         .         .         .       g.59543
 GGAGAGGAAATTATTCACAAATACAACCTGCCACCAGACCTGCCCCAGTTCATCCAGGCT       c.3120
 G  E  E  I  I  H  K  Y  N  L  P  P  D  L  P  Q  F  I  Q  A         p.1040

          .         .        | 32.         .         .         .    g.59692
 AAAGTTAATGCCTACAATATCAGTGAG | AATATGTACAAAGCAGACTTGAAAGACTTGAGC    c.3180
 K  V  N  A  Y  N  I  S  E   | N  M  Y  K  A  D  L  K  D  L  S      p.1060

          .         .         .         .         .         .       g.59752
 AAGAAGGGATATGACCTGAGAACTGATGCGATTCCCATCAGAGCTGCCAAAGCTGCCAGG       c.3240
 K  K  G  Y  D  L  R  T  D  A  I  P  I  R  A  A  K  A  A  R         p.1080

          .      | 33  .         .         .         .         .    g.61345
 CAGGCGGCGAGTGAC | GTTCAGTACAAAAAAGACTATGAAAAGGCTAAAGGGAAAATGGTT    c.3300
 Q  A  A  S  D   | V  Q  Y  K  K  D  Y  E  K  A  K  G  K  M  V      p.1100

          .         .         .         .         .         .       g.61405
 GGCTTCCAAAGTCTTCAAGATGACCCTAAACTGGTTCATTATATGAACGTGGCCAAGATA       c.3360
 G  F  Q  S  L  Q  D  D  P  K  L  V  H  Y  M  N  V  A  K  I         p.1120

          .         .         .         .         .         .       g.61465
 CAATCAGATCGGGAGTATAAAAAAGACTATGAGAAGACAAAGTCCAAATACAACACGCCC       c.3420
 Q  S  D  R  E  Y  K  K  D  Y  E  K  T  K  S  K  Y  N  T  P         p.1140

          .         .         .         .         .         .       g.61525
 CATGATATGTTCAATGTCGTGGCGGCTAAGAAAGCCCAGGATGTGGTCAGCAATGTCAAC       c.3480
 H  D  M  F  N  V  V  A  A  K  K  A  Q  D  V  V  S  N  V  N         p.1160

          .         .         .         .         .         .       g.61585
 TATAAGCATTCTCTCCATCATTACACCTACTTGCCTGACGCCATGGACCTGGAGCTGTCT       c.3540
 Y  K  H  S  L  H  H  Y  T  Y  L  P  D  A  M  D  L  E  L  S         p.1180

          .         .        | 34.         .         .         .    g.61749
 AAGAACATGATGCAGATACAGAGTGAT | AACGTCTACAAGGAAGACTACAACAACTGGATG    c.3600
 K  N  M  M  Q  I  Q  S  D   | N  V  Y  K  E  D  Y  N  N  W  M      p.1200

          .         .         .         .         .         .       g.61809
 AAAGGCATTGGCTGGATTCCTATTGGCAGTCTCGACGTCGAAAAAGTTAAAAAGGCCGGT       c.3660
 K  G  I  G  W  I  P  I  G  S  L  D  V  E  K  V  K  K  A  G         p.1220

          .         .         .         .         .         .       g.61869
 GATGCTCTGAATGAAAAGAAGTACAGGCAACATCCAGACACCCTCAAATTTACCAGCATT       c.3720
 D  A  L  N  E  K  K  Y  R  Q  H  P  D  T  L  K  F  T  S  I         p.1240

          .         .         .         .         .     | 35   .    g.64102
 GTGGACTCCCCAGTTATGGTCCAGGCAAAACAGAACACGAAGCAAGTCAGTGAT | ATCTTA    c.3780
 V  D  S  P  V  M  V  Q  A  K  Q  N  T  K  Q  V  S  D   | I  L      p.1260

          .         .         .         .         .         .       g.64162
 TACAAGGCTAAAGGAGAAGATGTGAAACATAAATACACCATGAGTCCTGATCTTCCTCAG       c.3840
 Y  K  A  K  G  E  D  V  K  H  K  Y  T  M  S  P  D  L  P  Q         p.1280

          .         .         .          | 36        .         .    g.64924
 TTTCTCCAGGCCAAGTGCAATGCTTACAATATAAGTGAC | GTCTGTTATAAACGGGATTGG    c.3900
 F  L  Q  A  K  C  N  A  Y  N  I  S  D   | V  C  Y  K  R  D  W      p.1300

          .         .         .         .         .         .       g.64984
 CATGACTTAATAGCCAAGGGCAACAATGTGCTGGGCGATGCTATTCCCATCACTGCAGCC       c.3960
 H  D  L  I  A  K  G  N  N  V  L  G  D  A  I  P  I  T  A  A         p.1320

          .         .        | 37.         .         .         .    g.66840
 AAGGCATCGAGAAACATTGCCAGTGAT | TATAAATACAAGGAAGCTTATGAGAAGTCAAAG    c.4020
 K  A  S  R  N  I  A  S  D   | Y  K  Y  K  E  A  Y  E  K  S  K      p.1340

          .         .         .         .         .         .       g.66900
 GGAAAGCATGTGGGTTTCAGAAGCCTCCAGGATGATCCCAAGCTGGTCCACTATATGAAT       c.4080
 G  K  H  V  G  F  R  S  L  Q  D  D  P  K  L  V  H  Y  M  N         p.1360

          .         .         .         .         .         .       g.66960
 GTGGCAAAGCTGCAGTCTGATCGTGAATACAAGAAGAACTATGAGAACACCAAAACCAGC       c.4140
 V  A  K  L  Q  S  D  R  E  Y  K  K  N  Y  E  N  T  K  T  S         p.1380

          .         .         .         .         .         .       g.67020
 TACCATACCCCTGGGGACATGGTTAGCATCACAGCTGCAAAGATGGCCCAGGATGTCGCT       c.4200
 Y  H  T  P  G  D  M  V  S  I  T  A  A  K  M  A  Q  D  V  A         p.1400

          .         .         .         .         .         .       g.67080
 ACCAATGTCAACTACAAACAGCCATTGCATCATTACACATACCTACCTGACGCCATGAGT       c.4260
 T  N  V  N  Y  K  Q  P  L  H  H  Y  T  Y  L  P  D  A  M  S         p.1420

          .         .         .          | 38        .         .    g.68279
 CTTGAGCATACGAGGAATGTCAATCAAATTCAGAGTGAT | AATGTGTATAAAGACGAGTAT    c.4320
 L  E  H  T  R  N  V  N  Q  I  Q  S  D   | N  V  Y  K  D  E  Y      p.1440

          .         .         .         .         .         .       g.68339
 AACAGCTTCTTGAAGGGCATCGGATGGATCCCTATTGGTTCCCTGGAGGTGGAGAAGGTC       c.4380
 N  S  F  L  K  G  I  G  W  I  P  I  G  S  L  E  V  E  K  V         p.1460

          .         .         .         .         .         .       g.68399
 AAGAAAGCAGGCGATGCATTAAATGAGAGGAAGTATCGACAGCACCCAGATACCGTCAAG       c.4440
 K  K  A  G  D  A  L  N  E  R  K  Y  R  Q  H  P  D  T  V  K         p.1480

          .         .         .         .         .         .       g.68459
 TTCACAAGTGTGCCTGATTCCATGGGCATGGTGTTGGCTCAGCATAACACAAAGCAGCTA       c.4500
 F  T  S  V  P  D  S  M  G  M  V  L  A  Q  H  N  T  K  Q  L         p.1500

        | 39 .         .         .         .         .         .    g.70410
 AGTGAT | TTGAACTACAAGGTAGAGGGAGAGAAACTGAAGCACAAGTATACTATTGACCCT    c.4560
 S  D   | L  N  Y  K  V  E  G  E  K  L  K  H  K  Y  T  I  D  P      p.1520

          .         .         .         .         .  | 40      .    g.71585
 GAATTGCCTCAGTTTATTCAAGCCAAAGTCAACGCCCTCAACATGAGTGAT | GCTCATTAT    c.4620
 E  L  P  Q  F  I  Q  A  K  V  N  A  L  N  M  S  D   | A  H  Y      p.1540

          .         .         .         .         .         .       g.71645
 AAAGCAGATTGGAAGAAAACCATTGCCAAGGGCTATGATTTGAGACCAGATGCCATCCCA       c.4680
 K  A  D  W  K  K  T  I  A  K  G  Y  D  L  R  P  D  A  I  P         p.1560

          .         .         .          | 41        .         .    g.73107
 ATTGTTGCTGCAAAAAGTTCAAGGAATATTGCTAGTGAT | TGCAAATATAAGGAGGCCTAC    c.4740
 I  V  A  A  K  S  S  R  N  I  A  S  D   | C  K  Y  K  E  A  Y      p.1580

          .         .         .         .         .         .       g.73167
 GAGAAAGCCAAAGGCAAGCAAGTTGGATTTCTCAGTCTTCAGGATGATCCTAAACTGGTT       c.4800
 E  K  A  K  G  K  Q  V  G  F  L  S  L  Q  D  D  P  K  L  V         p.1600

          .         .         .         .         .         .       g.73227
 CACTACATGAATGTGGCCAAAATCCAGTCTGATCGTGAGTACAAAAAGGGCTATGAAGCC       c.4860
 H  Y  M  N  V  A  K  I  Q  S  D  R  E  Y  K  K  G  Y  E  A         p.1620

          .         .         .         .         .         .       g.73287
 AGCAAGACCAAGTACCACACACCTCTGGATATGGTCAGTGTGACAGCTGCAAAGAAATCT       c.4920
 S  K  T  K  Y  H  T  P  L  D  M  V  S  V  T  A  A  K  K  S         p.1640

          .         .         .         .         .         .       g.73347
 CAGGAGGTTGCCACCAACGCCAACTACAGACAGTCATACCACCACTACACTCTCCTGCCC       c.4980
 Q  E  V  A  T  N  A  N  Y  R  Q  S  Y  H  H  Y  T  L  L  P         p.1660

          .         .         .         .         .  | 42      .    g.73957
 GATGCCTTGAATGTGGAGCACTCCAGGAATGCCATGCAGATTCAGAGTGAT | AATCTGTAC    c.5040
 D  A  L  N  V  E  H  S  R  N  A  M  Q  I  Q  S  D   | N  L  Y      p.1680

          .         .         .         .         .         .       g.74017
 AAATCTGACTTCACCAATTGGATGAAAGGGATCGGCTGGGTGCCCATAGAGTCCCTGGAG       c.5100
 K  S  D  F  T  N  W  M  K  G  I  G  W  V  P  I  E  S  L  E         p.1700

          .         .         .         .         .         .       g.74077
 GTGGAGAAGGCAAAGAAAGCAGGAGAGATTCTTAGTGAGAAGAAGTATCGCCAGCACCCC       c.5160
 V  E  K  A  K  K  A  G  E  I  L  S  E  K  K  Y  R  Q  H  P         p.1720

          .         .         .         .         .         .       g.74137
 GAGAAGCTGAAGTTCACTTACGCCATGGACACAATGGAACAGGCACTTAACAAGAGTAAC       c.5220
 E  K  L  K  F  T  Y  A  M  D  T  M  E  Q  A  L  N  K  S  N         p.1740

          .         | 43         .         .         .         .    g.74666
 AAACTGAACATGGACAAG | AGGCTCTACACTGAAAAATGGAACAAGGACAAGACCACCATT    c.5280
 K  L  N  M  D  K   | R  L  Y  T  E  K  W  N  K  D  K  T  T  I      p.1760

          .         .         .         .         .         .       g.74726
 CATGTCATGCCTGACACACCGGATATTTTACTCTCCAGAGTAAACCAAATCACCATGAGT       c.5340
 H  V  M  P  D  T  P  D  I  L  L  S  R  V  N  Q  I  T  M  S         p.1780

     | 44    .         .         .         .         .         .    g.74936
 GAT | AAACTGTACAAAGCTGGCTGGGAAGAGGAAAAGAAGAAAGGATATGACCTGAGGCCT    c.5400
 D   | K  L  Y  K  A  G  W  E  E  E  K  K  K  G  Y  D  L  R  P      p.1800

          .         .         .         .         .  | 45      .    g.75637
 GATGCCATTGCAATAAAGGCTGCAAGAGCCTCTAGAGACATTGCCAGTGAT | TACAAATAC    c.5460
 D  A  I  A  I  K  A  A  R  A  S  R  D  I  A  S  D   | Y  K  Y      p.1820

          .         .         .         .         .         .       g.75697
 AAGAAAGCCTATGAACAAGCCAAAGGGAAACACATTGGCTTCCGGAGCCTGGAAGATGAC       c.5520
 K  K  A  Y  E  Q  A  K  G  K  H  I  G  F  R  S  L  E  D  D         p.1840

          .         .         .         .         .         .       g.75757
 CCCAAGCTGGTGCACTTCATGCAAGTGGCCAAGATGCAGTCAGACCGGGAATACAAGAAG       c.5580
 P  K  L  V  H  F  M  Q  V  A  K  M  Q  S  D  R  E  Y  K  K         p.1860

          .         .         .         .         .         .       g.75817
 GGATATGAGAAATCCAAGACCTCCTTCCACACCCCGGTGGACATGCTCAGTGTGGTGGCA       c.5640
 G  Y  E  K  S  K  T  S  F  H  T  P  V  D  M  L  S  V  V  A         p.1880

          .         .         .         .         .         .       g.75877
 GCCAAGAAGTCTCAGGAAGTGGCCACCAATGCCAACTACAGGAACGTGATCCATACCTAC       c.5700
 A  K  K  S  Q  E  V  A  T  N  A  N  Y  R  N  V  I  H  T  Y         p.1900

          .         .         .         .         .         .       g.75937
 AACATGCTTCCTGATGCCATGAGCTTTGAATTGGCCAAAAATATGATGCAGATTCAAAGT       c.5760
 N  M  L  P  D  A  M  S  F  E  L  A  K  N  M  M  Q  I  Q  S         p.1920

     | 46    .         .         .         .         .         .    g.77203
 GAT | AATCAGTACAAGGCTGACTATGCTGACTTCATGAAGGGCATTGGATGGCTCCCTCTG    c.5820
 D   | N  Q  Y  K  A  D  Y  A  D  F  M  K  G  I  G  W  L  P  L      p.1940

          .         .         .         .         .         .       g.77263
 GGCTCCCTGGAAGCAGAGAAAAACAAGAAAGCCATGGAGATTATTAGTGAAAAGAAGTAC       c.5880
 G  S  L  E  A  E  K  N  K  K  A  M  E  I  I  S  E  K  K  Y         p.1960

          .         .         .         .         .         .       g.77323
 CGCCAGCACCCAGACACTTTGAAGTATTCCACACTCATGGACTCGATGAACATGGTTTTG       c.5940
 R  Q  H  P  D  T  L  K  Y  S  T  L  M  D  S  M  N  M  V  L         p.1980

          .         .         . | 47       .         .         .    g.80348
 GCCCAGAATAATGCAAAAATTATGAACGAA | CATCTCTACAAACAAGCATGGGAGGCTGAC    c.6000
 A  Q  N  N  A  K  I  M  N  E   | H  L  Y  K  Q  A  W  E  A  D      p.2000

          .         .         .         .         .         .       g.80408
 AAAACCAAAGTCCACATCATGCCTGATATCCCCCAGATTATTTTGGCAAAGGCAAATGCA       c.6060
 K  T  K  V  H  I  M  P  D  I  P  Q  I  I  L  A  K  A  N  A         p.2020

          .      | 48  .         .         .         .         .    g.81442
 ATTAATATGAGTGAT | AAACTCTACAAACTTTCCTTGGAAGAGTCTAAAAAGAAAGGCTAT    c.6120
 I  N  M  S  D   | K  L  Y  K  L  S  L  E  E  S  K  K  K  G  Y      p.2040

          .         .         .         .         .         .       g.81502
 GATCTCAGACCTGATGCAATTCCTATCAAAGCTGCCAAGGCTTCCAGAGATATTGCAAGT       c.6180
 D  L  R  P  D  A  I  P  I  K  A  A  K  A  S  R  D  I  A  S         p.2060

     | 49    .         .         .         .         .         .    g.83080
 GAT | TATAAATACAAGTACAATTATGAAAAAGGGAAGGGGAAAATGGTTGGTTTCCGCAGT    c.6240
 D   | Y  K  Y  K  Y  N  Y  E  K  G  K  G  K  M  V  G  F  R  S      p.2080

          .         .         .         .         .         .       g.83140
 CTCGAGGATGATCCCAAATTAGTCCATTCCATGCAAGTGGCTAAGATGCAATCTGATCGG       c.6300
 L  E  D  D  P  K  L  V  H  S  M  Q  V  A  K  M  Q  S  D  R         p.2100

          .         .         .         .         .         .       g.83200
 GAGTACAAGAAAAACTATGAGAACACAAAGACCAGCTACCACACCCCTGCCGACATGCTC       c.6360
 E  Y  K  K  N  Y  E  N  T  K  T  S  Y  H  T  P  A  D  M  L         p.2120

          .         .         .         .         .         .       g.83260
 AGTGTCACGGCTGCAAAGGATGCCCAAGCCAACATCACCAACACTAACTACAAGCACCTG       c.6420
 S  V  T  A  A  K  D  A  Q  A  N  I  T  N  T  N  Y  K  H  L         p.2140

          .         .         .         .         .         .       g.83320
 ATTCACAAGTACATCCTCCTTCCAGATGCAATGAACATTGAGCTGACCAGGAATATGAAT       c.6480
 I  H  K  Y  I  L  L  P  D  A  M  N  I  E  L  T  R  N  M  N         p.2160

          .      | 50  .         .         .         .         .    g.83509
 CGCATACAGAGTGAT | AATGAATATAAGCAAGATTACAATGAATGGTACAAAGGGCTTGGC    c.6540
 R  I  Q  S  D   | N  E  Y  K  Q  D  Y  N  E  W  Y  K  G  L  G      p.2180

          .         .         .         .         .         .       g.83569
 TGGAGTCCAGCAGGTTCTCTGGAAGTGGAGAAGGCCAAGAAAGCAACTGAATATGCCAGT       c.6600
 W  S  P  A  G  S  L  E  V  E  K  A  K  K  A  T  E  Y  A  S         p.2200

          .         .         .         .         .         .       g.83629
 GATCAGAAATACCGCCAGCACCCGAGCAACTTCCAGTTTAAGAAGCTGACTGATTCCATG       c.6660
 D  Q  K  Y  R  Q  H  P  S  N  F  Q  F  K  K  L  T  D  S  M         p.2220

          .         .         .         .   | 51     .         .    g.84131
 GACATGGTGCTTGCCAAGCAGAATGCACATACCATGAACAAG | CATTTATACACCATTGAT    c.6720
 D  M  V  L  A  K  Q  N  A  H  T  M  N  K   | H  L  Y  T  I  D      p.2240

          .         .         .         .         .         .       g.84191
 TGGAATAAAGATAAGACCAAGATTCATGTGATGCCTGATACACCAGATATTTTACAAGCC       c.6780
 W  N  K  D  K  T  K  I  H  V  M  P  D  T  P  D  I  L  Q  A         p.2260

          .         .        | 52.         .         .         .    g.85421
 AAGCAGAATCAAACACTGTATAGTCAG | AAACTCTATAAACTTGGATGGGAAGAAGCTTTG    c.6840
 K  Q  N  Q  T  L  Y  S  Q   | K  L  Y  K  L  G  W  E  E  A  L      p.2280

          .         .         .         .         .         .       g.85481
 AAGAAAGGCTATGATCTCCCAGTTGATGCAATTTCTGTACAGCTAGCTAAAGCTTCAAGA       c.6900
 K  K  G  Y  D  L  P  V  D  A  I  S  V  Q  L  A  K  A  S  R         p.2300

          .      | 53  .         .         .         .         .    g.88647
 GACATTGCTAGTGAT | TATAAATACAAACAAGGCTACCGAAAGCAACTTGGCCACCATGTT    c.6960
 D  I  A  S  D   | Y  K  Y  K  Q  G  Y  R  K  Q  L  G  H  H  V      p.2320

          .         .         .         .         .         .       g.88707
 GGATTCCGGAGTCTGCAAGATGACCCAAAACTTGTGTTGTCCATGAATGTAGCCAAAATG       c.7020
 G  F  R  S  L  Q  D  D  P  K  L  V  L  S  M  N  V  A  K  M         p.2340

          .         .         .         .         .         .       g.88767
 CAGAGTGAAAGAGAATACAAGAAGGACTTTGAGAAGTGGAAAACTAAGTTCTCCAGCCCA       c.7080
 Q  S  E  R  E  Y  K  K  D  F  E  K  W  K  T  K  F  S  S  P         p.2360

          .         .         .         .         .         .       g.88827
 GTGGACATGTTGGGAGTGGTACTGGCCAAGAAGTGTCAGGAGTTGGTTAGTGACGTGGAC       c.7140
 V  D  M  L  G  V  V  L  A  K  K  C  Q  E  L  V  S  D  V  D         p.2380

          .         .         .         .         .         .       g.88887
 TACAAGAACTACCTGCATCAGTGGACATGTCTGCCTGATCAGAACGATGTTGTGCAAGCT       c.7200
 Y  K  N  Y  L  H  Q  W  T  C  L  P  D  Q  N  D  V  V  Q  A         p.2400

          .         .        | 54.         .         .         .    g.89141
 AAGAAAGTTTATGAACTGCAAAGTGAG | AATCTATATAAATCTGACCTTGAGTGGCTGAGA    c.7260
 K  K  V  Y  E  L  Q  S  E   | N  L  Y  K  S  D  L  E  W  L  R      p.2420

          .         .         .         .         .         .       g.89201
 GGCATAGGATGGAGTCCCTTGGGTTCTTTAGAGGCAGAAAAGAACAAGCGGGCTTCGGAA       c.7320
 G  I  G  W  S  P  L  G  S  L  E  A  E  K  N  K  R  A  S  E         p.2440

          .         .         .         .         .         .       g.89261
 ATCATCAGTGAGAAGAAATATCGTCAGCCTCCAGACAGAAACAAGTTCACCAGCATTCCT       c.7380
 I  I  S  E  K  K  Y  R  Q  P  P  D  R  N  K  F  T  S  I  P         p.2460

          .         .         .         .         .  | 55      .    g.93262
 GATGCCATGGATATAGTTCTGGCAAAGACAAATGCCAAAAATAGGAGTGAT | AGACTTTAT    c.7440
 D  A  M  D  I  V  L  A  K  T  N  A  K  N  R  S  D   | R  L  Y      p.2480

          .         .         .         .         .         .       g.93322
 AGAGAAGCTTGGGACAAAGACAAGACTCAGATCCACATCATGCCTGATACACCTGACATT       c.7500
 R  E  A  W  D  K  D  K  T  Q  I  H  I  M  P  D  T  P  D  I         p.2500

          .         .         .       | 56 .         .         .    g.94936
 GTTCTGGCTAAAGCAAACTTAATCAACACAAGTGAT | AAACTCTACCGAATGGGTTATGAG    c.7560
 V  L  A  K  A  N  L  I  N  T  S  D   | K  L  Y  R  M  G  Y  E      p.2520

          .         .         .         .         .         .       g.94996
 GAGCTGAAGAGAAAAGGTTACGATCTTCCTGTTGATGCCATACCAATCAAAGCAGCAAAA       c.7620
 E  L  K  R  K  G  Y  D  L  P  V  D  A  I  P  I  K  A  A  K         p.2540

          .         .     | 57   .         .         .         .    g.95394
 GCCTCCCGGGAAATTGCCAGTGAA | TACAAGTACAAGGAAGGCTTTCGCAAGCAGCTCGGC    c.7680
 A  S  R  E  I  A  S  E   | Y  K  Y  K  E  G  F  R  K  Q  L  G      p.2560

          .         .         .         .         .         .       g.95454
 CACCACATTGGTGCCCGGAACATTGAAGATGACCCCAAGATGATGTGGTCCATGCATGTG       c.7740
 H  H  I  G  A  R  N  I  E  D  D  P  K  M  M  W  S  M  H  V         p.2580

          .         .         .         .         .         .       g.95514
 GCCAAGATCCAGAGTGACAGGGAGTACAAGAAGGACTTTGAGAAGTGGAAGACCAAGTTC       c.7800
 A  K  I  Q  S  D  R  E  Y  K  K  D  F  E  K  W  K  T  K  F         p.2600

          .         .         .         .         .         .       g.95574
 AGCAGCCCAGTGGACATGCTGGGGGTGGTGTTGGCCAAGAAGTGCCAGACCTTAGTCAGC       c.7860
 S  S  P  V  D  M  L  G  V  V  L  A  K  K  C  Q  T  L  V  S         p.2620

          .         .         .         .         .         .       g.95634
 GACGTGGACTACAAGAACTACCTGCACCAGTGGACATGCCTGCCCGACCAGAGCGATGTC       c.7920
 D  V  D  Y  K  N  Y  L  H  Q  W  T  C  L  P  D  Q  S  D  V         p.2640

          .         .         .       | 58 .         .         .    g.96158
 ATCCATGCTCGGCAGGCCTATGACCTCCAGAGCGAT | AATTTGTACAAGTCAGACCTTCAG    c.7980
 I  H  A  R  Q  A  Y  D  L  Q  S  D   | N  L  Y  K  S  D  L  Q      p.2660

          .         .         .         .         .         .       g.96218
 TGGCTAAAAGGCATTGGCTGGATGACTAGTGGTTCTCTCGAGGATGAGAAAAATAAACGA       c.8040
 W  L  K  G  I  G  W  M  T  S  G  S  L  E  D  E  K  N  K  R         p.2680

          .         .         .         .         .         .       g.96278
 GCCACCCAGATTTTGAGTGACCATGTTTACCGTCAGCACCCAGATCAATTTAAGTTTTCC       c.8100
 A  T  Q  I  L  S  D  H  V  Y  R  Q  H  P  D  Q  F  K  F  S         p.2700

          .         .         .         .         .         .       g.96338
 AGCCTTATGGATTCCATACCAATGGTTTTGGCAAAAAACAATGCTATTACCATGAATCAT       c.8160
 S  L  M  D  S  I  P  M  V  L  A  K  N  N  A  I  T  M  N  H         p.2720

  | 59       .         .         .         .         .         .    g.96678
  | CGCCTCTATACAGAAGCTTGGGATAAAGATAAAACCACTGTCCACATTATGCCAGATACC    c.8220
  | R  L  Y  T  E  A  W  D  K  D  K  T  T  V  H  I  M  P  D  T      p.2740

          .         .         .         .      | 60  .         .    g.96821
 CCTGAAGTTTTATTAGCTAAACAAAACAAAGTAAATTACAGTGAG | AAATTGTATAAGCTT    c.8280
 P  E  V  L  L  A  K  Q  N  K  V  N  Y  S  E   | K  L  Y  K  L      p.2760

          .         .         .         .         .         .       g.96881
 GGCCTAGAAGAAGCCAAGAGGAAAGGTTATGACATGCGGGTAGATGCCATTCCTATCAAG       c.8340
 G  L  E  E  A  K  R  K  G  Y  D  M  R  V  D  A  I  P  I  K         p.2780

          .         .         .    | 61    .         .         .    g.98848
 GCAGCCAAGGCCTCCAGAGATATTGCAAGTGAA | TTCAAGTACAAAGAAGGCTATCGTAAG    c.8400
 A  A  K  A  S  R  D  I  A  S  E   | F  K  Y  K  E  G  Y  R  K      p.2800

          .         .         .         .         .         .       g.98908
 CAGCTCGGCCACCACATTGGTGCCCGAGCTATACGTGATGACCCCAAGATGATGTGGTCC       c.8460
 Q  L  G  H  H  I  G  A  R  A  I  R  D  D  P  K  M  M  W  S         p.2820

          .         .         .         .         .         .       g.98968
 ATGCACGTGGCCAAGATCCAGAGTGACAGGGAGTACAAGAAGGACTTTGAGAAGTGGAAG       c.8520
 M  H  V  A  K  I  Q  S  D  R  E  Y  K  K  D  F  E  K  W  K         p.2840

          .         .         .         .         .         .       g.99028
 ACCAAGTTCAGCAGCCCAGTGGACATGCTGGGGGTGGTGCTGGCCAAGAAGTGCCAGACC       c.8580
 T  K  F  S  S  P  V  D  M  L  G  V  V  L  A  K  K  C  Q  T         p.2860

          .         .         .         .         .         .       g.99088
 TTAGTCAGCGATGTGGACTACAAGAACTACCTGCACCAGTGGACATGCCTGCCCGACCAG       c.8640
 L  V  S  D  V  D  Y  K  N  Y  L  H  Q  W  T  C  L  P  D  Q         p.2880

          .         .         .         .      | 62  .         .    g.99442
 AGCGACGTCATCCATGCTCGGCAGGCCTATGACCTCCAGAGCGAT | AATATGTACAAGTCT    c.8700
 S  D  V  I  H  A  R  Q  A  Y  D  L  Q  S  D   | N  M  Y  K  S      p.2900

          .         .         .         .         .         .       g.99502
 GATCTCCAGTGGATGAGAGGCATTGGCTGGGTGTCCATTGGCTCTTTGGATGTGGAAAAA       c.8760
 D  L  Q  W  M  R  G  I  G  W  V  S  I  G  S  L  D  V  E  K         p.2920

          .         .         .         .         .         .       g.99562
 TGCAAAAGGGCAACTGAAATTTTGAGTGATAAAATCTATCGCCAGCCTCCAGACAGATTC       c.8820
 C  K  R  A  T  E  I  L  S  D  K  I  Y  R  Q  P  P  D  R  F         p.2940

          .         .         .         .         .         .       g.99622
 AAATTTACCAGTGTGACTGACTCTCTGGAACAAGTGTTGGCCAAAAATAATGCCATCACT       c.8880
 K  F  T  S  V  T  D  S  L  E  Q  V  L  A  K  N  N  A  I  T         p.2960

           | 63        .         .         .         .         .    g.100154
 ATGAACAAG | CGTTTATACACAGAAGCCTGGGACAAAGACAAGACTCAGATCCACATAATG    c.8940
 M  N  K   | R  L  Y  T  E  A  W  D  K  D  K  T  Q  I  H  I  M      p.2980

          .         .         .         .         .     | 64   .    g.103159
 CCAGACACACCAGAAATTATGTTGGCAAGAATGAACAAAATCAACTACAGTGAG | AGTCTG    c.9000
 P  D  T  P  E  I  M  L  A  R  M  N  K  I  N  Y  S  E   | S  L      p.3000

          .         .         .         .         .         .       g.103219
 TACAAACTTGCTAATGAAGAAGCAAAGAAGAAAGGCTATGACTTGCGAAGCGACGCCATC       c.9060
 Y  K  L  A  N  E  E  A  K  K  K  G  Y  D  L  R  S  D  A  I         p.3020

          .         .         .         .   | 65     .         .    g.105540
 CCCATCGTGGCGGCCAAGGCCTCCAGGGACATCATCAGTGAC | TACAAATATAAAGATGGT    c.9120
 P  I  V  A  A  K  A  S  R  D  I  I  S  D   | Y  K  Y  K  D  G      p.3040

          .         .         .         .         .         .       g.105600
 TACTGCAAGCAACTTGGCCACCATATTGGAGCCCGGAACATTGAAGATGACCCCAAGATG       c.9180
 Y  C  K  Q  L  G  H  H  I  G  A  R  N  I  E  D  D  P  K  M         p.3060

          .         .         .         .         .         .       g.105660
 ATGTGGTCCATGCACGTAGCCAAGATCCAGAGTGACAGGGAGTACAAAAAGGACTTTGAG       c.9240
 M  W  S  M  H  V  A  K  I  Q  S  D  R  E  Y  K  K  D  F  E         p.3080

          .         .         .         .         .         .       g.105720
 AAGTGGAAGACCAAGTTCAGCAGCCCAGTGGACATGCTGGGGGTGGTGCTGGCCAAGAAG       c.9300
 K  W  K  T  K  F  S  S  P  V  D  M  L  G  V  V  L  A  K  K         p.3100

          .         .         .         .         .         .       g.105780
 TGCCAGACCTTAGTCAGTGACGTGGACTATAAGAACTACCTGCACGAGTGGACATGCCTG       c.9360
 C  Q  T  L  V  S  D  V  D  Y  K  N  Y  L  H  E  W  T  C  L         p.3120

          .         .         .         .         .     | 66   .    g.108147
 CCTGACCAGAGCGATGTCATCCATGCTCGGCAGGCCTATGACCTCCAGAGTGAC | AATATT    c.9420
 P  D  Q  S  D  V  I  H  A  R  Q  A  Y  D  L  Q  S  D   | N  I      p.3140

          .         .         .         .         .         .       g.108207
 TACAAGTCAGATCTCCAGTGGCTGAGAGGCATTGGCTGGGTCCCCATTGGGTCTATGGAT       c.9480
 Y  K  S  D  L  Q  W  L  R  G  I  G  W  V  P  I  G  S  M  D         p.3160

          .         .         .         .         .         .       g.108267
 GTGGTCAAGTGCAAGAGAGCTACCGAAATACTGAGTGATAACATCTACCGCCAGCCTCCG       c.9540
 V  V  K  C  K  R  A  T  E  I  L  S  D  N  I  Y  R  Q  P  P         p.3180

          .         .         .         .         .         .       g.108327
 GACAAGCTGAAATTTACCAGTGTGACTGATTCTCTAGAGCAGGTGCTGGCCAAGAACAAT       c.9600
 D  K  L  K  F  T  S  V  T  D  S  L  E  Q  V  L  A  K  N  N         p.3200

          .         | 67         .         .         .         .    g.108710
 GCTCTCAACATGAATAAG | CGTTTATACACAGAGGCCTGGGACAAAGACAAGACTCAAATT    c.9660
 A  L  N  M  N  K   | R  L  Y  T  E  A  W  D  K  D  K  T  Q  I      p.3220

          .         .         .         .         .         .       g.108770
 CACATAATGCCTGATACACCAGAGATTATGTTGGCAAGGCAGAACAAAATCAACTACAGT       c.9720
 H  I  M  P  D  T  P  E  I  M  L  A  R  Q  N  K  I  N  Y  S         p.3240

     | 68    .         .         .         .         .         .    g.109898
 GAG | ACTCTATACAAACTTGCCAATGAAGAAGCAAAAAAGAAAGGCTACGACTTGCGAAGT    c.9780
 E   | T  L  Y  K  L  A  N  E  E  A  K  K  K  G  Y  D  L  R  S      p.3260

          .         .         .         .         .  | 69      .    g.111662
 GACGCCATCCCCATCGTGGCTGCCAAGGCCTCCAGGGACGTTATCAGTGAT | TACAAATAC    c.9840
 D  A  I  P  I  V  A  A  K  A  S  R  D  V  I  S  D   | Y  K  Y      p.3280

          .         .         .         .         .         .       g.111722
 AAAGATGGTTACCGCAAGCAGCTCGGCCACCACATTGGAGCCCGGAACATTGAAGATGAC       c.9900
 K  D  G  Y  R  K  Q  L  G  H  H  I  G  A  R  N  I  E  D  D         p.3300

          .         .         .         .         .         .       g.111782
 CCCAAGATGATGTGGTCCATGCATGTGGCCAAGATCCAGAGTGACAGGGAGTATAAGAAG       c.9960
 P  K  M  M  W  S  M  H  V  A  K  I  Q  S  D  R  E  Y  K  K         p.3320

          .         .         .         .         .         .       g.111842
 GACTTTGAGAAGTGGAAGACCAAGTTCAGCAGCCCAGTGGACATGCTGGGAGTGGTGTTA       c.10020
 D  F  E  K  W  K  T  K  F  S  S  P  V  D  M  L  G  V  V  L         p.3340

          .         .         .         .         .         .       g.111902
 GCCAAGAAGTGCCAGACCTTAGTCAGCGATGTGGACTACAAGAACTACCTGCACGAGTGG       c.10080
 A  K  K  C  Q  T  L  V  S  D  V  D  Y  K  N  Y  L  H  E  W         p.3360

          .         .         .         .         .         .       g.111962
 ACGTGCCTGCCCGACCAGAATGATGTCATCCATGCTCGGCAGGCCTATGACCTCCAGAGC       c.10140
 T  C  L  P  D  Q  N  D  V  I  H  A  R  Q  A  Y  D  L  Q  S         p.3380

     | 70    .         .         .         .         .         .    g.112339
 GAT | AACATTTACAAATCTGATCTCCAGTGGCTGAGAGGCATTGGCTGGGTCCCCATTGGG    c.10200
 D   | N  I  Y  K  S  D  L  Q  W  L  R  G  I  G  W  V  P  I  G      p.3400

          .         .         .         .         .         .       g.112399
 TCTATGGATGTGGTCAAGTGCAAGAGAGCTGCTGAAATACTGAGTGATAACATCTACCGC       c.10260
 S  M  D  V  V  K  C  K  R  A  A  E  I  L  S  D  N  I  Y  R         p.3420

          .         .         .         .         .         .       g.112459
 CAGCCTCCGGACAAGCTGAAATTTACCAGTGTGACTGACTCTCTAGAGCAGGTGCTGGCC       c.10320
 Q  P  P  D  K  L  K  F  T  S  V  T  D  S  L  E  Q  V  L  A         p.3440

          .         .        | 71.         .         .         .    g.113882
 AAGAACAATGCTCTCAATATGAACAAG | CGCTTATACACAGAAGCCTGGGACAAAGACAAG    c.10380
 K  N  N  A  L  N  M  N  K   | R  L  Y  T  E  A  W  D  K  D  K      p.3460

          .         .         .         .         .         .       g.113942
 ACCCAAGTCCATATTATGCCTGATACACCTGAAATCATGTTGGCAAGACAAAATAAAATA       c.10440
 T  Q  V  H  I  M  P  D  T  P  E  I  M  L  A  R  Q  N  K  I         p.3480

          .   | 72     .         .         .         .         .    g.118509
 AATTATAGTGAG | AGCCTCTATCGTCAGGCCATGGAAGAAGCCAAGAAAGAAGGCTATGAC    c.10500
 N  Y  S  E   | S  L  Y  R  Q  A  M  E  E  A  K  K  E  G  Y  D      p.3500

          .         .         .         .         .         .       g.118569
 TTGAGAAGTGATGCCATTCCCATTGTGGCTGCCAAGGCCTCTCGGGATATTGCCAGTGAT       c.10560
 L  R  S  D  A  I  P  I  V  A  A  K  A  S  R  D  I  A  S  D         p.3520

  | 73       .         .         .         .         .         .    g.119785
  | TACAAATACAAAGAAGCATATCGTAAACAGTTGGGTCACCACATTGGCGCCCGAGCAGTA    c.10620
  | Y  K  Y  K  E  A  Y  R  K  Q  L  G  H  H  I  G  A  R  A  V      p.3540

          .         .         .         .         .         .       g.119845
 CACGATGACCCCAAGATAATGTGGTCCCTCCACATTGCCAAAGTGCAGAGTGACCGTGAG       c.10680
 H  D  D  P  K  I  M  W  S  L  H  I  A  K  V  Q  S  D  R  E         p.3560

          .         .         .         .         .         .       g.119905
 TACAAGAAAGATTTTGAGAAATACAAGACAAGGTACAGCAGCCCAGTGGACATGCTTGGT       c.10740
 Y  K  K  D  F  E  K  Y  K  T  R  Y  S  S  P  V  D  M  L  G         p.3580

          .         .         .         .         .         .       g.119965
 ATCGTTTTGGCCAAGAAGTGTCAGACCTTGGTCAGCGATGTGGACTATAAACATCCTCTG       c.10800
 I  V  L  A  K  K  C  Q  T  L  V  S  D  V  D  Y  K  H  P  L         p.3600

          .         .         .         .         .         .       g.120025
 CATGAATGGATCTGCCTGCCCGACCAGAATGACATCATTCATGCACGGAAAGCCTATGAC       c.10860
 H  E  W  I  C  L  P  D  Q  N  D  I  I  H  A  R  K  A  Y  D         p.3620

          .   | 74     .         .         .         .         .    g.121057
 CTCCAGAGTGAC | AATTTGTATAAGTCAGACCTTGAATGGATGAAAGGCATTGGCTGGGTT    c.10920
 L  Q  S  D   | N  L  Y  K  S  D  L  E  W  M  K  G  I  G  W  V      p.3640

          .         .         .         .         .         .       g.121117
 CCGATTGATTCCTTGGAAGTTGTTAGGGCCAAGAGAGCTGGAGAATTACTTAGTGATACT       c.10980
 P  I  D  S  L  E  V  V  R  A  K  R  A  G  E  L  L  S  D  T         p.3660

          .         .         .         .         .         .       g.121177
 ATCTACCGTCAGCGTCCAGAAACGCTGAAATTTACCAGTATAACGGACACTCCGGAGCAG       c.11040
 I  Y  R  Q  R  P  E  T  L  K  F  T  S  I  T  D  T  P  E  Q         p.3680

          .         .         .       | 75 .         .         .    g.122043
 GTGCTGGCAAAAAACAATGCTTTAAACATGAATAAG | CGCTTATATACTGAAGCCTGGGAC    c.11100
 V  L  A  K  N  N  A  L  N  M  N  K   | R  L  Y  T  E  A  W  D      p.3700

          .         .         .         .         .         .       g.122103
 AATGACAAGAAAACTATTCATGTCATGCCTGATACACCAGAAATCATGTTAGCCAAACTC       c.11160
 N  D  K  K  T  I  H  V  M  P  D  T  P  E  I  M  L  A  K  L         p.3720

          .         .  | 76      .         .         .         .    g.123417
 AACCGAATAAACTACAGTGAT | AAACTCTATAAACTTGCTTTGGAAGAGTCCAAGAAGGAA    c.11220
 N  R  I  N  Y  S  D   | K  L  Y  K  L  A  L  E  E  S  K  K  E      p.3740

          .         .         .         .         .         .       g.123477
 GGCTATGACTTGCGTCTGGATGCCATTCCAATCCAAGCAGCCAAGGCTTCAAGAGATATT       c.11280
 G  Y  D  L  R  L  D  A  I  P  I  Q  A  A  K  A  S  R  D  I         p.3760

           | 77        .         .         .         .         .    g.124951
 GCTAGTGAT | TACAAGTACAAGGAAGGCTACCGCAAACAGCTTGGCCACCATATTGGGGCC    c.11340
 A  S  D   | Y  K  Y  K  E  G  Y  R  K  Q  L  G  H  H  I  G  A      p.3780

          .         .         .         .         .         .       g.125011
 CGGAACATTAAGGATGACCCGAAGATGATGTGGTCCATCCATGTGGCCAAGATCCAGAGT       c.11400
 R  N  I  K  D  D  P  K  M  M  W  S  I  H  V  A  K  I  Q  S         p.3800

          .         .         .         .         .         .       g.125071
 GACAGGGAGTACAAGAAGGAGTTTGAGAAGTGGAAGACCAAGTTCAGCAGCCCAGTGGAC       c.11460
 D  R  E  Y  K  K  E  F  E  K  W  K  T  K  F  S  S  P  V  D         p.3820

          .         .         .         .         .         .       g.125131
 ATGCTGGGGGTGGTGCTGGCCAAGAAGTGTCAGATCCTTGTAAGCGACATAGACTACAAG       c.11520
 M  L  G  V  V  L  A  K  K  C  Q  I  L  V  S  D  I  D  Y  K         p.3840

          .         .         .         .         .         .       g.125191
 CATCCCCTGCATGAATGGACCTGCCTGCCTGATCAGAATGACGTCATTCAGGCTCGGAAG       c.11580
 H  P  L  H  E  W  T  C  L  P  D  Q  N  D  V  I  Q  A  R  K         p.3860

          .         .  | 78      .         .         .         .    g.127137
 GCCTATGACCTGCAGAGTGAT | GCTATTTACAAATCTGATCTTGAGTGGCTGAGAGGCATA    c.11640
 A  Y  D  L  Q  S  D   | A  I  Y  K  S  D  L  E  W  L  R  G  I      p.3880

          .         .         .         .         .         .       g.127197
 GGATGGGTTCCCATTGGCTCTGTAGAGGTCGAGAAAGTGAAGAGAGCTGGAGAAATCCTG       c.11700
 G  W  V  P  I  G  S  V  E  V  E  K  V  K  R  A  G  E  I  L         p.3900

          .         .         .         .         .         .       g.127257
 AGTGACAGGAAGTATCGCCAGCCTGCAGACCAGCTCAAATTCACATGCATTACCGACACT       c.11760
 S  D  R  K  Y  R  Q  P  A  D  Q  L  K  F  T  C  I  T  D  T         p.3920

          .         .         .         .      | 79  .         .    g.128636
 CCGGAAATTGTCCTAGCAAAGAATAATGCCCTGACAATGAGCAAG | CATTTATACACAGAA    c.11820
 P  E  I  V  L  A  K  N  N  A  L  T  M  S  K   | H  L  Y  T  E      p.3940

          .         .         .         .         .         .       g.128696
 GCTTGGGATGCTGACAAAACCTCCATCCACGTGATGCCAGACACCCCAGATATCCTGCTG       c.11880
 A  W  D  A  D  K  T  S  I  H  V  M  P  D  T  P  D  I  L  L         p.3960

          .         .         . | 80       .         .         .    g.128894
 GCCAAGAGTAATTCTGCCAATATCAGCCAA | AAACTTTACACCAAGGGATGGGATGAATCA    c.11940
 A  K  S  N  S  A  N  I  S  Q   | K  L  Y  T  K  G  W  D  E  S      p.3980

          .         .         .         .         .         .       g.128954
 AAGATGAAGGACTATGATCTGAGAGCAGATGCTATTTCCATCAAAAGTGCCAAGGCCTCC       c.12000
 K  M  K  D  Y  D  L  R  A  D  A  I  S  I  K  S  A  K  A  S         p.4000

          .         | 81         .         .         .         .    g.129409
 AGGGACATCGCCAGTGAC | TACAAATACAAGGAAGCCTATGAGAAACAGAAAGGCCACCAC    c.12060
 R  D  I  A  S  D   | Y  K  Y  K  E  A  Y  E  K  Q  K  G  H  H      p.4020

          .         .         .         .         .         .       g.129469
 ATTGGAGCCCAGAGCATTGAAGATGATCCCAAGATTATGTGTGCCATACATGCAGGAAAA       c.12120
 I  G  A  Q  S  I  E  D  D  P  K  I  M  C  A  I  H  A  G  K         p.4040

          .         .         .         .         .         .       g.129529
 ATTCAAAGTGAAAGGGAGTACAAGAAGGAATTCCAAAAGTGGAAAACCAAGTTCTCTAGC       c.12180
 I  Q  S  E  R  E  Y  K  K  E  F  Q  K  W  K  T  K  F  S  S         p.4060

          .         .         .         .         .         .       g.129589
 CCAGTGGACATGTTAAGCATCTTGCTGGCCAAGAAATGTCAGACTTTGGTCACTGACATT       c.12240
 P  V  D  M  L  S  I  L  L  A  K  K  C  Q  T  L  V  T  D  I         p.4080

          .         .         .         .         .         .       g.129649
 GATTATCGCAATTACCTGCATGAATGGACATGCATGCCGGATCAAAACGACATTATCCAA       c.12300
 D  Y  R  N  Y  L  H  E  W  T  C  M  P  D  Q  N  D  I  I  Q         p.4100

          .         .         . | 82       .         .         .    g.130841
 GCAAAAAAGGCCTATGACCTGCAGAGTGAT | AGTGTGTATAAGGCAGACCTGGAGTGGCTG    c.12360
 A  K  K  A  Y  D  L  Q  S  D   | S  V  Y  K  A  D  L  E  W  L      p.4120
                                ^  sequence block 1

          .         .         .         .         .         .       g.130901
 CGTGGCATCGGCTGGATGCCAGAAGGCTCAGTGGAAATGAACAGAGTGAAGGTTGCTCAA       c.12420
 R  G  I  G  W  M  P  E  G  S  V  E  M  N  R  V  K  V  A  Q         p.4140

          .         .         .         .         .         .       g.130961
 GACCTCGTGAATGAAAGACTCTATAGGACACGTCCAGAAGCTTTGTCATTCACCAGCATT       c.12480
 D  L  V  N  E  R  L  Y  R  T  R  P  E  A  L  S  F  T  S  I         p.4160

          .         .         .         .         .     | 83   .    g.131885
 GTCGACACTCCAGAAGTTGTCTTGGCAAAAGCCAATTCTCTGCAAATAAGTGAG | AAACTG    c.12540
 V  D  T  P  E  V  V  L  A  K  A  N  S  L  Q  I  S  E   | K  L      p.4180

          .         .         .         .         .         .       g.131945
 TATCAGGAAGCCTGGAATAAAGATAAAAGCAACATCACCATTCCTTCTGATACTCCGGAG       c.12600
 Y  Q  E  A  W  N  K  D  K  S  N  I  T  I  P  S  D  T  P  E         p.4200

          .         .         .          | 84        .         .    g.132795
 ATGCTGCAGGCCCACATCAATGCCTTGCAAATCAGCAAT | AAACTCTACCAAAAAGACTGG    c.12660
 M  L  Q  A  H  I  N  A  L  Q  I  S  N   | K  L  Y  Q  K  D  W      p.4220

          .         .         .         .         .         .       g.132855
 AATGACGCCAAGCAGAAAGGCTATGACATAAGGGCAGATGCCATTGAAATCAAGCACGCC       c.12720
 N  D  A  K  Q  K  G  Y  D  I  R  A  D  A  I  E  I  K  H  A         p.4240

          .         .        | 85.         .         .         .    g.134649
 AAGGCCTCCAGAGAAATTGCCAGTGAG | TACAAATACAAAGAAGGTTACCGTAAGCAACTG    c.12780
 K  A  S  R  E  I  A  S  E   | Y  K  Y  K  E  G  Y  R  K  Q  L      p.4260

          .         .         .         .         .         .       g.134709
 GGCCACCACATGGGTTTCCGCACCCTACAAGATGACCCCAAGTCAGTATGGGCTATACAT       c.12840
 G  H  H  M  G  F  R  T  L  Q  D  D  P  K  S  V  W  A  I  H         p.4280

          .         .         .         .         .         .       g.134769
 GCTGCCAAGATCCAGAGTGACAGAGAATATAAGAAAGCTTATGAGAAGTCTAAAGGAATT       c.12900
 A  A  K  I  Q  S  D  R  E  Y  K  K  A  Y  E  K  S  K  G  I         p.4300

          .         .         .         .         .         .       g.134829
 CACAACACACCGTTGGACATGATGTCAATTGTTCAAGCCAAGAAATGCCAGGTCCTGGTT       c.12960
 H  N  T  P  L  D  M  M  S  I  V  Q  A  K  K  C  Q  V  L  V         p.4320

          .         .         .         .         .         .       g.134889
 AGCGACATTGATTATCGCAATTATCTGCACCAGTGGACGTGTCTGCCAGATCAGAACGAT       c.13020
 S  D  I  D  Y  R  N  Y  L  H  Q  W  T  C  L  P  D  Q  N  D         p.4340

          .         .         .          | 86        .         .    g.135736
 GTGATCCAGGCCAAGAAAGCCTACGACCTGCAGAGCGAT | AACTTGTACAAGTCAGACCTG    c.13080
 V  I  Q  A  K  K  A  Y  D  L  Q  S  D   | N  L  Y  K  S  D  L      p.4360

          .         .         .         .         .         .       g.135796
 GAATGGCTGAAGGGTATTGGATGGCTGCCAGAGGGTTCTGTGGAAGTGATGAGAGTGAAG       c.13140
 E  W  L  K  G  I  G  W  L  P  E  G  S  V  E  V  M  R  V  K         p.4380

          .         .         .         .         .         .       g.135856
 AATGCCCAGAATCTCTTGAATGAAAGGCTGTATCGTATAAAGCCTGAGGCCCTCAAATTC       c.13200
 N  A  Q  N  L  L  N  E  R  L  Y  R  I  K  P  E  A  L  K  F         p.4400

          .         .         .         .         .         .       g.135916
 ACCAGCATTGTTGACACCCCGGAAGTAATCCAGGCAAAGATCAATGCTGTACAGATCAGT       c.13260
 T  S  I  V  D  T  P  E  V  I  Q  A  K  I  N  A  V  Q  I  S         p.4420

     | 87    .         .         .         .         .         .    g.136853
 GAG | CCATTGTATCGCGATGCCTGGGAGAAAGAGAAGGCTAATGTGAACGTGCCAGCTGAC    c.13320
 E   | P  L  Y  R  D  A  W  E  K  E  K  A  N  V  N  V  P  A  D      p.4440

          .         .         .         .         | 88         .    g.137491
 ACTCCCCTGATGCTGCAATCCAAAATCAATGCCCTGCAGATCAGCAAT | AAACGCTATCAG    c.13380
 T  P  L  M  L  Q  S  K  I  N  A  L  Q  I  S  N   | K  R  Y  Q      p.4460

          .         .         .         .         .         .       g.137551
 CAAGCTTGGGAAGATGTCAAGATGACTGGTTATGACCTGCGAGCAGATGCCATTGGGATC       c.13440
 Q  A  W  E  D  V  K  M  T  G  Y  D  L  R  A  D  A  I  G  I         p.4480

          .         .         .       | 89 .         .         .    g.138758
 CAGCATGCCAAGGCTTCCAGGGATATTGCCAGTGAC | TATCTGTACAAAACTGCTTATGAG    c.13500
 Q  H  A  K  A  S  R  D  I  A  S  D   | Y  L  Y  K  T  A  Y  E      p.4500

          .         .         .         .         .         .       g.138818
 AAACAGAAAGGCCATTACATTGGCTGTCGCAGCGCCAAGGAAGACCCTAAACTGGTTTGG       c.13560
 K  Q  K  G  H  Y  I  G  C  R  S  A  K  E  D  P  K  L  V  W         p.4520

          .         .         .         .         .         .       g.138878
 GCAGCAAATGTGTTGAAGATGCAGAATGACAGGCTGTACAAAAAGGCCTACAACGACCAC       c.13620
 A  A  N  V  L  K  M  Q  N  D  R  L  Y  K  K  A  Y  N  D  H         p.4540

          .         .         .         .         .         .       g.138938
 AAAGCCAAGATCTCCATCCCTGTGGACATGGTGTCCATCAGCGCTGCCAAAGAAGGTCAG       c.13680
 K  A  K  I  S  I  P  V  D  M  V  S  I  S  A  A  K  E  G  Q         p.4560

          .         .         .         .         .         .       g.138998
 GCACTGGCAAGTGATGTGGACTATCGCCATTACCTGCACCACTGGTCTTGCTTTCCCGAC       c.13740
 A  L  A  S  D  V  D  Y  R  H  Y  L  H  H  W  S  C  F  P  D         p.4580

          .         .         .         .         | 90         .    g.141369
 CAGAATGATGTGATCCAAGCCAGGAAAGCCTACGACCTACAGAGCGAC | AGTGTGTATAAG    c.13800
 Q  N  D  V  I  Q  A  R  K  A  Y  D  L  Q  S  D   | S  V  Y  K      p.4600
                                                  ^  sequence block 2

          .         .         .         .         .         .       g.141429
 GCAGACCTGGAGTGGCTGCGTGGCATCGGCTGGATGCCAGAAGGCTCAGTGGAAATGAAC       c.13860
 A  D  L  E  W  L  R  G  I  G  W  M  P  E  G  S  V  E  M  N         p.4620

          .         .         .         .         .         .       g.141489
 AGAGTGAAGGTTGCTCAAGACCTCGTGAATGAAAGACTCTATAGGACACGTCCAGAAGCT       c.13920
 R  V  K  V  A  Q  D  L  V  N  E  R  L  Y  R  T  R  P  E  A         p.4640

          .         .         .         .         .         .       g.141549
 TTGTCATTCACCAGCATTGTCGACACTCCAGAAGTTGTCTTGGCAAAAGCCAATTCTCTG       c.13980
 L  S  F  T  S  I  V  D  T  P  E  V  V  L  A  K  A  N  S  L         p.4660

          .   | 91     .         .         .         .         .    g.142473
 CAAATAAGTGAG | AAACTGTATCAGGAAGCCTGGAATAAAGATAAAAGCAACATCACCATT    c.14040
 Q  I  S  E   | K  L  Y  Q  E  A  W  N  K  D  K  S  N  I  T  I      p.4680

          .         .         .         .         .        | 92.    g.143323
 CCTTCTGATACTCCGGAGATGCTGCAGGCCCACATCAATGCCTTGCAAATCAGCAAT | AAA    c.14100
 P  S  D  T  P  E  M  L  Q  A  H  I  N  A  L  Q  I  S  N   | K      p.4700

          .         .         .         .         .         .       g.143383
 CTCTACCAAAAAGACTGGAATGACACCAAGCAGAAAGGCTATGACATAAGGGCAGATGCC       c.14160
 L  Y  Q  K  D  W  N  D  T  K  Q  K  G  Y  D  I  R  A  D  A         p.4720

          .         .         .         .      | 93  .         .    g.145184
 ATTGAAATCAAGCACGCCAAGGCCTCCAGAGAAATTGCCAGTGAG | TACAAATACAAAGAA    c.14220
 I  E  I  K  H  A  K  A  S  R  E  I  A  S  E   | Y  K  Y  K  E      p.4740

          .         .         .         .         .         .       g.145244
 GGTTACCGTAAGCAACTGGGCCACCACATGGGTTTCCGCACCCTACAAGATGACCCCAAG       c.14280
 G  Y  R  K  Q  L  G  H  H  M  G  F  R  T  L  Q  D  D  P  K         p.4760

          .         .         .         .         .         .       g.145304
 TCAGTATGGGCTATACATGCTGCCAAGATCCAGAGTGACAGAGAATATAAGAAAGCTTAT       c.14340
 S  V  W  A  I  H  A  A  K  I  Q  S  D  R  E  Y  K  K  A  Y         p.4780

          .         .         .         .         .         .       g.145364
 GAGAAGTCTAAAGGAATTCACAACACACCGTTGGACATGATGTCAATTGTTCAAGCCAAG       c.14400
 E  K  S  K  G  I  H  N  T  P  L  D  M  M  S  I  V  Q  A  K         p.4800

          .         .         .         .         .         .       g.145424
 AAATGCCAGGTCCTGGTTAGCGACATTGATTATCGCAATTATCTGCACCAGTGGACGTGT       c.14460
 K  C  Q  V  L  V  S  D  I  D  Y  R  N  Y  L  H  Q  W  T  C         p.4820

          .         .         .         .         .        | 94.    g.146271
 CTGCCAGATCAGAACGATGTGATCCAGGCCAAGAAAGCCTACGACCTGCAGAGCGAT | AAC    c.14520
 L  P  D  Q  N  D  V  I  Q  A  K  K  A  Y  D  L  Q  S  D   | N      p.4840

          .         .         .         .         .         .       g.146331
 TTGTACAAGTCAGACCTGGAATGGCTGAAGGGTATTGGATGGTTGCCAGAGGGTTCTGTG       c.14580
 L  Y  K  S  D  L  E  W  L  K  G  I  G  W  L  P  E  G  S  V         p.4860

          .         .         .         .         .         .       g.146391
 GAAGTGATGAGAGTGAAGAATGCCCAGAATCTCTTGAATGAAAGGCTGTATCGTATAAAG       c.14640
 E  V  M  R  V  K  N  A  Q  N  L  L  N  E  R  L  Y  R  I  K         p.4880

          .         .         .         .         .         .       g.146451
 CCTGAGGCCCTCAAATTCACCAGCATTGTTGACACCCCGGAAGTAATCCAGGCAAAGATC       c.14700
 P  E  A  L  K  F  T  S  I  V  D  T  P  E  V  I  Q  A  K  I         p.4900

          .         .  | 95      .         .         .         .    g.147388
 AATGCTGTACAGATCAGTGAG | CCATTGTATCGCAATGCCTGGGAGAAAGAGAAGGCTAAT    c.14760
 N  A  V  Q  I  S  E   | P  L  Y  R  N  A  W  E  K  E  K  A  N      p.4920

          .         .         .         .         .         .       g.147448
 GTGAACGTGCCAGCTGACACTCCCCTGATGCTGCAATCCAAAATCAATGCTCTGCAGATC       c.14820
 V  N  V  P  A  D  T  P  L  M  L  Q  S  K  I  N  A  L  Q  I         p.4940

        | 96 .         .         .         .         .         .    g.148086
 AGCAAT | AAACGCTATCAGCAAGCTTGGGAAGATGTCAAGATGACTGGTTATGACCTGCGA    c.14880
 S  N   | K  R  Y  Q  Q  A  W  E  D  V  K  M  T  G  Y  D  L  R      p.4960

          .         .         .         .         .     | 97   .    g.149293
 GCAGATGCCATTGGGATCCAGCATGCCAAGGCTTCCAGGGATATTGCCAGTGAT | TATCTG    c.14940
 A  D  A  I  G  I  Q  H  A  K  A  S  R  D  I  A  S  D   | Y  L      p.4980

          .         .         .         .         .         .       g.149353
 TACAAAACTGCTTATGAGAAACAGAAAGGCCATTACATTGGCTGTCGCAGCGCCAAGGAA       c.15000
 Y  K  T  A  Y  E  K  Q  K  G  H  Y  I  G  C  R  S  A  K  E         p.5000

          .         .         .         .         .         .       g.149413
 GACCCTAAACTGGTTTGGGCAGCAAATGTGTTGAAGATGCAGAATGACAGGCTGTACAAA       c.15060
 D  P  K  L  V  W  A  A  N  V  L  K  M  Q  N  D  R  L  Y  K         p.5020

          .         .         .         .         .         .       g.149473
 AAGGCCTACAACGACCACAAAGCCAAGATCTCCATCCCTGTGGACATGGTGTCCATCAGC       c.15120
 K  A  Y  N  D  H  K  A  K  I  S  I  P  V  D  M  V  S  I  S         p.5040

          .         .         .         .         .         .       g.149533
 GCTGCCAAAGAAGGTCAGGCACTGGCAAGTGATGTGGACTATCGCCATTACCTGCACCAC       c.15180
 A  A  K  E  G  Q  A  L  A  S  D  V  D  Y  R  H  Y  L  H  H         p.5060

          .         .         .         .         .         .       g.149593
 TGGTCTTGCTTTCCCGACCAGAATGATGTGATCCAAGCCAGGAAAGCCTACGACCTACAG       c.15240
 W  S  C  F  P  D  Q  N  D  V  I  Q  A  R  K  A  Y  D  L  Q         p.5080

        | 98 .         .         .         .         .         .    g.151964
 AGCGAC | AGTGTGTATAAGGCAGACCTGGAGTGGCTGCGTGGCATCGGCTGGATGCCAGAA    c.15300
 S  D   | S  V  Y  K  A  D  L  E  W  L  R  G  I  G  W  M  P  E      p.5100
        ^  sequence block 3

          .         .         .         .         .         .       g.152024
 GGCTCAGTGGAAATGAACAGAGTGAAGGTTGCTCAAGACCTCGTGAATGAAAGACTCTAT       c.15360
 G  S  V  E  M  N  R  V  K  V  A  Q  D  L  V  N  E  R  L  Y         p.5120

          .         .         .         .         .         .       g.152084
 AGGACACGTCCAGAAGCTTTGTCATTCACCAGCATTGTCGACACTCCAGAAGTTGTCTTG       c.15420
 R  T  R  P  E  A  L  S  F  T  S  I  V  D  T  P  E  V  V  L         p.5140

          .         .         . | 99       .         .         .    g.153008
 GCAAAAGCCAATTCTCTGCAAATAAGTGAG | AAACTGTATCAGGAAGCCTGGAATAAAGAT    c.15480
 A  K  A  N  S  L  Q  I  S  E   | K  L  Y  Q  E  A  W  N  K  D      p.5160

          .         .         .         .         .         .       g.153068
 AAAAGCAACATCACCATTCCTTCTGATACTCCGGAGATGCTGCAGGCCCACATCAATGCC       c.15540
 K  S  N  I  T  I  P  S  D  T  P  E  M  L  Q  A  H  I  N  A         p.5180

          .      | 100 .         .         .         .         .    g.153918
 TTGCAAATCAGCAAT | AAACTCTACCAAAAAGACTGGAATGACACCAAGCAGAAAGGCTAT    c.15600
 L  Q  I  S  N   | K  L  Y  Q  K  D  W  N  D  T  K  Q  K  G  Y      p.5200

          .         .         .         .         .         .       g.153978
 GACATAAGGGCAGATGCCATTGAAATCAAGCACGCCAAGGCCTCCAGAGAAATTGCCAGT       c.15660
 D  I  R  A  D  A  I  E  I  K  H  A  K  A  S  R  E  I  A  S         p.5220

     | 101   .         .         .         .         .         .    g.155778
 GAG | TACAAATACAAAGAAGGTTACCGTAAGCAACTGGGCCACCACATGGGTTTCCGCACC    c.15720
 E   | Y  K  Y  K  E  G  Y  R  K  Q  L  G  H  H  M  G  F  R  T      p.5240

          .         .         .         .         .         .       g.155838
 CTACAAGATGACCCCAAGTCAGTATGGGCTATACATGCTGCCAAGATCCAGAGTGACAGA       c.15780
 L  Q  D  D  P  K  S  V  W  A  I  H  A  A  K  I  Q  S  D  R         p.5260

          .         .         .         .         .         .       g.155898
 GAATATAAGAAAGCTTATGAGAAGTCTAAAGGAATTCACAACACACCGTTGGACATGATG       c.15840
 E  Y  K  K  A  Y  E  K  S  K  G  I  H  N  T  P  L  D  M  M         p.5280

          .         .         .         .         .         .       g.155958
 TCAATTGTTCAAGCCAAGAAATGCCAGGTCCTGGTTAGCGACATTGATTATCGCAATTAT       c.15900
 S  I  V  Q  A  K  K  C  Q  V  L  V  S  D  I  D  Y  R  N  Y         p.5300

          .         .         .         .         .         .       g.156018
 CTGCACCAGTGGACGTGTCTGCCAGATCAGAACGATGTGATCCAGGCCAAGAAAGCCTAC       c.15960
 L  H  Q  W  T  C  L  P  D  Q  N  D  V  I  Q  A  K  K  A  Y         p.5320

          .      | 102 .         .         .         .         .    g.156865
 GACCTGCAGAGCGAT | AACTTGTACAAGTCAGACCTGGAATGGCTGAAGGGTATTGGATGG    c.16020
 D  L  Q  S  D   | N  L  Y  K  S  D  L  E  W  L  K  G  I  G  W      p.5340

          .         .         .         .         .         .       g.156925
 TTGCCAGAGGGTTCTGTGGAAGTGATGAGAGTGAAGAATGCCCAGAATCTCTTGAATGAA       c.16080
 L  P  E  G  S  V  E  V  M  R  V  K  N  A  Q  N  L  L  N  E         p.5360

          .         .         .         .         .         .       g.156985
 AGGCTGTATCGTATAAAGCCTGAGGCCCTCAAATTCACCAGCATTGTTGACACCCCGGAA       c.16140
 R  L  Y  R  I  K  P  E  A  L  K  F  T  S  I  V  D  T  P  E         p.5380

          .         .         .          | 103       .         .    g.157921
 GTAATCCAGGCAAAGATCAATGCTGTACAGATCAGTGAG | CCATTGTATCGCGATGCCTGG    c.16200
 V  I  Q  A  K  I  N  A  V  Q  I  S  E   | P  L  Y  R  D  A  W      p.5400

          .         .         .         .         .         .       g.157981
 GAGAAAGAGAAGGCTAATGTGAACGTGCCAGCTGACACTCCCCTGATGCTGCAATCCAAA       c.16260
 E  K  E  K  A  N  V  N  V  P  A  D  T  P  L  M  L  Q  S  K         p.5420

          .         .     | 104  .         .         .         .    g.158619
 ATCAATGCCCTGCAGATCAGCAAT | AAACGCTATCAGCAAGCTTGGGAAGATGTCAAGATG    c.16320
 I  N  A  L  Q  I  S  N   | K  R  Y  Q  Q  A  W  E  D  V  K  M      p.5440

          .         .         .         .         .         .       g.158679
 ACTGGTTATGACCTGCGAGCAGATGCCATTGGGATCCAGCATGCCAAGGCTTCCAGGGAT       c.16380
 T  G  Y  D  L  R  A  D  A  I  G  I  Q  H  A  K  A  S  R  D         p.5460

          .   | 105    .         .         .         .         .    g.159886
 ATTGCCAGTGAC | TATCTGTACAAAACTGCTTATGAGAAACAGAAAGGCCATTACATTGGC    c.16440
 I  A  S  D   | Y  L  Y  K  T  A  Y  E  K  Q  K  G  H  Y  I  G      p.5480

          .         .         .         .         .         .       g.159946
 TGTCGCAGCGCCAAGGAAGACCCTAAACTGGTTTGGGCAGCAAATGTGTTGAAGATGCAG       c.16500
 C  R  S  A  K  E  D  P  K  L  V  W  A  A  N  V  L  K  M  Q         p.5500

          .         .         .         .         .         .       g.160006
 AATGACAGGCTGTACAAAAAGGCCTACAACGACCACAAAGCCAAGATCTCCATCCCTGTG       c.16560
 N  D  R  L  Y  K  K  A  Y  N  D  H  K  A  K  I  S  I  P  V         p.5520

          .         .         .         .         .         .       g.160066
 GACATGGTGTCCATCAGCGCTGCCAAAGAAGGTCAGGCACTGGCAAGTGATGTGGACTAT       c.16620
 D  M  V  S  I  S  A  A  K  E  G  Q  A  L  A  S  D  V  D  Y         p.5540

          .         .         .         .         .         .       g.160126
 CGCCATTACCTGCACCGCTGGTCTTGCTTTCCCGACCAGAATGATGTGATCCAAGCCAGG       c.16680
 R  H  Y  L  H  R  W  S  C  F  P  D  Q  N  D  V  I  Q  A  R         p.5560

          .         .     | 106  .         .         .         .    g.163169
 AAAGCCTACGACCTACAGAGCGAC | GCCCTCTACAAGGCTGACTTGGAGTGGTTGCGTGGC    c.16740
 K  A  Y  D  L  Q  S  D   | A  L  Y  K  A  D  L  E  W  L  R  G      p.5580

          .         .         .         .         .         .       g.163229
 ATTGGCTGGATGCCCCAAGGGTCTCCTGAAGTGTTGAGAGTCAAAAACGCCCAGAATATC       c.16800
 I  G  W  M  P  Q  G  S  P  E  V  L  R  V  K  N  A  Q  N  I         p.5600

          .         .         .         .         .         .       g.163289
 TTTTGTGACAGTGTCTATCGGACGCCTGTGGTGAACCTTAAGTACACAAGCATTGTTGAC       c.16860
 F  C  D  S  V  Y  R  T  P  V  V  N  L  K  Y  T  S  I  V  D         p.5620

          .         .         .         .         | 107        .    g.163700
 ACACCTGAAGTGGTCCTTGCTAAATCAAATGCTGAAAATATTAGTATT | CCAAAGTACAGA    c.16920
 T  P  E  V  V  L  A  K  S  N  A  E  N  I  S  I   | P  K  Y  R      p.5640

          .         .         .         .         .         .       g.163760
 GAGGTTTGGGACAAGGATAAAACTTCAATACACATAATGCCAGATACTCCAGAAATTAAT       c.16980
 E  V  W  D  K  D  K  T  S  I  H  I  M  P  D  T  P  E  I  N         p.5660

          .         .         .    | 108   .         .         .    g.168913
 CTCGCTAGAGCAAATGCTCTTAATGTGAGCAAT | AAACTTTACCGTGAGGGCTGGGATGAA    c.17040
 L  A  R  A  N  A  L  N  V  S  N   | K  L  Y  R  E  G  W  D  E      p.5680

          .         .         .         .         .         .       g.168973
 ATGAAGGCGGGCTGTGATGTCCGGCTGGATGCCATCCCCATCCAGGCTGCCAAGGCCTCC       c.17100
 M  K  A  G  C  D  V  R  L  D  A  I  P  I  Q  A  A  K  A  S         p.5700

          .         | 109        .         .         .         .    g.169137
 AGGGAGATTGCCAGTGAC | TATAAATATAAGCTTGACCATGAGAAGCAGAAGGGACACTAC    c.17160
 R  E  I  A  S  D   | Y  K  Y  K  L  D  H  E  K  Q  K  G  H  Y      p.5720

          .         .         .         .         .         .       g.169197
 GTGGGCACCCTCACAGCCAGGGATGACAACAAGATCCGCTGGGCCCTCATAGCTGACAAG       c.17220
 V  G  T  L  T  A  R  D  D  N  K  I  R  W  A  L  I  A  D  K         p.5740

          .         .         .         .         .         .       g.169257
 CTCCAGAATGAACGAGAGTACCGGCTGGACTGGGCCAAATGGAAGGCCAAGATCCAGAGC       c.17280
 L  Q  N  E  R  E  Y  R  L  D  W  A  K  W  K  A  K  I  Q  S         p.5760

          .         .         .         .         .         .       g.169317
 CCTGTGGACATGCTTTCCATCCTGCACTCTAAAAATTCCCAGGCTCTGGTCAGTGACATG       c.17340
 P  V  D  M  L  S  I  L  H  S  K  N  S  Q  A  L  V  S  D  M         p.5780

          .         .         .         .         .         .       g.169377
 GATTACCGCAATTACCTGCACCAGTGGACCTGCATGCCCGACCAGAACGATGTGATTCAG       c.17400
 D  Y  R  N  Y  L  H  Q  W  T  C  M  P  D  Q  N  D  V  I  Q         p.5800

          .         .         . | 110      .         .         .    g.170145
 GCCAAGAAGGCCTACGAACTGCAGAGCGAT | AATGTTTACAAGGCTGACTTGGAATGGTTG    c.17460
 A  K  K  A  Y  E  L  Q  S  D   | N  V  Y  K  A  D  L  E  W  L      p.5820

          .         .         .         .         .         .       g.170205
 CGTGGAATTGGGTGGATGCCAAATGACTCCGTGTCCGTCAATCATGCCAAACATGCCGCG       c.17520
 R  G  I  G  W  M  P  N  D  S  V  S  V  N  H  A  K  H  A  A         p.5840

          .      | 111 .         .         .         .         .    g.170816
 GACATCTTCAGTGAG | AAAAAATATCGCACAAAAATAGAAACTCTCAACTTTACGCCTGTG    c.17580
 D  I  F  S  E   | K  K  Y  R  T  K  I  E  T  L  N  F  T  P  V      p.5860

          .         .         .         .         .     | 112  .    g.171076
 GATGACAGAGTTGATTATGTGACAGCGAAACAAAGTGGCGAGATCCTCGATGAT | ATTAAA    c.17640
 D  D  R  V  D  Y  V  T  A  K  Q  S  G  E  I  L  D  D   | I  K      p.5880

          .         .         .         .         .         .       g.171136
 TACCGGAAAGACTGGAATGCCACCAAATCAAAGTACACCCTCACAGAAACCCCCCTGCTG       c.17700
 Y  R  K  D  W  N  A  T  K  S  K  Y  T  L  T  E  T  P  L  L         p.5900

          .         .         .       | 113.         .         .    g.171333
 CACACTGCCCAGGAGGCTGCTAGGATACTGGACCAG | TATCTCTACAAGGAAGGCTGGGAG    c.17760
 H  T  A  Q  E  A  A  R  I  L  D  Q   | Y  L  Y  K  E  G  W  E      p.5920

          .         .         .         .         .         .       g.171393
 AGACAAAAAGCCACAGGTTACATTTTGCCTCCAGATGCTGTGCCATTTGTTCATGCCCAT       c.17820
 R  Q  K  A  T  G  Y  I  L  P  P  D  A  V  P  F  V  H  A  H         p.5940

          .         .     | 114  .         .         .         .    g.172044
 CACTGCAATGACGTTCAGAGTGAG | CTGAAATACAAAGCTGAACATGTGAAGCAAAAAGGT    c.17880
 H  C  N  D  V  Q  S  E   | L  K  Y  K  A  E  H  V  K  Q  K  G      p.5960

          .         .         .         .         .         .       g.172104
 CATTATGTTGGTGTCCCGACGATGAGAGATGATCCTAAGCTGGTTTGGTTTGAGCATGCA       c.17940
 H  Y  V  G  V  P  T  M  R  D  D  P  K  L  V  W  F  E  H  A         p.5980

          .         .         .         .         .         .       g.172164
 GGCCAGATTCAGAATGAGAGACTATACAAAGAGGACTATCACAAAACAAAGGCCAAAATC       c.18000
 G  Q  I  Q  N  E  R  L  Y  K  E  D  Y  H  K  T  K  A  K  I         p.6000

          .         .         .         .         .         .       g.172224
 AATATACCTGCTGATATGGTGTCAGTCTTGGCCGCCAAGCAGGGGCAGACCCTTGTCAGT       c.18060
 N  I  P  A  D  M  V  S  V  L  A  A  K  Q  G  Q  T  L  V  S         p.6020

          .         .         .         .         .         .       g.172284
 GATATTGATTATCGTAATTACTTGCACCAATGGATGTGTCATCCTGACCAGAACGATGTT       c.18120
 D  I  D  Y  R  N  Y  L  H  Q  W  M  C  H  P  D  Q  N  D  V         p.6040

          .         .         .       | 115.         .         .    g.173691
 ATTCAGGCAAGAAAGGCCTATGACCTACAGAGTGAT | AATGTCTACAGAGCTGACCTGGAG    c.18180
 I  Q  A  R  K  A  Y  D  L  Q  S  D   | N  V  Y  R  A  D  L  E      p.6060

          .         .         .         .         .         .       g.173751
 TGGCTCCGAGGCATTGGCTGGATCCCACTGGATTCTGTGGACCATGTAAGGGTTACTAAG       c.18240
 W  L  R  G  I  G  W  I  P  L  D  S  V  D  H  V  R  V  T  K         p.6080

          .         .  | 116     .         .         .         .    g.173921
 AACCAGGAAATGATGAGTCAG | ATCAAATATAAGAAAAATGCCCTTGAAAACTATCCTAAC    c.18300
 N  Q  E  M  M  S  Q   | I  K  Y  K  K  N  A  L  E  N  Y  P  N      p.6100

          .         .         .         .         .         .       g.173981
 TTTAGAAGTGTGGTGGATCCTCCAGAGATTGTTTTAGCCAAGATTAATTCTGTCAATCAA       c.18360
 F  R  S  V  V  D  P  P  E  I  V  L  A  K  I  N  S  V  N  Q         p.6120

        | 117.         .         .         .         .         .    g.174393
 AGTGAT | GTAAAATATAAAGAAACATTTAATAAAGCAAAGGGCAAATATACGTTTTCACCA    c.18420
 S  D   | V  K  Y  K  E  T  F  N  K  A  K  G  K  Y  T  F  S  P      p.6140

          .         .         .         .         .  | 118     .    g.175566
 GATACACCACATATCTCCCACTCCAAAGACATGGGAAAACTCTACAGTACT | ATACTGTAT    c.18480
 D  T  P  H  I  S  H  S  K  D  M  G  K  L  Y  S  T   | I  L  Y      p.6160

          .         .         .         .         .         .       g.175626
 AAAGGGGCGTGGGAGGGCACCAAGGCCTATGGCTACACCCTGGATGAGCGCTACATTCCC       c.18540
 K  G  A  W  E  G  T  K  A  Y  G  Y  T  L  D  E  R  Y  I  P         p.6180

          .         .         .          | 119       .         .    g.175789
 ATTGTTGGAGCCAAGCATGCTGATCTGGTGAACAGTGAG | CTTAAATACAAAGAGACATAT    c.18600
 I  V  G  A  K  H  A  D  L  V  N  S  E   | L  K  Y  K  E  T  Y      p.6200

          .         .         .         .         .         .       g.175849
 GAGAAGCAGAAAGGTCACTACCTGGCTGGAAAAGTGATCGGTGAATTCCCTGGTGTGGTT       c.18660
 E  K  Q  K  G  H  Y  L  A  G  K  V  I  G  E  F  P  G  V  V         p.6220

          .         .         .    | 120   .         .         .    g.176706
 CACTGTCTGGATTTCCAAAAGATGAGGAGTGCG | TTGAACTACAGAAAACATTATGAGGAT    c.18720
 H  C  L  D  F  Q  K  M  R  S  A   | L  N  Y  R  K  H  Y  E  D      p.6240

          .         .         .         .         .         .       g.176766
 ACCAAAGCAAATGTTCATATCCCCAATGACATGATGAATCACGTGCTGGCTAAAAGGTGC       c.18780
 T  K  A  N  V  H  I  P  N  D  M  M  N  H  V  L  A  K  R  C         p.6260

          .         .         .         .         .         .       g.176826
 CAGTACATCCTCAGTGACCTGGAGTATCGACACTATTTCCACCAGTGGACGTCTCTTCTG       c.18840
 Q  Y  I  L  S  D  L  E  Y  R  H  Y  F  H  Q  W  T  S  L  L         p.6280

          .         .         .         .         .  | 121     .    g.177282
 GAAGAACCCAATGTTATACGCGTCCGAAACGCCCAGGAGATCTTGAGTGAT | AATGTGTAT    c.18900
 E  E  P  N  V  I  R  V  R  N  A  Q  E  I  L  S  D   | N  V  Y      p.6300

          .         .         .         .         .         .       g.177342
 AAAGATGACCTGAATTGGTTGAAAGGCATTGGTTGCTACGTTTGGGATACACCCCAAATC       c.18960
 K  D  D  L  N  W  L  K  G  I  G  C  Y  V  W  D  T  P  Q  I         p.6320

          .         .         .       | 122.         .         .    g.178199
 CTCCATGCCAAGAAATCATACGACCTTCAGAGTCAG | CTACAATATACAGCAGCAGGTAAA    c.19020
 L  H  A  K  K  S  Y  D  L  Q  S  Q   | L  Q  Y  T  A  A  G  K      p.6340

          .         .         .         .         .         .       g.178259
 GAAAATCTACAAAACTATAATCTGGTCACAGACACGCCCCTCTATGTGACTGCTGTTCAG       c.19080
 E  N  L  Q  N  Y  N  L  V  T  D  T  P  L  Y  V  T  A  V  Q         p.6360

          .         .  | 123     .         .         .         .    g.178418
 AGTGGCATTAATGCCAGTGAG | GTAAAATATAAAGAAAATTATCATCAGATTAAGGACAAA    c.19140
 S  G  I  N  A  S  E   | V  K  Y  K  E  N  Y  H  Q  I  K  D  K      p.6380

          .         .         .         .         .         .       g.178478
 TACACAACAGTTCTAGAAACAGTGGATTATGACAGAACCAGAAACCTGAAGAATCTTTAC       c.19200
 Y  T  T  V  L  E  T  V  D  Y  D  R  T  R  N  L  K  N  L  Y         p.6400

        | 124.         .         .         .         .         .    g.178842
 AGCAGT | AACCTGTACAAGGAGGCCTGGGATAGAGTGAAAGCCACCAGCTACATCCTGCCT    c.19260
 S  S   | N  L  Y  K  E  A  W  D  R  V  K  A  T  S  Y  I  L  P      p.6420

          .         .         .         .         .     | 125  .    g.184449
 TCCAGCACCTTGTCCCTGACACACGCCAAGAACCAGAAGCATCTGGCCAGCCAT | ATCAAA    c.19320
 S  S  T  L  S  L  T  H  A  K  N  Q  K  H  L  A  S  H   | I  K      p.6440

          .         .         .         .         .         .       g.184509
 TATCGGGAAGAATATGAAAAGTTCAAAGCTCTTTATACGTTACCAAGAAGTGTTGACGAT       c.19380
 Y  R  E  E  Y  E  K  F  K  A  L  Y  T  L  P  R  S  V  D  D         p.6460

          .         .         .         .         | 126        .    g.185474
 GATCCGAACACAGCACGGTGCCTCCGAGTTGGCAAGCTTAACATCGAT | CGCCTGTACAGA    c.19440
 D  P  N  T  A  R  C  L  R  V  G  K  L  N  I  D   | R  L  Y  R      p.6480

          .         .         .         .         .         .       g.185534
 TCAGTTTATGAAAAGAACAAGATGAAAATCCACATCGTGCCCGACATGGTAGAGATGGTT       c.19500
 S  V  Y  E  K  N  K  M  K  I  H  I  V  P  D  M  V  E  M  V         p.6500

          .         .         .         .         .         .       g.185594
 ACTGCCAAGGATTCCCAGAAGAAAGTCAGTGAGATTGATTACCGCCTGCGCCTCCACGAA       c.19560
 T  A  K  D  S  Q  K  K  V  S  E  I  D  Y  R  L  R  L  H  E         p.6520

          .         .         .         .         .         .       g.185654
 TGGATTTGCCACCCCGACTTGCAAGTCAATGATCACGTCAGGAAAGTCACAGATCAGATC       c.19620
 W  I  C  H  P  D  L  Q  V  N  D  H  V  R  K  V  T  D  Q  I         p.6540

        | 127.         .         .         .         .         .    g.186039
 AGCGAT | ATTGTATACAAGGATGACCTCAACTGGCTGAAAGGCATTGGTTGCTACGTCTGG    c.19680
 S  D   | I  V  Y  K  D  D  L  N  W  L  K  G  I  G  C  Y  V  W      p.6560

          .         .         .         .         .  | 128     .    g.186720
 GACACTCCTGAAATCCTCCATGCCAAGCATGCTTATGATCTACGTGATGAT | ATCAAGTAT    c.19740
 D  T  P  E  I  L  H  A  K  H  A  Y  D  L  R  D  D   | I  K  Y      p.6580

          .         .         .         .         .         .       g.186780
 AAAGCTCACATGTTGAAAACAAGGAATGACTACAAGCTTGTCACAGATACACCAGTCTAC       c.19800
 K  A  H  M  L  K  T  R  N  D  Y  K  L  V  T  D  T  P  V  Y         p.6600

          .         .         .       | 129.         .         .    g.187666
 GTGCAGGCTGTCAAAAGTGGGAAACAGCTAAGTGAC | GCTGTCTACCACTATGACTATGTG    c.19860
 V  Q  A  V  K  S  G  K  Q  L  S  D   | A  V  Y  H  Y  D  Y  V      p.6620

          .         .         .         .         .         .       g.187726
 CACAGTGTCAGAGGCAAAGTGGCTCCAACTACCAAAACCGTGGATCTGGACCGGGCCCTT       c.19920
 H  S  V  R  G  K  V  A  P  T  T  K  T  V  D  L  D  R  A  L         p.6640

          .         .     | 130  .         .         .         .    g.189783
 CATGCATACAAGCTCCAGAGTTCG | AATCTATACAAAACCAGCCTGCGCACCCTGCCCACT    c.19980
 H  A  Y  K  L  Q  S  S   | N  L  Y  K  T  S  L  R  T  L  P  T      p.6660

          .         .         .         .         .         .       g.189843
 GGATATAGACTTCCAGGTGACACTCCTCACTTCAAACACATCAAGGACACCCGTTACATG       c.20040
 G  Y  R  L  P  G  D  T  P  H  F  K  H  I  K  D  T  R  Y  M         p.6680

           | 131       .         .         .         .         .    g.191123
 AGCAGTTAT | TTCAAGTACAAAGAAGCCTATGAACACACCAAGGCATATGGGTATACACTT    c.20100
 S  S  Y   | F  K  Y  K  E  A  Y  E  H  T  K  A  Y  G  Y  T  L      p.6700

          .         .         .         .         .        | 132    g.191752
 GGCCCCAAAGATGTTCCATTTGTCCACGTCCGGAGAGTCAACAATGTTACCAGCGAG | AGA    c.20160
 G  P  K  D  V  P  F  V  H  V  R  R  V  N  N  V  T  S  E   | R      p.6720

          .         .         .         .         .         .       g.191812
 CTGTATCGGGAATTGTACCACAAACTGAAAGACAAGATCCATACAACTCCCGATACCCCT       c.20220
 L  Y  R  E  L  Y  H  K  L  K  D  K  I  H  T  T  P  D  T  P         p.6740

          .         .         .         .   | 133    .         .    g.191972
 GAGATCCGCCAAGTCAAGAAGACACAAGAGGCTGTCAGTGAG | TTGATCTACAAATCAGAC    c.20280
 E  I  R  Q  V  K  K  T  Q  E  A  V  S  E   | L  I  Y  K  S  D      p.6760

          .         .         .         .         .         .       g.192032
 TTCTTCAAGATGCAGGGCCACATGATCTCTCTGCCATACACACCCCAAGTGATCCATTGC       c.20340
 F  F  K  M  Q  G  H  M  I  S  L  P  Y  T  P  Q  V  I  H  C         p.6780

          .         .        | 134         .         .         .    g.193077
 CGCTATGTGGGAGACATCACCAGTGAT | ATTAAATACAAAGAGGACTTGCAGGTCCTGAAG    c.20400
 R  Y  V  G  D  I  T  S  D   | I  K  Y  K  E  D  L  Q  V  L  K      p.6800

          .         .         .         .         .         .       g.193137
 GGATTTGGCTGCTTCCTGTATGACACTCCTGACATGGTCCGCTCCCGGCACCTGCGGAAG       c.20460
 G  F  G  C  F  L  Y  D  T  P  D  M  V  R  S  R  H  L  R  K         p.6820

        | 135.         .         .         .         .         .    g.193543
 CTCTGG | TCTAATTACCTATACACTGATAAGGCAAGGAAGATGCGAGACAAATACAAAGTG    c.20520
 L  W   | S  N  Y  L  Y  T  D  K  A  R  K  M  R  D  K  Y  K  V      p.6840

          .         .         .         .         .        | 136    g.197939
 GTGCTTGACACTCCAGAATACAGAAAAGTGCAAGAACTGAAGACACATCTGAGTGAG | CTG    c.20580
 V  L  D  T  P  E  Y  R  K  V  Q  E  L  K  T  H  L  S  E   | L      p.6860

          .         .         .         .         .         .       g.197999
 GTCTACAGAGCTGCAGGCAAGAAGCAGAAGTCAATCTTTACTTCAGTTCCTGATACTCCT       c.20640
 V  Y  R  A  A  G  K  K  Q  K  S  I  F  T  S  V  P  D  T  P         p.6880

          .         .         .         .   | 137    .         .    g.198704
 GATCTTTTAAGAGCCAAGCGAGGGCAGAAGCTTCAGAGTCAG | TATCTGTATGTTGAACTT    c.20700
 D  L  L  R  A  K  R  G  Q  K  L  Q  S  Q   | Y  L  Y  V  E  L      p.6900

          .         .         .         .         .         .       g.198764
 GCCACCAAAGAGAGACCCCATCATCACGCTGGAAACCAGACCACAGCCTTGAAGCATGCT       c.20760
 A  T  K  E  R  P  H  H  H  A  G  N  Q  T  T  A  L  K  H  A         p.6920

          .         .        | 138         .         .         .    g.199072
 AAAGACGTGAAGGACATGGTCAGTGAG | AAAAAGTACAAGATTCAATATGAAAAGATGAAA    c.20820
 K  D  V  K  D  M  V  S  E   | K  K  Y  K  I  Q  Y  E  K  M  K      p.6940

          .         .         .         .         .         .       g.199132
 GACAAGTACACTCCGGTTCCAGATACGCCAATCCTCATCAGAGCCAAGAGGGCTTACTGG       c.20880
 D  K  Y  T  P  V  P  D  T  P  I  L  I  R  A  K  R  A  Y  W         p.6960

          .   | 139    .         .         .         .         .    g.201291
 AATGCCAGTGAT | CTACGCTACAAAGAAACATTTCAAAAGACCAAAGGGAAATACCACACG    c.20940
 N  A  S  D   | L  R  Y  K  E  T  F  Q  K  T  K  G  K  Y  H  T      p.6980

          .         .         .         .         .        | 140    g.201514
 GTGAAAGATGCCCTAGACATTGTCTATCATCGCAAAGTCACAGATGACATCAGTAAA | ATA    c.21000
 V  K  D  A  L  D  I  V  Y  H  R  K  V  T  D  D  I  S  K   | I      p.7000

          .         .         .         .         .         .       g.201574
 AAATACAAGGAGAACTACATGAGCCAGTTGGGTATCTGGAGGTCCATTCCTGATCGTCCA       c.21060
 K  Y  K  E  N  Y  M  S  Q  L  G  I  W  R  S  I  P  D  R  P         p.7020

          .         .         .         .   | 141    .         .    g.202269
 GAGCATTTCCACCACCGAGCAGTCACTGACACAGTCAGTGAT | GTAAAATATAAAGAAGAC    c.21120
 E  H  F  H  H  R  A  V  T  D  T  V  S  D   | V  K  Y  K  E  D      p.7040

          .         .         .         .         .         .       g.202329
 TTGACTTGGCTTAAAGGCATTGGTTGCTATGCCTATGATACCCCTGATTTCACTCTGGCT       c.21180
 L  T  W  L  K  G  I  G  C  Y  A  Y  D  T  P  D  F  T  L  A         p.7060

          .         .        | 142         .         .         .    g.203725
 GAAAAGAACAAGACTCTCTACAGCAAG | TATAAGTATAAAGAAGTATTTGAAAGGACAAAG    c.21240
 E  K  N  K  T  L  Y  S  K   | Y  K  Y  K  E  V  F  E  R  T  K      p.7080

          .         .         .         .         .         .       g.203785
 TCAGATTTCAAGTATGTTGCCGACTCTCCGATCAATAGGCATTTCAAGTATGCAACTCAA       c.21300
 S  D  F  K  Y  V  A  D  S  P  I  N  R  H  F  K  Y  A  T  Q         p.7100

          .   | 143    .         .         .         .         .    g.205216
 TTGATGAATGAG | AAAAAATACAGAGCTGATTATGAGCAGCGGAAAGATAAATACCACCTG    c.21360
 L  M  N  E   | K  K  Y  R  A  D  Y  E  Q  R  K  D  K  Y  H  L      p.7120
              ^  alternative splicing; exon 143 or exon 144 is present

          .         .         .         .         .        | 144    g.205944
 GTAGTCGATGAGCCTAGACATCTGCTGGCTAAGACCGCAGGCGACCAGATCAGTCAG | AGA    c.21420
 V  V  D  E  P  R  H  L  L  A  K  T  A  G  D  Q  I  S  Q   | R      p.7140

          .         .         .         .         .         .       g.206004
 AAATATAAATCTAGTGCCAAGATGTTTCTGCAACATGGATGTAATGAAATTCTGCGTCCA       c.21480
 K  Y  K  S  S  A  K  M  F  L  Q  H  G  C  N  E  I  L  R  P         p.7160

          .         .         .         .   | 145    .         .    g.207609
 GATATGTTGACTGCTCTCTACAATTCGCATATGTGGAGCCAG | ATCAAATACAGGAAAAAC    c.21540
 D  M  L  T  A  L  Y  N  S  H  M  W  S  Q   | I  K  Y  R  K  N      p.7180

          .         .         .         .         .         .       g.207669
 TATGAAAAATCAAAGGACAAATTTACCTCAATTGTGGATACTCCAGAACACCTGCGTACT       c.21600
 Y  E  K  S  K  D  K  F  T  S  I  V  D  T  P  E  H  L  R  T         p.7200

          .         .        | 146         .         .         .    g.208419
 ACAAAAGTCAACAAACAAATCAGCGAT | ATCCTTTATAAATTGGAATACAACAAGGCCAAA    c.21660
 T  K  V  N  K  Q  I  S  D   | I  L  Y  K  L  E  Y  N  K  A  K      p.7220

          .         .         .         .         .         .       g.208479
 CCCAGAGGCTACACCACAATCCACGACACACCCATGTTGCTGCATGTCCGCAAGGTTAAA       c.21720
 P  R  G  Y  T  T  I  H  D  T  P  M  L  L  H  V  R  K  V  K         p.7240

          .      | 147 .         .         .         .         .    g.210218
 GATGAAGTCAGTGAT | CTGAAATACAAAGAAGTATACCAAAGAAATAAATCCAACTGCACC    c.21780
 D  E  V  S  D   | L  K  Y  K  E  V  Y  Q  R  N  K  S  N  C  T      p.7260

          .         .         .         .         .         .       g.210278
 ATTGAGCCAGATGCTGTTCATATCAAAGCAGCCAAGGACGCCTACAAAGTCAACACCAAT       c.21840
 I  E  P  D  A  V  H  I  K  A  A  K  D  A  Y  K  V  N  T  N         p.7280

  | 148      .         .         .         .         .         .    g.211962
  | CTGGACTATAAGAAACAGTACGAAGCCAACAAAGCCCACTGGAAGTGGACTCCTGACCGA    c.21900
  | L  D  Y  K  K  Q  Y  E  A  N  K  A  H  W  K  W  T  P  D  R      p.7300

          .         .         .         .      | 149 .         .    g.212480
 CCGGACTTCCTCCAGGCTGCCAAGTCATCCCTGCAGCAAAGCGAT | TTTGAATATAAGCTG    c.21960
 P  D  F  L  Q  A  A  K  S  S  L  Q  Q  S  D   | F  E  Y  K  L      p.7320

          .         .         .         .         .         .       g.212540
 GACCGGGAGTTCCTCAAGGGTTGCAAGCTTTCTGTCACTGATGACAAAAACACGGTGCTC       c.22020
 D  R  E  F  L  K  G  C  K  L  S  V  T  D  D  K  N  T  V  L         p.7340

          .         .         . | 150      .         .         .    g.213255
 GCCCTCAGGAATACTTTAATAGAAAGTGAT | CTGAAATACAAAGAGAAACATGTCAAGGAA    c.22080
 A  L  R  N  T  L  I  E  S  D   | L  K  Y  K  E  K  H  V  K  E      p.7360

          .         .         .         .         .         .       g.213315
 AGAGGAACCTGCCATGCCGTACCTGACACGCCTCAGATCCTGCTGGCGAAGACTGTCAGC       c.22140
 R  G  T  C  H  A  V  P  D  T  P  Q  I  L  L  A  K  T  V  S         p.7380

          .      | 151 .         .         .         .         .    g.213464
 AACCTGGTGTCTGAG | AACAAGTACAAGGACCATGTCAAGAAGCACTTGGCACAGGGCTCA    c.22200
 N  L  V  S  E   | N  K  Y  K  D  H  V  K  K  H  L  A  Q  G  S      p.7400

          .         .         .         .         .         .       g.213524
 TACACAACACTACCAGAGACCCGGGACACTGTTCACGTCAAGGAAGTGACCAAGCATGTC       c.22260
 Y  T  T  L  P  E  T  R  D  T  V  H  V  K  E  V  T  K  H  V         p.7420

        | 152.         .         .         .         .         .    g.214268
 AGTGAT | ACAAATTACAAAAAGAAGTTTGTCAAGGAGAAAGGAAAATCCAACTACTCCATC    c.22320
 S  D   | T  N  Y  K  K  K  F  V  K  E  K  G  K  S  N  Y  S  I      p.7440

          .         .         .         .         .        | 153    g.214874
 ATGCTGGAGCCACCAGAGGTGAAACATGCTATGGAAGTGGCCAAGAAGCAAAGTGAT | GTC    c.22380
 M  L  E  P  P  E  V  K  H  A  M  E  V  A  K  K  Q  S  D   | V      p.7460

          .         .         .         .         .         .       g.214934
 GCTTACAGAAAAGATGCCAAAGAGAACCTGCATTACACCACAGTGGCTGATCGACCAGAC       c.22440
 A  Y  R  K  D  A  K  E  N  L  H  Y  T  T  V  A  D  R  P  D         p.7480

          .         .         .          | 154       .         .    g.215093
 ATCAAGAAGGCCACACAGGCAGCCAAACAGGCCAGTGAG | GTGGAGTACAGAGCCAAGCAC    c.22500
 I  K  K  A  T  Q  A  A  K  Q  A  S  E   | V  E  Y  R  A  K  H      p.7500

          .         .         .         .         .         .       g.215153
 CGCAAGGAAGGCAGCCATGGCTTAAGCATGCTCGGTCGCCCAGACATAGAAATGGCCAAG       c.22560
 R  K  E  G  S  H  G  L  S  M  L  G  R  P  D  I  E  M  A  K         p.7520

          .         .     | 155  .         .         .         .    g.219755
 AAGGCAGCCAAGCTGAGCAGCCAG | GTTAAATACCGAGAAAATTTCGATAAAGAAAAGGGC    c.22620
 K  A  A  K  L  S  S  Q   | V  K  Y  R  E  N  F  D  K  E  K  G      p.7540

          .         .         .         .         .         .       g.219815
 AAGACACCAAAATACAATCCAAAAGACAGCCAGCTCTACAAAGTCATGAAAGATGCTAAT       c.22680
 K  T  P  K  Y  N  P  K  D  S  Q  L  Y  K  V  M  K  D  A  N         p.7560

          .      | 156 .         .         .         .         .    g.220463
 AATCTTGCAAGTGAG | GTTAAATACAAGGCTGACCTGAAGAAACTTCACAAACCCGTGACT    c.22740
 N  L  A  S  E   | V  K  Y  K  A  D  L  K  K  L  H  K  P  V  T      p.7580

          .         .         .         .         .         .       g.220523
 GACATGAAGGAGTCTCTGATCATGAATCATGTCCTGAATACAAGCCAACTTGCCAGTTCT       c.22800
 D  M  K  E  S  L  I  M  N  H  V  L  N  T  S  Q  L  A  S  S         p.7600

  | 157      .         .         .         .         .         .    g.221125
  | TACCAGTACAAGAAGAAGTATGAGAAGAGTAAAGGCCACTACCACACCATACCCGATAAT    c.22860
  | Y  Q  Y  K  K  K  Y  E  K  S  K  G  H  Y  H  T  I  P  D  N      p.7620

          .         .         .         .      | 158 .         .    g.222939
 CTGGAGCAGCTTCACCTAAAAGAGGCCACAGAATTACAGAGTATA | GTGAAATACAAAGAA    c.22920
 L  E  Q  L  H  L  K  E  A  T  E  L  Q  S  I   | V  K  Y  K  E      p.7640

          .         .         .         .         .         .       g.222999
 AAGTATGAAAAGGAACGAGGAAAACCCATGCTGGACTTTGAAACACCAACGTACATCACT       c.22980
 K  Y  E  K  E  R  G  K  P  M  L  D  F  E  T  P  T  Y  I  T         p.7660

          .         .         . | 159      .         .         .    g.224589
 GCCAAAGAGTCTCAGCAGATGCAGAGTGGG | AAAGAATATAGGAAAGATTATGAAGAGTCC    c.23040
 A  K  E  S  Q  Q  M  Q  S  G   | K  E  Y  R  K  D  Y  E  E  S      p.7680

          .         .         .         .         .         .       g.224649
 ATTAAAGGCAGAAACCTGACTGGCCTGGAGGTCACGCCAGCTTTGTTACATGTCAAATAT       c.23100
 I  K  G  R  N  L  T  G  L  E  V  T  P  A  L  L  H  V  K  Y         p.7700

          .         .  | 160     .         .         .         .    g.225098
 GCAACTAAAATAGCAAGCGAG | AAAGAGTACAGGAAAGATCTAGAGGAAAGCATCCGTGGG    c.23160
 A  T  K  I  A  S  E   | K  E  Y  R  K  D  L  E  E  S  I  R  G      p.7720

          .         .         .         .         .         .       g.225158
 AAGGGCCTCACTGAAATGGAAGATACACCTGACATGCTAAGAGCAAAGAATGCCACTCAA       c.23220
 K  G  L  T  E  M  E  D  T  P  D  M  L  R  A  K  N  A  T  Q         p.7740

          .   | 161    .         .         .         .         .    g.225842
 ATCCTCAATGAG | AAAGAATATAAGCGAGACCTGGAACTGGAAGTCAAAGGAAGAGGCCTG    c.23280
 I  L  N  E   | K  E  Y  K  R  D  L  E  L  E  V  K  G  R  G  L      p.7760

          .         .         .         .         .         .       g.225902
 AATGCCATGGCCAATGAAACTCCGGATTTTATGAGGGCCAGGAATGCTACTGATATTGCC       c.23340
 N  A  M  A  N  E  T  P  D  F  M  R  A  R  N  A  T  D  I  A         p.7780

        | 162.         .         .         .         .         .    g.226704
 AGTCAG | ATTAAGTATAAGCAATCAGCAGAAATGGAGAAAGCCAATTTCACTTCTGTGGTT    c.23400
 S  Q   | I  K  Y  K  Q  S  A  E  M  E  K  A  N  F  T  S  V  V      p.7800

          .         .         .         .         .  | 163     .    g.231387
 GATACTCCAGAGATCATTCATGCCCAACAAGTCAAGAATCTTTCAAGCCAG | AAAAAGTAC    c.23460
 D  T  P  E  I  I  H  A  Q  Q  V  K  N  L  S  S  Q   | K  K  Y      p.7820

          .         .         .         .         .         .       g.231447
 AAGGAAGATGCTGAGAAGTCCATGTCGTATTATGAGACTGTTTTGGACACCCCAGAGATA       c.23520
 K  E  D  A  E  K  S  M  S  Y  Y  E  T  V  L  D  T  P  E  I         p.7840

          .         .         .       | 164.         .         .    g.232498
 CAGAGAGTCCGGGAGAACCAAAAGAACTTCAGCCTT | CTCCAATACCAGTGTGACCTTAAA    c.23580
 Q  R  V  R  E  N  Q  K  N  F  S  L   | L  Q  Y  Q  C  D  L  K      p.7860

          .         .         .         .         .         .       g.232558
 AACAGTAAAGGAAAAATTACAGTTGTTCAAGACACGCCAGAAATACTGCGTGTAAAAGAA       c.23640
 N  S  K  G  K  I  T  V  V  Q  D  T  P  E  I  L  R  V  K  E         p.7880

          .         .  | 165     .         .         .         .    g.233268
 AATCAGAAGAATTTCAGCTCG | GTTTTATATAAAGAGGATGTCTCACCAGGAACGGCTATC    c.23700
 N  Q  K  N  F  S  S   | V  L  Y  K  E  D  V  S  P  G  T  A  I      p.7900

          .         .         .         .         .     | 166  .    g.233923
 GGAAAGACACCTGAGATGATGAGAGTGAAACAAACACAGGACCACATTAGCTCG | GTGAAG    c.23760
 G  K  T  P  E  M  M  R  V  K  Q  T  Q  D  H  I  S  S   | V  K      p.7920

          .         .         .         .         .         .       g.233983
 TATAAGGAAGCAATAGGACAAGGAACTCCAATCCCTGACCTGCCTGAAGTGAAACGTGTG       c.23820
 Y  K  E  A  I  G  Q  G  T  P  I  P  D  L  P  E  V  K  R  V         p.7940

          .         .        | 167         .         .         .    g.236079
 AAGGAGACGCAGAAGCACATTAGCTCG | GTTATGTACAAAGAAAACTTGGGAACAGGCATT    c.23880
 K  E  T  Q  K  H  I  S  S   | V  M  Y  K  E  N  L  G  T  G  I      p.7960

          .         .         .         .         .         .       g.236139
 CCAACCACTGTGACTCCAGAGATTGAGAGAGTCAAACGCAATCAAGAGAACTTTAGCTCG       c.23940
 P  T  T  V  T  P  E  I  E  R  V  K  R  N  Q  E  N  F  S  S         p.7980

  | 168      .         .         .         .         .         .    g.236662
  | GTTTTGTACAAAGAAAATTTGGGGAAAGGAATCCCAACACCTATCACTCCAGAGATGGAG    c.24000
  | V  L  Y  K  E  N  L  G  K  G  I  P  T  P  I  T  P  E  M  E      p.8000

          .         .         .    | 169   .         .         .    g.238031
 AGAGTCAAACGCAATCAAGAGAACTTTAGCTCG | ATATTGTACAAAGAGAACTTGAGCAAG    c.24060
 R  V  K  R  N  Q  E  N  F  S  S   | I  L  Y  K  E  N  L  S  K      p.8020

          .         .         .         .         .         .       g.238091
 GGGACTCCCCTACCTGTCACTCCTGAGATGGAGCGAGTCAAACTCAATCAAGAAAACTTT       c.24120
 G  T  P  L  P  V  T  P  E  M  E  R  V  K  L  N  Q  E  N  F         p.8040

        | 170.         .         .         .         .         .    g.240151
 AGCTCG | GTGTTGTATAAAGAAAACGTTGGAAAAGGGATTCCAATCCCCATCACTCCAGAG    c.24180
 S  S   | V  L  Y  K  E  N  V  G  K  G  I  P  I  P  I  T  P  E      p.8060

          .         .         .          | 171       .         .    g.241156
 ATGGAGAGAGTCAAACACAATCAAGAAAACTTTAGTTCG | GTGCTATACAAAGAAAACCTG    c.24240
 M  E  R  V  K  H  N  Q  E  N  F  S  S   | V  L  Y  K  E  N  L      p.8080

          .         .         .         .         .         .       g.241216
 GGGACAGGAATTCCAATCCCCATCACTCCTGAGATGCAGAGAGTCAAACACAATCAAGAA       c.24300
 G  T  G  I  P  I  P  I  T  P  E  M  Q  R  V  K  H  N  Q  E         p.8100

          .   | 172    .         .         .         .         .    g.241817
 AACCTTAGCTCG | GTGTTATACAAAGAAAACATGGGCAAGGGAACCCCTTTACCTGTCACT    c.24360
 N  L  S  S   | V  L  Y  K  E  N  M  G  K  G  T  P  L  P  V  T      p.8120

          .         .         .         .      | 173 .         .    g.242469
 CCCGAGATGGAAAGAGTCAAACACAATCAAGAAAATATTAGCTCG | GTGTTATACAAAGAA    c.24420
 P  E  M  E  R  V  K  H  N  Q  E  N  I  S  S   | V  L  Y  K  E      p.8140

          .         .         .         .         .         .       g.242529
 AACATGGGCAAGGGAACCCCTCTACCTGTCACTCCTGAGATGGAGAGAGTCAAACACAAT       c.24480
 N  M  G  K  G  T  P  L  P  V  T  P  E  M  E  R  V  K  H  N         p.8160

          .         | 174        .         .         .         .    g.243161
 CAAGAAAATATTAGCTCG | GTGTTATACAAAGAAAACATGGGCAAGGGAACTCCTTTAGCT    c.24540
 Q  E  N  I  S  S   | V  L  Y  K  E  N  M  G  K  G  T  P  L  A      p.8180

          .         .         .         .         .  | 175     .    g.245243
 GTCACTCCCGAGATGGAGCGAGTCAAACACAATCAAGAAAATATTAGCTCG | GTTTTGTAC    c.24600
 V  T  P  E  M  E  R  V  K  H  N  Q  E  N  I  S  S   | V  L  Y      p.8200

          .         .         .         .         .         .       g.245303
 AAAGAAAATGTGGGGAAAGCCACCGCAACCCCTGTCACTCCTGAGATGCAGAGAGTCAAA       c.24660
 K  E  N  V  G  K  A  T  A  T  P  V  T  P  E  M  Q  R  V  K         p.8220

          .         .     | 176  .         .         .         .    g.245656
 CGCAATCAAGAAAACATTAGCTCG | GTGTTATACAAAGAGAACCTGGGGAAAGCAACCCCC    c.24720
 R  N  Q  E  N  I  S  S   | V  L  Y  K  E  N  L  G  K  A  T  P      p.8240

          .         .         .         .         .        | 177    g.246045
 ACACCCTTTACTCCTGAGATGGAAAGAGTGAAACGCAATCAAGAAAACTTTAGCTCG | GTA    c.24780
 T  P  F  T  P  E  M  E  R  V  K  R  N  Q  E  N  F  S  S   | V      p.8260

          .         .         .         .         .         .       g.246105
 TTGTACAAAGAGAACATGAGAAAAGCAACTCCGACACCTGTTACTCCAGAGATGGAGAGA       c.24840
 L  Y  K  E  N  M  R  K  A  T  P  T  P  V  T  P  E  M  E  R         p.8280

          .         .         . | 178      .         .         .    g.247023
 GCTAAGCGCAACCAAGAAAACATTAGCTCG | GTTCTTTATTCTGATAGTTTCCGGAAACAA    c.24900
 A  K  R  N  Q  E  N  I  S  S   | V  L  Y  S  D  S  F  R  K  Q      p.8300

          .         .         .         .         .         .       g.247083
 ATACAAGGCAAAGCTGCCTATGTATTGGATACCCCCGAGATGAGACGGGTGAGGGAGACC       c.24960
 I  Q  G  K  A  A  Y  V  L  D  T  P  E  M  R  R  V  R  E  T         p.8320

          .         | 179        .         .         .         .    g.247248
 CAACGGCACATCTCAACG | GTGAAATATCATGAAGACTTTGAGAAACACAAGGGTTGCTTC    c.25020
 Q  R  H  I  S  T   | V  K  Y  H  E  D  F  E  K  H  K  G  C  F      p.8340

          .         .         .         .         .         .       g.247308
 ACACCAGTGGTGACAGATCCTATCACTGAACGAGTAAAGAAGAACATGCAGGACTTCAGT       c.25080
 T  P  V  V  T  D  P  I  T  E  R  V  K  K  N  M  Q  D  F  S         p.8360

          .         .         .         .         .         .       g.247368
 GACATTAACTACCGAGGTATTCAGAGGAAAGTGGTAGAAATGGAACAAAAACGGAATGAC       c.25140
 D  I  N  Y  R  G  I  Q  R  K  V  V  E  M  E  Q  K  R  N  D         p.8380

          .         .   | 180    .         .         .         .    g.247750
 CAAGATCAGGAGACTATTACAG | GTTTACGTGTCTGGCGTACTAATCCTGGTTCGGTTTTT    c.25200
 Q  D  Q  E  T  I  T  G |   L  R  V  W  R  T  N  P  G  S  V  F      p.8400

          .         .         .         .         .      | 181 .    g.248974
 GACTATGATCCAGCAGAAGACAACATCCAGTCCCGAAGCTTACACATGATTAATG | TCCAA    c.25260
 D  Y  D  P  A  E  D  N  I  Q  S  R  S  L  H  M  I  N  V |   Q      p.8420

          .         .         .         .         .         .       g.249034
 GCTCAGCGCCGGAGCCGGGAGCAGTCACGATCTGCCAGTGCACTAAGCATCAGTGGGGGT       c.25320
 A  Q  R  R  S  R  E  Q  S  R  S  A  S  A  L  S  I  S  G  G         p.8440

          .         .         .         .         .         .       g.249094
 GAGGAGAAGTCTGAGCATTCAGAAGCACCAGACCACCACCTTTCGACTTACAGCGACGGG       c.25380
 E  E  K  S  E  H  S  E  A  P  D  H  H  L  S  T  Y  S  D  G         p.8460

          .         .   | 182    .         .         .         .    g.249448
 GGTGTCTTTGCAGTCTCAACAG | CTTACAAACATGCAAAAACCACAGAGCTCCCACAACAA    c.25440
 G  V  F  A  V  S  T  A |   Y  K  H  A  K  T  T  E  L  P  Q  Q      p.8480

          .         .         .         .         .         .       g.249508
 CGATCATCTTCAGTTGCTACCCAACAGACAACGGTATCTTCCATCCCATCTCATCCATCT       c.25500
 R  S  S  S  V  A  T  Q  Q  T  T  V  S  S  I  P  S  H  P  S         p.8500

           | 183       .         .         .         .         .    g.253605
 ACTGCTGGA | AAAATCTTCCGTGCCATGTATGACTATATGGCTGCTGATGCAGATGAGGTG    c.25560
 T  A  G   | K  I  F  R  A  M  Y  D  Y  M  A  A  D  A  D  E  V      p.8520

          .         .         .         .         .         .       g.253665
 TCCTTCAAGGATGGAGATGCCATCATAAATGTTCAAGCAATTGATGAAGGCTGGATGTAT       c.25620
 S  F  K  D  G  D  A  I  I  N  V  Q  A  I  D  E  G  W  M  Y         p.8540

          .         .         .         .         .         .       g.253725
 GGCACTGTGCAGAGGACTGGCAGGACCGGAATGCTCCCAGCCAACTACGTTGAAGCTATT       c.25680
 G  T  V  Q  R  T  G  R  T  G  M  L  P  A  N  Y  V  E  A  I         p.8560

                                                                    g.253728
 TAG                                                                c.25683
 X                                                                  p.8560

          .         .         .         .         .         .       g.253788
 gcatttcaaagcatcacacttgtctgcaggacttacagatcctgcagtcaatgtttcggt       c.*60

          .         .         .         .         .         .       g.253848
 ttagactctccactgttacctaagttctcaagctgcctatggtttttctgtgtcaatgtg       c.*120

          .         .         .         .         .         .       g.253908
 atttatggtagtaccatcctttctcctttgggttttaaaataagttgcagaacagacact       c.*180

          .         .         .         .         .         .       g.253968
 ttaaaagcttctgcaatattatttctgtgcctagagtctttctccattataaacatgttt       c.*240

          .         .         .         .         .         .       g.254028
 taacattatttcttttctaaaacagggattttgaatatgccaaacacattaaaggaaaaa       c.*300

          .         .         .         .         .         .       g.254088
 tagcagagatgttcaccttttccttgctgattgctaatgcttattatttctaattcagtt       c.*360

          .         .         .         .         .         .       g.254148
 ctgaagttataaacttataatcaatacaaaccagcaactaataaaacctctaattctgca       c.*420

                                                                    g.254149
 a                                                                  c.*421

 (downstream sequence)

Legend:
Nucleotide numbering (following the rules of the HGVS for a 'Coding DNA Reference Sequence') is indicated at the right of the sequence, counting the A of the ATG translation initiating Methionine as 1. Every 10th nucleotide is indicated by a "." above the sequence. The Nebulin protein sequence is shown below the coding DNA sequence, with numbering indicated at the right starting with 1 for the translation initiating Methionine. Every 10th amino acid is shown in bold. The position of introns is indicated by a vertical line, splitting the two exons. The start of the first exon (transcription initiation site) is indicated by a '\', the end of the last exon (poly-A addition site) by a '/'. The exon number is indicated above the first nucleotide(s) of the exon. To aid the description of frame shift mutations, all stop codons in the +1 frame are shown in bold while all stop codons in the +2 frame are underlined.


Powered by LOVD v.2.0 Build 29
©2004-2010 Leiden University Medical Center