(used for mutation description)
(last modified February 11, 2011)
This file was created to facilitate the description of sequence variants in the SEPN1
gene based on a coding DNA reference sequence following the HGVS recommendations. The sequence was taken
from NG_009930.1,
covering SEPN1 transcript NM_020451.2.
Exon 3 can be differentially spliced (see transcript NM_206926.1).
NOTE: SEPN1 is a selenoprotien, meaning Selenium is
incorporated as selenocysteine (U or Sec)
at a UGA codon, normally a termination codon (here *127
and *462). The recognition of UGA
as a selenocysteine codon requires a secondary structure called SECIS (selenocysteine
insertion sequence) that is located in the 3' UTR of the transcript.
Please note that introns are available by clicking on the exon numbers above the sequence.
(upstream sequence)
. . . . . g.5055
ccccgccccgctctttcgcttcccgggccgccggcagccgccgccagccgcagcc c.-1
. . . . . . g.5115
ATGGGCCGGGCCCGGCCGGGCCAACGCGGGCCGCCCAGCCCCGGCCCCGCCGCGCAGCCT c.60
M G R A R P G Q R G P P S P G P A A Q P p.20
. . . . . . g.5175
CCCGCGCCACCGCGCCGCCGCGCCCGTTCCCTGGCGCTGCTCGGAGCCCTGCTGGCCGCC c.120
P A P P R R R A R S L A L L G A L L A A p.40
. . . . . . g.5235
GCCGCTGCCGCCGCCGTCCGGGTCTGCGCCCGCCACGCCGAGGCCCAGGCGGCCGCGCGG c.180
A A A A A V R V C A R H A E A Q A A A R p.60
| 02 . . . . . . g.5924
CAG | GAACTGGCGCTGAAGACCCTGGGGACAGATGGCCTTTTTCTCTTTTCCTCCTTGGAC c.240
Q | E L A L K T L G T D G L F L F S S L D p.80
. . . . . . g.5984
ACTGACGGGGATATGTACATCAGCCCTGAGGAGTTCAAACCCATTGCTGAGAAGCTAACA c.300
T D G D M Y I S P E E F K P I A E K L T p.100
| 03 . . . . . . g.6899
G | GGTCTTGTTCTGTCACCCAGACTGGAGTGCAGTGGTGCAGTCACAGCTCACTGCAGCCT c.360
G | S C S V T Q T G V Q W C S H S S L Q P p.120
^ differentially spliced exon
. . . . | 04 . . g.9983
CAACTTCCCTGGCTCAATTGATCCTCCTGCCTCAGCCTCCTGA | GGTCAACTCCCGCGGCC c.420
Q L P W L N U S S C L S L L R | S T P A A p.140
. . . . . . g.10043
AGCTGCGAGGAGGAGGAGTTGCCCCCTGACCCTAGCGAGGAGACGCTCACCATAGAAGCC c.480
S C E E E E L P P D P S E E T L T I E A p.160
. . . . . | 05. g.13407
CGATTCCAGCCTCTGCTCCCGGAGACCATGACCAAGAGCAAAGATGGCTTCCTAGGG | GTC c.540
R F Q P L L P E T M T K S K D G F L G | V p.180
. . . . . . g.13467
TCCCGCCTCGCCCTGTCCGGCCTCCGAAACTGGACAGCCGCCGCCTCACCAAGTGCAGTG c.600
S R L A L S G L R N W T A A A S P S A V p.200
. . . . . . g.13527
TTTGCCACCCGCCACTTCCAGCCCTTCCTTCCCCCGCCAGGCCAGGAGCTGGGTGAGCCC c.660
F A T R H F Q P F L P P P G Q E L G E P p.220
. . . . . . g.13587
TGGTGGATCATCCCCAGTGAGCTGAGCATGTTCACTGGCTACCTGTCCAACAACCGCTTC c.720
W W I I P S E L S M F T G Y L S N N R F p.240
. . | 06. . . . g.13883
TATCCACCGCCGCCCAAGGGCAAGGAG | GTCATCATCCACCGGCTCCTGAGCATGTTCCAC c.780
Y P P P P K G K E | V I I H R L L S M F H p.260
. . . . . . g.13943
CCTCGGCCCTTTGTGAAGACCCGCTTTGCCCCTCAGGGAGCTGTGGCCTGCCTGACTGCC c.840
P R P F V K T R F A P Q G A V A C L T A p.280
. . . | 07 . . . g.14535
ATCAGCGACTTCTACTACACTGTGATGTTCCG | GATCCATGCCGAGTTCCAGCTCAGTGAG c.900
I S D F Y Y T V M F R | I H A E F Q L S E p.300
. . . . . . g.14595
CCGCCCGACTTCCCCTTTTGGTTCTCCCCTGCTCAGTTCACCGGCCACATCATCCTCTCC c.960
P P D F P F W F S P A Q F T G H I I L S p.320
. . . . . | 08 . g.16288
AAAGACGCCACCCACGTCCGCGACTTCCGGCTCTTCGTGCCCAACCACAG | GTCTCTGAAT c.1020
K D A T H V R D F R L F V P N H R | S L N p.340
. . . . . . g.16348
GTGGACATGGAGTGGCTTTACGGGGCCAGTGAAAGCAGCAACATGGAGGTGGACATCGGC c.1080
V D M E W L Y G A S E S S N M E V D I G p.360
. | 09 . . . . . g.16563
TACATACCCCAG | ATGGAGCTGGAGGCCACGGGCCCCTCTGTGCCCTCCGTGATCCTGGAT c.1140
Y I P Q | M E L E A T G P S V P S V I L D p.380
. . . . . . g.16623
GAGGATGGCAGCATGATCGACAGCCACCTGCCTTCAGGGGAGCCCCTGCAGTTTGTGTTT c.1200
E D G S M I D S H L P S G E P L Q F V F p.400
. . . . . . g.16683
GAGGAGATCAAGTGGCAGCAGGAGCTGAGCTGGGAGGAGGCTGCCCGGCGCCTGGAGGTG c.1260
E E I K W Q Q E L S W E E A A R R L E V p.420
. . | 10 . . . . g.17550
GCCATGTACCCCTTCAAGAAG | GTCTCCTACTTGCCGTTCACTGAGGCCTTCGACCGAGCC c.1320
A M Y P F K K | V S Y L P F T E A F D R A p.440
. . . . . . g.17610
AAGGCTGAGAACAAGCTGGTGCACTCAATCCTGCTGTGGGGGGCCCTGGATGACCAGTCC c.1380
K A E N K L V H S I L L W G A L D D Q S p.460
| 11. . . . . . g.18758
TGCTGAG | GTTCAGGGCGGACTCTCCGGGAGACTGTCCTGGAAAGTTCGCCCATCCTCACC c.1440
C U G | S G R T L R E T V L E S S P I L T p.480
SRE (Sec redefinition element)
. . . . . . g.18818
CTGCTCAACGAGAGCTTCATCAGCACCTGGTCCCTGGTGAAGGAGCTGGAGGAACTGCAG c.1500
L L N E S F I S T W S L V K E L E E L Q p.500
| 12 . . . . . . g.18961
| AACAACCAGGAGAACTCGTCCCACCAGAAGCTGGCTGGCCTGCACCTGGAGAAGTACAGC c.1560
| N N Q E N S S H Q K L A G L H L E K Y S p.520
. . . . | 13 . . g.20390
TTCCCCGTGGAGATGATGATCTGCCTGCCCAATGGCACCGTG | GTCCATCACATCAATGCC c.1620
F P V E M M I C L P N G T V | V H H I N A p.540
. . . . . . g.20450
AACTACTTCTTGGACATCACCTCCGTGAAGCCCGAGGAAATCGAGAGCAATCTCTTCAGC c.1680
N Y F L D I T S V K P E E I E S N L F S p.560
. . . . . . g.20510
TTCTCATCCACCTTTGAAGACCCGTCCACGGCCACCTACATGCAGTTCCTGAAGGAGGGA c.1740
F S S T F E D P S T A T Y M Q F L K E G p.580
. . . g.20543
CTCCGGCGTGGCCTGCCCCTCCTCCAGCCCTAG c.1773
L R R G L P L L Q P * p.590
. . . . . . g.20603
agtgcctggacgggatctgatgcacaggcccccacgcctcagagccagagtggtcctcag c.*60
. . . . . . g.20663
cccatttcagactgcagatgccgcccactcccaccccactcctaggctgccttggagggt c.*120
. . . . . . g.20723
acaagatccactgagggtggccaccacagccttggctccatggtggcgggtagacaaggg c.*180
. . . . . . g.20783
atgcctgggctgactgggcagaggaacctctagctctgactgtcactcggctctccctac c.*240
. . . . . . g.20843
ccatttggctctggaagctgcttggcccccccagatcagggcctgggtgaactccctgga c.*300
. . . . . . g.20903
cctttcctagccagccgcacagtctaggcccttgtggggtgaagaatggagggaggagca c.*360
. . . . . . g.20963
ggctaggaagacggggccaccaccctctccttgctttcagcccttcccacaggaaacatc c.*420
. . . . . . g.21023
aagaagccccagccaggaggggccaggctgccaaggcggctcccctgtttatctagagcc c.*480
. . . . . . g.21083
ttcgttcctggccataccccggactgccctcctgtgcctgatgtccccagctggggtcag c.*540
. . . . . . g.21143
tctcaacaggagccagtcttctggagcctctgggcagaaccctccatcagagtggaaatc c.*600
. . . . . . g.21203
agacgggaccccctgcagcttccctgaccacgccactgaccagctatctggggaagttta c.*660
. . . . . . g.21263
ctgtgaaggggtttctgcctttagcaatggggttcactaagggggttcccgaggcccagg c.*720
. . . . . . g.21323
gccaaggcactcccaccgcctaccttagcacagggtctctgcaggactgcgggagccagc c.*780
. . . . . . g.21383
gctcctgccgcccctcttgcccctcagaccttgcatccacagaagcacaacccagccaaa c.*840
. . . . . . g.21443
caccacagccttctccagagccggcactgtcccggcaaccaggggtgccccaggctagct c.*900
. . . . . . g.21503
cttctacctctggggcaccacggactccccttggccactcttgggactttggtccacgtc c.*960
. . . . . . g.21563
ctgagccactgaccacggccagtctctctttttatatgtgcagaaaagtgtttttacaca c.*1020
. . . . . . g.21623
aactttctcatggtttgtaggtatttttttataaccccagtgctgaggagaaaggagggg c.*1080
. . . . . . g.21683
cagtggcttccccggcagcagccccatgatggctgaatccgaaatcctcgatgggtccag c.*1140
. . . . . . g.21743
cttgatgtctttgcagctgcacctatgggaagaagtagtcctctcttccttctcctcttc c.*1200
. . . . . . g.21803
agctttttaaaaacagtcctcagaggatccatgatccccagcactgtcccatcctccaca c.*1260
. . . . . . g.21863
aaggcccacaggcatgcctgtactctctttcattaaggtcttgaagtcaggctgccccct c.*1320
. . . . . . g.21923
ccccagcccccagttctctccccaccccctcaccccacccggggctcactcagcctggca c.*1380
. . . . . . g.21983
gaggaagaaggaaggcagacatctccgcagccactcctgggccttttatgtgccgagtta c.*1440
. . . . . . g.22043
ccccacttgccttgggcgtgtccactgagccttccccagccagtcttgttctcaattttg c.*1500
. . . . . . g.22103
ttttgttttgttttgagacggagtcttgctctgtcacccaggctggagtgctatggctcg c.*1560
. . . . . . g.22163
atcttggctcactgcaacctccacctcccaggttcaagcaattctcttgcctcagcctcc c.*1620
. . . . . . g.22223
cgagtagctgggattacaggtgcatgccaccatggctggctaatttttgtatttttagta c.*1680
. . . . . . g.22283
gagatggggtttcaccatattggtcaggctgatctggaacttctgacctcaggtgatcca c.*1740
. . . . . . g.22343
cctgcctcagcctcccaaagtgctgggattacaggcgtgagcaatcgtgcccagccttgt c.*1800
. . . . . . g.22403
tcttaattttgtatcatccagtcatcgctaatattacacgcaccttctcacttaatcctc c.*1860
. . . . . . g.22463
acgacaagcctgtgaggcagatgctcattgttcccatcttgatgaaacttgagtctcagg c.*1920
. . . . . . g.22523
gaagtgaagtgacttgcccagggtcactcaggtagagttgagattcaaacccacatgtgg c.*1980
. . . . . . g.22583
ctccaaagtctgcatctggatttgggggtgttttttggcatggcaccctcacctctctcc c.*2040
. . . . . . g.22643
ctgcctgttttccccaaagtggaaaggaaggcctttcaaaccagagtgtctcactcccct c.*2100
. . . . . . g.22703
ctgacctccagaccagatggggcatgagccagccagctcagccaggctccctgtgtcctg c.*2160
. . . . . . g.22763
ggaggaagtgtccccatcccccatgccccttatggggagggagggcgtctgatgctctct c.*2220
. . . . . . g.22823
ctctgcctccccccccatcctgtcaggcacaggtgacgggggcagcccatgcgagccctt c.*2280
. . . . . . g.22883
ctcctgctgctctgggagggccagttccacattgagccagcctggtcccatggaaaatga c.*2340
. . . . . . g.22943
tggcctgggctttctgaggccttatctgatgcctctgcagttcatgtcccccaccaggcc c.*2400
. . . . . . g.23003
tcgaggctcagggtgggagagggccccgggctgccctgtcactcctctaacacttccctc c.*2460
. . . . g.23047
ccctgtccccaacatgccctgtaataaaattagagaagactaac c.*2504
(downstream sequence)
Legend:
Powered by LOVDv.2.0-20 Build 20
©2004-2009 Leiden University Medical Center