|
|
|
| Protein Name | keratin 36, K36 |
|---|
| Former Protein Name | Ha6 |
|---|
| Gene Symbol | KRT36 |
|---|
| Former Gene Symbol | KRTHA6 |
|---|
| Intermediate Filament Type | I |
|---|
| HGNC ID | 6454 |
|---|
| OMIM ID | *604540 |
|---|
| NCBI Gene ID | 8689 |
|---|
INTEGRATED SEQUENCE VIEW |
|---|
Information last updated on 2009-07-21 17:06:48 * variant associated with disease as a risk factor rather than a causal factor. Note: Sequences are numbered according to latest reference sequence information. Discrepancies may exist with some publications. | Chromosome | 17q12-q21 |
|---|
| Chromosome Strand | - |
|---|
| Chromosome GI | 51511734 |
|---|
| Chromosome RefSeq ID | NC_000017.9 |
|---|
36899642 : ATGGCCACCC AGACCTGCAC CCCTACCTTC TCCACTGGGT CTATCAAGGG : 36899593
1 : ATGGCCACCC AGACCTGCAC CCCTACCTTC TCCACTGGGT CTATCAAGGG : 50
1 : M A T Q T C T P T F S T G S I K G : 17
36899592 : CCTCTGTGGC ACAGCAGGCG GCATCTCTCG GGTGTCCTCC ATCCGTTCTG : 36899543
51 : CCTCTGTGGC ACAGCAGGCG GCATCTCTCG GGTGTCCTCC ATCCGTTCTG : 100
18 : L C G T A G G I S R V S S I R S : 33
36899542 : TGGGCTCCTG CAGGGTCCCC AGTCTCGCCG GTGCTGCAGG GTACATCTCT : 36899493
101 : TGGGCTCCTG CAGGGTCCCC AGTCTCGCCG GTGCTGCAGG GTACATCTCT : 150
34 : V G S C R V P S L A G A A G Y I S : 50
36899492 : TCTGCTAGGT CGGGCCTCTC TGGCCTTGGG AGCTGCTTGC CTGGCTCCTA : 36899443
151 : TCTGCTAGGT CGGGCCTCTC TGGCCTTGGG AGCTGCTTGC CTGGCTCCTA : 200
51 : S A R S G L S G L G S C L P G S Y : 67
36899442 : CCTGTCTTCT GAGTGCCACA CCTCTGGCTT TGTGGGGAGC GGGGGCTGGT : 36899393
201 : CCTGTCTTCT GAGTGCCACA CCTCTGGCTT TGTGGGGAGC GGGGGCTGGT : 250
68 : L S S E C H T S G F V G S G G W : 83
36899392 : TCTGCGAGGG CTCCTTCAAC GGCAGCGAGA AGGAGACTAT GCAGTTCCTG : 36899343
251 : TCTGCGAGGG CTCCTTCAAC GGCAGCGAGA AGGAGACTAT GCAGTTCCTG : 300
84 : F C E G S F N G S E K E T M Q F L : 100
36899342 : AACGACCGCC TGGCCAACTA CCTGGAGAAG GTGCGTCAGC TGGAGCGGGA : 36899293
301 : AACGACCGCC TGGCCAACTA CCTGGAGAAG GTGCGTCAGC TGGAGCGGGA : 350
101 : N D R L A N Y L E K V R Q L E R E : 117
36899292 : GAACGCGGAG CTGGAGAGCC GCATCCAGGA GTGGTACGAG TTTCAGATCC : 36899243
351 : GAACGCGGAG CTGGAGAGCC GCATCCAGGA GTGGTACGAG TTTCAGATCC : 400
118 : N A E L E S R I Q E W Y E F Q I : 133
36899242 : CATACATCTG CCCAGACTAC CAGTCCTACT TCAAGACCAT CGAAGATTTC : 36899193
401 : CATACATCTG CCCAGACTAC CAGTCCTACT TCAAGACCAT CGAAGATTTC : 450
134 : P Y I C P D Y Q S Y F K T I E D F : 150
36899192 : CAGCAGAAGG TGAGGGAGAC CTGGCCCCTT TCCAGCTGAG CAGCCCAACT : 36899143
451 : CAGCAGAAG : 459
151 : Q Q K : 153
36899142 : CTTAGGAGGC GTCTGTCTCA TTCCAGGGCC TTTCCAGATG GGACGGTAGC : 36899093
36899092 : TTTTAGGGAA TTCTCCTGGT AGGGCAAACT TTTCTCTCTA GGGCTTTGGT : 36899043
36899042 : TTCCTGGGGG ACCCAGGCTT CTCTGCTGGT CTCTTGCTTG TACCTTGTTG : 36898993
36898992 : CCAGCCGTTC CTCACCCCCC TCCCCTTGCT GTAGGAAGGG AAGGAACATG : 36898943
36898942 : TACCAAGCCC CTACCGGATA CAGCACTGTG CCTGACACTT TCCTATTCCC : 36898893
36898892 : TGGCTTATTT ATTCTTCATC TTGGAGAGAT TGGTATTACT GGACCCATCT : 36898843
36898842 : GACAGATGTG GAAGTTGAAT TTTGAGGGGT TCAGCAACTT GCTCAAGAGC : 36898793
36898792 : ACACAGGCTC AGTGTTAGAT GCAGAACCTG AACTCAGGTC TGCCTCACCC : 36898743
36898742 : TGGTGTATCT TGCTGTGCCA CTCTGAAGGC TACAGGCGCC AGGACTGATG : 36898693
36898692 : GTCACTGACA TCCTCGGGAT GCCCCTGTGG GGTGGATGCT CCCCCCACAC : 36898643
36898642 : CTGGGGTGTG GCTCAGGGCT GGAAATGGTG AAGGGGCTGG TGGTACACAG : 36898593
36898592 : TGGCATGGCC TCTGTGAGCT GAGCTATCCC CCTACCTGCC CCCACACTGC : 36898543
36898542 : TGGGCCTCCT GTAGCTTATG GGAAGCCTCT TGTTCCCCAG ATCCTGCTGA : 36898493
460 : ATCCTGCTGA : 469
154 : I L L : 156
36898492 : CTAAGTCTGA GAATGCCAGG CTGGTCCTGC AGATTGATAA TGCCAAGCTG : 36898443
470 : CTAAGTCTGA GAATGCCAGG CTGGTCCTGC AGATTGATAA TGCCAAGCTG : 519
157 : T K S E N A R L V L Q I D N A K L : 173
36898442 : GCTGCTGACG ACTTCCGGAC CAAGTGAGTG GGCCTGATCA GGGAAGGTTT : 36898393
520 : GCTGCTGACG ACTTCCGGAC CAA : 542
174 : A A D D F R T K : 181
36898392 : GCCATGCCAC TCCCTTGCCT CTGTCCCCTG CCTTTGGGAG CTGGTGATGT : 36898343
36898342 : GGGAATAAGG CTGGATGGGA AAATGTGGCA TGAGCCCTGG ACCTCAGGGG : 36898293
36898292 : AAAGGAGTGT CATGCTCCTG ATCCCGTCAG CCACTGCAGC TGGACAGTAG : 36898243
36898242 : GGCAGGGGTC CCTGCTGGTG TGTGGTTTCC TTTGCTGCAG ACTTGGCAGG : 36898193
36898192 : GTTCTGTGTG TGCAGGTATG AGACAGAGCT GTCTCTGCGG CAGCTAGTGG : 36898143
543 : GTATG AGACAGAGCT GTCTCTGCGG CAGCTAGTGG : 577
182 : Y E T E L S L R Q L V : 192
36898142 : AGGCCGACAT CAACGGCCTG CGTAGGATCC TGGATGAGCT GACCCTGTGC : 36898093
578 : AGGCCGACAT CAACGGCCTG CGTAGGATCC TGGATGAGCT GACCCTGTGC : 627
193 : E A D I N G L R R I L D E L T L C : 209
36898092 : AAGGCTGACC TGGAGGCTCA GGTGGAGTCC CTGAAGGAGG AGCTGATGTG : 36898043
628 : AAGGCTGACC TGGAGGCTCA GGTGGAGTCC CTGAAGGAGG AGCTGATGTG : 677
210 : K A D L E A Q V E S L K E E L M C : 226
36898042 : CCTCAAGAAG AATCACGAGG AGGTGAGGCT GGTGCCATGT GACTTCCCAG : 36897993
678 : CCTCAAGAAG AATCACGAGG AG : 699
227 : L K K N H E E : 233
36897992 : TGTTTCCCAT CCAGCTTAGG AAGCCACTGC TGGGCTTTCA GTTTTCTGTG : 36897943
36897942 : CGGCAGGAAC TATACAAAGG CCTTGCATTT CATTCTCGTT TCATTTCATC : 36897893
36897892 : CTTACAATAA TCCCAAGAAT TATAAACTGT TACAAGCTCC ATTTTACAGA : 36897843
36897842 : TGAGAAAACT TAGGCACAAA GAGGTTAAGT TGCTTGCCTA AGGTCATAGA : 36897793
36897792 : GGGTCTACAC TTTTGCCCAT AACACTACAT GTCTATTTGG GCTCTAGTGC : 36897743
36897742 : CTGATAACAG CAATTTAATT TGCCTAGGGT TTGTATCATC TCACAAATAT : 36897693
36897692 : CCCATAGAAG GAGGTAGGTA TATACGGAGA AGGAGACCAA GGCTCAGAGA : 36897643
36897642 : TATTTAAGTA TCCTGTCTAA TGCTATGCAG CTGGTGACTG AGGGAAGAGG : 36897593
36897592 : GTTTGAATTC AGGTCATTGA AACCTGCAAT CCAGCATCTT TTTCACAACC : 36897543
36897542 : TCATGCCGTC TTGCCTCCTC TCTGCAGGAA GTCAGTGTAC TCCGTTGCCA : 36897493
700 : GAA GTCAGTGTAC TCCGTTGCCA : 722
234 : E V S V L R C Q : 241
36897492 : ACTTGGGGAC CGACTGAATG TGGAGGTGGA CGCTGCTCCC CCAGTGGATC : 36897443
723 : ACTTGGGGAC CGACTGAATG TGGAGGTGGA CGCTGCTCCC CCAGTGGATC : 772
242 : L G D R L N V E V D A A P P V D : 257
36897442 : TCAACAAGAT CCTGGAGGAT ATGAGATGCC AGTACGAGGC CCTGGTGGAG : 36897393
773 : TCAACAAGAT CCTGGAGGAT ATGAGATGCC AGTACGAGGC CCTGGTGGAG : 822
258 : L N K I L E D M R C Q Y E A L V E : 274
36897392 : AATAACCGCA GAGATGTGGA GGCCTGGTTC AACACCCAGG TGGGGCTGGG : 36897343
823 : AATAACCGCA GAGATGTGGA GGCCTGGTTC AACACCCAG : 861
275 : N N R R D V E A W F N T Q : 287
36897342 : GTGCCCTGGG ACCACAGGCT CCTGGGCTGG GGTTACCCTT GGAAGTAGCT : 36897293
36897292 : TGGTTTGACC ATGCTCTGGG CCCTGGCATG TGTTTCAGAC TGAGGAGCTG : 36897243
862 : AC TGAGGAGCTG : 873
288 : T E E L : 291
36897242 : AACCAGCAGG TGGTGTCCAG CTCGGAGCAG CTGCAGTGCT GCCAGACGGA : 36897193
874 : AACCAGCAGG TGGTGTCCAG CTCGGAGCAG CTGCAGTGCT GCCAGACGGA : 923
292 : N Q Q V V S S S E Q L Q C C Q T E : 308
36897192 : GATCATCGAG CTGAGACGTA CGGTCAACGC GCTAGAGATT GAGCTGCAGG : 36897143
924 : GATCATCGAG CTGAGACGTA CGGTCAACGC GCTAGAGATT GAGCTGCAGG : 973
309 : I I E L R R T V N A L E I E L Q : 324
36897142 : CTCAGCACAG CATGGTGAGT GGCCCCTGCC TGCGTCGCTG GCCACGGCCT : 36897093
974 : CTCAGCACAG CATG : 987
325 : A Q H S M : 329
36897092 : GTGGCAGGTC CCCGACGCAC CAGCCTCAGC GTGCAGGCTC TCATGGGGTG : 36897043
36897042 : TGATCACAGG CCGTAGGCAG ATGCCCAGGG CTGTGGGTTT CTGGGGTCAG : 36896993
36896992 : GGGATTCCTC CCCAATAGGC AGCTCTTCCT CTTTCCCATT GCAGCGGAAT : 36896943
988 : CGGAAT : 993
330 : R N : 331
36896942 : TCCTTGGAAT CCACCCTGGC CGAAACCGAG GCCCGCTACA GCTCCCAGCT : 36896893
994 : TCCTTGGAAT CCACCCTGGC CGAAACCGAG GCCCGCTACA GCTCCCAGCT : 1043
332 : S L E S T L A E T E A R Y S S Q L : 348
36896892 : GGCCCAGATG CAGTGCCTGA TCAGCAACGT GGAGGCCCAG CTGTCTGAGA : 36896843
1044 : GGCCCAGATG CAGTGCCTGA TCAGCAACGT GGAGGCCCAG CTGTCTGAGA : 1093
349 : A Q M Q C L I S N V E A Q L S E : 364
36896842 : TCCGCTGCGA CCTGGAGCGG CAGAACCAGG AGTACCAGGT GTTACTGGAC : 36896793
1094 : TCCGCTGCGA CCTGGAGCGG CAGAACCAGG AGTACCAGGT GTTACTGGAC : 1143
365 : I R C D L E R Q N Q E Y Q V L L D : 381
36896792 : GTCAAGGCCC GGCTGGAGGG CGAGATCGCT ACCTACCGCC ACCTGCTGGA : 36896743
1144 : GTCAAGGCCC GGCTGGAGGG CGAGATCGCT ACCTACCGCC ACCTGCTGGA : 1193
382 : V K A R L E G E I A T Y R H L L E : 398
36896742 : GGGAGAGGAC TGCAAGTGAG TGGCCCTTGG GCTGGGGTAG GGCTTGACTG : 36896693
1194 : GGGAGAGGAC TGCAA : 1208
399 : G E D C K : 403
36896692 : AACCCTCAGT GCCATGTGGA GGGCGTCAAG CCCAGAAGTG GTTGTCGCCC : 36896643
36896642 : AGATGAAGGG AACTAAACCA AAGCCCCTTG AGATTCTCCA TTTAGTCCCA : 36896593
36896592 : GGCTTTGGTA ATGCACAGCG GGAGAATCCA ACCCAACACA CGCCGCGTGT : 36896543
36896542 : TTTCCGCCAT CTTTTCTGAT TGGCAGTTTC TGCTCTTCAT TCCTGTAGCT : 36896493
36896492 : CAGTCCTCTC ACCCTTGGGG AATTCAGAGG CACTGAGATG ATCCGGGGCC : 36896443
36896442 : ACCGGTCTCG CTTGATCCTC TAGATCTGTT TAACACGAAT CTCAGCCCAG : 36896393
36896392 : TGCTCCGATG CCAAATGCAC CCTGCATGAT TTTGTTTCCT CAGGCTTCCT : 36896343
1209 : GCTTCCT : 1215
404 : L P : 405
36896342 : CCCCAACCTT GTGCCACGGC ATGCAAGCCT GTTATTAGAG TTCCTTCTGT : 36896293
1216 : CCCCAACCTT GTGCCACGGC ATGCAAGCCT GTTATTAGAG TTCCTTCTGT : 1265
406 : P Q P C A T A C K P V I R V P S V : 422
36896292 : CCCCCCGGTG CCCTGTGTCC CCTCTGTGCC CTGCACCCCG GCTCCCCAGG : 36896243
1266 : CCCCCCGGTG CCCTGTGTCC CCTCTGTGCC CTGCACCCCG GCTCCCCAGG : 1315
423 : P P V P C V P S V P C T P A P Q : 438
36896242 : TTGGCACTCA GATCCGCACC ATCACCGAGG AGATCAGAGA TGGGAAAGTC : 36896193
1316 : TTGGCACTCA GATCCGCACC ATCACCGAGG AGATCAGAGA TGGGAAAGTC : 1365
439 : V G T Q I R T I T E E I R D G K V : 455
36896192 : ATCTCCTCCA GGGAGCACGT GCAGTCCCGC CCGCTGTGAC AGCCCACTTG : 36896143
1366 : ATCTCCTCCA GGGAGCACGT GCAGTCCCGC CCGCTGTGAC AGCCCACTTG : *11
456 : I S S R E H V Q S R P L : 467
36896142 : GTCCACCAGG GCAGGGCCCT GACCACAGGA AGGAGGACAC CCCTGTGGCT : 36896093
*12 : GTCCACCAGG GCAGGGCCCT GACCACAGGA AGGAGGACAC CCCTGTGGCT : *61
36896092 : CCTGGAGGCT TAACGACCCT GCCCTTCTCT AGAGGGGTCC CCCTACGCTT : 36896043
*62 : CCTGGAGGCT TAACGACCCT GCCCTTCTCT AGAGGGGTCC CCCTACGCTT : *111
36896042 : AGCAGGTTTT TCTACCAAAA CACTCCCCGT ATTGTGTTTC CGGACTTAAC : 36895993
*112 : AGCAGGTTTT TCTACCAAAA CACTCCCCGT ATTGTGTTTC CGGACTTAAC : *161
36895992 : TGTGCTTTTA CGCCATGCAA AACCAGGTTT CCTGGAAATT TACCCAATAA : 36895943
*162 : TGTGCTTTTA CGCCATGCAA AACCAGGTTT CCTGGAAATT TACCCAATAA : *211
36895942 : AGTGTGTTCT CCTGGCATAG CAAACTCAA- ---------- ---------- : 36895914
*212 : AGTGTGTTCT CCTGGCATAG CAAACTCAAA AAAAAAAAAA AAAAAAAAAA : *261
36895914 : ---------- ---------- --- : 36895914
*262 : AAAAAAAAAA AAAAAAAAAA AAA : *284
|
Back to top |
mRNA SEQUENCE |
|---|
Reference sequence last updated on 2009-07-21 17:06:48 | RefSeq ID | NM_003771.4 |
|---|
| GI | 94538346 |
|---|
| Length | 1688 nucleotides |
|---|
| Download | File | Text |
|---|
| Sequence | Legend: Blue = CDS, Red = UTR 1 ATGGCCACCC AGACCTGCAC CCCTACCTTC TCCACTGGGT CTATCAAGGG 50
51 CCTCTGTGGC ACAGCAGGCG GCATCTCTCG GGTGTCCTCC ATCCGTTCTG 100
101 TGGGCTCCTG CAGGGTCCCC AGTCTCGCCG GTGCTGCAGG GTACATCTCT 150
151 TCTGCTAGGT CGGGCCTCTC TGGCCTTGGG AGCTGCTTGC CTGGCTCCTA 200
201 CCTGTCTTCT GAGTGCCACA CCTCTGGCTT TGTGGGGAGC GGGGGCTGGT 250
251 TCTGCGAGGG CTCCTTCAAC GGCAGCGAGA AGGAGACTAT GCAGTTCCTG 300
301 AACGACCGCC TGGCCAACTA CCTGGAGAAG GTGCGTCAGC TGGAGCGGGA 350
351 GAACGCGGAG CTGGAGAGCC GCATCCAGGA GTGGTACGAG TTTCAGATCC 400
401 CATACATCTG CCCAGACTAC CAGTCCTACT TCAAGACCAT CGAAGATTTC 450
451 CAGCAGAAGA TCCTGCTGAC TAAGTCTGAG AATGCCAGGC TGGTCCTGCA 500
501 GATTGATAAT GCCAAGCTGG CTGCTGACGA CTTCCGGACC AAGTATGAGA 550
551 CAGAGCTGTC TCTGCGGCAG CTAGTGGAGG CCGACATCAA CGGCCTGCGT 600
601 AGGATCCTGG ATGAGCTGAC CCTGTGCAAG GCTGACCTGG AGGCTCAGGT 650
651 GGAGTCCCTG AAGGAGGAGC TGATGTGCCT CAAGAAGAAT CACGAGGAGG 700
701 AAGTCAGTGT ACTCCGTTGC CAACTTGGGG ACCGACTGAA TGTGGAGGTG 750
751 GACGCTGCTC CCCCAGTGGA TCTCAACAAG ATCCTGGAGG ATATGAGATG 800
801 CCAGTACGAG GCCCTGGTGG AGAATAACCG CAGAGATGTG GAGGCCTGGT 850
851 TCAACACCCA GACTGAGGAG CTGAACCAGC AGGTGGTGTC CAGCTCGGAG 900
901 CAGCTGCAGT GCTGCCAGAC GGAGATCATC GAGCTGAGAC GTACGGTCAA 950
951 CGCGCTAGAG ATTGAGCTGC AGGCTCAGCA CAGCATGCGG AATTCCTTGG 1000
1001 AATCCACCCT GGCCGAAACC GAGGCCCGCT ACAGCTCCCA GCTGGCCCAG 1050
1051 ATGCAGTGCC TGATCAGCAA CGTGGAGGCC CAGCTGTCTG AGATCCGCTG 1100
1101 CGACCTGGAG CGGCAGAACC AGGAGTACCA GGTGTTACTG GACGTCAAGG 1150
1151 CCCGGCTGGA GGGCGAGATC GCTACCTACC GCCACCTGCT GGAGGGAGAG 1200
1201 GACTGCAAGC TTCCTCCCCA ACCTTGTGCC ACGGCATGCA AGCCTGTTAT 1250
1251 TAGAGTTCCT TCTGTCCCCC CGGTGCCCTG TGTCCCCTCT GTGCCCTGCA 1300
1301 CCCCGGCTCC CCAGGTTGGC ACTCAGATCC GCACCATCAC CGAGGAGATC 1350
1351 AGAGATGGGA AAGTCATCTC CTCCAGGGAG CACGTGCAGT CCCGCCCGCT 1400
1401 GTGACAGCCC ACTTGGTCCA CCAGGGCAGG GCCCTGACCA CAGGAAGGAG *46
*47 GACACCCCTG TGGCTCCTGG AGGCTTAACG ACCCTGCCCT TCTCTAGAGG *96
*97 GGTCCCCCTA CGCTTAGCAG GTTTTTCTAC CAAAACACTC CCCGTATTGT *146
*147 GTTTCCGGAC TTAACTGTGC TTTTACGCCA TGCAAAACCA GGTTTCCTGG *196
*197 AAATTTACCC AATAAAGTGT GTTCTCCTGG CATAGCAAAC TCAAAAAAAA *246
*247 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAA
|
|---|
Back to top |
RELATED SEQUENCES AND MULTIPLE SEQUENCE ALIGNMENT |
|---|
The following table shows the orthologous sequences in other organisms as indicated by NCBI HomoloGene ID 88459. Information last updated on 2007-08-17 12:29:14 Vertebrate ClustalW multiple sequence alignments Multiple sequence alignments were created using ClustalW for vertebrate sequences and the results available for viewing, download and visualization. Please use the analysis tools to include other sequences. | Type of ClustalW alignment | Download in FASTA format | View file in FASTA format |
|---|
| Nucleotide | file | view | | Protein | file | view |
Back to top | |
 |
|