|
|
|
| Protein Name | keratin 33a, K33a |
|---|
| Former Protein Name | Ha3-I |
|---|
| Gene Symbol | KRT33A |
|---|
| Former Gene Symbol | KRTHA3A |
|---|
| Intermediate Filament Type | I |
|---|
| HGNC ID | 6450 |
|---|
| OMIM ID | *602761 |
|---|
| NCBI Gene ID | 3883 |
|---|
INTEGRATED SEQUENCE VIEW |
|---|
Information last updated on 2009-07-21 17:12:53 * variant associated with disease as a risk factor rather than a causal factor. Note: Sequences are numbered according to latest reference sequence information. Discrepancies may exist with some publications. | Chromosome | 17q12-q21 |
|---|
| Chromosome Strand | - |
|---|
| Chromosome GI | 51511734 |
|---|
| Chromosome RefSeq ID | NC_000017.9 |
|---|
36760582 : GGACTCTGTC TTCAGCTGGA CACTCCCTCC CTGCACCATG TCTTACAGTT : 36760533
-37 : GGACTCTGTC TTCAGCTGGA CACTCCCTCC CTGCACCATG TCTTACAGTT : 13
1 : M S Y S : 4
36760532 : GTGGCCTGCC CAGCCTGAGC TGCCGCACCA GCTGCTCCTC CCGGCCCTGT : 36760483
14 : GTGGCCTGCC CAGCCTGAGC TGCCGCACCA GCTGCTCCTC CCGGCCCTGT : 63
5 : C G L P S L S C R T S C S S R P C : 21
36760482 : GTGCCCCCCA GCTGCCACGG CTGCACCCTG CCCGGGGCCT GCAACATCCC : 36760433
64 : GTGCCCCCCA GCTGCCACGG CTGCACCCTG CCCGGGGCCT GCAACATCCC : 113
22 : V P P S C H G C T L P G A C N I P : 38
36760432 : CGCCAATGTG AGCAACTGCA ACTGGTTCTG TGAGGGCTCC TTCAATGGCA : 36760383
114 : CGCCAATGTG AGCAACTGCA ACTGGTTCTG TGAGGGCTCC TTCAATGGCA : 163
39 : A N V S N C N W F C E G S F N G : 54
36760382 : GTGAGAAGGA GACCATGCAG TTCCTGAACG ACCGCCTGGC CAGCTACCTG : 36760333
164 : GTGAGAAGGA GACCATGCAG TTCCTGAACG ACCGCCTGGC CAGCTACCTG : 213
55 : S E K E T M Q F L N D R L A S Y L : 71
36760332 : GAGAAGGTGC GTCAGCTGGA GCGGGACAAC GCGGAGCTGG AGAACCTCAT : 36760283
214 : GAGAAGGTGC GTCAGCTGGA GCGGGACAAC GCGGAGCTGG AGAACCTCAT : 263
72 : E K V R Q L E R D N A E L E N L I : 88
36760282 : CCGGGAGCGG TCACAGCAGC AGGAGCCCTT GGTGTGTGCC AGCTACCAGT : 36760233
264 : CCGGGAGCGG TCACAGCAGC AGGAGCCCTT GGTGTGTGCC AGCTACCAGT : 313
89 : R E R S Q Q Q E P L V C A S Y Q : 104
36760232 : CCTACTTCAA GACCATTGAG GAGCTCCAGC AGAAGGTGAG GAGTGGGCGA : 36760183
314 : CCTACTTCAA GACCATTGAG GAGCTCCAGC AGAAG : 348
105 : S Y F K T I E E L Q Q K : 116
36760182 : CACAGTGCCT CCAGTAGGAA GTTGTCGGGA GAGAATCCGA GTTGTGATTG : 36760133
36760132 : AGAAACAACG TAAGCTCTTG ATGTGAATGA AAAGCTGAAA ACTGGCCTTT : 36760083
36760082 : CCATTTTGTT TTTTGAGGGC CCATCTAATA CTAGACTTGG GACTGTTAGG : 36760033
36760032 : TCTAAATATT TGCATAGTGT TAGTCTATGT GGAGGTAGCA GCACTTTGCA : 36759983
36759982 : TAAATGAGCT GTAATTAAAA ACTACCTCTG AATGCTTCCA TGCACCTGCC : 36759933
36759932 : AAGCTCCAGA ATTATTACCC TGAGGTGAGA GAAAAGGACT GAGTCCAAAG : 36759883
36759882 : GAAGCTGGGG AAACGGAATC TTTTGTAAAG ACTGGGCCAT TTTCTTTCTC : 36759833
36759832 : CCTCATCTGA GCTTTTTGCC CCTTGGACTG CTGGCTGCCT CTCTCTTCTA : 36759783
36759782 : GGTTCCTACT TATCCTGATT TCATTGTCCT CACCTTCATG CCTCTGGGCT : 36759733
36759732 : TCTCCTGGTG CACAGTGTCC AGTGAGAGCC TAAAATGTTT GGCCCAAATT : 36759683
36759682 : GGGAAGATTG GAAATTACCT AGGACAAAGT AGTCCCAGTT GTCTGGGACA : 36759633
36759632 : TCTGGGAACA TGTGGCTTCT TCCCTGATCC AGTCCCCTGC AATTGCTAGT : 36759583
36759582 : TTACCTATTT GTCTTTTCTG CTTGATTGAA AGTTTCTTGA GGGCAGGATC : 36759533
36759532 : AAGTGCTTAC CAATTTTTGT ATCCTCAGCT CCTAGTATCA TGCCAGAAAC : 36759483
36759482 : ATAGTAGGTG TTTTTCATAA ATGTCAACTA TTATGATCAG CTTCTCAGTA : 36759433
36759432 : TTGCTCATTA GGATAGAAAA AAGATTATCT ACTTATACTT TTTGCATATT : 36759383
36759382 : TTGGTGTTCT TTTTTTGAAT ATAATTTTGG AAATATTCAT TGAGTACCTG : 36759333
36759332 : CTGTATATAG AAAGCATTTA ATAAGATAGA CATCCTGATC CTTCCAAGGA : 36759283
36759282 : AGTGAATTAG GAAGGACTGC CATTGGTGAA AAATTACCAA GTCCTTGCTC : 36759233
36759232 : ATTCTTGAAT GTTTTGGGCC TTCTAGATCC TGTGCAGCAA GTCTGAGAAT : 36759183
349 : ATCC TGTGCAGCAA GTCTGAGAAT : 372
117 : I L C S K S E N : 124
36759182 : GCCAGGCTTG TGGTGCAGAT CGACAATGCC AAGCTGGCCT CAGATGACTT : 36759133
373 : GCCAGGCTTG TGGTGCAGAT CGACAATGCC AAGCTGGCCT CAGATGACTT : 422
125 : A R L V V Q I D N A K L A S D D F : 141
36759132 : CAGGACCAAG TGAGTGGGCA AGCGGGGTGT TGCTGTTTAT TTCTCTTCTC : 36759083
423 : CAGGACCAA : 431
142 : R T K : 144
36759082 : TGGCAGCCCT CCTTTCCTTT AGCCTGTTGT CAACACAAGT TAAACAAACA : 36759033
36759032 : GTGACTGAGC TATGTTCTAG CTGCTTTCTC CAAGATTCAC AGAGCCACCA : 36758983
36758982 : CACACCGTAG CTCAGAGATC CTTTCCCCAG GCCTTAGACT TCCTTTCCTC : 36758933
36758932 : CTTCTTCTCC TCTGTGACGT GACACCCCTA GTCGTACTGT GAGTTAGTGT : 36758883
36758882 : AGACGTGTTG GCCCCCGATG TTCTCCTATG CACAGAACCC ATTCATGAGA : 36758833
36758832 : ATGGAGAATG TGGAAGCATA TTATTTTGGA AGTCTTACAG TTTGTCAGCT : 36758783
36758782 : TTTATCCTAA ATCCACTGGC TTTCTTCTTT TTTTTAGAGA CAGGGCCTTG : 36758733
36758732 : CTCTGTCACC CAGGCTGGAG GGCAGTGGTG CAATCATAGC TCAGTGTAAC : 36758683
36758682 : TTCAGATCCT CAGACTCCTG GGCTCAAGCA ACCCTCTCCC CTTAGCCTCC : 36758633
36758632 : TGAATAGCTA GGAATATAGA AGTCTATCCA CTGGATCTTT AAACGTAAAA : 36758583
36758582 : TTACCAGTGC ATTTTCTTCC AGCTCGATGC TGAAAACCTA AACAAAAGAG : 36758533
36758532 : ATGGCTATAA AAACTACAAA CCCTAATTCC TAGTTGCCTG ACTAACCAAA : 36758483
36758482 : CCAAAGCAGA GTCAAGGCCA GGCCTATTCC TGCCCAGCCT GTTCCTGACC : 36758433
36758432 : CTCCTGTGAT CACAGATATG AGACCGAGCT GTCCCTGCGG CAGCTGGTGG : 36758383
432 : ATATG AGACCGAGCT GTCCCTGCGG CAGCTGGTGG : 466
145 : Y E T E L S L R Q L V : 155
36758382 : AGTCGGACAT CAATGGCCTG CGCAGGATCC TGGATGAGCT GACCCTGTGC : 36758333
467 : AGTCGGACAT CAATGGCCTG CGCAGGATCC TGGATGAGCT GACCCTGTGC : 516
156 : E S D I N G L R R I L D E L T L C : 172
36758332 : AGGTCTGACC TGGAGGCCCA GGTGGAGTCC CTGAAGGAGG AGCTGCTGTG : 36758283
517 : AGGTCTGACC TGGAGGCCCA GGTGGAGTCC CTGAAGGAGG AGCTGCTGTG : 566
173 : R S D L E A Q V E S L K E E L L C : 189
36758282 : CCTCAAGCAG AACCATGAGC AGGTGAGTTC CCTGAAGAGT GATGGGCTCA : 36758233
567 : CCTCAAGCAG AACCATGAGC AG : 588
190 : L K Q N H E Q : 196
36758232 : GACTAGCCAG GACCTGTTTC ACAACTGCCA CAGGGTCCAG GACAGGCTGT : 36758183
36758182 : CAGTAGAGGC TCAATTAGCT GAGGAAGTAA TTCTGCATAC AAAAAGGGAT : 36758133
36758132 : ATGTAGAGAA TGACTAATCT AAAAACCTGT GATATGAGAT TAGGATTTTT : 36758083
36758082 : CTGCCAGAGG CAGGAATGAG AGTGAGGAAG TTGGCAAGCC CAGTAGTGAG : 36758033
36758032 : GAAAGCTGGG TAATGAGGCA GAAGCCACAG CCTCTCCTGT GCCCAGGTTA : 36757983
36757982 : GAGTGCATGC AGCAGCAATC TTCTATTCCA ATTCTCCATT TGTAGCTGGA : 36757933
36757932 : GAAGCCAGGA TGGCCTAGGC CCATCAGGCA GAGAGGGCTG AGCTAGAACA : 36757883
36757882 : AAATGCAATT CTCTTCGTTC CTGTCCAGAG CTCTTTCTGC TTGCTGTTCA : 36757833
36757832 : GTGTTGCTTG GAAACAGCCT TGAGCAAAAT GCTCACCATT ATGGCTAAGT : 36757783
36757782 : CAGTTGGGCA AAAAAGAAGA CGTGTATTTT ATGCAACCTG AAATCCAAGA : 36757733
36757732 : ACAATGTAAA TGGTGACAGT TAATATTAAT CAAGTGCTTA GTATGTGCCA : 36757683
36757682 : AACATTGTGC TAAGTGCTTT AACTCCGGAG GCAGTTCTCA TGTTAAGATG : 36757633
36757632 : AAGAAATCGA GGCTTCAAGA AGCTTCTAGA AGCAATGTAT TAAGTGGTAG : 36757583
36757582 : AGATGAGGTT GAATACAAAT TTGTCTGAAA TCCGCCTAAG CCCAAGTACC : 36757533
36757532 : AAGCTATGCT TAGCAGGGAA GTAAACAGAT ACTTATTGCC TTCAGCAAAA : 36757483
36757482 : TCTATGCTGG TGGAATCAAC AAATCACCCT TTAAGTCTCT TCTATGCCAG : 36757433
36757432 : GATGGCAGAT TTTGAAGTAA GCATTCTGTA TTTCCTCTTT AGCTAGACTC : 36757383
36757382 : TGTTCTTTTT ATTATCCATG CTTAAGAGTC AAACATGTGA GGCTATATGG : 36757333
36757332 : TATACTAGCA AAAACACTGA ACTGGGCATT TGGGGACCAG AGTTTAAATA : 36757283
36757282 : ATGGCCCACC ATGTATCAGC TTTGAGCCCA AATCACTGAA TCTCTCTAAA : 36757233
36757232 : TGAGGAGTGA TACATTTAAT ACAAAGAGCA GAGGGAAGTC ATGGGCTTGA : 36757183
36757182 : AAGTAGTTTG TAAAAAGTGG AGTATACAAA TTAGTCATTG AATATAGATC : 36757133
36757132 : CTGGTTAGTG ACAGGATCAG AGGGAACAAA GTTTCTTGAA AGCTTAAGAA : 36757083
36757082 : TGATCAGAAG TTGAATTGCA TTCTCTTAAG GAGAGCACCA AGATCCTTTG : 36757033
36757032 : TGAAATTTAA AATTTTGCCC TTTCTCCATC AGGAGGTTAA CACCCTGCGC : 36756983
589 : GAGGTTAA CACCCTGCGC : 606
197 : E V N T L R : 202
36756982 : TGCCAGCTTG GAGACCGCCT CAACGTGGAG GTGGACGCTG CTCCCACTGT : 36756933
607 : TGCCAGCTTG GAGACCGCCT CAACGTGGAG GTGGACGCTG CTCCCACTGT : 656
203 : C Q L G D R L N V E V D A A P T V : 219
36756932 : GGACCTGAAC CAGGTCCTGA ATGAGACCAG GAGTCAGTAT GAGGCCCTGG : 36756883
657 : GGACCTGAAC CAGGTCCTGA ATGAGACCAG GAGTCAGTAT GAGGCCCTGG : 706
220 : D L N Q V L N E T R S Q Y E A L : 235
36756882 : TGGAAACCAA CCGCAGGGAA GTGGAGCAAT GGTTCGCCAC GCAGGTGGGC : 36756833
707 : TGGAAACCAA CCGCAGGGAA GTGGAGCAAT GGTTCGCCAC GCAG : 750
236 : V E T N R R E V E Q W F A T Q : 250
36756832 : ATCTAAGCAC GTGGCCACTC AGGACCCGAG GCCCCCCAGG GCCCCGGAGG : 36756783
36756782 : CAGGGTCTGA TCCTTTCTCC CCTTGGGTGT TTCAGACCGA GGAGCTGAAC : 36756733
751 : ACCGA GGAGCTGAAC : 765
251 : T E E L N : 255
36756732 : AAGCAGGTGG TATCCAGCTC GGAGCAGCTG CAGTCCTACC AGGCGGAGAT : 36756683
766 : AAGCAGGTGG TATCCAGCTC GGAGCAGCTG CAGTCCTACC AGGCGGAGAT : 815
256 : K Q V V S S S E Q L Q S Y Q A E I : 272
36756682 : CATCGAGCTG AGACGCACGG TCAATGCCCT GGAGATCGAG CTGCAGGCCC : 36756633
816 : CATCGAGCTG AGACGCACGG TCAATGCCCT GGAGATCGAG CTGCAGGCCC : 865
273 : I E L R R T V N A L E I E L Q A : 288
36756632 : AGCACAACCT GGTGTGTATT GTTCAGACCT GCTGGTGAGC GATGGGAACT : 36756583
866 : AGCACAACCT G : 876
289 : Q H N L : 292
36756582 : TGGGAGGCAG AGTCTTGGGG ATGCCCTTGG GGCCACACAC TCTCCTTAGC : 36756533
36756532 : TCTTGGAGCT TGTGAGTTCT TTGGAACCCC ATGGAGGAAC CTTATAAGGA : 36756483
36756482 : GCAGCTCTCT GACACTCTCA ATCTTCCCCA CCACAGCGAG ACTCTCTGGA : 36756433
877 : CGAG ACTCTCTGGA : 890
293 : R D S L E : 297
36756432 : AAACACGCTG ACAGAGAGCG AGGCCCGCTA CAGCTCCCAG CTGTCCCAGG : 36756383
891 : AAACACGCTG ACAGAGAGCG AGGCCCGCTA CAGCTCCCAG CTGTCCCAGG : 940
298 : N T L T E S E A R Y S S Q L S Q : 313
36756382 : TGCAGAGACT GATCACCAAC GTGGAGTCCC AGCTGGCGGA GATCCGCAGT : 36756333
941 : TGCAGAGACT GATCACCAAC GTGGAGTCCC AGCTGGCGGA GATCCGCAGT : 990
314 : V Q R L I T N V E S Q L A E I R S : 330
36756332 : GACCTGGAGC GGCAGAACCA GGAGTATCAG GTGCTGCTGG ACGTGCGGGC : 36756283
991 : GACCTGGAGC GGCAGAACCA GGAGTATCAG GTGCTGCTGG ACGTGCGGGC : 1040
331 : D L E R Q N Q E Y Q V L L D V R A : 347
36756282 : GCGGCTGGAG TGTGAGATCA ACACGTACCG GAGCCTGCTG GAGAGCGAGG : 36756233
1041 : GCGGCTGGAG TGTGAGATCA ACACGTACCG GAGCCTGCTG GAGAGCGAGG : 1090
348 : R L E C E I N T Y R S L L E S E : 363
36756232 : ACTGCAAGTC AGTATGGGGG TAGTAATCTT CTCCTTGGGG CATGTTAGGT : 36756183
1091 : ACTGCAA : 1097
364 : D C K : 366
36756182 : TGCTGTGGAG ATGAATAGTC TTCTTGATGG AAATGAATTT ATAATTTCAA : 36756133
36756132 : GCTTTTGTTT GGAGGAAGTC CAAGGAGGAA CAATATTCTT ACACAAAGCC : 36756083
36756082 : CTTTTAGTAT TTTCCAATCT CTTCAGCTCA TTTTACTCTA ACTTGCTCCA : 36756033
36756032 : ATGATTTTCT GCCCACAGGC TCCCCTCCAA CCCCTGCGCC ACAACCAATG : 36755983
1098 : GC TCCCCTCCAA CCCCTGCGCC ACAACCAATG : 1129
367 : L P S N P C A T T N : 376
36755982 : CATGTGACAA GTCCACTGGG CCCTGTATCT CTAATCCCTG TGGCCTACGT : 36755933
1130 : CATGTGACAA GTCCACTGGG CCCTGTATCT CTAATCCCTG TGGCCTACGT : 1179
377 : A C D K S T G P C I S N P C G L R : 393
36755932 : GCTCGGTGTG GGCCTTGCAA CACATTTGGG TACTAG : 36755897
1180 : GCTCGGTGTG GGCCTTGCAA CACATTTGGG TACTAG : 1215
394 : A R C G P C N T F G Y : 404
|
Back to top |
mRNA SEQUENCE |
|---|
Reference sequence last updated on 2009-07-21 17:12:53 | RefSeq ID | NM_004138.2 |
|---|
| GI | 14917116 |
|---|
| Length | 1252 nucleotides |
|---|
| Download | File | Text |
|---|
| Sequence | Legend: Blue = CDS, Red = UTR -37 GGACTCTGTC TTCAGCTGGA CACTCCCTCC CTGCACCATG TCTTACAGTT 13
14 GTGGCCTGCC CAGCCTGAGC TGCCGCACCA GCTGCTCCTC CCGGCCCTGT 63
64 GTGCCCCCCA GCTGCCACGG CTGCACCCTG CCCGGGGCCT GCAACATCCC 113
114 CGCCAATGTG AGCAACTGCA ACTGGTTCTG TGAGGGCTCC TTCAATGGCA 163
164 GTGAGAAGGA GACCATGCAG TTCCTGAACG ACCGCCTGGC CAGCTACCTG 213
214 GAGAAGGTGC GTCAGCTGGA GCGGGACAAC GCGGAGCTGG AGAACCTCAT 263
264 CCGGGAGCGG TCACAGCAGC AGGAGCCCTT GGTGTGTGCC AGCTACCAGT 313
314 CCTACTTCAA GACCATTGAG GAGCTCCAGC AGAAGATCCT GTGCAGCAAG 363
364 TCTGAGAATG CCAGGCTTGT GGTGCAGATC GACAATGCCA AGCTGGCCTC 413
414 AGATGACTTC AGGACCAAAT ATGAGACCGA GCTGTCCCTG CGGCAGCTGG 463
464 TGGAGTCGGA CATCAATGGC CTGCGCAGGA TCCTGGATGA GCTGACCCTG 513
514 TGCAGGTCTG ACCTGGAGGC CCAGGTGGAG TCCCTGAAGG AGGAGCTGCT 563
564 GTGCCTCAAG CAGAACCATG AGCAGGAGGT TAACACCCTG CGCTGCCAGC 613
614 TTGGAGACCG CCTCAACGTG GAGGTGGACG CTGCTCCCAC TGTGGACCTG 663
664 AACCAGGTCC TGAATGAGAC CAGGAGTCAG TATGAGGCCC TGGTGGAAAC 713
714 CAACCGCAGG GAAGTGGAGC AATGGTTCGC CACGCAGACC GAGGAGCTGA 763
764 ACAAGCAGGT GGTATCCAGC TCGGAGCAGC TGCAGTCCTA CCAGGCGGAG 813
814 ATCATCGAGC TGAGACGCAC GGTCAATGCC CTGGAGATCG AGCTGCAGGC 863
864 CCAGCACAAC CTGCGAGACT CTCTGGAAAA CACGCTGACA GAGAGCGAGG 913
914 CCCGCTACAG CTCCCAGCTG TCCCAGGTGC AGAGACTGAT CACCAACGTG 963
964 GAGTCCCAGC TGGCGGAGAT CCGCAGTGAC CTGGAGCGGC AGAACCAGGA 1013
1014 GTATCAGGTG CTGCTGGACG TGCGGGCGCG GCTGGAGTGT GAGATCAACA 1063
1064 CGTACCGGAG CCTGCTGGAG AGCGAGGACT GCAAGCTCCC CTCCAACCCC 1113
1114 TGCGCCACAA CCAATGCATG TGACAAGTCC ACTGGGCCCT GTATCTCTAA 1163
1164 TCCCTGTGGC CTACGTGCTC GGTGTGGGCC TTGCAACACA TTTGGGTACT 1213
1214 AG
|
|---|
Back to top |
RELATED SEQUENCES AND MULTIPLE SEQUENCE ALIGNMENT |
|---|
The following table shows the orthologous sequences in other organisms as indicated by NCBI HomoloGene ID 68245. Information last updated on 2007-08-17 12:31:29 Vertebrate ClustalW multiple sequence alignments Multiple sequence alignments were created using ClustalW for vertebrate sequences and the results available for viewing, download and visualization. Please use the analysis tools to include other sequences. | Type of ClustalW alignment | Download in FASTA format | View file in FASTA format |
|---|
| Nucleotide | file | view | | Protein | file | view |
Back to top | |
 |
|