ID G5B8K0_HETGA Unreviewed; 4421 AA.
AC G5B8K0;
DT 14-DEC-2011, integrated into UniProtKB/TrEMBL.
DT 14-DEC-2011, sequence version 1.
DT 27-MAR-2024, entry version 44.
DE SubName: Full=Hornerin {ECO:0000313|EMBL:EHB05611.1};
GN ORFNames=GW7_10294 {ECO:0000313|EMBL:EHB05611.1};
OS Heterocephalus glaber (Naked mole rat).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Hystricomorpha; Bathyergidae;
OC Heterocephalus.
OX NCBI_TaxID=10181 {ECO:0000313|EMBL:EHB05611.1, ECO:0000313|Proteomes:UP000006813};
RN [1] {ECO:0000313|EMBL:EHB05611.1, ECO:0000313|Proteomes:UP000006813}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21993625; DOI=10.1038/nature10533;
RA Kim E.B., Fang X., Fushan A.A., Huang Z., Lobanov A.V., Han L.,
RA Marino S.M., Sun X., Turanov A.A., Yang P., Yim S.H., Zhao X.,
RA Kasaikina M.V., Stoletzki N., Peng C., Polak P., Xiong Z., Kiezun A.,
RA Zhu Y., Chen Y., Kryukov G.V., Zhang Q., Peshkin L., Yang L., Bronson R.T.,
RA Buffenstein R., Wang B., Han C., Li Q., Chen L., Zhao W., Sunyaev S.R.,
RA Park T.J., Zhang G., Wang J., Gladyshev V.N.;
RT "Genome sequencing reveals insights into physiology and longevity of the
RT naked mole rat.";
RL Nature 479:223-227(2011).
CC -!- SUBCELLULAR LOCATION: Cytoplasmic granule
CC {ECO:0000256|ARBA:ARBA00004463}.
CC -!- SIMILARITY: Belongs to the S100-fused protein family.
CC {ECO:0000256|ARBA:ARBA00038258}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JH169004; EHB05611.1; -; Genomic_DNA.
DR STRING; 10181.G5B8K0; -.
DR eggNOG; ENOG502QQH0; Eukaryota.
DR InParanoid; G5B8K0; -.
DR Proteomes; UP000006813; Unassembled WGS sequence.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR GO; GO:0046914; F:transition metal ion binding; IEA:InterPro.
DR CDD; cd00213; S-100; 1.
DR Gene3D; 1.10.238.10; EF-hand; 1.
DR InterPro; IPR011992; EF-hand-dom_pair.
DR InterPro; IPR018247; EF_Hand_1_Ca_BS.
DR InterPro; IPR002048; EF_hand_dom.
DR InterPro; IPR034325; S-100_dom.
DR InterPro; IPR001751; S100/CaBP7/8-like_CS.
DR InterPro; IPR013787; S100_Ca-bd_sub.
DR PANTHER; PTHR22571:SF24; FILAGGRIN-2; 1.
DR PANTHER; PTHR22571; FILAGGRIN-RELATED; 1.
DR Pfam; PF01023; S_100; 1.
DR SMART; SM01394; S_100; 1.
DR SUPFAM; SSF47473; EF-hand; 1.
DR PROSITE; PS00018; EF_HAND_1; 1.
DR PROSITE; PS50222; EF_HAND_2; 1.
DR PROSITE; PS00303; S100_CABP; 1.
PE 3: Inferred from homology;
KW Calcium {ECO:0000256|ARBA:ARBA00022837};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Reference proteome {ECO:0000313|Proteomes:UP000006813}.
FT DOMAIN 49..84
FT /note="EF-hand"
FT /evidence="ECO:0000259|PROSITE:PS50222"
FT REGION 97..302
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 97..123
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 124..145
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 146..194
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 200..302
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 4421 AA; 461187 MW; A431DC91342280DD CRC64;
MPKLLQGIVT VVDVFYQYAT QHGECDMLNK EELKELLENE FRQILKNPDD PDTVDIIMQN
LDRDHNKQVD FIEYLLMIFK LFQACNKIIG KDYCQASGSK QKDHSHQHQE EQSETEEEEG
KGQKSSSTYS SSSAGENGSY SRGSRGSTEH KPRSTSRKLR HQGDLSSSEH RESSEERTES
SSSHFKNSEK NKHGSHQQYK KKGGQSPNQQ HHGFSSGSCE KQDYQSSSSD LTSQQQKYGS
EPRQSSSYEK YRSESGKFTS NDKNQSSSYQ SSTQRKLSSS SSHQLGNYGR QNHGSGAGKN
TMNPTQVVIQ GVVKIKHIAL DQVNPLIMEN MDLAVANLLV RNTMSPPQGV MNNMGQGQIS
HLAMANRVQA QSSLQDMVNM DLDQANILAT GNINQVQESH LVLVNKSLAQ EDLLVLVNMG
LDQDSHQAVA NMDPTQVSLP AVDRMCLDQV SLLATGNIGL VQVHLLAVVN MANTDPDQAN
LLGTGNIGLV HSNLPAVVDT GLAQVSLLAT VSMDLDPVSP QVMDNMVLHQ DNHHGVVSMD
PAQVSLPAMV SMGLNQISLL AIANIGLDHL PGTGDMGLVK GSILVLVNIG LAQVSLLALV
NMDLAQVSLP ALANMHLAQE SLLDLVSMDQ VQVSLPAMVT IGLAQGSLLA MVNMGLAQVS
LPALANTHLA QESLLALVSM DQDQVSLPAL ASMHLAQDSL LALVSMDQDQ VSLPAMVTMG
LVQGSLLAMV NMGLAQVSLP ALANMHLAQE SLLALVSMDQ DQVSLPAVVT MGLVQVNLLA
TVDISLARVS LLAIADMGLH PVSLQDAADM GLDQISLLPL GNKSLAQVSL PALANMHLAQ
ESLLALVNMV QDQVSLPAVV TMGLVQVNLL AIVDISLAQV SLLAIADMGL HPVSLQDAAN
RGLDQISLLP LGNMSLAQVS LLALVNMVQD QVSLPAVVTM GLVQVNLLAI VDISLAQVSL
LAIADMGLHP VSLQDAADMG LDQISLLPLG NMSLAQVSLL ALVNMVQDQV SLPAVVTMGL
VQVNLLAIVD ISLAQVSLLA IADMGLHPVS LQDAADMGLD QISLLPLGNM SLAQVSLLAL
VNMVQDQVSL PAVVTMGLVQ VNLLAIVDIS LAQVSLLAIA DMGLHPVSLQ DAADMGLDQI
SLLPLGNMSL AQVSLLALVN MVQDQVSLPA VVTMGLVQVN LLAIVDISLA QVSLLAIADM
GLHPVSLQDA ADMGLDQISL LPLGNMSLAQ VSLLALVNMV QDQVSLPAVV TMGLVQVNLL
AIVDISLAQV SLLAIADMGL HPVSLQDAAD MGLDQISLLP LGNMSLAQVS LLALVNMVQD
QVSLPAVVTM GLVQVNLLAI VDISLAQVSL LAIADMGLHP VSLQDAADMG LDQISLLPLG
NMSLAQVSLL ALVNMVQDQV SLPAVVTMGL VQVNLLAIVD ISLAQVSLLA IADMGLHPVS
LQDAADMGLD QISLLPLGNM SLAQVSLLAL VNMVQDQVSL PAVVTMGLVQ VNLLAIVDIS
LAQVSLLAIA DMGLHPVSLQ DAADMGLDQI SLLPLGNMSL AQVSLLALVN MVQDQVSLPA
VVTMGLVQVN LLAIVDISLA QVSLLAIADM GLHPVSLQDA ADMGLDQISL LPLGNMSLAQ
VSLLALVNMV QDQVSLPAVV TMGLVQVNLL AIVDISLAQV SLLAIADMGL HPVSLQDAAD
MGLDQISLLP LGNMSLAQVS LLALVNMVQD QVSLPAVVTM GLVQVNLLAI VDISLAQVSL
LAIADMGLHP VSLQDAADMG LDQISLLPLG NMSLAQVSLL ALVNMVQDQV SLPAVVTMGL
VQVNLLAIVD ISLAQVSLLA IADMGLHPVS LQDAADMGLD QISLLPLGNM SLAQVSLLAL
VNMVQDQVSL PAVVTMGLVQ VNLLAIVDIS LAQVSLLAIA DMGLHPVSLQ DAADMGLDQI
SLLPLGNMSL AQVSLLALVN MVQDQVSLPA VVTMGLVQVN LLAIVDISLA QVSLLAIADM
GLHPVSLQDA ADMGLDQISL LPLGNMSLAQ VSLLALVNMV QDQVSLPAVV TMGLVQVNLL
AIVDISLAQV SLLAIADMGL HPVSLQDAAD MGLDQISLLP LGNMSLAQVS LLALVNMVQD
QVSLPAVVTM GLVQVNLLAI VDISLAQVSL LAIADMGLHP VSLQDAADMG LDQISLLPLG
NMSLAQVSLL ALVNMVQDQV SLPAVVTMGL VQVNLLAIVD ISLAQVSLLA IADMGLHPVS
LQDAADMGLD QISLLPLGNM SLAQVSLLAL VNMVQDQVSL PAVVTMGLVQ VNLLAIVDIS
LAQVSLLAIA DMGLHPVSLQ DAADMGLDQI SLLPLGNMSL AQVSLLALVN MVQDQVSLPA
VVTMGLVQVN LLAIVDISLA QVSLLAIADM GLHPVSLQDA ADMGLDQISL LPLGNMSLAQ
VSLLALVNMV QDQVSLPAVV TMGLVQVNLL AIVDISLAQV SLLAIADMGL HPVSLQDAAD
MGLDQISLLP LGNMSLAQVS LLALVNMVQD QVSLPAVVTM GLVQVNLLAI VDISLAQVSL
LAIADMGLHP VSLQDAADMG LDQISLLPLG NMSLAQVSLL ALVNMVQDQV SLPAVVTMGL
VQVNLLAIVD ISLAQVSLLA IADMGLHPVS LQDAADMGLD QISLLPLGNM SLAQVSLLAL
VNMVQDQVSL PAVVTMGLVQ VNLLAIVDIS LAQVSLLAIA DMGLHPVSLQ DAADMGLDQI
SLLPLGNMSL AQVSLLALVN MVQDQVSLPA VVTMGLVQVN LLAIVDISLA QVSLLAIADM
GLHPVSLQDA ADMGLDQISL LPLGNMSLAQ VSLLALVNMV QDQVSLPAVV TMGLVQVNLL
AIVDISLAQV SLLAIADMGL HPVSLQDAAD MGLDQISLLP LGNMSLAQVS LLALVNMVQD
QVSLPAVVTM GLVQVNLLAI VDISLAQVSL LAIADMGLHP VSLQDAADMG LDQISLLPLG
NMSLAQVSLL ALVNMVQDQV SLPAVVTMGL VQVNLLAIVD ISLAQVSLLA IADMGLHPVS
LQDAADMGLD QISLLPLGNM SLAQVSLLAL VNMVQDQVSL PAVVTMGLVQ VNLLAIVDIS
LAQVSLLAIA DMGLHPVSLQ DAADMGLDQI SLLPLGNMSL AQVSLLALVN MVQDQVSLPA
VVTMGLVQVN LLAIVDISLA QVSLLAIADM GLHPVSLQDA ADMGLDQISL LPLGNMSLAQ
VSLLALVNMV QDQVSLPAVV TMGLVQVNLL AIVDISLAQV SLLAIADMGL HPVSLQDAAD
MGLDQISLLP LGNMSLAQVS LLALVNMVQD QVSLPAVVTM GLVQVNLLAI VDISLAQVSL
LAIADMGLHP VSLQDAADMG LDQISLLPLG NMSLAQVSLL ALVNMVQDQV SLPAVVTMGL
VQVNLLAIVD ISLAQVSLLA IADMGLHPVS LQDAADMGLD QISLLPLGNM SLAQVSLLAL
VNMVQDQVSL PAVVTMGLVQ VNLLAIVDIS LAQVSLLAIA DMGLHPVSLQ DAADMGLDQI
SLLPLGNMSL AQVSLLALVN MVQDQVSLPA VVTMGLVQVN LLAIVDISLA QVSLLAIADM
GLHPVSLQDA ADMGLDQISL LPLGNMSLAQ VSLLALVNMV QDQVSLPAVV TMGLVQVNLL
AIVDISLAQV SLLAIADMGL HPVSLQDAAD MGLDQISLLP LGNMSLAQVS LLALVNMVQD
QVSLPAVVTM GLVQVNLLAI VDISLAQVSL LAIADMGLHP VSLQDAADMG LDQISLLPLG
NMSLAQVSLL ALVNMVQDQV SLPAVVTMGL VQVNLLAIVD ISLAQVSLLA IADMGLHPVS
LQDAADMGLD QISLLPLGNM SLAQVSLLAL VNMVQDQVSL PAVVTMGLVQ VNLLAIVDIS
LAQVSLLAIA DMGLHPVSLQ DAADMGLDQI SLLPLGNMSL AQVSLLALVN MVQDQVSLPA
VVTMGLVQVN LLATVDISLA RVSLLAIADM GLHPVSLQDA ADMGLDQISL LPLGNMSLAQ
VSLLALVNMV QDQVSLPAVV TMGLVQVNLL ATVDISLARV SLLAIADMGL HPVSLQDAAN
RGLDQISLLP LGNMSLAQVS LLALVNMVQD QVSLPAVVTM GLVQVNLLAT VDISLARVSL
LAIADMGLHP VSLQDAANRG LDQISLLPLG NMSLAQVSLL ALVNMVQDQV SLPAVVTMGL
VQVNLLAIVD ISLARVSLLA IADMGLHPVS LQDAANRGLD QISLLPLGNM SLAQVSLLAL
VNMVQDQVSL PAVVTMGLVQ VNLLAIVDIS LAQVSLLAIA DMGLHPVSLQ DAANRGLDQI
SLLPLGNMSL AQVNILAMDN IAQHQDSHHV LVNMDPAQAS LPAMVNMDVV QVSLLMMASM
DLLQVIVLAI ANTSIPQVCL NTLSLVHVLH LPLDNIVLGL VIVLTLNNMG LAHVTHLVLD
HVALDLVSVL VLSKGDVETS GNLAIAIQIV MKGNGEVVID K
//