ID G3WG38_SARHA Unreviewed; 285 AA.
AC G3WG38;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 07-APR-2021, sequence version 2.
DT 27-MAR-2024, entry version 69.
DE RecName: Full=CXXC-type zinc finger protein 5 {ECO:0000256|ARBA:ARBA00039661};
GN Name=CXXC5 {ECO:0000313|Ensembl:ENSSHAP00000014393.2};
OS Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus.
OX NCBI_TaxID=9305 {ECO:0000313|Ensembl:ENSSHAP00000014393.2, ECO:0000313|Proteomes:UP000007648};
RN [1] {ECO:0000313|Ensembl:ENSSHAP00000014393.2, ECO:0000313|Proteomes:UP000007648}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21709235; DOI=10.1073/pnas.1102838108;
RA Miller W., Hayes V.M., Ratan A., Petersen D.C., Wittekindt N.E., Miller J.,
RA Walenz B., Knight J., Qi J., Zhao F., Wang Q., Bedoya-Reina O.C.,
RA Katiyar N., Tomsho L.P., Kasson L.M., Hardie R.A., Woodbridge P.,
RA Tindall E.A., Bertelsen M.F., Dixon D., Pyecroft S., Helgen K.M.,
RA Lesk A.M., Pringle T.H., Patterson N., Zhang Y., Kreiss A., Woods G.M.,
RA Jones M.E., Schuster S.C.;
RT "Genetic diversity and population structure of the endangered marsupial
RT Sarcophilus harrisii (Tasmanian devil).";
RL Proc. Natl. Acad. Sci. U.S.A. 108:12348-12353(2011).
RN [2] {ECO:0000313|Ensembl:ENSSHAP00000014393.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Cytoplasm {ECO:0000256|ARBA:ARBA00004496}.
CC Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; G3WG38; -.
DR Ensembl; ENSSHAT00000014513.2; ENSSHAP00000014393.2; ENSSHAG00000012293.2.
DR eggNOG; ENOG502QT2M; Eukaryota.
DR GeneTree; ENSGT00940000154108; -.
DR HOGENOM; CLU_074593_0_0_1; -.
DR TreeFam; TF326617; -.
DR Proteomes; UP000007648; Unassembled WGS sequence.
DR GO; GO:0005737; C:cytoplasm; IEA:UniProtKB-SubCell.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR InterPro; IPR040388; CXXC4/CXXC5.
DR InterPro; IPR002857; Znf_CXXC.
DR PANTHER; PTHR13419:SF2; CXXC-TYPE ZINC FINGER PROTEIN 5; 1.
DR PANTHER; PTHR13419; ZINC FINGER-CONTAINING; 1.
DR Pfam; PF02008; zf-CXXC; 1.
DR PROSITE; PS51058; ZF_CXXC; 1.
PE 4: Predicted;
KW Cytoplasm {ECO:0000256|ARBA:ARBA00022490};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Reference proteome {ECO:0000313|Proteomes:UP000007648};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW ProRule:PRU00509}.
FT DOMAIN 220..261
FT /note="CXXC-type"
FT /evidence="ECO:0000259|PROSITE:PS51058"
FT REGION 1..69
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 82..103
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..23
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 55..69
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 285 AA; 30340 MW; A879B42ED702FF98 CRC64;
MSSLVGSPQP DSGGSSKSGT VEKGSSDEPL PPANPERRNK SGIISEPLNK SLRKSRPLSH
YSSFSSSSSA GELVDKAAAA SGLANGHDPP MDKTNSTSKH KSSAVASLLS KAERAAELSA
EGQLTLQQFA QSTEMLKRVV QEHLPLMSEA GAGLPDMEAV SGTEALNGPS DFPYLGAFPI
NPGLFIMTPA GVFLAESALH MAGLAEYPMQ SELASAISSG KKKRKRCGMC PPCRRRINCE
QCSSCRNRKT GHQICKFRKC EELKKKPSAA LEVMLPTGAA FRWFQ
//