ID A0A1A6GE06_NEOLE Unreviewed; 456 AA.
AC A0A1A6GE06;
DT 05-OCT-2016, integrated into UniProtKB/TrEMBL.
DT 05-OCT-2016, sequence version 1.
DT 27-MAR-2024, entry version 23.
DE RecName: Full=C2H2-type domain-containing protein {ECO:0000259|PROSITE:PS50157};
GN ORFNames=A6R68_07965 {ECO:0000313|EMBL:OBS63557.1};
OS Neotoma lepida (Desert woodrat).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea;
OC Cricetidae; Neotominae; Neotoma.
OX NCBI_TaxID=56216 {ECO:0000313|EMBL:OBS63557.1, ECO:0000313|Proteomes:UP000092124};
RN [1] {ECO:0000313|EMBL:OBS63557.1, ECO:0000313|Proteomes:UP000092124}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=417 {ECO:0000313|EMBL:OBS63557.1};
RC TISSUE=Liver {ECO:0000313|EMBL:OBS63557.1};
RA Campbell M., Oakeson K.F., Yandell M., Halpert J.R., Dearing D.;
RT "The Draft Genome Sequence and Annotation of the Desert Woodrat Neotoma
RT lepida.";
RL Submitted (JUN-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OBS63557.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LZPO01099512; OBS63557.1; -; Genomic_DNA.
DR STRING; 56216.A0A1A6GE06; -.
DR Proteomes; UP000092124; Unassembled WGS sequence.
DR Gene3D; 3.30.160.60; Classic Zinc Finger; 11.
DR InterPro; IPR036236; Znf_C2H2_sf.
DR InterPro; IPR013087; Znf_C2H2_type.
DR PANTHER; PTHR24390:SF241; EXPRESSED SEQUENCE AW146154-RELATED; 1.
DR PANTHER; PTHR24390; ZINC FINGER PROTEIN; 1.
DR Pfam; PF00096; zf-C2H2; 6.
DR SMART; SM00355; ZnF_C2H2; 11.
DR SUPFAM; SSF57667; beta-beta-alpha zinc fingers; 7.
DR PROSITE; PS00028; ZINC_FINGER_C2H2_1; 9.
DR PROSITE; PS50157; ZINC_FINGER_C2H2_2; 11.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Reference proteome {ECO:0000313|Proteomes:UP000092124};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW ProRule:PRU00042}.
FT DOMAIN 138..165
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 162..189
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 191..218
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 219..246
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 247..274
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 275..301
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 302..329
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 330..357
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 358..385
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 386..413
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 414..441
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT REGION 58..123
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 96..115
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 456 AA; 51088 MW; FE938BA6F8CD78F3 CRC64;
MEEDGYKTGL SQKLILKQLK RPGRLGASLV SAQTWDLQDD TVLGSKAVLW KPPPIVNGDS
IKESHESEGA FPLGCPFSSQ QGTPIEEHSP ARCSKDTQSA ERLTQNHQRT GSPSGEPRKS
PLEHRQHWVG TDAEESMFAC SQCRKVFLQS SALALHLRRH SLQCQEWDKA FPWSTNLVQH
QRSHTGGGKP FFCGECGKAF SCHSSLNVHH RVHTGERPYK CGACEKAFSC SSLLSMHRRV
HTGERPYACN ACGKAFNQRT HLTRHLRIHT GEKPYKCGCG KAFTCHSSLT VHEKIHSGDK
PFKCGECSKA FHSXARLTLH QRTHTGEKPF KCSNCGKAFS CHSYLTVHQR THSGEKPFRC
NECGKAFGSH SYLIVHQRVH TGEKPFDCSR CWKAFSCHSS LIVHQRVHTG EKPYKCHQCG
KAFSQNHCLI KHQKVHSREK ASECSEVTVS IDALHP
//