ID A0A1A6HIX8_NEOLE Unreviewed; 1003 AA.
AC A0A1A6HIX8;
DT 05-OCT-2016, integrated into UniProtKB/TrEMBL.
DT 05-OCT-2016, sequence version 1.
DT 27-MAR-2024, entry version 24.
DE RecName: Full=C2H2-type domain-containing protein {ECO:0000259|PROSITE:PS00028};
GN ORFNames=A6R68_20014 {ECO:0000313|EMBL:OBS77597.1};
OS Neotoma lepida (Desert woodrat).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea;
OC Cricetidae; Neotominae; Neotoma.
OX NCBI_TaxID=56216 {ECO:0000313|EMBL:OBS77597.1, ECO:0000313|Proteomes:UP000092124};
RN [1] {ECO:0000313|EMBL:OBS77597.1, ECO:0000313|Proteomes:UP000092124}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=417 {ECO:0000313|EMBL:OBS77597.1};
RC TISSUE=Liver {ECO:0000313|EMBL:OBS77597.1};
RA Campbell M., Oakeson K.F., Yandell M., Halpert J.R., Dearing D.;
RT "The Draft Genome Sequence and Annotation of the Desert Woodrat Neotoma
RT lepida.";
RL Submitted (JUN-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OBS77597.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LZPO01028837; OBS77597.1; -; Genomic_DNA.
DR STRING; 56216.A0A1A6HIX8; -.
DR Proteomes; UP000092124; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR Gene3D; 3.30.160.60; Classic Zinc Finger; 2.
DR InterPro; IPR036236; Znf_C2H2_sf.
DR InterPro; IPR013087; Znf_C2H2_type.
DR PANTHER; PTHR45891; ZINC FINGER HOMEOBOX PROTEIN; 1.
DR PANTHER; PTHR45891:SF2; ZINC FINGER HOMEOBOX PROTEIN 4; 1.
DR Pfam; PF00096; zf-C2H2; 1.
DR Pfam; PF12874; zf-met; 1.
DR SMART; SM00355; ZnF_C2H2; 9.
DR SUPFAM; SSF57667; beta-beta-alpha zinc fingers; 2.
DR PROSITE; PS00028; ZINC_FINGER_C2H2_1; 2.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000092124};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 577..598
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS00028"
FT DOMAIN 608..629
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS00028"
FT REGION 1..20
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 387..446
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 486..579
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 689..723
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 395..415
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 430..446
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 502..520
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1003 AA; 109734 MW; E47DD92741ED4D1E CRC64;
MEPDRENSST DDNLKTDERK SEVLLGFSIE NAAATQVTSA KEIPCNECAT SFPSLQKYME
HHCPNARLPV LKDDESEISE LEDSDVENLT GEIVYQPDGS AYIIEDSKES GQNAQTGANS
KLFSTAMFLD SLASAGEKSD QSASAPVSFY PQIINTFHIA SSLGKPFTAD QAFPNTSALA
GVGPVLHSFR VYDLRHKREK DYLTSDGSAK NSCVSKDVPN NVDLSKFDGC VSDGKRKPVL
MCFLCKLSFG YIRSFVTHAV HDHRMTLNDE EQRLLGNKCV SAIIQGIGKD KEPLISFLEP
KKSTSVYPHF STTNLIGPDP TFRGLWSAFH VENGDSLPAG FAFLKGSRSP SSSAEQPLGI
TQMPKAEVNL GGLSSLVVNT PITSVSLSHS SSESSKMSES KDQENNCERP KESTILHPNG
ECPVKSEPTE PGDEDEEDAY SNELDDEEVL GELTDSIGNK DFPLLNQSIS PLSSSVLKFI
EKGTSSSSGA IADDTEKKKQ TAAGRNSGNV TNSYSIGSKD FADVSASRDG ATAAHPSETA
RGDEDSSATP HQHGFTPSTP GTPGPGGDGS PGSGIECPKC DTVLGSSRSL GGHMTMMHSR
NSCKTLKCPK CNWHYKYQQT LEAHMKEKHP EPGGSCVYCK TGQPHPRLAR GESYTCGYKP
FRCEVCNYST TTKGNLSIHM QSDKHLNNVQ NLQNGNGEQV FGHSAPAPNT SLSGCGTPSP
SKPKQKPTWR CEVCDYETNV ARNLRIHMTS EKHMHNMMLL QQNMKQIQHN LHLGLAPAEA
ELYQYYLAQN IGLTGMKLEN PADPQLMINP FQLDSATAAA LAPGLGELSP YISDPALKLF
QCAVCNKFTS DSLEALSVHV NSERSLPEEE WRAVIGDIYQ CKLCNYNTQL KANFQLHCKT
DKHMQKYQLV AHIKEGGKSN EWRLKCIAIG NPVHLKCNAC DYYTNSVDKL RLHTTNHRHE
AALKLYKVDG FVGXGILAKD PEKQNVLMMF PYGPKKKKCY VYC
//