ID A0A1A6GIQ3_NEOLE Unreviewed; 959 AA.
AC A0A1A6GIQ3;
DT 05-OCT-2016, integrated into UniProtKB/TrEMBL.
DT 05-OCT-2016, sequence version 1.
DT 24-JAN-2024, entry version 20.
DE RecName: Full=C2H2-type domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=A6R68_05791 {ECO:0000313|EMBL:OBS65630.1};
OS Neotoma lepida (Desert woodrat).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea;
OC Cricetidae; Neotominae; Neotoma.
OX NCBI_TaxID=56216 {ECO:0000313|EMBL:OBS65630.1, ECO:0000313|Proteomes:UP000092124};
RN [1] {ECO:0000313|EMBL:OBS65630.1, ECO:0000313|Proteomes:UP000092124}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=417 {ECO:0000313|EMBL:OBS65630.1};
RC TISSUE=Liver {ECO:0000313|EMBL:OBS65630.1};
RA Campbell M., Oakeson K.F., Yandell M., Halpert J.R., Dearing D.;
RT "The Draft Genome Sequence and Annotation of the Desert Woodrat Neotoma
RT lepida.";
RL Submitted (JUN-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OBS65630.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LZPO01089559; OBS65630.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1A6GIQ3; -.
DR STRING; 56216.A0A1A6GIQ3; -.
DR Proteomes; UP000092124; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-KW.
DR InterPro; IPR000949; ELM2_dom.
DR InterPro; IPR013087; Znf_C2H2_type.
DR PANTHER; PTHR16089; REST COREPRESSOR COREST PROTEIN-RELATED; 1.
DR PANTHER; PTHR16089:SF19; TRANSCRIPTIONAL-REGULATING FACTOR 1; 1.
DR Pfam; PF01448; ELM2; 1.
DR Pfam; PF13912; zf-C2H2_6; 1.
DR SMART; SM01189; ELM2; 1.
DR SMART; SM00355; ZnF_C2H2; 1.
DR PROSITE; PS51156; ELM2; 1.
DR PROSITE; PS00028; ZINC_FINGER_C2H2_1; 1.
DR PROSITE; PS50157; ZINC_FINGER_C2H2_2; 1.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00042};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000092124};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00042};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00042}.
FT DOMAIN 522..549
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 811..918
FT /note="ELM2"
FT /evidence="ECO:0000259|PROSITE:PS51156"
FT REGION 194..226
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 259..314
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 331..351
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 388..511
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 537..589
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 611..631
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 194..209
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 388..416
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 437..487
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 496..511
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 558..589
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 611..627
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 959 AA; 105003 MW; 7F9B55E512D366F1 CRC64;
MGDQQLYKTN HVAHGGENLF YQQPPLGVHS GLNHSYGNTI SGAGMDAPQA SPISPHFPQD
TRDSLGLPIG SKNLGQMDTS RQGGWGSHAG PGNHVQLRSN LANSNMMWGA PGQVEPTDGY
QYTYSQASEI RTQKLTSGVL HKLDSFTQVF ANQNLRIQVN NMAQVLHTQS AVMDGASDSA
LRQLLSQKPV EPSASAIASR YQQVPQQPHP GFTGGLPKPA LPVGQHAPQG HLYYDYQQPL
AQMSMQGGQP LQAPQVLSSH MQPMQQHQYY PQPPPQQQQA GLQRISMQEM QQQPQQQIRP
SQPQQQQQLQ LQQRQGSLQI PQYYQPQPMM QHLQEQQQPP MHLQPPSYHR DPHQYTPEQA
HAVQLIQLGS MPQYYYQEPQ QPYSHPLYQQ SHLSQHQQRE DSQLKTYSSD RQTPAMLSSH
GDMGPPDTGV ADPASSEMTR VGSTLPHQPL LSPTGIHLNN MGPQHQQPSP SAMWPQVSQS
QSFSPYLHTH LPDGRTQPGS PESSSGQTKG AFGEQFDAKN KLTCSICLKE FKSLPALNGH
MRSHGGMRAS PSLKQEEGEK APPPPPQPQP QPPLPPPPPP PPLPPEAECL TPMVMPVSVP
VKLLPPKPSS QGFTNSVAAT PSARDKPAST MSDDEMPVLV RMNLSPPHSP QGAAPCAPAE
IPRKHHPPIT AKVEEPLKTL PEKKKFRHRP EPLFIPPPPS SYTPNPTSYS GATLYQSQLR
SPRILGDHLL LDPAHELPPY TPPPMLSPVR QGSGLFSNVL ISGHGPGVHP QLPLTPLTPT
PRVLLCRSSS IDGSNVTVTP GPGEQTVDVE PRINIGLRFQ AEIPELQDVS ALAQDTHKAT
LVWKPWPELE NQALQQQGTG PQEEPGGDFL FSTVENLLNL CCSSALPGGG TNSEFALHSL
FEAKGDVMAT LEMLLLRKPV RLKCHPLANY HYAGSSREEL DTRSAQAQNV GPNLRNIVE
//