ID A0A0P7W591_SCLFO Unreviewed; 652 AA.
AC A0A0P7W591;
DT 20-JAN-2016, integrated into UniProtKB/TrEMBL.
DT 20-JAN-2016, sequence version 1.
DT 27-MAR-2024, entry version 31.
DE RecName: Full=C2H2-type domain-containing protein {ECO:0000259|PROSITE:PS50157};
GN ORFNames=Z043_124515 {ECO:0000313|EMBL:KPP57735.1};
OS Scleropages formosus (Asian bonytongue) (Osteoglossum formosum).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Osteoglossocephala;
OC Osteoglossomorpha; Osteoglossiformes; Osteoglossidae; Scleropages.
OX NCBI_TaxID=113540 {ECO:0000313|EMBL:KPP57735.1, ECO:0000313|Proteomes:UP000034805};
RN [1] {ECO:0000313|EMBL:KPP57735.1, ECO:0000313|Proteomes:UP000034805}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Aro1 {ECO:0000313|EMBL:KPP57735.1};
RA Tan M.H., Gan H.M., Croft L.J., Austin C.M.;
RT "The genome of the Asian arowana (Scleropages formosus).";
RL Submitted (AUG-2015) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KPP57735.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JARO02015565; KPP57735.1; -; Genomic_DNA.
DR Proteomes; UP000034805; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR Gene3D; 3.30.160.60; Classic Zinc Finger; 3.
DR InterPro; IPR041697; Znf-C2H2_11.
DR InterPro; IPR039149; ZNF800.
DR InterPro; IPR036236; Znf_C2H2_sf.
DR InterPro; IPR013087; Znf_C2H2_type.
DR PANTHER; PTHR21020; UNCHARACTERIZED; 1.
DR PANTHER; PTHR21020:SF0; ZINC FINGER PROTEIN 800; 1.
DR Pfam; PF00096; zf-C2H2; 3.
DR Pfam; PF16622; zf-C2H2_11; 1.
DR Pfam; PF13909; zf-H2C2_5; 1.
DR SMART; SM00355; ZnF_C2H2; 7.
DR SUPFAM; SSF57667; beta-beta-alpha zinc fingers; 3.
DR PROSITE; PS00028; ZINC_FINGER_C2H2_1; 5.
DR PROSITE; PS50157; ZINC_FINGER_C2H2_2; 4.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00042};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000034805};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00042};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00042}.
FT DOMAIN 294..322
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 354..382
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 514..541
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 605..627
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT REGION 190..287
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 572..592
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 625..652
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 244..271
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 272..286
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 578..592
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 625..645
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 652 AA; 73930 MW; 0C03F5C633DF54C8 CRC64;
MNSDCCGPSN KLRFGGVVCS SRNERHVRCG FGQERSHKSL SSSAVDERAV PQERWMLCGG
ANGANTAVPF AIEPGDPAVL QRPLQTSKSG IQQIIECFRS GTAQLKHILL KEVDTIFECK
LCRSMFRSLP NLIAHKEFYC FPTRLRNHLR RTARAKPXXX XKDLLAAIYP RKDGPEYVVR
LEPIESNQNA VFQFLTTEEE QLPQEEPEAV PPSEPDVTDE DPAESLEAAR PAEPAVQQDP
PQEQQEDSAH EEPEAVPPKE REEQEVKREQ EEEEQKEEEE EEEGAKSWMD DVTISCCLCG
KDFNSRRGVR RHCRKMHKAK LEELRKFTDT RTVPISLLSM VKDRSSCPPP VPGRSCPVCQ
KTFATKANVR RHFDEVHRGL RRDYITPDIA TKPGQPLSLE PPGAARLRKQ KTKMEYNLSA
CTCLLCKRKY SSQMMLKRHL HIVHKIDTVE NGTAAKRSTT SCAKAKGETW GSAKSEGKGV
PAAAFPPSEE ELKSMSKARL RKLKLSMGFD FKQLYCKLCK RQFTTRQNLT KHMDLHMDGS
EIYIKFYRCP LCSYESRRKR DVLRHMSVVH KKSSRXPTGG PITRQQDAPE TGGSVLKVSN
NFVLHTCDVC GRAFGKKVYL ESHRRTHKTT GTLKPPEESR KKGRSTRSKL FL
//