ID W6UGV1_ECHGR Unreviewed; 769 AA.
AC W6UGV1;
DT 16-APR-2014, integrated into UniProtKB/TrEMBL.
DT 16-APR-2014, sequence version 1.
DT 27-MAR-2024, entry version 26.
DE SubName: Full=HIV1 enhancer-binding protein 2 {ECO:0000313|EMBL:EUB60760.1};
GN ORFNames=EGR_04386 {ECO:0000313|EMBL:EUB60760.1};
OS Echinococcus granulosus (Hydatid tapeworm).
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes; Cestoda;
OC Eucestoda; Cyclophyllidea; Taeniidae; Echinococcus;
OC Echinococcus granulosus group.
OX NCBI_TaxID=6210 {ECO:0000313|EMBL:EUB60760.1, ECO:0000313|Proteomes:UP000019149};
RN [1] {ECO:0000313|EMBL:EUB60760.1, ECO:0000313|Proteomes:UP000019149}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=24013640; DOI=10.1038/ng.2757;
RA Zheng H., Zhang W., Zhang L., Zhang Z., Li J., Lu G., Zhu Y., Wang Y.,
RA Huang Y., Liu J., Kang H., Chen J., Wang L., Chen A., Yu S., Gao Z.,
RA Jin L., Gu W., Wang Z., Zhao L., Shi B., Wen H., Lin R., Jones M.K.,
RA Brejova B., Vinar T., Zhao G., McManus D.P., Chen Z., Zhou Y., Wang S.;
RT "The genome of the hydatid tapeworm Echinococcus granulosus.";
RL Nat. Genet. 45:1168-1175(2013).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EUB60760.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; APAU02000027; EUB60760.1; -; Genomic_DNA.
DR AlphaFoldDB; W6UGV1; -.
DR STRING; 6210.W6UGV1; -.
DR EnsemblMetazoa; XM_024493635.1; XP_024351956.1; GeneID_36340101.
DR OrthoDB; 2936270at2759; -.
DR Proteomes; UP000019149; Unassembled WGS sequence.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR Gene3D; 3.30.160.60; Classic Zinc Finger; 1.
DR Gene3D; 2.20.100.10; Thrombospondin type-1 (TSP1) repeat; 1.
DR InterPro; IPR007110; Ig-like_dom.
DR InterPro; IPR036383; TSP1_rpt_sf.
DR InterPro; IPR036236; Znf_C2H2_sf.
DR InterPro; IPR013087; Znf_C2H2_type.
DR PANTHER; PTHR24409:SF295; LD43035P; 1.
DR PANTHER; PTHR24409; ZINC FINGER PROTEIN 142; 1.
DR SMART; SM00355; ZnF_C2H2; 3.
DR SUPFAM; SSF57667; beta-beta-alpha zinc fingers; 1.
DR SUPFAM; SSF82895; TSP-1 type 1 repeat; 1.
DR PROSITE; PS50835; IG_LIKE; 1.
DR PROSITE; PS00028; ZINC_FINGER_C2H2_1; 1.
DR PROSITE; PS50157; ZINC_FINGER_C2H2_2; 1.
PE 4: Predicted;
KW Membrane {ECO:0000256|SAM:Phobius};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000019149};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW ProRule:PRU00042}.
FT SIGNAL 1..19
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 20..769
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5004884800"
FT TRANSMEM 417..438
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 254..390
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 641..668
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
SQ SEQUENCE 769 AA; 87341 MW; A05F1F8FDD48D4CF CRC64;
MPKWVGILLF QIFSILVECD VGVPCAVDGS WCPWSGTVVK CSAACGDSGM GLRTRLCACP
SPAHGGKSCS LPPGAKEAAM LALSHLQDAA IRKGTSGGKG DGDDDFPMPT AADIAAIADG
SGKWDACNRK FCPYLKRLTD FEVNITIDDL RLQRPEDAWL WSGGIPSSQH DPVGLHCSPQ
LRSRTEIYDK RYRFPRARAM WTRSVGRSEY QSYDFVGTPL RANRRLQILR DRLIIRSLDE
DDEGVYRYGY EYEPLQFATI CFFAVYLSDK VVVIESGKPF KLTCNAKGLW PIIQQTPKDN
WKTFWAYRPD AKAASLGTKP IEKLWLVDLK PPRIIEEEDI FMNTTEGTMV MTLFDTERRQ
FDAVAYAMSG YYQCIVQNSP KGLGERNFVT NAVQLIVVSP PTLTEKLRKW IVKHWRAIVY
LLLTLMVVAL FYMLLIRLRA GRVASLRNWE ALEEAKKRAK LITAGEIKKP KPPQWIRPSP
VATENHLPSV NNVFVEIFRL LTLPHTPPPL PPLLPPPPTS LAICLMNRET PLDLSRKAAT
VASGSDTIQR RVKAINGTYH HNRNHHHHYG SGKQPIPRCF ECKRLFPTLW DLNIHFLGEH
QSTLQREFTQ NRSWKTCSLE NISVHQLQTT LRRGAVERTG YPCPHCDYFA KWPTELQKHI
MVHSKERPHQ CVICGLTYKW KWDLGRHFDK SHHHAVNPYK KTCLSVRAAR QQQQRVAKRS
SGHRCGRRSM GVSQIPLPPS SVHLQPYVPP SPFPSFISPF PNPQFGQYF
//