GenomeNet

Database: UniProt
Entry: G7YSN2_CLOSI
LinkDB: G7YSN2_CLOSI
Original site: G7YSN2_CLOSI 
ID   G7YSN2_CLOSI            Unreviewed;       473 AA.
AC   G7YSN2;
DT   25-JAN-2012, integrated into UniProtKB/TrEMBL.
DT   25-JAN-2012, sequence version 1.
DT   27-MAR-2024, entry version 45.
DE   SubName: Full=Homeobox protein bagpipe {ECO:0000313|EMBL:GAA55962.1};
GN   ORFNames=CLF_109461 {ECO:0000313|EMBL:GAA55962.1};
OS   Clonorchis sinensis (Chinese liver fluke).
OC   Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes; Trematoda;
OC   Digenea; Opisthorchiida; Opisthorchiata; Opisthorchiidae; Clonorchis.
OX   NCBI_TaxID=79923 {ECO:0000313|EMBL:GAA55962.1, ECO:0000313|Proteomes:UP000008909};
RN   [1] {ECO:0000313|EMBL:GAA55962.1}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Henan {ECO:0000313|EMBL:GAA55962.1};
RX   PubMed=22023798; DOI=10.1186/gb-2011-12-10-r107;
RA   Wang X., Chen W., Huang Y., Sun J., Men J., Liu H., Luo F., Guo L., Lv X.,
RA   Deng C., Zhou C., Fan Y., Li X., Huang L., Hu Y., Liang C., Hu X., Xu J.,
RA   Yu X.;
RT   "The draft genome of the carcinogenic human liver fluke Clonorchis
RT   sinensis.";
RL   Genome Biol. 12:R107-R107(2011).
RN   [2]
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=Henan;
RA   Wang X., Huang Y., Chen W., Liu H., Guo L., Chen Y., Luo F., Zhou W.,
RA   Sun J., Mao Q., Liang P., Zhou C., Tian Y., Men J., Lv X., Huang L.,
RA   Zhou J., Hu Y., Li R., Zhang F., Lei H., Li X., Hu X., Liang C., Xu J.,
RA   Wu Z., Yu X.;
RT   "The genome and transcriptome sequence of Clonorchis sinensis provide
RT   insights into the carcinogenic liver fluke.";
RL   Submitted (OCT-2011) to the EMBL/GenBank/DDBJ databases.
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC       ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; DF144123; GAA55962.1; -; Genomic_DNA.
DR   AlphaFoldDB; G7YSN2; -.
DR   Proteomes; UP000008909; Unassembled WGS sequence.
DR   GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR   GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR   GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR   CDD; cd00086; homeodomain; 1.
DR   Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR   InterPro; IPR009057; Homeobox-like_sf.
DR   InterPro; IPR017970; Homeobox_CS.
DR   InterPro; IPR001356; Homeobox_dom.
DR   InterPro; IPR020479; Homeobox_metazoa.
DR   PANTHER; PTHR24340; HOMEOBOX PROTEIN NKX; 1.
DR   Pfam; PF00046; Homeodomain; 1.
DR   PRINTS; PR00024; HOMEOBOX.
DR   SMART; SM00389; HOX; 1.
DR   SUPFAM; SSF46689; Homeodomain-like; 1.
DR   PROSITE; PS00027; HOMEOBOX_1; 1.
DR   PROSITE; PS50071; HOMEOBOX_2; 1.
PE   4: Predicted;
KW   DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW   ProRule:PRU00108};
KW   Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW   ProRule:PRU00108};
KW   Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW   ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000008909}.
FT   DOMAIN          246..306
FT                   /note="Homeobox"
FT                   /evidence="ECO:0000259|PROSITE:PS50071"
FT   DNA_BIND        248..307
FT                   /note="Homeobox"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT   REGION          306..340
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        324..340
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   473 AA;  53298 MW;  7CD4B8776A9B7875 CRC64;
     MDGSAFSIVN LIDTSQKANQ QYNSDSRHTV KSEQLNHLTV PERTDEVTKP ATLQVTANVE
     PPSFPNLNNV PPIRTDGVSH HDDLLNRLYK EYRDTFWRAL QIKHEALMNQ VRCQLTPKQN
     GLADAYFRPA FPADSTASVE VRDDWSPTHL LYSTDRSDGK NIQNNGFGEK NLSILIENQN
     KPQSKGLVLA SCVPSQLVSP NDLYRSWRDQ ALMQTQPVKA QHPCNDSPLF QFLPGSKCYP
     ALSSLSRKKR TRAAFSHAQV FELERRFTYQ RYLSAPERAE LARSLRLSET QVKIWFQNRR
     YKTKKRQLSS SCESPPPEEL APHSRSTASP NGSNVSNKEE VVDTLPEFNT SIVQEKRDDD
     HFVAAGHPVN KSTLDGTKLP GSEATMAYQL TNCAYLIPPL ANPTYCTGSV PREAAVKLPH
     CSQLNSMKSR IHMLSFDLNA FPDLLQAYLR SQTDPNDFTQ FSPVKKTVST FNA
//
DBGET integrated database retrieval system