ID H2KVB0_CLOSI Unreviewed; 707 AA.
AC H2KVB0;
DT 21-MAR-2012, integrated into UniProtKB/TrEMBL.
DT 21-MAR-2012, sequence version 1.
DT 27-MAR-2024, entry version 57.
DE RecName: Full=One cut domain family member {ECO:0000256|RuleBase:RU361129};
GN ORFNames=CLF_111721 {ECO:0000313|EMBL:GAA32120.1};
OS Clonorchis sinensis (Chinese liver fluke).
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes; Trematoda;
OC Digenea; Opisthorchiida; Opisthorchiata; Opisthorchiidae; Clonorchis.
OX NCBI_TaxID=79923 {ECO:0000313|EMBL:GAA32120.1, ECO:0000313|Proteomes:UP000008909};
RN [1] {ECO:0000313|EMBL:GAA32120.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Henan {ECO:0000313|EMBL:GAA32120.1};
RX PubMed=22023798; DOI=10.1186/gb-2011-12-10-r107;
RA Wang X., Chen W., Huang Y., Sun J., Men J., Liu H., Luo F., Guo L., Lv X.,
RA Deng C., Zhou C., Fan Y., Li X., Huang L., Hu Y., Liang C., Hu X., Xu J.,
RA Yu X.;
RT "The draft genome of the carcinogenic human liver fluke Clonorchis
RT sinensis.";
RL Genome Biol. 12:R107-R107(2011).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|PROSITE-ProRule:PRU00108,
CC ECO:0000256|RuleBase:RU000682}.
CC -!- SIMILARITY: Belongs to the CUT homeobox family.
CC {ECO:0000256|ARBA:ARBA00008190, ECO:0000256|RuleBase:RU361129}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; DF144420; GAA32120.1; -; Genomic_DNA.
DR AlphaFoldDB; H2KVB0; -.
DR Proteomes; UP000008909; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR Gene3D; 1.10.260.40; lambda repressor-like DNA-binding domains; 1.
DR InterPro; IPR003350; CUT_dom.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR010982; Lambda_DNA-bd_dom_sf.
DR PANTHER; PTHR14057:SF32; HOMEOBOX PROTEIN ONECUT; 1.
DR PANTHER; PTHR14057; TRANSCRIPTION FACTOR ONECUT; 1.
DR Pfam; PF02376; CUT; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR SMART; SM01109; CUT; 1.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR SUPFAM; SSF47413; lambda repressor-like DNA-binding domains; 1.
DR PROSITE; PS51042; CUT; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 3: Inferred from homology;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|PROSITE-ProRule:PRU00108,
KW ECO:0000256|RuleBase:RU000682};
KW Nucleus {ECO:0000256|PROSITE-ProRule:PRU00108,
KW ECO:0000256|RuleBase:RU000682};
KW Reference proteome {ECO:0000313|Proteomes:UP000008909};
KW Transcription {ECO:0000256|RuleBase:RU361129};
KW Transcription regulation {ECO:0000256|RuleBase:RU361129}.
FT DOMAIN 259..345
FT /note="CUT"
FT /evidence="ECO:0000259|PROSITE:PS51042"
FT DOMAIN 360..420
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 362..421
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 223..267
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 425..456
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 682..707
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 428..456
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 707 AA; 75879 MW; 93C9B14B8B2D3FCC CRC64;
MMSSDPSNSL SASSGCGTVA MAAVPISALA PSKINFPLGD LENIPIFNSG DGAALELQAL
QGNAQALQGI QTVKLTLINC GDQQTILLTT PSTHFTMPDQ NLDGVTLTLP DNLLATDFNL
TGLQSFQLGD LPVMSDKCGV PDNAKLTYSN MTSLPPISSV SDKLYNQLTE DSDSVQPSHG
YAIEDLKPVL QRNGEKLHDD NGNYSLSDPT SVALTSPVAS LPQTTFAQNI PPKPGKLMDS
VDESSLPRQD NTDADANDPS CPDDMTELNT KDLAQRISAE LKRYTIPQAV FAQRVLCRSQ
GTLSDLLRNP KPWSKLKSGR ETFRRMWNWL NEPEFQRMSA LRLATCKRKT EENQKPIEER
STKKPRLVFT DIQRRTLHAI FKETKRPSKE MQATIAQQLN LEVSTVANFF MNARRRSLDK
WVDDKDVQHT VSTSSPATSI ETSPPNHTTH VSSHLGDHCS LQPVHAALPT SQMSEVSDSV
LHRIPGCHAG VPTSLELPSS PAPPRLTPDP TVSFSHADVD LNTLCAVSEN EHTSLSNGLL
LSGDSAALQT QLLGHVLVGG QNAMLLSHNL SPTSDKALAQ PMSAPGLTCM ASNLSSIKSA
LDSLSDPPTL SPAHPHHLTR VSMDSLRDTH SLQHNMTVGS VPDAVCTANH VLPSAATLIG
HDRLGLDSVG HMSGSTVLTS SLGSSVPIHP KQEDLGSGLL GGTNALN
//