ID W6USH0_ECHGR Unreviewed; 340 AA.
AC W6USH0;
DT 16-APR-2014, integrated into UniProtKB/TrEMBL.
DT 16-APR-2014, sequence version 1.
DT 27-MAR-2024, entry version 39.
DE SubName: Full=Iroquois-class homeodomain protein IRX-6 {ECO:0000313|EMBL:EUB64580.1};
GN ORFNames=EGR_00530 {ECO:0000313|EMBL:EUB64580.1};
OS Echinococcus granulosus (Hydatid tapeworm).
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes; Cestoda;
OC Eucestoda; Cyclophyllidea; Taeniidae; Echinococcus;
OC Echinococcus granulosus group.
OX NCBI_TaxID=6210 {ECO:0000313|EMBL:EUB64580.1, ECO:0000313|Proteomes:UP000019149};
RN [1] {ECO:0000313|EMBL:EUB64580.1, ECO:0000313|Proteomes:UP000019149}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=24013640; DOI=10.1038/ng.2757;
RA Zheng H., Zhang W., Zhang L., Zhang Z., Li J., Lu G., Zhu Y., Wang Y.,
RA Huang Y., Liu J., Kang H., Chen J., Wang L., Chen A., Yu S., Gao Z.,
RA Jin L., Gu W., Wang Z., Zhao L., Shi B., Wen H., Lin R., Jones M.K.,
RA Brejova B., Vinar T., Zhao G., McManus D.P., Chen Z., Zhou Y., Wang S.;
RT "The genome of the hydatid tapeworm Echinococcus granulosus.";
RL Nat. Genet. 45:1168-1175(2013).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|PROSITE-ProRule:PRU00108}.
CC -!- SIMILARITY: Belongs to the TALE/IRO homeobox family.
CC {ECO:0000256|ARBA:ARBA00008446}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EUB64580.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; APAU02000002; EUB64580.1; -; Genomic_DNA.
DR AlphaFoldDB; W6USH0; -.
DR STRING; 6210.W6USH0; -.
DR EnsemblMetazoa; XM_024489779.1; XP_024355776.1; GeneID_36336245.
DR OMA; KENKATW; -.
DR OrthoDB; 2915644at2759; -.
DR Proteomes; UP000019149; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR008422; Homeobox_KN_domain.
DR PANTHER; PTHR11211:SF40; HOMEOBOX PROTEIN ARAUCAN-RELATED; 1.
DR PANTHER; PTHR11211; IROQUOIS-CLASS HOMEODOMAIN PROTEIN IRX; 1.
DR Pfam; PF05920; Homeobox_KN; 1.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 3: Inferred from homology;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000019149}.
FT DOMAIN 183..246
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 185..247
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 1..20
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 25..54
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 70..96
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 114..134
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 276..340
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 36..54
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 79..95
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 280..294
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 340 AA; 37332 MW; 1B6D87AEE94DC86F CRC64;
MAQPLENPSP SQRYPPCAMV PPLPPPFGLA AALTGPRLPL PPPPPPPPLP PGACAPHPML
MLQWMRSLLT QPQPPPTSST VAAAVPPPPP PTSVQPFCAA SEAVQLAGLL SAAKNGAPQQ
GSVPPPPPPS LNSYQQETVE WQHTNQEESE QLRRFRHALC TGSLDPQTMK MYQSFRYSLK
VNNHGTPRRK NTTRETTGLL KAWLNEHRNN PYPNKSEKIM LAVITKMSLT QVSTWFANAR
RRLKKENKAT WSLALDQQFP DLDEGFHGPA GEEFRECEAY PTPSPPPPPR LPLPPYEEEG
REEDMDAVGV GEGMKGTTAT TPEPKRSGKI WSVVEMTEGR
//