ID A0A095C6T9_SCHHA Unreviewed; 689 AA.
AC A0A095C6T9;
DT 26-NOV-2014, integrated into UniProtKB/TrEMBL.
DT 26-NOV-2014, sequence version 1.
DT 24-JAN-2024, entry version 39.
DE SubName: Full=Irx-related protein {ECO:0000313|EMBL:KAF1314788.1};
DE SubName: Full=Pre-B-cell leukemia transcription factor 4 {ECO:0000313|EMBL:KGB37743.1};
GN ORFNames=MS3_0014282 {ECO:0000313|EMBL:KAF1314788.1}, MS3_06095
GN {ECO:0000313|EMBL:KGB37743.1};
OS Schistosoma haematobium (Blood fluke).
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes; Trematoda;
OC Digenea; Strigeidida; Schistosomatoidea; Schistosomatidae; Schistosoma.
OX NCBI_TaxID=6185 {ECO:0000313|EMBL:KGB37743.1};
RN [1] {ECO:0000313|EMBL:KGB37743.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=22246508; DOI=10.1038/ng.1065;
RA Young N.D., Jex A.R., Li B., Liu S., Yang L., Xiong Z., Li Y.,
RA Cantacessi C., Hall R.S., Xu X., Chen F., Wu X., Zerlotini A., Oliveira G.,
RA Hofmann A., Zhang G., Fang X., Kang Y., Campbell B.E., Loukas A.,
RA Ranganathan S., Rollinson D., Rinaldi G., Brindley P.J., Yang H., Wang J.,
RA Wang J., Gasser R.B.;
RT "Whole-genome sequence of Schistosoma haematobium.";
RL Nat. Genet. 44:221-225(2012).
RN [2] {ECO:0000313|EMBL:KAF1314788.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=31494670;
RA Stroehlein A.J., Korhonen P.K., Chong T.M., Lim Y.L., Chan K.G.,
RA Webster B., Rollinson D., Brindley P.J., Gasser R.B., Young N.D.;
RT "High-quality Schistosoma haematobium genome achieved by single-molecule
RT and long-range sequencing.";
RL Gigascience 8:0-0(2019).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|PROSITE-ProRule:PRU00108}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AMPZ02000613; KAF1314788.1; -; Genomic_DNA.
DR EMBL; KL250921; KGB37743.1; -; Genomic_DNA.
DR RefSeq; XP_012797505.1; XM_012942051.1.
DR AlphaFoldDB; A0A095C6T9; -.
DR STRING; 6185.A0A095C6T9; -.
DR EnsemblMetazoa; XM_012942051.2; XP_012797505.1; MS3_0014282.
DR GeneID; 24593507; -.
DR KEGG; shx:MS3_00010791; -.
DR CTD; 24593507; -.
DR OrthoDB; 2999675at2759; -.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0006355; P:regulation of DNA-templated transcription; IEA:InterPro.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR008422; Homeobox_KN_domain.
DR Pfam; PF05920; Homeobox_KN; 1.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|PROSITE-ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}.
FT DOMAIN 589..643
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 591..644
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 1..43
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 458..489
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..39
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 465..480
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 689 AA; 77140 MW; 4E088884B95EAC72 CRC64;
MDLDCTSNSS VTDSQSNEHT VGSSPTELGY ENTIGTPTGV ERYSDLYDNR SNKIVAEDPR
NNLESSVNES GYSSYCNYSY ISTEDAITKL EESEPQFCVN ACHSIYTIPY VPENLTNNTC
RYVEFTSDNN ETVNAVASIV SHGVNSCSYI PVSNMITTVI NHNFTPTVTN EFTKIFNYQC
DTISTPDISR KNDNQRNFKS DPDTFWISQR DDIHKSVSIR TGHLSNDKQN ALDGYVDSGN
IPTSTDQQLL QPQACLNNYR VKEDESKESY NSNFPEFAYN LMNTHSQHYT TSDFINVSSQ
NSVIDCISSY QDSSQNSESV NACVKGPSFY SQTHNVYWPW VDRFYAENCA SKLGSVCNSD
VATKGLINYT NSFSCNPSFG HSYNSHSLFI SGANLPHISE KPNEQSQPSK LSAVNLCTGE
NMSAFHSIAQ ISSDYSGASV VAGNIPNLIL MNAQSSGDFS AQSADNHKES QEDENSFDHS
ETSWQPELNS VQCSSRKRLS DKQRKTICVS ILNGNQTENS HTPQDEITQN SSTSFDISSL
HRENEICLDQ SISGNSYSWT SKSDKLKFNS RLRSKRYSDA INLTRNRPLN QTALSVMESW
YTNHVDNPYP TTAEKEELAA LGGITVIQVS SWFANRRTRT ANTKPKKNRR KLYHQIYQLA
VEIEALTQGS LRACDLQERI GQIIDEYLT
//