ID A0A0B2VRJ7_TOXCA Unreviewed; 351 AA.
AC A0A0B2VRJ7;
DT 04-MAR-2015, integrated into UniProtKB/TrEMBL.
DT 04-MAR-2015, sequence version 1.
DT 27-MAR-2024, entry version 46.
DE SubName: Full=Protein gooseberry {ECO:0000313|EMBL:KHN83949.1};
GN Name=gsb {ECO:0000313|EMBL:KHN83949.1};
GN ORFNames=Tcan_15563 {ECO:0000313|EMBL:KHN83949.1};
OS Toxocara canis (Canine roundworm).
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC Spirurina; Ascaridomorpha; Ascaridoidea; Toxocaridae; Toxocara.
OX NCBI_TaxID=6265 {ECO:0000313|EMBL:KHN83949.1, ECO:0000313|Proteomes:UP000031036};
RN [1] {ECO:0000313|EMBL:KHN83949.1, ECO:0000313|Proteomes:UP000031036}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=PN_DK_2014 {ECO:0000313|EMBL:KHN83949.1};
RA Zhu X.-Q., Korhonen P.K., Cai H., Young N.D., Nejsum P.,
RA von Samson-Himmelstjerna G., Boag P.R., Tan P., Li Q., Min J., Yang Y.,
RA Wang X., Fang X., Hall R.S., Hofmann A., Sternberg P.W., Jex A.R.,
RA Gasser R.B.;
RT "Genetic blueprint of the zoonotic pathogen Toxocara canis.";
RL Submitted (NOV-2014) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC -!- SIMILARITY: Belongs to the paired homeobox family.
CC {ECO:0000256|ARBA:ARBA00005733}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KHN83949.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JPKZ01001120; KHN83949.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A0B2VRJ7; -.
DR STRING; 6265.A0A0B2VRJ7; -.
DR OMA; RYYRTGI; -.
DR Proteomes; UP000031036; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0006355; P:regulation of DNA-templated transcription; IEA:InterPro.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR Gene3D; 1.10.10.10; Winged helix-like DNA-binding domain superfamily/Winged helix DNA-binding domain; 2.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR001523; Paired_dom.
DR InterPro; IPR043565; PAX_fam.
DR InterPro; IPR036388; WH-like_DNA-bd_sf.
DR PANTHER; PTHR45636; PAIRED BOX PROTEIN PAX-6-RELATED-RELATED; 1.
DR PANTHER; PTHR45636:SF38; PROTEIN GOOSEBERRY-RELATED; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR Pfam; PF00292; PAX; 1.
DR PRINTS; PR00027; PAIREDBOX.
DR SMART; SM00389; HOX; 1.
DR SMART; SM00351; PAX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 2.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
DR PROSITE; PS51057; PAIRED_2; 1.
PE 3: Inferred from homology;
KW DNA-binding {ECO:0000256|PROSITE-ProRule:PRU00108,
KW ECO:0000256|RuleBase:RU000682};
KW Homeobox {ECO:0000256|PROSITE-ProRule:PRU00108,
KW ECO:0000256|RuleBase:RU000682};
KW Nucleus {ECO:0000256|PROSITE-ProRule:PRU00108,
KW ECO:0000256|RuleBase:RU000682}; Paired box {ECO:0000256|ARBA:ARBA00022724};
KW Reference proteome {ECO:0000313|Proteomes:UP000031036};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT DOMAIN 14..141
FT /note="Paired"
FT /evidence="ECO:0000259|PROSITE:PS51057"
FT DOMAIN 228..288
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 230..289
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 137..156
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 195..235
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 351 AA; 39406 MW; FC96A00998D97A92 CRC64;
MNDTKAISFT NIMGQGRVNQ LGGVFINGRP LPQHIRLKII EMASNGIKPC HISRQLRVSH
GAVSKILNRY AETGSISPGQ IGGNPRSRLA IQAVEKHILA LKAEKPTICA SELRARLIEQ
EVCSRENAPT VSSINRHIRA KRLHSPPAKR EKKSKLNHSI ENILGISIGL HGLKRATIMP
LKRALGENKQ TKCIDRNADE RKLESPETSS SDEESSIRAD DESEPMTKKA RRNRTSFTCE
QLDVLENAFR ANTYPDQEER ERIAATTRLS EEKIMTWFSN RRARCRKNLS ISSAHPFIMH
SLTPQTTVPV AQPPLVTMFS QPIGFFPPTV SPQSLKLHEQ PIIYYPQYTA I
//