ID A0A183P303_9TREM Unreviewed; 1590 AA.
AC A0A183P303;
DT 07-SEP-2016, integrated into UniProtKB/TrEMBL.
DT 07-SEP-2016, sequence version 1.
DT 27-MAR-2024, entry version 25.
DE SubName: Full=TOG domain-containing protein {ECO:0000313|WBParaSite:SMTD_0000873801-mRNA-1};
GN ORFNames=SMTD_LOCUS8739 {ECO:0000313|EMBL:VDP46259.1};
OS Schistosoma mattheei.
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes; Trematoda;
OC Digenea; Strigeidida; Schistosomatoidea; Schistosomatidae; Schistosoma.
OX NCBI_TaxID=31246 {ECO:0000313|WBParaSite:SMTD_0000873801-mRNA-1};
RN [1] {ECO:0000313|WBParaSite:SMTD_0000873801-mRNA-1}
RP IDENTIFICATION.
RG WormBaseParasite;
RL Submitted (JUN-2016) to UniProtKB.
RN [2] {ECO:0000313|EMBL:VDP46259.1, ECO:0000313|Proteomes:UP000269396}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Denwood {ECO:0000313|EMBL:VDP46259.1}, and Denwood, Zambia
RC {ECO:0000313|Proteomes:UP000269396};
RG Pathogen Informatics;
RL Submitted (NOV-2018) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the GCN1 family.
CC {ECO:0000256|ARBA:ARBA00007366}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; UZAL01029148; VDP46259.1; -; Genomic_DNA.
DR STRING; 31246.A0A183P303; -.
DR WBParaSite; SMTD_0000873801-mRNA-1; SMTD_0000873801-mRNA-1; SMTD_0000873801.
DR Proteomes; UP000269396; Unassembled WGS sequence.
DR Gene3D; 1.25.10.10; Leucine-rich Repeat Variant; 5.
DR InterPro; IPR011989; ARM-like.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR000357; HEAT.
DR InterPro; IPR021133; HEAT_type_2.
DR InterPro; IPR034085; TOG.
DR PANTHER; PTHR23346:SF7; EIF-2-ALPHA KINASE ACTIVATOR GCN1; 1.
DR PANTHER; PTHR23346; TRANSLATIONAL ACTIVATOR GCN1-RELATED; 1.
DR Pfam; PF02985; HEAT; 2.
DR SMART; SM01349; TOG; 1.
DR SUPFAM; SSF48371; ARM repeat; 2.
DR PROSITE; PS50077; HEAT_REPEAT; 3.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000269396};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 200..436
FT /note="TOG"
FT /evidence="ECO:0000259|SMART:SM01349"
FT REPEAT 376..414
FT /note="HEAT"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00103"
FT REPEAT 475..513
FT /note="HEAT"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00103"
FT REPEAT 596..632
FT /note="HEAT"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00103"
FT REGION 1049..1117
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1055..1111
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1590 AA; 174957 MW; B102D602B2233CC6 CRC64;
MGRILCPESP DRWRERIGLA MILTRLSDAP AVSLSTSSSS TSQILFRQSL LIGSNEEKTE
DNINPNGFDT ENFSSTSDDM SNQSPMWLLN MFRFLVSDGL NDRNTAVQSE MLQAGLRAVR
NFGKQYIGQI LPILENYVNK APNVPELDSV RQSILILTGS LSQHLDSTDP RVGTIFNRLL
NTLSFPSDLV QQAVEDSLAS LIGKLSEEQT AKTINKLMTT LLSSNNYAER HGAAHGIAGI
ARGLGIMSLK HHGIIDKIIP ALDDTKVAKR REGALMAVER LSLGMGRLFE PYVVRLITPL
LNTFGDTNPG VREAASNAAR AVMSKLSAHG VKLILPALLK AIDDQQSWRT KAEAVDLLAS
MTHCASKQLS ACLPQIVPRL LEVLVDSQDR VKQAGVRALT QIGKVIRNPE VQALVPLLTN
CLQQPLADKT PCLAALRDTC FVHVLDAPSL ALILPVIQRA FADRSTETPP YVSTILPLLK
TCLLDAVPEV RSAAAAALGA VVRGMGETSF SELLPWLMST LTSETSSVDR SGGAQGLAEV
LGGMGIEKLR VVLPDLIRTV SSESKLQPHI RDGYLMLFIY LPTVFQDDFA EFIGPIIPTI
LKSLSDETEF LRETALRAAQ RIVHMFSETS LELLLPQLEQ GMTDSNWRIR HSSVQLLGEL
LYRISGLSGK GTTKTTNEDD TFGTVEAHER LREIIGDERH NRILARLHLS RSDPIIIVRQ
SAIHIWKIVV PNTPRTLREI MPVLVRLLLD TLGSSSREHQ QIAARALGDV VRKLGERILP
EIIPLLVTGL DSPDADQRRG VCTGLIEIIR SCQSDLLSNY ADSLLDPIRR TLCDPLVEVR
RNGGKTFELL YAAIGIRSLD GILPDLLAQL DDPETSHYAL DGIKQLLAVK GKAVMPYLVP
KLTHPVVNVK AFAYLASVAG EALTKQLGRI LPALLQTVSL MSESDYNQNN ENKDIEEETE
NEDLEHCAAV LVCIYEATGI RQILNELLSG LSTTVTIDET TNNTTTQNKL VPGSSAYRLA
CLRLLRAYLE ASFQDSSDVV TSIVKSNVNN KSSDEDETDE SDVDDDDLED EEASDDGSFN
SDEDYSDEDD MDKDDDDSYD DDDDDEMEEN PDAKEVISRV LNESYPLALR NICRLLASTD
KTTLSEAWKC LETLFKRWNP ETITSQIGDL RQGIRGAISE MNKLAAANKN EGQKYLPGFS
DPTLPLVSLV KLYAECTLRG RPAIKEPSAQ GLSECIIHAN GTALQGCVIK VIGPLIRLLG
ERQTNVVRVA VLQSLTSLVN KCPQSVRPFV TQLQSTFLKC LGDSHKQTRI LGGEGLSSIV
PITPKLDPLL IDLARVSSQT VVSRFHDYLE DTSSEIEDML PKSGVSATAH TLAGVAAFPD
TSLQALRLCL EHSRGRAGLT ALNTILHSLI PLMRLPESCG NEIRIETNPH RNNIVDDNEY
NSEDNDDEQE SVLGSFNQTK QVLISSDQLR IIVSSCIGFI VVAAQATIDN FSQSDNEKIQ
KKLELVDLFE RKLCLSSSTK QDVQWTFNQS QGIALLIPLK HTPEILINSE TIQSGFCERL
GQFIKQLSIH ENVSLISTII SICNHNYCAY
//