GenomeNet

Database: UniProt
Entry: A0A0L0DT91_THETB
LinkDB: A0A0L0DT91_THETB
Original site: A0A0L0DT91_THETB 
ID   A0A0L0DT91_THETB        Unreviewed;      1240 AA.
AC   A0A0L0DT91;
DT   11-NOV-2015, integrated into UniProtKB/TrEMBL.
DT   11-NOV-2015, sequence version 1.
DT   27-MAR-2024, entry version 29.
DE   SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:KNC55485.1};
GN   ORFNames=AMSG_01749 {ECO:0000313|EMBL:KNC55485.1};
OS   Thecamonas trahens ATCC 50062.
OC   Eukaryota; Apusozoa; Apusomonadida; Apusomonadidae; Thecamonas.
OX   NCBI_TaxID=461836 {ECO:0000313|EMBL:KNC55485.1, ECO:0000313|Proteomes:UP000054408};
RN   [1] {ECO:0000313|EMBL:KNC55485.1, ECO:0000313|Proteomes:UP000054408}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=ATCC 50062 {ECO:0000313|EMBL:KNC55485.1,
RC   ECO:0000313|Proteomes:UP000054408};
RG   The Broad Institute Genome Sequencing Platform;
RA   Russ C., Cuomo C., Shea T., Young S.K., Zeng Q., Koehrsen M., Haas B.,
RA   Borodovsky M., Guigo R., Alvarado L., Berlin A., Bochicchio J.,
RA   Borenstein D., Chapman S., Chen Z., Freedman E., Gellesch M., Goldberg J.,
RA   Griggs A., Gujja S., Heilman E., Heiman D., Hepburn T., Howarth C., Jen D.,
RA   Larson L., Mehta T., Park D., Pearson M., Roberts A., Saif S., Shenoy N.,
RA   Sisk P., Stolte C., Sykes S., Thomson T., Walk T., White J., Yandava C.,
RA   Burger G., Gray M.W., Holland P.W.H., King N., Lang F.B.F., Roger A.J.,
RA   Ruiz-Trillo I., Lander E., Nusbaum C.;
RT   "The Genome Sequence of Thecamonas trahens ATCC 50062.";
RL   Submitted (MAY-2010) to the EMBL/GenBank/DDBJ databases.
CC   -!- SIMILARITY: Belongs to the eukaryotic initiation factor 4G family.
CC       {ECO:0000256|ARBA:ARBA00005775}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; GL349439; KNC55485.1; -; Genomic_DNA.
DR   RefSeq; XP_013761265.1; XM_013905811.1.
DR   AlphaFoldDB; A0A0L0DT91; -.
DR   STRING; 461836.A0A0L0DT91; -.
DR   EnsemblProtists; KNC55485; KNC55485; AMSG_01749.
DR   GeneID; 25561483; -.
DR   eggNOG; KOG0401; Eukaryota.
DR   OMA; CAPLDIN; -.
DR   OrthoDB; 1123866at2759; -.
DR   Proteomes; UP000054408; Unassembled WGS sequence.
DR   GO; GO:0003723; F:RNA binding; IEA:InterPro.
DR   GO; GO:0003743; F:translation initiation factor activity; IEA:UniProtKB-KW.
DR   Gene3D; 1.25.40.180; -; 3.
DR   InterPro; IPR016024; ARM-type_fold.
DR   InterPro; IPR045208; IF4G.
DR   InterPro; IPR003891; Initiation_fac_eIF4g_MI.
DR   InterPro; IPR003890; MIF4G-like_typ-3.
DR   InterPro; IPR003307; W2_domain.
DR   PANTHER; PTHR23253; EUKARYOTIC TRANSLATION INITIATION FACTOR 4 GAMMA; 1.
DR   PANTHER; PTHR23253:SF9; EUKARYOTIC TRANSLATION INITIATION FACTOR 4G1, ISOFORM B-RELATED; 1.
DR   Pfam; PF02847; MA3; 1.
DR   Pfam; PF02854; MIF4G; 1.
DR   Pfam; PF02020; W2; 1.
DR   SMART; SM00515; eIF5C; 1.
DR   SMART; SM00543; MIF4G; 1.
DR   SUPFAM; SSF48371; ARM repeat; 3.
DR   PROSITE; PS51366; MI; 1.
DR   PROSITE; PS51363; W2; 1.
PE   3: Inferred from homology;
KW   Reference proteome {ECO:0000313|Proteomes:UP000054408}.
FT   DOMAIN          870..993
FT                   /note="MI"
FT                   /evidence="ECO:0000259|PROSITE:PS51366"
FT   DOMAIN          1057..1240
FT                   /note="W2"
FT                   /evidence="ECO:0000259|PROSITE:PS51363"
FT   REGION          1..55
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          170..395
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          670..865
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        189..240
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        283..306
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        314..341
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        352..366
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        670..688
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        694..720
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        750..782
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        805..859
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1240 AA;  132803 MW;  E55FD1BA1C4EDB27 CRC64;
     MAGQEGTKTA PPSLGSDAEA GAGGASAAGG QAGGQGQKGQ QGHPGAMGQG QGMPQQMYPM
     QYGYAPYQGY QPYPSYQGFP QYAGAPPAAY TQHSSPGAPA PAAAKRSGLN PAASIFEYKA
     ASKIVVSTPQ GKEVDINELA GKKAKAAAEA KAAAEAEAKA AADAKTEATT EAVADAKTEA
     AADGADKPNA SEAEAETKAE PKPNASEAEA ETKAETKTEA EAEDKPDAAE TEDKPAADAD
     EAGSSAEATA AAAEAEARAA AAKAAAEAAA EAAAAAEAEA RAAARRAVPH EVEPGKIKRY
     TREQLMHFKD LTAPPDSDVF RKFKEHYEEM ERAEPRQQRS RGKANNRGRR SNRNSRKNDY
     QGGGRGRKRG SRKGRGRRGG YNGGGKGRPT IDPADVKPLE LSESRWKPAT ASQADLDETA
     RVIKAAKGIL NKLTLEKFDR LSKKLVEVGI SSPAILRELI GLIFDKAIDE RPFASMYAAL
     CVFLQENVET FTETNDEGEE EDVSFRRLLL NQCQEKFERP TELSEEEKEG MTEEEIREKN
     HILKLRRLGN IKFIGELFKQ RMISARIMHD CVLTLLKDPE NPDEEELESL CELMETVGKM
     LDVPQAAQYM KAYFQRMTAL STSDKVPSRI RFHLQDIIAA RKNGWEFRNA RKKKGPKTIA
     EIHAEKAAEE RAAAAEAARN DKRDRKRGGR RERGGRRERG GRGDRDRGDR DRDRGRYQRP
     AATDRFAVSR NNYERPEQFN ARKSKKAKAS NSDGWSTVNQ GTAKRPSATS EDHLTLGPSR
     GGSSLASGAR GWKSSSDDKP KSSSGTKKST SVGQNLFALL EAGSSSSSSS SEDTQSGSGS
     ADASGSADAS GSADASASGA ESLTEEDIAE AHKAGKSLFK EYTSAKDTAE AVQCIEEVET
     APAWVAIAGG LLAMYESKAD DVAAAQELLA HILEATELLS SAADVQALLA MVMEALPDLS
     IDAPHAPKYA AASAALLIDA HNLALDDVLT PEALEPFGMS SHRVVSSFVS ALKDARGGNV
     ASVVATWRAS SIDLMEVLPG RMADVSGLAD WMSRAGLDGL FKPEHEVHRV KAKKMQPVLY
     AAFVGGTSID DVLTLIAELP VELRESPEFA RAALAANLEF IAKMAPLANG FADSELKTTV
     TELVAKYASI LVVVVNASLE AQAECVFELQ AYVWDLASAA DGAFPPELLP YLFHTYYDND
     VIKEEAFEAW ASDSRSTPGY EECKAEAAAW LKWLAEANPE
//
DBGET integrated database retrieval system