GenomeNet

Database: UniProt
Entry: A0A0L0DIF7_THETB
LinkDB: A0A0L0DIF7_THETB
Original site: A0A0L0DIF7_THETB 
ID   A0A0L0DIF7_THETB        Unreviewed;       620 AA.
AC   A0A0L0DIF7;
DT   11-NOV-2015, integrated into UniProtKB/TrEMBL.
DT   11-NOV-2015, sequence version 1.
DT   22-NOV-2017, entry version 11.
DE   SubName: Full=Ligatin {ECO:0000313|EMBL:KNC52149.1};
GN   ORFNames=AMSG_00975 {ECO:0000313|EMBL:KNC52149.1};
OS   Thecamonas trahens ATCC 50062.
OC   Eukaryota; Apusozoa; Apusomonadidae; Thecamonas.
OX   NCBI_TaxID=461836 {ECO:0000313|EMBL:KNC52149.1, ECO:0000313|Proteomes:UP000054408};
RN   [1] {ECO:0000313|EMBL:KNC52149.1, ECO:0000313|Proteomes:UP000054408}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=ATCC 50062 {ECO:0000313|EMBL:KNC52149.1,
RC   ECO:0000313|Proteomes:UP000054408};
RG   The Broad Institute Genome Sequencing Platform;
RA   Russ C., Cuomo C., Shea T., Young S.K., Zeng Q., Koehrsen M., Haas B.,
RA   Borodovsky M., Guigo R., Alvarado L., Berlin A., Bochicchio J.,
RA   Borenstein D., Chapman S., Chen Z., Freedman E., Gellesch M.,
RA   Goldberg J., Griggs A., Gujja S., Heilman E., Heiman D., Hepburn T.,
RA   Howarth C., Jen D., Larson L., Mehta T., Park D., Pearson M.,
RA   Roberts A., Saif S., Shenoy N., Sisk P., Stolte C., Sykes S.,
RA   Thomson T., Walk T., White J., Yandava C., Burger G., Gray M.W.,
RA   Holland P.W.H., King N., Lang F.B.F., Roger A.J., Ruiz-Trillo I.,
RA   Lander E., Nusbaum C.;
RT   "The Genome Sequence of Thecamonas trahens ATCC 50062.";
RL   Submitted (MAY-2010) to the EMBL/GenBank/DDBJ databases.
CC   -----------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution-NoDerivs License
CC   -----------------------------------------------------------------------
DR   EMBL; GL349436; KNC52149.1; -; Genomic_DNA.
DR   RefSeq; XP_013762152.1; XM_013906698.1.
DR   EnsemblProtists; KNC52149; KNC52149; AMSG_00975.
DR   GeneID; 25560749; -.
DR   Proteomes; UP000054408; Unassembled WGS sequence.
DR   GO; GO:0003743; F:translation initiation factor activity; IEA:InterPro.
DR   Gene3D; 2.30.130.10; -; 1.
DR   Gene3D; 3.30.780.10; -; 1.
DR   InterPro; IPR002478; PUA.
DR   InterPro; IPR015947; PUA-like_sf.
DR   InterPro; IPR036974; PUA_sf.
DR   InterPro; IPR001950; SUI1.
DR   InterPro; IPR036877; SUI1_dom_sf.
DR   InterPro; IPR004521; Uncharacterised_CHP00451.
DR   Pfam; PF01472; PUA; 1.
DR   Pfam; PF01253; SUI1; 1.
DR   SUPFAM; SSF55159; SSF55159; 1.
DR   SUPFAM; SSF88697; SSF88697; 1.
DR   TIGRFAMs; TIGR00451; unchar_dom_2; 1.
DR   PROSITE; PS50890; PUA; 1.
DR   PROSITE; PS50296; SUI1; 1.
PE   4: Predicted;
KW   Complete proteome {ECO:0000313|Proteomes:UP000054408};
KW   Reference proteome {ECO:0000313|Proteomes:UP000054408}.
FT   DOMAIN      133    212       PUA. {ECO:0000259|PROSITE:PS50890}.
FT   DOMAIN      525    591       SUI1. {ECO:0000259|PROSITE:PS50296}.
SQ   SEQUENCE   620 AA;  64752 MW;  39912321F52F50CD CRC64;
     MFRKSLETSG RARVKGAVVK KVRAALQAVV PGLPAAVAGL VLHKKTPVTE VKIREPKGVV
     IYEAVVSTAN KCDDEHPLNK VLDALRECPD SDVAAEAVAF SEASIPLVFD RSNGKLADVV
     PSVYLLWMAP TAIPVMRTHR PVLDRLRGGA DLMLPGVVAT QDEIRSWRVG ELVAVTLTEG
     SAAIAVGACL VDADHVLHYG MVGKAVEILH VWLDELSQLW PLGVCSLPHL AGPAAPAAAA
     GPEPESQPDP GAESDTADPV RVDAPDEAGP SAAPTSRDEA DASLRHAFLN ALKTRGADLR
     AALPLETSTF MSAYVVPSRA VGETVSIKAS SFKNLTKFLK SQAADGLISI KDKAGVMFVV
     SVASASHPAL ASHVASRTVY ENDAAAAAKA ERRAAAQAAQ ARASALPFFR ELVAPARRAE
     SLFLALGLDP SKDAPIPQGK LTKALANYVR ERELGSGAAT ALDANLDSAL TAKARAAFVG
     DDGRIARKDL VRALLAANAV KKVTEFNPPY GTQLLLRGEP PKLVVTVANR GGNRRKATTV
     VSNLAALQLD AASLAGDLST ACAASATVRD NDLVVQGDKV DDVIGYLASE WGVPGKAVTL
     VDKRKGKGGG GGRGRGKKRK
//
DBGET integrated database retrieval system