GenomeNet

Database: UniProt
Entry: T1GUM2_MEGSC
LinkDB: T1GUM2_MEGSC
Original site: T1GUM2_MEGSC 
ID   T1GUM2_MEGSC            Unreviewed;       608 AA.
AC   T1GUM2;
DT   16-OCT-2013, integrated into UniProtKB/TrEMBL.
DT   16-OCT-2013, sequence version 1.
DT   22-FEB-2023, entry version 28.
DE   RecName: Full=THO complex subunitTHOC2 C-terminal domain-containing protein {ECO:0000259|Pfam:PF11262};
OS   Megaselia scalaris (Humpbacked fly) (Phora scalaris).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Platypezoidea;
OC   Phoridae; Megaseliini; Megaselia.
OX   NCBI_TaxID=36166 {ECO:0000313|EnsemblMetazoa:MESCA007439-PA, ECO:0000313|Proteomes:UP000015102};
RN   [1] {ECO:0000313|Proteomes:UP000015102}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=Durham, NC isolate 2 -- Noor lab
RC   {ECO:0000313|Proteomes:UP000015102};
RA   Hughes D.;
RL   Submitted (FEB-2013) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|EnsemblMetazoa:MESCA007439-PA}
RP   IDENTIFICATION.
RG   EnsemblMetazoa;
RL   Submitted (JUN-2015) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; CAQQ02394034; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   AlphaFoldDB; T1GUM2; -.
DR   STRING; 36166.T1GUM2; -.
DR   EnsemblMetazoa; MESCA007439-RA; MESCA007439-PA; MESCA007439.
DR   HOGENOM; CLU_031324_0_0_1; -.
DR   OMA; RNERTHE; -.
DR   Proteomes; UP000015102; Unassembled WGS sequence.
DR   GO; GO:0000347; C:THO complex; IEA:InterPro.
DR   GO; GO:0006406; P:mRNA export from nucleus; IEA:InterPro.
DR   GO; GO:0006397; P:mRNA processing; IEA:InterPro.
DR   InterPro; IPR040007; Tho2.
DR   InterPro; IPR021418; THO_THOC2_C.
DR   PANTHER; PTHR21597:SF0; THO COMPLEX SUBUNIT 2; 1.
DR   PANTHER; PTHR21597; THO2 PROTEIN; 1.
DR   Pfam; PF11262; Tho2; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000015102}.
FT   DOMAIN          2..241
FT                   /note="THO complex subunitTHOC2 C-terminal"
FT                   /evidence="ECO:0000259|Pfam:PF11262"
FT   REGION          333..608
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        333..402
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        431..446
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        461..475
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        491..599
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   608 AA;  72097 MW;  2E26CBEC0FF9FA77 CRC64;
     MEKLQDEQKK QEEHVNKIKE RLFQEKDNWF LTRSARSAKN DTITQFLQLC LFPRCTFTAL
     DAIYCAKFVQ TIHKLKTVNF STILCYDRIF CDITFSVTSF TENEANRYGR FLCAMLETAM
     SWHNDQATFN RECTNYPGFV TKFRVSNQFS EANDHVGYEN YRHVCHKWHF KITKALLSCL
     TSKDYMQIRN ALIILMRILP HFPVLVNLAQ VIERKTEKVR EDEKNQRPDL FVLASSYIAQ
     LKTRQITMIK ESDFHQISEK TKESKTDDAL MKIDSNGNSS NGNNIVIKTE PKEIKNEKKV
     IKSTAAAIII PESEQIIEIK EEPTDIIKIE RSKESRKEER ARERERSERL EKSDRLERNE
     RSDRNNDRAE RERERDRELA EELAARKRKE EKKRDRSPHY NSKHSNADTG PQSPIYPNSP
     YYYRGGGIGN NDYEKESERE RDRDLSSVSN SSNGSLRRSQ EIMQYDKESK DAKWTPPVLR
     LKQQEEIPLE SSSNSRKDRE RSSKTKERNK EKANAADEDK DSRKERKLGR KRERLEENSS
     SDHKRRREVQ NGDDREDREK YHLRSKSPRS ERNERTHEKI RYENQSPPRS ERERSRYNTK
     SQSRINYT
//
DBGET integrated database retrieval system