GenomeNet

Database: UniProt
Entry: A0A1Y1Z9I0_9FUNG
LinkDB: A0A1Y1Z9I0_9FUNG
Original site: A0A1Y1Z9I0_9FUNG 
ID   A0A1Y1Z9I0_9FUNG        Unreviewed;      2365 AA.
AC   A0A1Y1Z9I0;
DT   30-AUG-2017, integrated into UniProtKB/TrEMBL.
DT   30-AUG-2017, sequence version 1.
DT   24-JAN-2024, entry version 20.
DE   RecName: Full=THO complex subunit 2 {ECO:0000256|ARBA:ARBA00019596};
GN   ORFNames=LY90DRAFT_709169 {ECO:0000313|EMBL:ORY06930.1};
OS   Neocallimastix californiae.
OC   Eukaryota; Fungi; Fungi incertae sedis; Chytridiomycota;
OC   Chytridiomycota incertae sedis; Neocallimastigomycetes; Neocallimastigales;
OC   Neocallimastigaceae; Neocallimastix.
OX   NCBI_TaxID=1754190 {ECO:0000313|EMBL:ORY06930.1, ECO:0000313|Proteomes:UP000193920};
RN   [1] {ECO:0000313|EMBL:ORY06930.1, ECO:0000313|Proteomes:UP000193920}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=G1 {ECO:0000313|EMBL:ORY06930.1,
RC   ECO:0000313|Proteomes:UP000193920};
RG   DOE Joint Genome Institute;
RA   Haitjema C.H., Gilmore S.P., Henske J.K., Solomon K.V., De Groot R.,
RA   Kuo A., Mondo S.J., Salamov A.A., Labutti K., Zhao Z., Chiniquy J.,
RA   Barry K., Brewer H.M., Purvine S.O., Wright A.T., Boxma B., Van Alen T.,
RA   Hackstein J.H., Baker S.E., Grigoriev I.V., O'Malley M.A.;
RT   "A Parts List for Fungal Cellulosomes Revealed by Comparative Genomics.";
RL   Submitted (AUG-2016) to the EMBL/GenBank/DDBJ databases.
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC   -!- SIMILARITY: Belongs to the THOC2 family.
CC       {ECO:0000256|ARBA:ARBA00007857}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:ORY06930.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; MCOG01000435; ORY06930.1; -; Genomic_DNA.
DR   STRING; 1754190.A0A1Y1Z9I0; -.
DR   OrthoDB; 179356at2759; -.
DR   Proteomes; UP000193920; Unassembled WGS sequence.
DR   GO; GO:0000347; C:THO complex; IEA:InterPro.
DR   GO; GO:0006406; P:mRNA export from nucleus; IEA:InterPro.
DR   GO; GO:0006397; P:mRNA processing; IEA:InterPro.
DR   InterPro; IPR040007; Tho2.
DR   InterPro; IPR021418; THO_THOC2_C.
DR   InterPro; IPR021726; THO_THOC2_N.
DR   InterPro; IPR032302; THOC2_N.
DR   PANTHER; PTHR21597:SF0; THO COMPLEX SUBUNIT 2; 1.
DR   PANTHER; PTHR21597; THO2 PROTEIN; 1.
DR   Pfam; PF11262; Tho2; 1.
DR   Pfam; PF11732; Thoc2; 1.
DR   Pfam; PF16134; THOC2_N; 1.
PE   3: Inferred from homology;
KW   Coiled coil {ECO:0000256|SAM:Coils};
KW   Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW   Reference proteome {ECO:0000313|Proteomes:UP000193920}.
FT   DOMAIN          154..711
FT                   /note="THO complex subunit 2 N-terminal"
FT                   /evidence="ECO:0000259|Pfam:PF16134"
FT   DOMAIN          713..788
FT                   /note="THO complex subunitTHOC2 N-terminal"
FT                   /evidence="ECO:0000259|Pfam:PF11732"
FT   DOMAIN          1047..1355
FT                   /note="THO complex subunitTHOC2 C-terminal"
FT                   /evidence="ECO:0000259|Pfam:PF11262"
FT   REGION          409..459
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1368..1396
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1478..1960
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1975..2365
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COILED          1070..1135
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COMPBIAS        415..430
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        431..459
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1380..1396
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1478..1702
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1704..1722
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1723..1880
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1881..1929
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1930..1954
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1983..2009
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2010..2024
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2025..2159
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2160..2209
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2211..2358
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   2365 AA;  280197 MW;  C0239D0123BD2944 CRC64;
     MNTNNDNSSL NVDFQKIFKN WKTEQNAYKK LLKNYIIECN GETQSTSKNL IPVKVCIRNI
     IKAYMNGFIS ASEVSVFFHT LYIDLVKALS IELYNIKTSG YAFMNPTNSK DPLFLPLLIV
     DWIWLIDQEI EGINSNFNDQ VENAKFKSEQ KAKIAALAKE LLSKNFIPAE LMKERFEVEM
     LEAVGLINSS KLFTKKSARI NTSLLYKQTK FNLLREESEG YAKLITEIST NMYTFHPFGL
     SKEEIEKNEA ILNERVHLVL TNIKSLIGYF ALDPNRVLDI IFDIFIANVV QYYKFFINLL
     KASPWRSIDC SNEINNKDKM DIDDEDNNQS VKLEDLRGRS SNGQILGYKF AVYQDPESPD
     NASVTLYIVA ALLIKNNLVN LNDLYPHLTP ENDSMDEEYS LYLKEIKKST ERETEPNPLL
     SAGSLTDDSA PSDHKISKES NKEKEEREKE KEKKEEKKKI IKTNQKAGLL AALLSIGDLK
     HSKIILEISP KLGIMYHEIV HLIGRIIHVS IQPLYKKIMK IPDIEVPLEP KPYLTHHYEL
     CNPLYEANLV IPKKKGSRVT RYMFFYQQWK DEIPLCNTYE ELLQNIPTFI KYVGIQLHQE
     ITVLTKLVRL AKVHIKEKNI SEEYNNMWLS IISSYFLPSV SLLYSNPGFF MELGDLIKLY
     PFQIRYGLYS NWKNKSYKIN KELGIIKNSA LQEIRRLMRR LSKDNTKQIG RMFGKLVHSN
     PVIAFDKMLE MIEGYENLIN PLVDGCKYLT SLEFDILSFV LIERLSTTLR NRIKDDGTSL
     AQWLSNLATF VGTIYKKYAN MELPGILRYI IYQLKCNNVT DLVILKELIQ KMSGIETTEN
     LSDSQLDALA GGETLKKEAY NAEHLKNTKK SSLRLIDALI DNNLITPLFI SLCQQRSICV
     YNTNLTHLKL IGNMFDQTND TMMQYAEFLM ANIDKHSYSK YIPTIKELCK NYYIDPETAF
     YIIRPKLNYL MKKAEQKESS NSIVNKTENE KTEKTEKSEI NKSAMDIDEK ISKPMEAESS
     LEVLNIWHKG LLSTIEEVST ILPVNVWEGI SPQFYVTFWQ LSLYDIYIPH SRYQDQIKKE
     KQAIHALEND RDESSNANRR RRERERCLDN ISHLEKELNI QNENNKKVIA RLKKEKDYWF
     ANCTNIYNII SYLIQYCLYP RCLFSNIDSI YCSKFIKLMH NLGTENFSSL TLFDRIMNVT
     DITNIISSCT ENEAHHYGKF LKEILKIIYY WYNDQKLYDK EAKGDNLPGF RKKWSTGSSK
     SNQAKISENE ILQFEDYRKV VYKWHIKCHK AFKTCLESTE YMNIRNSIII LNIIIDYFPI
     INKVGVSIEK NVNNVVKNEE REDLKLVAKQ YAGKLSLKKS SWIPNEMFNI DPHSSKPSRT
     ASTHSHDRDR ERDRDRDRDR DRERERDRER ERERDRERDR DRERDRDRDR DRERDRERER
     DRERERDRER DRERDRDRDR EREKERDRDR DRDRDRERDR DREREKERDR DREKERERER
     DRERERDKDR ERDKDRERDR DRDRERDRDR DRERDRDRDR EKSREKDKDK DKSIKERSDT
     PSSHYKLDNL DSKSYSKERE REREKDRERD RDRERNRDRE REHDKDRERE KDREKEKEKE
     RERDRNHERE KEHEKEKDHD RERERDKERE RDRERERDRE RERDRERDNK DIRENESRLS
     NNDIKETSNR DVRSDRRSSS RNSQRHQRLA SDTHSLNNQK SSSKDRINRD KDLKMADRNN
     NRDNHDQRNE YKDKERIERD RNERNREYRN SRDSRDWEHS NSQRNSEKDK EIEKEKEKEK
     EKEREKDRDR ERERERERER ERERERERER DKERDKERDK DRERERERDK SKDTKDRNNI
     KDKNNNSDKI DKDNKESSSK RLVHNSSVAS INSNNDVHKS RSSSLSSNKP NSDIHHPRSN
     SFSSNKPNND IHRSRSDSTS SNKTDNEMHR SHSDSLSTSK LSNEIQRMHN DSYQHNKNNE
     AHHTHNSSIQ FNKTNTNDNN SNSSKQYSKE KTTNHSDQHS NYKLNSKDSY DNQSLSRTSS
     DNNINSDRNN DNRKTISNNS SNNNSQNTHI SNNNSNRNSN ANTNNINNNN SQFKNTNKQN
     SNIPTNIDNN NNNIINLNKN SSKINKSNNN INNATVITSN SNTNNNNNNN NNNNKTTKIL
     SHHDNNENYK KDMKTNSSSI DKLPIKDERK RERNDKEDNT REAKRRKAEG VSIQGSSSSS
     KKNNNIDDNN NNNNNNTKMI NTHGSNNSNN ISIGGSSNNK INYNKLSSNS SNGSISISNS
     NYNKKSQSEI KSNNEIYGNN SNKRQRFDRP SMNNSSNSTI RSNNSNNDNQ NRIYSHSNNN
     NNNNNNNSSS SNNKNSMNKI YNRRE
//
DBGET integrated database retrieval system