ID M7WDI9_RHOT1 Unreviewed; 1022 AA.
AC M7WDI9;
DT 29-MAY-2013, integrated into UniProtKB/TrEMBL.
DT 29-MAY-2013, sequence version 1.
DT 24-JAN-2024, entry version 40.
DE SubName: Full=Nuclear cap-binding protein subunit 1 {ECO:0000313|EMBL:EMS18477.1};
GN ORFNames=RHTO_05874 {ECO:0000313|EMBL:EMS18477.1};
OS Rhodotorula toruloides (strain NP11) (Yeast) (Rhodosporidium toruloides).
OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina;
OC Microbotryomycetes; Sporidiobolales; Sporidiobolaceae; Rhodotorula.
OX NCBI_TaxID=1130832 {ECO:0000313|EMBL:EMS18477.1, ECO:0000313|Proteomes:UP000016926};
RN [1] {ECO:0000313|EMBL:EMS18477.1, ECO:0000313|Proteomes:UP000016926}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=NP11 {ECO:0000313|EMBL:EMS18477.1,
RC ECO:0000313|Proteomes:UP000016926};
RX PubMed=23047670; DOI=10.1038/ncomms2112;
RA Zhu Z., Zhang S., Liu H., Shen H., Lin X., Yang F., Zhou Y.J., Jin G.,
RA Ye M., Zou H., Zou H., Zhao Z.K.;
RT "A multi-omic map of the lipid-producing yeast Rhodosporidium toruloides.";
RL Nat. Commun. 3:1112-1112(2012).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KB722679; EMS18477.1; -; Genomic_DNA.
DR RefSeq; XP_016269596.1; XM_016419535.1.
DR AlphaFoldDB; M7WDI9; -.
DR GeneID; 27369887; -.
DR eggNOG; KOG1104; Eukaryota.
DR HOGENOM; CLU_004991_0_0_1; -.
DR OrthoDB; 5477544at2759; -.
DR Proteomes; UP000016926; Unassembled WGS sequence.
DR GO; GO:0005846; C:nuclear cap binding complex; IEA:InterPro.
DR GO; GO:0000339; F:RNA cap binding; IEA:InterPro.
DR GO; GO:0006406; P:mRNA export from nucleus; IEA:InterPro.
DR GO; GO:0016070; P:RNA metabolic process; IEA:InterPro.
DR Gene3D; 1.25.40.180; -; 3.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR027159; CBP80.
DR InterPro; IPR015172; MIF4G-like_typ-1.
DR InterPro; IPR015174; MIF4G-like_typ-2.
DR PANTHER; PTHR12412; CAP BINDING PROTEIN; 1.
DR PANTHER; PTHR12412:SF2; NUCLEAR CAP-BINDING PROTEIN SUBUNIT 1; 1.
DR Pfam; PF09088; MIF4G_like; 1.
DR Pfam; PF09090; MIF4G_like_2; 1.
DR SUPFAM; SSF48371; ARM repeat; 3.
PE 4: Predicted;
FT DOMAIN 484..691
FT /note="MIF4G-like type 1"
FT /evidence="ECO:0000259|Pfam:PF09088"
FT DOMAIN 709..981
FT /note="MIF4G-like type 2"
FT /evidence="ECO:0000259|Pfam:PF09090"
FT REGION 12..84
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 188..237
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 532..552
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 888..911
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 43..69
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 188..203
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1022 AA; 112171 MW; F0F478AE3C6CEDCD CRC64;
MSGYGNYQGG GGYNGGGYGG GGGGGYGGDG YGRGGYGRGG YGGGRKRNRD DDQGGGQNYR
RRQNERGDYQ SSPRRGGSGG GYGGGGGGMA ISYDRLPSLE TQKNRFKEDL WKLGDNPNYD
PAIDIPSTAQ SVETWFFRDR SHVFFTFRAA VSEMPHKLPH YAALLARLSL KSMEPPVSLA
ARISANPTPA SWPPPPTDAS PLAPGLPAKP VVDGEDAQMA GEDSTDGAKV ENSEEKKEVE
KVNVGKEIVQ DLMKAFQAFL DERKWKSVRY CVTLFSYLTT MPPASPVISA SSLVNLLASF
ISVLDEPGLR AARGDECVRI IVEALLRFDE NALAEPGVDT LRDGVQSYLS SRRIEKDLFA
DEATKVQWQD PLEQLVTALS SASSSDADGI FPVYSILPDV YASLTLAPAE EDETRAQAGD
DSLTLPLVLV PPESDDSDIT IGAAVGLEHA LPPAPITAGL RGDEGVGYEG TRLTLRLFDD
ESVPSDYDPA GIVVRSLIAD VISLYETNRK EAATILLELP KWFKKGTFRV NKPARRPDDD
QDMPEEEAPE GPNWSLENLI VESILTSVLS LPAPPLTAMY YYSVLTELCR ISPQTVAPSL
GKSIRKLYAA LGTDRDGSEE SVGPVLDAEG VRRLADWFSI HLSNYGFMWG WNDWAPDMDV
SDKHPKRVFV KRTMDLEIRL SYFDRVKNTI PGSMLDAGVF PDDAPGPDYA YEDPEHIHNA
AATSFLRMVR AKAPISEATE ELDSFQKSLE TEHNMTAEAA ENVKRDMAVQ TILNVGSRSF
SHFLNALERY LTLLRNLSSS PSARQHLLNT VAAFWKRHPQ FHLIVLDKLL QYRLVDTRDV
IAWVFAPSEE QEGSKTKTWS DPDLWQMVKI TLRSVTGQID SAKMRVEGLK REEEMKGAEN
DTGKQDGEDV LDAEGDLPVR DAQNPELDSA NSYLAEAEDE QASVLVNVLG HFAKLLPADV
DEEDWETWWI KGWVREFCRS SFSHKALTST VVSEGIDKLD LATTSPATKT ILDAAKAWHS
FA
//