ID Q6C4R2_YARLI Unreviewed; 1093 AA.
AC Q6C4R2;
DT 16-AUG-2004, integrated into UniProtKB/TrEMBL.
DT 16-AUG-2004, sequence version 1.
DT 27-MAR-2024, entry version 87.
DE SubName: Full=YALI0E24387p {ECO:0000313|EMBL:CAG79949.1};
GN ORFNames=YALI0_E24387g {ECO:0000313|EMBL:CAG79949.1};
OS Yarrowia lipolytica (strain CLIB 122 / E 150) (Yeast) (Candida lipolytica).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; Saccharomycetes;
OC Saccharomycetales; Dipodascaceae; Yarrowia.
OX NCBI_TaxID=284591 {ECO:0000313|EMBL:CAG79949.1, ECO:0000313|Proteomes:UP000001300};
RN [1] {ECO:0000313|EMBL:CAG79949.1, ECO:0000313|Proteomes:UP000001300}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CLIB 122 / E 150 {ECO:0000313|Proteomes:UP000001300};
RX PubMed=15229592; DOI=10.1038/nature02579;
RG Genolevures;
RA Dujon B., Sherman D., Fischer G., Durrens P., Casaregola S., Lafontaine I.,
RA de Montigny J., Marck C., Neuveglise C., Talla E., Goffard N., Frangeul L.,
RA Aigle M., Anthouard V., Babour A., Barbe V., Barnay S., Blanchin S.,
RA Beckerich J.M., Beyne E., Bleykasten C., Boisrame A., Boyer J.,
RA Cattolico L., Confanioleri F., de Daruvar A., Despons L., Fabre E.,
RA Fairhead C., Ferry-Dumazet H., Groppi A., Hantraye F., Hennequin C.,
RA Jauniaux N., Joyet P., Kachouri R., Kerrest A., Koszul R., Lemaire M.,
RA Lesur I., Ma L., Muller H., Nicaud J.M., Nikolski M., Oztas S.,
RA Ozier-Kalogeropoulos O., Pellenz S., Potier S., Richard G.F., Straub M.L.,
RA Suleau A., Swennene D., Tekaia F., Wesolowski-Louvel M., Westhof E.,
RA Wirth B., Zeniou-Meyer M., Zivanovic I., Bolotin-Fukuhara M., Thierry A.,
RA Bouchier C., Caudron B., Scarpelli C., Gaillardin C., Weissenbach J.,
RA Wincker P., Souciet J.L.;
RT "Genome evolution in yeasts.";
RL Nature 430:35-44(2004).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CR382131; CAG79949.1; -; Genomic_DNA.
DR RefSeq; XP_504350.1; XM_504350.1.
DR AlphaFoldDB; Q6C4R2; -.
DR STRING; 284591.Q6C4R2; -.
DR EnsemblFungi; CAG79949; CAG79949; YALI0_E24387g.
DR GeneID; 2912547; -.
DR KEGG; yli:YALI0E24387g; -.
DR VEuPathDB; FungiDB:YALI0_E24387g; -.
DR HOGENOM; CLU_284296_0_0_1; -.
DR InParanoid; Q6C4R2; -.
DR OrthoDB; 2046299at2759; -.
DR Proteomes; UP000001300; Chromosome E.
DR GO; GO:0005634; C:nucleus; IBA:GO_Central.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0006397; P:mRNA processing; IEA:UniProtKB-KW.
DR Gene3D; 2.130.10.10; YVTN repeat-like/Quinoprotein amine dehydrogenase; 2.
DR InterPro; IPR004871; Cleavage/polyA-sp_fac_asu_C.
DR InterPro; IPR018846; Cleavage/polyA-sp_fac_asu_N.
DR InterPro; IPR015943; WD40/YVTN_repeat-like_dom_sf.
DR InterPro; IPR036322; WD40_repeat_dom_sf.
DR PANTHER; PTHR10644; DNA REPAIR/RNA PROCESSING CPSF FAMILY; 1.
DR PANTHER; PTHR10644:SF24; MMS1_N DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF03178; CPSF_A; 1.
DR Pfam; PF10433; MMS1_N; 1.
DR SUPFAM; SSF50978; WD40 repeat-like; 1.
PE 4: Predicted;
KW mRNA processing {ECO:0000256|ARBA:ARBA00022664};
KW Reference proteome {ECO:0000313|Proteomes:UP000001300}.
FT DOMAIN 47..476
FT /note="Cleavage/polyadenylation specificity factor A
FT subunit N-terminal"
FT /evidence="ECO:0000259|Pfam:PF10433"
FT DOMAIN 762..999
FT /note="Cleavage/polyadenylation specificity factor A
FT subunit C-terminal"
FT /evidence="ECO:0000259|Pfam:PF03178"
SQ SEQUENCE 1093 AA; 120629 MW; EA0912795DB25D7D CRC64;
MQTMHPQICL HTFTTSLVIA NQTHTLPARL LSACVIEDKT PDKLSRDHLL ICMSSGELLL
VKVYETGATP QVLDATTIFP RNKPNLSGLG GKICVDASST LAAIGSFAEF VLLVEIRALA
DKPKRFFKEK HVVSSAGAPA TFNSPRSTAM DYCFNMIAST EEDHGVYLSA LFVNERQRLT
LETYLWRPKY PVKTVSKLPL PIMDLPVFLF FNDTVSGSVI LACPKNLYLV KFNDMLSGVT
TFPSCTIDSF PVAHYRSEDA SHVYISCEDG SILEVVATNS TISVKTVVKT DVDLGKTFIL
EEADEESYLL GYGGDASDGR LLQITKSDAK SFVMETIASF PNWAPASDIE IIHGNIYVAS
GVENTGALTQ LNPGSRITTL YNEESDQYDT ISSVTFGSGL FPVSYIVVSS PIRSKIRQSM
YKDAISEDPN PRFWDLAEDC GFEFDQKTLH TGFTTNGLFV QITPNTLVLT DLSGHKCTLD
QSIVLGSVGP NNLLATVGNV PGNDRNLSVY RIASSVKPDA FELIKSIPLD APVTLLKVIE
DHIVVGTHDA INFYDIEDLT AVSTGLPFTP CDMLINSAFE LLIGCRTGHL LFCSFTSQGE
VTVNSYRQFG DTPVRITSLQ FQDYILVASD SVYILNLANN SPPLQIHNDD SSNEVIIGEC
PVFIGEANPE TAMSVFACFL TPAQLKLVDL ELAHSVVEFK IELGKTPRRL LYLDDVNLLV
ITLLGEMDPD TGNHLAFVDL ATRAVVSPNV FMDRRKHGGV TVFSRKEVIY CMAEWRLSLD
SQEYRYLVTG SGYTTSTKGR LVILSYKVKN GEVSLGKQAA WTVPEPVFAV CQLSKNTLVY
SFGRMISVAK FSGPSIEYKS GPEYEFPSKV VKLTVTSPGE VIASTMNNGI IVLTYEDDKF
CRVAKDSVYR SCLNHGVSDS LIAVADKERC VSFLARDDGI TLLKPTGKVF LPSFVSKLLL
SDNKPVWRKQ PSQMFQNPSI LCLSMSGEIT KMYIMTKQEK EYWDKELAVM KREKLKDSLL
DLDSEDFMED YNGSTNPIDS HSVSYVSFPT PLSPAPTMWS DFDLNSTNII NGTALFQMYY
GTDNPIIDAL NEM
//