ID A0A1E3HMF5_9TREE Unreviewed; 1421 AA.
AC A0A1E3HMF5;
DT 18-JAN-2017, integrated into UniProtKB/TrEMBL.
DT 18-JAN-2017, sequence version 1.
DT 24-JAN-2024, entry version 15.
DE RecName: Full=Protein CFT1 {ECO:0000256|ARBA:ARBA00039443};
DE AltName: Full=Cleavage factor two protein 1 {ECO:0000256|ARBA:ARBA00041264};
DE AltName: Full=Protein cft1 {ECO:0000256|ARBA:ARBA00039187};
GN ORFNames=L202_05477 {ECO:0000313|EMBL:ODN76896.1};
OS Cryptococcus amylolentus CBS 6039.
OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; Tremellomycetes;
OC Tremellales; Cryptococcaceae; Cryptococcus.
OX NCBI_TaxID=1295533 {ECO:0000313|EMBL:ODN76896.1, ECO:0000313|Proteomes:UP000094065};
RN [1] {ECO:0000313|EMBL:ODN76896.1, ECO:0000313|Proteomes:UP000094065}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CBS 6039 {ECO:0000313|EMBL:ODN76896.1,
RC ECO:0000313|Proteomes:UP000094065};
RA Cuomo C., Litvintseva A., Heitman J., Chen Y., Sun S., Springer D.,
RA Dromer F., Young S., Zeng Q., Chapman S., Gujja S., Saif S., Birren B.;
RT "Evolution of pathogenesis and genome organization in the Tremellales.";
RL Submitted (JUN-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the CFT1 family.
CC {ECO:0000256|ARBA:ARBA00038304}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:ODN76896.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AWGJ01000008; ODN76896.1; -; Genomic_DNA.
DR RefSeq; XP_018992270.1; XM_019139779.1.
DR STRING; 1295533.A0A1E3HMF5; -.
DR GeneID; 30156786; -.
DR OrthoDB; 149432at2759; -.
DR Proteomes; UP000094065; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:InterPro.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR Gene3D; 2.130.10.10; YVTN repeat-like/Quinoprotein amine dehydrogenase; 3.
DR InterPro; IPR004871; Cleavage/polyA-sp_fac_asu_C.
DR InterPro; IPR018846; Cleavage/polyA-sp_fac_asu_N.
DR InterPro; IPR015943; WD40/YVTN_repeat-like_dom_sf.
DR PANTHER; PTHR10644:SF2; CLEAVAGE AND POLYADENYLATION SPECIFICITY FACTOR SUBUNIT 1; 1.
DR PANTHER; PTHR10644; DNA REPAIR/RNA PROCESSING CPSF FAMILY; 1.
DR Pfam; PF03178; CPSF_A; 1.
DR Pfam; PF10433; MMS1_N; 1.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000094065}.
FT DOMAIN 145..675
FT /note="Cleavage/polyadenylation specificity factor A
FT subunit N-terminal"
FT /evidence="ECO:0000259|Pfam:PF10433"
FT DOMAIN 1063..1389
FT /note="Cleavage/polyadenylation specificity factor A
FT subunit C-terminal"
FT /evidence="ECO:0000259|Pfam:PF03178"
SQ SEQUENCE 1421 AA; 155991 MW; 3E00976B2A4E722A CRC64;
MHALHQTLLP PSSIHHSLFL PHFTPSTIYP LPKPSSALDT QDIRIVGNLI VAGAQVLRVF
EIREENVTIK EESDSLEESG LGGGTEDVQM GDVGDGFFDD GHADRVPIQR ATVKKLHLLT
QHELHGTVTG LAPLRTIESS ADGLDRLLVS FKDAKIALLE WSRGDIATVS LHTYERCPQM
NTGDLQNYVP MLRADPLSRL AVLTLPEDSL AVLPVVQEQS ELEMTNGFAR DAPYSPSFVL
SLSDVSASIK NVHDLLFLAG FHSPTLALLF SPLYTWSGRY QTVKDNFCLQ IRTFDLSSGG
SYPLLTSVSG LPSDALYLVA SPAELGGIVV ITSTGLVHVD QSGRTTCASV NGWWSYVTSL
KPQSSHNHLK MALEGSKSVF VGPYDLIITL QSGDVHQVRF EMEGRAVGSI FIEEKSSIVP
PPSTLTNAGE KTVFVGCAEG DSWLADVIRE EITREKEEEP REDIEVDWDE DLYGDINDPA
LDSASGARHE DGPAKLSLAP SDVLSAVGKI MDIEFGIAAS DQGLRTYPQL VAVAGGSRNS
TFNVFRRGIP ITKRRRFNEL TNSEGVWFLP IDRPSGQKFK DIPESERATM LLSSEGSATR
VFALSTKPAP QQIGRLDGKT LTAAPFFQRS CVLRVSPSEV SLLDNNGKVI QDVSPKSDQA
PIVGASISDP FVVIRRADDS VSFFVGDTVA RTVSEVAIAS EGKTCPPCQA VEVFSDTTGI
YRTFEPSRVG PYEHLQSNVT ARVNGANSSR NTRQAQLTAE QIKHLQEQEP AITIDAPSTE
AAINSSHGTQ WLCLLTRKGE MQILSLPDMT VVMQSEGLSS SAPSFTDDIG ERYTGEEKIE
EGEEEDEVKQ MVFCPIGKNN FRPHLLALHH SGRLNAYEAQ PRFTVDASTQ SRRSLAVRFK
KVHTQLLPIS GGVRTTNTAE TRLPYSIIPF SDIEGLTGAF ITGEKPQWVI STEAHPLRAF
ALKQAAMAFG KTTHLGGKGE YFIRIEDGSF ICYLPPTLNT EFAIPCDRYE MERVYTHITF
DPTSAHYVGA SSIEVPFQAY DEEGEIQLGP EGEALIPPTN QRSTLELFSQ GSQPWRVIDG
HEFDQNEEVM CMESVTLESI GAPGGYRDFI AVGTGFNFGE DRATRGNTYI FEIVETAGAQ
NLPGWKLRLR HKDPARHPVN AVANINGYLL NTNGPKLYVK GLDDDKQLMG LAFLDVQLYA
TTLKVFKNFI LVGDLCKSFW FVSLQENPYK FSTISKDLQG VSVVTTDFLV HDGQVTFISS
DRNGDIRMLE FDPTDPDSLN GERLMLKTEY HAGSIITVSK VIARRKTADE EFAPQTQIIY
ATADGGLTTM VSVKDARFKR LQLVSDQLVR NAQHVAGLNP RAFRTVRNDL LPRPLSKGIL
DGQLLSHFAL QPIGRQQEMM RQIGTDAVTV ASDLAALGGF W
//