ID A0A0G4MI47_9PEZI Unreviewed; 1409 AA.
AC A0A0G4MI47;
DT 16-SEP-2015, integrated into UniProtKB/TrEMBL.
DT 16-SEP-2015, sequence version 1.
DT 24-JAN-2024, entry version 21.
DE RecName: Full=Protein CFT1 {ECO:0000256|ARBA:ARBA00039443};
DE AltName: Full=Cleavage factor two protein 1 {ECO:0000256|ARBA:ARBA00041264};
DE AltName: Full=Protein cft1 {ECO:0000256|ARBA:ARBA00039187};
GN ORFNames=BN1708_006200 {ECO:0000313|EMBL:CRK33931.1};
OS Verticillium longisporum.
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Sordariomycetes;
OC Hypocreomycetidae; Glomerellales; Plectosphaerellaceae; Verticillium.
OX NCBI_TaxID=100787 {ECO:0000313|EMBL:CRK33931.1, ECO:0000313|Proteomes:UP000044602};
RN [1] {ECO:0000313|EMBL:CRK33931.1, ECO:0000313|Proteomes:UP000044602}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=VL1 {ECO:0000313|EMBL:CRK33931.1};
RA Wang D.B., Wang M.;
RL Submitted (MAY-2015) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the CFT1 family.
CC {ECO:0000256|ARBA:ARBA00038304}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CVQH01022750; CRK33931.1; -; Genomic_DNA.
DR STRING; 100787.A0A0G4MI47; -.
DR OrthoDB; 149432at2759; -.
DR Proteomes; UP000044602; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:InterPro.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR Gene3D; 2.130.10.10; YVTN repeat-like/Quinoprotein amine dehydrogenase; 2.
DR InterPro; IPR004871; Cleavage/polyA-sp_fac_asu_C.
DR InterPro; IPR018846; Cleavage/polyA-sp_fac_asu_N.
DR InterPro; IPR015943; WD40/YVTN_repeat-like_dom_sf.
DR PANTHER; PTHR10644:SF2; CLEAVAGE AND POLYADENYLATION SPECIFICITY FACTOR SUBUNIT 1; 1.
DR PANTHER; PTHR10644; DNA REPAIR/RNA PROCESSING CPSF FAMILY; 1.
DR Pfam; PF03178; CPSF_A; 1.
DR Pfam; PF10433; MMS1_N; 1.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000044602}.
FT DOMAIN 128..755
FT /note="Cleavage/polyadenylation specificity factor A
FT subunit N-terminal"
FT /evidence="ECO:0000259|Pfam:PF10433"
FT DOMAIN 1021..1371
FT /note="Cleavage/polyadenylation specificity factor A
FT subunit C-terminal"
FT /evidence="ECO:0000259|Pfam:PF03178"
FT REGION 206..239
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 487..516
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1333..1352
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 487..501
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1409 AA; 153593 MW; D7F694EF6F70AD89 CRC64;
MQCYTELVPP TLVTCSVSLP FTSAKTENLI VSKGSLLQIF AVKTVSTEVD TSQIQAKSAS
KAGETYDRRI NDDDGLESAF LGGDGMLMRA DRTTNTRLVL VAEYPVHGVI AGLARVKIQS
SRSGGEALLV HSRTARLSLL QWDPEKNGVE DVSIHFYEKD EWQGSPMDGP LRQHATILQA
DPQSRCAALK FGLRKTAFLP FRQNDGDIDM DDWDEEVDGP RPEEEPPAAA AVNGSSSSSS
LAPVPYTPSF VLALPQLDPE ILHPVHFAFL HEYREPTLGI ISSSNRRLKM EPQKDHFTFK
VFTVDLLQKA STAILTVSNL PQSLNKVVAL PKPMGGALLI GENELIHIDQ AGKAHGVAVN
PYAAKMTKFP LADQSELKLR LEHCEVELMS PENGEMLLVT RHGEMAVVTF KMDGRSVSGV
SVKLVAPENG GDILPFRAAC LSKVSKNSIF YGTIGGDAKV IGWSRQHVQT ARKKARLLDE
SLDYDLDEDE LDDDDDDDLY GEGTVAPQPS AAAGSAKGGD VVFRVHDSLL SLSPIMDMTY
GKTAFFPGSE DATNSEGVRS ELDLVCAVGR HRGGSLALIN QHIQPRVIGR FEFPEARGFW
TTRVQKTIAE SLQGEKGANL AVGNDYGSVT QYDKFMIVAK VDLDGYETSD VYALTGAGFE
ALSGTEFDPA AGLTIEAGTM GNDMRIIQVL RSEVRCYDGD LGLSQILPML DEETGAEPRV
ISASIVDPYL LLLREDSSIL VAQITNHNEL EELDKEDETI VSTKWLSGCL YKDSRGLFAP
VQTDKGTSTS ESVFMFLLNA TGELHVYALP NLSKSIYVAA GLSYIPSLLS ADYAARRGTS
PETLTEILVA DLGDSTSASA HLILRHANDD MTIYEPFRIG GQEEKEDLAK SLFFKKVSNS
HLAKSPVEAA EDEAVQENRL GVNGMSSFHT EGCERGFIYA DSKGCARVTQ FPEAANVAEL
GVSVRKVPID TAVSHVAWHP TMEVYAVASS KLEPFELPKD DDYHKEWAKE ERPMPPMKEH
GSIKLYSPIT WNVIDEFELE QYEVAMCMKT LLLEVSEETK ERRMLFAVGT AILRGEDLPV
RGRILVFDVV HVIPQPDRPE TDRKLKLIAK EEIPRGAVTS LCEVGTQGLM LVAQGQKCMV
RGLKEDGTLL PVAFLDMSTY VVAVHELRNT GYCLMADANM GVWFVGYSEE PYRMTLFGKS
GTQLKCLTAD FLVAGNDLSI VASDEDGVLH ILQFDPEHPR SLQGHLLLNR ASFSVAPNHA
WVTLALPRTT TRPYLPQSEP ATSAAGSQNR TQTLLLASAS GAIASLNPIT EHAYRRLTSL
TTSLANALPH AAGMNPKAHR LPPQDGAARP PAVDVSAGRT IVDGALLARW NELGARQRAE
AAGKGGFASA ADVRGELEDV LGWRGVDYF
//