ID A0A078FSP8_BRANA Unreviewed; 434 AA.
AC A0A078FSP8;
DT 29-OCT-2014, integrated into UniProtKB/TrEMBL.
DT 29-OCT-2014, sequence version 1.
DT 24-JAN-2024, entry version 40.
DE RecName: Full=Protein CLP1 homolog {ECO:0000256|HAMAP-Rule:MF_03035};
GN Name=BnaA04g11670D {ECO:0000313|EMBL:CDY15944.1};
GN ORFNames=DARMORV10_A04P15620.1 {ECO:0000313|EMBL:CAF2275608.1},
GN GSBRNA2T00090320001 {ECO:0000313|EMBL:CDY15944.1};
OS Brassica napus (Rape).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Brassicales; Brassicaceae; Brassiceae; Brassica.
OX NCBI_TaxID=3708 {ECO:0000313|EMBL:CDY15944.1, ECO:0000313|Proteomes:UP000028999};
RN [1] {ECO:0000313|EMBL:CDY15944.1, ECO:0000313|Proteomes:UP000028999}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Darmor-bzh {ECO:0000313|Proteomes:UP000028999};
RX PubMed=25146293; DOI=10.1126/science.1253435;
RA Chalhoub B., Denoeud F., Liu S., Parkin I.A., Tang H., Wang X., Chiquet J.,
RA Belcram H., Tong C., Samans B., Correa M., Da Silva C., Just J.,
RA Falentin C., Koh C.S., Le Clainche I., Bernard M., Bento P., Noel B.,
RA Labadie K., Alberti A., Charles M., Arnaud D., Guo H., Daviaud C.,
RA Alamery S., Jabbari K., Zhao M., Edger P.P., Chelaifa H., Tack D.,
RA Lassalle G., Mestiri I., Schnel N., Le Paslier M.C., Fan G., Renault V.,
RA Bayer P.E., Golicz A.A., Manoli S., Lee T.H., Thi V.H., Chalabi S., Hu Q.,
RA Fan C., Tollenaere R., Lu Y., Battail C., Shen J., Sidebottom C.H.,
RA Wang X., Canaguier A., Chauveau A., Berard A., Deniot G., Guan M., Liu Z.,
RA Sun F., Lim Y.P., Lyons E., Town C.D., Bancroft I., Wang X., Meng J.,
RA Ma J., Pires J.C., King G.J., Brunel D., Delourme R., Renard M., Aury J.M.,
RA Adams K.L., Batley J., Snowdon R.J., Tost J., Edwards D., Zhou Y., Hua W.,
RA Sharpe A.G., Paterson A.H., Guan C., Wincker P.;
RT "Plant genetics. Early allopolyploid evolution in the post-Neolithic
RT Brassica napus oilseed genome.";
RL Science 345:950-953(2014).
RN [2] {ECO:0000313|EMBL:CDY15944.1}
RP NUCLEOTIDE SEQUENCE.
RA Genoscope - CEA;
RL Submitted (JUN-2014) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|EMBL:CAF2275608.1}
RP NUCLEOTIDE SEQUENCE.
RG Genoscope - CEA;
RA William W.;
RL Submitted (JAN-2021) to the EMBL/GenBank/DDBJ databases.
CC -!- FUNCTION: Required for endonucleolytic cleavage during polyadenylation-
CC dependent pre-mRNA 3'-end formation. {ECO:0000256|HAMAP-Rule:MF_03035}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|HAMAP-Rule:MF_03035}.
CC -!- SIMILARITY: Belongs to the Clp1 family. Clp1 subfamily.
CC {ECO:0000256|HAMAP-Rule:MF_03035}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|HAMAP-Rule:MF_03035}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; HG994358; CAF2275608.1; -; Genomic_DNA.
DR EMBL; LK032059; CDY15944.1; -; Genomic_DNA.
DR RefSeq; XP_013751014.1; XM_013895560.1.
DR STRING; 3708.A0A078FSP8; -.
DR PaxDb; 3708-A0A078FSP8; -.
DR EnsemblPlants; CDY15944; CDY15944; GSBRNA2T00090320001.
DR GeneID; 106453294; -.
DR Gramene; CDY15944; CDY15944; GSBRNA2T00090320001.
DR KEGG; bna:106453294; -.
DR OMA; SADANYG; -.
DR OrthoDB; 56092at2759; -.
DR Proteomes; UP000028999; Unassembled WGS sequence.
DR GO; GO:0005849; C:mRNA cleavage factor complex; IEA:InterPro.
DR GO; GO:0005634; C:nucleus; IBA:GO_Central.
DR GO; GO:0005524; F:ATP binding; IEA:UniProtKB-UniRule.
DR GO; GO:0051731; F:polynucleotide 5'-hydroxyl-kinase activity; IBA:GO_Central.
DR GO; GO:0006378; P:mRNA polyadenylation; IBA:GO_Central.
DR GO; GO:0006388; P:tRNA splicing, via endonucleolytic cleavage and ligation; IBA:GO_Central.
DR Gene3D; 2.60.120.1030; Clp1, DNA binding domain; 1.
DR Gene3D; 3.40.50.300; P-loop containing nucleotide triphosphate hydrolases; 1.
DR Gene3D; 2.40.30.330; Pre-mRNA cleavage complex subunit Clp1, C-terminal domain; 1.
DR HAMAP; MF_03035; Clp1; 1.
DR InterPro; IPR028606; Clp1.
DR InterPro; IPR045116; Clp1/Grc3.
DR InterPro; IPR010655; Clp1_C.
DR InterPro; IPR038238; Clp1_C_sf.
DR InterPro; IPR032324; Clp1_N.
DR InterPro; IPR038239; Clp1_N_sf.
DR InterPro; IPR032319; CLP1_P.
DR InterPro; IPR027417; P-loop_NTPase.
DR PANTHER; PTHR12755; CLEAVAGE/POLYADENYLATION FACTOR IA SUBUNIT CLP1P; 1.
DR PANTHER; PTHR12755:SF19; PROTEIN CLP1 HOMOLOG 5; 1.
DR Pfam; PF06807; Clp1; 1.
DR Pfam; PF16573; CLP1_N; 1.
DR Pfam; PF16575; CLP1_P; 1.
DR SUPFAM; SSF52540; P-loop containing nucleoside triphosphate hydrolases; 1.
PE 3: Inferred from homology;
KW ATP-binding {ECO:0000256|ARBA:ARBA00022840, ECO:0000256|HAMAP-
KW Rule:MF_03035};
KW mRNA processing {ECO:0000256|ARBA:ARBA00022664, ECO:0000256|HAMAP-
KW Rule:MF_03035};
KW Nucleotide-binding {ECO:0000256|ARBA:ARBA00022741, ECO:0000256|HAMAP-
KW Rule:MF_03035};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|HAMAP-Rule:MF_03035};
KW Reference proteome {ECO:0000313|Proteomes:UP000028999}.
FT DOMAIN 27..117
FT /note="Clp1 N-terminal beta-sandwich"
FT /evidence="ECO:0000259|Pfam:PF16573"
FT DOMAIN 137..326
FT /note="Polyribonucleotide 5'-hydroxyl-kinase Clp1 P-loop"
FT /evidence="ECO:0000259|Pfam:PF16575"
FT DOMAIN 334..432
FT /note="Pre-mRNA cleavage complex subunit Clp1 C-terminal"
FT /evidence="ECO:0000259|Pfam:PF06807"
FT BINDING 33
FT /ligand="ATP"
FT /ligand_id="ChEBI:CHEBI:30616"
FT /evidence="ECO:0000256|HAMAP-Rule:MF_03035"
FT BINDING 140..145
FT /ligand="ATP"
FT /ligand_id="ChEBI:CHEBI:30616"
FT /evidence="ECO:0000256|HAMAP-Rule:MF_03035"
SQ SEQUENCE 434 AA; 48446 MW; 9C6AAA71DAA7F1BF CRC64;
MSYCGLSMNS DAATGATEFS PQVRRVTLEK QSEIRIQVPQ ISPLKLRVLH GKVEIFGSEL
LRNVWLTFPP LQQFAVFTWY GATLEIDGVT ETDNTSVETP MVSYLSIHNS LQVQRHRVTS
STRDYVSSQG PRVIIVGDTD SGKSTLAKML LSWAAKDGCK PTFVDLNIGQ SSITIPGTIA
ATSVEMPVDP VEGLPLHKAL VHYFGHNTAT NNVRLYKYLV EELARELEEE FAINAESRRS
GMVIDTMGWT SGLGYQLLLH AIRIFNASLV IVLGQETELV YDLNKAFKFK KNVQILNLER
SSGVFSRLSD FRKMLRNISI QRYFSGATNN LTAYTKTAKF TDMQVYRIGA LLEKSRSTEP
LRITPVLIDK DLVNTVLAIS YAKQPHHIIS SIVAGFVYIT DVDLGEERIT YLSPSAAELP
AKIFIMGTLT WHKT
//