ID A0A453M5B7_AEGTS Unreviewed; 310 AA.
AC A0A453M5B7;
DT 08-MAY-2019, integrated into UniProtKB/TrEMBL.
DT 08-MAY-2019, sequence version 1.
DT 24-JAN-2024, entry version 15.
DE RecName: Full=Cleavage and polyadenylation specificity factor subunit 2 {ECO:0000256|RuleBase:RU365006};
DE AltName: Full=Cleavage and polyadenylation specificity factor 100 kDa subunit {ECO:0000256|RuleBase:RU365006};
OS Aegilops tauschii subsp. strangulata (Goatgrass).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; Liliopsida; Poales; Poaceae; BOP clade;
OC Pooideae; Triticodae; Triticeae; Triticinae; Aegilops.
OX NCBI_TaxID=200361 {ECO:0000313|EnsemblPlants:AET5Gv21052200.6, ECO:0000313|Proteomes:UP000015105};
RN [1] {ECO:0000313|Proteomes:UP000015105}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. AL8/78 {ECO:0000313|Proteomes:UP000015105};
RX PubMed=25035499; DOI=10.1126/science.1250092;
RG International Wheat Genome Sequencing Consortium,;
RA Marcussen T., Sandve S.R., Heier L., Spannagl M., Pfeifer M.,
RA Jakobsen K.S., Wulff B.B., Steuernagel B., Mayer K.F., Olsen O.A.;
RT "Ancient hybridizations among the ancestral genomes of bread wheat.";
RL Science 345:1250092-1250092(2014).
RN [2] {ECO:0000313|Proteomes:UP000015105}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. AL8/78 {ECO:0000313|Proteomes:UP000015105};
RX PubMed=29158546; DOI=10.1038/s41477-017-0067-8;
RA Zhao G., Zou C., Li K., Wang K., Li T., Gao L., Zhang X., Wang H., Yang Z.,
RA Liu X., Jiang W., Mao L., Kong X., Jiao Y., Jia J.;
RT "The Aegilops tauschii genome reveals multiple impacts of transposons.";
RL Nat. Plants 3:946-955(2017).
RN [3] {ECO:0000313|EnsemblPlants:AET5Gv21052200.6}
RP IDENTIFICATION.
RG EnsemblPlants;
RL Submitted (MAR-2019) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|RuleBase:RU365006}.
CC -!- SIMILARITY: Belongs to the metallo-beta-lactamase superfamily. RNA-
CC metabolizing metallo-beta-lactamase-like family. CPSF2/YSH1 subfamily.
CC {ECO:0000256|RuleBase:RU365006}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A453M5B7; -.
DR EnsemblPlants; AET5Gv21052200.6; AET5Gv21052200.6; AET5Gv21052200.
DR Gramene; AET5Gv21052200.6; AET5Gv21052200.6; AET5Gv21052200.
DR Proteomes; UP000015105; Chromosome 5D.
DR GO; GO:0005847; C:mRNA cleavage and polyadenylation specificity factor complex; IEA:InterPro.
DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-KW.
DR GO; GO:0006378; P:mRNA polyadenylation; IEA:InterPro.
DR InterPro; IPR027075; CPSF2.
DR InterPro; IPR025069; Cpsf2_C.
DR InterPro; IPR036866; RibonucZ/Hydroxyglut_hydro.
DR InterPro; IPR011108; RMMBL.
DR PANTHER; PTHR45922; CLEAVAGE AND POLYADENYLATION SPECIFICITY FACTOR SUBUNIT 2; 1.
DR PANTHER; PTHR45922:SF1; CLEAVAGE AND POLYADENYLATION SPECIFICITY FACTOR SUBUNIT 2; 1.
DR Pfam; PF13299; CPSF100_C; 1.
DR Pfam; PF07521; RMMBL; 1.
DR SUPFAM; SSF56281; Metallo-hydrolase/oxidoreductase; 1.
PE 3: Inferred from homology;
KW mRNA processing {ECO:0000256|RuleBase:RU365006};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|RuleBase:RU365006};
KW Reference proteome {ECO:0000313|Proteomes:UP000015105};
KW RNA-binding {ECO:0000256|RuleBase:RU365006};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..15
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 16..310
FT /note="Cleavage and polyadenylation specificity factor
FT subunit 2"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5019354220"
FT DOMAIN 107..167
FT /note="Zn-dependent metallo-hydrolase RNA specificity"
FT /evidence="ECO:0000259|Pfam:PF07521"
FT DOMAIN 218..307
FT /note="Cleavage and polyadenylation specificity factor 2 C-
FT terminal"
FT /evidence="ECO:0000259|Pfam:PF13299"
SQ SEQUENCE 310 AA; 33891 MW; 0DB412D6B5E30D58 CRC64;
MQLTALLYFL LPSAGSHVGG NVDILIDGFV SPVTSIAPMF PFFENTADWD DFGEVINPDD
YMMKQDEMDN NMMLGAGDGM DGKLDESSAR LLLDSAPSKV ISNEMTVQVK CSLAYMDFEG
RSDGRSVKSV IAHVAPLKLV LVHGSAEATE HLKMHCAKNS DLHVYAPQIE ETIDVTSDLC
AYKVQLSEKL MSNVLSKKLG EHEIAWVDSG VGKVDEKLTL LPPSSTPAAH KSVLVGDLKL
ADFKQFLANK GLQVEFAGGA LRCGEYITVR KIGDSNQKGS TGSQQIVVEG PLCEDYYKIR
ELLYSQFFLL
//