GenomeNet

Database: UniProt
Entry: A0A7K5IW09_TOXRE
LinkDB: A0A7K5IW09_TOXRE
Original site: A0A7K5IW09_TOXRE 
ID   A0A7K5IW09_TOXRE        Unreviewed;       431 AA.
AC   A0A7K5IW09;
DT   07-APR-2021, integrated into UniProtKB/TrEMBL.
DT   07-APR-2021, sequence version 1.
DT   28-JAN-2026, entry version 14.
DE   RecName: Full=Cleavage stimulation factor subunit 1 {ECO:0000256|ARBA:ARBA00074323};
DE   AltName: Full=Cleavage stimulation factor 50 kDa subunit {ECO:0000256|ARBA:ARBA00029851};
DE   Flags: Fragment;
GN   Name=Cstf1 {ECO:0000313|EMBL:NWS85453.1};
GN   ORFNames=TOXRED_R09143 {ECO:0000313|EMBL:NWS85453.1};
OS   Toxostoma redivivum (California thrasher).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC   Coelurosauria; Aves; Neognathae; Neoaves; Telluraves; Australaves;
OC   Passeriformes; Mimidae; Toxostoma.
OX   NCBI_TaxID=99882 {ECO:0000313|EMBL:NWS85453.1, ECO:0000313|Proteomes:UP000523146};
RN   [1] {ECO:0000313|EMBL:NWS85453.1, ECO:0000313|Proteomes:UP000523146}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=B10K-DU-002-15 {ECO:0000313|EMBL:NWS85453.1};
RC   TISSUE=Muscle {ECO:0000313|EMBL:NWS85453.1};
RA   Zhang G.;
RT   "Bird 10,000 Genomes (B10K) Project - Family phase.";
RL   Submitted (SEP-2019) to the EMBL/GenBank/DDBJ databases.
CC   -!- FUNCTION: One of the multiple factors required for polyadenylation and
CC       3'-end cleavage of mammalian pre-mRNAs. May be responsible for the
CC       interaction of CSTF with other factors to form a stable complex on the
CC       pre-mRNA. {ECO:0000256|ARBA:ARBA00058408}.
CC   -!- SUBUNIT: Homodimer. The CSTF complex is composed of CSTF1 (50 kDa
CC       subunit), CSTF2 (64 kDa subunit) and CSTF3 (77 kDa subunit). Interacts
CC       (via repeats WD) directly with CSTF3. Interacts (via repeat WD6) with
CC       BARD1. Interacts with ERCC6. {ECO:0000256|ARBA:ARBA00066148}.
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:NWS85453.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; VXBI01006779; NWS85453.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A7K5IW09; -.
DR   Proteomes; UP000523146; Unassembled WGS sequence.
DR   GO; GO:0005848; C:mRNA cleavage stimulating factor complex; IEA:InterPro.
DR   GO; GO:0003723; F:RNA binding; IEA:TreeGrafter.
DR   GO; GO:0031124; P:mRNA 3'-end processing; IEA:InterPro.
DR   CDD; cd00200; WD40; 1.
DR   FunFam; 1.20.960.50:FF:000001; Cleavage stimulation factor subunit 1; 1.
DR   FunFam; 2.130.10.10:FF:000064; Cleavage stimulation factor subunit 1; 1.
DR   FunFam; 2.130.10.10:FF:000089; Cleavage stimulation factor subunit 1; 1.
DR   Gene3D; 1.20.960.50; Cleavage stimulation factor subunit 1, dimerisation domain; 1.
DR   Gene3D; 2.130.10.10; YVTN repeat-like/Quinoprotein amine dehydrogenase; 2.
DR   InterPro; IPR044633; CstF1-like.
DR   InterPro; IPR032028; CSTF1_dimer.
DR   InterPro; IPR038184; CSTF1_dimer_sf.
DR   InterPro; IPR015943; WD40/YVTN_repeat-like_dom_sf.
DR   InterPro; IPR019775; WD40_repeat_CS.
DR   InterPro; IPR036322; WD40_repeat_dom_sf.
DR   InterPro; IPR001680; WD40_rpt.
DR   PANTHER; PTHR44133; CLEAVAGE STIMULATION FACTOR SUBUNIT 1; 1.
DR   PANTHER; PTHR44133:SF2; CLEAVAGE STIMULATION FACTOR SUBUNIT 1; 1.
DR   Pfam; PF16699; CSTF1_dimer; 1.
DR   Pfam; PF00400; WD40; 6.
DR   SMART; SM00320; WD40; 6.
DR   SUPFAM; SSF50978; WD40 repeat-like; 1.
DR   PROSITE; PS00678; WD_REPEATS_1; 1.
DR   PROSITE; PS50082; WD_REPEATS_2; 4.
DR   PROSITE; PS50294; WD_REPEATS_REGION; 2.
PE   4: Predicted;
KW   mRNA processing {ECO:0000256|ARBA:ARBA00022664};
KW   Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW   Reference proteome {ECO:0000313|Proteomes:UP000523146};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   WD repeat {ECO:0000256|ARBA:ARBA00022574, ECO:0000256|PROSITE-
KW   ProRule:PRU00221}.
FT   DOMAIN          8..60
FT                   /note="Cleavage stimulation factor subunit 1 dimerisation"
FT                   /evidence="ECO:0000259|Pfam:PF16699"
FT   REPEAT          104..138
FT                   /note="WD"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT   REPEAT          169..210
FT                   /note="WD"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT   REPEAT          258..299
FT                   /note="WD"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT   REPEAT          309..343
FT                   /note="WD"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT   NON_TER         1
FT                   /evidence="ECO:0000313|EMBL:NWS85453.1"
FT   NON_TER         431
FT                   /evidence="ECO:0000313|EMBL:NWS85453.1"
SQ   SEQUENCE   431 AA;  48346 MW;  9A045AE843789374 CRC64;
     MYRTKVSLKD RQQLYKLIIS QLLYDGYINI ANGLINEIKP QSVCAPSEQL LHLIKLGMEN
     DDSAVQYAIG RSDTVAPGTG IDLEFDADVQ TMSPEASEYE TCYVTSHKGP CRVATYSRDG
     QLIATGSADA SIKILDTERM LAKSAMPIEV MMNETAQQNM ENHPVIRTLY DHVDEVTCLA
     FHPTEQILAS GSRDYTLKLF DYSKPSAKRA FKYIQEAEML RSISFHPSGD FILVGTQHPT
     LRLYDINTFQ CFVSCNPQDQ HTDAICSVNY NASANMYVTG SKDGCIKLWD GVSNRCITTF
     EKAHDGAEVC SAIFSKNSKY ILSSGKDSVA KLWEISTGRT LVKYTGAGLS GRQVHRTQAV
     FNHTEDYVLL PDERTISLCC WDSRTAERRN LLSLGHNSIV RCIVHSPTNP GFMTCSDDYR
     ARFWYRRSTT D
//
DBGET integrated database retrieval system