ID G3WDI8_SARHA Unreviewed; 259 AA.
AC G3WDI8;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 07-APR-2021, sequence version 2.
DT 27-MAR-2024, entry version 74.
DE RecName: Full=Cleavage and polyadenylation specificity factor subunit 4 {ECO:0000256|RuleBase:RU369008};
DE Short=CPSF 30 kDa subunit {ECO:0000256|RuleBase:RU369008};
DE AltName: Full=Cleavage and polyadenylation specificity factor 30 kDa subunit {ECO:0000256|RuleBase:RU369008};
GN Name=CPSF4 {ECO:0000313|Ensembl:ENSSHAP00000013493.2};
OS Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus.
OX NCBI_TaxID=9305 {ECO:0000313|Ensembl:ENSSHAP00000013493.2, ECO:0000313|Proteomes:UP000007648};
RN [1] {ECO:0000313|Ensembl:ENSSHAP00000013493.2, ECO:0000313|Proteomes:UP000007648}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21709235; DOI=10.1073/pnas.1102838108;
RA Miller W., Hayes V.M., Ratan A., Petersen D.C., Wittekindt N.E., Miller J.,
RA Walenz B., Knight J., Qi J., Zhao F., Wang Q., Bedoya-Reina O.C.,
RA Katiyar N., Tomsho L.P., Kasson L.M., Hardie R.A., Woodbridge P.,
RA Tindall E.A., Bertelsen M.F., Dixon D., Pyecroft S., Helgen K.M.,
RA Lesk A.M., Pringle T.H., Patterson N., Zhang Y., Kreiss A., Woods G.M.,
RA Jones M.E., Schuster S.C.;
RT "Genetic diversity and population structure of the endangered marsupial
RT Sarcophilus harrisii (Tasmanian devil).";
RL Proc. Natl. Acad. Sci. U.S.A. 108:12348-12353(2011).
RN [2] {ECO:0000313|Ensembl:ENSSHAP00000013493.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- FUNCTION: Component of the cleavage and polyadenylation specificity
CC factor (CPSF) complex that play a key role in pre-mRNA 3'-end
CC formation, recognizing the AAUAAA signal sequence and interacting with
CC poly(A) polymerase and other factors to bring about cleavage and
CC poly(A) addition. CPSF4 binds RNA polymers with a preference for
CC poly(U). {ECO:0000256|RuleBase:RU369008}.
CC -!- SUBUNIT: Component of the cleavage and polyadenylation specificity
CC factor (CPSF) complex. {ECO:0000256|RuleBase:RU369008}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|RuleBase:RU369008}.
CC -!- SIMILARITY: Belongs to the CPSF4/YTH1 family.
CC {ECO:0000256|RuleBase:RU369008}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; G3WDI8; -.
DR STRING; 9305.ENSSHAP00000013493; -.
DR Ensembl; ENSSHAT00000013604.2; ENSSHAP00000013493.2; ENSSHAG00000011533.2.
DR eggNOG; KOG1040; Eukaryota.
DR GeneTree; ENSGT00940000155520; -.
DR HOGENOM; CLU_024513_0_1_1; -.
DR InParanoid; G3WDI8; -.
DR OrthoDB; 33612at2759; -.
DR TreeFam; TF314871; -.
DR Proteomes; UP000007648; Unassembled WGS sequence.
DR GO; GO:0005847; C:mRNA cleavage and polyadenylation specificity factor complex; IEA:UniProtKB-UniRule.
DR GO; GO:0005654; C:nucleoplasm; IEA:Ensembl.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:1990837; F:sequence-specific double-stranded DNA binding; IEA:Ensembl.
DR GO; GO:0031124; P:mRNA 3'-end processing; IEA:UniProtKB-UniRule.
DR Gene3D; 4.10.1000.10; Zinc finger, CCCH-type; 2.
DR InterPro; IPR045348; CPSF4/Yth1.
DR InterPro; IPR041686; Znf-CCCH_3.
DR InterPro; IPR000571; Znf_CCCH.
DR InterPro; IPR036855; Znf_CCCH_sf.
DR PANTHER; PTHR23102:SF18; CLEAVAGE AND POLYADENYLATION SPECIFICITY FACTOR SUBUNIT 4; 1.
DR PANTHER; PTHR23102; CLEAVAGE AND POLYADENYLATION SPECIFICITY FACTOR SUBUNIT 4-RELATED; 1.
DR Pfam; PF00642; zf-CCCH; 1.
DR Pfam; PF15663; zf-CCCH_3; 1.
DR SMART; SM00356; ZnF_C3H1; 5.
DR SUPFAM; SSF90229; CCCH zinc finger; 2.
DR PROSITE; PS50103; ZF_C3H1; 5.
PE 3: Inferred from homology;
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723, ECO:0000256|PROSITE-
KW ProRule:PRU00723}; mRNA processing {ECO:0000256|RuleBase:RU369008};
KW Nucleus {ECO:0000256|RuleBase:RU369008};
KW Reference proteome {ECO:0000313|Proteomes:UP000007648};
KW Repeat {ECO:0000256|ARBA:ARBA00022737, ECO:0000256|RuleBase:RU369008};
KW RNA-binding {ECO:0000256|RuleBase:RU369008};
KW Zinc {ECO:0000256|ARBA:ARBA00022833, ECO:0000256|PROSITE-ProRule:PRU00723};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW ProRule:PRU00723}.
FT DOMAIN 35..59
FT /note="C3H1-type"
FT /evidence="ECO:0000259|PROSITE:PS50103"
FT DOMAIN 62..89
FT /note="C3H1-type"
FT /evidence="ECO:0000259|PROSITE:PS50103"
FT DOMAIN 90..117
FT /note="C3H1-type"
FT /evidence="ECO:0000259|PROSITE:PS50103"
FT DOMAIN 118..142
FT /note="C3H1-type"
FT /evidence="ECO:0000259|PROSITE:PS50103"
FT DOMAIN 143..169
FT /note="C3H1-type"
FT /evidence="ECO:0000259|PROSITE:PS50103"
FT ZN_FING 35..59
FT /note="C3H1-type"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00723"
FT ZN_FING 62..89
FT /note="C3H1-type"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00723"
FT ZN_FING 90..117
FT /note="C3H1-type"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00723"
FT ZN_FING 118..142
FT /note="C3H1-type"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00723"
FT ZN_FING 143..169
FT /note="C3H1-type"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00723"
FT REGION 173..199
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 177..199
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 259 AA; 29449 MW; 04D0C7911A50E823 CRC64;
MQELIASVDH IKFDLEIAVE QQLGAQPLPF PGMDKSGAAV CEFFLKAACG KGGMCPFRHI
SGEKTVVCKH WLRGLCKKGD QCEFLHEYDM TKMPECYFYS KFGECSNKEC PFLHIDPESK
IKDCPWYDRG FCKHGPLCRH RHTRRVICVN YLVGFCPEGP SCKFMHPRFE LPMGTTEQPP
LPQQTQPQAK QNNNPPLQRS SSLIQLTSQN SSPNQQRTPQ VIGVMQSQNN NVGNRGPRPL
EQVTCYKIRQ TDHSHGFSF
//