ID A0A3Q3L766_9LABR Unreviewed; 1468 AA.
AC A0A3Q3L766;
DT 10-APR-2019, integrated into UniProtKB/TrEMBL.
DT 10-APR-2019, sequence version 1.
DT 27-MAR-2024, entry version 23.
DE SubName: Full=Cleavage and polyadenylation specific factor 1 {ECO:0000313|Ensembl:ENSLBEP00000004789.1};
OS Labrus bergylta (ballan wrasse).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Eupercaria; Labriformes; Labridae; Labrus.
OX NCBI_TaxID=56723 {ECO:0000313|Ensembl:ENSLBEP00000004789.1, ECO:0000313|Proteomes:UP000261660};
RN [1] {ECO:0000313|Ensembl:ENSLBEP00000004789.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SIMILARITY: Belongs to the CPSF1 family.
CC {ECO:0000256|ARBA:ARBA00038446}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 56723.ENSLBEP00000004789; -.
DR Ensembl; ENSLBET00000005057.1; ENSLBEP00000004789.1; ENSLBEG00000003601.1.
DR GeneTree; ENSGT00950000183151; -.
DR InParanoid; A0A3Q3L766; -.
DR OrthoDB; 149432at2759; -.
DR Proteomes; UP000261660; Unplaced.
DR GO; GO:0005634; C:nucleus; IEA:InterPro.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR Gene3D; 1.10.150.910; -; 1.
DR Gene3D; 2.130.10.10; YVTN repeat-like/Quinoprotein amine dehydrogenase; 2.
DR InterPro; IPR004871; Cleavage/polyA-sp_fac_asu_C.
DR InterPro; IPR018846; Cleavage/polyA-sp_fac_asu_N.
DR InterPro; IPR015943; WD40/YVTN_repeat-like_dom_sf.
DR PANTHER; PTHR10644:SF2; CLEAVAGE AND POLYADENYLATION SPECIFICITY FACTOR SUBUNIT 1; 1.
DR PANTHER; PTHR10644; DNA REPAIR/RNA PROCESSING CPSF FAMILY; 1.
DR Pfam; PF03178; CPSF_A; 1.
DR Pfam; PF10433; MMS1_N; 2.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000261660}.
FT DOMAIN 92..540
FT /note="Cleavage/polyadenylation specificity factor A
FT subunit N-terminal"
FT /evidence="ECO:0000259|Pfam:PF10433"
FT DOMAIN 591..692
FT /note="Cleavage/polyadenylation specificity factor A
FT subunit N-terminal"
FT /evidence="ECO:0000259|Pfam:PF10433"
FT DOMAIN 1098..1433
FT /note="Cleavage/polyadenylation specificity factor A
FT subunit C-terminal"
FT /evidence="ECO:0000259|Pfam:PF03178"
FT REGION 403..436
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 545..594
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 781..803
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 927..946
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 564..594
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 781..795
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 927..943
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1468 AA; 164613 MW; E39661BED46B032A CRC64;
MYAVYRQAHT PTAVEFSVYC NFISSKEKNL VVAGTSQLYV YRIIHDVEST SKVDKSSDTK
SRKEKLEQVA SFSLFGNVMS MASVQLVGAS RDALLLSFKD AKLSVVEYDP GTHDLKTLSL
HFFEEPELRD GFVQNVHIPI VRVDPENRCA VMLVYGTQLV VLPFRKDTLT DEQEGGVGEG
PKSSFLPSYI IDVRELDEKL LNIIDMKFLH GYYEPTLLIL FEPNQTWPGR VAVRQDTCSI
VAISLNIMQK VHPVIWSLSN LPFDCTQVMP VPKPIGGVVV FAVNSLLYLN QSVPPYGVSL
NSQTTGTTAF PLRIQDEVKI TLDCSQSDFI AYDKMVISLK GGEIYVLTLI TDGMRSVRAF
HFDKAAASVL TTCMVTMEPG YLFLGSRLGN SLLLKYTEKL QETPSEEGKE TLDKEKDKEK
PEEPPSKKKR MESSTNWTDE VDEIEVYGSE AQSGTQLATY SFEVCDSILN IGPCANASMG
EPAFLSEEFQ GNPEPDLEVV VCSGNGKNGA LSVLQRSIRP QVVTTFELPG CHDMWTVISN
EVKEDKKPVK GSDSEDGTET ETGEDGEERE KEKEEKEEEG EKTEAPLEDD TKKHGFLILS
REDSTMILQT GQEIMELDTS GFATQGPTVF AGNIGNNKYI IQVSPMGIRL LEGVTQLHFI
PVDLGSPIVH CSVADPYVVI MTADGVVTMF VLKTDSYMGK THRLALQKPQ IPTQSRVIAL
CAYRDVSGMF TTENKVSCSV KDDTIITGQS EEETIIHDLS NAVDDEEEML YGDSNASAAL
AREESNNNSG GPGHFGSDGN SGKAEPTHWC MITRESGVME IYQLPDWRLV FLVKNFPVGQ
RVLVDSSAGQ SATQGEGKKE EVTRQGDLPL VREVALVSLG NNQSRPYLLV HVENELLIYE
AFPYDQQQQT NLKVRFKKVP HNINYREKKS KLKRDKKTDS GAPEDSSAVK SRIARFRYFE
DISGYSGVFI CGPSPHWMLV TSRGALRLHP MIIDGAIESF SPFHNINCPK GFLYFNKQGE
LRISVLPTYL SYDAPWPVRK IPLRCTVHYV SYHVESKVYA VCTSIKEPCT RIPRMTGEEK
EFETIDRDER YINPQQEKFS IQLISPVSWE AIPNARIDLE EWEHVTCMKT VALRSQETVS
GLKGYVAAGT CLMQGEEVTC RGRILILDVI EVVPEPGQPL TKNKFKVLYE KEQKGPVTAM
CHCNGYLVSA IGQKIFLWVL KDNDLTGMAF IDTQLYIHQM FSIKNFILAA DLMKSISLLR
YQEESKTLSL VSRDAKPLEV YSIDFVVDNN QLGFLVSDRD KNLYVYMYLP EAKESFGGMR
LLRRADFNAG ANINTFWRMP CRGALDTGSK KSLTWDNKHI TWFATLDGGV GLLLPMQEKT
YRRLLMLQNA LNTMLPHHAG LNPKAFRMLQ SNRRGLQNAV RNILDGELLN KYLYLSTMER
SELAKKIGTT QDIILDDLLE IDRVTAHF
//