ID A0A091K789_EGRGA Unreviewed; 1230 AA.
AC A0A091K789;
DT 26-NOV-2014, integrated into UniProtKB/TrEMBL.
DT 26-NOV-2014, sequence version 1.
DT 24-JAN-2024, entry version 23.
DE RecName: Full=Splicing factor 3B subunit 3 {ECO:0000256|ARBA:ARBA00040929};
GN ORFNames=Z169_06070 {ECO:0000313|EMBL:KFP19646.1};
OS Egretta garzetta (Little egret).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Pelecaniformes; Ardeidae; Egretta.
OX NCBI_TaxID=188379 {ECO:0000313|EMBL:KFP19646.1, ECO:0000313|Proteomes:UP000053119};
RN [1] {ECO:0000313|EMBL:KFP19646.1, ECO:0000313|Proteomes:UP000053119}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=BGI_Z169 {ECO:0000313|EMBL:KFP19646.1};
RA Zhang G., Li C.;
RT "Genome evolution of avian class.";
RL Submitted (APR-2014) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the RSE1 family.
CC {ECO:0000256|ARBA:ARBA00038266}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KK502030; KFP19646.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A091K789; -.
DR STRING; 188379.A0A091K789; -.
DR Proteomes; UP000053119; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR Gene3D; 1.10.150.910; -; 1.
DR Gene3D; 2.130.10.10; YVTN repeat-like/Quinoprotein amine dehydrogenase; 3.
DR InterPro; IPR004871; Cleavage/polyA-sp_fac_asu_C.
DR InterPro; IPR018846; Cleavage/polyA-sp_fac_asu_N.
DR InterPro; IPR015943; WD40/YVTN_repeat-like_dom_sf.
DR InterPro; IPR036322; WD40_repeat_dom_sf.
DR PANTHER; PTHR10644; DNA REPAIR/RNA PROCESSING CPSF FAMILY; 1.
DR PANTHER; PTHR10644:SF1; SPLICING FACTOR 3B SUBUNIT 3; 1.
DR Pfam; PF03178; CPSF_A; 1.
DR Pfam; PF10433; MMS1_N; 1.
DR SUPFAM; SSF50978; WD40 repeat-like; 1.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000053119}.
FT DOMAIN 76..602
FT /note="Cleavage/polyadenylation specificity factor A
FT subunit N-terminal"
FT /evidence="ECO:0000259|Pfam:PF10433"
FT DOMAIN 873..1196
FT /note="Cleavage/polyadenylation specificity factor A
FT subunit C-terminal"
FT /evidence="ECO:0000259|Pfam:PF03178"
SQ SEQUENCE 1230 AA; 137107 MW; 7CF2821A57296130 CRC64;
MFLYNLTLQR ATGISYAIHG NFSGTKQQEI VVSRGKILEL LRPDPNTGKV HTLLTVEVFG
VIRSLMAFRL TGGTKDYIVV GSDSGRIVIL EYQPSKNVFE KIHQETFGKS GCRRIVPGQY
LAVDPKGFCQ VMCFRFCCPF LLGAIEKQKL VYILNRDAAA RLTISSPLEA HKANTLVYHV
VGVDVGFENP MFACLEMDYE EADNDPTGEA AANTQQTLTF YELDLGLNHV VRKYSEPLEE
HGNFLITVPG GSDGPSGVLI CSENYITYKN FGDQPDIRCP IPRRRNDLDD PERGMIFVCS
ATHKTKSMFF FLAQTEQGDI FKITLETDED MVTEIRLKYF DTVPVAAAMC VLKTGFLFVA
SEFGNHYLYQ IAHLGDDDEE PEFSSAMPLE EGDTFFFQPR PLKNLVLVDE LDSLSPILCC
QIADLANEDT PQLYVACGRG PRSSLRVLRH GLEVSEMAVS ELPGNPNAVW TVRRHVEDEF
DAYIIVSFVN ATLVLSIGET VEEVTDSGFL GTTPTLSCSL LGDDALVQVY PDGIRHIRAD
KRVNEWKTPG KKTIVKCAVN QRQVVIALTG GELVYFEMDP SGQLNEYTER KEMSADVVCM
SLANVPPGEQ RSRFLAVGLV DNTVRIISLD PSDCLQPLSM QALPAQPESL CIVEMGGTEK
QDELGERGSI GFLYLNIGLQ NGVLLRTVLD PVTGDLSDTR TRYLGSRPVK LFRVRMQGQE
AVLAMSSRSW LSYSYQSRFH LTPLSYETLE FASGFASEQC PEGIVAISTN TLRILALEKL
GAVFNQVAFP LQYTPRKFVI HPESNNLIII ETDHNAYTEA TKAQRKQQMA EEMVEAAGED
ERELAAEMAA AFLNENLPES IFGAPKAGNG QWASVIRVMN PIQGNTLDLV QLEQNEAAFS
VAVCRFSNTG DEWYVLVGVA KDLILNPRSV AGGFVYTYKL VNSGEKLEFL HKTPVEEVPA
AIAPFQGRVL IGVGKLLRVY DLGKKKLLRK CENKVLSFLS QKHIANYICG IQTIGHRVIV
SDVQESFIWV RYKRNENQLI IFADDTYPRW VTTATLLDYD TVAGADKFGN ICVVRLPPNT
NDEVDEDPTG NKALWDRGLL NGASQKAEVI MNYHVGETKT TLIPGGSESL VYTTLSGGIG
ILVPFTSHED HDFFQHVEMH LRSEHPPLCG RDHLSFRSYY FPVKNVIDGD LCEQFNSMEP
NKQKNVAEEL DRTPPEVSKK LEDIRTRYAF
//