ID L0B0L5_THEEQ Unreviewed; 1175 AA.
AC L0B0L5;
DT 06-MAR-2013, integrated into UniProtKB/TrEMBL.
DT 06-MAR-2013, sequence version 1.
DT 24-JAN-2024, entry version 38.
DE SubName: Full=CPSF A subunit region domain-containing protein {ECO:0000313|EMBL:AFZ81043.1};
GN ORFNames=BEWA_004510 {ECO:0000313|EMBL:AFZ81043.1};
OS Theileria equi strain WA.
OC Eukaryota; Sar; Alveolata; Apicomplexa; Aconoidasida; Piroplasmida;
OC Theileriidae; Theileria.
OX NCBI_TaxID=1537102 {ECO:0000313|EMBL:AFZ81043.1, ECO:0000313|Proteomes:UP000031512};
RN [1] {ECO:0000313|EMBL:AFZ81043.1, ECO:0000313|Proteomes:UP000031512}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=WA {ECO:0000313|EMBL:AFZ81043.1,
RC ECO:0000313|Proteomes:UP000031512};
RX PubMed=23137308; DOI=10.1186/1471-2164-13-603;
RA Kappmeyer L.S., Thiagarajan M., Herndon D.R., Ramsay J.D., Caler E.,
RA Djikeng A., Gillespie J.J., Lau A.O., Roalson E.H., Silva J.C., Silva M.G.,
RA Suarez C.E., Ueti M.W., Nene V.M., Mealey R.H., Knowles D.P., Brayton K.A.;
RT "Comparative genomic analysis and phylogenetic position of Theileria
RT equi.";
RL BMC Genomics 13:603-603(2012).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the RSE1 family.
CC {ECO:0000256|ARBA:ARBA00038266}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CP001670; AFZ81043.1; -; Genomic_DNA.
DR RefSeq; XP_004830709.1; XM_004830652.1.
DR AlphaFoldDB; L0B0L5; -.
DR STRING; 1537102.L0B0L5; -.
DR EnsemblProtists; AFZ81043; AFZ81043; BEWA_004510.
DR GeneID; 15805313; -.
DR KEGG; beq:BEWA_004510; -.
DR VEuPathDB; PiroplasmaDB:BEWA_004510; -.
DR eggNOG; KOG1898; Eukaryota.
DR OrthoDB; 101343at2759; -.
DR Proteomes; UP000031512; Chromosome 3.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR Gene3D; 2.130.10.10; YVTN repeat-like/Quinoprotein amine dehydrogenase; 3.
DR InterPro; IPR004871; Cleavage/polyA-sp_fac_asu_C.
DR InterPro; IPR018846; Cleavage/polyA-sp_fac_asu_N.
DR InterPro; IPR015943; WD40/YVTN_repeat-like_dom_sf.
DR InterPro; IPR036322; WD40_repeat_dom_sf.
DR PANTHER; PTHR10644; DNA REPAIR/RNA PROCESSING CPSF FAMILY; 1.
DR PANTHER; PTHR10644:SF1; SPLICING FACTOR 3B SUBUNIT 3; 1.
DR Pfam; PF03178; CPSF_A; 1.
DR Pfam; PF10433; MMS1_N; 1.
DR SUPFAM; SSF50978; WD40 repeat-like; 1.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000031512}.
FT DOMAIN 76..590
FT /note="Cleavage/polyadenylation specificity factor A
FT subunit N-terminal"
FT /evidence="ECO:0000259|Pfam:PF10433"
FT DOMAIN 842..1141
FT /note="Cleavage/polyadenylation specificity factor A
FT subunit C-terminal"
FT /evidence="ECO:0000259|Pfam:PF03178"
SQ SEQUENCE 1175 AA; 129494 MW; 1E4587C66641D67F CRC64;
MPVLYHLTLK RPTGITQAVQ GSFSAPKAQE IVVARSHILE LLSPDSNGKL QSICVCDVFG
IVRTMSTFCL TGTQRDYLVV GSDSGRLVVL EFCAVSRSFK RIHCETFGKT GIRRIVPGQY
LAVDPKGRAV IIGAVERQKF VYILNRDAKA NLTISSPLEA HKSHSICFDL VGLDVGFENP
VFASIEQSYE AVDALQIDLD EELTDDALRK GVSFWELDLG LNHVVKKVTL PIDLSAHLLI
PVPGGDGPGG VVVCCENFLV YKNVEHAEVA CAYPRRLEMP QEKSLIIVAY AVHRMKDFFF
ILIQSEFGDV YKVELTYEEG VVKEIVCRYF DTVPVSVSLC ILRSGYLFVA SEFGNHHLYQ
FTGLGTDERD PLCTSLHPHG RSAIIAFKPR ALQNLQLVDE LSSLSAITDM KVADIQGLGQ
HQIFLGCGRG SRSSLRVLRY GIAIEGLASS ELPGRPKSVW TVRSSFESAY DGFIIVGFEG
NTLVLSVGEA VEEVTDSCFL TSITTLHVAL MGDGSFIQVH DAGIRHVYDQ RVKEWRAPSS
KRVKVAASND RQVILGLSGG DVIYFEIDDS GNLVEYAKKS LSVEISCLDL QPTPKGRILA
NFMAIGTLDN SVRVLTLDKS LKVVSTQILS NNSTPESVCI SEFAVGDSSL VYLHVGLNTG
VMLRSTVDPI SGALSDQESR FLGGRAVKFR RVSLGSSFAI VALSDKPWLI YTHRGILLVS
PINVGTLESA DSLISPICPD GFVAVSGNTL RIFRCTSLGE TFAESQLPLT YTPRKLVLMP
SEAPSVGSLN YMLAIVESDH ARYNEDQSLE IKNVHAGIEL PSDYCESLDY TDFKAEPGKW
GSCLRIVNPL TLETVAKLLF ETDEAAMCAQ VVVLDGIQCL VVGTAIGMNL KGDPDSVSGY
LRVYAYGANY EIRLLHATPI TGVPRALAGY EGKLICALGS RLRLYALGKR QLLLKAEHRT
CTDHGFIWIS VCGSRIFAGD IREGFQLLRL RFYAEDAAEF EWIGHSTGPR WLSCCEQLDY
HTVIGGDKFD SLFIARVPQE EFTKATQFEN HAQFHLGDLP TAISKVSFNN MSQPIVIYST
ILGSIGAFIP YANKDELDLM QHLEMIMANE HPPLCGREHA FFRSYYYPVQ NIVDGDLCEQ
VKTLPEAVQR KIATQLDTNV HTLLRKVDDV RNRIL
//