ID A7AVH1_BABBO Unreviewed; 443 AA.
AC A7AVH1;
DT 11-SEP-2007, integrated into UniProtKB/TrEMBL.
DT 11-SEP-2007, sequence version 1.
DT 24-JAN-2024, entry version 44.
DE RecName: Full=Splicing factor Cactin {ECO:0000256|ARBA:ARBA00034534};
GN ORFNames=BBOV_IV002000 {ECO:0000313|EMBL:EDO05797.1};
OS Babesia bovis.
OC Eukaryota; Sar; Alveolata; Apicomplexa; Aconoidasida; Piroplasmida;
OC Babesiidae; Babesia.
OX NCBI_TaxID=5865 {ECO:0000313|EMBL:EDO05797.1, ECO:0000313|Proteomes:UP000002173};
RN [1] {ECO:0000313|EMBL:EDO05797.1, ECO:0000313|Proteomes:UP000002173}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=T2Bo {ECO:0000313|EMBL:EDO05797.1};
RX PubMed=17953480; DOI=10.1371/journal.ppat.0030148;
RA Brayton K.A., Lau A.O.T., Herndon D.R., Hannick L., Kappmeyer L.S.,
RA Berens S.J., Bidwell S.L., Brown W.C., Crabtree J., Fadrosh D.,
RA Feldblum T., Forberger H.A., Haas B.J., Howell J.M., Khouri H., Koo H.,
RA Mann D.J., Norimine J., Paulsen I.T., Radune D., Ren Q., Smith R.K. Jr.,
RA Suarez C.E., White O., Wortman J.R., Knowles D.P. Jr., McElwain T.F.,
RA Nene V.M.;
RT "Genome sequence of Babesia bovis and comparative analysis of apicomplexan
RT hemoprotozoa.";
RL PLoS Pathog. 3:1401-1413(2007).
RN [2] {ECO:0000313|Proteomes:UP000002173}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=33294524; DOI=10.1016/j.dib.2020.106533;
RA Ueti M.W., Johnson W.C., Kappmeyer L.S., Herndon D.R., Mousel M.R.,
RA Reif K.E., Taus N.S., Ifeonu O.O., Silva J.C., Suarez C.E., Brayton K.A.;
RT "Transcriptome dataset of Babesia bovis life stages within vertebrate and
RT invertebrate hosts.";
RL Data Brief 33:106533-106533(2020).
RN [3] {ECO:0000313|Proteomes:UP000002173}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=33069745;
RA Ueti M.W., Johnson W.C., Kappmeyer L.S., Herndon D.R., Mousel M.R.,
RA Reif K.E., Taus N.S., Ifeonu O.O., Silva J.C., Suarez C.E., Brayton K.A.;
RT "Comparative analysis of gene expression between Babesia bovis blood stages
RT and kinetes allowed by improved genome annotation.";
RL Int. J. Parasitol. 51:123-136(2021).
CC -!- SIMILARITY: Belongs to the CACTIN family.
CC {ECO:0000256|ARBA:ARBA00006895}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EDO05797.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAXT01000004; EDO05797.1; -; Genomic_DNA.
DR RefSeq; XP_001609365.1; XM_001609315.1.
DR AlphaFoldDB; A7AVH1; -.
DR STRING; 5865.A7AVH1; -.
DR EnsemblProtists; EDO05797; EDO05797; BBOV_IV002000.
DR GeneID; 5477584; -.
DR KEGG; bbo:BBOV_IV002000; -.
DR VEuPathDB; PiroplasmaDB:BBOV_IV002000; -.
DR eggNOG; KOG2370; Eukaryota.
DR InParanoid; A7AVH1; -.
DR Proteomes; UP000002173; Unassembled WGS sequence.
DR InterPro; IPR019134; Cactin_C.
DR InterPro; IPR018816; Cactin_central.
DR PANTHER; PTHR21737:SF4; CACTIN; 1.
DR PANTHER; PTHR21737; POLYGLUTAMINE BINDING PROTEIN 1/MARVEL MEMBRANE-ASSOCIATING DOMAIN CONTAINING 3; 1.
DR Pfam; PF10312; Cactin_mid; 1.
DR Pfam; PF09732; CactinC_cactus; 1.
DR SMART; SM01050; CactinC_cactus; 1.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000002173}.
FT DOMAIN 111..269
FT /note="Splicing factor cactin central"
FT /evidence="ECO:0000259|Pfam:PF10312"
FT DOMAIN 326..443
FT /note="Splicing factor Cactin C-terminal"
FT /evidence="ECO:0000259|Pfam:PF09732"
FT REGION 1..34
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 48..77
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 9..34
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 443 AA; 50219 MW; FDAEEAD698761786 CRC64;
MPGILINSSG RDPDNNVSKT RDGSQPGKQI HTFTNGIDGK TLKFVVSTNT DENGSSSLSR
GPALWKRKAD ETKNQDLKPS SITDATIAEK GQEANIKRIK SSGIQDTNTG DNYELTEDEF
MYKQALEKSK LRIQDGRANE LDVILSSNSA IADASTLFDE ADKELLSQYL ELLEAKVYFS
RDPEKAFFEA LSTIVKTRLD GQILPTEIGE SVSKKIDEIL STKSVQDLDT YENEIKRKLS
SNAIVDTNFW EFALCRIPYF KACAILREYN NDGALTTRNS TKIQVISPRN VQKTDKKYER
FMQALKLDTD EHIMRSTVEY KSSTGPKPLF AARVTMSYEW NKYNLAHFDV DNPPPKSVQG
FKFSIFYSDL EDPKATPQWK LVKDGNSSET CLLVFKGARP YAPLAFRIPA REWDTDPSRG
FKNCFSDGVL HLYFNFRKLV YRR
//