ID A0A182NG25_9DIPT Unreviewed; 933 AA.
AC A0A182NG25;
DT 07-SEP-2016, integrated into UniProtKB/TrEMBL.
DT 07-SEP-2016, sequence version 1.
DT 13-SEP-2023, entry version 35.
DE RecName: Full=Pre-mRNA-splicing factor CDC5/CEF1 {ECO:0008006|Google:ProtNLM};
OS Anopheles dirus.
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=7168 {ECO:0000313|EnsemblMetazoa:ADIR006597-PA, ECO:0000313|Proteomes:UP000075884};
RN [1] {ECO:0000313|Proteomes:UP000075884}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=WRAIR2 {ECO:0000313|Proteomes:UP000075884};
RG The Broad Institute Genomics Platform;
RA Neafsey D.E., Walton C., Walker B., Young S.K., Zeng Q., Gargeya S.,
RA Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Anopheles dirus WRAIR2.";
RL Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EnsemblMetazoa:ADIR006597-PA}
RP IDENTIFICATION.
RC STRAIN=WRAIR2 {ECO:0000313|EnsemblMetazoa:ADIR006597-PA};
RG EnsemblMetazoa;
RL Submitted (MAY-2020) to UniProtKB.
CC -!- SIMILARITY: Belongs to the CEF1 family.
CC {ECO:0000256|ARBA:ARBA00010506}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A182NG25; -.
DR STRING; 7168.A0A182NG25; -.
DR EnsemblMetazoa; ADIR006597-RA; ADIR006597-PA; ADIR006597.
DR VEuPathDB; VectorBase:ADIR006597; -.
DR OrthoDB; 131128at2759; -.
DR Proteomes; UP000075884; Unassembled WGS sequence.
DR GO; GO:0000974; C:Prp19 complex; IEA:InterPro.
DR GO; GO:0005681; C:spliceosomal complex; IEA:UniProtKB-KW.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0000398; P:mRNA splicing, via spliceosome; IEA:InterPro.
DR CDD; cd00167; SANT; 1.
DR CDD; cd11659; SANT_CDC5_II; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 2.
DR InterPro; IPR047242; CDC5L/Cef1.
DR InterPro; IPR021786; Cdc5p/Cef1_C.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017930; Myb_dom.
DR InterPro; IPR001005; SANT/Myb.
DR InterPro; IPR047240; SANT_CDC5L_II.
DR PANTHER; PTHR45885; CELL DIVISION CYCLE 5-LIKE PROTEIN; 1.
DR PANTHER; PTHR45885:SF1; CELL DIVISION CYCLE 5-LIKE PROTEIN; 1.
DR Pfam; PF11831; Myb_Cef; 1.
DR Pfam; PF13921; Myb_DNA-bind_6; 1.
DR SMART; SM00717; SANT; 2.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS51294; HTH_MYB; 2.
DR PROSITE; PS50090; MYB_LIKE; 2.
PE 3: Inferred from homology;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW mRNA processing {ECO:0000256|ARBA:ARBA00022664};
KW mRNA splicing {ECO:0000256|ARBA:ARBA00023187};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Spliceosome {ECO:0000256|ARBA:ARBA00022728}.
FT DOMAIN 1..58
FT /note="HTH myb-type"
FT /evidence="ECO:0000259|PROSITE:PS51294"
FT DOMAIN 3..54
FT /note="Myb-like"
FT /evidence="ECO:0000259|PROSITE:PS50090"
FT DOMAIN 55..104
FT /note="Myb-like"
FT /evidence="ECO:0000259|PROSITE:PS50090"
FT DOMAIN 59..108
FT /note="HTH myb-type"
FT /evidence="ECO:0000259|PROSITE:PS51294"
FT REGION 108..145
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 254..275
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 813..933
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 686..812
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 813..833
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 851..868
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 881..901
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 902..933
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 933 AA; 105346 MW; 19542DD0D555C784 CRC64;
MPRIMIKGGV WRNTEDEILK AAVMKYGKNQ WSRIASLLHR KSAKQCKARW YEWLDPSIKK
TEWSREEDEK LLHLAKLMPT QWRTIAPIIG RTAAQCLERY EYLLDQAQRK EEGEDGMDDP
RKLKPGEIDP NPETKPARPD PKDMDEDELE MLSEARARLA NTQGKKAKRK AREKQLEEAR
RLAALQKRRE LRAAGIGVGN RKRKLKGIDY NSEIPFEKTP ALGFYDTTEE FVVPIAADFS
SLRQQTLDGE LRVEKEARER KKDKEKLKQR KENDIPTALL KNQEPAKKRS KLVLPEPQIS
DQELQQVVKL GRASEIAKEV ASESGVETTD ALLADYSITP QVAATPRTPA PVTDRILQEA
QNMMALTHVE TPLKGGVNTP LHQSDFSGVL PQSQAVATPN TVLATPFRSV RGPDGSATPG
GFLTPASGAM VPVGSGTQPH VPGATPNFLR DKLNINTEDG MSVAETPAAY KSYQKQLKST
LKEGLASLPA PRNDYEIVVP ESETDEAADD GTMDMEQMVP DQADVDEKRV RNKLAQEAKE
LSLRSQVIQR DLPRPLEINT TVLRPSNEMH GLTDLQKAEE LVKQEMVKML NYDALRNPIT
QVPAAALMKR PALTQYQAYL EQHPYENIDE SELSEARKLL AEEMGVVKQG MAHGDLSLES
YTQVWQECLS QVLYLPSQNR YTRANLASKK DRIESAEKRL EINRRHMAKE AKRCGKIEKK
LKILTAGYQA RAQALVKQFQ DTNEQIEQNS LALSTFQFLA AQEDLAIPKR LESLTEDVMR
QTEREKSLQT RYAQMTDELD ELNRLLEDAR VAGVERDERE KDTVPRVNGR LEQHEEDEDD
EAAVNAEEPE NGSQDCVQED SGDSNATEPQ EADGEHAADE NEEPSENQRT ENERTSDSEE
KDDDSTSEYP EAPEQQDEPM EQDNDEENQT NDD
//