GenomeNet

Database: UniProt
Entry: A0A182NG25_9DIPT
LinkDB: A0A182NG25_9DIPT
Original site: A0A182NG25_9DIPT 
ID   A0A182NG25_9DIPT        Unreviewed;       933 AA.
AC   A0A182NG25;
DT   07-SEP-2016, integrated into UniProtKB/TrEMBL.
DT   07-SEP-2016, sequence version 1.
DT   13-SEP-2023, entry version 35.
DE   RecName: Full=Pre-mRNA-splicing factor CDC5/CEF1 {ECO:0008006|Google:ProtNLM};
OS   Anopheles dirus.
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC   Anophelinae; Anopheles.
OX   NCBI_TaxID=7168 {ECO:0000313|EnsemblMetazoa:ADIR006597-PA, ECO:0000313|Proteomes:UP000075884};
RN   [1] {ECO:0000313|Proteomes:UP000075884}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=WRAIR2 {ECO:0000313|Proteomes:UP000075884};
RG   The Broad Institute Genomics Platform;
RA   Neafsey D.E., Walton C., Walker B., Young S.K., Zeng Q., Gargeya S.,
RA   Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA   Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA   Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA   Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA   Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA   Birren B.;
RT   "The Genome Sequence of Anopheles dirus WRAIR2.";
RL   Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|EnsemblMetazoa:ADIR006597-PA}
RP   IDENTIFICATION.
RC   STRAIN=WRAIR2 {ECO:0000313|EnsemblMetazoa:ADIR006597-PA};
RG   EnsemblMetazoa;
RL   Submitted (MAY-2020) to UniProtKB.
CC   -!- SIMILARITY: Belongs to the CEF1 family.
CC       {ECO:0000256|ARBA:ARBA00010506}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   AlphaFoldDB; A0A182NG25; -.
DR   STRING; 7168.A0A182NG25; -.
DR   EnsemblMetazoa; ADIR006597-RA; ADIR006597-PA; ADIR006597.
DR   VEuPathDB; VectorBase:ADIR006597; -.
DR   OrthoDB; 131128at2759; -.
DR   Proteomes; UP000075884; Unassembled WGS sequence.
DR   GO; GO:0000974; C:Prp19 complex; IEA:InterPro.
DR   GO; GO:0005681; C:spliceosomal complex; IEA:UniProtKB-KW.
DR   GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR   GO; GO:0000398; P:mRNA splicing, via spliceosome; IEA:InterPro.
DR   CDD; cd00167; SANT; 1.
DR   CDD; cd11659; SANT_CDC5_II; 1.
DR   Gene3D; 1.10.10.60; Homeodomain-like; 2.
DR   InterPro; IPR047242; CDC5L/Cef1.
DR   InterPro; IPR021786; Cdc5p/Cef1_C.
DR   InterPro; IPR009057; Homeobox-like_sf.
DR   InterPro; IPR017930; Myb_dom.
DR   InterPro; IPR001005; SANT/Myb.
DR   InterPro; IPR047240; SANT_CDC5L_II.
DR   PANTHER; PTHR45885; CELL DIVISION CYCLE 5-LIKE PROTEIN; 1.
DR   PANTHER; PTHR45885:SF1; CELL DIVISION CYCLE 5-LIKE PROTEIN; 1.
DR   Pfam; PF11831; Myb_Cef; 1.
DR   Pfam; PF13921; Myb_DNA-bind_6; 1.
DR   SMART; SM00717; SANT; 2.
DR   SUPFAM; SSF46689; Homeodomain-like; 1.
DR   PROSITE; PS51294; HTH_MYB; 2.
DR   PROSITE; PS50090; MYB_LIKE; 2.
PE   3: Inferred from homology;
KW   Coiled coil {ECO:0000256|SAM:Coils};
KW   DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW   mRNA processing {ECO:0000256|ARBA:ARBA00022664};
KW   mRNA splicing {ECO:0000256|ARBA:ARBA00023187};
KW   Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Spliceosome {ECO:0000256|ARBA:ARBA00022728}.
FT   DOMAIN          1..58
FT                   /note="HTH myb-type"
FT                   /evidence="ECO:0000259|PROSITE:PS51294"
FT   DOMAIN          3..54
FT                   /note="Myb-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50090"
FT   DOMAIN          55..104
FT                   /note="Myb-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50090"
FT   DOMAIN          59..108
FT                   /note="HTH myb-type"
FT                   /evidence="ECO:0000259|PROSITE:PS51294"
FT   REGION          108..145
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          254..275
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          813..933
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COILED          686..812
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COMPBIAS        813..833
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        851..868
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        881..901
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        902..933
FT                   /note="Acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   933 AA;  105346 MW;  19542DD0D555C784 CRC64;
     MPRIMIKGGV WRNTEDEILK AAVMKYGKNQ WSRIASLLHR KSAKQCKARW YEWLDPSIKK
     TEWSREEDEK LLHLAKLMPT QWRTIAPIIG RTAAQCLERY EYLLDQAQRK EEGEDGMDDP
     RKLKPGEIDP NPETKPARPD PKDMDEDELE MLSEARARLA NTQGKKAKRK AREKQLEEAR
     RLAALQKRRE LRAAGIGVGN RKRKLKGIDY NSEIPFEKTP ALGFYDTTEE FVVPIAADFS
     SLRQQTLDGE LRVEKEARER KKDKEKLKQR KENDIPTALL KNQEPAKKRS KLVLPEPQIS
     DQELQQVVKL GRASEIAKEV ASESGVETTD ALLADYSITP QVAATPRTPA PVTDRILQEA
     QNMMALTHVE TPLKGGVNTP LHQSDFSGVL PQSQAVATPN TVLATPFRSV RGPDGSATPG
     GFLTPASGAM VPVGSGTQPH VPGATPNFLR DKLNINTEDG MSVAETPAAY KSYQKQLKST
     LKEGLASLPA PRNDYEIVVP ESETDEAADD GTMDMEQMVP DQADVDEKRV RNKLAQEAKE
     LSLRSQVIQR DLPRPLEINT TVLRPSNEMH GLTDLQKAEE LVKQEMVKML NYDALRNPIT
     QVPAAALMKR PALTQYQAYL EQHPYENIDE SELSEARKLL AEEMGVVKQG MAHGDLSLES
     YTQVWQECLS QVLYLPSQNR YTRANLASKK DRIESAEKRL EINRRHMAKE AKRCGKIEKK
     LKILTAGYQA RAQALVKQFQ DTNEQIEQNS LALSTFQFLA AQEDLAIPKR LESLTEDVMR
     QTEREKSLQT RYAQMTDELD ELNRLLEDAR VAGVERDERE KDTVPRVNGR LEQHEEDEDD
     EAAVNAEEPE NGSQDCVQED SGDSNATEPQ EADGEHAADE NEEPSENQRT ENERTSDSEE
     KDDDSTSEYP EAPEQQDEPM EQDNDEENQT NDD
//
DBGET integrated database retrieval system