ID A0A182P087_9DIPT Unreviewed; 1317 AA.
AC A0A182P087;
DT 07-SEP-2016, integrated into UniProtKB/TrEMBL.
DT 07-SEP-2016, sequence version 1.
DT 27-MAR-2024, entry version 46.
DE RecName: Full=CUB domain-containing protein {ECO:0008006|Google:ProtNLM};
OS Anopheles epiroticus.
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=199890 {ECO:0000313|EnsemblMetazoa:AEPI000319-PA, ECO:0000313|Proteomes:UP000075885};
RN [1] {ECO:0000313|Proteomes:UP000075885}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Epiroticus2 {ECO:0000313|Proteomes:UP000075885};
RG The Broad Institute Genomics Platform;
RA Neafsey D.E., Howell P., Walker B., Young S.K., Zeng Q., Gargeya S.,
RA Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Anopheles epiroticus epiroticus2.";
RL Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EnsemblMetazoa:AEPI000319-PA}
RP IDENTIFICATION.
RC STRAIN=Epiroticus2 {ECO:0000313|EnsemblMetazoa:AEPI000319-PA};
RG EnsemblMetazoa;
RL Submitted (MAY-2020) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 199890.A0A182P087; -.
DR EnsemblMetazoa; AEPI000319-RA; AEPI000319-PA; AEPI000319.
DR VEuPathDB; VectorBase:AEPI000319; -.
DR OrthoDB; 5471913at2759; -.
DR Proteomes; UP000075885; Unassembled WGS sequence.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR GO; GO:0048513; P:animal organ development; IEA:UniProt.
DR GO; GO:0016043; P:cellular component organization; IEA:UniProt.
DR GO; GO:0048731; P:system development; IEA:UniProt.
DR CDD; cd00041; CUB; 1.
DR CDD; cd00054; EGF_CA; 1.
DR CDD; cd00055; EGF_Lam; 2.
DR Gene3D; 2.120.10.80; Kelch-type beta propeller; 2.
DR Gene3D; 2.10.25.10; Laminin; 2.
DR Gene3D; 2.60.120.290; Spermadhesin, CUB domain; 1.
DR InterPro; IPR000859; CUB_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR015915; Kelch-typ_b-propeller.
DR InterPro; IPR002049; LE_dom.
DR InterPro; IPR002165; Plexin_repeat.
DR InterPro; IPR016201; PSI.
DR InterPro; IPR035914; Sperma_CUB_dom_sf.
DR PANTHER; PTHR46376:SF2; DISTRACTED, ISOFORM B; 1.
DR PANTHER; PTHR46376; LEUCINE-ZIPPER-LIKE TRANSCRIPTIONAL REGULATOR 1; 1.
DR Pfam; PF00431; CUB; 1.
DR Pfam; PF13854; Kelch_5; 1.
DR Pfam; PF01437; PSI; 1.
DR SMART; SM00042; CUB; 1.
DR SMART; SM00181; EGF; 3.
DR SMART; SM00180; EGF_Lam; 2.
DR SMART; SM00423; PSI; 5.
DR SUPFAM; SSF57196; EGF/Laminin; 1.
DR SUPFAM; SSF117281; Kelch motif; 2.
DR SUPFAM; SSF49854; Spermadhesin, CUB domain; 1.
DR PROSITE; PS01180; CUB; 1.
DR PROSITE; PS00022; EGF_1; 2.
DR PROSITE; PS01186; EGF_2; 1.
DR PROSITE; PS50026; EGF_3; 2.
DR PROSITE; PS50027; EGF_LAM_2; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|PROSITE-ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Kelch repeat {ECO:0000256|ARBA:ARBA00022441};
KW Laminin EGF-like domain {ECO:0000256|ARBA:ARBA00023292,
KW ECO:0000256|PROSITE-ProRule:PRU00460};
KW Membrane {ECO:0000256|ARBA:ARBA00023136, ECO:0000256|SAM:Phobius};
KW Transmembrane {ECO:0000256|ARBA:ARBA00022692, ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|ARBA:ARBA00022989,
KW ECO:0000256|SAM:Phobius}.
FT TRANSMEM 1162..1183
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 75..107
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 109..240
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 274..310
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 946..992
FT /note="Laminin EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50027"
FT REGION 1291..1317
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1295..1317
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 97..106
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 300..309
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 964..973
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
FT DISULFID 976..990
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
SQ SEQUENCE 1317 AA; 146324 MW; 0A0DE3DE4FB108D8 CRC64;
MPLEYLQMFV YLLHKSKYRR KRDNHHHHQH HPRWSMVATA NLWRLSLVLL ALSCHLGPSV
LVASHGLTGV TDSAGGGRCS EVRCMNGGVC KNGTCHCPDG WQGSECQFCG GKVRLTDPSG
SIHDGLGNYS IGVKCSWLID AREHNSITDK VSIGPQHRPS VIRLHLEEFA TECGWDHLYV
YDGDSVESPL LAVFSGLMYR KNFTIRRIPE VFAHSGSALL HFFSDDAYNM SGFNISYQVN
ACPTNDSSLN CSGNGDCWNG ECRCHSGFTG AACNIPRCPN YCSAHLGRGV CDKKQQRCIC
STGYTGNDCS QSIAHGYWTA IDAGETEGFT PPGSASHGAA VYHDTLYVIA GESYGKAEAL
LYMYDFNGKV WETAHTESRP VPELRYGAST VIFGDKIFMY GGVIEGKGVC GELWAFDVSA
KIWENITVKA EQCNDTYEMC GPLRSAGHTA TIVTSYDQLS GGKKESSPQK MVVIFGHSPQ
FGYLNTVQEF HFGTREWKIV QTRGYPVKGG YGHSAAYDPL RERIYVYGGI VSESDSSQLL
SNKLFSYEPH ERLWTLLETA PTARFLHTAN FLTPGLMMVF GGNTHNDTSH SFGAKCYSRD
LMVYDVLCNS WHTQPMPDDL YADLARFGHS AVVFEPSLYI YGGFDGQMIN DMLKFTPGVC
QAFNRTEQCL NARPGVKCVW DIQKSRCLPA ATVPRERLFD RDQEGLEVCP KKSRLVMTQH
ELMDYELCSQ LTTCQGCVST AYGCMFCGIG NGKGICVKEK CPDVSYTFRA DFYPTKALKD
CPDNDEHVCA QLHGCHACTA VSVCHWDYEH SRCQYSRNKS GDALNDACPP ACSGLTSCGN
CTQEECIWCQ NEQRCVDKNA YTASFPYGQC REWTTGSSKC RAASSGKSQC GFYRTCAQCR
DDPACGWCDD GSMTGLGKCL PGGDSGAYDE SECPAQRWHF THCPKCQCNG HSTCPDSNTC
KQPCNDLTVG PNCDKCKPGF WGNPVNGGVC QKCECNGQAQ YCHSDTGKCF CSTKGLAGDH
CEKCDATNHY HGDPSRGSCY YDLTIDYQFT FNLSKKEDRH FTQINFRNSP VKPDIDADFT
ITCSVAARMN ITIRTAGGIE KPLFSAVNCS TFRYRFSKAE HQFGIEDNVT LTTFYVYVYD
FQPPLWIQIA FSQYPKLNLQ QFFITFSTCF LLLLVMAAIL WKIKQHYDMF RRRQRLFVEM
EQMASRPFSQ VLVEIESREY NELSPAVENI TAAPRKRKKD SPSPIALEPC EGNRAAVLSL
LVRLPTGGLQ HSPPGQSAGL AVASALVTLG NPRRASIEHP KEPKTKRKQS QHPDSCI
//