ID A0A182T3U1_9DIPT Unreviewed; 862 AA.
AC A0A182T3U1;
DT 07-SEP-2016, integrated into UniProtKB/TrEMBL.
DT 07-SEP-2016, sequence version 1.
DT 27-MAR-2024, entry version 21.
DE RecName: Full=EGF-like domain-containing protein {ECO:0000259|PROSITE:PS50026};
OS Anopheles maculatus.
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles; Anopheles maculatus group.
OX NCBI_TaxID=74869 {ECO:0000313|EnsemblMetazoa:AMAM019088-PA, ECO:0000313|Proteomes:UP000075901};
RN [1] {ECO:0000313|Proteomes:UP000075901}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=maculatus3 {ECO:0000313|Proteomes:UP000075901};
RG The Broad Institute Genomics Platform;
RA Neafsey D.E., Besansky N., Howell P., Walton C., Young S.K., Zeng Q.,
RA Gargeya S., Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Anopheles maculatus species B.";
RL Submitted (SEP-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EnsemblMetazoa:AMAM019088-PA}
RP IDENTIFICATION.
RC STRAIN=maculatus3 {ECO:0000313|EnsemblMetazoa:AMAM019088-PA};
RG EnsemblMetazoa;
RL Submitted (MAY-2020) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A182T3U1; -.
DR EnsemblMetazoa; AMAM019088-RA; AMAM019088-PA; AMAM019088.
DR VEuPathDB; VectorBase:AMAM019088; -.
DR OrthoDB; 5391644at2759; -.
DR Proteomes; UP000075901; Unassembled WGS sequence.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR Gene3D; 2.10.25.10; Laminin; 2.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR InterPro; IPR003645; Fol_N.
DR PANTHER; PTHR22963:SF39; EGF-LIKE DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR22963; ENDOGLIN-RELATED; 1.
DR Pfam; PF00008; EGF; 1.
DR SMART; SM00181; EGF; 10.
DR SMART; SM00179; EGF_CA; 3.
DR SMART; SM00274; FOLN; 4.
DR SUPFAM; SSF57196; EGF/Laminin; 1.
DR PROSITE; PS00010; ASX_HYDROXYL; 1.
DR PROSITE; PS01186; EGF_2; 2.
DR PROSITE; PS50026; EGF_3; 4.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}.
FT DOMAIN 8..44
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 161..197
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 724..763
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 766..805
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
SQ SEQUENCE 862 AA; 91433 MW; 1B6645ED109ADCCB CRC64;
VIAIVRKLPS VCKERPCASN AKCRNSRGSY RCSCPTGLVG DPYQGGCKRA AECERDSDCP
ETAECVQENG ESKCRDVCAN VTCGPNAECA ARKHNADCKC RSRFEGDPKD LTNGCSPKPL
SCKRNTDCPE NSYCHGQICK PACSETDECN QDEVCFNGQC INPCHEVNAC GMNAECLMGA
HTKQCSCPAG FTGEAAVECV RVPISCASSA DCVDGSICKE SMCLPRCRND QECALNEKCL
QGSCMLTCRL DNDCFLGHIC LTGRCVYGCH ADSDCSASET CRDNRCVNPC SDNPCGPNAA
CTVVNHRASC SCFNGMVPSP TAKVGCVRAP ALQCTENRDC VDGTSCIDRL CRPVCGNDQG
CLNNERCDGG SCKPICRKDD DCRTGEICQG QTCMIGCRSD SGCSDHLACI AQQCTDPCQE
PTACGTNAEC VVISHKKQCS CPAGLVGDPF NLGCRQETHL CQARTDCPKG QACYGGTCMQ
TCRNDQNCLA DERCVRGTCR TVCNSDGACG NGLICEGRIC QTGCRSDNQC ANNQACINKK
CTDPCATLGQ CGSCSECTVI DHGVQCSCPH GYLGNPLLSC SPPAEKCHAQ CICDDDGMYC
VKSCRQAKDC GCGQTCHRGK CRTKCNPGNC PAGLLCQNGA CVAGCRTNAD CPSDRSCTNG
KCVDPCAGGK ACGRDAICQV SDHRSLCLCP DGFQGDPSVG CVQYECQTND DCELDKKCAS
GKCINPCLIP GACGLNAQCR VVNRQAQCSC TPGFFGNARQ ECQPVQKNSC AQNPCGDNTV
CREDENGYEC SCQPGCVGDP RQGCLCEGKL KKDDCEQYAC GTNAVCRMTE WGAPSCVCLP
THPHGDPYMS CECGEFIPSS NG
//