ID A0A3P9CF27_9CICH Unreviewed; 886 AA.
AC A0A3P9CF27;
DT 13-FEB-2019, integrated into UniProtKB/TrEMBL.
DT 13-FEB-2019, sequence version 1.
DT 27-MAR-2024, entry version 26.
DE SubName: Full=von Willebrand factor D and EGF domains {ECO:0000313|Ensembl:ENSMZEP00005020639.1};
GN Name=VWDE {ECO:0000313|Ensembl:ENSMZEP00005020639.1};
OS Maylandia zebra (zebra mbuna).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Cichlomorphae; Cichliformes; Cichlidae; African cichlids;
OC Pseudocrenilabrinae; Haplochromini; Maylandia; Maylandia zebra complex.
OX NCBI_TaxID=106582 {ECO:0000313|Ensembl:ENSMZEP00005020639.1, ECO:0000313|Proteomes:UP000265160};
RN [1] {ECO:0000313|Ensembl:ENSMZEP00005020639.1, ECO:0000313|Proteomes:UP000265160}
RP NUCLEOTIDE SEQUENCE.
RX PubMed=25186727; DOI=10.1038/nature13726;
RA Brawand D., Wagner C.E., Li Y.I., Malinsky M., Keller I., Fan S.,
RA Simakov O., Ng A.Y., Lim Z.W., Bezault E., Turner-Maier J., Johnson J.,
RA Alcazar R., Noh H.J., Russell P., Aken B., Alfoldi J., Amemiya C.,
RA Azzouzi N., Baroiller J.F., Barloy-Hubler F., Berlin A., Bloomquist R.,
RA Carleton K.L., Conte M.A., D'Cotta H., Eshel O., Gaffney L., Galibert F.,
RA Gante H.F., Gnerre S., Greuter L., Guyon R., Haddad N.S., Haerty W.,
RA Harris R.M., Hofmann H.A., Hourlier T., Hulata G., Jaffe D.B., Lara M.,
RA Lee A.P., MacCallum I., Mwaiko S., Nikaido M., Nishihara H.,
RA Ozouf-Costaz C., Penman D.J., Przybylski D., Rakotomanga M., Renn S.C.P.,
RA Ribeiro F.J., Ron M., Salzburger W., Sanchez-Pulido L., Santos M.E.,
RA Searle S., Sharpe T., Swofford R., Tan F.J., Williams L., Young S., Yin S.,
RA Okada N., Kocher T.D., Miska E.A., Lander E.S., Venkatesh B., Fernald R.D.,
RA Meyer A., Ponting C.P., Streelman J.T., Lindblad-Toh K., Seehausen O.,
RA Di Palma F.;
RT "The genomic substrate for adaptive radiation in African cichlid fish.";
RL Nature 513:375-381(2014).
RN [2] {ECO:0000313|Ensembl:ENSMZEP00005020639.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A3P9CF27; -.
DR STRING; 106582.ENSMZEP00005020639; -.
DR Ensembl; ENSMZET00005021319.1; ENSMZEP00005020639.1; ENSMZEG00005015481.1.
DR GeneTree; ENSGT00940000160835; -.
DR Proteomes; UP000265160; LG11.
DR GO; GO:0005886; C:plasma membrane; IEA:UniProt.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR GO; GO:0048856; P:anatomical structure development; IEA:UniProt.
DR CDD; cd00054; EGF_CA; 2.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR Gene3D; 2.10.25.10; Laminin; 6.
DR InterPro; IPR015919; Cadherin-like_sf.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR013032; EGF-like_CS.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR InterPro; IPR018097; EGF_Ca-bd_CS.
DR InterPro; IPR013111; EGF_extracell.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR001846; VWF_type-D.
DR PANTHER; PTHR14949:SF50; 3-PHYTASE; 1.
DR PANTHER; PTHR14949; EGF-LIKE-DOMAIN, MULTIPLE 7, 8; 1.
DR Pfam; PF00008; EGF; 1.
DR Pfam; PF07974; EGF_2; 2.
DR Pfam; PF12661; hEGF; 2.
DR SMART; SM00181; EGF; 9.
DR SMART; SM00179; EGF_CA; 2.
DR SUPFAM; SSF49313; Cadherin-like; 1.
DR SUPFAM; SSF57196; EGF/Laminin; 5.
DR PROSITE; PS00010; ASX_HYDROXYL; 1.
DR PROSITE; PS00022; EGF_1; 3.
DR PROSITE; PS01186; EGF_2; 3.
DR PROSITE; PS50026; EGF_3; 5.
DR PROSITE; PS01187; EGF_CA; 1.
DR PROSITE; PS51233; VWFD; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Reference proteome {ECO:0000313|Proteomes:UP000265160}.
FT DOMAIN 1..56
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 579..618
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 620..655
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 755..787
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 788..819
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 851..883
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT REGION 76..104
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 130..171
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 83..104
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 137..171
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 608..617
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 624..634
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 759..769
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 777..786
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 791..801
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 809..818
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 855..865
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 873..882
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ SEQUENCE 886 AA; 96843 MW; 06D65A693EB20AC1 CRC64;
MSLTLRAPIS DWRHTEGLCG TYDGQSENDF HLAGGAKLED LHAFISEWRL PPGNSLFDTV
PSHISTLNSR RHCTCQEEMP RFSSPRAPSQ PTSSSDPSCS NHGNMHFPSV VPTLDVTAEY
ITSVELLSEN EREPLPSRAS RVSTSAYQQP RGRRSAQRFI SNSPHQSLSQ SDLEGVTYFF
PEDHEPAVQP ESPLTWPTPS GLTEQQAQAQ CRQTVANSSI VMGCRHLLKE LFVNHAVTMC
VTDLQLKDEQ SWLNATIPLL ENECERRLLE EGEGEEKYQD AVAILKCPNL CNGNGQCSNW
GCVCFPGFSS YDCSALSDQI PEITTLQKDG LCDVRQGNCT TVNIHGQGFK DSYELKCEFV
KEKFVDGEWV LDEPLFVLAT FLNDTALECQ LPLEDSWVSA GVDLETVTNR PLARWQIKVS
NDGYSYSNAK ILTLYDGACQ MCSLRTEVVC TLREKTCNID GLCYNEGESL PSSPCLACRP
DSSKHTWSIA ENNEPPLLQS LTLPLQSFEG ENFIYQLNAR DPEGSTVLFT LLSGPEGASL
SPAGLLMWKA TAEPTNRHTF QFTVTDDCNA ETRASVEVFV RSCECLNAAS CVASVNLPAG
SGEYVCVCPD GFTGRRCEVD IDNCKPNPCR LGFCIDGLNS FSCVCPPGMT GQLGQSPVEQ
TVIFTSPDQI PLPPQTDDLL TGERDQKQQI SGDFLLYPCG QNMECTLPNT CTCKDGYTGY
SCHIAICRPD CKNRGKCVKP NVCKCPAGYG GPTCEEASCE PPCQHGGTCL ARNLCTCSYG
YVGPRCEIMV CNRHCENGGV CVSPDVCKCQ PGWYGPTCNS ALCKPVCLNG GTCVKPNVCA
CPSGFFGSQC QIAVCNPPCK NGGQCMRNNV CSCPEGYTGK RCNIRE
//