ID A0A087XEY0_POEFO Unreviewed; 732 AA.
AC A0A087XEY0;
DT 29-OCT-2014, integrated into UniProtKB/TrEMBL.
DT 26-NOV-2014, sequence version 2.
DT 27-MAR-2024, entry version 50.
DE SubName: Full=Intestinal mucin-like protein {ECO:0000313|Ensembl:ENSPFOP00000004333.2};
OS Poecilia formosa (Amazon molly) (Limia formosa).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Atherinomorphae; Cyprinodontiformes; Poeciliidae; Poeciliinae;
OC Poecilia.
OX NCBI_TaxID=48698 {ECO:0000313|Ensembl:ENSPFOP00000004333.2, ECO:0000313|Proteomes:UP000028760};
RN [1] {ECO:0000313|Proteomes:UP000028760}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=female {ECO:0000313|Proteomes:UP000028760};
RA Schartl M., Warren W.;
RL Submitted (OCT-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSPFOP00000004333.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00039}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AYCK01026843; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AYCK01026844; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR AlphaFoldDB; A0A087XEY0; -.
DR Ensembl; ENSPFOT00000004341.2; ENSPFOP00000004333.2; ENSPFOG00000004329.2.
DR GeneTree; ENSGT00940000163235; -.
DR OMA; DFDPSSC; -.
DR Proteomes; UP000028760; Unassembled WGS sequence.
DR CDD; cd19941; TIL; 1.
DR Gene3D; 2.10.25.10; Laminin; 1.
DR InterPro; IPR006207; Cys_knot_C.
DR InterPro; IPR036084; Ser_inhib-like_sf.
DR InterPro; IPR014853; VWF/SSPO/ZAN-like_Cys-rich_dom.
DR InterPro; IPR001007; VWF_dom.
DR InterPro; IPR001846; VWF_type-D.
DR PANTHER; PTHR11339; EXTRACELLULAR MATRIX GLYCOPROTEIN RELATED; 1.
DR PANTHER; PTHR11339:SF398; MUCIN-2-LIKE-RELATED; 1.
DR Pfam; PF08742; C8; 1.
DR Pfam; PF00094; VWD; 1.
DR SMART; SM00832; C8; 1.
DR SMART; SM00041; CT; 1.
DR SMART; SM00214; VWC; 3.
DR SMART; SM00216; VWD; 1.
DR SUPFAM; SSF57567; Serine protease inhibitors; 1.
DR PROSITE; PS01185; CTCK_1; 1.
DR PROSITE; PS01225; CTCK_2; 1.
DR PROSITE; PS01208; VWFC_1; 1.
DR PROSITE; PS50184; VWFC_2; 2.
DR PROSITE; PS51233; VWFD; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00039}; Reference proteome {ECO:0000313|Proteomes:UP000028760};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 48..234
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 377..446
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 484..551
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 631..716
FT /note="CTCK"
FT /evidence="ECO:0000259|PROSITE:PS01225"
FT DISULFID 654..708
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00039"
FT DISULFID 658..710
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00039"
SQ SEQUENCE 732 AA; 81048 MW; C9805BAF80315043 CRC64;
MARCIGENQI EIVPYECPTI KPITCANGKD PVLVYDYYHC CQQYECDCEC EGWGDPHYIT
FDGKYYSYQG NCTYYLMKEI RPTHNLEILI ENVYCGPTED VSCPRALIVN YGAQSIKLIN
FNLGGRPDLK AFKNEDEDNL RLPYFKDGVK VVSTGINLVL EILRLNVVVK FGRTGFSINL
PYEYFGGNTQ GHCGTCSNNQ DDDCRLRNGT VVENCGVMAD DWLLEKDKGK TGCLPKRTSP
QKTCKHNPDS VCELLKDSSG VFAACHSQIS PDNFYTGCIF DGCYVHNRAV ECTSLETYAA
ACAEIGICID WRNHTKICAS SCPSGKIYRS CGPADQPSCE DNPNDPVVNY TTEGCFCPEG
QKLFSKESNI CVKSCGCLDP TGTSREFNET FEYNCQTCVC DESTKTVTCK PKTCPPTALP
RCMGPGYVLV NKTDPSDQCC NVHVCQCQSH ACPDINMNCD VGFMPNISVP EGKCCPERTC
EPKRVCVLNS VEYPPGSSVP GQKCENCFCS SNSSSGGLME IKCEKQQCEK TCRKGFEYKK
TNSDDCCGTC VQTQCVFVVN GTETLLKEGE TWSPPENKCE SKTCVKNGET FTVTNKHIIC
PAFQESNCKN DTIQTAANGC CKICVEKENA CRLFNRTTPI NHNGCQTELN MPSCEGSCDT
FTKYSEAAAA MEHSCSCCKE RRASNRTVTL ACEDGAHVQF TYVHVEECGC GHTECTTPAA
LHVRRKRRFT LQ
//