ID E3MWG7_CAERE Unreviewed; 2161 AA.
AC E3MWG7;
DT 11-JAN-2011, integrated into UniProtKB/TrEMBL.
DT 11-JAN-2011, sequence version 1.
DT 27-MAR-2024, entry version 72.
DE SubName: Full=CRE-MUP-4 protein {ECO:0000313|EMBL:EFP10615.1};
GN Name=Cre-mup-4 {ECO:0000313|EMBL:EFP10615.1};
GN ORFNames=CRE_01151 {ECO:0000313|EMBL:EFP10615.1};
OS Caenorhabditis remanei (Caenorhabditis vulgaris).
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae;
OC Caenorhabditis.
OX NCBI_TaxID=31234 {ECO:0000313|Proteomes:UP000008281};
RN [1] {ECO:0000313|Proteomes:UP000008281}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=PB4641 {ECO:0000313|Proteomes:UP000008281};
RG Caenorhabditis remanei Sequencing Consortium;
RA Wilson R.K.;
RT "PCAP assembly of the Caenorhabditis remanei genome.";
RL Submitted (JUL-2007) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; DS268487; EFP10615.1; -; Genomic_DNA.
DR RefSeq; XP_003099473.1; XM_003099425.1.
DR STRING; 31234.E3MWG7; -.
DR EnsemblMetazoa; CRE01151.1; CRE01151.1; WBGene00077349.
DR eggNOG; KOG1217; Eukaryota.
DR HOGENOM; CLU_000420_0_0_1; -.
DR InParanoid; E3MWG7; -.
DR OMA; HGDCIHD; -.
DR OrthoDB; 2872525at2759; -.
DR Proteomes; UP000008281; Unassembled WGS sequence.
DR GO; GO:0030056; C:hemidesmosome; IEA:EnsemblMetazoa.
DR GO; GO:0098733; C:hemidesmosome associated protein complex; IEA:EnsemblMetazoa.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR CDD; cd00054; EGF_CA; 5.
DR Gene3D; 2.10.25.10; Laminin; 16.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 1.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR013032; EGF-like_CS.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR InterPro; IPR018097; EGF_Ca-bd_CS.
DR InterPro; IPR009030; Growth_fac_rcpt_cys_sf.
DR InterPro; IPR000082; SEA_dom.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR24039; FIBRILLIN-RELATED; 1.
DR PANTHER; PTHR24039:SF28; FIBULIN-1; 1.
DR Pfam; PF00008; EGF; 2.
DR Pfam; PF07645; EGF_CA; 7.
DR Pfam; PF12661; hEGF; 5.
DR Pfam; PF00092; VWA; 1.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00181; EGF; 26.
DR SMART; SM00179; EGF_CA; 19.
DR SMART; SM00200; SEA; 2.
DR SMART; SM00327; VWA; 1.
DR SUPFAM; SSF57196; EGF/Laminin; 5.
DR SUPFAM; SSF57184; Growth factor receptor domain; 1.
DR SUPFAM; SSF53300; vWA-like; 1.
DR PROSITE; PS00010; ASX_HYDROXYL; 12.
DR PROSITE; PS00022; EGF_1; 1.
DR PROSITE; PS01186; EGF_2; 1.
DR PROSITE; PS50026; EGF_3; 16.
DR PROSITE; PS01187; EGF_CA; 2.
DR PROSITE; PS50024; SEA; 2.
DR PROSITE; PS50234; VWFA; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Membrane {ECO:0000256|SAM:Phobius};
KW Reference proteome {ECO:0000313|Proteomes:UP000008281};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT SIGNAL 1..15
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 16..2161
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5012226487"
FT TRANSMEM 1918..1941
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 71..110
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 122..163
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 175..213
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 278..315
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 377..416
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 437..612
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 844..882
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 894..932
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1011..1050
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1112..1151
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1162..1201
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1210..1249
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1256..1295
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1365..1487
FT /note="SEA"
FT /evidence="ECO:0000259|PROSITE:PS50024"
FT DOMAIN 1538..1663
FT /note="SEA"
FT /evidence="ECO:0000259|PROSITE:PS50024"
FT DOMAIN 1674..1710
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1769..1811
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1829..1867
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1874..1910
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT REGION 746..771
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2035..2054
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2123..2161
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2136..2153
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 1881..1898
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1900..1909
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ SEQUENCE 2161 AA; 236197 MW; 0A03AAA3A45332B7 CRC64;
MRWVLLVLLP LIASAATTYQ HRQTYSSLQC RVNDPLSCNQ AKSEVCVFVN GQYRCECPVG
VSRLPDGRCM VVNECARPSL NACHKDAQCI DLAEGYTCRC NSGFADTSPD KVNKPGRQCQ
KTMNECGAKS TYGVDCDENA ACVDTPEGFQ CVCQPGFVDV STSISKLPGR KCVESVNECT
NGEADCSNNA DCFDRADGYE CKCRPGFVDA SPNVDKYPGR VCNKPKAPEY YGQQSRQPQC
SEGSGCGPNE ECRFNTAGER VCQCRRGSVQ QSNGVCKVFS QCEQANECDR NAFCSNTYDG
PKCQCKDGFL DVSPDPIRLP GRKCQQVKNE CADGSHDCSH QADCQDTPTG YICTCKSNCI
DVSSRYNLPP GRKCSTAANQ CSDKSLNSCD ENADCVQLPD GYTCKCFAGY VDVSSNANLP
PGRVCTLSTA CPAQPTDLVF LIDGSGSIGS YVFQTEVLRF LAEFTELFDI APQKTRVSVV
QYSDQIRHEF GLDNYSDRKS LQNAIRNIEY LTGLTRTGAA IEHVANEAFS ERRGARPVGQ
VSRVAIVITD GRSQDNVTRP SDNARKQEIQ LFAVGVTNHV LDAELEEISG AKDRTFHVSG
FEDLNTRLRS AIQRVACPHQ NNEDTYNKGP CDPSNHNGCD RSLNQVCQQK DGKFVCVCPA
GFDIHPVTKV CGGDICNPEI ATSCPDPEIC EKTPFGNWRC TCPADLGWRD KFTGICSKLG
KCELLKSARN SLKPLTVPTS AHQTTCTAAQ PMRSARKEPA ESSSANAMPD SNVTAVPTNA
RLQEPVIQEC QILVTPERRR SVFQMDVERS RACVIDIIRD IQSLIFALRE EQRIGCHLTS
RFSVIDECAA GVADCDPNAK CTDTDESYIC TCNEGFLDKS PEQNKKPGRV CSKQRNECLD
GTHNCSMNAE CIDLPDGFLC RCKEDFVDIS PNPNAFGGID CRALVNECLI PGGHNCHEHA
ICIGELGGTG GGSKSEKLII SDTRDSYKCQ CKEGYVDHDE LRNPGRTCKK LNQICESGKH
ECDKNARCVE KGANDYECVC NAGFIDKSPL AHRPGRKCVE PICSDDSKHD CHSAAICEEN
DSVPEKYTCK CRDGYLDVGA NGGKSGRECK ELVNECLSAS LNSCDAAATC IDLDDGYTCK
CPLGSKDESP DPKLPGRSCK GLVNECNIPH LNNCSHFATC IDLEEGYECK CKAEYYDQKP
EQPGTQCKFI INECLAENLN DCSPNAMCID KIDGYECKCK APFEDQMPAT PGRICRFDEC
ANPKDNDCDK NALCIDTDDS YTCQCKEGFF DEISDPKKPG RVCIEVGLVI ETPNQSEDPT
TPDPNTIKCG NGFCHLNLGE VCVGGATCAC RPGESRDNEK EKCVPTTSIP LVVRVMEYDG
EPIQYRTDYS KPDTPAHVEI VDAVRKSVGK IIGKTEFAPR FVTTDVNYIT NPKVQNSDWD
KGLLGNVTIH LAGKEEVDKC RVYEQFSEIV REMGGRVDRI KLSDDADLDP CRKEDVKKGI
PCGNTFCSIE LGEECIAGRI CGCPKGQKRK DASSPCRAVE SWNLPLYVVR DGHEKITYSP
SLSNPLNDEH KSLVSRFESG IGQSYDKTPL KSAFVTAEVN EIENPESRKK SWDTGILYNF
TSHFVKGSVA EPASVFNDLI DYIQKRNNFE VSGGTFWLSV GTSKLFISPE QLNPFSACYH
SDCHPDAICK EVGKGYECSC PDGFRDLNPS RPGRNCLSYR GVNECEKPEL NECSPHARCI
DLDYLYKCEC IRPYVNSAVG DALPGSVCSI DYCQDVNYCP LNSTCVNVDE QTYDGQARCD
CKPGFVDLRK SGHLSEAGLG DAICLKQFDI DECALGLHNC SAAATCIDKK IGYDCKCQEG
YEDGNPSLPG RICAAALCGL CNGHGDCIHD ALSTNITCAC VDGYTGEFCE TAPSTLPLLL
MLLLALIFLI LTLCCCLYFC LSWRCFGARG RSEGSASGQE ILGSDYYTIP RAKLARPLYG
EDMGDDHAGA LAAYLDDGAS ISSDGSIEEI ERRVTTDVTT REVRTTTVRD ESGNVISQSQ
TVSHGNPHET DTEQYGMISS DHYKTSASEA MDAAMSASAS GGAYHHSSGG AAAMSSASRS
AYNQGYASDS EDSDAGHAVY DRTTRTNQSH DFEPGADPRT GTERSKREFV TTTKAEEVNY
F
//