ID H2YIS8_CIOSA Unreviewed; 2560 AA.
AC H2YIS8;
DT 18-APR-2012, integrated into UniProtKB/TrEMBL.
DT 18-APR-2012, sequence version 1.
DT 27-MAR-2024, entry version 71.
DE RecName: Full=Multiple EGF-like-domains 8 {ECO:0008006|Google:ProtNLM};
OS Ciona savignyi (Pacific transparent sea squirt).
OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Phlebobranchia;
OC Cionidae; Ciona.
OX NCBI_TaxID=51511 {ECO:0000313|Ensembl:ENSCSAVP00000005227.1, ECO:0000313|Proteomes:UP000007875};
RN [1] {ECO:0000313|Proteomes:UP000007875}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Birren B., Nusbaum C., Abebe A., Abouelleil A., Adekoya E., Ait-zahra M.,
RA Allen N., Allen T., An P., Anderson M., Anderson S., Arachchi H.,
RA Armbruster J., Bachantsang P., Baldwin J., Barry A., Bayul T.,
RA Blitshsteyn B., Bloom T., Blye J., Boguslavskiy L., Borowsky M.,
RA Boukhgalter B., Brunache A., Butler J., Calixte N., Calvo S., Camarata J.,
RA Campo K., Chang J., Cheshatsang Y., Citroen M., Collymore A., Considine T.,
RA Cook A., Cooke P., Corum B., Cuomo C., David R., Dawoe T., Degray S.,
RA Dodge S., Dooley K., Dorje P., Dorjee K., Dorris L., Duffey N., Dupes A.,
RA Elkins T., Engels R., Erickson J., Farina A., Faro S., Ferreira P.,
RA Fischer H., Fitzgerald M., Foley K., Gage D., Galagan J., Gearin G.,
RA Gnerre S., Gnirke A., Goyette A., Graham J., Grandbois E., Gyaltsen K.,
RA Hafez N., Hagopian D., Hagos B., Hall J., Hatcher B., Heller A.,
RA Higgins H., Honan T., Horn A., Houde N., Hughes L., Hulme W., Husby E.,
RA Iliev I., Jaffe D., Jones C., Kamal M., Kamat A., Kamvysselis M.,
RA Karlsson E., Kells C., Kieu A., Kisner P., Kodira C., Kulbokas E.,
RA Labutti K., Lama D., Landers T., Leger J., Levine S., Lewis D., Lewis T.,
RA Lindblad-toh K., Liu X., Lokyitsang T., Lokyitsang Y., Lucien O., Lui A.,
RA Ma L.J., Mabbitt R., Macdonald J., Maclean C., Major J., Manning J.,
RA Marabella R., Maru K., Matthews C., Mauceli E., Mccarthy M., Mcdonough S.,
RA Mcghee T., Meldrim J., Meneus L., Mesirov J., Mihalev A., Mihova T.,
RA Mikkelsen T., Mlenga V., Moru K., Mozes J., Mulrain L., Munson G.,
RA Naylor J., Newes C., Nguyen C., Nguyen N., Nguyen T., Nicol R., Nielsen C.,
RA Nizzari M., Norbu C., Norbu N., O'donnell P., Okoawo O., O'leary S.,
RA Omotosho B., O'neill K., Osman S., Parker S., Perrin D., Phunkhang P.,
RA Piqani B., Purcell S., Rachupka T., Ramasamy U., Rameau R., Ray V.,
RA Raymond C., Retta R., Richardson S., Rise C., Rodriguez J., Rogers J.,
RA Rogov P., Rutman M., Schupbach R., Seaman C., Settipalli S., Sharpe T.,
RA Sheridan J., Sherpa N., Shi J., Smirnov S., Smith C., Sougnez C.,
RA Spencer B., Stalker J., Stange-thomann N., Stavropoulos S., Stetson K.,
RA Stone C., Stone S., Stubbs M., Talamas J., Tchuinga P., Tenzing P.,
RA Tesfaye S., Theodore J., Thoulutsang Y., Topham K., Towey S., Tsamla T.,
RA Tsomo N., Vallee D., Vassiliev H., Venkataraman V., Vinson J., Vo A.,
RA Wade C., Wang S., Wangchuk T., Wangdi T., Whittaker C., Wilkinson J.,
RA Wu Y., Wyman D., Yadav S., Yang S., Yang X., Yeager S., Yee E., Young G.,
RA Zainoun J., Zembeck L., Zimmer A., Zody M., Lander E.;
RL Submitted (AUG-2003) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSCSAVP00000005227.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004479}; Single-
CC pass type I membrane protein {ECO:0000256|ARBA:ARBA00004479}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 51511.ENSCSAVP00000005227; -.
DR Ensembl; ENSCSAVT00000005298.1; ENSCSAVP00000005227.1; ENSCSAVG00000003112.1.
DR eggNOG; KOG1388; Eukaryota.
DR GeneTree; ENSGT00940000174104; -.
DR HOGENOM; CLU_000612_0_0_1; -.
DR InParanoid; H2YIS8; -.
DR TreeFam; TF321873; -.
DR Proteomes; UP000007875; Unassembled WGS sequence.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR CDD; cd00041; CUB; 1.
DR CDD; cd00054; EGF_CA; 1.
DR CDD; cd00055; EGF_Lam; 3.
DR Gene3D; 2.120.10.80; Kelch-type beta propeller; 3.
DR Gene3D; 2.10.25.10; Laminin; 6.
DR Gene3D; 2.60.120.290; Spermadhesin, CUB domain; 2.
DR InterPro; IPR000859; CUB_dom.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR InterPro; IPR018097; EGF_Ca-bd_CS.
DR InterPro; IPR024731; EGF_dom.
DR InterPro; IPR009030; Growth_fac_rcpt_cys_sf.
DR InterPro; IPR015915; Kelch-typ_b-propeller.
DR InterPro; IPR006652; Kelch_1.
DR InterPro; IPR002049; LE_dom.
DR InterPro; IPR016201; PSI.
DR InterPro; IPR035914; Sperma_CUB_dom_sf.
DR PANTHER; PTHR46093; ACYL-COA-BINDING DOMAIN-CONTAINING PROTEIN 5; 1.
DR PANTHER; PTHR46093:SF17; MULTIPLE EGF-LIKE-DOMAINS 8; 1.
DR Pfam; PF00431; CUB; 1.
DR Pfam; PF12947; EGF_3; 1.
DR Pfam; PF13415; Kelch_3; 2.
DR Pfam; PF00053; Laminin_EGF; 3.
DR SMART; SM00042; CUB; 2.
DR SMART; SM00181; EGF; 9.
DR SMART; SM00179; EGF_CA; 2.
DR SMART; SM00180; EGF_Lam; 4.
DR SMART; SM00612; Kelch; 4.
DR SMART; SM00423; PSI; 8.
DR SUPFAM; SSF57196; EGF/Laminin; 3.
DR SUPFAM; SSF57184; Growth factor receptor domain; 1.
DR SUPFAM; SSF117281; Kelch motif; 3.
DR SUPFAM; SSF49854; Spermadhesin, CUB domain; 2.
DR PROSITE; PS00010; ASX_HYDROXYL; 2.
DR PROSITE; PS01180; CUB; 2.
DR PROSITE; PS00022; EGF_1; 1.
DR PROSITE; PS01186; EGF_2; 2.
DR PROSITE; PS50026; EGF_3; 2.
DR PROSITE; PS01187; EGF_CA; 1.
DR PROSITE; PS01248; EGF_LAM_1; 4.
DR PROSITE; PS50027; EGF_LAM_2; 2.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Kelch repeat {ECO:0000256|ARBA:ARBA00022441};
KW Laminin EGF-like domain {ECO:0000256|PROSITE-ProRule:PRU00460};
KW Membrane {ECO:0000256|SAM:Phobius};
KW Reference proteome {ECO:0000313|Proteomes:UP000007875};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Transmembrane {ECO:0000256|ARBA:ARBA00022692, ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|ARBA:ARBA00022989,
KW ECO:0000256|SAM:Phobius}.
FT TRANSMEM 2290..2312
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 2324..2347
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 2421..2443
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 1..89
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 118..153
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1005..1044
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1085..1131
FT /note="Laminin EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50027"
FT DOMAIN 1132..1181
FT /note="Laminin EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50027"
FT DOMAIN 1183..1292
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DISULFID 143..152
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1103..1112
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
FT DISULFID 1115..1129
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
FT DISULFID 1132..1144
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
FT DISULFID 1134..1151
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
FT DISULFID 1153..1162
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
FT DISULFID 1165..1179
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
SQ SEQUENCE 2560 AA; 284900 MW; F7B66517F0B3FC9F CRC64;
CEWLIISPDP VNFPFVSLRI LSRDTECGYD FVSVYDGASQ NDKVLAKLSG DSSTTRIDTL
YARSARMLIY FYSDRNYVRQ GFTAEFSILS CLNNCSGNGD CLHNKCSCYP GWFGESCDQL
ECSNRCGGSL QGFCNFTSSK CMCNVGYIGK DCSMATRTSS NAGGWNVVDR SGDNVAFHTA
AYAHPYLYTF GGYDLNIMKN TLHRYDFQQN SWEKMIAGSI SPSARAFHTV VTLKTLTTSL
IVYILPALFT WFHKILKVLL TLVHLLQMEH IVDDGLDYLI VFGGDLGDNQ YLNDLWLYNI
SGNTWLRQNN LSSPPPVSRH ASCVLQHSHH LYIFGGMVHG EGGLDFSSEL YKFDWKSWAW
SKVNSSGLKP SSKKVAGHSM VYDSVDNLFI VFGGFHASKP IAVRSKKLLL FKIDQSYWYE
VELKGSTPMA FHSANIIGDY MVVHGGSVHQ HTSDESCYSD TTYLYNMRCG KWSSVHTNYS
GMYGHVSSIL PQNILLVHGG YDGTVLNALH AFKPPLYVLP NLLVSNVCTN YDKEDQCLLD
QRCVWCSQFS FCSENTAQCS EDQLTYPVCD GICKHLNTCT SCSLFNRYQF VGGSWVPEKC
GWCVADQKCT GLNSTSGNMF SEISWIRDYD SPIISPPSCL VRKLVKGFFV QYYKDNQLNP
YQIGTIANKP GLNINIETLK KHCDYHILKR FTIILQLQVV ISNGQLSLIT TSDESNRTLV
VQGFLHPLNP AYFPNGILNS LGWPTLRSLQ SLNRISRYLS MVLSSRNNAG ASQSEFTLIE
SKYLEAYNTE GGSCAGYSSC LLCLSDTSCM WCTDGNACVA ASPSACHNAG SALTVVGGTG
QVVTDPHLCP VCSDSITCWE CSKHRYCRWT STRGCLQNSQ GGNKLKSVCS TSCSERKSCR
TCLEEEGGGG GCSWCESSRT CYSFQHHFPI YLLGQCGEWM EDAHQCRNCS THNACASCID
EYHCGWCFNT GNSLNGKCMD GEISDDRCGD PYLPSQSWSY ASCPDVDECN AGSLHSCHEN
ATCSNTPHSY RCSCNRGFHG DGFSCSPTCY NHCGNGYCSE SLRCECDLGW RSSPDGGLYD
CSDDCLCNFH SSCNTSVGIC DECQHNTHGS HCELCVAGSY GNASSPHGCQ TCQCNNHGEL
ANICNQSTGE CVCTDHTTGR NCQLCANGYY GNASNGGMCY QECDGRYLVE SSLVGGFGTR
YNFNKIKHCF HVIKARSASD IVVLEIENNL DIDCQVTRLY VYNGIPGVSA PLGAVCGSNS
AVFIILMFAD TLSVVYNSAL GSTGFTAQYQ VVSCTNHPSI TDCLSTNMSL LHCSSEEQLM
CFNVRFKRDP LLLTPQGMCV CTHPLWGQLC GLQSGIDSNN SVPRSNIRPT SIIGHSMVIY
NQSLLVYGGF RFNNSDPTIY RFNTTSRMWD FILPSMFPDT PYFHSSVALP GMNSMLVLGG
ILHNGSYSNR LWKLMLAENF SHEWIEILSN ATLPAVAGHT ITLCSKHVIL LGGYTHSGII
NKQVWSFDIY THTCKEVKTK GEIVGMFGHT AVCDKTTHRI FVFGGLHFES SQSANLFVYH
TRYKRWSKLV SSPQPLNMMF PAISLFHTGN FLTTDSTSGL LLFGQPYSSN VTETWMFNLH
SYTWLQITNL ESWKFQGILS WSAVIRPLNK EDSDIYLYRG GSVMQVGSII AKLLFCKETK
ILFSDHFNIY NTIKFYFSQT IKVVSKCFQP SLKCRGGINI LCTGIQVGIH NCKNQFGKII
VGWNTSVKPH LLSRSCKECN LNFPYNNFFC GWDGRTCVSG NFTTPKCSEC AHLDCSSCVQ
SSQCIWLPQG GCENIESIGP APTNSDLLIC RKSCNEHKSC DTCLSQNSDC SWSTRLRRCL
SVGENILYCS TGVCGRVIKH VHECPVVCAH STYCHSCLMM DGCGWYGTDD GSGCGDCRSG
SYQNPTDHST PTCPQSHWWF VQCPPENECV NGHNTCQQFE TCADTLDYFQ CSCVDGYFRN
GVNLQCVPRC QPECLHGVCT SPNVCSCSFG FTGDTCNISC PCNNHSDCVM LANGTFVCTG
CLHHTSGEYC NHCEVGFVGD PSRNNSCLPC NQICNGRSDT CISKSIGVNL WREPLYGVSS
VNDAVCIGCS GNSFGDHCQD CYDGYFMMHR VCRRCECNGH GNRCNKLTGN SCQCWNRTTT
SCGVEAQCWR QQCNLCMDGF KGNPVNGGHC YRLVNVNQEW CFDPTLQGGC VDYGFDRELE
LSVGYSLLFA LLPRFVNVDM RLILDVTHGA VDVFISDLQD AFVVTVDNAT MQHKVSINSS
LQTPKKLRDY LFYCCPTQCI NIISYLQLLL WISKYVKSFS LKIITFMLKF SIWTTMCLYH
LSHYFIITWN LNATNLNNFI TIAPNFVLHI KQLQNRLIVT FPRTYYNFQK ATFYIIVLST
EELIDSPGRL ILRQDQTRID LFVFFSVFFS CFFLFLSICI VAWKLKHHVD ARRNILAHHI
QLQHMASRPF AYVYLLFHHS RSLGSCPLSL DGQNHAKTSL VLKENSKTMV ISMANKHLTG
WHSRSVGPVA FEPSKDGRAG VATVFINLPA NTSVCFASAL
//