ID A0A2G8LP12_STIJA Unreviewed; 1939 AA.
AC A0A2G8LP12;
DT 31-JAN-2018, integrated into UniProtKB/TrEMBL.
DT 31-JAN-2018, sequence version 1.
DT 24-JAN-2024, entry version 11.
DE RecName: Full=Hyalin {ECO:0008006|Google:ProtNLM};
GN ORFNames=BSL78_01119 {ECO:0000313|EMBL:PIK61975.1};
OS Stichopus japonicus (Sea cucumber).
OC Eukaryota; Metazoa; Echinodermata; Eleutherozoa; Echinozoa; Holothuroidea;
OC Aspidochirotacea; Aspidochirotida; Stichopodidae; Apostichopus.
OX NCBI_TaxID=307972 {ECO:0000313|EMBL:PIK61975.1, ECO:0000313|Proteomes:UP000230750};
RN [1] {ECO:0000313|EMBL:PIK61975.1, ECO:0000313|Proteomes:UP000230750}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Shaxun {ECO:0000313|EMBL:PIK61975.1};
RC TISSUE=Muscle {ECO:0000313|EMBL:PIK61975.1};
RX PubMed=29023486;
RA Zhang X., Sun L., Yuan J., Sun Y., Gao Y., Zhang L., Li S., Dai H.,
RA Hamel J.F., Liu C., Yu Y., Liu S., Lin W., Guo K., Jin S., Xu P.,
RA Storey K.B., Huan P., Zhang T., Zhou Y., Zhang J., Lin C., Li X., Xing L.,
RA Huo D., Sun M., Wang L., Mercier A., Li F., Yang H., Xiang J.;
RT "The sea cucumber genome provides insights into morphological evolution and
RT visceral regeneration.";
RL PLoS Biol. 15:E2003790-E2003790(2017).
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PIK61975.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; MRZV01000021; PIK61975.1; -; Genomic_DNA.
DR STRING; 307972.A0A2G8LP12; -.
DR Proteomes; UP000230750; Unassembled WGS sequence.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR CDD; cd00054; EGF_CA; 5.
DR Gene3D; 2.10.25.10; Laminin; 8.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR003410; HYR_dom.
DR PANTHER; PTHR24273; FI04643P-RELATED; 1.
DR PANTHER; PTHR24273:SF32; HYALIN; 1.
DR Pfam; PF00008; EGF; 4.
DR Pfam; PF02494; HYR; 15.
DR SMART; SM00181; EGF; 9.
DR SMART; SM00179; EGF_CA; 6.
DR SUPFAM; SSF57196; EGF/Laminin; 6.
DR PROSITE; PS00022; EGF_1; 9.
DR PROSITE; PS01186; EGF_2; 5.
DR PROSITE; PS50026; EGF_3; 7.
DR PROSITE; PS50825; HYR; 13.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; EGF-like domain {ECO:0000256|PROSITE-ProRule:PRU00076};
KW Reference proteome {ECO:0000313|Proteomes:UP000230750};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..19
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 20..1939
FT /note="Hyalin"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5013567126"
FT DOMAIN 22..104
FT /note="HYR"
FT /evidence="ECO:0000259|PROSITE:PS50825"
FT DOMAIN 106..189
FT /note="HYR"
FT /evidence="ECO:0000259|PROSITE:PS50825"
FT DOMAIN 222..305
FT /note="HYR"
FT /evidence="ECO:0000259|PROSITE:PS50825"
FT DOMAIN 306..388
FT /note="HYR"
FT /evidence="ECO:0000259|PROSITE:PS50825"
FT DOMAIN 502..586
FT /note="HYR"
FT /evidence="ECO:0000259|PROSITE:PS50825"
FT DOMAIN 666..749
FT /note="HYR"
FT /evidence="ECO:0000259|PROSITE:PS50825"
FT DOMAIN 829..912
FT /note="HYR"
FT /evidence="ECO:0000259|PROSITE:PS50825"
FT DOMAIN 985..1068
FT /note="HYR"
FT /evidence="ECO:0000259|PROSITE:PS50825"
FT DOMAIN 1148..1231
FT /note="HYR"
FT /evidence="ECO:0000259|PROSITE:PS50825"
FT DOMAIN 1232..1314
FT /note="HYR"
FT /evidence="ECO:0000259|PROSITE:PS50825"
FT DOMAIN 1315..1395
FT /note="HYR"
FT /evidence="ECO:0000259|PROSITE:PS50825"
FT DOMAIN 1396..1480
FT /note="HYR"
FT /evidence="ECO:0000259|PROSITE:PS50825"
FT DOMAIN 1480..1517
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1602..1641
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1647..1683
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1691..1737
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1745..1788
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1791..1827
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1830..1919
FT /note="HYR"
FT /evidence="ECO:0000259|PROSITE:PS50825"
FT DOMAIN 1896..1932
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DISULFID 1507..1516
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1631..1640
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1673..1682
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1727..1736
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1778..1787
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1817..1826
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1922..1931
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ SEQUENCE 1939 AA; 200183 MW; ECD257858F97E09C CRC64;
MHYEFIFIVP ILFLIHVIAV NRPTLPPVVD CTGRDIVETA MFGVSSVFVT LPTCSATDDS
GQAIFVSQTP PSGFFRLGST PVINIFADAD GNQGFGTFNV TVIPEADNEP PVVDCSGRTV
RRNAPPGASG VQVVLPSCTA TDNSGTANEV SQSPTSGSFF PIGTTPVTNT FVDPSGNRGV
GTFEVIISGS FFSLGTTVVE TCYVDPDGNR GCDSFTVTVT AADNEPPVVD CSGRTVRRNA
PPGASGVSVV LPSCTATDNS GTANEVSQSP TSGSFFLIGT TPVTNTFVDP SGNRGVGTFE
VIISGDNEPP VVDCSGITVN RNAPPGSSGV QVVLPSCTAI DNSGTVFEVS QSPPSGSFFP
IGTTRVTNTF VDPSGNRGVG TFEVIISGTV FESITVTPSG SFFPIGTTRV TNTFVDPSGN
RGVGTFEVII SGDNEPPVVD CSGITVNRNA PPGSSGVQVV LPSCTATDNS GTVFEVSQSP
PSGSFFLIGT TLVTNIFQDA NVDNEPPVVN CDGLSFSRTV PAGTQSLVVP NLPDCTATDN
SGTVLPLPQN PPPGTPFSVG PTMVGNTFED PSGNRATGFY TVTIIEDRQR LQIFCPANAV
AQCQTNSNAI IGLWSDPDCT GGVQPRTTSC SPSSGSDISS GQTLASCTCE DNAGETETCF
FNLPAAENIP PTVDCSGLDM TRMAPSGASG VTVLLPSCRA SDNSGTVLPM NQFPRSGSFF
TIGSTTVINT FEDACGNTAT DTFIVTVNPG VENLVMNCPT NAVAQCQTNS NAIIGLWSDP
DCTGGVQPIT TRCNPSSGSD ISSGQTFATC TCEDNAGETE TCFFNLPAAE NIPPVVDCSG
LDITRMAPSG ASGVTVLLPS CRASDNSGTV LPMNQSPRSG SFFTIGSTTV INTFEDACGN
TATDTFIVTV NPGVENLVMN CPTNAVAQCQ TNSNAIIGLW SDPDCTGGVQ PITTRCNPSS
GSDISSGQTF ATCTCEDNAG ETETSENIPP VVDCSGLDMT RMAPFGASGV TVLLPSCRAS
DNSGTVLPMN QSPRSGSFFT IGSTTVINTF EDACGNTATD TFIVTVNPEI RLVDITCPSD
GTGGNEVAPG IFIATWSNPI CTGSSGTLST SCDPSSGSPV GLGTTVVRCS CTDVRGQRDE
CTFAIFNADV MPPSVDCSGR DEVVMVLPPA RGATVMLPSC EVSDNSGSSS LVRQSPTSGS
FFPLGTTPVT NEYADDSGNR GSDVFTVTVV VVDIEGPDVT CADILRTVPF GQSGAIVQLN
TCTAVDNSGR PPSLLSYQPT SGSFFPIGST EVRVIFLDDA GNSGTDTFNV IVQGLDGIDP
SVNCAGRDIN ADTTGSCAVV SFAPCTATDN SGVPPVLDFQ SHQSGDCFPV GTTAVVFTFR
DGAGNTGSGS FDIVVTRDTA PVLLRCPDDI TDQVLITMGG GIVQYDAPTA FDESGSVQII
NDVIFVPGSF FPLGTTPVTY VFSDPSGNTV ECSFSVTLVG VNPCSSRICQ NGGVCQAMSL
TDAACVCSGC FTGSTCQIST GACNLNSCSN GGVCIPFADS CTASSCDCPR CFSGLSCQTR
VSACRNHECL NGASCIPDPV ECDQYTCECL NCFRGEFCAI AIPDPCSSTP CLNGGQCIRR
SDSCYGFFCA CQTGFSGERC ESSVNILENP CNNFPCENDG SCVSSGSVYK CLCRDGYTGI
NCRQQTGSNS FFDQCVSSPC ANGGSCFNSY STSSGSLTYT PQYTCVCPNS YTGERCTVLT
SLVPQLNRCQ SSNICQNGGT CLNSYCSFDD RVDFFCDCPI GFIGEVCTIP YGNPCSTIQC
SNGGTCVPFN QYFVCECRPG FSGSTCGLLG DVIPPTISGC PQQTIVVQAS PGASSAQVSW
PTPQVSDNSG GPVQLVSVNA VSGAFYGVGT TRRSSTSTGC QLNPCLNGGT CRQVAGVDSC
ICPPGFTGQT CSQSEYPGK
//