ID A0A7M7N489_STRPU Unreviewed; 1483 AA.
AC A0A7M7N489;
DT 07-APR-2021, integrated into UniProtKB/TrEMBL.
DT 07-APR-2021, sequence version 1.
DT 28-JAN-2026, entry version 19.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EnsemblMetazoa:XP_030830245};
OS Strongylocentrotus purpuratus (Purple sea urchin).
OC Eukaryota; Metazoa; Echinodermata; Eleutherozoa; Echinozoa; Echinoidea;
OC Euechinoidea; Echinacea; Camarodonta; Echinidea; Strongylocentrotidae;
OC Strongylocentrotus.
OX NCBI_TaxID=7668 {ECO:0000313|EnsemblMetazoa:XP_030830245, ECO:0000313|Proteomes:UP000007110};
RN [1] {ECO:0000313|Proteomes:UP000007110}
RP NUCLEOTIDE SEQUENCE.
RA Murali S., Liu Y., Vee V., English A., Wang M., Skinner E., Han Y.,
RA Muzny D.M., Worley K.C., Gibbs R.A.;
RT "Genome sequencing for Strongylocentrotus purpuratus.";
RL Submitted (FEB-2015) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EnsemblMetazoa:XP_030830245}
RP IDENTIFICATION.
RG EnsemblMetazoa;
RL Submitted (JAN-2021) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_030830245.1; XM_030974385.1.
DR EnsemblMetazoa; XM_030974385; XP_030830245; LOC576214.
DR GeneID; 576214; -.
DR InParanoid; A0A7M7N489; -.
DR OMA; VQDQHQN; -.
DR OrthoDB; 5983381at2759; -.
DR Proteomes; UP000007110; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0031012; C:extracellular matrix; IBA:GO_Central.
DR GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IBA:GO_Central.
DR FunFam; 3.40.1620.70:FF:000003; Collagen type XVIII alpha 1; 1.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 3.40.1620.70; -; 1.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR050149; Collagen_superfamily.
DR InterPro; IPR010515; Collagenase_NC10/endostatin.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1082; COLLAGEN TRIPLE HELIX REPEAT; 1.
DR Pfam; PF01391; Collagen; 8.
DR Pfam; PF20010; Collagen_trimer; 1.
DR Pfam; PF06482; Endostatin; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Reference proteome {ECO:0000313|Proteomes:UP000007110}.
FT DOMAIN 1189..1231
FT /note="Collagen type XV/XVIII trimerization"
FT /evidence="ECO:0000259|Pfam:PF20010"
FT DOMAIN 1310..1476
FT /note="Collagenase NC10/endostatin"
FT /evidence="ECO:0000259|Pfam:PF06482"
FT REGION 222..351
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 393..505
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 527..775
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 834..880
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 892..1179
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1234..1302
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 227..237
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 270..279
FT /note="Gly residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 577..586
FT /note="Gly residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 587..597
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 611..628
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 670..697
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 752..762
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 922..932
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1114..1128
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1135..1150
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1168..1179
FT /note="Gly residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1238..1252
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1253..1270
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1483 AA; 150621 MW; AC8229B52CF3AA70 CRC64;
MLLRPYSGSR GTVFGVTDPY HQTIILSVNI TSMSIEQMRI SLILTNIDYT SESTEVAAFL
VPSFTNHWTQ LALSIRDNQV TLFLDCQELG MQFFDKNQGW HIRIPQMAPL FIGHLGTVGE
RVQYRGDIQQ IIITNSADSA EEHCPIGEIS GSGSMALRES AGEDWAFVFS GSGDEPTSMP
EMPITYFTQY DAETPSTPAN NVRLPASVQP PLEQVATILP KAPPTTPTLQ ATPSSQPAVV
VPPGDSMKGQ KGEEGWPGVN GIHGNPGVPG PRGVGGETGP EGPEGPMGPK GEKGDLGDAG
PVGPLGEEGF PGVAGPPGDP GVKGEEGDQG MTGATGAKGE PGVGVPGLPG VAPTLSSEYM
MRLGGEKGDP GRDGYPGVPG VQGPMGIPGV DGIGVPGPKG DKGEVGLGLP GSQGKPGKEG
PKGETGQRGL PGPSSLESFI EGSGLEMFVG PPGQPGSPGR PGVMGPHGPR GESGLDGIPG
LDGVPGQPGP AGRDGEPGPK GEPGALTEEE ILDEIANLTA EFEMEGLSGM DGFNGSRGPP
GVPGAVGPKG DSGMDGNPGM KGERGSSGLP GLEGPEGMKG GPGQNGIPGV PGIAGIPGES
GQNGARGAPG PAGPPGPPGP SPLLPPGFPW TNMALNQDQE GSADGSGVWS GTVPPYLMGS
PGVQGPPGPE GMQGEPGLVG EPGIPGEPAI PGMPGLTGER GERGLPGPRG MEGETGMDGA
AGQKGDTGDP GVPGQPGADG FPGPRGVGVQ GPPGDRGLPG PRGIQGLKGD VGEQGPACEI
PPEILALSEH FAGVGSGDIG GGMMTRAPQE LTSTKGEKGD IGFDGVPGMD GQQGAKGYPG
PRGPGGVVGP KGNNGTQGEK GDVGAVGYAG PRGVKGDKGD MFLPTEEEML MLKGDGGDTG
RRGKKGLKGQ FGIPGRPGRV GKTGPEGQTG PEGPVGPQGP PGPSGGGEGT CECDPAAFPP
AFFPPGEKGS MGAPGSPGPM GPDGVPGAPG LPGVQGLRGL QGYAGEPGIE GLEGRPGAQG
VPGVPGIDGA RGEKGDPGDF FVNDEVFIGP PGEQGLPGRD GQPGQSIKGD TGEPGHGAEG
MPGRDGRDGS QGPPGPPGMP GHPGEPAYYT GDGNGERGPK GEPGEPGREG QSGAPGFDGR
PGSRGPRGPQ GPTGYGEQGP PGEPGTPGEP GRGSIGHLGG GDYVGMARFA TIQDMLDGMR
DLEVGTLAFI IRVQELYIRV ETGMQQVLLG ATFTLPPTSE PRPQPRPQPR PQPSITSRTT
TTTLIPTTTL APFVDPTEPP QIPGPGLTDR TEEPILPGPE IPVSDQPMLY MYALNTPQEG
YMIGGISGAD FKCWKQAKAA GLKGTYRAFL SSPLQDIRTI VRKQDKGLPI ANGKQEVLFN
SWDDIFSTEN EAPFHDSFHI YSFDGKDVMT DETWPDKYMW HGSYANGERM AESFCEAWRA
QNHFARGQAS PLSDHKLLGQ DEFPCNNAFA VLCVQTARPN ARK
//