ID A0A8S4ADC6_9TELE Unreviewed; 1186 AA.
AC A0A8S4ADC6;
DT 12-OCT-2022, integrated into UniProtKB/TrEMBL.
DT 12-OCT-2022, sequence version 1.
DT 28-JAN-2026, entry version 11.
DE SubName: Full=(Atlantic silverside) hypothetical protein {ECO:0000313|EMBL:CAG5865944.1};
GN ORFNames=MMEN_LOCUS2585 {ECO:0000313|EMBL:CAG5865944.1};
OS Menidia menidia (Atlantic silverside).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Atherinomorphae; Atheriniformes; Atherinopsidae; Menidiinae;
OC Menidia.
OX NCBI_TaxID=238744 {ECO:0000313|EMBL:CAG5865944.1, ECO:0000313|Proteomes:UP000677803};
RN [1] {ECO:0000313|EMBL:CAG5865944.1}
RP NUCLEOTIDE SEQUENCE.
RA Tigano A.;
RL Submitted (MAY-2021) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CAG5865944.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CAJRST010001113; CAG5865944.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A8S4ADC6; -.
DR OrthoDB; 5983381at2759; -.
DR Proteomes; UP000677803; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR CDD; cd00247; Endostatin-like; 1.
DR FunFam; 3.10.100.10:FF:000008; collagen alpha-1(XVIII) chain isoform X1; 1.
DR FunFam; 3.40.1620.70:FF:000003; Collagen type XVIII alpha 1; 1.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 3.40.1620.70; -; 1.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR050938; Collagen_Structural_Proteins.
DR InterPro; IPR010515; Collagenase_NC10/endostatin.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR PANTHER; PTHR37456:SF6; COLLAGEN ALPHA-1(XXIII) CHAIN-LIKE ISOFORM X2; 1.
DR PANTHER; PTHR37456; SI:CH211-266K2.1; 1.
DR Pfam; PF01391; Collagen; 4.
DR Pfam; PF20010; Collagen_trimer; 1.
DR Pfam; PF06482; Endostatin; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Reference proteome {ECO:0000313|Proteomes:UP000677803}.
FT DOMAIN 853..899
FT /note="Collagen type XV/XVIII trimerization"
FT /evidence="ECO:0000259|Pfam:PF20010"
FT DOMAIN 1012..1180
FT /note="Collagenase NC10/endostatin"
FT /evidence="ECO:0000259|Pfam:PF06482"
FT REGION 74..124
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 139..851
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 920..976
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 987..1006
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 96..111
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 142..157
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 164..176
FT /note="Gly residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 179..188
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 205..217
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 281..290
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 295..305
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 333..345
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 366..386
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 434..443
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 489..499
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 516..528
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 562..571
FT /note="Gly residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 677..691
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 739..753
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 770..779
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 789..798
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 807..821
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 834..844
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 922..941
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 946..960
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 993..1002
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1186 AA; 120250 MW; 6FFA72C6812C79FD CRC64;
MRDTWTRFAV AVRDDKVMLY LNCDTDPQMM RIERSPDDME LEMGAGVFVG QAGGADPEKF
LGVIAELRVV GDPGAAERQC EEDEDDSDMA SGEGSGFRET RPAKPNTEIH RSTTTPRPIQ
QPPLVKKVLE ATAVEAGAFG ARGEKGDRGD KGEKGDRGLA GPKGEAGSGS SSGGGVRVAK
GEPGEKGSKG SAGFGYQGTK GEPGVPGPPG PPGPPGPSTE YSVEGDGSVV SRVPGPRGPP
GQPGPKGAPG DDGEPGDPGE DGKTGPQGPP GFPGTPGDPG SKGDKGDKGE GQPGPRGPPG
PPGPPGSGFR STFMDMEASG FPDIESIRGL PGLPGPPGPP GPPGPSTAGT ALSSGAFGPA
GKDGANGQPG LPGLPGNDGL QGLPGPKGEK GDSGALGLPG AIGQKGTQGE PGLPGPPGET
GLAGLPGPMG PVGRPGPPGP PGPGYRVGFD DMEGSTGGLP GIRGPEGVQG PPGLPGHPGK
PGLPGLPGPK GSEGSSGKDG QPGLDGFTGP PGLKGVKGDR GDRGEPGRDG TGLPGPPGPP
GEPGQIIYRN SNDFDAVSGR AGPQGGAGLP GRAGFPGPIG PKGDRGDPGP PGYAEKGEKG
EPGLIVGPDG NPLYLGGLTG PKGDQGYPGP VGPPGPYGPA GIKGEIGMPG RPGRPGVNGY
KGEKGEAGGG SGYSVVPGPP GPPGPPGPPG PGSTLDRFNR YDDASRNYPV IKGEKGDHGD
PGLPGIPGQS FNIDISTFKG DRGDTGVKGD KGEPGGGYYD PRFGGLQGPQ GPPGIPGLPG
PKGDSIMGPP GPQGPPGEPG VGYDGRPGPP GQPGPPGPPG SFPGVYRPNY PVSVPGPPGP
PGAPGSPGVS SGVTVLRSYE TMIATARRQE EGSLIFVLDR ADLYLRVRDG LRQVMLGEYS
PFYRDLENEV AEAQPPPVIL YPQSQDQSQN NGAGHYSQGS PVIQPIEPPP AVDPGYPPQY
EPRFPEQTYP GQTDGRLTSQ QAENRYPVTP QQRPNPPVPG PFGPVETSGS GLHLIALNAP
QTGNMRGIRG ADFLCFQQAR AAGLKGTYRA LLSSKLQDLY TIVRKSDRDS ISIVNLKNQV
LYSSWESIFG DNASKMRENV PIYSFDGRDI IRDSAWPEKM VWHGTSSKGH RKTDHYCETW
RAGDRAVTGL ASSLQSGQLL QQSPRSCSGS YVVLCIENAF TLPSKK
//