GenomeNet

Database: UniProt
Entry: A0A498P4N0_LABRO
LinkDB: A0A498P4N0_LABRO
Original site: A0A498P4N0_LABRO 
ID   A0A498P4N0_LABRO        Unreviewed;      1134 AA.
AC   A0A498P4N0;
DT   05-JUN-2019, integrated into UniProtKB/TrEMBL.
DT   05-JUN-2019, sequence version 1.
DT   08-OCT-2025, entry version 19.
DE   SubName: Full=Collagen alpha-1(XVIII) chain-like isoform X2 {ECO:0000313|EMBL:RXN38998.1};
GN   ORFNames=ROHU_000648 {ECO:0000313|EMBL:RXN38998.1};
OS   Labeo rohita (Indian major carp) (Cyprinus rohita).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Ostariophysi; Cypriniformes;
OC   Cyprinidae; Labeoninae; Labeonini; Labeo.
OX   NCBI_TaxID=84645 {ECO:0000313|EMBL:RXN38998.1, ECO:0000313|Proteomes:UP000290572};
RN   [1] {ECO:0000313|EMBL:RXN38998.1, ECO:0000313|Proteomes:UP000290572}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=DASCIFA01 {ECO:0000313|EMBL:RXN38998.1};
RC   TISSUE=Testis {ECO:0000313|EMBL:RXN38998.1};
RA   Das P., Kushwaha B., Joshi C.G., Kumar D., Nagpure N.S., Sahoo L.,
RA   Das S.P., Bit A., Patnaik S., Meher P.K., Jayasankar P., Koringa P.G.,
RA   Patel N.V., Hinsu A.T., Kumar R., Pandey M., Agarwal S., Srivastava S.,
RA   Singh M., Iquebal M.A., Jaiswal S., Angadi U.B., Kumar N., Raza M.,
RA   Shah T.M., Rai A., Jena J.K.;
RT   "Draft genome sequence of Rohu Carp (Labeo rohita).";
RL   Submitted (MAR-2018) to the EMBL/GenBank/DDBJ databases.
CC   -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC       matrix {ECO:0000256|ARBA:ARBA00004498}.
CC   -!- SIMILARITY: Belongs to the multiplexin collagen family.
CC       {ECO:0000256|ARBA:ARBA00061275}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:RXN38998.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; QBIY01003913; RXN38998.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A498P4N0; -.
DR   STRING; 84645.A0A498P4N0; -.
DR   Proteomes; UP000290572; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR   CDD; cd00247; Endostatin-like; 1.
DR   FunFam; 3.10.100.10:FF:000008; collagen alpha-1(XVIII) chain isoform X1; 1.
DR   FunFam; 3.40.1620.70:FF:000003; Collagen type XVIII alpha 1; 1.
DR   FunFam; 2.60.120.200:FF:000039; Collagen XV alpha 1 chain; 1.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR048287; TSPN-like_N.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1082; COLLAGEN TRIPLE HELIX REPEAT; 1.
DR   Pfam; PF01391; Collagen; 2.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SMART; SM00210; TSPN; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE   1: Evidence at protein level;
KW   Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW   Collagen {ECO:0000256|ARBA:ARBA00023119, ECO:0000313|EMBL:RXN38998.1};
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW   Proteoglycan {ECO:0000256|ARBA:ARBA00022974};
KW   Proteomics identification {ECO:0007829|PeptideAtlas:A0A498P4N0};
KW   Reference proteome {ECO:0000313|Proteomes:UP000290572};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Secreted {ECO:0000256|ARBA:ARBA00022525};
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..22
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           23..1134
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5019730887"
FT   DOMAIN          91..282
FT                   /note="Thrombospondin-like N-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00210"
FT   REGION          40..78
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          315..571
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          637..866
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          911..934
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        40..58
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        346..359
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        388..403
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        404..418
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        477..492
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        528..538
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        540..556
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        654..672
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        699..712
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        717..731
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        762..779
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        805..814
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        822..834
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        843..854
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        914..925
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1134 AA;  116250 MW;  DCE79BE33C839820 CRC64;
     MVAPVLPLTC LVLLSLVSVS SQIQWLKFLW VPETTSPAAA VTTPAPTTTA PEESTAPVFP
     AFPPQPTEGS SKARTKRPLK MWKSERGSKG HLVLTELVGV PLPPSVSFIT GYEGFPAYHF
     GSKANVGRLT QTFVPEPFFL DFAIIVTVKP SSSRGGVLFA ITDPSQKIIH LGLALTPVED
     KTQKIVFYYS EPGLADSMEV ASFKVPDMTQ QWNRFTLTVE AEEVRLYMDC EKYHSTPLKR
     SQQPLSFKPG SGIFVANAGG TDLERFVGLT KQMKQGRVQG MVLARGKKGS VENLDLLAPQ
     DPLALLDRLS LQHIQGQRGY PGPTGLPGES GVKGEKGDPG VGQPGLPGPP GPPGLPGPSK
     PIKVPYGFDA LGSGFGDVDV DTELLRGPPG PPGPPGKPGP PGPNSSSGGL LPGPAGAPGK
     DGKNGQPGLP GVPGQDGLTG QQGLKGAKGE QGIRGPPGPK GENGDPGPIG PMGPRGVPGP
     PGIPGPPGPP GPSSNNFMKD TLKGLPGADG APGLSVKGEP GSPGKDGIPG LAGLPGARGP
     KGDKGSPGQK GERGRDGLSI TGPPGPPGPP GTIINLEDLL LNDTAAELNL TKIRGPPGPM
     GPRGPKGEIG FPGIQGPPGL KGDRGEPGVS IAADGSLITG LRGPKGPKGM KGDIGPSGQP
     GIVGPIGPPG QKGEYGLPGR PGRSGIAGRK GDKGDSSGPP GPPGPPGPPG PPGRVIGLNG
     VNNNNSQPGN GRVSYGVKGD KGDVGNPGLP GTSVPMFPDG SVGEKGDNGY KGQKGEKGDP
     GLSGLPGRTG LVGPKGDSIV GPPGDAGPPG PPGRPGYGSP GPQGPPGPPG PPGTPSGSAV
     SLPGPPGPPG PPGAPGHGNS VRSYENSQTL IRETSQTAEG TLAYVIDKSE LFIRVRGGWK
     KVELGELIPV PQDSSSSALS QGLSRPSDRS VPRVHSQELN SFLPDYHVFP QNGHSVPALH
     LVALNAPFSG DMHGIRGADY QCYQQARARG LTSTYRAFLS SHLQDISSIV KKGDRFSLPV
     VNLKGDVLFS SWMAMFSGNG AVFDPLTPIY SFDGRNVMTD QAWSQKLVWH GSSTAGIRMT
     TSYCEAWRTG DMAVTGQASL LQTGRLLGQH TRSCSNHFIV LCIENSYIQN PGRN
//
DBGET integrated database retrieval system