ID A0A3Q3S7N0_9TELE Unreviewed; 1468 AA.
AC A0A3Q3S7N0;
DT 10-APR-2019, integrated into UniProtKB/TrEMBL.
DT 02-JUN-2021, sequence version 2.
DT 27-MAR-2024, entry version 23.
DE SubName: Full=Collagen, type V, alpha 3a {ECO:0000313|Ensembl:ENSMAMP00000019025.2};
OS Mastacembelus armatus (zig-zag eel).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Anabantaria; Synbranchiformes; Mastacembelidae; Mastacembelus.
OX NCBI_TaxID=205130 {ECO:0000313|Ensembl:ENSMAMP00000019025.2, ECO:0000313|Proteomes:UP000261640};
RN [1] {ECO:0000313|Ensembl:ENSMAMP00000019025.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 205130.ENSMAMP00000019025; -.
DR Ensembl; ENSMAMT00000019521.2; ENSMAMP00000019025.2; ENSMAMG00000012693.2.
DR GeneTree; ENSGT00940000154535; -.
DR InParanoid; A0A3Q3S7N0; -.
DR Proteomes; UP000261640; Unplaced.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR Gene3D; 2.60.120.1000; -; 2.
DR Gene3D; 2.60.120.200; -; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR000885; Fib_collagen_C.
DR InterPro; IPR048287; TSPN-like_N.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1088; COLLAGEN ALPHA-1(XXVII) CHAIN A; 1.
DR Pfam; PF01410; COLFI; 1.
DR Pfam; PF01391; Collagen; 10.
DR SMART; SM00038; COLFI; 1.
DR SMART; SM00210; TSPN; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR PROSITE; PS51461; NC1_FIB; 1.
PE 4: Predicted;
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Reference proteome {ECO:0000313|Proteomes:UP000261640};
KW Secreted {ECO:0000256|ARBA:ARBA00022530};
KW Signal {ECO:0000256|ARBA:ARBA00022729}.
FT DOMAIN 1229..1467
FT /note="Fibrillar collagen NC1"
FT /evidence="ECO:0000259|PROSITE:PS51461"
FT REGION 255..290
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 470..513
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 558..609
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 633..681
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 718..939
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 951..1200
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1235..1257
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 268..283
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1468 AA; 152345 MW; BCC00AA57FB8616E CRC64;
SGVQVSAVRC AAVGLWVRLS DPIDVLRVME LSEKMEGVSL EAGFCISRRG MKESDLAYKI
DKKIQLSAPT KQLLPDSKFP ENFSLMATMR AKKGSQFFLL SVYDDQGVQQ LGLEVGRSPV
FLYEDQHGQP TPELYPIFKK INLADGKWHR IAYSVQDKSV TLYLDCQRVD TLDLLRGDNA
VVSTEGITVF GTRLLDEDVF EGDIQQLLIV EDPQIAANYC VDYIPECDSA LPYNSQALNL
QENSELLSDA LMQNSEPDEL KEKRDKGSKG KGKGKRKGKG KKGSRKKKDE EEILEDGFLR
VSATTPAYLS QESFQATNLE TANPTEVFEK TLMEMPTHEP DSTPEVPALV PTFGVSWFIF
AVTHLGLCVG LQGDKPVKEP VVEEYGHDLY GHLYDDDISV STVTVGPNIT EYEVLEYEDN
KNDTDTEYEE YETYDDGFDF AERERAQVWD GEVNFIKGQK GEPAIIEPGL AGSAGPMGPQ
GPRGDPGEIG PQGRPGLAGA DGIPGPPGTL LMLPFQYGGD SQKGPAVSPQ EAQAQAILQQ
TQVQFTGTCT FILQGVPGST GLKGDRGETG PPGPRGLPGL PGINGKPGKR GHAGVDGGRG
TPGETGSKVS SLRPHCIAVV ISFHFTSPVL NATQGEDGFP GSKGDMGIKG DRGDNGSPGA
RGEDGTEGPK GQAGPLGDPG APGIAGEKLM LKERLFILDC IFLNMGLNVQ FGCFQGKSGE
TGPTGERGHP GPPGPPGEHG LPGAAGKEGT KGDPGPPGAT GKSGPAGLQG FRGSRGTPGA
TGAPGERGPP GAAGAIGQPG RPGAVGPAGP MGEKGEPGEK GPVGPAGHDG ELGPVGLPGL
AGPAGPPGDD GDKGETGGPG QKGSKGDKGE AGPPGPVGSQ GPVGQPGLPG ADGEPGPRGQ
QGMNGAKGDE GLRGFKGASG PSGLQGMPGP PGEKGESGHV GSLVSTVQQF QHIPGPHGTP
GGVGQPGPVG EKGEDGEAGD PGTVGEPGIA GEKGDVGEKG DSGPPGAAGP PGPRGTPGED
GPKGNPGPIG FPGDSGSPGE PGVNGVDGVS GPKGDNGEPG KAGPPGASGE PGSQGPPGRR
GHVGTAGKEG KQGMKGVKGT PGTPGLVGKT GPVGPQGQPG RLGPEGLRGI PGPAGEQGLT
GPPGQIGPPG PIGPPGLPGL KGDTGNKGDK GHGGLIGLIG PPGEHGEKGD RGLPGNEGTQ
GTKVPPLCSF FPCQLLLTFW CYLQGPPATM IQPLPIREGR RKRRRHSNQA QVEGGDEDVH
LDIEELLQGD QPLEDAEGME EVFATLSSMK TEVELMRRPL GTFESPARTC KELMMIRPDY
KDAAWNKEKP RSWYSKYRKG KQFSYNDRDG NPVHVVQLTF LKLLSATAKQ SFTYTCQNSA
GWFDSTSHSY QHALRFRGSN DEEMTQAKSS FVNVVHDGCQ FRKGQERTVL EIDSPSSELL
PIMDVAPSDF GNSNQKFGFQ VGRVCFNG
//