ID A0A498NB91_LABRO Unreviewed; 4941 AA.
AC A0A498NB91;
DT 05-JUN-2019, integrated into UniProtKB/TrEMBL.
DT 05-JUN-2019, sequence version 1.
DT 27-MAR-2024, entry version 18.
DE SubName: Full=Collagen alpha-3(VI) chain-like protein {ECO:0000313|EMBL:RXN29074.1};
GN ORFNames=ROHU_018803 {ECO:0000313|EMBL:RXN29074.1};
OS Labeo rohita (Indian major carp) (Cyprinus rohita).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; Cypriniformes;
OC Cyprinidae; Labeoninae; Labeonini; Labeo.
OX NCBI_TaxID=84645 {ECO:0000313|EMBL:RXN29074.1, ECO:0000313|Proteomes:UP000290572};
RN [1] {ECO:0000313|EMBL:RXN29074.1, ECO:0000313|Proteomes:UP000290572}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DASCIFA01 {ECO:0000313|EMBL:RXN29074.1};
RC TISSUE=Testis {ECO:0000313|EMBL:RXN29074.1};
RA Das P., Kushwaha B., Joshi C.G., Kumar D., Nagpure N.S., Sahoo L.,
RA Das S.P., Bit A., Patnaik S., Meher P.K., Jayasankar P., Koringa P.G.,
RA Patel N.V., Hinsu A.T., Kumar R., Pandey M., Agarwal S., Srivastava S.,
RA Singh M., Iquebal M.A., Jaiswal S., Angadi U.B., Kumar N., Raza M.,
RA Shah T.M., Rai A., Jena J.K.;
RT "Draft genome sequence of Rohu Carp (Labeo rohita).";
RL Submitted (MAR-2018) to the EMBL/GenBank/DDBJ databases.
CC -!- FUNCTION: Collagen VI acts as a cell-binding protein.
CC {ECO:0000256|ARBA:ARBA00043858}.
CC -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC matrix {ECO:0000256|ARBA:ARBA00004498}.
CC -!- SIMILARITY: Belongs to the type VI collagen family.
CC {ECO:0000256|ARBA:ARBA00044000}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:RXN29074.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; QBIY01011822; RXN29074.1; -; Genomic_DNA.
DR STRING; 84645.A0A498NB91; -.
DR Proteomes; UP000290572; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0004867; F:serine-type endopeptidase inhibitor activity; IEA:InterPro.
DR GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR CDD; cd22629; Kunitz_collagen_alpha3_VI; 1.
DR CDD; cd22635; Kunitz_papilin; 1.
DR CDD; cd01472; vWA_collagen; 1.
DR CDD; cd00198; vWFA; 1.
DR CDD; cd01450; vWFA_subfamily_ECM; 2.
DR Gene3D; 4.10.410.10; Pancreatic trypsin inhibitor Kunitz domain; 2.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 21.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR002223; Kunitz_BPTI.
DR InterPro; IPR036880; Kunitz_BPTI_sf.
DR InterPro; IPR020901; Prtase_inh_Kunz-CS.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR22588:SF6; COLLAGEN ALPHA-4(VI) CHAIN; 1.
DR PANTHER; PTHR22588; UNCHARACTERIZED; 1.
DR Pfam; PF01391; Collagen; 1.
DR Pfam; PF00014; Kunitz_BPTI; 2.
DR Pfam; PF00092; VWA; 21.
DR PRINTS; PR00759; BASICPTASE.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00131; KU; 2.
DR SMART; SM00327; VWA; 20.
DR SUPFAM; SSF57362; BPTI-like; 2.
DR SUPFAM; SSF53300; vWA-like; 22.
DR PROSITE; PS00280; BPTI_KUNITZ_1; 1.
DR PROSITE; PS50279; BPTI_KUNITZ_2; 2.
DR PROSITE; PS50234; VWFA; 21.
PE 1: Evidence at protein level;
KW Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW Collagen {ECO:0000256|ARBA:ARBA00023119, ECO:0000313|EMBL:RXN29074.1};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Proteomics identification {ECO:0007829|PeptideAtlas:A0A498NB91};
KW Reference proteome {ECO:0000313|Proteomes:UP000290572};
KW Secreted {ECO:0000256|ARBA:ARBA00022530}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..25
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 26..4941
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5019824489"
FT DOMAIN 33..206
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 234..409
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 439..611
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 639..811
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 837..906
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 945..1121
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 1142..1314
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 1339..1517
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 1539..1711
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 1739..1914
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 1937..2109
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 2137..2309
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 2337..2509
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 2537..2712
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 2735..2907
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 2935..3111
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 3143..3315
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 3343..3388
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 3453..3629
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 4174..4355
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 4392..4590
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 4795..4846
FT /note="BPTI/Kunitz inhibitor"
FT /evidence="ECO:0000259|PROSITE:PS50279"
FT DOMAIN 4875..4925
FT /note="BPTI/Kunitz inhibitor"
FT /evidence="ECO:0000259|PROSITE:PS50279"
FT REGION 3860..4141
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 4075..4089
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 4941 AA; 539316 MW; FE3254DE1162E8E7 CRC64;
MGKSRLLLYA LLGVMVFGLF PKLEAQEASA PADLVLLIDG SESVGAANFP LFSDLAVQVI
EGLAVGRDAI RVALVLYGAD PEIQFYLNSY DNKESVLSAI RGLKYPGGYD ANLGSALEEV
ADSLLSQEAG GRAEEGVPQV LVVISAGEST DDVSQGERAL KQASVYTFGI AVGDSATAQL
EAIATDKSFV LSAPDVRTVA SMGDQILPYI NGVAQRTILI ETQFTEALAV GKRDIIFLID
SSMGTIVINA VREFIKRFID TMPIGPDQVQ VGVAMFSTTP KMEINLNSFN SKESLISALA
RIKPKPSADV NIGSALDFVR TSMLTAESGS RFQDQVPQLV LLLTSKKSKD SVQQPADALR
QTGVLTLAAG SKAADEAELK QIAFDDSLVF MLKDFRALLR NPREIISPLT TLSGIVVTEG
PTEPVVDVTT VHTQRVVRDI VFMVDGSSYV GNNVQPVLNF ITEVVNRLDV RPERVRIGLM
QFAERQHTEF FLNTYNTKQD VLSAIARLGL IGGRALNTGA ALQFALANHF QPAAGSRRKE
GTQQVLVLIT GGPSQDEVKR VADRVALAGV LTFAVGAGQV EDRFLKTVAF VEDLAYYRRN
FADLSSVVDE IMTPLITVVG ETNTTIDFPT PGPSGGERDV AFLIDGSDDV RGDFPYIRDF
ISKVIEPLDI ASNKVRVSVV QHSERPSPSF FLNTYQTKDE VLRAVRGLAL AGGRSLNTGA
ALTFMKNTIL SPANGGRASK NVPQFLIVLT GGRSRDSVKE PAVALKTEGV VPFGVGVKNA
DPKQIEAISH NPSFAFNVKE FSQLSTVQER LKNYVNLPDQ ELKEFLTIVD AEDIAKDVVF
LLDGSDGTKN GFGAVCDFVQ RVVEQLNVEE SKYRISVVQY SDNPVVDFYL KTYSTKSQFT
HYIDRFDDLP LIEPQIVLTL KNIQKDPGEI PKTEVPTSSG INKRDIVFLL DGSDDSKNGL
LVIREFIRRM AEDLDIDQDI VRVAVIQYSE DALTHFLLNT YTSKKAVIYA INGLRAKGGR
NLNTGAALQY VRDHVFTAAS GSRHQLGVPQ LLIVMTGGGS SDDVAGPAED LKNTGVLSIA
IGIKNAVEAE LQCIAFSPRF LFNLSAFGEL LHIQPDILSF IKSKMGIEPP TIIVELDAAQ
RDIVFILDGS DDTRDAFKQM CQFVQRVVDK LNIGPNRDRV SVVQYSREPQ VHFYLNTHAT
KQDVLNSIES LQHQGGSPLN TGRALDYTKK NMFIASSGSR ILEGVPQVLV LVTGGRSQDD
VRAPAAALKK DQIVTFGIGN QNADVIQLQA ISYTPGHTLT VSQFDDLQTI EQNLLSYVKR
VPRQPRRLPP TTTDAGNRDV VFLIDGSDET KGIFSGMQNF VQTLVQKLNV ASNKDRVSVV
QYSDDAAFDF LLNTYSSSDD VISNVKRLIH KRGRLRNTGA ALQYLKDNVF TAAAGSRLLE
GVPQVLILLN SGRSKDDIRG AVKALKDIGV ISFSIGTTNA DTLELQTISH QPNYFFISDF
ESILDVQENI LALINRVSYQ QLPTVTPQVS AESDRQRRDV VFLMDGSDGN RNGFPAMKEF
VQRMVEKLDI AENRDHVSVV QYSKDTKVHF YLNTYMTKKD ILESVRGLRH KGGRPLYMGA
SLQYVRDNVF TASSGSRWLE GVPQILILLS GGKSFDSVDA AASALKELGV LTFGIGSRGS
DSREMQRISY DPNYALTVSD FSELPNVQEQ LLASLQAVAI PITPTLPTVT ADYTTPRKDV
VFLLDGSDGT RNTFPAMRDF VQRLVEQFNI EANRDRVSVV QYGRDAEVNF YLNSYTTKGD
ILTSVRGLRH RGGRPLNTGA ALQYVRDNVF TASAGSRRQE GVPQILILLS GGRSSDNVDV
PASALKESGI SIFGIGTRNS SREVQRIAND PTYAQSINEF SDLPSVQQQF ISSLNNVLGE
VKPMTPTGPA TAERRRDVVF LLDGSDGTRS SFPAMRDFVE RMVERLNVSE NRDRVSVVQY
SRDPEAHFYL NTYSRKEDVL DEVRGLRHKG GRPLNTGAAL QYVRDNVFTA SSGSRRVEGV
PQLLILLSGG RSFDNVDTPA SSLKELGVLI FGIGSRSSDS RELQKISHEP SYALSVSDFA
DLPSVQQQIF SNVDKVFVEG TSITTTTIAE GRRQRRDVVF LLDGSDATRN GFPAMKEFVQ
RMVERLDIAE DRDRVSVVQY SRDAEVNFYL NTYTTKEDIL DGVRGLRHKG GRPLYTGAGL
QYVRDNVFSA SSGSRRLEGV PQILVLLSGG RSSDNVDAAA SSLKGLGVLT FGIGSRGSDS
RELQRISYDP SYAVTVSDFS ELPNVQEKLL ASLQAVAIPV TPTSPTATDE YTAPRKDVVF
LLDGSDGTRN SFPAMRDFVQ RVVEQFDIEA NKDRVSVVQY SRDAEVHFYL NSYTTKEEIL
DRVRGLKHKG GRPLNTGAAL QYVRDNVFTA SSGSRRLEAV PQILILLSGG RSFDDVDAAA
SSLKELGVLT FGIGSRCSDS RELQRISYDP NYAVTVSDFS ELPNVQEKLL ATVQTVAMPV
TPTSPTPTAD DGVPRKDVVF LLDGSDGTRN TFPAMRDFVQ RLVEQFNIEA NRDRVSVVQY
GKDAEVNFYL NSYTTKGDIL TSVRGLRHRG GRPLNTGAAL QYVRDNVFTA SAGSRGQEGV
PQILILLSGG RSSDNIDVPA SALKESGISI FGIGTRNSSR EVQRIANDPT YAQSINEFSD
LPSVQQQFIS SLNNVLGEVK PMTPTGPATA ERRRDVVFLL DGSDGTRSSF PAMRDFVERM
VERLNVSENR DRVSVVQYSR DPEAHFYLNT YSRKEDVLDE VRGLRHKGGR PLNTGAALQY
VRDNVFTASS GSRRVEGVPQ LLILLSGGRS FDNVDTPASS LKELGVLIFG IGSRSSDSRE
LQKISHEPSY ALSVSDFADL PSIQQQLFTN INTVLVERPP ITTTVIVESI GPKKDIVFVI
DGSEGVGREF PIIQEFVRRV VENLNVGEKK IRVGVVQYGD LPHADIYLNS HKTKEGVLNG
IKELRQRGGR QRNLGRAINF VSRDVLTSGH GGRKQEGVPQ FVVVVSGGKA TDSIRQSATE
LKQSGVVPLS IGTRDVDTQE LQVTAYVPRF AYTVDDLPGL YTIQDTLINT LTELSSEELA
KLRPEYPIDT RPVIPVPRGD KRDVVFLIDG TTKMRTEFPA IRDMVQRVVD KLDVGLDNVR
VSVVQYSDDP KLEFLLNEHS TKEEVRQAVR RMRSKGGNRL NTGQALEYVS KNIYQRSAGS
RVEDGVPQFL ILVTGGKSND DVSGPATQLK LSRVAPLAVG AHNADEEELK LISFSPELTY
TIKDFQQLSA VEQELLTKVS TMTRDEISAP PKVTDLLNLG KKDIIFLIDG SDSVGQSGVA
HIRDFILKVV DQLDVRPDQV RVALVQYGIG ADNADAGQLA QITTTADDVL KVATFPSLPT
IQSKFISRLN GTIIVEAPTE EPASGLPQAK TADIVFLVDG SMNLGRDNFK EVMEFILNLI
DLFFTERDNL RIGLAHYATD VTDVFYLNTY NNKDDIIDAI TRAEYKGGRE IKTGNAIRYV
QKTHFVKERG SRKDEGIPQI LMVVTGGRSR DDSKSAALAL KASGVRIYAV GVGDIEDELN
NLGSETTTVA RARTFQELSE LNEQILETLD DEVKGIGLCT GVKDATRECK LDVLVGFDVA
SQGIFAAQKS LELKMTAILK RITQMQSISC TSGQVPSVQV GMLAMDSASE PVQLDFTNKY
TELLEPFKAL RNRGPFLLNA ATMKAYADRL KSQPSDSVKV VIHLTDGLDA PYNILKERVE
LLRSAGVSSL ILVALERVPR FEDAVLLEYG RGFRYTRPLR VNLMDLDYEL LEELDNIAER
ECCSVPCKCN GQRGDRGAVG VSGLKGQPGG QGYSGHPGDE GGPGERGPPG VNGTQGFQGC
PGQRGVKGSR GYNGEKGEIG EIGLDGINGE EGTSGVAGPP GDRGNPGQRG PKGAKGQKGD
VGQTGIRGDP GIPGSDNNRR GPKGDPGDAG PPGAAGRPGS DGLPGETGIG GSRGPAGPIG
APGVRGEDGN PGPRGPGGLP GSAGEKGRRG AVGRKGEPGE PGPKGVVGPL GPRGEPGEDG
RDGFGIPGPK GRKGDEGFPG FPGPKGEAGD PGSNGGPGPR GNNGQRGTSG EPGGPGQKGD
TGYPGPYQCN LVKKIRDNCP CCYGAQECPL YPTELAFALD ASDGVSRSAF NNMRDTVLRL
VGDITIAESN CPRGARVALA LYNNEVTTEI RFADTQKKRA LIERVQGLQT QQTRKQRSLE
TAMNFVAQNT FKRVRSGFLV RKVAVFFVNG PNTVTQEFSA AALRLYDAGI SSVFLLNRED
RQLTRALQLN NTALAQVIVL PSPGSAEYNN VIKKIMTCHI CLDFCAPDQM CDYIPPRGVR
DRRDPTTDLD IDMAFVVDSS ESTWPSVFTE TKQYVAQMIE QLEMSPDPAI STHHARVALV
QHAPYEYLHN GSGLPISITF GLTDHKSANS VRSFLQNKMH QLEGGRALAD ALEGTVEHVF
EKAPHPRHLK VLVLLVTGPV ELHEERIIRA ATEVKCKGYF IVVMAVGKLF SAGDVRVLSQ
VASEPSDVFF KRVDRPSGFY DDHIQTFARL LPKYLGLENA FYMSPEVSKK CQWYQSDQPR
KFPFNLPKTE EKHEKHHGHQ QVHDRKHKET GMEKMQLVNV TSSGFSLQWL SGDSKATHEV
TVTRLRDHRL VLRKNVTGSH LSISELEAAE TYHVVVNTHS VGGHVASTYK GIVPTKSARM
KVLTVNDVMG IAPTAPLSKP ETKTVELKVP SEAMGIVSTA PLSKPEIINN LMDPCSLDFD
TGLPCKDYQA KWYFDRKNGF CTQFWYGGCG GNDNRFETEA DCLKRCMKKA IEDHVKSVHP
PAPAPAALRS SVDICKLPKD EGSCAKFVLK WHYDPLSGNC ARFWYGGCGG NQNRFETQDE
CEKTCGKAVP VKQGIIAAVK T
//