GenomeNet

Database: UniProt
Entry: A0A498NB91_LABRO
LinkDB: A0A498NB91_LABRO
Original site: A0A498NB91_LABRO 
ID   A0A498NB91_LABRO        Unreviewed;      4941 AA.
AC   A0A498NB91;
DT   05-JUN-2019, integrated into UniProtKB/TrEMBL.
DT   05-JUN-2019, sequence version 1.
DT   27-MAR-2024, entry version 18.
DE   SubName: Full=Collagen alpha-3(VI) chain-like protein {ECO:0000313|EMBL:RXN29074.1};
GN   ORFNames=ROHU_018803 {ECO:0000313|EMBL:RXN29074.1};
OS   Labeo rohita (Indian major carp) (Cyprinus rohita).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Ostariophysi; Cypriniformes;
OC   Cyprinidae; Labeoninae; Labeonini; Labeo.
OX   NCBI_TaxID=84645 {ECO:0000313|EMBL:RXN29074.1, ECO:0000313|Proteomes:UP000290572};
RN   [1] {ECO:0000313|EMBL:RXN29074.1, ECO:0000313|Proteomes:UP000290572}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=DASCIFA01 {ECO:0000313|EMBL:RXN29074.1};
RC   TISSUE=Testis {ECO:0000313|EMBL:RXN29074.1};
RA   Das P., Kushwaha B., Joshi C.G., Kumar D., Nagpure N.S., Sahoo L.,
RA   Das S.P., Bit A., Patnaik S., Meher P.K., Jayasankar P., Koringa P.G.,
RA   Patel N.V., Hinsu A.T., Kumar R., Pandey M., Agarwal S., Srivastava S.,
RA   Singh M., Iquebal M.A., Jaiswal S., Angadi U.B., Kumar N., Raza M.,
RA   Shah T.M., Rai A., Jena J.K.;
RT   "Draft genome sequence of Rohu Carp (Labeo rohita).";
RL   Submitted (MAR-2018) to the EMBL/GenBank/DDBJ databases.
CC   -!- FUNCTION: Collagen VI acts as a cell-binding protein.
CC       {ECO:0000256|ARBA:ARBA00043858}.
CC   -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC       matrix {ECO:0000256|ARBA:ARBA00004498}.
CC   -!- SIMILARITY: Belongs to the type VI collagen family.
CC       {ECO:0000256|ARBA:ARBA00044000}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:RXN29074.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; QBIY01011822; RXN29074.1; -; Genomic_DNA.
DR   STRING; 84645.A0A498NB91; -.
DR   Proteomes; UP000290572; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0004867; F:serine-type endopeptidase inhibitor activity; IEA:InterPro.
DR   GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR   CDD; cd22629; Kunitz_collagen_alpha3_VI; 1.
DR   CDD; cd22635; Kunitz_papilin; 1.
DR   CDD; cd01472; vWA_collagen; 1.
DR   CDD; cd00198; vWFA; 1.
DR   CDD; cd01450; vWFA_subfamily_ECM; 2.
DR   Gene3D; 4.10.410.10; Pancreatic trypsin inhibitor Kunitz domain; 2.
DR   Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 21.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR002223; Kunitz_BPTI.
DR   InterPro; IPR036880; Kunitz_BPTI_sf.
DR   InterPro; IPR020901; Prtase_inh_Kunz-CS.
DR   InterPro; IPR002035; VWF_A.
DR   InterPro; IPR036465; vWFA_dom_sf.
DR   PANTHER; PTHR22588:SF6; COLLAGEN ALPHA-4(VI) CHAIN; 1.
DR   PANTHER; PTHR22588; UNCHARACTERIZED; 1.
DR   Pfam; PF01391; Collagen; 1.
DR   Pfam; PF00014; Kunitz_BPTI; 2.
DR   Pfam; PF00092; VWA; 21.
DR   PRINTS; PR00759; BASICPTASE.
DR   PRINTS; PR00453; VWFADOMAIN.
DR   SMART; SM00131; KU; 2.
DR   SMART; SM00327; VWA; 20.
DR   SUPFAM; SSF57362; BPTI-like; 2.
DR   SUPFAM; SSF53300; vWA-like; 22.
DR   PROSITE; PS00280; BPTI_KUNITZ_1; 1.
DR   PROSITE; PS50279; BPTI_KUNITZ_2; 2.
DR   PROSITE; PS50234; VWFA; 21.
PE   1: Evidence at protein level;
KW   Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW   Collagen {ECO:0000256|ARBA:ARBA00023119, ECO:0000313|EMBL:RXN29074.1};
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Proteomics identification {ECO:0007829|PeptideAtlas:A0A498NB91};
KW   Reference proteome {ECO:0000313|Proteomes:UP000290572};
KW   Secreted {ECO:0000256|ARBA:ARBA00022530}; Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..25
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           26..4941
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5019824489"
FT   DOMAIN          33..206
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          234..409
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          439..611
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          639..811
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          837..906
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          945..1121
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          1142..1314
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          1339..1517
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          1539..1711
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          1739..1914
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          1937..2109
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          2137..2309
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          2337..2509
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          2537..2712
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          2735..2907
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          2935..3111
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          3143..3315
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          3343..3388
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          3453..3629
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          4174..4355
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          4392..4590
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          4795..4846
FT                   /note="BPTI/Kunitz inhibitor"
FT                   /evidence="ECO:0000259|PROSITE:PS50279"
FT   DOMAIN          4875..4925
FT                   /note="BPTI/Kunitz inhibitor"
FT                   /evidence="ECO:0000259|PROSITE:PS50279"
FT   REGION          3860..4141
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        4075..4089
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   4941 AA;  539316 MW;  FE3254DE1162E8E7 CRC64;
     MGKSRLLLYA LLGVMVFGLF PKLEAQEASA PADLVLLIDG SESVGAANFP LFSDLAVQVI
     EGLAVGRDAI RVALVLYGAD PEIQFYLNSY DNKESVLSAI RGLKYPGGYD ANLGSALEEV
     ADSLLSQEAG GRAEEGVPQV LVVISAGEST DDVSQGERAL KQASVYTFGI AVGDSATAQL
     EAIATDKSFV LSAPDVRTVA SMGDQILPYI NGVAQRTILI ETQFTEALAV GKRDIIFLID
     SSMGTIVINA VREFIKRFID TMPIGPDQVQ VGVAMFSTTP KMEINLNSFN SKESLISALA
     RIKPKPSADV NIGSALDFVR TSMLTAESGS RFQDQVPQLV LLLTSKKSKD SVQQPADALR
     QTGVLTLAAG SKAADEAELK QIAFDDSLVF MLKDFRALLR NPREIISPLT TLSGIVVTEG
     PTEPVVDVTT VHTQRVVRDI VFMVDGSSYV GNNVQPVLNF ITEVVNRLDV RPERVRIGLM
     QFAERQHTEF FLNTYNTKQD VLSAIARLGL IGGRALNTGA ALQFALANHF QPAAGSRRKE
     GTQQVLVLIT GGPSQDEVKR VADRVALAGV LTFAVGAGQV EDRFLKTVAF VEDLAYYRRN
     FADLSSVVDE IMTPLITVVG ETNTTIDFPT PGPSGGERDV AFLIDGSDDV RGDFPYIRDF
     ISKVIEPLDI ASNKVRVSVV QHSERPSPSF FLNTYQTKDE VLRAVRGLAL AGGRSLNTGA
     ALTFMKNTIL SPANGGRASK NVPQFLIVLT GGRSRDSVKE PAVALKTEGV VPFGVGVKNA
     DPKQIEAISH NPSFAFNVKE FSQLSTVQER LKNYVNLPDQ ELKEFLTIVD AEDIAKDVVF
     LLDGSDGTKN GFGAVCDFVQ RVVEQLNVEE SKYRISVVQY SDNPVVDFYL KTYSTKSQFT
     HYIDRFDDLP LIEPQIVLTL KNIQKDPGEI PKTEVPTSSG INKRDIVFLL DGSDDSKNGL
     LVIREFIRRM AEDLDIDQDI VRVAVIQYSE DALTHFLLNT YTSKKAVIYA INGLRAKGGR
     NLNTGAALQY VRDHVFTAAS GSRHQLGVPQ LLIVMTGGGS SDDVAGPAED LKNTGVLSIA
     IGIKNAVEAE LQCIAFSPRF LFNLSAFGEL LHIQPDILSF IKSKMGIEPP TIIVELDAAQ
     RDIVFILDGS DDTRDAFKQM CQFVQRVVDK LNIGPNRDRV SVVQYSREPQ VHFYLNTHAT
     KQDVLNSIES LQHQGGSPLN TGRALDYTKK NMFIASSGSR ILEGVPQVLV LVTGGRSQDD
     VRAPAAALKK DQIVTFGIGN QNADVIQLQA ISYTPGHTLT VSQFDDLQTI EQNLLSYVKR
     VPRQPRRLPP TTTDAGNRDV VFLIDGSDET KGIFSGMQNF VQTLVQKLNV ASNKDRVSVV
     QYSDDAAFDF LLNTYSSSDD VISNVKRLIH KRGRLRNTGA ALQYLKDNVF TAAAGSRLLE
     GVPQVLILLN SGRSKDDIRG AVKALKDIGV ISFSIGTTNA DTLELQTISH QPNYFFISDF
     ESILDVQENI LALINRVSYQ QLPTVTPQVS AESDRQRRDV VFLMDGSDGN RNGFPAMKEF
     VQRMVEKLDI AENRDHVSVV QYSKDTKVHF YLNTYMTKKD ILESVRGLRH KGGRPLYMGA
     SLQYVRDNVF TASSGSRWLE GVPQILILLS GGKSFDSVDA AASALKELGV LTFGIGSRGS
     DSREMQRISY DPNYALTVSD FSELPNVQEQ LLASLQAVAI PITPTLPTVT ADYTTPRKDV
     VFLLDGSDGT RNTFPAMRDF VQRLVEQFNI EANRDRVSVV QYGRDAEVNF YLNSYTTKGD
     ILTSVRGLRH RGGRPLNTGA ALQYVRDNVF TASAGSRRQE GVPQILILLS GGRSSDNVDV
     PASALKESGI SIFGIGTRNS SREVQRIAND PTYAQSINEF SDLPSVQQQF ISSLNNVLGE
     VKPMTPTGPA TAERRRDVVF LLDGSDGTRS SFPAMRDFVE RMVERLNVSE NRDRVSVVQY
     SRDPEAHFYL NTYSRKEDVL DEVRGLRHKG GRPLNTGAAL QYVRDNVFTA SSGSRRVEGV
     PQLLILLSGG RSFDNVDTPA SSLKELGVLI FGIGSRSSDS RELQKISHEP SYALSVSDFA
     DLPSVQQQIF SNVDKVFVEG TSITTTTIAE GRRQRRDVVF LLDGSDATRN GFPAMKEFVQ
     RMVERLDIAE DRDRVSVVQY SRDAEVNFYL NTYTTKEDIL DGVRGLRHKG GRPLYTGAGL
     QYVRDNVFSA SSGSRRLEGV PQILVLLSGG RSSDNVDAAA SSLKGLGVLT FGIGSRGSDS
     RELQRISYDP SYAVTVSDFS ELPNVQEKLL ASLQAVAIPV TPTSPTATDE YTAPRKDVVF
     LLDGSDGTRN SFPAMRDFVQ RVVEQFDIEA NKDRVSVVQY SRDAEVHFYL NSYTTKEEIL
     DRVRGLKHKG GRPLNTGAAL QYVRDNVFTA SSGSRRLEAV PQILILLSGG RSFDDVDAAA
     SSLKELGVLT FGIGSRCSDS RELQRISYDP NYAVTVSDFS ELPNVQEKLL ATVQTVAMPV
     TPTSPTPTAD DGVPRKDVVF LLDGSDGTRN TFPAMRDFVQ RLVEQFNIEA NRDRVSVVQY
     GKDAEVNFYL NSYTTKGDIL TSVRGLRHRG GRPLNTGAAL QYVRDNVFTA SAGSRGQEGV
     PQILILLSGG RSSDNIDVPA SALKESGISI FGIGTRNSSR EVQRIANDPT YAQSINEFSD
     LPSVQQQFIS SLNNVLGEVK PMTPTGPATA ERRRDVVFLL DGSDGTRSSF PAMRDFVERM
     VERLNVSENR DRVSVVQYSR DPEAHFYLNT YSRKEDVLDE VRGLRHKGGR PLNTGAALQY
     VRDNVFTASS GSRRVEGVPQ LLILLSGGRS FDNVDTPASS LKELGVLIFG IGSRSSDSRE
     LQKISHEPSY ALSVSDFADL PSIQQQLFTN INTVLVERPP ITTTVIVESI GPKKDIVFVI
     DGSEGVGREF PIIQEFVRRV VENLNVGEKK IRVGVVQYGD LPHADIYLNS HKTKEGVLNG
     IKELRQRGGR QRNLGRAINF VSRDVLTSGH GGRKQEGVPQ FVVVVSGGKA TDSIRQSATE
     LKQSGVVPLS IGTRDVDTQE LQVTAYVPRF AYTVDDLPGL YTIQDTLINT LTELSSEELA
     KLRPEYPIDT RPVIPVPRGD KRDVVFLIDG TTKMRTEFPA IRDMVQRVVD KLDVGLDNVR
     VSVVQYSDDP KLEFLLNEHS TKEEVRQAVR RMRSKGGNRL NTGQALEYVS KNIYQRSAGS
     RVEDGVPQFL ILVTGGKSND DVSGPATQLK LSRVAPLAVG AHNADEEELK LISFSPELTY
     TIKDFQQLSA VEQELLTKVS TMTRDEISAP PKVTDLLNLG KKDIIFLIDG SDSVGQSGVA
     HIRDFILKVV DQLDVRPDQV RVALVQYGIG ADNADAGQLA QITTTADDVL KVATFPSLPT
     IQSKFISRLN GTIIVEAPTE EPASGLPQAK TADIVFLVDG SMNLGRDNFK EVMEFILNLI
     DLFFTERDNL RIGLAHYATD VTDVFYLNTY NNKDDIIDAI TRAEYKGGRE IKTGNAIRYV
     QKTHFVKERG SRKDEGIPQI LMVVTGGRSR DDSKSAALAL KASGVRIYAV GVGDIEDELN
     NLGSETTTVA RARTFQELSE LNEQILETLD DEVKGIGLCT GVKDATRECK LDVLVGFDVA
     SQGIFAAQKS LELKMTAILK RITQMQSISC TSGQVPSVQV GMLAMDSASE PVQLDFTNKY
     TELLEPFKAL RNRGPFLLNA ATMKAYADRL KSQPSDSVKV VIHLTDGLDA PYNILKERVE
     LLRSAGVSSL ILVALERVPR FEDAVLLEYG RGFRYTRPLR VNLMDLDYEL LEELDNIAER
     ECCSVPCKCN GQRGDRGAVG VSGLKGQPGG QGYSGHPGDE GGPGERGPPG VNGTQGFQGC
     PGQRGVKGSR GYNGEKGEIG EIGLDGINGE EGTSGVAGPP GDRGNPGQRG PKGAKGQKGD
     VGQTGIRGDP GIPGSDNNRR GPKGDPGDAG PPGAAGRPGS DGLPGETGIG GSRGPAGPIG
     APGVRGEDGN PGPRGPGGLP GSAGEKGRRG AVGRKGEPGE PGPKGVVGPL GPRGEPGEDG
     RDGFGIPGPK GRKGDEGFPG FPGPKGEAGD PGSNGGPGPR GNNGQRGTSG EPGGPGQKGD
     TGYPGPYQCN LVKKIRDNCP CCYGAQECPL YPTELAFALD ASDGVSRSAF NNMRDTVLRL
     VGDITIAESN CPRGARVALA LYNNEVTTEI RFADTQKKRA LIERVQGLQT QQTRKQRSLE
     TAMNFVAQNT FKRVRSGFLV RKVAVFFVNG PNTVTQEFSA AALRLYDAGI SSVFLLNRED
     RQLTRALQLN NTALAQVIVL PSPGSAEYNN VIKKIMTCHI CLDFCAPDQM CDYIPPRGVR
     DRRDPTTDLD IDMAFVVDSS ESTWPSVFTE TKQYVAQMIE QLEMSPDPAI STHHARVALV
     QHAPYEYLHN GSGLPISITF GLTDHKSANS VRSFLQNKMH QLEGGRALAD ALEGTVEHVF
     EKAPHPRHLK VLVLLVTGPV ELHEERIIRA ATEVKCKGYF IVVMAVGKLF SAGDVRVLSQ
     VASEPSDVFF KRVDRPSGFY DDHIQTFARL LPKYLGLENA FYMSPEVSKK CQWYQSDQPR
     KFPFNLPKTE EKHEKHHGHQ QVHDRKHKET GMEKMQLVNV TSSGFSLQWL SGDSKATHEV
     TVTRLRDHRL VLRKNVTGSH LSISELEAAE TYHVVVNTHS VGGHVASTYK GIVPTKSARM
     KVLTVNDVMG IAPTAPLSKP ETKTVELKVP SEAMGIVSTA PLSKPEIINN LMDPCSLDFD
     TGLPCKDYQA KWYFDRKNGF CTQFWYGGCG GNDNRFETEA DCLKRCMKKA IEDHVKSVHP
     PAPAPAALRS SVDICKLPKD EGSCAKFVLK WHYDPLSGNC ARFWYGGCGG NQNRFETQDE
     CEKTCGKAVP VKQGIIAAVK T
//
DBGET integrated database retrieval system