ID M3WHG4_FELCA Unreviewed; 3178 AA.
AC M3WHG4;
DT 01-MAY-2013, integrated into UniProtKB/TrEMBL.
DT 10-OCT-2018, sequence version 3.
DT 27-MAR-2024, entry version 68.
DE SubName: Full=Collagen type VI alpha 3 chain {ECO:0000313|Ensembl:ENSFCAP00000011794.5};
GN Name=COL6A3 {ECO:0000313|Ensembl:ENSFCAP00000011794.5,
GN ECO:0000313|VGNC:VGNC:61068};
OS Felis catus (Cat) (Felis silvestris catus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Carnivora; Feliformia; Felidae; Felinae; Felis.
OX NCBI_TaxID=9685 {ECO:0000313|Ensembl:ENSFCAP00000011794.5, ECO:0000313|Proteomes:UP000011712};
RN [1] {ECO:0000313|Ensembl:ENSFCAP00000011794.5, ECO:0000313|Proteomes:UP000011712}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Abyssinian {ECO:0000313|Ensembl:ENSFCAP00000011794.5,
RC ECO:0000313|Proteomes:UP000011712};
RX PubMed=17975172; DOI=10.1101/gr.6380007;
RA Pontius J.U., Mullikin J.C., Smith D.R., Lindblad-Toh K., Gnerre S.,
RA Clamp M., Chang J., Stephens R., Neelam B., Volfovsky N., Schaffer A.A.,
RA Agarwala R., Narfstrom K., Murphy W.J., Giger U., Roca A.L., Antunes A.,
RA Menotti-Raymond M., Yuhki N., Pecon-Slattery J., Johnson W.E., Bourque G.,
RA Tesler G., O'Brien S.J.;
RT "Initial sequence and comparative analysis of the cat genome.";
RL Genome Res. 17:1675-1689(2007).
RN [2] {ECO:0000313|Ensembl:ENSFCAP00000011794.5, ECO:0000313|Proteomes:UP000011712}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Abyssinian {ECO:0000313|Ensembl:ENSFCAP00000011794.5,
RC ECO:0000313|Proteomes:UP000011712};
RA Hillier L.W., Warren W., Obrien S., Wilson R.K.;
RT "Sequence assembly of the Felis catus genome version 6.2.";
RL Submitted (SEP-2011) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|Ensembl:ENSFCAP00000011794.5}
RP IDENTIFICATION.
RC STRAIN=breed Abyssinian {ECO:0000313|Ensembl:ENSFCAP00000011794.5};
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AANG04002752; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR STRING; 9685.ENSFCAP00000011794; -.
DR PaxDb; 9685-ENSFCAP00000011794; -.
DR Ensembl; ENSFCAT00000012724.6; ENSFCAP00000011794.5; ENSFCAG00000012720.6.
DR VGNC; VGNC:61068; COL6A3.
DR eggNOG; KOG3544; Eukaryota.
DR GeneTree; ENSGT00940000156462; -.
DR HOGENOM; CLU_000182_1_0_1; -.
DR InParanoid; M3WHG4; -.
DR OMA; KGGRQAN; -.
DR Proteomes; UP000011712; Chromosome C1.
DR Bgee; ENSFCAG00000012720; Expressed in uterus and 10 other cell types or tissues.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0062023; C:collagen-containing extracellular matrix; IBA:GO_Central.
DR GO; GO:0005615; C:extracellular space; IBA:GO_Central.
DR GO; GO:0042383; C:sarcolemma; IEA:Ensembl.
DR GO; GO:0004867; F:serine-type endopeptidase inhibitor activity; IEA:InterPro.
DR GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR GO; GO:0051402; P:neuron apoptotic process; IEA:Ensembl.
DR GO; GO:0043491; P:phosphatidylinositol 3-kinase/protein kinase B signal transduction; IEA:Ensembl.
DR GO; GO:0009411; P:response to UV; IEA:Ensembl.
DR CDD; cd00063; FN3; 1.
DR CDD; cd22629; Kunitz_collagen_alpha3_VI; 1.
DR CDD; cd01481; vWA_collagen_alpha3-VI-like; 4.
DR CDD; cd01450; vWFA_subfamily_ECM; 2.
DR Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR Gene3D; 4.10.410.10; Pancreatic trypsin inhibitor Kunitz domain; 1.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 12.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR002223; Kunitz_BPTI.
DR InterPro; IPR036880; Kunitz_BPTI_sf.
DR InterPro; IPR020901; Prtase_inh_Kunz-CS.
DR InterPro; IPR041900; vWA_collagen_alpha3-VI-like.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24020:SF70; PH DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF01391; Collagen; 2.
DR Pfam; PF00014; Kunitz_BPTI; 1.
DR Pfam; PF00092; VWA; 12.
DR PRINTS; PR00759; BASICPTASE.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00131; KU; 1.
DR SMART; SM00327; VWA; 12.
DR SUPFAM; SSF57362; BPTI-like; 1.
DR SUPFAM; SSF49265; Fibronectin type III; 1.
DR SUPFAM; SSF53300; vWA-like; 12.
DR PROSITE; PS00280; BPTI_KUNITZ_1; 1.
DR PROSITE; PS50279; BPTI_KUNITZ_2; 1.
DR PROSITE; PS50853; FN3; 1.
DR PROSITE; PS50234; VWFA; 12.
PE 4: Predicted;
KW Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Reference proteome {ECO:0000313|Proteomes:UP000011712};
KW Secreted {ECO:0000256|ARBA:ARBA00022525}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..25
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 26..3178
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5016438570"
FT DOMAIN 39..213
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 242..415
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 445..620
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 639..816
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 837..1009
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 1029..1205
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 1233..1404
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 1436..1609
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 1639..1812
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 1838..2024
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 2402..2581
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 2619..2815
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 2991..3082
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 3113..3163
FT /note="BPTI/Kunitz inhibitor"
FT /evidence="ECO:0000259|PROSITE:PS50279"
FT REGION 1612..1634
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2041..2373
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2901..2992
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3074..3094
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2132..2169
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2291..2313
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 3178 AA; 342999 MW; 6C650A1F82DCAEC5 CRC64;
MRKHRHLPLV ATFCLFLSGF SLTRAQQQQA DVKNGAAADI IFLVDSSWSI GKEHFQLVRE
FLYDVIKSLA VGENDFHFAL VQFNGNPHTE FRLNTYRTNQ EVLSHISNMS YIGGSNQTGK
GLEYVMQNHL TEAAGSRASD GVPQVIVVLT GGHSDDGLAL PSAELKSADV NVFAIGVEDS
DEGALREIAS EPLNMHVFNL ENFTSLHDIV GSLVSCVQSS VAPEGAGGTE TFKDITAQDS
ADIIFLIDGS NNTGSVHFAV IRDFLVNLLE RLSVGTQQIR VGVVQYSDEP RTMFSLDTYS
TKVQVLDAVK ALAFTGGELA NIGLALDFVV ENHFTRAGGS RVEEGVPQVL VLISAGPSSD
EIRDGVIALK QASVFSFGLG TQAASRAELQ HIATSDNLVF TVPEFRSFGD LQEQLLLYIV
GVAQRRIVLQ PPTIVTEVIE VNKRDIVFLV DGSSALGLAN FNAIRDFIAK VIQRLEIGQD
LIQVAVAQYA DTVRPEFYFN SYPSKREVIA AVRRMKPMEG SALYTGSALD FVRNNLFTSS
AGYRAAEGVP KLLVLVTGGK SLDEISQPAQ ELKRSSIMSF AIGNKVADQA ELEEIAFDSS
LVFIPAEFRA APLQGVLPGL LAPLRTLTGT TEVHVNKRDI IFLLDGSFNV GKTDFPYVRD
FVMNVVNSLD VGSDNIRVGL VQFSDTPVTE FSLNTYQTKS DLLAHLRQLQ PKGGSGLNTG
AALSYVHANH FTEAGGSRIR EHVPQLLLLV TAGQSEDSYG QAANALARAG VLTFCVGASQ
ANKAELEQIA FNPSLVYLMD DFSSLPALPQ QLIQPLTTYV SGGVEEVPFA QPESKRDILF
LFDGSANLVG QFPAVRDFLY KVIDELDVKP DGTRIAVAQY SDDVRVESRF DEHQNKPEIL
NLVKRMKIKT GKALNLGYAL EYAQRYIFVK SAGSRIEDGV VQFLVLLVAG RSSDRLDTPA
LSLKQSQVVT FILQAKNADP AELELMVPSP VFILAAESLP KIGDLQPQIV NLLKSVQDGA
PTPVSGEKDV VFLIDGSEGV RSGFPLLKEF VQRVVESLDV GPDRVRVAVV QYSDRTRPEF
YLNSYMDQQS VVSAIRKLAL LGGPAPNTGA ALDFVLRNIL ISSAGSRIAE GVPQLLIVLT
ADRSGDDVRG PSVVLKRGGA VPIGIGIGNA DITEMQTISF IPDFAVAIPT FRQLGTVQQV
ISDRVIQLSR EELSRLQPVL LPPTIPGVGN KKDVVFLIDG SQNAGPEFQY IRTLIERLVD
YLDVGFDTTR VAVIQFSEDP RVEFLLNAHS SKDEVQNAVR RLRPKGGRQI NIGGALEYVS
RNIFKRPLGS RIEEGVPQFL VLISSGKSDD EVDDSAAELK QFGVAPFAVA RNADQEQLVK
ISLSPEYVFS VSTFRELPSL EQKLLTPITT LTSEQIQRIL ASTRYPSPDV ESDAADIVFL
IDSSDNVRPD GIAHIRDFVS RIVRRLNIGP NKVRIGVVQF SNEVFPEFYL KTHRSQAPVL
DAIRRLRFKG GSPLNTGKAL EFVARNLFVK SAGSRIEDGV PQHLVLFLGG KSQDDISRFS
QVISSSGIVS LGVGDRNVDR TELQTITNDP RLVFTVREFR DLPNIEEKIM NSFGPSGITP
APPGVDTPSP SRPEKKKADV VFLLDGSINF RRDSFQEVLR FVSEIVDTLY EGGDSIQVGL
VQYNSDPTDE FFLRDFTTKQ QIIDAINKVV YKGGRHANTK VGLQHLRLNH FVPEAGSRLD
QRVPQIAFVI TGGKSVEDAQ EASLALTQRG VKVFAVGVKN IDSEEIGKIA SNSATAFRVG
NVQELSELSE QVLETLHDAM HETLCPGVTD VSKVCNLDVI LGFDGSRDQN VFVTQKGLES
KVDTILNRIS QMQRISCTGG QMPTVRVSVV ANTPSGPVEA FDFAEYQPEL FEKFQNMRSQ
HPYVLTADTL KVYQNKFRQS SADNVKVVIH FTDGVDGDLA DLQRASEDLR QEGVQALILV
GLERVANLEQ LMQLEFGRGF MYNRPLRLNL LDLDYELAEQ LDNIAEKACC GVPCKCSGQR
GDRGPIGSIG PKGIPGEDGY RGYPGDEGGP GERGPPGVNG TQGFQGCPGQ RGIKGSRGFP
GEKGELGEIG LDGLNGEDGD KGLPGSPGEK GSPGRRGDKG PKGDKGERGD VGIRGDPGDS
GRDSQQRGPK GETGDIGPMG LPGRDGVSGR PGETGKDGGF GRRGPAGAKG NKGGPGQPGS
VGEQGTRGGQ GPPGPTGPPG LIGEQGISGP RGSGGTAGAP GERGRVGPLG RKGEPGEPGP
KGGVGSRGPR GETGDDGRDG IGGEGRKGRK GERGFPGYPG PKGAPGEPGT DGGPGPKGIR
GRRGNSGPPG AIGQKGDPGY PGPSGPKGNR GDSMDQCALV QSIKDKCPCC YGPLECPVFP
TELAFALDTS EGVTQDTFSR MRDVLLSIVG DLTIAESNCP RGARVAVVTY NNEVTTEIRF
ADSKKKSVLL DKIKNLQVAL TSKQQSLETA MSFVARNTFK RVRNGFLMRK VAVFFSNKPT
RASPQLREAV LKLSDAGITP LFLTSQEDRQ LINALQINNT AVGHALVLPP RRDLTNFLKT
VLTCHVCLDI CNIDPSCGFG SWRPSFRDRR AAGSDADIDL AFVLDSSEST SLFQFNEMRK
YIGYLVRQLD LSPDPKAAQH FARVAVVQHA PYESAGNASV PPVKVEVSLT DYGSKEKLVD
FLSSRMTQLQ GTRDLGRAIQ YTIENVFESA PNPRDMKVVV LMLTGEVDKQ QLEEAQGVIL
QAKCKGYFFV ILGIGRKVNV KELYNFASEP SDVFFKLLDK STELNEEPLM RFGRLLPSFV
SSENAFYLSP DIRKQCDWFQ GDQLTKNPVK FGHKQLNIPN NVTSSPASKL VTTAKPVAST
EAVTTTTKPV TVVNLPATKP VAAKPAAPKP AASRPVSGRP VAAKPEATKT ATVRPAVAAK
PAAAKPAAAK PAAVRPPAAA KPAAAKPEAA KPQATKLATS KTDAAKPVVK ASREVQVSDV
TENSARIHWE RPEPPSPYFY DLTVTSAHDQ SLVLRQNLSV TERAIGGLLA GHTYHVVVLC
YLKSQVRAAY QGSFTTKKAQ PPPSPPQARS ASSSTINLMV STEPLAGGEA DICKLPKEEG
TCRKFILKWY YDADTKSCAR FWYGGCGGNE NRFNSQKECE KVCAPALVNP GVMAAMGT
//