ID M3WAG8_FELCA Unreviewed; 1186 AA.
AC M3WAG8;
DT 01-MAY-2013, integrated into UniProtKB/TrEMBL.
DT 11-DEC-2019, sequence version 4.
DT 24-JAN-2024, entry version 60.
DE SubName: Full=Collagen type XX alpha 1 chain {ECO:0000313|Ensembl:ENSFCAP00000008267.6};
GN Name=COL20A1 {ECO:0000313|Ensembl:ENSFCAP00000008267.6,
GN ECO:0000313|VGNC:VGNC:80073};
OS Felis catus (Cat) (Felis silvestris catus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Carnivora; Feliformia; Felidae; Felinae; Felis.
OX NCBI_TaxID=9685 {ECO:0000313|Ensembl:ENSFCAP00000008267.6, ECO:0000313|Proteomes:UP000011712};
RN [1] {ECO:0000313|Ensembl:ENSFCAP00000008267.6, ECO:0000313|Proteomes:UP000011712}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Abyssinian {ECO:0000313|Ensembl:ENSFCAP00000008267.6,
RC ECO:0000313|Proteomes:UP000011712};
RX PubMed=17975172; DOI=10.1101/gr.6380007;
RA Pontius J.U., Mullikin J.C., Smith D.R., Lindblad-Toh K., Gnerre S.,
RA Clamp M., Chang J., Stephens R., Neelam B., Volfovsky N., Schaffer A.A.,
RA Agarwala R., Narfstrom K., Murphy W.J., Giger U., Roca A.L., Antunes A.,
RA Menotti-Raymond M., Yuhki N., Pecon-Slattery J., Johnson W.E., Bourque G.,
RA Tesler G., O'Brien S.J.;
RT "Initial sequence and comparative analysis of the cat genome.";
RL Genome Res. 17:1675-1689(2007).
RN [2] {ECO:0000313|Ensembl:ENSFCAP00000008267.6, ECO:0000313|Proteomes:UP000011712}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Abyssinian {ECO:0000313|Ensembl:ENSFCAP00000008267.6,
RC ECO:0000313|Proteomes:UP000011712};
RA Hillier L.W., Warren W., Obrien S., Wilson R.K.;
RT "Sequence assembly of the Felis catus genome version 6.2.";
RL Submitted (SEP-2011) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|Ensembl:ENSFCAP00000008267.6}
RP IDENTIFICATION.
RC STRAIN=breed Abyssinian {ECO:0000313|Ensembl:ENSFCAP00000008267.6};
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AANG04000928; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR AlphaFoldDB; M3WAG8; -.
DR Ensembl; ENSFCAT00000008922.6; ENSFCAP00000008267.6; ENSFCAG00000008920.6.
DR VGNC; VGNC:80073; COL20A1.
DR eggNOG; KOG3544; Eukaryota.
DR GeneTree; ENSGT00940000153769; -.
DR HOGENOM; CLU_002527_0_0_1; -.
DR Proteomes; UP000011712; Chromosome A3.
DR Bgee; ENSFCAG00000008920; Expressed in adult mammalian kidney and 7 other cell types or tissues.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR CDD; cd00063; FN3; 6.
DR CDD; cd01482; vWA_collagen_alphaI-XII-like; 1.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 6.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 1.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR048287; TSPN-like_N.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24020:SF39; COLLAGEN ALPHA-1(XX) CHAIN; 1.
DR Pfam; PF00041; fn3; 5.
DR Pfam; PF00092; VWA; 1.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00060; FN3; 6.
DR SMART; SM00210; TSPN; 1.
DR SMART; SM00327; VWA; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR SUPFAM; SSF49265; Fibronectin type III; 4.
DR SUPFAM; SSF53300; vWA-like; 1.
DR PROSITE; PS50853; FN3; 6.
DR PROSITE; PS50234; VWFA; 1.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Reference proteome {ECO:0000313|Proteomes:UP000011712};
KW Secreted {ECO:0000256|ARBA:ARBA00022525}.
FT DOMAIN 36..130
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 176..351
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 375..466
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 467..554
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 555..642
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 654..743
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 744..837
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT REGION 1..21
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 132..165
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 591..610
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1059..1186
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 140..156
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1186 AA; 125800 MW; 6C19C7816D39510C CRC64;
MPPPGCRRPS KTQKPFHEQE LGGGGVPAHK WLFMAAPSDS GRLRLAVLPE DRLQMKWSES
EGSSLGYLVQ VKPMAGDAEQ DVMLTTKAPK ATVGGLSPSK GYTVQIFRLT GSGTTLLAQR
EFVIEELKRP LRAAPEPTPS HEGSPAPEPP VASFPSQDPP TLASPQFRCT PPTPVDMIFL
VDGSWSIGHS HFQQVKDFLA SVIEPFDIGP DKVQVGLTQY SGDPQTEWDL STFGAKEDVL
AAVRSLRYKG GNTFTGLALS HVLEQNLRPG AGPRPEAATV VILVTDGKSQ DDARAAGRAL
KDLGVDIFAV GVKNADEAEL QLLASRPLDI TVHNVQDFPQ LGTLAGLLSR LVCQKVQGRS
RAPAGSPSSP DTLFTPTSLV VTQVTSSSVH LSWTPAPQLP LKYRIVWQPS RGGAPREVVV
EGPASSAELH NLTAGTQYLV SVLPVLRAGF GEGLRRLVTT APLPPPQALA LAAVTPRTIR
LTWQPSAGAT RYLVRCLSAS AKGREEGREV RVGQPEVLLD GLEPGRDYDV WVQSLRGAEA
SEARGIRART STLAPPGHLS FSDVSHDSAR VSWEGVSKPV RLFRISFASS DGSHSGEVEA
PGNATSATVG PLSSSTAYSV RVTCLYPGGS SSVLTGRLTT PSVSPLCLPG KVPSPSQLSV
TELPGDEVRL EWTAAAASGV LVYQIKWTPL GDGKAHEISV PGHRDVAVLP GLRSHLEYEI
TILAYYRDGA RSDPVSLRYT PRSPPSDLAL ASKSPDSLQV SWTPPSGHVL HYRLTYAPAS
GSGPEKSISV PGLSNHVTLP NLLAASKYRV LVSAVYGAGE SRTVSAIGHT GEWAPRPPGF
DLMAAFGLVE KEYASIRGVA MEPSAFGRAR TFTLFKDAQL TRRASDVHLA ALPPEHTVVF
LLRLLPETPR ETFALWQMTA EHSQPVLGVL LDAGRKSLTY FNHDPRATLQ EVTFDLPEVR
RIFFGSFHKV HVAVGRSKVR LYVDCRKVAE RPMGEAGSPP VTGFVMLGRL AKARGPRSSS
ATFQLQTLQI VCNDSWAEED MCCELPASKD GETCPAFSSS CACSSQTPGP PGPQGPPVSP
SSAPRVTWAP GKGPPGVKGE KGDHGAPGLQ GHPGHQGPPG KVGLQGPKGG GAAATPSAPL
QGFQGTAGAR GSGGQRGLPG PVGLTVSPLP TYRPAAQQRW EAGEQI
//