ID K9J850_XENTR Unreviewed; 1714 AA.
AC K9J850;
DT 06-FEB-2013, integrated into UniProtKB/TrEMBL.
DT 02-JUN-2021, sequence version 4.
DT 27-MAR-2024, entry version 70.
DE SubName: Full=Collagen alpha-2(XI) chain isoform X7 {ECO:0000313|RefSeq:XP_031747082.1};
DE SubName: Full=Collagen type XI alpha 2 pseudogene 1 {ECO:0000313|Ensembl:ENSXETP00000027837};
GN Name=col11a2 {ECO:0000313|RefSeq:XP_031747082.1,
GN ECO:0000313|Xenbase:XB-GENE-876669};
GN Synonyms=col11a2p1 {ECO:0000313|Ensembl:ENSXETP00000027837,
GN ECO:0000313|Xenbase:XB-GENE-876669};
OS Xenopus tropicalis (Western clawed frog) (Silurana tropicalis).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Amphibia;
OC Batrachia; Anura; Pipoidea; Pipidae; Xenopodinae; Xenopus; Silurana.
OX NCBI_TaxID=8364 {ECO:0000313|Ensembl:ENSXETP00000027837};
RN [1] {ECO:0000313|Ensembl:ENSXETP00000027837}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Nigerian {ECO:0000313|Ensembl:ENSXETP00000027837};
RX PubMed=20431018; DOI=10.1126/science.1183670;
RA Hellsten U., Harland R.M., Gilchrist M.J., Hendrix D., Jurka J.,
RA Kapitonov V., Ovcharenko I., Putnam N.H., Shu S., Taher L., Blitz I.L.,
RA Blumberg B., Dichmann D.S., Dubchak I., Amaya E., Detter J.C., Fletcher R.,
RA Gerhard D.S., Goodstein D., Graves T., Grigoriev I.V., Grimwood J.,
RA Kawashima T., Lindquist E., Lucas S.M., Mead P.E., Mitros T., Ogino H.,
RA Ohta Y., Poliakov A.V., Pollet N., Robert J., Salamov A., Sater A.K.,
RA Schmutz J., Terry A., Vize P.D., Warren W.C., Wells D., Wills A.,
RA Wilson R.K., Zimmerman L.B., Zorn A.M., Grainger R., Grammer T.,
RA Khokha M.K., Richardson P.M., Rokhsar D.S.;
RT "The genome of the Western clawed frog Xenopus tropicalis.";
RL Science 328:633-636(2010).
RN [2] {ECO:0000313|Ensembl:ENSXETP00000027837}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (DEC-2012) to UniProtKB.
RN [3] {ECO:0000313|RefSeq:XP_031747082.1}
RP IDENTIFICATION.
RC STRAIN=Nigerian {ECO:0000313|RefSeq:XP_031747082.1};
RC TISSUE=Liver and blood {ECO:0000313|RefSeq:XP_031747082.1};
RG RefSeq;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_031747082.1; XM_031891222.1.
DR STRING; 8364.ENSXETP00000027837; -.
DR Ensembl; ENSXETT00000027837; ENSXETP00000027837; ENSXETG00000012724.
DR AGR; Xenbase:XB-GENE-876669; -.
DR Xenbase; XB-GENE-876669; col11a2.
DR eggNOG; KOG3544; Eukaryota.
DR HOGENOM; CLU_001074_2_1_1; -.
DR TreeFam; TF323987; -.
DR Proteomes; UP000008143; Chromosome 8.
DR Bgee; ENSXETG00000012724; Expressed in testis and 2 other cell types or tissues.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR CDD; cd00110; LamG; 1.
DR Gene3D; 2.60.120.1000; -; 1.
DR Gene3D; 2.60.120.200; -; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR000885; Fib_collagen_C.
DR InterPro; IPR001791; Laminin_G.
DR InterPro; IPR048287; TSPN-like_N.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF509; COLLAGEN ALPHA-2(XI) CHAIN; 1.
DR Pfam; PF01410; COLFI; 1.
DR Pfam; PF01391; Collagen; 6.
DR Pfam; PF02210; Laminin_G_2; 1.
DR SMART; SM00038; COLFI; 1.
DR SMART; SM00282; LamG; 1.
DR SMART; SM00210; TSPN; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR PROSITE; PS51461; NC1_FIB; 1.
PE 4: Predicted;
KW Collagen {ECO:0000313|RefSeq:XP_031747082.1};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Reference proteome {ECO:0000313|Proteomes:UP000008143};
KW Secreted {ECO:0000256|ARBA:ARBA00022530};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..34
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 35..1714
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5041198328"
FT DOMAIN 1486..1713
FT /note="Fibrillar collagen NC1"
FT /evidence="ECO:0000259|PROSITE:PS51461"
FT REGION 245..292
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 309..403
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 439..1455
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 356..370
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 678..695
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 837..852
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1035..1050
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1126..1140
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1438..1454
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1714 AA; 170881 MW; 93B25CE0F956C92B CRC64;
MPRSQRQPRR FRVPGMPPLC SLALLLLLTQ LVKTQVAADP IDVLKGLQLH LQPEGVKKTT
GLCPVRRSGT GADVAFRISK QAQISAPTRQ LFPGKFPEDF SIMALVRAKN GLQSFLLSIY
SEQGTQQIGV ELGHSPVFLF EDQNGKPAPE DYPIFRGINL ADGKWHRIAL SVQKKSVTLI
LDCKKKITQQ LPRGNKPIVD TKGITVFGTR ILDEESFEGD IQQLLIASNP QAAYDYCEHY
SPDCDSLPNA PQAQDPNPRV DAKEQEPSMA AKGSDSFTEE YMTGEEDNPT GGLDYEYVYK
DYTEGSETTH LGPVLSAETP ESGGAFSGPR GIKGDKGEPA VMEPGMLVEG PPGAEGPAGL
PGPPGRHGHP GPVGDPGERG PPGRAGLPGA NGLPGQPGSS VMLPFRLGNS AGAKGPIVAA
QEAQAQAILQ QARLALRGPS GPMGYTGRSG PLGPPGSPGL KGESGDVGPQ GQRGPQGLAG
PPGKAGRRGR SGADGARGLP GEFGLKGDRG FDGLPGLPGE KGNRGDSGVQ GPLGGPGEDG
ERGADGEVGA RGLPGEPGVR GLLGPKGPPG IPGPPGVRGM DGHVGPKGNL GPQGEPGPPG
QQGTPGTQGL SGPQGPVGPP GEKGPHGKPG LPGIPGSDGP PGHPGKEGPA GTKGNLGPTG
PQGPIGYPGP RGGKGIDGIR GLKGHKGEKG EDGFPGFKGD VGIKGDKGEG GVPGSRGEDG
PEGPKGRSGP NGDPGPIGVI GEKGKLGVPG LPGYPGRQGA KGSLGFPGFP GANGEKGARG
LSGKSGPRGE RGPSGPRGQR GPRGATGKSG PKGTSGGDGP SGPLGERGLP GPQGSNGFPG
PKGPPGPPGK DGIPGHPGQR GEVGFQGKMG TTGPPGVVGS QGTQGESGPM GERGHPGPPG
PPGEQGLSGP SGKEGTKGDP GPTGVGGKDG PAGLRGFPGE RGLPGTPGGP GLKGNEGPAG
PPGPAGSPGE RGGPGQGGPI GPPGRPGPQG PAGAAGEKGV PGEKGPIGPA GRDGVQGPVG
LPGPAGPPGI SGEDGDKGEV GEHGQKGAKG NKGEHGPPGP SGPIGSVGQP GAAGADGEPG
PRGQQGVFGA KGDEGTRGFP GAPGPIGLQG LPGPYGEKGE TGDGGPMGPP GPPGPRGPAG
PSGADGPQGP PGGIGNLGPQ GEKGEPGETG VPGIKGEFGP KGPRGDRGEK GESGQPGDPG
PQGVKGPPGD DGPKGNPGPV GFPGDPGPPG EMGPRGQDGA KGDRGEDGEQ GEAGSPGPTG
ENGPPGPVGK RGPAGPSGPE GRQGEKGSKG EPGALGPPGK TGAVGPQGTA GKQGPEGLRG
LPGAVGEQGR PGATGQAGPP GPMGPSGLPG LKGDTGAKGE KGHPGLIGLI GPTGEQGEKG
DRGLPGPHGS GGQKGETGIP GVTGPIGPGG PAGSPGPQGP KGAKGATGQA GPKGQKGVQG
PPGPPGPPGE VIQPLPFQMP KKSKRSIDAS QIMADEPAAD YTDGMEEIYG SLNAIKKDIE
LMRTPMGTKE HPARTCQDLK LCHPELPDGE YWIDPNQGCT RDSFKVFCNF TAGESCIFPS
KDIEKVKMSS WSSEKRETWY SQYRSGRKFS YIDSEGNAIG VVQLTFLRLL STYVHQNFTY
HCHRSAAWHL TSSDSYQKAL HFRGSNEEDL SFHTTPYIKA LRDGCAMRKG TEKTVLEIYT
PRVEQLPLTD AMFMDFGEPN QKFGFEVGPV CFLG
//