ID H9ERW3_MACMU Unreviewed; 1629 AA.
AC H9ERW3;
DT 16-MAY-2012, integrated into UniProtKB/TrEMBL.
DT 16-MAY-2012, sequence version 1.
DT 27-MAR-2024, entry version 41.
DE SubName: Full=Collagen alpha-2(XI) chain isoform 3 preproprotein {ECO:0000313|EMBL:AFE65122.1};
GN Name=COL11A2 {ECO:0000313|EMBL:AFE65122.1};
OS Macaca mulatta (Rhesus macaque).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini;
OC Cercopithecidae; Cercopithecinae; Macaca.
OX NCBI_TaxID=9544 {ECO:0000313|EMBL:AFE65122.1};
RN [1] {ECO:0000313|EMBL:AFE65122.1}
RP NUCLEOTIDE SEQUENCE.
RC TISSUE=Caudate {ECO:0000313|EMBL:AFE65122.1};
RX PubMed=25319552; DOI=10.1186/1745-6150-9-20;
RA Zimin A.V., Cornish A.S., Maudhoo M.D., Gibbs R.M., Zhang X., Pandey S.,
RA Meehan D.T., Wipfler K., Bosinger S.E., Johnson Z.P., Tharp G.K.,
RA Marcais G., Roberts M., Ferguson B., Fox H.S., Treangen T., Salzberg S.L.,
RA Yorke J.A., Norgren R.B.Jr.;
RT "A new rhesus macaque assembly and annotation for next-generation
RT sequencing analyses.";
RL Biol. Direct 9:20-20(2014).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JU321366; AFE65122.1; -; mRNA.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005788; C:endoplasmic reticulum lumen; IEA:UniProt.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR CDD; cd00110; LamG; 1.
DR Gene3D; 2.60.120.1000; -; 1.
DR Gene3D; 2.60.120.200; -; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR000885; Fib_collagen_C.
DR InterPro; IPR001791; Laminin_G.
DR InterPro; IPR048287; TSPN-like_N.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF509; COLLAGEN ALPHA-2(XI) CHAIN; 1.
DR Pfam; PF01410; COLFI; 2.
DR Pfam; PF01391; Collagen; 7.
DR Pfam; PF02210; Laminin_G_2; 1.
DR SMART; SM00038; COLFI; 1.
DR SMART; SM00282; LamG; 1.
DR SMART; SM00210; TSPN; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR PROSITE; PS51461; NC1_FIB; 1.
PE 2: Evidence at transcript level;
KW Collagen {ECO:0000256|ARBA:ARBA00023119, ECO:0000313|EMBL:AFE65122.1};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022530};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..22
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 23..1629
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5003619046"
FT DOMAIN 1434..1628
FT /note="Fibrillar collagen NC1"
FT /evidence="ECO:0000259|PROSITE:PS51461"
FT REGION 228..358
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 378..1434
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 233..266
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 620..637
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 767..794
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1068..1082
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1629 AA; 159796 MW; EFEF7AB941236A87 CRC64;
MERCSRCHRL LLLLPLVLGL SAAPGWAGAP PVDVLRALRF PSLPDGVQRT KGICPADVAY
RVARPAQLSA PTRQLFPGGF PKDFSLLTVV RTRPGLQAPL LTLYSAQGVQ QLGLELGRPV
HFLYEDQTGR PQPPAQPVFR GLSLADGKWH RVAVAVKGQS VTLIVDCKKR VTRPLPRSAR
PVLDTHGVII FGARILDEEV FEGDVQELAI VPGVQAAYES CEQKELECEG GQRARPQNQQ
PHRAQRSPEQ QPSRLHRPQN QEPQRQAAHG PRGLKGEKGE PAVLEPGMLV EGPPGPEGPA
GLIGPPGIQG NPGPVGDPGE RGPPGRAGLP GSDGAPGPPG TSLMLPFRFG SGGGDKGPVV
AAQEAQAQAI LQQARLALRG PPGPMGYTGR PGPLGQPGSP GLKGESGDLG PQGPRGPQGL
TGPPGKAGRR GRAGADGARG MPGEPGVKGD RGFDGLPGLP GEKGHRGDTG AQGLPGPPGE
DGERGDDGEI GPRGLPGESG PRGLLGPKGP PGIPGPPGVR GMDGPQGPKG SLGPQGEPGP
PGQQGTPGTQ GLPGPQGAIG PHGEKGPQGK PGLPGMPGSD GPPGHPGKEG PPGTKGNQGP
SGPQGPLGYP GPRGVKGVDG IRGLKGHKGE KGEDGFPGFK GDIGVKGDRG EVGVPGSRGE
DGPEGPKGRS GPTGDPGPPG LMGEKGKLGV PGLPGYPGRQ GPKGSLGFPG FPGASGEKGA
RGLSGKSGPR GERGPTGPRG QRGPRGATGK SGAKGTSGGD GPHGPPGERG LPGPQGPNGF
PGPKGPPGPP GKDGLPGHPG QRGEVGFQGK TGPPGPPGVV GPQGAAGETG PMGERGHPGP
PGPPGEQGLP GTAGKEGTKG DPGPPGAPGK DGPAGLRGFP GERGLPGTAG GPGLKGNEGP
SGPPGPAGSP GERGAAGSGG PIGPPGRPGP QGPPGAAGEK GVPGEKGPTG PTGRDGVQGP
VGLPGPAGPP GVAGEDGDKG EVGDPGQKGT KGNKGEHGPP GPPGPIGPVG QPGAAGADGE
PGARGPQGHF GAKGDEGTRG FNGPPGPIGL QGLPGPSGEK GETGDVGPMG PPGPPGPRGP
AGPNGADGPQ GPPGGVGNLG PPGEKGEPGE SGSPGIQGEP GVKGPRGERG EKGESGQPGE
PGPPGPKGPT GDDGPKGNPG PVGFPGDPGP PGEGGPRGQD GAKGDRGEDG EPGQPGSPGP
TGENGPPGPL GKRGPAGSPG PEGRQGGKGA KGDPGAVGAP GKTGPVGPAG PAGKPGPDGL
RGLPGSVGQQ GRPGATGQAG PPGPVGPPGL PGLRGDAGAK GEKGHPGLIG LIGPPGEQGE
KGDRGLPGPQ GSPGQKGETG IPGASGPIGP GGPPGLPGPA GPKGAKGATG PAGPKGEKGV
QGPPGHPGPP GEVIQPLPIQ MPKKTRRSVD GSRLMQEDEA IPTGGAPGSP GGLEEIFGSL
DSLRGEIEQM RRPTGTQDSP ARTCQDLKLC HPELPDGEYW VDPNQGCARD AFRVFCNFTA
GGETCVTPRD DVTQFSYVDS EGSPVGVVQL TFLRLLNVSA HQDISYPCSG AARNGPLRLR
GANEDELSPE TSPYVKEFRD GCQTQQGRTV LEVRTPVLEQ LPVLDASFSD LGTPPRRGGV
LLGPVCFMG
//