ID A0A2K6D172_MACNE Unreviewed; 1506 AA.
AC A0A2K6D172;
DT 28-MAR-2018, integrated into UniProtKB/TrEMBL.
DT 28-MAR-2018, sequence version 1.
DT 27-MAR-2024, entry version 24.
DE SubName: Full=Collagen type XXIV alpha 1 chain {ECO:0000313|Ensembl:ENSMNEP00000029658.1};
GN Name=COL24A1 {ECO:0000313|Ensembl:ENSMNEP00000029658.1};
OS Macaca nemestrina (Pig-tailed macaque).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini;
OC Cercopithecidae; Cercopithecinae; Macaca.
OX NCBI_TaxID=9545 {ECO:0000313|Ensembl:ENSMNEP00000029658.1, ECO:0000313|Proteomes:UP000233120};
RN [1] {ECO:0000313|Ensembl:ENSMNEP00000029658.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR Ensembl; ENSMNET00000054070.1; ENSMNEP00000029658.1; ENSMNEG00000038307.1.
DR GeneTree; ENSGT00940000162448; -.
DR Proteomes; UP000233120; Unplaced.
DR Bgee; ENSMNEG00000038307; Expressed in temporal lobe.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005788; C:endoplasmic reticulum lumen; IEA:UniProt.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR Gene3D; 2.60.120.200; -; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR048287; TSPN-like_N.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1082; COLLAGEN ALPHA-1(X) CHAIN; 1.
DR Pfam; PF01391; Collagen; 13.
DR SMART; SM00210; TSPN; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW Reference proteome {ECO:0000313|Proteomes:UP000233120};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..37
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 38..1506
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5014356357"
FT DOMAIN 68..228
FT /note="Thrombospondin-like N-terminal"
FT /evidence="ECO:0000259|SMART:SM00210"
FT REGION 487..1436
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 493..507
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 520..537
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1235..1249
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1506 AA; 153521 MW; F7AD209A803B4B6A CRC64;
MHLGAHRTRR GKVSPTAKTK PLLHFIVLCV AGVVVHAQEQ GIDILHQLGL GGKDVRHSSS
ATAVPASSTP LPQGVHLIES GVILKNDAYI ETPFMKILPV NLGRPFTILT GLQSHRVNNA
FLFSIRNKNR LQLGVQLLPK KLVVHIRGKQ AVVFNYSVHD EQWHLFAITI RNQSVSMFVE
CGKKYFNTET ISEVQTFDSN SVFTLGSMNN NSVHFEGIVC QLDIIPSAEA SADYCRYVKQ
QCRQTDKYQP ETSLPHTTLI PTKILEHSPP PKRFAEKVLS EDTFTEGRSI PNIINNDSET
VYKRREHQIS RSQLSSLQSG NVSAVDLTNH GIQAKEMITE EDTQTNLSLS VTPHRISEAG
MNTKEKFSSL LNVSDNITQH DDRLTGLSLF KKMPSILPQI TQDAITNLKK AITANLHTNE
LMEMQPILNT SLYRVSNEPS VDNHLDLRKE GEFYPDATYP IENSYETELY DYYYYEDLNT
MLEMEYLRGP KGDTGPPGPP GPAGIPGPSG KRGPRGIPGP HGNPGLPGLP GPKGPKGDPG
FSPGQAVPGE KGDQGLSGLM GPPGMQGDKG LKGHPGLPGL RGEQGIPGFA GNIGSPGYPG
RQGLAGPEGH PGPKGARGFI GSPGEAGQLG PEGERGIPGI RGKKGFKGRQ GFPGDFGDRG
PAGLDGSPGL VGGTGPPGFP GLRGSVGPVG PIGPAGIPGP VGLSGNKGLP GIKGDKGEQG
TAGELGEPGY PGDKGAVGLP GPQGMRGKPG PSGFPGDIGI PGQNGPEGPK GLLGNRGPPG
PPGLKGTQGE EGPIGPFGEL GPRGKPGQKG YAGEPGPEGL KGEVGDQGNI GKIGETGPVG
LPGEVGTTGS IGEKGERGSP GPLGPQGEKG VMGYPGPPGV PGPIGPLGLP GHVGARGPPG
SQGPKGQRGP RGPDGLLGEQ GIQGAKGEKG DQGKRGPHGL IGKTGNPGER GVQGKPGLQG
LPGSTGDRGL PGEPGLRGLQ GDVGPPGEMG IEGPPGIEGE SGLQGEPGAK GDVGPAGSVG
EPGEPGLRGE PGAPGEEGLQ GKDGLKGASG GRGLPGEDGE KGDMGLPGII GPLGRSGQMG
LPGPEGIVGI PGQRGRLGKK GDKGQIGPTG EVGSRGSPGK IGKSGPKGAR GTRGAVGHLG
LMGPDGEPGI PGYRGHQGQP GLSGLPGPKG EKGYPGEDST VLGPPGPRGE PGPVGEQGER
GEPGAEGYKG HVGVPGLRGA TGQQGPPGEP GDQGEQGLKG ERGCEGHKGK KGAPGPSGKA
GIPGLQGLPG PKGIQGYHGA DGILGNPGKV GPPGKQGLPG IRGSPGRTGL AGAPGPPGVK
GSSGLPGSPG IQGPKGEQGL PGQPGIQGTR GHRGAQGDQG PCGEPGLKGQ PGEYGVQGLT
GFQGFPGPKG PEGDAGIVGI SGPKGPIGQR GNTGPLGREG IIGPTGRTGP RGEKGFRGET
VSLTKFVILV TLTLDFKLHE SKDYFTVIHY YISDTCRIKI KLQLLGRRFK LCGKRDLDSH
KNNGRL
//