ID K7D4M4_PANTR Unreviewed; 1339 AA.
AC K7D4M4;
DT 09-JAN-2013, integrated into UniProtKB/TrEMBL.
DT 09-JAN-2013, sequence version 1.
DT 27-MAR-2024, entry version 44.
DE SubName: Full=Collagen, type XVIII, alpha 1 {ECO:0000313|EMBL:JAA40833.1};
GN Name=COL18A1 {ECO:0000313|EMBL:JAA40833.1};
OS Pan troglodytes (Chimpanzee).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC Pan.
OX NCBI_TaxID=9598 {ECO:0000313|EMBL:JAA40833.1};
RN [1] {ECO:0000313|EMBL:JAA40833.1}
RP NUCLEOTIDE SEQUENCE.
RC TISSUE=Skeletal muscle {ECO:0000313|EMBL:JAA40833.1};
RA Maudhoo M.D., Meehan D.T., Norgren R.B.Jr.;
RT "De novo assembly of the reference chimpanzee transcriptome from NextGen
RT mRNA sequences.";
RL Submitted (OCT-2012) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GABE01003906; JAA40833.1; -; mRNA.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005788; C:endoplasmic reticulum lumen; IEA:UniProt.
DR CDD; cd00247; Endostatin-like; 1.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 3.40.1620.70; -; 1.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR010515; Collagenase_NC10/endostatin.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR048287; TSPN-like_N.
DR InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1034; COLLAGEN ALPHA-1(XVIII) CHAIN; 1.
DR Pfam; PF01391; Collagen; 4.
DR Pfam; PF20010; Collagen_trimer; 1.
DR Pfam; PF06482; Endostatin; 1.
DR SMART; SM00210; TSPN; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE 2: Evidence at transcript level;
KW Collagen {ECO:0000256|ARBA:ARBA00023119, ECO:0000313|EMBL:JAA40833.1};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..33
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 34..1339
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5003899901"
FT DOMAIN 41..229
FT /note="Thrombospondin-like N-terminal"
FT /evidence="ECO:0000259|SMART:SM00210"
FT REGION 230..1028
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1096..1140
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 301..325
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 349..398
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 446..460
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 488..502
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 523..547
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 589..607
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 838..854
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 905..922
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 937..951
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 960..1022
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1101..1115
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1339 AA; 135943 MW; E99BFC558B66DBB7 CRC64;
MAPRCPWPWP RRRRLLDVLA PLVLLLGVRA ASAEPERVSE EVGLLQLLGD PPPQQVTQTD
DPDVGLAYVF GPDANSGQVA RYHFPSLFFR DFSLLFHIRP ATEGPGVLFA ITDSAQAVVL
LGVKLSGVQD GHQDISLLYT EPGAGQTYTA ASFRLPAFVG QWTHLALSVA GGFVALYVDC
EEFQRMPLAR SSRGLELEPG AGLFVAQAGG ADPDKFQGVI AELKVRRDPQ VSPMHCLDEE
GDDSDGASGD SGSGLGDTRE LLREETGVAL KPRLPTPPPV TTPPLAGGSS TEDSRSEEIE
EQTTVASLGA QTLPGSDSVS TWDGSVRTPG GRVKEGGLKG QKGEPGVPGP PGRAGPPGSP
CLPGPPGLPC PVSPLGPAGP ALQPVPGPQG PPGPPGRDGT PGRDGEPGDP GEDGKPGDTG
PQGFPGTPGD VGPKGDKGDP GVGERGPPGP QGPPGPPGPS FRHDKLTFID MEGSGFGGDL
EALRGPRGFP GPPGPPGVPG LPGEPGRFGV NSSDVPGPAG LPGVPGREGP PGFPGLPGPP
GPPGREGPPG RTGQKGSLGE AGAPGHKGSK GDPGPAGARG ESGLAGAPGP AGPPGPPGPP
GPPGPGLPAG FDDMEGSGGP FWSTARSAAG PQGPPGLPGL KGDPGVPGLP GAKGEVGADG
VPGFPGLPGR EGIAGPQGPK GDRGSQGEKG DPGKDGVGQP GLPGPPGPPG PVVYVSEQDG
SVLSVPGPEG RPGFAGFPGP AGPKGNLGSK GEQGSPGPKG EKGEPGSIFS PDGGALGPAQ
KGAKGEPGFR GPPGPYGRPG YKGEIGFPGR PGRPGMNGLK GEKGEPGDAS LGFGMRGMPG
PPGPPGPPGP PGTPVYDSNV FAESSRPGPP GLPGNQGPPG PKGAKGEVGP PGPPGQFPFD
FLQLEAEMKG EKGDRGDAGQ KGERGEPGGG GFFSSSLPGP PGPPGPRGYP GIPGPKGESI
RGQPGPPGPQ GPPGIGYEGR QGPPGPPGPP GPPGPPSFPG PHRQTISVPG PPGPPGPPGP
PGTMGASSGV RLWATRQAML GQVHEVPEGW LIFVAEQEEL YVRVRNGFRK VQLEARTPLP
RGTDNEVAAL QPPVVQLHDS NPYPRREHPH PTARPWRADD ILASPPRLPE PQPYPGAPHH
SSYVHLRPAR PTSPPAHTHR DFQPVLHLVA LNSPLSGSMR GIRGADFQCF QQARAVGLAG
TFRAFLSSRL QDLYSIVRRA DRTAVPIVNL KDELLFPSWE ALFSGSEGPL KPGARIFSFD
GKDVLRHPTW PQKSVWHGSD PNGRRLTESY CETWRTEAPS ATGQASSLLG GRLLGQSAVS
CHHAYIVLCI ENSFMTASK
//