ID A0AAG2UW65_HUMAN Unreviewed; 1336 AA.
AC A0AAG2UW65;
DT 24-JUL-2024, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2024, sequence version 1.
DT 28-JAN-2026, entry version 8.
DE SubName: Full=Collagen type XVIII alpha 1 chain {ECO:0000313|Ensembl:ENSP00000518184.1};
DE SubName: Full=Collagen, type XVIII, alpha 1, isoform CRA_d {ECO:0000313|EMBL:EAX09341.1};
GN Name=COL18A1 {ECO:0000313|EMBL:EAX09341.1,
GN ECO:0000313|Ensembl:ENSP00000518184.1};
GN ORFNames=hCG_1647196 {ECO:0000313|EMBL:EAX09341.1};
OS Homo sapiens (Human).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC Homo.
OX NCBI_TaxID=9606 {ECO:0000313|Ensembl:ENSP00000518184.1, ECO:0000313|Proteomes:UP000005640};
RN [1] {ECO:0000313|EMBL:EAX09341.1}
RP NUCLEOTIDE SEQUENCE.
RX PubMed=11181995; DOI=10.1126/science.1058040;
RA Venter J.C., Adams M.D., Myers E.W., Li P.W., Mural R.J., Sutton G.G.,
RA Smith H.O., Yandell M., Evans C.A., Holt R.A., Gocayne J.D., Amanatides P.,
RA Ballew R.M., Huson D.H., Wortman J.R., Zhang Q., Kodira C.D., Zheng X.H.,
RA Chen L., Skupski M., Subramanian G., Thomas P.D., Zhang J.,
RA Gabor Miklos G.L., Nelson C., Broder S., Clark A.G., Nadeau J.,
RA McKusick V.A., Zinder N., Levine A.J., Roberts R.J., Simon M., Slayman C.,
RA Hunkapiller M., Bolanos R., Delcher A., Dew I., Fasulo D., Flanigan M.,
RA Florea L., Halpern A., Hannenhalli S., Kravitz S., Levy S., Mobarry C.,
RA Reinert K., Remington K., Abu-Threideh J., Beasley E., Biddick K.,
RA Bonazzi V., Brandon R., Cargill M., Chandramouliswaran I., Charlab R.,
RA Chaturvedi K., Deng Z., Di Francesco V., Dunn P., Eilbeck K.,
RA Evangelista C., Gabrielian A.E., Gan W., Ge W., Gong F., Gu Z., Guan P.,
RA Heiman T.J., Higgins M.E., Ji R.R., Ke Z., Ketchum K.A., Lai Z., Lei Y.,
RA Li Z., Li J., Liang Y., Lin X., Lu F., Merkulov G.V., Milshina N.,
RA Moore H.M., Naik A.K., Narayan V.A., Neelam B., Nusskern D., Rusch D.B.,
RA Salzberg S., Shao W., Shue B., Sun J., Wang Z., Wang A., Wang X., Wang J.,
RA Wei M., Wides R., Xiao C., Yan C., Yao A., Ye J., Zhan M., Zhang W.,
RA Zhang H., Zhao Q., Zheng L., Zhong F., Zhong W., Zhu S., Zhao S.,
RA Gilbert D., Baumhueter S., Spier G., Carter C., Cravchik A., Woodage T.,
RA Ali F., An H., Awe A., Baldwin D., Baden H., Barnstead M., Barrow I.,
RA Beeson K., Busam D., Carver A., Center A., Cheng M.L., Curry L.,
RA Danaher S., Davenport L., Desilets R., Dietz S., Dodson K., Doup L.,
RA Ferriera S., Garg N., Gluecksmann A., Hart B., Haynes J., Haynes C.,
RA Heiner C., Hladun S., Hostin D., Houck J., Howland T., Ibegwam C.,
RA Johnson J., Kalush F., Kline L., Koduru S., Love A., Mann F., May D.,
RA McCawley S., McIntosh T., McMullen I., Moy M., Moy L., Murphy B.,
RA Nelson K., Pfannkoch C., Pratts E., Puri V., Qureshi H., Reardon M.,
RA Rodriguez R., Rogers Y.H., Romblad D., Ruhfel B., Scott R., Sitter C.,
RA Smallwood M., Stewart E., Strong R., Suh E., Thomas R., Tint N.N., Tse S.,
RA Vech C., Wang G., Wetter J., Williams S., Williams M., Windsor S.,
RA Winn-Deen E., Wolfe K., Zaveri J., Zaveri K., Abril J.F., Guigo R.,
RA Campbell M.J., Sjolander K.V., Karlak B., Kejariwal A., Mi H., Lazareva B.,
RA Hatton T., Narechania A., Diemer K., Muruganujan A., Guo N., Sato S.,
RA Bafna V., Istrail S., Lippert R., Schwartz R., Walenz B., Yooseph S.,
RA Allen D., Basu A., Baxendale J., Blick L., Caminha M., Carnes-Stine J.,
RA Caulk P., Chiang Y.H., Coyne M., Dahlke C., Mays A., Dombroski M.,
RA Donnelly M., Ely D., Esparham S., Fosler C., Gire H., Glanowski S.,
RA Glasser K., Glodek A., Gorokhov M., Graham K., Gropman B., Harris M.,
RA Heil J., Henderson S., Hoover J., Jennings D., Jordan C., Jordan J.,
RA Kasha J., Kagan L., Kraft C., Levitsky A., Lewis M., Liu X., Lopez J.,
RA Ma D., Majoros W., McDaniel J., Murphy S., Newman M., Nguyen T., Nguyen N.,
RA Nodell M., Pan S., Peck J., Peterson M., Rowe W., Sanders R., Scott J.,
RA Simpson M., Smith T., Sprague A., Stockwell T., Turner R., Venter E.,
RA Wang M., Wen M., Wu D., Wu M., Xia A., Zandieh A., Zhu X.;
RT "The sequence of the human genome.";
RL Science 291:1304-1351(2001).
RN [2] {ECO:0000313|EMBL:EAX09341.1}
RP NUCLEOTIDE SEQUENCE.
RA Mural R.J., Istrail S., Sutton G., Florea L., Halpern A.L., Mobarry C.M.,
RA Lippert R., Walenz B., Shatkay H., Dew I., Miller J.R., Flanigan M.J.,
RA Edwards N.J., Bolanos R., Fasulo D., Halldorsson B.V., Hannenhalli S.,
RA Turner R., Yooseph S., Lu F., Nusskern D.R., Shue B.C., Zheng X.H.,
RA Zhong F., Delcher A.L., Huson D.H., Kravitz S.A., Mouchard L., Reinert K.,
RA Remington K.A., Clark A.G., Waterman M.S., Eichler E.E., Adams M.D.,
RA Hunkapiller M.W., Myers E.W., Venter J.C.;
RL Submitted (SEP-2005) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|Ensembl:ENSP00000518184.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (MAY-2025) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC matrix {ECO:0000256|ARBA:ARBA00004498}.
CC -!- SIMILARITY: Belongs to the multiplexin collagen family.
CC {ECO:0000256|ARBA:ARBA00061275}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CH471079; EAX09341.1; -; Genomic_DNA.
DR SMR; A0AAG2UW65; -.
DR Ensembl; ENST00000710294.1; ENSP00000518184.1; ENSG00000292234.1.
DR GeneID; 80781; -.
DR CTD; 80781; -.
DR Proteomes; UP000005640; Unplaced.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005788; C:endoplasmic reticulum lumen; IEA:UniProtKB-ARBA.
DR GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR CDD; cd00247; Endostatin-like; 1.
DR FunFam; 3.10.100.10:FF:000008; collagen alpha-1(XVIII) chain isoform X1; 1.
DR FunFam; 3.40.1620.70:FF:000003; Collagen type XVIII alpha 1; 1.
DR FunFam; 2.60.120.200:FF:000039; Collagen XV alpha 1 chain; 1.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 3.40.1620.70; -; 1.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR050149; Collagen_superfamily.
DR InterPro; IPR010515; Collagenase_NC10/endostatin.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR048287; TSPN-like_N.
DR InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR PANTHER; PTHR24023:SF1112; COL_CUTICLE_N DOMAIN-CONTAINING PROTEIN-RELATED; 1.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR Pfam; PF01391; Collagen; 4.
DR Pfam; PF20010; Collagen_trimer; 1.
DR Pfam; PF06482; Endostatin; 1.
DR SMART; SM00210; TSPN; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE 1: Evidence at protein level;
KW Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW Collagen {ECO:0000256|ARBA:ARBA00023119, ECO:0000313|EMBL:EAX09341.1};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW Proteoglycan {ECO:0000256|ARBA:ARBA00022974};
KW Proteomics identification {ECO:0007829|PeptideAtlas:A0AAG2UW65};
KW Reference proteome {ECO:0000313|Proteomes:UP000005640};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..33
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 34..1336
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5044707639"
FT DOMAIN 41..229
FT /note="Thrombospondin-like N-terminal"
FT /evidence="ECO:0000259|SMART:SM00210"
FT REGION 230..1025
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1093..1138
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 257..266
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 302..323
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 347..374
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 400..416
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 418..431
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 447..459
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 489..499
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 515..527
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 531..546
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 561..588
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 590..606
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 638..650
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 680..694
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 702..711
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 726..738
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 839..853
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 867..881
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 906..926
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 938..947
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 983..996
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1006..1018
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1101..1117
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1123..1133
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1336 AA; 135510 MW; 16B7FB340F2B3CB6 CRC64;
MAPRCPWPWP RRRRLLDVLA PLVLLLGVRA ASAEPERISE EVGLLQLLGD PPPQQVTQTD
DPDVGLAYVF GPDANSGQVA RYHFPSLFFR DFSLLFHIRP ATEGPGVLFA ITDSAQAMVL
LGVKLSGVQD GHQDISLLYT EPGAGQTHTA ASFRLPAFVG QWTHLALSVA GGFVALYVDC
EEFQRMPLAR SSRGLELEPG AGLFVAQAGG ADPDKFQGVI AELKVRRDPQ VSPMHCLDEE
GDDSDGASGD SGSGLGDARE LLREETGAAL KPRLPAPPPV TTPPLAGGSS TEDSRSEEVE
EQTTVASLGA QTLPGSDSVS TWDGSVRTPG GRVKEGGLKG QKGEPGVPGP PGRAGPPGSP
CLPGPPGLPC PVSPLGPAGP ALQTVPGPQG PPGPPGRDGT PGRDGEPGDP GEDGKPGDTG
PQGFPGTPGD VGPKGDKGDP GVGERGPPGP QGPPGPPGPS FRHDKLTFID MEGSGFGGDL
EALRGPRGFP GPPGPPGVPG LPGEPGRFGV NSSDVPGPAG LPGVPGREGP PGFPGLPGPP
GPPGREGPPG RTGQKGSLGE AGAPGHKGSK GAPGPAGARG ESGLAGAPGP AGPPGPPGPP
GPPGPGLPAG FDDMEGSGGP FWSTARSADG PQGPPGLPGL KGDPGVPGLP GAKGEVGADG
VPGFPGLPGR EGIAGPQGPK GDRGSRGEKG DPGKDGVGQP GLPGPPGPPG PVVYVSEQDG
SVLSVPGPEG RPGFAGFPGP AGPKGNLGSK GERGSPGPKG EKGEPGSIFS PDGGALGPAQ
KGAKGEPGFR GPPGPYGRPG YKGEIGFPGR PGRPGMNGLK GEKGEPGDAS LGFGMRGMPG
PPGPPGPPGP PGTPVYDSNV FAESSRPGPP GLPGNQGPPG PKGAKGEVGP PGPPGQFPFD
FLQLEAEMKG EKGDRGDAGQ KGERGEPGGG GFFGSSLPGP PGPPGPRGYP GIPGPKGESI
RGQPGPPGPQ GPPGIGYEGR QGPPGPPGPP GPPSFPGPHR QTISVPGPPG PPGPPGPPGT
MGASSGVRLW ATRQAMLGQV HEVPEGWLIF VAEQEELYVR VQNGFRKVQL EARTPLPRGT
DNEVAALQPP VVQLHDSNPY PRREHPHPTA RPWRADDILA SPPRLPEPQP YPGAPHHSSY
VHLRPARPTS PPAHSHRDFQ PVLHLVALNS PLSGGMRGIR GADFQCFQQA RAVGLAGTFR
AFLSSRLQDL YSIVRRADRA AVPIVNLKDE LLFPSWEALF SGSEGPLKPG ARIFSFDGKD
VLRHPTWPQK SVWHGSDPNG RRLTESYCET WRTEAPSATG QASSLLGGRL LGQSAASCHH
AYIVLCIENS FMTASK
//