GenomeNet

Database: UniProt
Entry: A0AAG2UW65_HUMAN
LinkDB: A0AAG2UW65_HUMAN
Original site: A0AAG2UW65_HUMAN 
ID   A0AAG2UW65_HUMAN        Unreviewed;      1336 AA.
AC   A0AAG2UW65;
DT   24-JUL-2024, integrated into UniProtKB/TrEMBL.
DT   24-JUL-2024, sequence version 1.
DT   28-JAN-2026, entry version 8.
DE   SubName: Full=Collagen type XVIII alpha 1 chain {ECO:0000313|Ensembl:ENSP00000518184.1};
DE   SubName: Full=Collagen, type XVIII, alpha 1, isoform CRA_d {ECO:0000313|EMBL:EAX09341.1};
GN   Name=COL18A1 {ECO:0000313|EMBL:EAX09341.1,
GN   ECO:0000313|Ensembl:ENSP00000518184.1};
GN   ORFNames=hCG_1647196 {ECO:0000313|EMBL:EAX09341.1};
OS   Homo sapiens (Human).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC   Homo.
OX   NCBI_TaxID=9606 {ECO:0000313|Ensembl:ENSP00000518184.1, ECO:0000313|Proteomes:UP000005640};
RN   [1] {ECO:0000313|EMBL:EAX09341.1}
RP   NUCLEOTIDE SEQUENCE.
RX   PubMed=11181995; DOI=10.1126/science.1058040;
RA   Venter J.C., Adams M.D., Myers E.W., Li P.W., Mural R.J., Sutton G.G.,
RA   Smith H.O., Yandell M., Evans C.A., Holt R.A., Gocayne J.D., Amanatides P.,
RA   Ballew R.M., Huson D.H., Wortman J.R., Zhang Q., Kodira C.D., Zheng X.H.,
RA   Chen L., Skupski M., Subramanian G., Thomas P.D., Zhang J.,
RA   Gabor Miklos G.L., Nelson C., Broder S., Clark A.G., Nadeau J.,
RA   McKusick V.A., Zinder N., Levine A.J., Roberts R.J., Simon M., Slayman C.,
RA   Hunkapiller M., Bolanos R., Delcher A., Dew I., Fasulo D., Flanigan M.,
RA   Florea L., Halpern A., Hannenhalli S., Kravitz S., Levy S., Mobarry C.,
RA   Reinert K., Remington K., Abu-Threideh J., Beasley E., Biddick K.,
RA   Bonazzi V., Brandon R., Cargill M., Chandramouliswaran I., Charlab R.,
RA   Chaturvedi K., Deng Z., Di Francesco V., Dunn P., Eilbeck K.,
RA   Evangelista C., Gabrielian A.E., Gan W., Ge W., Gong F., Gu Z., Guan P.,
RA   Heiman T.J., Higgins M.E., Ji R.R., Ke Z., Ketchum K.A., Lai Z., Lei Y.,
RA   Li Z., Li J., Liang Y., Lin X., Lu F., Merkulov G.V., Milshina N.,
RA   Moore H.M., Naik A.K., Narayan V.A., Neelam B., Nusskern D., Rusch D.B.,
RA   Salzberg S., Shao W., Shue B., Sun J., Wang Z., Wang A., Wang X., Wang J.,
RA   Wei M., Wides R., Xiao C., Yan C., Yao A., Ye J., Zhan M., Zhang W.,
RA   Zhang H., Zhao Q., Zheng L., Zhong F., Zhong W., Zhu S., Zhao S.,
RA   Gilbert D., Baumhueter S., Spier G., Carter C., Cravchik A., Woodage T.,
RA   Ali F., An H., Awe A., Baldwin D., Baden H., Barnstead M., Barrow I.,
RA   Beeson K., Busam D., Carver A., Center A., Cheng M.L., Curry L.,
RA   Danaher S., Davenport L., Desilets R., Dietz S., Dodson K., Doup L.,
RA   Ferriera S., Garg N., Gluecksmann A., Hart B., Haynes J., Haynes C.,
RA   Heiner C., Hladun S., Hostin D., Houck J., Howland T., Ibegwam C.,
RA   Johnson J., Kalush F., Kline L., Koduru S., Love A., Mann F., May D.,
RA   McCawley S., McIntosh T., McMullen I., Moy M., Moy L., Murphy B.,
RA   Nelson K., Pfannkoch C., Pratts E., Puri V., Qureshi H., Reardon M.,
RA   Rodriguez R., Rogers Y.H., Romblad D., Ruhfel B., Scott R., Sitter C.,
RA   Smallwood M., Stewart E., Strong R., Suh E., Thomas R., Tint N.N., Tse S.,
RA   Vech C., Wang G., Wetter J., Williams S., Williams M., Windsor S.,
RA   Winn-Deen E., Wolfe K., Zaveri J., Zaveri K., Abril J.F., Guigo R.,
RA   Campbell M.J., Sjolander K.V., Karlak B., Kejariwal A., Mi H., Lazareva B.,
RA   Hatton T., Narechania A., Diemer K., Muruganujan A., Guo N., Sato S.,
RA   Bafna V., Istrail S., Lippert R., Schwartz R., Walenz B., Yooseph S.,
RA   Allen D., Basu A., Baxendale J., Blick L., Caminha M., Carnes-Stine J.,
RA   Caulk P., Chiang Y.H., Coyne M., Dahlke C., Mays A., Dombroski M.,
RA   Donnelly M., Ely D., Esparham S., Fosler C., Gire H., Glanowski S.,
RA   Glasser K., Glodek A., Gorokhov M., Graham K., Gropman B., Harris M.,
RA   Heil J., Henderson S., Hoover J., Jennings D., Jordan C., Jordan J.,
RA   Kasha J., Kagan L., Kraft C., Levitsky A., Lewis M., Liu X., Lopez J.,
RA   Ma D., Majoros W., McDaniel J., Murphy S., Newman M., Nguyen T., Nguyen N.,
RA   Nodell M., Pan S., Peck J., Peterson M., Rowe W., Sanders R., Scott J.,
RA   Simpson M., Smith T., Sprague A., Stockwell T., Turner R., Venter E.,
RA   Wang M., Wen M., Wu D., Wu M., Xia A., Zandieh A., Zhu X.;
RT   "The sequence of the human genome.";
RL   Science 291:1304-1351(2001).
RN   [2] {ECO:0000313|EMBL:EAX09341.1}
RP   NUCLEOTIDE SEQUENCE.
RA   Mural R.J., Istrail S., Sutton G., Florea L., Halpern A.L., Mobarry C.M.,
RA   Lippert R., Walenz B., Shatkay H., Dew I., Miller J.R., Flanigan M.J.,
RA   Edwards N.J., Bolanos R., Fasulo D., Halldorsson B.V., Hannenhalli S.,
RA   Turner R., Yooseph S., Lu F., Nusskern D.R., Shue B.C., Zheng X.H.,
RA   Zhong F., Delcher A.L., Huson D.H., Kravitz S.A., Mouchard L., Reinert K.,
RA   Remington K.A., Clark A.G., Waterman M.S., Eichler E.E., Adams M.D.,
RA   Hunkapiller M.W., Myers E.W., Venter J.C.;
RL   Submitted (SEP-2005) to the EMBL/GenBank/DDBJ databases.
RN   [3] {ECO:0000313|Ensembl:ENSP00000518184.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (MAY-2025) to UniProtKB.
CC   -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC       matrix {ECO:0000256|ARBA:ARBA00004498}.
CC   -!- SIMILARITY: Belongs to the multiplexin collagen family.
CC       {ECO:0000256|ARBA:ARBA00061275}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; CH471079; EAX09341.1; -; Genomic_DNA.
DR   SMR; A0AAG2UW65; -.
DR   Ensembl; ENST00000710294.1; ENSP00000518184.1; ENSG00000292234.1.
DR   GeneID; 80781; -.
DR   CTD; 80781; -.
DR   Proteomes; UP000005640; Unplaced.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0005788; C:endoplasmic reticulum lumen; IEA:UniProtKB-ARBA.
DR   GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR   CDD; cd00247; Endostatin-like; 1.
DR   FunFam; 3.10.100.10:FF:000008; collagen alpha-1(XVIII) chain isoform X1; 1.
DR   FunFam; 3.40.1620.70:FF:000003; Collagen type XVIII alpha 1; 1.
DR   FunFam; 2.60.120.200:FF:000039; Collagen XV alpha 1 chain; 1.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR048287; TSPN-like_N.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023:SF1112; COL_CUTICLE_N DOMAIN-CONTAINING PROTEIN-RELATED; 1.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   Pfam; PF01391; Collagen; 4.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SMART; SM00210; TSPN; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE   1: Evidence at protein level;
KW   Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW   Collagen {ECO:0000256|ARBA:ARBA00023119, ECO:0000313|EMBL:EAX09341.1};
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW   Proteoglycan {ECO:0000256|ARBA:ARBA00022974};
KW   Proteomics identification {ECO:0007829|PeptideAtlas:A0AAG2UW65};
KW   Reference proteome {ECO:0000313|Proteomes:UP000005640};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Secreted {ECO:0000256|ARBA:ARBA00022525};
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..33
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           34..1336
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5044707639"
FT   DOMAIN          41..229
FT                   /note="Thrombospondin-like N-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00210"
FT   REGION          230..1025
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1093..1138
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        257..266
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        302..323
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        347..374
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        400..416
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        418..431
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        447..459
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        489..499
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        515..527
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        531..546
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        561..588
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        590..606
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        638..650
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        680..694
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        702..711
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        726..738
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        839..853
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        867..881
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        906..926
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        938..947
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        983..996
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1006..1018
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1101..1117
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1123..1133
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1336 AA;  135510 MW;  16B7FB340F2B3CB6 CRC64;
     MAPRCPWPWP RRRRLLDVLA PLVLLLGVRA ASAEPERISE EVGLLQLLGD PPPQQVTQTD
     DPDVGLAYVF GPDANSGQVA RYHFPSLFFR DFSLLFHIRP ATEGPGVLFA ITDSAQAMVL
     LGVKLSGVQD GHQDISLLYT EPGAGQTHTA ASFRLPAFVG QWTHLALSVA GGFVALYVDC
     EEFQRMPLAR SSRGLELEPG AGLFVAQAGG ADPDKFQGVI AELKVRRDPQ VSPMHCLDEE
     GDDSDGASGD SGSGLGDARE LLREETGAAL KPRLPAPPPV TTPPLAGGSS TEDSRSEEVE
     EQTTVASLGA QTLPGSDSVS TWDGSVRTPG GRVKEGGLKG QKGEPGVPGP PGRAGPPGSP
     CLPGPPGLPC PVSPLGPAGP ALQTVPGPQG PPGPPGRDGT PGRDGEPGDP GEDGKPGDTG
     PQGFPGTPGD VGPKGDKGDP GVGERGPPGP QGPPGPPGPS FRHDKLTFID MEGSGFGGDL
     EALRGPRGFP GPPGPPGVPG LPGEPGRFGV NSSDVPGPAG LPGVPGREGP PGFPGLPGPP
     GPPGREGPPG RTGQKGSLGE AGAPGHKGSK GAPGPAGARG ESGLAGAPGP AGPPGPPGPP
     GPPGPGLPAG FDDMEGSGGP FWSTARSADG PQGPPGLPGL KGDPGVPGLP GAKGEVGADG
     VPGFPGLPGR EGIAGPQGPK GDRGSRGEKG DPGKDGVGQP GLPGPPGPPG PVVYVSEQDG
     SVLSVPGPEG RPGFAGFPGP AGPKGNLGSK GERGSPGPKG EKGEPGSIFS PDGGALGPAQ
     KGAKGEPGFR GPPGPYGRPG YKGEIGFPGR PGRPGMNGLK GEKGEPGDAS LGFGMRGMPG
     PPGPPGPPGP PGTPVYDSNV FAESSRPGPP GLPGNQGPPG PKGAKGEVGP PGPPGQFPFD
     FLQLEAEMKG EKGDRGDAGQ KGERGEPGGG GFFGSSLPGP PGPPGPRGYP GIPGPKGESI
     RGQPGPPGPQ GPPGIGYEGR QGPPGPPGPP GPPSFPGPHR QTISVPGPPG PPGPPGPPGT
     MGASSGVRLW ATRQAMLGQV HEVPEGWLIF VAEQEELYVR VQNGFRKVQL EARTPLPRGT
     DNEVAALQPP VVQLHDSNPY PRREHPHPTA RPWRADDILA SPPRLPEPQP YPGAPHHSSY
     VHLRPARPTS PPAHSHRDFQ PVLHLVALNS PLSGGMRGIR GADFQCFQQA RAVGLAGTFR
     AFLSSRLQDL YSIVRRADRA AVPIVNLKDE LLFPSWEALF SGSEGPLKPG ARIFSFDGKD
     VLRHPTWPQK SVWHGSDPNG RRLTESYCET WRTEAPSATG QASSLLGGRL LGQSAASCHH
     AYIVLCIENS FMTASK
//
DBGET integrated database retrieval system