GenomeNet

Database: UniProt
Entry: H2YUS3_CIOSA
LinkDB: H2YUS3_CIOSA
Original site: H2YUS3_CIOSA 
ID   H2YUS3_CIOSA            Unreviewed;      1150 AA.
AC   H2YUS3;
DT   18-APR-2012, integrated into UniProtKB/TrEMBL.
DT   18-APR-2012, sequence version 1.
DT   27-MAR-2024, entry version 59.
DE   RecName: Full=Collagen IV NC1 domain-containing protein {ECO:0000259|PROSITE:PS51403};
OS   Ciona savignyi (Pacific transparent sea squirt).
OC   Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Phlebobranchia;
OC   Cionidae; Ciona.
OX   NCBI_TaxID=51511 {ECO:0000313|Ensembl:ENSCSAVP00000009083.1, ECO:0000313|Proteomes:UP000007875};
RN   [1] {ECO:0000313|Proteomes:UP000007875}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA   Birren B., Nusbaum C., Abebe A., Abouelleil A., Adekoya E., Ait-zahra M.,
RA   Allen N., Allen T., An P., Anderson M., Anderson S., Arachchi H.,
RA   Armbruster J., Bachantsang P., Baldwin J., Barry A., Bayul T.,
RA   Blitshsteyn B., Bloom T., Blye J., Boguslavskiy L., Borowsky M.,
RA   Boukhgalter B., Brunache A., Butler J., Calixte N., Calvo S., Camarata J.,
RA   Campo K., Chang J., Cheshatsang Y., Citroen M., Collymore A., Considine T.,
RA   Cook A., Cooke P., Corum B., Cuomo C., David R., Dawoe T., Degray S.,
RA   Dodge S., Dooley K., Dorje P., Dorjee K., Dorris L., Duffey N., Dupes A.,
RA   Elkins T., Engels R., Erickson J., Farina A., Faro S., Ferreira P.,
RA   Fischer H., Fitzgerald M., Foley K., Gage D., Galagan J., Gearin G.,
RA   Gnerre S., Gnirke A., Goyette A., Graham J., Grandbois E., Gyaltsen K.,
RA   Hafez N., Hagopian D., Hagos B., Hall J., Hatcher B., Heller A.,
RA   Higgins H., Honan T., Horn A., Houde N., Hughes L., Hulme W., Husby E.,
RA   Iliev I., Jaffe D., Jones C., Kamal M., Kamat A., Kamvysselis M.,
RA   Karlsson E., Kells C., Kieu A., Kisner P., Kodira C., Kulbokas E.,
RA   Labutti K., Lama D., Landers T., Leger J., Levine S., Lewis D., Lewis T.,
RA   Lindblad-toh K., Liu X., Lokyitsang T., Lokyitsang Y., Lucien O., Lui A.,
RA   Ma L.J., Mabbitt R., Macdonald J., Maclean C., Major J., Manning J.,
RA   Marabella R., Maru K., Matthews C., Mauceli E., Mccarthy M., Mcdonough S.,
RA   Mcghee T., Meldrim J., Meneus L., Mesirov J., Mihalev A., Mihova T.,
RA   Mikkelsen T., Mlenga V., Moru K., Mozes J., Mulrain L., Munson G.,
RA   Naylor J., Newes C., Nguyen C., Nguyen N., Nguyen T., Nicol R., Nielsen C.,
RA   Nizzari M., Norbu C., Norbu N., O'donnell P., Okoawo O., O'leary S.,
RA   Omotosho B., O'neill K., Osman S., Parker S., Perrin D., Phunkhang P.,
RA   Piqani B., Purcell S., Rachupka T., Ramasamy U., Rameau R., Ray V.,
RA   Raymond C., Retta R., Richardson S., Rise C., Rodriguez J., Rogers J.,
RA   Rogov P., Rutman M., Schupbach R., Seaman C., Settipalli S., Sharpe T.,
RA   Sheridan J., Sherpa N., Shi J., Smirnov S., Smith C., Sougnez C.,
RA   Spencer B., Stalker J., Stange-thomann N., Stavropoulos S., Stetson K.,
RA   Stone C., Stone S., Stubbs M., Talamas J., Tchuinga P., Tenzing P.,
RA   Tesfaye S., Theodore J., Thoulutsang Y., Topham K., Towey S., Tsamla T.,
RA   Tsomo N., Vallee D., Vassiliev H., Venkataraman V., Vinson J., Vo A.,
RA   Wade C., Wang S., Wangchuk T., Wangdi T., Whittaker C., Wilkinson J.,
RA   Wu Y., Wyman D., Yadav S., Yang S., Yang X., Yeager S., Yee E., Young G.,
RA   Zainoun J., Zembeck L., Zimmer A., Zody M., Lander E.;
RL   Submitted (AUG-2003) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Ensembl:ENSCSAVP00000009083.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   -!- FUNCTION: Type IV collagen is the major structural component of
CC       glomerular basement membranes (GBM), forming a 'chicken-wire' meshwork
CC       together with laminins, proteoglycans and entactin/nidogen.
CC       {ECO:0000256|ARBA:ARBA00003696}.
CC   -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004370}.
CC       Secreted, extracellular space, extracellular matrix, basement membrane
CC       {ECO:0000256|ARBA:ARBA00004302}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   AlphaFoldDB; H2YUS3; -.
DR   Ensembl; ENSCSAVT00000009197.1; ENSCSAVP00000009083.1; ENSCSAVG00000005368.1.
DR   GeneTree; ENSGT00940000169543; -.
DR   Proteomes; UP000007875; Unassembled WGS sequence.
DR   GO; GO:0005604; C:basement membrane; IEA:UniProtKB-SubCell.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR   GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR   Gene3D; 2.170.240.10; Collagen IV, non-collagenous; 1.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR001442; Collagen_IV_NC.
DR   InterPro; IPR036954; Collagen_IV_NC_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1104; COLLAGEN ALPHA-1(IV) CHAIN; 1.
DR   Pfam; PF01413; C4; 2.
DR   Pfam; PF01391; Collagen; 9.
DR   SMART; SM00111; C4; 2.
DR   SUPFAM; SSF56436; C-type lectin-like; 2.
DR   PROSITE; PS51403; NC1_IV; 1.
PE   4: Predicted;
KW   Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Reference proteome {ECO:0000313|Proteomes:UP000007875};
KW   Secreted {ECO:0000256|ARBA:ARBA00022530}; Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..29
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           30..1150
FT                   /note="Collagen IV NC1 domain-containing protein"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5003578870"
FT   DOMAIN          923..1149
FT                   /note="Collagen IV NC1"
FT                   /evidence="ECO:0000259|PROSITE:PS51403"
FT   REGION          56..146
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          159..247
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          260..524
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          544..733
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          752..781
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          796..911
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        61..80
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        264..279
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1150 AA;  113261 MW;  643218D984C4ABBF CRC64;
     MAKTVVGWSP MAGFIFVGLL AHFASFSHGK AVYASVGCGR GAVTYCNATK GTRGMPGLEG
     MPGPPGLPGY PGPEGPAGPR GLSGPPGQDG LQGRKGEPGF QGPPGRSGIK GYKGEDGESG
     LPGPPGIAVS VGRPGPTGAP GAKGAMGFPG MSGLNGLKGA AGLPGIPGEQ GPRGLPGFGL
     QGPPGEPGPS GQPGLPGRPG SKGSIGEPGA PGERGLTVEG PPGQDGRPGR NGRPGLPGAV
     GLKGQPGSDA VITHEIMEEM VTPGPPGQPG PPGPPGRVGA PGYPDRGNIG GLGLPGENGL
     QGFPGFAGKK GAQGSPGAPG RSVQGEPGPS GYPGESGAPG KKGDQGPAGF DGMPGLDGMP
     GKKVCSEGIL DAVGFPGSRG MKGSPGSPGS PGYPGEQGPP GPSGRRGPAG PDGEDGKDGK
     PSNLDNQIYV GRPGERGLPG TDIVGMKGER GNPGLEGLPG FVGEVGSPGR AGLPGLDGPA
     GPPGKDGLPG TKGLPGRSGP MGFPGAKGEP GKPGTSGADG LPGSVGLPGE DGPPGFRIYG
     YPGKKGSTGF PGAKGDQGPR AKGDLGLPGP AGEQGLQGPE GRPGVAGETG LPGYPGVKGA
     KSTLPGLPGA DGLPGQDGLD GTAGLPGKDG EVGFPGRDGT PGPKGTQGRP GLMGFPGENG
     EKGNIGPVGE RGLPGQQGIP GDKGEQGNPG LPGRNGLDGL PGEDGLPGDS VGGPAGLPGR
     QGLPGEKGMV GLPGLTVTRG LPGDKGEMGF LGPSGLPGRQ GQPGPRGASV KGETGLPGLP
     GRPIGPAGIP GLPGMIGEKG EAGPVGEAGS TGFDGRPGQK GQIGLPSGQP GVAGLQGRPG
     QPGSPGLPGL EGPTGGEKGL PGSPGPVGPR GFPGSAGRTG TPGGRGHKGE RGQEGAPGLP
     GSNGLPGVDG LPGAAGPSAL FHGYFVTRHS QTQYVPECPL NMRKLWEGYS LLYIQGNERS
     HGQDLGTAGS CLRRFNTMPF MFCAVDNSCR VASRNDYSFW LSTPEPFPMS MEAVTGKTTI
     EPYISRCAVC ETPSLTIAVH SQSDLIPDCP ENWVSLWIGY SFVMVSSRSG AEGSGQSLQS
     PGSCLEDFRA GPFIECHGRG TCNQYANGYS FWLATIMQQN QFSQPISETL KAGTLRQRVS
     RCQVCMRGDF
//
DBGET integrated database retrieval system