GenomeNet

Database: UniProt
Entry: H2YUS6_CIOSA
LinkDB: H2YUS6_CIOSA
Original site: H2YUS6_CIOSA 
ID   H2YUS6_CIOSA            Unreviewed;      1673 AA.
AC   H2YUS6;
DT   18-APR-2012, integrated into UniProtKB/TrEMBL.
DT   18-APR-2012, sequence version 1.
DT   27-MAR-2024, entry version 57.
DE   RecName: Full=Collagen IV NC1 domain-containing protein {ECO:0000259|PROSITE:PS51403};
OS   Ciona savignyi (Pacific transparent sea squirt).
OC   Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Phlebobranchia;
OC   Cionidae; Ciona.
OX   NCBI_TaxID=51511 {ECO:0000313|Ensembl:ENSCSAVP00000009086.1, ECO:0000313|Proteomes:UP000007875};
RN   [1] {ECO:0000313|Proteomes:UP000007875}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA   Birren B., Nusbaum C., Abebe A., Abouelleil A., Adekoya E., Ait-zahra M.,
RA   Allen N., Allen T., An P., Anderson M., Anderson S., Arachchi H.,
RA   Armbruster J., Bachantsang P., Baldwin J., Barry A., Bayul T.,
RA   Blitshsteyn B., Bloom T., Blye J., Boguslavskiy L., Borowsky M.,
RA   Boukhgalter B., Brunache A., Butler J., Calixte N., Calvo S., Camarata J.,
RA   Campo K., Chang J., Cheshatsang Y., Citroen M., Collymore A., Considine T.,
RA   Cook A., Cooke P., Corum B., Cuomo C., David R., Dawoe T., Degray S.,
RA   Dodge S., Dooley K., Dorje P., Dorjee K., Dorris L., Duffey N., Dupes A.,
RA   Elkins T., Engels R., Erickson J., Farina A., Faro S., Ferreira P.,
RA   Fischer H., Fitzgerald M., Foley K., Gage D., Galagan J., Gearin G.,
RA   Gnerre S., Gnirke A., Goyette A., Graham J., Grandbois E., Gyaltsen K.,
RA   Hafez N., Hagopian D., Hagos B., Hall J., Hatcher B., Heller A.,
RA   Higgins H., Honan T., Horn A., Houde N., Hughes L., Hulme W., Husby E.,
RA   Iliev I., Jaffe D., Jones C., Kamal M., Kamat A., Kamvysselis M.,
RA   Karlsson E., Kells C., Kieu A., Kisner P., Kodira C., Kulbokas E.,
RA   Labutti K., Lama D., Landers T., Leger J., Levine S., Lewis D., Lewis T.,
RA   Lindblad-toh K., Liu X., Lokyitsang T., Lokyitsang Y., Lucien O., Lui A.,
RA   Ma L.J., Mabbitt R., Macdonald J., Maclean C., Major J., Manning J.,
RA   Marabella R., Maru K., Matthews C., Mauceli E., Mccarthy M., Mcdonough S.,
RA   Mcghee T., Meldrim J., Meneus L., Mesirov J., Mihalev A., Mihova T.,
RA   Mikkelsen T., Mlenga V., Moru K., Mozes J., Mulrain L., Munson G.,
RA   Naylor J., Newes C., Nguyen C., Nguyen N., Nguyen T., Nicol R., Nielsen C.,
RA   Nizzari M., Norbu C., Norbu N., O'donnell P., Okoawo O., O'leary S.,
RA   Omotosho B., O'neill K., Osman S., Parker S., Perrin D., Phunkhang P.,
RA   Piqani B., Purcell S., Rachupka T., Ramasamy U., Rameau R., Ray V.,
RA   Raymond C., Retta R., Richardson S., Rise C., Rodriguez J., Rogers J.,
RA   Rogov P., Rutman M., Schupbach R., Seaman C., Settipalli S., Sharpe T.,
RA   Sheridan J., Sherpa N., Shi J., Smirnov S., Smith C., Sougnez C.,
RA   Spencer B., Stalker J., Stange-thomann N., Stavropoulos S., Stetson K.,
RA   Stone C., Stone S., Stubbs M., Talamas J., Tchuinga P., Tenzing P.,
RA   Tesfaye S., Theodore J., Thoulutsang Y., Topham K., Towey S., Tsamla T.,
RA   Tsomo N., Vallee D., Vassiliev H., Venkataraman V., Vinson J., Vo A.,
RA   Wade C., Wang S., Wangchuk T., Wangdi T., Whittaker C., Wilkinson J.,
RA   Wu Y., Wyman D., Yadav S., Yang S., Yang X., Yeager S., Yee E., Young G.,
RA   Zainoun J., Zembeck L., Zimmer A., Zody M., Lander E.;
RL   Submitted (AUG-2003) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Ensembl:ENSCSAVP00000009086.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   -!- FUNCTION: Type IV collagen is the major structural component of
CC       glomerular basement membranes (GBM), forming a 'chicken-wire' meshwork
CC       together with laminins, proteoglycans and entactin/nidogen.
CC       {ECO:0000256|ARBA:ARBA00003696}.
CC   -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004370}.
CC       Secreted, extracellular space, extracellular matrix, basement membrane
CC       {ECO:0000256|ARBA:ARBA00004302}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   Ensembl; ENSCSAVT00000009200.1; ENSCSAVP00000009086.1; ENSCSAVG00000005368.1.
DR   GeneTree; ENSGT00940000169543; -.
DR   Proteomes; UP000007875; Unassembled WGS sequence.
DR   GO; GO:0005604; C:basement membrane; IEA:UniProtKB-SubCell.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR   GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR   Gene3D; 2.170.240.10; Collagen IV, non-collagenous; 1.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR001442; Collagen_IV_NC.
DR   InterPro; IPR036954; Collagen_IV_NC_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1104; COLLAGEN ALPHA-1(IV) CHAIN; 1.
DR   Pfam; PF01413; C4; 2.
DR   Pfam; PF01391; Collagen; 20.
DR   SMART; SM00111; C4; 2.
DR   SUPFAM; SSF56436; C-type lectin-like; 2.
DR   PROSITE; PS51403; NC1_IV; 1.
PE   4: Predicted;
KW   Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Reference proteome {ECO:0000313|Proteomes:UP000007875};
KW   Secreted {ECO:0000256|ARBA:ARBA00022530}.
FT   DOMAIN          1447..1673
FT                   /note="Collagen IV NC1"
FT                   /evidence="ECO:0000259|PROSITE:PS51403"
FT   REGION          14..318
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          331..419
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          432..465
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          546..948
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          961..1334
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1357..1426
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        25..39
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1673 AA;  162650 MW;  F7E698588C02374E CRC64;
     GCGRGAVTYC NATKGTRGMP GLEGMPGPPG LPGYPGPEGP AGPRGLSGKA GRDGMQGPKG
     ARGSQGVHGY PGNPGIPGIP GQSGAPGPEG IPGCNGTKGA LGPVGPPGLD GFVGRPGRPG
     QVGVKGEPGE VINMVAGTKG EPGFDGAPGP RGDNGEDGPI GPRGYIGNPG PPGPEGTPGT
     PGERGEMGLG FQGEKGEKGE PGLPGTSGSR GDLDAQTGGA DFFIGITGMK GDKGGKGEMG
     PHGPQVGKPG TPGVNGFDGR KGEPGFQGPP GRSGIKGYKG EDGESGLPGP PGIAGESGAA
     GRVGRPGPTG APGAKGAMGF PGMSGLNGLK GAAGLPGIPG EQGPRGLPGF GLQGPPGEPG
     PSGQPGLPGR PGSKGSIGEP GAPGERGLTV EGPPGQDGRP GRNGRPGLPG AVGLKGQPGS
     DAVITHEIME EMVTPGPPGQ PGPPGPPGRF ICNTGRSGTD GPKGEECGVC PAGPKGATGE
     SGTTGIPGIP GDMGSHGAKG AKGQKGFAGF SGLTGQPGRD GAEGEIGLPG SKGEVGELTV
     IRIKGEKGSP GYVGPAGLPG RSGAPGRDGF NGERGAKGGK GEVGPPGPLG RSGETGNPGI
     PGSPGRDGLD LPGETGETGE KGDRGNIGGL GLPGTPGQKG EPARARVVAG EKGDAGNIGL
     PGTPGRAGAP GEKGENGLQG FPGFAGKKGA QGSPGAPGRS VQGEPGPSGY PGESGAPGKK
     GDQGPAGFDG MPGLDGMPGK KGEDAVGFPG SRGMKGSPGS PGSPGYPGEQ GPPGPSGRRG
     PAGPDGEDGK DGLPGLDGNP GRTGPKGEPG SARLGPIGPP GPEGKSGEPG LPGFQGRPGE
     RGLPGTDIVG MKGERGNPGL EGLPGFVGEV GSPGRAGLPG LDGPAGPPGK DGLPGTKGLP
     GRSGPMGFPG AKGEPGKPGT SGADGLPGSV GLPGEDGPPG FRGLNGQKGE PAEIAAGLLE
     PGEKGEQAGL PGIYGYPGKK GSTGFPGAKG DQGPRGLTGL DGLQGPQGEQ GAKGDLGLPG
     PAGEQGLQGP EGRPGVAGET GLPGYPGVKG AKSTLPGLPG ADGLPGQDGL DGTAGLPGKD
     GEVGFPGRDG TPGPKGTQGR PGLMGFPGEN GEKGNIGPVG ERGLPGQQGF PGIGGLEGIP
     GDKGEQGNPG LPGRNGLDGL PGEDGLPGDS VGGPAGLPGR QGLPGEKGMV GLPGLTGARG
     LPGDKGEMGF LGPSGLPGRQ GQPGPRGASV KGETGLPGLP GRPIGPAGIP GLPGMIGEKG
     EAGPVGEAGS TGFDGRPGQK GQIGLPGTPG LPGRSGFPGE SGQPGVAGLQ GRPGQPGSPG
     LPGLEGPTGV KGERANDLFS FKGVTGLRGE DAEPIFREGE KGLPGSPGPV GPRGFPGSAG
     RTGTPGGRGH KGERGQEGIQ GLPGLTGDRG EPGAQGLTGA PGLPGSNGLP GVDGLPGAAG
     PSALFHGYFV TRHSQTQYVP ECPLNMRKLW EGYSLLYIQG NERSHGQDLG TAGSCLRRFN
     TMPFMFCAVD NSCRVASRND YSFWLSTPEP FPMSMEAVTG KTTIEPYISR CAVCETPSLT
     IAVHSQSDLI PDCPENWVSL WIGYSFVMVS SRSGAEGSGQ SLQSPGSCLE DFRAGPFIEC
     HGRGTCNQYA NGYSFWLATI MQQNQFSQPI SETLKAGTLR QRVSRCQVCM RGD
//
DBGET integrated database retrieval system