ID H2YUS6_CIOSA Unreviewed; 1673 AA.
AC H2YUS6;
DT 18-APR-2012, integrated into UniProtKB/TrEMBL.
DT 18-APR-2012, sequence version 1.
DT 27-MAR-2024, entry version 57.
DE RecName: Full=Collagen IV NC1 domain-containing protein {ECO:0000259|PROSITE:PS51403};
OS Ciona savignyi (Pacific transparent sea squirt).
OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Phlebobranchia;
OC Cionidae; Ciona.
OX NCBI_TaxID=51511 {ECO:0000313|Ensembl:ENSCSAVP00000009086.1, ECO:0000313|Proteomes:UP000007875};
RN [1] {ECO:0000313|Proteomes:UP000007875}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Birren B., Nusbaum C., Abebe A., Abouelleil A., Adekoya E., Ait-zahra M.,
RA Allen N., Allen T., An P., Anderson M., Anderson S., Arachchi H.,
RA Armbruster J., Bachantsang P., Baldwin J., Barry A., Bayul T.,
RA Blitshsteyn B., Bloom T., Blye J., Boguslavskiy L., Borowsky M.,
RA Boukhgalter B., Brunache A., Butler J., Calixte N., Calvo S., Camarata J.,
RA Campo K., Chang J., Cheshatsang Y., Citroen M., Collymore A., Considine T.,
RA Cook A., Cooke P., Corum B., Cuomo C., David R., Dawoe T., Degray S.,
RA Dodge S., Dooley K., Dorje P., Dorjee K., Dorris L., Duffey N., Dupes A.,
RA Elkins T., Engels R., Erickson J., Farina A., Faro S., Ferreira P.,
RA Fischer H., Fitzgerald M., Foley K., Gage D., Galagan J., Gearin G.,
RA Gnerre S., Gnirke A., Goyette A., Graham J., Grandbois E., Gyaltsen K.,
RA Hafez N., Hagopian D., Hagos B., Hall J., Hatcher B., Heller A.,
RA Higgins H., Honan T., Horn A., Houde N., Hughes L., Hulme W., Husby E.,
RA Iliev I., Jaffe D., Jones C., Kamal M., Kamat A., Kamvysselis M.,
RA Karlsson E., Kells C., Kieu A., Kisner P., Kodira C., Kulbokas E.,
RA Labutti K., Lama D., Landers T., Leger J., Levine S., Lewis D., Lewis T.,
RA Lindblad-toh K., Liu X., Lokyitsang T., Lokyitsang Y., Lucien O., Lui A.,
RA Ma L.J., Mabbitt R., Macdonald J., Maclean C., Major J., Manning J.,
RA Marabella R., Maru K., Matthews C., Mauceli E., Mccarthy M., Mcdonough S.,
RA Mcghee T., Meldrim J., Meneus L., Mesirov J., Mihalev A., Mihova T.,
RA Mikkelsen T., Mlenga V., Moru K., Mozes J., Mulrain L., Munson G.,
RA Naylor J., Newes C., Nguyen C., Nguyen N., Nguyen T., Nicol R., Nielsen C.,
RA Nizzari M., Norbu C., Norbu N., O'donnell P., Okoawo O., O'leary S.,
RA Omotosho B., O'neill K., Osman S., Parker S., Perrin D., Phunkhang P.,
RA Piqani B., Purcell S., Rachupka T., Ramasamy U., Rameau R., Ray V.,
RA Raymond C., Retta R., Richardson S., Rise C., Rodriguez J., Rogers J.,
RA Rogov P., Rutman M., Schupbach R., Seaman C., Settipalli S., Sharpe T.,
RA Sheridan J., Sherpa N., Shi J., Smirnov S., Smith C., Sougnez C.,
RA Spencer B., Stalker J., Stange-thomann N., Stavropoulos S., Stetson K.,
RA Stone C., Stone S., Stubbs M., Talamas J., Tchuinga P., Tenzing P.,
RA Tesfaye S., Theodore J., Thoulutsang Y., Topham K., Towey S., Tsamla T.,
RA Tsomo N., Vallee D., Vassiliev H., Venkataraman V., Vinson J., Vo A.,
RA Wade C., Wang S., Wangchuk T., Wangdi T., Whittaker C., Wilkinson J.,
RA Wu Y., Wyman D., Yadav S., Yang S., Yang X., Yeager S., Yee E., Young G.,
RA Zainoun J., Zembeck L., Zimmer A., Zody M., Lander E.;
RL Submitted (AUG-2003) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSCSAVP00000009086.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- FUNCTION: Type IV collagen is the major structural component of
CC glomerular basement membranes (GBM), forming a 'chicken-wire' meshwork
CC together with laminins, proteoglycans and entactin/nidogen.
CC {ECO:0000256|ARBA:ARBA00003696}.
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004370}.
CC Secreted, extracellular space, extracellular matrix, basement membrane
CC {ECO:0000256|ARBA:ARBA00004302}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR Ensembl; ENSCSAVT00000009200.1; ENSCSAVP00000009086.1; ENSCSAVG00000005368.1.
DR GeneTree; ENSGT00940000169543; -.
DR Proteomes; UP000007875; Unassembled WGS sequence.
DR GO; GO:0005604; C:basement membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR Gene3D; 2.170.240.10; Collagen IV, non-collagenous; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR001442; Collagen_IV_NC.
DR InterPro; IPR036954; Collagen_IV_NC_sf.
DR InterPro; IPR016187; CTDL_fold.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1104; COLLAGEN ALPHA-1(IV) CHAIN; 1.
DR Pfam; PF01413; C4; 2.
DR Pfam; PF01391; Collagen; 20.
DR SMART; SM00111; C4; 2.
DR SUPFAM; SSF56436; C-type lectin-like; 2.
DR PROSITE; PS51403; NC1_IV; 1.
PE 4: Predicted;
KW Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Reference proteome {ECO:0000313|Proteomes:UP000007875};
KW Secreted {ECO:0000256|ARBA:ARBA00022530}.
FT DOMAIN 1447..1673
FT /note="Collagen IV NC1"
FT /evidence="ECO:0000259|PROSITE:PS51403"
FT REGION 14..318
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 331..419
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 432..465
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 546..948
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 961..1334
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1357..1426
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 25..39
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1673 AA; 162650 MW; F7E698588C02374E CRC64;
GCGRGAVTYC NATKGTRGMP GLEGMPGPPG LPGYPGPEGP AGPRGLSGKA GRDGMQGPKG
ARGSQGVHGY PGNPGIPGIP GQSGAPGPEG IPGCNGTKGA LGPVGPPGLD GFVGRPGRPG
QVGVKGEPGE VINMVAGTKG EPGFDGAPGP RGDNGEDGPI GPRGYIGNPG PPGPEGTPGT
PGERGEMGLG FQGEKGEKGE PGLPGTSGSR GDLDAQTGGA DFFIGITGMK GDKGGKGEMG
PHGPQVGKPG TPGVNGFDGR KGEPGFQGPP GRSGIKGYKG EDGESGLPGP PGIAGESGAA
GRVGRPGPTG APGAKGAMGF PGMSGLNGLK GAAGLPGIPG EQGPRGLPGF GLQGPPGEPG
PSGQPGLPGR PGSKGSIGEP GAPGERGLTV EGPPGQDGRP GRNGRPGLPG AVGLKGQPGS
DAVITHEIME EMVTPGPPGQ PGPPGPPGRF ICNTGRSGTD GPKGEECGVC PAGPKGATGE
SGTTGIPGIP GDMGSHGAKG AKGQKGFAGF SGLTGQPGRD GAEGEIGLPG SKGEVGELTV
IRIKGEKGSP GYVGPAGLPG RSGAPGRDGF NGERGAKGGK GEVGPPGPLG RSGETGNPGI
PGSPGRDGLD LPGETGETGE KGDRGNIGGL GLPGTPGQKG EPARARVVAG EKGDAGNIGL
PGTPGRAGAP GEKGENGLQG FPGFAGKKGA QGSPGAPGRS VQGEPGPSGY PGESGAPGKK
GDQGPAGFDG MPGLDGMPGK KGEDAVGFPG SRGMKGSPGS PGSPGYPGEQ GPPGPSGRRG
PAGPDGEDGK DGLPGLDGNP GRTGPKGEPG SARLGPIGPP GPEGKSGEPG LPGFQGRPGE
RGLPGTDIVG MKGERGNPGL EGLPGFVGEV GSPGRAGLPG LDGPAGPPGK DGLPGTKGLP
GRSGPMGFPG AKGEPGKPGT SGADGLPGSV GLPGEDGPPG FRGLNGQKGE PAEIAAGLLE
PGEKGEQAGL PGIYGYPGKK GSTGFPGAKG DQGPRGLTGL DGLQGPQGEQ GAKGDLGLPG
PAGEQGLQGP EGRPGVAGET GLPGYPGVKG AKSTLPGLPG ADGLPGQDGL DGTAGLPGKD
GEVGFPGRDG TPGPKGTQGR PGLMGFPGEN GEKGNIGPVG ERGLPGQQGF PGIGGLEGIP
GDKGEQGNPG LPGRNGLDGL PGEDGLPGDS VGGPAGLPGR QGLPGEKGMV GLPGLTGARG
LPGDKGEMGF LGPSGLPGRQ GQPGPRGASV KGETGLPGLP GRPIGPAGIP GLPGMIGEKG
EAGPVGEAGS TGFDGRPGQK GQIGLPGTPG LPGRSGFPGE SGQPGVAGLQ GRPGQPGSPG
LPGLEGPTGV KGERANDLFS FKGVTGLRGE DAEPIFREGE KGLPGSPGPV GPRGFPGSAG
RTGTPGGRGH KGERGQEGIQ GLPGLTGDRG EPGAQGLTGA PGLPGSNGLP GVDGLPGAAG
PSALFHGYFV TRHSQTQYVP ECPLNMRKLW EGYSLLYIQG NERSHGQDLG TAGSCLRRFN
TMPFMFCAVD NSCRVASRND YSFWLSTPEP FPMSMEAVTG KTTIEPYISR CAVCETPSLT
IAVHSQSDLI PDCPENWVSL WIGYSFVMVS SRSGAEGSGQ SLQSPGSCLE DFRAGPFIEC
HGRGTCNQYA NGYSFWLATI MQQNQFSQPI SETLKAGTLR QRVSRCQVCM RGD
//