ID H2Z8Z8_CIOSA Unreviewed; 876 AA.
AC H2Z8Z8;
DT 18-APR-2012, integrated into UniProtKB/TrEMBL.
DT 18-APR-2012, sequence version 1.
DT 27-MAR-2024, entry version 51.
DE RecName: Full=Fibrillar collagen NC1 domain-containing protein {ECO:0000259|PROSITE:PS51461};
OS Ciona savignyi (Pacific transparent sea squirt).
OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Phlebobranchia;
OC Cionidae; Ciona.
OX NCBI_TaxID=51511 {ECO:0000313|Ensembl:ENSCSAVP00000014063.1, ECO:0000313|Proteomes:UP000007875};
RN [1] {ECO:0000313|Proteomes:UP000007875}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Birren B., Nusbaum C., Abebe A., Abouelleil A., Adekoya E., Ait-zahra M.,
RA Allen N., Allen T., An P., Anderson M., Anderson S., Arachchi H.,
RA Armbruster J., Bachantsang P., Baldwin J., Barry A., Bayul T.,
RA Blitshsteyn B., Bloom T., Blye J., Boguslavskiy L., Borowsky M.,
RA Boukhgalter B., Brunache A., Butler J., Calixte N., Calvo S., Camarata J.,
RA Campo K., Chang J., Cheshatsang Y., Citroen M., Collymore A., Considine T.,
RA Cook A., Cooke P., Corum B., Cuomo C., David R., Dawoe T., Degray S.,
RA Dodge S., Dooley K., Dorje P., Dorjee K., Dorris L., Duffey N., Dupes A.,
RA Elkins T., Engels R., Erickson J., Farina A., Faro S., Ferreira P.,
RA Fischer H., Fitzgerald M., Foley K., Gage D., Galagan J., Gearin G.,
RA Gnerre S., Gnirke A., Goyette A., Graham J., Grandbois E., Gyaltsen K.,
RA Hafez N., Hagopian D., Hagos B., Hall J., Hatcher B., Heller A.,
RA Higgins H., Honan T., Horn A., Houde N., Hughes L., Hulme W., Husby E.,
RA Iliev I., Jaffe D., Jones C., Kamal M., Kamat A., Kamvysselis M.,
RA Karlsson E., Kells C., Kieu A., Kisner P., Kodira C., Kulbokas E.,
RA Labutti K., Lama D., Landers T., Leger J., Levine S., Lewis D., Lewis T.,
RA Lindblad-toh K., Liu X., Lokyitsang T., Lokyitsang Y., Lucien O., Lui A.,
RA Ma L.J., Mabbitt R., Macdonald J., Maclean C., Major J., Manning J.,
RA Marabella R., Maru K., Matthews C., Mauceli E., Mccarthy M., Mcdonough S.,
RA Mcghee T., Meldrim J., Meneus L., Mesirov J., Mihalev A., Mihova T.,
RA Mikkelsen T., Mlenga V., Moru K., Mozes J., Mulrain L., Munson G.,
RA Naylor J., Newes C., Nguyen C., Nguyen N., Nguyen T., Nicol R., Nielsen C.,
RA Nizzari M., Norbu C., Norbu N., O'donnell P., Okoawo O., O'leary S.,
RA Omotosho B., O'neill K., Osman S., Parker S., Perrin D., Phunkhang P.,
RA Piqani B., Purcell S., Rachupka T., Ramasamy U., Rameau R., Ray V.,
RA Raymond C., Retta R., Richardson S., Rise C., Rodriguez J., Rogers J.,
RA Rogov P., Rutman M., Schupbach R., Seaman C., Settipalli S., Sharpe T.,
RA Sheridan J., Sherpa N., Shi J., Smirnov S., Smith C., Sougnez C.,
RA Spencer B., Stalker J., Stange-thomann N., Stavropoulos S., Stetson K.,
RA Stone C., Stone S., Stubbs M., Talamas J., Tchuinga P., Tenzing P.,
RA Tesfaye S., Theodore J., Thoulutsang Y., Topham K., Towey S., Tsamla T.,
RA Tsomo N., Vallee D., Vassiliev H., Venkataraman V., Vinson J., Vo A.,
RA Wade C., Wang S., Wangchuk T., Wangdi T., Whittaker C., Wilkinson J.,
RA Wu Y., Wyman D., Yadav S., Yang S., Yang X., Yeager S., Yee E., Young G.,
RA Zainoun J., Zembeck L., Zimmer A., Zody M., Lander E.;
RL Submitted (AUG-2003) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSCSAVP00000014063.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; H2Z8Z8; -.
DR Ensembl; ENSCSAVT00000014225.1; ENSCSAVP00000014063.1; ENSCSAVG00000008250.1.
DR GeneTree; ENSGT00940000168023; -.
DR Proteomes; UP000007875; Unassembled WGS sequence.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR Gene3D; 2.60.120.1000; -; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR000885; Fib_collagen_C.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1082; COLLAGEN ALPHA-1(X) CHAIN; 1.
DR Pfam; PF01410; COLFI; 1.
DR Pfam; PF01391; Collagen; 4.
DR PROSITE; PS51461; NC1_FIB; 1.
PE 4: Predicted;
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Reference proteome {ECO:0000313|Proteomes:UP000007875};
KW Secreted {ECO:0000256|ARBA:ARBA00022530}.
FT DOMAIN 796..876
FT /note="Fibrillar collagen NC1"
FT /evidence="ECO:0000259|PROSITE:PS51461"
FT REGION 1..449
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 505..527
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 550..611
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 726..790
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 343..366
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 761..790
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 876 AA; 86301 MW; 9A58AD4A83CBC06E CRC64;
ANLQGRRGHA GRTGRPGNRG PRGQKGSYGR PGLPGPPGPK GEKGIQGPTG PDGIMGERGR
RGRKGVQGLP GRQGVKGLQG EPGVPGHPGE GGSQGIQGPP GLSGATGSRG NVGSRGLLGT
QGKSVRGSVG PKGPIGEPGR PGRSGPQGHH GPPGIPGTPG KKGLPGEKGA SKLPGFKGEM
GKPGKEGLYG KQGPMGRKGH KGIEGVTGPV GPKGHNGLTG PIGPPGRPGL PGKPGLPGQI
GTMGSVGLPG TTGPTGPDGP LGAKGNIGDI GPQGEDGKIG NRGNQGPSAT KGRKGEPGPV
GPEGNPGSPG PMGKRGPMGP EGTEGREGIP GPVGPVGDDG KSGLDGVKGE PGDDGAKGEK
GNKGDNGDHG LQGPIGLQGL QGIGGTPGNV GSSGPKGSVG KKGEMGADGL TGSPGAKGVM
GKQGQSGVKG KRGDRGDMGF PGESGPLGPI GPKGYIGLPG PIGIEGPKGT PGIEVNYNYL
GIFIYDNLGL LKGISGEKGK RGNRGARGLV GKPGVRGFSG KSGDIGRIGN LGFQGPKGKP
GSIGYPGHVG DQGQPGVRGP IGEKGLSGKP GSPGKRGKSG IPGPIGPDGV TGRDGTGGIR
GEQGEPGPIG MRGPQGPTGF PGLPGNMGDR GSIGGPGPDG IQGSTGLAGK MGRIGYPGLR
GNKGNLGEKG KNGEMGPMGI NGLPGPYGRK GDPGQPGPAG PVGLRGSDVS GIVYGPAGIM
GPIGLTGAKG TSGPNGFRGD QGIPGPPGPP GPDVDFSRLR ENMGVTSTMQ SELRSVSVTE
NTPQNDETSP VTIASYPIQS QFDTLDFILQ AKAMKLTHKD GSREYPALTC LDLMKMNEKV
GYQSRDGAYW IDPNEGSILD ALKVRCNFQR GGYTCI
//