GenomeNet

Database: UniProt
Entry: H2Z8Z8_CIOSA
LinkDB: H2Z8Z8_CIOSA
Original site: H2Z8Z8_CIOSA 
ID   H2Z8Z8_CIOSA            Unreviewed;       876 AA.
AC   H2Z8Z8;
DT   18-APR-2012, integrated into UniProtKB/TrEMBL.
DT   18-APR-2012, sequence version 1.
DT   27-MAR-2024, entry version 51.
DE   RecName: Full=Fibrillar collagen NC1 domain-containing protein {ECO:0000259|PROSITE:PS51461};
OS   Ciona savignyi (Pacific transparent sea squirt).
OC   Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Phlebobranchia;
OC   Cionidae; Ciona.
OX   NCBI_TaxID=51511 {ECO:0000313|Ensembl:ENSCSAVP00000014063.1, ECO:0000313|Proteomes:UP000007875};
RN   [1] {ECO:0000313|Proteomes:UP000007875}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA   Birren B., Nusbaum C., Abebe A., Abouelleil A., Adekoya E., Ait-zahra M.,
RA   Allen N., Allen T., An P., Anderson M., Anderson S., Arachchi H.,
RA   Armbruster J., Bachantsang P., Baldwin J., Barry A., Bayul T.,
RA   Blitshsteyn B., Bloom T., Blye J., Boguslavskiy L., Borowsky M.,
RA   Boukhgalter B., Brunache A., Butler J., Calixte N., Calvo S., Camarata J.,
RA   Campo K., Chang J., Cheshatsang Y., Citroen M., Collymore A., Considine T.,
RA   Cook A., Cooke P., Corum B., Cuomo C., David R., Dawoe T., Degray S.,
RA   Dodge S., Dooley K., Dorje P., Dorjee K., Dorris L., Duffey N., Dupes A.,
RA   Elkins T., Engels R., Erickson J., Farina A., Faro S., Ferreira P.,
RA   Fischer H., Fitzgerald M., Foley K., Gage D., Galagan J., Gearin G.,
RA   Gnerre S., Gnirke A., Goyette A., Graham J., Grandbois E., Gyaltsen K.,
RA   Hafez N., Hagopian D., Hagos B., Hall J., Hatcher B., Heller A.,
RA   Higgins H., Honan T., Horn A., Houde N., Hughes L., Hulme W., Husby E.,
RA   Iliev I., Jaffe D., Jones C., Kamal M., Kamat A., Kamvysselis M.,
RA   Karlsson E., Kells C., Kieu A., Kisner P., Kodira C., Kulbokas E.,
RA   Labutti K., Lama D., Landers T., Leger J., Levine S., Lewis D., Lewis T.,
RA   Lindblad-toh K., Liu X., Lokyitsang T., Lokyitsang Y., Lucien O., Lui A.,
RA   Ma L.J., Mabbitt R., Macdonald J., Maclean C., Major J., Manning J.,
RA   Marabella R., Maru K., Matthews C., Mauceli E., Mccarthy M., Mcdonough S.,
RA   Mcghee T., Meldrim J., Meneus L., Mesirov J., Mihalev A., Mihova T.,
RA   Mikkelsen T., Mlenga V., Moru K., Mozes J., Mulrain L., Munson G.,
RA   Naylor J., Newes C., Nguyen C., Nguyen N., Nguyen T., Nicol R., Nielsen C.,
RA   Nizzari M., Norbu C., Norbu N., O'donnell P., Okoawo O., O'leary S.,
RA   Omotosho B., O'neill K., Osman S., Parker S., Perrin D., Phunkhang P.,
RA   Piqani B., Purcell S., Rachupka T., Ramasamy U., Rameau R., Ray V.,
RA   Raymond C., Retta R., Richardson S., Rise C., Rodriguez J., Rogers J.,
RA   Rogov P., Rutman M., Schupbach R., Seaman C., Settipalli S., Sharpe T.,
RA   Sheridan J., Sherpa N., Shi J., Smirnov S., Smith C., Sougnez C.,
RA   Spencer B., Stalker J., Stange-thomann N., Stavropoulos S., Stetson K.,
RA   Stone C., Stone S., Stubbs M., Talamas J., Tchuinga P., Tenzing P.,
RA   Tesfaye S., Theodore J., Thoulutsang Y., Topham K., Towey S., Tsamla T.,
RA   Tsomo N., Vallee D., Vassiliev H., Venkataraman V., Vinson J., Vo A.,
RA   Wade C., Wang S., Wangchuk T., Wangdi T., Whittaker C., Wilkinson J.,
RA   Wu Y., Wyman D., Yadav S., Yang S., Yang X., Yeager S., Yee E., Young G.,
RA   Zainoun J., Zembeck L., Zimmer A., Zody M., Lander E.;
RL   Submitted (AUG-2003) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Ensembl:ENSCSAVP00000014063.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   AlphaFoldDB; H2Z8Z8; -.
DR   Ensembl; ENSCSAVT00000014225.1; ENSCSAVP00000014063.1; ENSCSAVG00000008250.1.
DR   GeneTree; ENSGT00940000168023; -.
DR   Proteomes; UP000007875; Unassembled WGS sequence.
DR   GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR   Gene3D; 2.60.120.1000; -; 1.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR000885; Fib_collagen_C.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1082; COLLAGEN ALPHA-1(X) CHAIN; 1.
DR   Pfam; PF01410; COLFI; 1.
DR   Pfam; PF01391; Collagen; 4.
DR   PROSITE; PS51461; NC1_FIB; 1.
PE   4: Predicted;
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Reference proteome {ECO:0000313|Proteomes:UP000007875};
KW   Secreted {ECO:0000256|ARBA:ARBA00022530}.
FT   DOMAIN          796..876
FT                   /note="Fibrillar collagen NC1"
FT                   /evidence="ECO:0000259|PROSITE:PS51461"
FT   REGION          1..449
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          505..527
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          550..611
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          726..790
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        343..366
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        761..790
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   876 AA;  86301 MW;  9A58AD4A83CBC06E CRC64;
     ANLQGRRGHA GRTGRPGNRG PRGQKGSYGR PGLPGPPGPK GEKGIQGPTG PDGIMGERGR
     RGRKGVQGLP GRQGVKGLQG EPGVPGHPGE GGSQGIQGPP GLSGATGSRG NVGSRGLLGT
     QGKSVRGSVG PKGPIGEPGR PGRSGPQGHH GPPGIPGTPG KKGLPGEKGA SKLPGFKGEM
     GKPGKEGLYG KQGPMGRKGH KGIEGVTGPV GPKGHNGLTG PIGPPGRPGL PGKPGLPGQI
     GTMGSVGLPG TTGPTGPDGP LGAKGNIGDI GPQGEDGKIG NRGNQGPSAT KGRKGEPGPV
     GPEGNPGSPG PMGKRGPMGP EGTEGREGIP GPVGPVGDDG KSGLDGVKGE PGDDGAKGEK
     GNKGDNGDHG LQGPIGLQGL QGIGGTPGNV GSSGPKGSVG KKGEMGADGL TGSPGAKGVM
     GKQGQSGVKG KRGDRGDMGF PGESGPLGPI GPKGYIGLPG PIGIEGPKGT PGIEVNYNYL
     GIFIYDNLGL LKGISGEKGK RGNRGARGLV GKPGVRGFSG KSGDIGRIGN LGFQGPKGKP
     GSIGYPGHVG DQGQPGVRGP IGEKGLSGKP GSPGKRGKSG IPGPIGPDGV TGRDGTGGIR
     GEQGEPGPIG MRGPQGPTGF PGLPGNMGDR GSIGGPGPDG IQGSTGLAGK MGRIGYPGLR
     GNKGNLGEKG KNGEMGPMGI NGLPGPYGRK GDPGQPGPAG PVGLRGSDVS GIVYGPAGIM
     GPIGLTGAKG TSGPNGFRGD QGIPGPPGPP GPDVDFSRLR ENMGVTSTMQ SELRSVSVTE
     NTPQNDETSP VTIASYPIQS QFDTLDFILQ AKAMKLTHKD GSREYPALTC LDLMKMNEKV
     GYQSRDGAYW IDPNEGSILD ALKVRCNFQR GGYTCI
//
DBGET integrated database retrieval system