GenomeNet

Database: UniProt
Entry: H2YZ20_CIOSA
LinkDB: H2YZ20_CIOSA
Original site: H2YZ20_CIOSA 
ID   H2YZ20_CIOSA            Unreviewed;       929 AA.
AC   H2YZ20;
DT   18-APR-2012, integrated into UniProtKB/TrEMBL.
DT   08-OCT-2025, sequence version 2.
DT   28-JAN-2026, entry version 77.
DE   RecName: Full=Collagenase NC10/endostatin domain-containing protein {ECO:0008006|Google:ProtNLM};
OS   Ciona savignyi (Pacific transparent sea squirt).
OC   Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Phlebobranchia;
OC   Cionidae; Ciona.
OX   NCBI_TaxID=51511 {ECO:0000313|Ensembl:ENSCSAVP00000010581.1, ECO:0000313|Proteomes:UP000007875};
RN   [1] {ECO:0000313|Proteomes:UP000007875}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA   Birren B., Nusbaum C., Abebe A., Abouelleil A., Adekoya E., Ait-zahra M.,
RA   Allen N., Allen T., An P., Anderson M., Anderson S., Arachchi H.,
RA   Armbruster J., Bachantsang P., Baldwin J., Barry A., Bayul T.,
RA   Blitshsteyn B., Bloom T., Blye J., Boguslavskiy L., Borowsky M.,
RA   Boukhgalter B., Brunache A., Butler J., Calixte N., Calvo S., Camarata J.,
RA   Campo K., Chang J., Cheshatsang Y., Citroen M., Collymore A., Considine T.,
RA   Cook A., Cooke P., Corum B., Cuomo C., David R., Dawoe T., Degray S.,
RA   Dodge S., Dooley K., Dorje P., Dorjee K., Dorris L., Duffey N., Dupes A.,
RA   Elkins T., Engels R., Erickson J., Farina A., Faro S., Ferreira P.,
RA   Fischer H., Fitzgerald M., Foley K., Gage D., Galagan J., Gearin G.,
RA   Gnerre S., Gnirke A., Goyette A., Graham J., Grandbois E., Gyaltsen K.,
RA   Hafez N., Hagopian D., Hagos B., Hall J., Hatcher B., Heller A.,
RA   Higgins H., Honan T., Horn A., Houde N., Hughes L., Hulme W., Husby E.,
RA   Iliev I., Jaffe D., Jones C., Kamal M., Kamat A., Kamvysselis M.,
RA   Karlsson E., Kells C., Kieu A., Kisner P., Kodira C., Kulbokas E.,
RA   Labutti K., Lama D., Landers T., Leger J., Levine S., Lewis D., Lewis T.,
RA   Lindblad-toh K., Liu X., Lokyitsang T., Lokyitsang Y., Lucien O., Lui A.,
RA   Ma L.J., Mabbitt R., Macdonald J., Maclean C., Major J., Manning J.,
RA   Marabella R., Maru K., Matthews C., Mauceli E., Mccarthy M., Mcdonough S.,
RA   Mcghee T., Meldrim J., Meneus L., Mesirov J., Mihalev A., Mihova T.,
RA   Mikkelsen T., Mlenga V., Moru K., Mozes J., Mulrain L., Munson G.,
RA   Naylor J., Newes C., Nguyen C., Nguyen N., Nguyen T., Nicol R., Nielsen C.,
RA   Nizzari M., Norbu C., Norbu N., O'donnell P., Okoawo O., O'leary S.,
RA   Omotosho B., O'neill K., Osman S., Parker S., Perrin D., Phunkhang P.,
RA   Piqani B., Purcell S., Rachupka T., Ramasamy U., Rameau R., Ray V.,
RA   Raymond C., Retta R., Richardson S., Rise C., Rodriguez J., Rogers J.,
RA   Rogov P., Rutman M., Schupbach R., Seaman C., Settipalli S., Sharpe T.,
RA   Sheridan J., Sherpa N., Shi J., Smirnov S., Smith C., Sougnez C.,
RA   Spencer B., Stalker J., Stange-thomann N., Stavropoulos S., Stetson K.,
RA   Stone C., Stone S., Stubbs M., Talamas J., Tchuinga P., Tenzing P.,
RA   Tesfaye S., Theodore J., Thoulutsang Y., Topham K., Towey S., Tsamla T.,
RA   Tsomo N., Vallee D., Vassiliev H., Venkataraman V., Vinson J., Vo A.,
RA   Wade C., Wang S., Wangchuk T., Wangdi T., Whittaker C., Wilkinson J.,
RA   Wu Y., Wyman D., Yadav S., Yang S., Yang X., Yeager S., Yee E., Young G.,
RA   Zainoun J., Zembeck L., Zimmer A., Zody M., Lander E.;
RL   Submitted (AUG-2003) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Ensembl:ENSCSAVP00000010581.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (AUG-2025) to UniProtKB.
RN   [3] {ECO:0000313|Ensembl:ENSCSAVP00000010581.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (SEP-2025) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   AlphaFoldDB; H2YZ20; -.
DR   STRING; 51511.ENSCSAVP00000010581; -.
DR   Ensembl; ENSCSAVT00000010710.1; ENSCSAVP00000010581.1; ENSCSAVG00000006221.1.
DR   eggNOG; KOG3544; Eukaryota.
DR   eggNOG; KOG3546; Eukaryota.
DR   GeneTree; ENSGT00940000158302; -.
DR   HOGENOM; CLU_014222_0_0_1; -.
DR   InParanoid; H2YZ20; -.
DR   OMA; YSHERPY; -.
DR   Proteomes; UP000007875; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   CDD; cd00247; Endostatin-like; 1.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1082; COLLAGEN TRIPLE HELIX REPEAT; 1.
DR   Pfam; PF01391; Collagen; 4.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
PE   4: Predicted;
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Reference proteome {ECO:0000313|Proteomes:UP000007875}.
FT   DOMAIN          659..701
FT                   /note="Collagen type XV/XVIII trimerization"
FT                   /evidence="ECO:0000259|Pfam:PF20010"
FT   DOMAIN          754..921
FT                   /note="Collagenase NC10/endostatin"
FT                   /evidence="ECO:0000259|Pfam:PF06482"
FT   REGION          1..656
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          719..746
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1..12
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        15..29
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        49..63
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        99..112
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        156..174
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        225..236
FT                   /note="Acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        287..296
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        395..408
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        410..419
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        420..435
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        446..462
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        557..569
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        617..632
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   929 AA;  92385 MW;  9618DB7097967722 CRC64;
     MPSISPLTTQ PSPKEVTTTP TITRVTEPTT TKDHLVVSEM VPDSNFRTAS DRGEKGEAGR
     DGENGVGVPG MLGPQGEKGE PGDMSDVAGP PGERGPKGEP GIPGKDGVSV GAAGAGEKGE
     QGEPGPQGER GESVTGSKGE PGEIGKDGIP GEVGSAGEQG PAGPAGPQGE TGKPGVDGKD
     GVGLPGANGA KGQKGDAGDV SSVTGAPGVA GPPGIKGEKG SPGMDGEDGE DGEDGEDGSK
     GDRGPPGKGI RGLPGPPGTY SGFTDMEGSG FGGVPGEPGP AGPIGPAGPA GAMGPIGPAG
     PPGGDQGDAG PEGPAGKDGI SGAEGQIGHP GPVGPIGAKG EPGDDGSAGL PGPPGPPGTC
     QASDKMPTDD EDFMEGSGEE RPKIKSGCGA GAAGAPGPEG KPGLQGPHGP AGPPGPPGPQ
     GEMGPQGPHG LQGQQGDAGV SGVKGSQGPA GVPGAVGPQG SPGNDGRDGA PGTPGLDGKP
     GLSGSKGDTG AQGPPGPPGD VISRIQEASA EKGEKGEPGA VIGPDGNVLS AGQKGEPGVP
     GINGKDGYPG IDGAKGDMGM PGRRGSPGRN GRDGRPGPPG PSTSSGGLFG GSDGAKRGDT
     GAEGARGPRG SPGIDGQQGI PGRTGQPGRT GPAGPPGPAG RDGQNGGNVA SLRDSNPVRS
     FDSIHHMQAS YQTTADGTMA FLNDREEMFV KTSIGWRQVM LSRPLPPPRQ PVVTLEPEIP
     ETTQPPTTTL STTTLPPTTT TISTEAPRTP GKVLHLIALN FPLPGNTRGI VGADGRCFEQ
     ARRAGLTGTY RAFLSSRDQN VRSIVGREDR RNVPIVNALG EQLFPSWESI FRSNEGSIPN
     AKMIYSFKKQ QVLTDARWPL KYIWHGSYPD GRLNPMHYCA SWYTQHPAVT GQASPLASSK
     LLAQKPYSCQ SSFVVLCIEN STRTLRYKR
//
DBGET integrated database retrieval system