ID H2YYS9_CIOSA Unreviewed; 1980 AA.
AC H2YYS9;
DT 18-APR-2012, integrated into UniProtKB/TrEMBL.
DT 18-APR-2012, sequence version 1.
DT 27-MAR-2024, entry version 52.
DE RecName: Full=Kielin/chordin-like protein {ECO:0008006|Google:ProtNLM};
OS Ciona savignyi (Pacific transparent sea squirt).
OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Phlebobranchia;
OC Cionidae; Ciona.
OX NCBI_TaxID=51511 {ECO:0000313|Ensembl:ENSCSAVP00000010490.1, ECO:0000313|Proteomes:UP000007875};
RN [1] {ECO:0000313|Proteomes:UP000007875}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Birren B., Nusbaum C., Abebe A., Abouelleil A., Adekoya E., Ait-zahra M.,
RA Allen N., Allen T., An P., Anderson M., Anderson S., Arachchi H.,
RA Armbruster J., Bachantsang P., Baldwin J., Barry A., Bayul T.,
RA Blitshsteyn B., Bloom T., Blye J., Boguslavskiy L., Borowsky M.,
RA Boukhgalter B., Brunache A., Butler J., Calixte N., Calvo S., Camarata J.,
RA Campo K., Chang J., Cheshatsang Y., Citroen M., Collymore A., Considine T.,
RA Cook A., Cooke P., Corum B., Cuomo C., David R., Dawoe T., Degray S.,
RA Dodge S., Dooley K., Dorje P., Dorjee K., Dorris L., Duffey N., Dupes A.,
RA Elkins T., Engels R., Erickson J., Farina A., Faro S., Ferreira P.,
RA Fischer H., Fitzgerald M., Foley K., Gage D., Galagan J., Gearin G.,
RA Gnerre S., Gnirke A., Goyette A., Graham J., Grandbois E., Gyaltsen K.,
RA Hafez N., Hagopian D., Hagos B., Hall J., Hatcher B., Heller A.,
RA Higgins H., Honan T., Horn A., Houde N., Hughes L., Hulme W., Husby E.,
RA Iliev I., Jaffe D., Jones C., Kamal M., Kamat A., Kamvysselis M.,
RA Karlsson E., Kells C., Kieu A., Kisner P., Kodira C., Kulbokas E.,
RA Labutti K., Lama D., Landers T., Leger J., Levine S., Lewis D., Lewis T.,
RA Lindblad-toh K., Liu X., Lokyitsang T., Lokyitsang Y., Lucien O., Lui A.,
RA Ma L.J., Mabbitt R., Macdonald J., Maclean C., Major J., Manning J.,
RA Marabella R., Maru K., Matthews C., Mauceli E., Mccarthy M., Mcdonough S.,
RA Mcghee T., Meldrim J., Meneus L., Mesirov J., Mihalev A., Mihova T.,
RA Mikkelsen T., Mlenga V., Moru K., Mozes J., Mulrain L., Munson G.,
RA Naylor J., Newes C., Nguyen C., Nguyen N., Nguyen T., Nicol R., Nielsen C.,
RA Nizzari M., Norbu C., Norbu N., O'donnell P., Okoawo O., O'leary S.,
RA Omotosho B., O'neill K., Osman S., Parker S., Perrin D., Phunkhang P.,
RA Piqani B., Purcell S., Rachupka T., Ramasamy U., Rameau R., Ray V.,
RA Raymond C., Retta R., Richardson S., Rise C., Rodriguez J., Rogers J.,
RA Rogov P., Rutman M., Schupbach R., Seaman C., Settipalli S., Sharpe T.,
RA Sheridan J., Sherpa N., Shi J., Smirnov S., Smith C., Sougnez C.,
RA Spencer B., Stalker J., Stange-thomann N., Stavropoulos S., Stetson K.,
RA Stone C., Stone S., Stubbs M., Talamas J., Tchuinga P., Tenzing P.,
RA Tesfaye S., Theodore J., Thoulutsang Y., Topham K., Towey S., Tsamla T.,
RA Tsomo N., Vallee D., Vassiliev H., Venkataraman V., Vinson J., Vo A.,
RA Wade C., Wang S., Wangchuk T., Wangdi T., Whittaker C., Wilkinson J.,
RA Wu Y., Wyman D., Yadav S., Yang S., Yang X., Yeager S., Yee E., Young G.,
RA Zainoun J., Zembeck L., Zimmer A., Zody M., Lander E.;
RL Submitted (AUG-2003) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSCSAVP00000010490.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 51511.ENSCSAVP00000010490; -.
DR Ensembl; ENSCSAVT00000010617.1; ENSCSAVP00000010490.1; ENSCSAVG00000006179.1.
DR eggNOG; KOG1216; Eukaryota.
DR GeneTree; ENSGT00940000160243; -.
DR HOGENOM; CLU_000367_1_0_1; -.
DR InParanoid; H2YYS9; -.
DR OMA; CQECVVE; -.
DR TreeFam; TF106451; -.
DR Proteomes; UP000007875; Unassembled WGS sequence.
DR GO; GO:0006952; P:defense response; IEA:InterPro.
DR CDD; cd19941; TIL; 1.
DR Gene3D; 6.20.200.20; -; 15.
DR Gene3D; 2.10.70.10; Complement Module, domain 1; 7.
DR InterPro; IPR036084; Ser_inhib-like_sf.
DR InterPro; IPR001010; Thionin.
DR InterPro; IPR014853; VWF/SSPO/ZAN-like_Cys-rich_dom.
DR InterPro; IPR001007; VWF_dom.
DR InterPro; IPR001846; VWF_type-D.
DR PANTHER; PTHR46698; CROSSVEINLESS 2; 1.
DR PANTHER; PTHR46698:SF4; CROSSVEINLESS 2; 1.
DR Pfam; PF00093; VWC; 15.
DR Pfam; PF00094; VWD; 1.
DR SMART; SM00832; C8; 1.
DR SMART; SM00214; VWC; 27.
DR SMART; SM00215; VWC_out; 9.
DR SMART; SM00216; VWD; 1.
DR SUPFAM; SSF57603; FnI-like domain; 25.
DR SUPFAM; SSF57567; Serine protease inhibitors; 1.
DR PROSITE; PS00271; THIONIN; 1.
DR PROSITE; PS01208; VWFC_1; 7.
DR PROSITE; PS50184; VWFC_2; 23.
DR PROSITE; PS51233; VWFD; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Reference proteome {ECO:0000313|Proteomes:UP000007875};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 32..91
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 88..145
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 146..205
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 205..262
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 262..321
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 321..380
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 380..438
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 438..497
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 497..557
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 557..614
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 614..673
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 732..791
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 847..905
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 905..964
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 1081..1139
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 1140..1197
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 1197..1256
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 1267..1326
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 1326..1384
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 1384..1453
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 1450..1514
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 1514..1576
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 1580..1639
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 1643..1820
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
SQ SEQUENCE 1980 AA; 215112 MW; 2CBF7F434673F90D CRC64;
HMEASTWEEG CKVCTCQSGM IQCTPMMCPS TKYCDYGGEI HRDGVQFQPN SCERCMCDSG
SVSCTSVVGD CPTPTCINPM MLPGDCCPSC PTSCGRHAEG AEWSMGPCHT CRCQFGNIEC
VIQQCPVLSC VNQHRLAGSC CPVCDVGCTF EGRLHRNGET FNSARNQCLN CTCQNNDVKC
SEPRCPEVNC PNPIQHSAQC CPQCEDCSHG NLVYRNLQVW TTANGCQRCL CQRGNVQCQE
IIPCRTCSHG VKVEGQCCKE CMRCSYHGTI YRDRETFTSS RDPCQQCVCQ RGSVTCTRVT
CPPVSCLDQH RPPGQCCPQC PGCTDGLEQW TTGSSWQQRG NPCMTCTCKD GDIRCSRRVC
PDVTCDNPAT MADQCCPTCA RCAYHGVVYG DGETVVAQDA CQQCTCSRGN VECSEQVCEA
VSCPSPVTRN GECCPRCVGC VHEGSSYEDG GSFTSQSNPC LTCTCQAGEV SCRRMECPSV
QCTHAGRRAG ECCATCDGCD YERRNYRNGE RFTPVGSSAC ISCICQDGGV QCTSIDCPQI
TCHNPTNLPG QCCPVCQVCS HDGEEYEYGE IWYADSCTTC ACDGGKVDCT SPTCPTAPCS
HPAKLSGSCC RTCEMCELDG HTYSSTTTFP HPTEPCRICQ CQDGNVACRS RPCPALSCTS
PIHEDGTCCP SCPTSCTVGT SQVEEGDSAP HPTNPCLTCS CQDGNLNCTE RCNSSPECSH
PTDGRCCRDN CDGCHYRGRS HDNGEVFAHH RDKCRSCSCI NGNVRCRLGQ CRALSCSNPV
HPAGDCCPRC PDTQCIHNDR EYADQERFVD GCRQCRCASG SVTCKPISCD PTTCTHPVKD
GCCRSCTGCR INHVDYQNGD VVPDSSSNAC EVCRCLNGNL VCRARTCPVP SCSHPTMKGC
CPACEGCLYN SISYHDGSSF ESFENPCELC TCSGGNVGCE RVVCSEPACT HPDTPVGGCC
PTCDGCQFGS STYDDGAIFA SHDNTCLTCV CSKGTVSCVN KPCAEVACSN PSIGSCDCPV
CSGCSYNGVT RRNRETFTNP EDERCSQCYC QDGNIQCESR PCEEVACHNP TFNKCGCPLC
ESCNYMGVTY SDNERFIDSM NKCNLCVCSQ GTVQCIHTPC PAAECNPVVP EGECCSKCVG
SCDVDGVEHE DGSIFPMASD PCSSCSCSQG VVRCIKLGCE RSCSHPREEG ACCPDCSGCR
YQDTLYEDGQ FFNPPDNPCR RCSCYEGNVL CRDTRCPVPD CAKPETPSGE CCPKCTEAIH
KCCGSCENCF YLNQTWSNGH RFQPNACQDC VCINGNIQCV TKACRPLSCP LNQQVHSPGS
CCPRCANCSA VGYLFAEDET WISPMDSCLK CQCHNGVVTC GRTRCIVPCQ TTVAVPGQCC
PVCRGCTFNN MPHEIGSTFE ANPMDPCEVC ECSAIISDSP SMTCHRVMCP SLADCPRTCI
VQPDPGTCCP TCASRCTYGT CGSHNIAIPH DNRCMSCECN ANHTWLCSPT SCPPLDCPQS
DRYTPKGTCC PICDRCYLDV ENRNVASGFS WRVSECQSCR CDLGSIVCST QQCPMLDCPH
GMLTYKQPGD CCEECIDPSE GCVYDGHTVA PQHRWLVDKC TTCQCFAGEN CVTQRCRMLM
CNSDEAPSVT PGECCPHCIP RPASCVAFGD PHYQTFDGRM IHFQGTCRYL LTADCSRAAF
RVEVENRNIG GDARVSWTDR VHMTVAHHRI TVDGFYNVML NGTQVPHLPL LIKPYIFIDK
SANTLLINTH LGVRLSWNAL QHHLQVEVPS SFKKKLCGLC GNFNNLPQDD LRLRNSRIAR
SDIQFGNNWK VLGREMQSCP DATPYNPCDG ISYRKRKRAN NACKVINSDL FAQCHQVVSP
AMYFSACVHD VCACGGNEDY CLCDVLETYA AQCRRAGVVV RWRSSTLCAL NCPSELGYVF
DECGSPCKRT CANTNLPPSV IEEQCYMPCV SGCQCPAGMV EYDGRCINSM DCPDTISVTD
//