GenomeNet

Database: UniProt
Entry: H2YA41_CIOSA
LinkDB: H2YA41_CIOSA
Original site: H2YA41_CIOSA 
ID   H2YA41_CIOSA            Unreviewed;      2107 AA.
AC   H2YA41;
DT   18-APR-2012, integrated into UniProtKB/TrEMBL.
DT   18-APR-2012, sequence version 1.
DT   27-MAR-2024, entry version 51.
DE   RecName: Full=C-type lectin domain-containing protein {ECO:0000259|PROSITE:PS50041};
OS   Ciona savignyi (Pacific transparent sea squirt).
OC   Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Phlebobranchia;
OC   Cionidae; Ciona.
OX   NCBI_TaxID=51511 {ECO:0000313|Ensembl:ENSCSAVP00000002189.1, ECO:0000313|Proteomes:UP000007875};
RN   [1] {ECO:0000313|Proteomes:UP000007875}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA   Birren B., Nusbaum C., Abebe A., Abouelleil A., Adekoya E., Ait-zahra M.,
RA   Allen N., Allen T., An P., Anderson M., Anderson S., Arachchi H.,
RA   Armbruster J., Bachantsang P., Baldwin J., Barry A., Bayul T.,
RA   Blitshsteyn B., Bloom T., Blye J., Boguslavskiy L., Borowsky M.,
RA   Boukhgalter B., Brunache A., Butler J., Calixte N., Calvo S., Camarata J.,
RA   Campo K., Chang J., Cheshatsang Y., Citroen M., Collymore A., Considine T.,
RA   Cook A., Cooke P., Corum B., Cuomo C., David R., Dawoe T., Degray S.,
RA   Dodge S., Dooley K., Dorje P., Dorjee K., Dorris L., Duffey N., Dupes A.,
RA   Elkins T., Engels R., Erickson J., Farina A., Faro S., Ferreira P.,
RA   Fischer H., Fitzgerald M., Foley K., Gage D., Galagan J., Gearin G.,
RA   Gnerre S., Gnirke A., Goyette A., Graham J., Grandbois E., Gyaltsen K.,
RA   Hafez N., Hagopian D., Hagos B., Hall J., Hatcher B., Heller A.,
RA   Higgins H., Honan T., Horn A., Houde N., Hughes L., Hulme W., Husby E.,
RA   Iliev I., Jaffe D., Jones C., Kamal M., Kamat A., Kamvysselis M.,
RA   Karlsson E., Kells C., Kieu A., Kisner P., Kodira C., Kulbokas E.,
RA   Labutti K., Lama D., Landers T., Leger J., Levine S., Lewis D., Lewis T.,
RA   Lindblad-toh K., Liu X., Lokyitsang T., Lokyitsang Y., Lucien O., Lui A.,
RA   Ma L.J., Mabbitt R., Macdonald J., Maclean C., Major J., Manning J.,
RA   Marabella R., Maru K., Matthews C., Mauceli E., Mccarthy M., Mcdonough S.,
RA   Mcghee T., Meldrim J., Meneus L., Mesirov J., Mihalev A., Mihova T.,
RA   Mikkelsen T., Mlenga V., Moru K., Mozes J., Mulrain L., Munson G.,
RA   Naylor J., Newes C., Nguyen C., Nguyen N., Nguyen T., Nicol R., Nielsen C.,
RA   Nizzari M., Norbu C., Norbu N., O'donnell P., Okoawo O., O'leary S.,
RA   Omotosho B., O'neill K., Osman S., Parker S., Perrin D., Phunkhang P.,
RA   Piqani B., Purcell S., Rachupka T., Ramasamy U., Rameau R., Ray V.,
RA   Raymond C., Retta R., Richardson S., Rise C., Rodriguez J., Rogers J.,
RA   Rogov P., Rutman M., Schupbach R., Seaman C., Settipalli S., Sharpe T.,
RA   Sheridan J., Sherpa N., Shi J., Smirnov S., Smith C., Sougnez C.,
RA   Spencer B., Stalker J., Stange-thomann N., Stavropoulos S., Stetson K.,
RA   Stone C., Stone S., Stubbs M., Talamas J., Tchuinga P., Tenzing P.,
RA   Tesfaye S., Theodore J., Thoulutsang Y., Topham K., Towey S., Tsamla T.,
RA   Tsomo N., Vallee D., Vassiliev H., Venkataraman V., Vinson J., Vo A.,
RA   Wade C., Wang S., Wangchuk T., Wangdi T., Whittaker C., Wilkinson J.,
RA   Wu Y., Wyman D., Yadav S., Yang S., Yang X., Yeager S., Yee E., Young G.,
RA   Zainoun J., Zembeck L., Zimmer A., Zody M., Lander E.;
RL   Submitted (AUG-2003) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Ensembl:ENSCSAVP00000002189.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   -!- SIMILARITY: Belongs to the FRAS1 family.
CC       {ECO:0000256|ARBA:ARBA00005529}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   Ensembl; ENSCSAVT00000002227.1; ENSCSAVP00000002189.1; ENSCSAVG00000001283.1.
DR   GeneTree; ENSGT00940000174298; -.
DR   HOGENOM; CLU_001041_0_0_1; -.
DR   Proteomes; UP000007875; Unassembled WGS sequence.
DR   GO; GO:0016020; C:membrane; IEA:InterPro.
DR   GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR   GO; GO:0007154; P:cell communication; IEA:InterPro.
DR   CDD; cd00037; CLECT; 1.
DR   Gene3D; 2.60.40.2030; -; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR001304; C-type_lectin-like.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR038081; CalX-like_sf.
DR   InterPro; IPR003644; Calx_beta.
DR   InterPro; IPR039005; CSPG_rpt.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR045658; FRAS1-rel_N.
DR   PANTHER; PTHR45739:SF11; C-TYPE LECTIN DOMAIN-CONTAINING PROTEIN; 1.
DR   PANTHER; PTHR45739; MATRIX PROTEIN, PUTATIVE-RELATED; 1.
DR   Pfam; PF16184; Cadherin_3; 12.
DR   Pfam; PF03160; Calx-beta; 1.
DR   Pfam; PF19309; Frem_N; 1.
DR   Pfam; PF00059; Lectin_C; 1.
DR   SMART; SM00237; Calx_beta; 1.
DR   SMART; SM00034; CLECT; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
DR   SUPFAM; SSF141072; CalX-like; 1.
DR   PROSITE; PS50041; C_TYPE_LECTIN_2; 1.
DR   PROSITE; PS51854; CSPG; 10.
PE   3: Inferred from homology;
KW   Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Reference proteome {ECO:0000313|Proteomes:UP000007875};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Signal {ECO:0000256|ARBA:ARBA00022729}.
FT   REPEAT          218..310
FT                   /note="CSPG"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT   REPEAT          335..422
FT                   /note="CSPG"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT   REPEAT          443..537
FT                   /note="CSPG"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT   REPEAT          700..791
FT                   /note="CSPG"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT   REPEAT          810..906
FT                   /note="CSPG"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT   REPEAT          949..1051
FT                   /note="CSPG"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT   REPEAT          1067..1176
FT                   /note="CSPG"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT   REPEAT          1197..1296
FT                   /note="CSPG"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT   REPEAT          1317..1409
FT                   /note="CSPG"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT   REPEAT          1437..1525
FT                   /note="CSPG"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT   DOMAIN          1989..2092
FT                   /note="C-type lectin"
FT                   /evidence="ECO:0000259|PROSITE:PS50041"
FT   REGION          1785..1848
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1801..1829
FT                   /note="Basic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1833..1848
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   2107 AA;  238741 MW;  82EE5BBE3C9D7646 CRC64;
     CKVEVVQNEP MYQKVGRFTP ETFDCDYLPN SVRYEHNGSP FLLQDDVKLR SYVMLSNATV
     QETVIIHIRI LNSRCDIIQI NDVPLRVERN TKFSNPINRD VMEFDYDRDF DTECEVRVIY
     EGLYEFPAQG QLVRSDNSTE TANTEQPHYI TTGTIRTSCD DFLVMGIRYE HRSVPTPDVD
     YIPIHVVVTD RRSGSVLTQE RHFLPVTIAS AFPNQPPKPS FTNMYILEVD QFVLTNILPM
     TVSAMDGETR NERLIFNISK QPEAGFITHL NDESKSIDSF WQEDLESLQI AYQPPNISYP
     ERQSFVVEFT IFDAYFEHSM PIRALFSVRV AQTNAPRVSW NMGLSMLEGQ SRPITHSSLQ
     VVDKDNLNRV RFMLAGGLQH GRLYVNNRPG FTFQWRDIEK GVVTYQHDDT DTRKDKIVFR
     VTDGIHSTRF KMPIKILPKD DAPPFLINNI VFTVHEGQTI LIHRFMLEAH DADSSADYIK
     FNITKLPVAG EVQKRRNWER AGWAVNKFQQ SDLYKGLIYY KHLGNEVFDD SFEFTLIDSC
     TPVPNISPIH RVVIKITPVN DLPPQPADGN SLALTLNETD VIHLTRRELH YVDLEEANSD
     VTFNIDAECR VIGGQSGVNA GRLIFTDDMV MLMKDPSVPT LRTFTQSAVD HEKVAYMPPM
     EDVGLHPLDV QFTFSVADGQ GREIRDLTFA ITVLPVDNQV APTINVTRLS LHEGATQVIS
     ADLMQIHDED TRKADLVVYL STPPLVGAIY KNDIEMREND SFSVMDVEFF RIIYHHDGGE
     IHTDRFQLTV SDGKQNTSTW MHVKITPVDD MPPTINGKIV MTINVREGSW GAISSDHLTA
     TDVDTNDNEL IFEILVPPHL GEITINGEPV TSFQQQDILD GKVHYEHDGT EVGKYVVEDV
     ATFMVVDRRG NIHAVKDGFY EWIIAGVCQD TVVTTKDVHF DIHPVNTHPP RVALGSQVFR
     CNEGGFEPLT EAFLMADDFD SPTLNLSFII TEEPTYGFIE DVTPRPGYEK VIGKRTNSFT
     YDQLMSGYIR YVQSQHQGME PTSDQFSIRA SDGDHLSAQV PFLISITPMN DEAPVLIVQN
     ITCNEGEMAP LTLVVDDMDS PHDHVMVVVT EAPTHGMVMD EMDMLSRYRR SGSHRSVHMH
     AMEEFSMQQL NEHRVIPAYF HDGSESVQDQ LELKVTDGMH VYKTHLMVNI IQVNDETPEI
     VHNEGITLEL GSDKVISSVA LQARDGDTIS SELLYELHSI PRRGLLQIKQ QSDIVWNDLE
     LGETFTEKDV EMNRIRYHHS SVLGSKGQDS FRFSVTDGEF TTPRVNFAIT IEHTKKSAIH
     VTTHPLRVNE RGQGYISGDV LLARDDAQRP EEITFDVIRS PAYGRIEYIN FPGMEIEMFT
     QLDIMARNVI YIHTSKADTA VDTFRILASN GIKTKEAEIN ILITSVDEEL PVLTLPQPSS
     MGHANLMLYS GSSVTITNMI INITDEDTSK NNVRLIIMER PQHGSMRLDG VETTVVTLGD
     LERGGFAYHH NGDGASIDRF TFTVSDGVHA GYFYQGGIRR QEATAFNLKI ESLDETPPYV
     ALNIEPTMIQ PLGNHQTNGI YIDSNFLRTQ DSGTLNNEDL IISIITPPSY GQLRLVHGNI
     GEESVMQFSQ SDLDHRNIIY VVAARMRVDN DSFTFSVQDA WSNTLSESRF SMSWSHIKIS
     QRKIKVCEDV GFIEVQVSRS GNLTRSAFVG VMVQPRNAKP GVDYVPSSAT QIQFDPGVAH
     QTWRIEIMND QLEERSEKFL VKLHTPVNSV LNPKKQKMLV VIKDKSHPSC SGGSLAKQSK
     KLNSKKLKSK KRKNKTKKKK SKAKKRKKGQ SRNIAADQSS TSSGRGGQLL TQIRYGKHGP
     GVTVSPASFF RNETHRIWRY HGLLPVTVDE TEEDIFSSSS NVLDVVPTAQ KRKLKIIDRL
     EVNDDPTASR VSLVTTISQP CNLSARGRLH YDAGRSTLFQ CDGSQWLAWR ARERPVIQPE
     PTTPANHCED GWCYYVSSAE EISWNSAQRS CREIHGAHLV SLGSRKQMNW LWKLSRKTPF
     FIGLNSKLNH QEWEWMDGSE VSFMNWKRRF PRPDGGRCVV VVRRQWQDHS CSDLPTRQTK
     YICARDP
//
DBGET integrated database retrieval system