ID H2YA41_CIOSA Unreviewed; 2107 AA.
AC H2YA41;
DT 18-APR-2012, integrated into UniProtKB/TrEMBL.
DT 18-APR-2012, sequence version 1.
DT 27-MAR-2024, entry version 51.
DE RecName: Full=C-type lectin domain-containing protein {ECO:0000259|PROSITE:PS50041};
OS Ciona savignyi (Pacific transparent sea squirt).
OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Phlebobranchia;
OC Cionidae; Ciona.
OX NCBI_TaxID=51511 {ECO:0000313|Ensembl:ENSCSAVP00000002189.1, ECO:0000313|Proteomes:UP000007875};
RN [1] {ECO:0000313|Proteomes:UP000007875}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Birren B., Nusbaum C., Abebe A., Abouelleil A., Adekoya E., Ait-zahra M.,
RA Allen N., Allen T., An P., Anderson M., Anderson S., Arachchi H.,
RA Armbruster J., Bachantsang P., Baldwin J., Barry A., Bayul T.,
RA Blitshsteyn B., Bloom T., Blye J., Boguslavskiy L., Borowsky M.,
RA Boukhgalter B., Brunache A., Butler J., Calixte N., Calvo S., Camarata J.,
RA Campo K., Chang J., Cheshatsang Y., Citroen M., Collymore A., Considine T.,
RA Cook A., Cooke P., Corum B., Cuomo C., David R., Dawoe T., Degray S.,
RA Dodge S., Dooley K., Dorje P., Dorjee K., Dorris L., Duffey N., Dupes A.,
RA Elkins T., Engels R., Erickson J., Farina A., Faro S., Ferreira P.,
RA Fischer H., Fitzgerald M., Foley K., Gage D., Galagan J., Gearin G.,
RA Gnerre S., Gnirke A., Goyette A., Graham J., Grandbois E., Gyaltsen K.,
RA Hafez N., Hagopian D., Hagos B., Hall J., Hatcher B., Heller A.,
RA Higgins H., Honan T., Horn A., Houde N., Hughes L., Hulme W., Husby E.,
RA Iliev I., Jaffe D., Jones C., Kamal M., Kamat A., Kamvysselis M.,
RA Karlsson E., Kells C., Kieu A., Kisner P., Kodira C., Kulbokas E.,
RA Labutti K., Lama D., Landers T., Leger J., Levine S., Lewis D., Lewis T.,
RA Lindblad-toh K., Liu X., Lokyitsang T., Lokyitsang Y., Lucien O., Lui A.,
RA Ma L.J., Mabbitt R., Macdonald J., Maclean C., Major J., Manning J.,
RA Marabella R., Maru K., Matthews C., Mauceli E., Mccarthy M., Mcdonough S.,
RA Mcghee T., Meldrim J., Meneus L., Mesirov J., Mihalev A., Mihova T.,
RA Mikkelsen T., Mlenga V., Moru K., Mozes J., Mulrain L., Munson G.,
RA Naylor J., Newes C., Nguyen C., Nguyen N., Nguyen T., Nicol R., Nielsen C.,
RA Nizzari M., Norbu C., Norbu N., O'donnell P., Okoawo O., O'leary S.,
RA Omotosho B., O'neill K., Osman S., Parker S., Perrin D., Phunkhang P.,
RA Piqani B., Purcell S., Rachupka T., Ramasamy U., Rameau R., Ray V.,
RA Raymond C., Retta R., Richardson S., Rise C., Rodriguez J., Rogers J.,
RA Rogov P., Rutman M., Schupbach R., Seaman C., Settipalli S., Sharpe T.,
RA Sheridan J., Sherpa N., Shi J., Smirnov S., Smith C., Sougnez C.,
RA Spencer B., Stalker J., Stange-thomann N., Stavropoulos S., Stetson K.,
RA Stone C., Stone S., Stubbs M., Talamas J., Tchuinga P., Tenzing P.,
RA Tesfaye S., Theodore J., Thoulutsang Y., Topham K., Towey S., Tsamla T.,
RA Tsomo N., Vallee D., Vassiliev H., Venkataraman V., Vinson J., Vo A.,
RA Wade C., Wang S., Wangchuk T., Wangdi T., Whittaker C., Wilkinson J.,
RA Wu Y., Wyman D., Yadav S., Yang S., Yang X., Yeager S., Yee E., Young G.,
RA Zainoun J., Zembeck L., Zimmer A., Zody M., Lander E.;
RL Submitted (AUG-2003) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSCSAVP00000002189.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SIMILARITY: Belongs to the FRAS1 family.
CC {ECO:0000256|ARBA:ARBA00005529}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR Ensembl; ENSCSAVT00000002227.1; ENSCSAVP00000002189.1; ENSCSAVG00000001283.1.
DR GeneTree; ENSGT00940000174298; -.
DR HOGENOM; CLU_001041_0_0_1; -.
DR Proteomes; UP000007875; Unassembled WGS sequence.
DR GO; GO:0016020; C:membrane; IEA:InterPro.
DR GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR GO; GO:0007154; P:cell communication; IEA:InterPro.
DR CDD; cd00037; CLECT; 1.
DR Gene3D; 2.60.40.2030; -; 1.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR001304; C-type_lectin-like.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR038081; CalX-like_sf.
DR InterPro; IPR003644; Calx_beta.
DR InterPro; IPR039005; CSPG_rpt.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR045658; FRAS1-rel_N.
DR PANTHER; PTHR45739:SF11; C-TYPE LECTIN DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR45739; MATRIX PROTEIN, PUTATIVE-RELATED; 1.
DR Pfam; PF16184; Cadherin_3; 12.
DR Pfam; PF03160; Calx-beta; 1.
DR Pfam; PF19309; Frem_N; 1.
DR Pfam; PF00059; Lectin_C; 1.
DR SMART; SM00237; Calx_beta; 1.
DR SMART; SM00034; CLECT; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
DR SUPFAM; SSF141072; CalX-like; 1.
DR PROSITE; PS50041; C_TYPE_LECTIN_2; 1.
DR PROSITE; PS51854; CSPG; 10.
PE 3: Inferred from homology;
KW Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Reference proteome {ECO:0000313|Proteomes:UP000007875};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Signal {ECO:0000256|ARBA:ARBA00022729}.
FT REPEAT 218..310
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 335..422
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 443..537
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 700..791
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 810..906
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 949..1051
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 1067..1176
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 1197..1296
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 1317..1409
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 1437..1525
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT DOMAIN 1989..2092
FT /note="C-type lectin"
FT /evidence="ECO:0000259|PROSITE:PS50041"
FT REGION 1785..1848
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1801..1829
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1833..1848
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2107 AA; 238741 MW; 82EE5BBE3C9D7646 CRC64;
CKVEVVQNEP MYQKVGRFTP ETFDCDYLPN SVRYEHNGSP FLLQDDVKLR SYVMLSNATV
QETVIIHIRI LNSRCDIIQI NDVPLRVERN TKFSNPINRD VMEFDYDRDF DTECEVRVIY
EGLYEFPAQG QLVRSDNSTE TANTEQPHYI TTGTIRTSCD DFLVMGIRYE HRSVPTPDVD
YIPIHVVVTD RRSGSVLTQE RHFLPVTIAS AFPNQPPKPS FTNMYILEVD QFVLTNILPM
TVSAMDGETR NERLIFNISK QPEAGFITHL NDESKSIDSF WQEDLESLQI AYQPPNISYP
ERQSFVVEFT IFDAYFEHSM PIRALFSVRV AQTNAPRVSW NMGLSMLEGQ SRPITHSSLQ
VVDKDNLNRV RFMLAGGLQH GRLYVNNRPG FTFQWRDIEK GVVTYQHDDT DTRKDKIVFR
VTDGIHSTRF KMPIKILPKD DAPPFLINNI VFTVHEGQTI LIHRFMLEAH DADSSADYIK
FNITKLPVAG EVQKRRNWER AGWAVNKFQQ SDLYKGLIYY KHLGNEVFDD SFEFTLIDSC
TPVPNISPIH RVVIKITPVN DLPPQPADGN SLALTLNETD VIHLTRRELH YVDLEEANSD
VTFNIDAECR VIGGQSGVNA GRLIFTDDMV MLMKDPSVPT LRTFTQSAVD HEKVAYMPPM
EDVGLHPLDV QFTFSVADGQ GREIRDLTFA ITVLPVDNQV APTINVTRLS LHEGATQVIS
ADLMQIHDED TRKADLVVYL STPPLVGAIY KNDIEMREND SFSVMDVEFF RIIYHHDGGE
IHTDRFQLTV SDGKQNTSTW MHVKITPVDD MPPTINGKIV MTINVREGSW GAISSDHLTA
TDVDTNDNEL IFEILVPPHL GEITINGEPV TSFQQQDILD GKVHYEHDGT EVGKYVVEDV
ATFMVVDRRG NIHAVKDGFY EWIIAGVCQD TVVTTKDVHF DIHPVNTHPP RVALGSQVFR
CNEGGFEPLT EAFLMADDFD SPTLNLSFII TEEPTYGFIE DVTPRPGYEK VIGKRTNSFT
YDQLMSGYIR YVQSQHQGME PTSDQFSIRA SDGDHLSAQV PFLISITPMN DEAPVLIVQN
ITCNEGEMAP LTLVVDDMDS PHDHVMVVVT EAPTHGMVMD EMDMLSRYRR SGSHRSVHMH
AMEEFSMQQL NEHRVIPAYF HDGSESVQDQ LELKVTDGMH VYKTHLMVNI IQVNDETPEI
VHNEGITLEL GSDKVISSVA LQARDGDTIS SELLYELHSI PRRGLLQIKQ QSDIVWNDLE
LGETFTEKDV EMNRIRYHHS SVLGSKGQDS FRFSVTDGEF TTPRVNFAIT IEHTKKSAIH
VTTHPLRVNE RGQGYISGDV LLARDDAQRP EEITFDVIRS PAYGRIEYIN FPGMEIEMFT
QLDIMARNVI YIHTSKADTA VDTFRILASN GIKTKEAEIN ILITSVDEEL PVLTLPQPSS
MGHANLMLYS GSSVTITNMI INITDEDTSK NNVRLIIMER PQHGSMRLDG VETTVVTLGD
LERGGFAYHH NGDGASIDRF TFTVSDGVHA GYFYQGGIRR QEATAFNLKI ESLDETPPYV
ALNIEPTMIQ PLGNHQTNGI YIDSNFLRTQ DSGTLNNEDL IISIITPPSY GQLRLVHGNI
GEESVMQFSQ SDLDHRNIIY VVAARMRVDN DSFTFSVQDA WSNTLSESRF SMSWSHIKIS
QRKIKVCEDV GFIEVQVSRS GNLTRSAFVG VMVQPRNAKP GVDYVPSSAT QIQFDPGVAH
QTWRIEIMND QLEERSEKFL VKLHTPVNSV LNPKKQKMLV VIKDKSHPSC SGGSLAKQSK
KLNSKKLKSK KRKNKTKKKK SKAKKRKKGQ SRNIAADQSS TSSGRGGQLL TQIRYGKHGP
GVTVSPASFF RNETHRIWRY HGLLPVTVDE TEEDIFSSSS NVLDVVPTAQ KRKLKIIDRL
EVNDDPTASR VSLVTTISQP CNLSARGRLH YDAGRSTLFQ CDGSQWLAWR ARERPVIQPE
PTTPANHCED GWCYYVSSAE EISWNSAQRS CREIHGAHLV SLGSRKQMNW LWKLSRKTPF
FIGLNSKLNH QEWEWMDGSE VSFMNWKRRF PRPDGGRCVV VVRRQWQDHS CSDLPTRQTK
YICARDP
//