ID C3Y9H6_BRAFL Unreviewed; 1897 AA.
AC C3Y9H6;
DT 28-JUL-2009, integrated into UniProtKB/TrEMBL.
DT 28-JUL-2009, sequence version 1.
DT 27-MAR-2024, entry version 71.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EEN63232.1};
GN ORFNames=BRAFLDRAFT_67980 {ECO:0000313|EMBL:EEN63232.1};
OS Branchiostoma floridae (Florida lancelet) (Amphioxus).
OC Eukaryota; Metazoa; Chordata; Cephalochordata; Leptocardii; Amphioxiformes;
OC Branchiostomatidae; Branchiostoma.
OX NCBI_TaxID=7739;
RN [1] {ECO:0000313|EMBL:EEN63232.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=S238N-H82 {ECO:0000313|EMBL:EEN63232.1};
RC TISSUE=Testes {ECO:0000313|EMBL:EEN63232.1};
RX PubMed=18563158; DOI=10.1038/nature06967;
RG US DOE Joint Genome Institute (JGI-PGF);
RA Putnam N.H., Butts T., Ferrier D.E.K., Furlong R.F., Hellsten U.,
RA Kawashima T., Robinson-Rechavi M., Shoguchi E., Terry A., Yu J.-K.,
RA Benito-Gutierrez E.L., Dubchak I., Garcia-Fernandez J., Gibson-Brown J.J.,
RA Grigoriev I.V., Horton A.C., de Jong P.J., Jurka J., Kapitonov V.V.,
RA Kohara Y., Kuroki Y., Lindquist E., Lucas S., Osoegawa K., Pennacchio L.A.,
RA Salamov A.A., Satou Y., Sauka-Spengler T., Schmutz J., Shin-I T.,
RA Toyoda A., Bronner-Fraser M., Fujiyama A., Holland L.Z., Holland P.W.H.,
RA Satoh N., Rokhsar D.S.;
RT "The amphioxus genome and the evolution of the chordate karyotype.";
RL Nature 453:1064-1071(2008).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GG666492; EEN63232.1; -; Genomic_DNA.
DR RefSeq; XP_002607222.1; XM_002607176.1.
DR eggNOG; KOG1216; Eukaryota.
DR eggNOG; KOG3544; Eukaryota.
DR eggNOG; KOG3669; Eukaryota.
DR eggNOG; KOG4157; Eukaryota.
DR InParanoid; C3Y9H6; -.
DR GO; GO:0031012; C:extracellular matrix; IBA:GO_Central.
DR GO; GO:0005615; C:extracellular space; IBA:GO_Central.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR CDD; cd00037; CLECT; 1.
DR CDD; cd19941; TIL; 2.
DR Gene3D; 2.10.25.10; Laminin; 2.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 1.
DR InterPro; IPR006624; Beta-propeller_rpt_TECPR.
DR InterPro; IPR001304; C-type_lectin-like.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR018378; C-type_lectin_CS.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR036084; Ser_inhib-like_sf.
DR InterPro; IPR002919; TIL_dom.
DR InterPro; IPR014853; VWF/SSPO/ZAN-like_Cys-rich_dom.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR001846; VWF_type-D.
DR InterPro; IPR036465; vWFA_dom_sf.
DR InterPro; IPR002889; WSC_carb-bd.
DR PANTHER; PTHR11339; EXTRACELLULAR MATRIX GLYCOPROTEIN RELATED; 1.
DR PANTHER; PTHR11339:SF373; HEMOLECTIN, ISOFORM A; 1.
DR Pfam; PF08742; C8; 2.
DR Pfam; PF06462; Hyd_WA; 1.
DR Pfam; PF00059; Lectin_C; 1.
DR Pfam; PF19193; Tectonin; 2.
DR Pfam; PF01826; TIL; 2.
DR Pfam; PF00092; VWA; 1.
DR Pfam; PF00094; VWD; 2.
DR Pfam; PF01822; WSC; 1.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00832; C8; 2.
DR SMART; SM00034; CLECT; 1.
DR SMART; SM00706; TECPR; 10.
DR SMART; SM00327; VWA; 1.
DR SMART; SM00216; VWD; 2.
DR SMART; SM00321; WSC; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
DR SUPFAM; SSF57567; Serine protease inhibitors; 2.
DR SUPFAM; SSF53300; vWA-like; 1.
DR PROSITE; PS00615; C_TYPE_LECTIN_1; 1.
DR PROSITE; PS50041; C_TYPE_LECTIN_2; 1.
DR PROSITE; PS50234; VWFA; 1.
DR PROSITE; PS51233; VWFD; 2.
DR PROSITE; PS51212; WSC; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Membrane {ECO:0000256|SAM:Phobius};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT TRANSMEM 1767..1786
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 76..170
FT /note="WSC"
FT /evidence="ECO:0000259|PROSITE:PS51212"
FT DOMAIN 183..296
FT /note="C-type lectin"
FT /evidence="ECO:0000259|PROSITE:PS50041"
FT DOMAIN 411..584
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 1033..1206
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 1585..1721
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
SQ SEQUENCE 1897 AA; 211459 MW; B6CEC559D07E16F9 CRC64;
MVLVDIAVVV IVILVVTAMV ATTLAVDIVA MEAIAMEAIV MGIATHRLKR TDKKSVTIFN
NTNKASDLAF FFYISLGGYK GCYVDNRNRV FPHSPTSSNS MTTAICKAHC KRNGYAYAGT
EYAKECFCGT AAQFARLPAP RRASECNKKC KGNNKEICGG TWRISIYWIG GGGGGGTTTR
THYRVYNEAK TYSEAQRRCQ QDGGHLADLK TPAITAVVAR LVDSRSDYWI GLNDINHEGG
WHWSDGVPLS SCSYKNWYPG EPNNLFNEDC GHLWGGNKGL KWNDLSCNAR KYFICQTGDR
QTAGCTGKPP GNRGYTSLGC WKDTWNRAIP TLERTDARLD GFYKARANAI EKCYQVARSR
GFTVFAVQDG GWCAGSANAL NTYRKYGPSR TCAADGEGGP WGNAVYKITA GTCLAWGDPH
YITFDNRRHD FQGTCKYVLV RHADFTVAVR NVHRPGRSQR VAFCDRVEVT VYGYKIQIRS
GGGRDVLVNG YRRSLPVCLN RKVSISISGL NVMIQTDRCF SLTYDGNHRV EIKAPASYKG
KLSGMCGNYN GQPNDDNLMP GGQVASTSLL YGNSWIAPDD DTCPDTRPQD NFDSNDIRPA
DRQRYLHPSK CGLLKAANGP FRSCNSIVSP TEFVETCVFD MAAYRGDQLV LCQNLQAYAD
ACVSAGGKPQ QWRRRGFCAV PCPPHSHYSQ CATPCPRTCA DSGPRPCTKN CVESCVCDNG
YVLSGAHCVP LSSCGCSKDG NLYEKNEVWK SGNEICTCLP TRRIQCERQT GTGWNTVGGS
LSFVSIGFCG VWGVNPTGVV YYRVGTYGNE RVPGTGWLTV TGVGLVQISS GQGIVWGVTA
RYRVYVRIGI TAQRPQGTRW TEIRGRALKS ICVSGYYVWG ATTAGTVYYR TGVTAARQSG
TGWAQVSGPP IRGLSYVSIG HCGVWAVTSS GTIWYRSGTY GGTGSVGTQW VQVTGCSLVS
ISVGYNVVWG VSAIGQVFIR IGITAQRPQG TAWRLVGGSL TQVYVGATSN RVWGCDGGHH
VYIRVGITGG ETGQCKAYGD PHYITFDNRR HDFQGTCKYV LVRHADFTVE ARNVHRSGKS
QRVAFCDHVE VNVHNFEIQL RSGSGKEVLV NGYRRSLPVC LSRKVAISII GKNVQIQTDQ
CLSVLYDGRH SVIVRLPTSY KGKVSGMCGN YNGRPNDDNL MPGGQVAATS LLYGNSWIAP
DDDTCPDTRP QDNFDTTDIS AGDRRLYQRP DKCGLLRLPT GPFRACISVL NPATYFESCV
FDMAAYSGDE DMLCENLEAY SDDCRAAGGN PGRWRTANRC PMPCPAHSQY NPCGSACPLT
CAEPDPRPCI RLCVESCVCD QGYVLSGSTC IPRSSCGCFR DGNYYQKNEM WRSGNQICKC
TNRIVCEAEP SSGWDTCPGS LSFVTVGWSG VWGVNSRGVV FYRVGTYGKE GHFGREWKQI
DGNLVQISSG KGIVWGVDRR NQVYVRIGIT VQKVYGSSWR VVTGRPLKSV CVSSSSNSVW
GICPCGTIWR RTGITTRNLI GITAKVPQGT AWRLVGGSLT QIYVSSSSNR VWGCGLSHHI
YLRVGITWSQ GPVDIPAPTC RSKADIHVLV DGSKSVKTRN FPAVRQFILK LAAGFEIGPN
KARFGVYQFA KDMQTEFKMN QYNNREALLD AIKKIEYMNQ YQTKTGQSLK AVYEEFTKAN
GARDGVEKII ILITDGKATD QVRQPAQYVK NKGAHVFTVG VAKYKISELK HDNSTLGLQL
GSRFRNAPLY YYYYYYYYYY YYYYYYYYYY YCYYFSCCCC CFYYYYCYYY HYCYHYYYHY
NYYFYYYCCY YCYNYHCYYY HHYYYYYYRC CYYYYYYYYI HGKRNIGAEV DNGLQYLKQT
AEALMDDLEG RKNLGLETAH TETGEDAAMQ LRENGLV
//