ID C3YSI6_BRAFL Unreviewed; 4551 AA.
AC C3YSI6;
DT 28-JUL-2009, integrated into UniProtKB/TrEMBL.
DT 28-JUL-2009, sequence version 1.
DT 27-MAR-2024, entry version 84.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EEN56687.1};
GN ORFNames=BRAFLDRAFT_67738 {ECO:0000313|EMBL:EEN56687.1};
OS Branchiostoma floridae (Florida lancelet) (Amphioxus).
OC Eukaryota; Metazoa; Chordata; Cephalochordata; Leptocardii; Amphioxiformes;
OC Branchiostomatidae; Branchiostoma.
OX NCBI_TaxID=7739;
RN [1] {ECO:0000313|EMBL:EEN56687.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=S238N-H82 {ECO:0000313|EMBL:EEN56687.1};
RC TISSUE=Testes {ECO:0000313|EMBL:EEN56687.1};
RX PubMed=18563158; DOI=10.1038/nature06967;
RG US DOE Joint Genome Institute (JGI-PGF);
RA Putnam N.H., Butts T., Ferrier D.E.K., Furlong R.F., Hellsten U.,
RA Kawashima T., Robinson-Rechavi M., Shoguchi E., Terry A., Yu J.-K.,
RA Benito-Gutierrez E.L., Dubchak I., Garcia-Fernandez J., Gibson-Brown J.J.,
RA Grigoriev I.V., Horton A.C., de Jong P.J., Jurka J., Kapitonov V.V.,
RA Kohara Y., Kuroki Y., Lindquist E., Lucas S., Osoegawa K., Pennacchio L.A.,
RA Salamov A.A., Satou Y., Sauka-Spengler T., Schmutz J., Shin-I T.,
RA Toyoda A., Bronner-Fraser M., Fujiyama A., Holland L.Z., Holland P.W.H.,
RA Satoh N., Rokhsar D.S.;
RT "The amphioxus genome and the evolution of the chordate karyotype.";
RL Nature 453:1064-1071(2008).
CC -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC matrix {ECO:0000256|ARBA:ARBA00004498}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GG666549; EEN56687.1; -; Genomic_DNA.
DR RefSeq; XP_002600675.1; XM_002600629.1.
DR eggNOG; KOG1217; Eukaryota.
DR InParanoid; C3YSI6; -.
DR GO; GO:0071944; C:cell periphery; IEA:UniProt.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR GO; GO:0007160; P:cell-matrix adhesion; IEA:InterPro.
DR CDD; cd00053; EGF; 1.
DR CDD; cd00054; EGF_CA; 13.
DR Gene3D; 2.10.25.10; Laminin; 31.
DR Gene3D; 3.30.70.960; SEA domain; 1.
DR Gene3D; 2.20.100.10; Thrombospondin type-1 (TSP1) repeat; 1.
DR InterPro; IPR005533; AMOP_dom.
DR InterPro; IPR026823; cEGF.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR013032; EGF-like_CS.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR InterPro; IPR018097; EGF_Ca-bd_CS.
DR InterPro; IPR024731; EGF_dom.
DR InterPro; IPR009030; Growth_fac_rcpt_cys_sf.
DR InterPro; IPR003886; NIDO_dom.
DR InterPro; IPR000082; SEA_dom.
DR InterPro; IPR036364; SEA_dom_sf.
DR InterPro; IPR000884; TSP1_rpt.
DR InterPro; IPR036383; TSP1_rpt_sf.
DR InterPro; IPR001846; VWF_type-D.
DR PANTHER; PTHR24034; EGF-LIKE DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR24034:SF198; RE68558P; 1.
DR Pfam; PF12662; cEGF; 3.
DR Pfam; PF12947; EGF_3; 5.
DR Pfam; PF07645; EGF_CA; 9.
DR Pfam; PF14670; FXa_inhibition; 1.
DR Pfam; PF12661; hEGF; 2.
DR Pfam; PF06119; NIDO; 1.
DR Pfam; PF01390; SEA; 1.
DR Pfam; PF00090; TSP_1; 1.
DR Pfam; PF00094; VWD; 1.
DR SMART; SM00723; AMOP; 1.
DR SMART; SM00181; EGF; 35.
DR SMART; SM00179; EGF_CA; 31.
DR SMART; SM00539; NIDO; 1.
DR SMART; SM00209; TSP1; 1.
DR SMART; SM00216; VWD; 1.
DR SUPFAM; SSF57196; EGF/Laminin; 4.
DR SUPFAM; SSF57184; Growth factor receptor domain; 8.
DR SUPFAM; SSF82671; SEA domain; 1.
DR SUPFAM; SSF82895; TSP-1 type 1 repeat; 1.
DR PROSITE; PS50856; AMOP; 1.
DR PROSITE; PS00010; ASX_HYDROXYL; 13.
DR PROSITE; PS00022; EGF_1; 1.
DR PROSITE; PS01186; EGF_2; 25.
DR PROSITE; PS50026; EGF_3; 28.
DR PROSITE; PS01187; EGF_CA; 9.
DR PROSITE; PS50024; SEA; 2.
DR PROSITE; PS50092; TSP1; 1.
DR PROSITE; PS51233; VWFD; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Membrane {ECO:0000256|SAM:Phobius}; Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022530};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT TRANSMEM 4425..4446
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 183..223
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 224..328
FT /note="SEA"
FT /evidence="ECO:0000259|PROSITE:PS50024"
FT DOMAIN 449..489
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 597..637
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 745..785
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1037..1077
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1185..1225
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1333..1373
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1481..1521
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1629..1669
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1880..1921
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 2029..2070
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 2071..2111
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 2215..2255
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 2359..2400
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 2401..2441
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 2557..2598
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 2599..2639
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 2698..2739
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 2749..2790
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 2791..2830
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 3107..3244
FT /note="AMOP"
FT /evidence="ECO:0000259|PROSITE:PS50856"
FT DOMAIN 3256..3466
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 3643..3684
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 3780..3824
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 3912..3950
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 3957..3997
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 3998..4038
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 4081..4121
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 4178..4215
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 4259..4299
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 4302..4422
FT /note="SEA"
FT /evidence="ECO:0000259|PROSITE:PS50024"
FT REGION 2662..2684
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2667..2684
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 3674..3683
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ SEQUENCE 4551 AA; 498446 MW; FA25449579F5A7F4 CRC64;
MALTPVPAFQ GSGETVSIVQ VDIDECAETL DDCHRTRGYC DNFEGGYDCY CISRYPRGNG
RKYCVRDIIV YVSVRAIGYP FLPAYRDKTS NAFIGLRNWL LPMISSILQP IMYQPNIYYG
VHSVDLNEVR DGSFVAIFEV NVTEPALPEL EDLVVNVTKG GRLGNLTIQA NRTTIGDVSL
MVAIEQCFLG THDCHDNATC TDTYFSWDCT CNPGFTGNGT WCDVVESEYI SFKVLGLNLT
LDYENSSSAA YQDLQNQLET LVADEVDGVV AVRVIDIRTP DTGVVLQVDF AESSRDEVRS
NVFNAANDDR LGHLTVDGNA TTWGDISVRS HYISFKILDL DPTKDYENKT SADYLELTEL
LEELVRNITG DILSAELFDM RRPEVIRVFF LIPFRLPDAG VIFQINTSVS DDVEDIKQAI
FMEASDDELG PFTLEGNVTT FGPISLLVAL PECTDGSNNC STNADCVEEY LYFSCVCSNG
FVFNGTDCEA VESNYISFKI LDLDPTKDYE NKTSADYLEL TELLEELVRN ITGDILSVEL
FDVRLPDAGV IFQINVTVSE VEDVQRSVFN EAADDQLGPF TLEGNATTFG PISLLVALPE
CTDGTNNCST NAVCVEEYLY FSCVCSDGFV FNGTDCEAVE SNYISFKILD LDPTNDYENK
TSADYLELTE LLEELVRNIT GDILSVELFD VRLPDAGVIF KINHTASDTE DIKTSIFNEA
EDNLLGPFTV EGNATTFGPI SLLVALPECS DGTNNCSTNA VCVEEYLYFS CVCSEGFAFN
GTDCEAVVPN YISFKVLDLD PIKDYRNKTS ADYLELTALL EELVRNITGD ILSVQLVDVR
LPDAGVIFQI NTTVSDVGEV RESIFNESAD DVLGPFTVDG NATTFGPISL LVALPECTDG
TNNCSTNAVC VEEYLYFSCV CALNESDCEA VESSYISFKI LDLDPTKDYE NKTSADYLEL
TELLEELVRN ITGDILSVRL FDVRLPDAGV IFKINHTAPD TEDIKTSIFN EAEDNLFGPF
TVEGNATTFG PISLLVALPE CSDGTNNCST NAVCVEEYLY FSCVCSDGFA FNGTDCEAVV
PNYISFKVLD LDPTKDYSSK TSADYLELTA LLEDLVRNIT GDILSVQLVD VRIPDTGVIF
QINTTASDVE DVKTSIFDEA TDDELGPFTL DGNATTFGPI SLLVALPECT DGSNNCSTNT
VCVEEYLYFS CLCSDGFAFN GTECEAVESS HISFKILDLD PTKDYENKTS ADYLELTALL
EELVRNITGD ILSVQLVDVR LPDSGVIFQI NSTVSDVDNV KEYIFDEASD DELGPFTLDG
NATTFGPISL LVALPECTDG TNNCSTNADC VEEYLYFSCV CSDGFAFNGT DCEPVESNYI
SFKILDLDPT KDYENKTSAD YLELTELLEE LVRNITGDIL SVELFDVRLP DAGVIFQINT
TISDTESVKE AIFEEATDDK LGGFTVEGNA TTFGPISLLV ALPECSDGSN NCSTNAVCVE
KYLYFSCVCS DGFAYNGTDC EAVESSYISF KILNLDPTKD YENKTSLDYL ELTDILEELV
RNITGDILAV DLIDVRLPDA GVIFQLNTTR SDTDSVESAI FDEAADDRLG EFVLEGNATT
FGPVSLLVAL PECSDGTNNC STNADCVEEY LYFSCVCSDG FVFNGTDCEA VESNYISFKV
LDLDPNKDYE NKTSPDYLDL QDILEELVAN ITGDILSVEL FDVRLPDAGV IFQVNTTVSD
VDDVKQAIFD ESSDDRLGQF TLEGNDTTFG PISVESNYIS FKILDLDPTK DYENKTSADY
LELTELLEEL VRNITGDILS VELFDVRLPD AGVIFQVNTT VSDAAEVEQK IFEEASDDRL
GQFTLEGNAT TSGDISLLVA LPDCASNSTH NCSSDANCME EYLYFSCECK DGYIGNGTYC
EAVVSNYISF KILDLDPTKH YENKTSADYL ELTELLEELV RNITGDILSV ELFDVRLPDA
GVIFQINTTA ADTDSVKQDI FEETAEDTLG DFTVQGNATT FGPISLLVAL PECDDASTNN
CSSGADCQEE YLYFSCACKE GFTGNGTTCE DNDECSNGAA DCDSNAVCTN IPGSYTCRCD
SGFHGNGTFC KEVTNQYIAF RITDLDPTID YNNTSSPDYQ VLKNLLEELV GNISANIIGV
ELMDIRLPDA GVIFEINITD SYVNQVKSAV FAEGDDGTLG MFAVADNDTT FGDIYVDECA
LELDNCDDNA ICDNTPGSFQ CQCEDGYLGN GTVCKRIVTT YASFKMPNLQ PVQDYKDENS
TAYQALEDFL VSLVGNMSDE VTGVKLLDVR FPEGGVVLEL RVTDNELADV YDAIVAEGLD
GMVGPYTVEG NATTVGNISL PECNNTITND CDPEATCVEE YLFYTCVCNH GYTGDGQQCT
DIDECQLNLD NCHADADCTN LPGSFRCDCK DGFYGNGTHC EAIVANYASF KLLDLDPNIK
YSDPTSPGYQ QLKEQIQDLV RNISDAVVSA EVLEVRLPDG AVVLQVNMTV TQVDSVRRDI
LDTTTDGTLG DLPVDGNLTT FGDICNHSIF NFPLLVALPD CSSDTTNNCS SNATCKEEYF
YFSCQCKDGF TGDGVSCQDI DECSLGLDDC HSNADCTNTV GSYTCTCSSG FTGNGTYCAD
GGWSLWSGWS NCSASCGVGT QERTRRCDSP PPQHGGRDCR GPDRQERPCF SGQCPPGFID
WCSDAEDRCG QVSHGGVCTL AGAGYTCSCQ DGWQEVRSTY RTMFLRCEDV DECTTGQHDC
NTTFSRCVNN LGNYTCACLP GFTQVEGTCE DINECTARGP RRCDRNAACT NLYGSFTCVC
NDGYISRVQD GTGFPGQCKE KRLFPYGEAE GDLLLYNDSA ATGEMVSPII GVQHGIPLRN
GKLCNSIYVT ENGVLALNDR IFEHETGDKE TYRNPEPLND IFESNRTETC AVLAAFWANN
RFTELEPGRN PKVWYHVYQH SNEPMFDKVD RAILSNFTTL PNYHSEFILV ATWQDMAPPW
RADASQFNTF QAVLATDHHH TFVLYRYEDA EMTWVPVYDE EHVQHHRYPA RIGYVIRLPL
YDVEDPNSGE WSRHAGEANA YRMERKVSPT TGRLGRVIYQ LDDNDNSYVN PRRACQDWYE
AEPDPQSFAA DVIGTCPGWG GQAREERGRW EKITSDQETS SQCYQKIFPL RSGGNQECCY
GDKSQLIDTL NRVHRGTTGF LHRYPKGNPN DQTTNHYKFD ILPRRWCCEE SGSDEYCKRY
AAKRPVGTSA RYRAIRIANA FGDPHFTTLD GKGFTFNGYG EYVLLMSTGD ASHEFMLQAR
TAVAVSEGAQ VSATVFSAVA VKQPTNDVQV YLSGDDTVRV IVDSTAVPLA SIADSSYNQG
VFPDGRLLAV YDDTDPTRVT GVLVTYTSGI SVRVTAVNGL LTFVVGMIAE MEGKLTGLLG
NSNGDTNDDF IQPDGSVITS ATPSNPSERE LFTYGKTWSL KNVAQHLNQD INSFSLFTSY
PSDSNNNSPD SYGDENFVPM FFDAGQIFTD ETLRQEAEDT CGAGSTECLY DIAVTGRLAV
GNDTLQSDNT FLVSKEVLNN LPPVLVVPSE LRVEAHMTYT LDISATDEGS EVTLSLEEGS
EGSLVRTGPK TARFTWTPEH IHEVFIEFVA EDEQGARTEF IPEVTVCDCH NGGTCDYNNT
IERVNGFAAA VCQCLPGFGG EHCEIDIDAC EGNPCFPGVQ CYDAAAPLQP GQRAYTCEQC
PDGMVGNGET CEDINECLLA SSDPGIHSCV NADCVNTPGS FSCVCHPGYQ RDGDGHHCVD
INECRNKDHN RCDPHHGVCV NEDGGYRCMC QPGHTTNDNG TTCTELDECA AGNSCEQICT
DRVGFYECGC NNLFTLNADG RTCRPLVNCS SPSPCHPSPV GTCAGNITNG TRQEVCGCVT
GYALQPDNSC QDIDECAMGI DQCDEHGDCV NTVGDYNCSC HAGYQLTNNH GKRHCDDVNE
CAVDDGGCSD ICENTDGSYN CACPDGYNLG ADRKTCEDLN ECVLGTDDCS TQAECNNTIG
GFTCHCRDGF EGDGTACGDI DECVQSDTVA CQVGCSNTLG GYLCTCGTGF FLTSDQVSCQ
DLDECARNTD NCQQRCTNTV GSFVCACNPG YVLLSDRRTC QLQPTMSTVT VSSAVTTSLS
TDRASTITTS ATQPGVTSVF ISPAVSRSTG VPPVVLPDEP ECGTDCHVQA RFLTIGNREQ
CVCNPGWKGN GTYCELDIDE CVQSDTVACQ VGCSNTLGGY LCTCGTGFFL TSDQVSCQDL
DECARNTDNC QQRCTNTVGS FMCACDPGYV LQSDRRTCQL QPTTANGFNL RLRMKNISFV
DELNNCSSQE FQDLWPIVFR RLDSYYRYHQ NSRISDNYLY CNLQLFLNGS VIAYHTATFK
EDSLLTRDDV AAALSQAISA DTANTLGFDP ASPGVQDDTG LLTRIFGVLG AFLLLLLMVC
LCRYWFLVQK RSSKTRYYVP RDSVLGVDSD TSSTTDDMVQ RVGPTIFVGR YLPRRGFTNE
LRRSPSATNP DAHDERLYNE IFGKNDTFKI QRPKWSYLPI SGPASTMGSL M
//