ID E4XGU3_OIKDI Unreviewed; 1259 AA.
AC E4XGU3;
DT 08-FEB-2011, integrated into UniProtKB/TrEMBL.
DT 08-FEB-2011, sequence version 1.
DT 24-JAN-2024, entry version 58.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CBY09891.1};
GN ORFNames=GSOID_T00010708001 {ECO:0000313|EMBL:CBY09891.1};
OS Oikopleura dioica (Tunicate).
OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; Copelata;
OC Oikopleuridae; Oikopleura.
OX NCBI_TaxID=34765 {ECO:0000313|EMBL:CBY09891.1};
RN [1] {ECO:0000313|EMBL:CBY09891.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21097902; DOI=10.1126/science.1194167;
RA Denoeud F., Henriet S., Mungpakdee S., Aury J.M., Da Silva C.,
RA Brinkmann H., Mikhaleva J., Olsen L.C., Jubin C., Canestro C.,
RA Bouquet J.M., Danks G., Poulain J., Campsteijn C., Adamski M., Cross I.,
RA Yadetie F., Muffato M., Louis A., Butcher S., Tsagkogeorga G., Konrad A.,
RA Singh S., Jensen M.F., Cong E.H., Eikeseth-Otteraa H., Noel B.,
RA Anthouard V., Porcel B.M., Kachouri-Lafond R., Nishino A., Ugolini M.,
RA Chourrout P., Nishida H., Aasland R., Huzurbazar S., Westhof E., Delsuc F.,
RA Lehrach H., Reinhardt R., Weissenbach J., Roy S.W., Artiguenave F.,
RA Postlethwait J.H., Manak J.R., Thompson E.M., Jaillon O., Du Pasquier L.,
RA Boudinot P., Liberles D.A., Volff J.N., Philippe H., Lenhard B.,
RA Roest Crollius H., Wincker P., Chourrout D.;
RT "Plasticity of animal genome architecture unmasked by rapid evolution of a
RT pelagic tunicate.";
RL Science 330:1381-1385(2010).
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; FN653049; CBY09891.1; -; Genomic_DNA.
DR AlphaFoldDB; E4XGU3; -.
DR InParanoid; E4XGU3; -.
DR Proteomes; UP000001307; Unassembled WGS sequence.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR CDD; cd00054; EGF_CA; 3.
DR CDD; cd01450; vWFA_subfamily_ECM; 1.
DR Gene3D; 2.10.25.10; Laminin; 15.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 1.
DR InterPro; IPR026823; cEGF.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR InterPro; IPR018097; EGF_Ca-bd_CS.
DR InterPro; IPR009030; Growth_fac_rcpt_cys_sf.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR24039; FIBRILLIN-RELATED; 1.
DR PANTHER; PTHR24039:SF48; FIBULIN-1; 1.
DR Pfam; PF12662; cEGF; 3.
DR Pfam; PF00008; EGF; 1.
DR Pfam; PF07645; EGF_CA; 10.
DR Pfam; PF00092; VWA; 1.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00181; EGF; 13.
DR SMART; SM00179; EGF_CA; 18.
DR SMART; SM00327; VWA; 1.
DR SUPFAM; SSF57196; EGF/Laminin; 7.
DR SUPFAM; SSF57184; Growth factor receptor domain; 1.
DR SUPFAM; SSF53300; vWA-like; 1.
DR PROSITE; PS00010; ASX_HYDROXYL; 2.
DR PROSITE; PS00022; EGF_1; 3.
DR PROSITE; PS01186; EGF_2; 4.
DR PROSITE; PS50026; EGF_3; 7.
DR PROSITE; PS01187; EGF_CA; 7.
DR PROSITE; PS50234; VWFA; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Reference proteome {ECO:0000313|Proteomes:UP000001307};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Signal {ECO:0000256|ARBA:ARBA00022729}.
FT DOMAIN 136..171
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 173..212
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 213..253
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 425..463
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 464..489
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 966..1002
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1022..1058
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1073..1246
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DISULFID 140..150
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 161..170
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 992..1001
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1026..1036
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1048..1057
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ SEQUENCE 1259 AA; 140276 MW; 70AA33D24F015F48 CRC64;
MSSETPTYTS IELATKLRSF HDINIFAVAV GERAGMTDEF TGSAERILRV FKENRLVDEL
AQLQQLVCRD IERCSTSSFQ CLKKYNTGGY AGSYITNCHR INCDNGKCLT KPCHPSQFCE
EKGAYDFECS TSSFKNQLTC DDLSCANGLC KMLGQTAHCQ CHAGWTGDTC KEDVDECLIS
SHNCQHECIN TSGGYKCECD EGYLLTVAGQ CQDIDECSKS KHGCHQICRN YPGEYRCECY
PGFSLHSDGY TCSEIDTCKL AGCSDKCVNL PGLAYRCQCS EGFILDNDDH TCIDVNECAL
EPCEYGYECI NTSGSYKCVD IDECTQAHCS DGFSCKSFKG DFKCYDINEC ESSPCNKNEN
CENTVGGFVC SVVDPCAENP CEEGFTCRTN ESGFECLDID ECEKNNICPV GYRCENYRGD
YECVDIDECQ EYSPCAANHK CENTNGAYTC LYVDPFDGSY KCIDIDECAN EPCKNNQECT
NLHGGYRCDD MDECKRFYFS CSDCIDIDEC TEGLSTCTER QACQNTEGSF KCVDLDPCED
IQCDTGFECI DYMHSFECLD INECAIDSPC ELGYSCKNTP GMGFTCVNNV GNFSCVDVDE
CLEIGSCPVG FECFNRHGSY DCLDKNECFD EPCEDGYACL NNEGSYTCVD VNECNVESCP
PGTNCVNTLG SYECTDVDEC EDSPCDKGFY CSNSFGSFDC HDVDECSSLT SPCDEFSKCV
NTPGSFECEK IKEARVGETA LEEVSTCEES KHKCSHTCIL LSTNNITIAP STPIHIAAPN
VYYRCDCPNG FELAQNGHTC VDIDECTIRK GGCSDYCFNS QGGSYSCVCG EGKKLSSDMH
TCTNLFRNSN DSPCMVQCAN GNCIGDVCKC ADGWTGSHCD QEQGPFTCYP CNHGRCLNTE
GGFMCECDKG FQFDSQRKNC VDVNECLSRK RNRCEQNCIN EEGTYSCSCN EDFELSSDGF
RCTPSPKGIC AHNPCHNSGK CLIDEHSFHC ECQKGFTGRL CHQNVQLELP NFVNINTEDN
KEITACKGGC LNGGKCKESK EERNECQCTD RYYGRSCSKL RSVPNVCRDQ AFDLVFLIDG
SIRIGRKNYN MVIDWVKSIV AQLNISQERS RVGLVQFSTK TEIEFDLARF NDKKNLLQGL
EQSKSRFKNM GYNAYAGLRN SIDLFHGENP RFLIMITAGR LYIHPRRQKE VLNAAEHSKV
EIFAVKLGKM SNEAELMGLT KNDRNIYGIR HKSDFRKEGA AILTSVCFKD DWRSKIIED
//