GenomeNet

Database: UniProt
Entry: E4Y1L8_OIKDI
LinkDB: E4Y1L8_OIKDI
Original site: E4Y1L8_OIKDI 
ID   E4Y1L8_OIKDI            Unreviewed;      2319 AA.
AC   E4Y1L8;
DT   08-FEB-2011, integrated into UniProtKB/TrEMBL.
DT   08-FEB-2011, sequence version 1.
DT   27-MAR-2024, entry version 44.
DE   SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CBY15762.1};
DE   Flags: Fragment;
GN   ORFNames=GSOID_T00014076001 {ECO:0000313|EMBL:CBY15762.1};
OS   Oikopleura dioica (Tunicate).
OC   Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; Copelata;
OC   Oikopleuridae; Oikopleura.
OX   NCBI_TaxID=34765 {ECO:0000313|EMBL:CBY15762.1};
RN   [1] {ECO:0000313|EMBL:CBY15762.1}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=21097902; DOI=10.1126/science.1194167;
RA   Denoeud F., Henriet S., Mungpakdee S., Aury J.M., Da Silva C.,
RA   Brinkmann H., Mikhaleva J., Olsen L.C., Jubin C., Canestro C.,
RA   Bouquet J.M., Danks G., Poulain J., Campsteijn C., Adamski M., Cross I.,
RA   Yadetie F., Muffato M., Louis A., Butcher S., Tsagkogeorga G., Konrad A.,
RA   Singh S., Jensen M.F., Cong E.H., Eikeseth-Otteraa H., Noel B.,
RA   Anthouard V., Porcel B.M., Kachouri-Lafond R., Nishino A., Ugolini M.,
RA   Chourrout P., Nishida H., Aasland R., Huzurbazar S., Westhof E., Delsuc F.,
RA   Lehrach H., Reinhardt R., Weissenbach J., Roy S.W., Artiguenave F.,
RA   Postlethwait J.H., Manak J.R., Thompson E.M., Jaillon O., Du Pasquier L.,
RA   Boudinot P., Liberles D.A., Volff J.N., Philippe H., Lenhard B.,
RA   Roest Crollius H., Wincker P., Chourrout D.;
RT   "Plasticity of animal genome architecture unmasked by rapid evolution of a
RT   pelagic tunicate.";
RL   Science 330:1381-1385(2010).
CC   -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC       feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00500}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; FN653652; CBY15762.1; -; Genomic_DNA.
DR   InParanoid; E4Y1L8; -.
DR   Proteomes; UP000001307; Unassembled WGS sequence.
DR   CDD; cd19941; TIL; 4.
DR   CDD; cd00191; TY; 1.
DR   Gene3D; 2.10.25.10; Laminin; 4.
DR   Gene3D; 2.60.40.780; von Hippel-Lindau disease tumour suppressor, beta domain; 1.
DR   InterPro; IPR036084; Ser_inhib-like_sf.
DR   InterPro; IPR000436; Sushi_SCR_CCP_dom.
DR   InterPro; IPR000716; Thyroglobulin_1.
DR   InterPro; IPR036857; Thyroglobulin_1_sf.
DR   InterPro; IPR002919; TIL_dom.
DR   InterPro; IPR037140; VHL_beta_dom_sf.
DR   InterPro; IPR014853; VWF/SSPO/ZAN-like_Cys-rich_dom.
DR   InterPro; IPR001007; VWF_dom.
DR   InterPro; IPR001846; VWF_type-D.
DR   PANTHER; PTHR11339; EXTRACELLULAR MATRIX GLYCOPROTEIN RELATED; 1.
DR   PANTHER; PTHR11339:SF398; MUCIN-2-LIKE-RELATED; 1.
DR   Pfam; PF08742; C8; 3.
DR   Pfam; PF01826; TIL; 4.
DR   Pfam; PF00094; VWD; 4.
DR   SMART; SM00832; C8; 3.
DR   SMART; SM00032; CCP; 3.
DR   SMART; SM00211; TY; 1.
DR   SMART; SM00214; VWC; 4.
DR   SMART; SM00215; VWC_out; 3.
DR   SMART; SM00216; VWD; 4.
DR   SUPFAM; SSF57603; FnI-like domain; 1.
DR   SUPFAM; SSF57567; Serine protease inhibitors; 4.
DR   SUPFAM; SSF57610; Thyroglobulin type-1 domain; 1.
DR   PROSITE; PS51162; THYROGLOBULIN_1_2; 1.
DR   PROSITE; PS50184; VWFC_2; 2.
DR   PROSITE; PS51233; VWFD; 4.
PE   4: Predicted;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Reference proteome {ECO:0000313|Proteomes:UP000001307};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..22
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           23..2319
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5003191894"
FT   DOMAIN          107..169
FT                   /note="Thyroglobulin type-1"
FT                   /evidence="ECO:0000259|PROSITE:PS51162"
FT   DOMAIN          189..368
FT                   /note="VWFD"
FT                   /evidence="ECO:0000259|PROSITE:PS51233"
FT   DOMAIN          631..806
FT                   /note="VWFD"
FT                   /evidence="ECO:0000259|PROSITE:PS51233"
FT   DOMAIN          989..1158
FT                   /note="VWFD"
FT                   /evidence="ECO:0000259|PROSITE:PS51233"
FT   DOMAIN          1440..1609
FT                   /note="VWFD"
FT                   /evidence="ECO:0000259|PROSITE:PS51233"
FT   DOMAIN          1926..1994
FT                   /note="VWFC"
FT                   /evidence="ECO:0000259|PROSITE:PS50184"
FT   DOMAIN          2129..2196
FT                   /note="VWFC"
FT                   /evidence="ECO:0000259|PROSITE:PS50184"
FT   REGION          2278..2319
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2291..2305
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   NON_TER         2319
FT                   /evidence="ECO:0000313|EMBL:CBY15762.1"
SQ   SEQUENCE   2319 AA;  257309 MW;  40E9EE1DF0345FAB CRC64;
     MGMLNPAKVL SLWTILAPGV TPAEKSVYSE GFDSAELFTE KNLCRNLLAP INGEINCYVS
     DYDTVCTAHC NRGFIFSDAL EIEKTFSCAN DLGIWDPVSE VPQCVRITQC LRAKMAAGEG
     EQSPQCDAQG DYESKQVDAR SGESWCVDTQ GKEIANTRRD SLDAEEADCS QFAKIPSRPM
     IPTHMDLDGA CVHWDDVWYK TFDGQIYGYG VEGPLILLKS RDGQFSITML PLQNFGSDSH
     AKRWIEFRKG NKLFTLRFLD GKFVIASEGQ ILSHFDEPLE LKGGFIVTFD GEWVTVTNPE
     SYGKSVFMFC PLSGSHFIKT TKSFSDTEGL CGSFDNDVEN DFLAKNGNIC SESSEFAETW
     QMTCPVRPYW KTYETTGATK RSVEFINNFE EAIKLYVVTM DGSWEKMAEI DQFASYEAKK
     TYSTVAWIAT TKSNELLKLN NYCVFYALET EDLQHVSITT AKPEEGFAVH YEKKCNAIAP
     VNGHVECKTF GGLKMCAVNC DNGYQHGHTS LYICNLETGT WAPELPYNQL IFPACHDGAR
     RCTDLEAPIG GTITCSINDE GLQICVPSCP EGFEIKPSPP MSGYYTCNDG RWGTGSVPPA
     CIARANAVDP REIGQVSGVF TKGDVPADQT GYCMTWGQHH YRTFDGKIYR FRGQCSYVLA
     QDKLTGSFSI HIKNDGECDG TGTCRRSLTI FMNDASVDIK LDEDGQPTVY TASGEAVPLP
     TMIYDLSIQR ISDYIVVREA INGWYLKWDG AESVFLRMDE GMMGQTSGLC GVFDRDQSND
     FETRSGTLVK TVSLFANSWK IDQNCLDATE EGYCSIKDEV SARKATEAIM LCSYLLDTEC
     KDTVDPRPYF ESCKEDVCFS SGNNTEEWIC NSMSAYFREC ARHDVHVDWR CPARCEVQCP
     VGQIFKTCGS SCPQTCWAQQ YNCDDDHCID GCHCPAGTYL HDGQCLQRGE CPCKFGHDEY
     APRSRIQRDC NQCVCVNGGW KCTTNECDGV CTAMGSHYTT FDGIHYDFEG DCTYVLSRGF
     GDFTWEVLIQ NHDCSGSVCG RSVILRVNGE DVKLIAGGAL HATDSVTSLP YHGEYFVIEK
     VTTMFHKVTL DNGLTIFWDK TDRIYLRAPA SIKGSVSGLC GNFNDAQSDE FRTPENDIEA
     NAVEFATKWQ ATTCDNAPQR VPMNACAQFA NRRGHAQKIC ASILSGQFVQ CHEIVDYEAY
     YGLCMQDYCK TGEEKSACTI LSDYSSACAK NGIIVDGWRQ NCTECNVECP HGMVFAECAK
     SCGRSCGSLS YADDCTEGCV QGCTCPEGSV MDFDGDCIPV GECPCYYEQK KYLAGESRMQ
     DCNICECHKG QWFCTDNECE IEETCGVNAE LNSCVPIDPL TCSNMHLNRI PLKVEHCRTG
     CVCMHGFVLD ESTGDCVRPT ECPCHHGGKS YKEGEEMKKD CNHCQCVSGS WECTDLVCPG
     HCRAYGDSHI TTFDSRNYQF QGACEYTYVE TVPSAHASFR VTARNEQCGS QGTVCTKSIT
     ITLNPENTME RSSLRLVRGK PITLEKGSGF DVRYAGLWVF ITTDVGLTVQ WDKGTQMIIK
     LDPMFKGKVQ GLCGDYDDRV IDDLVSRGGI STANVLSFGD SWRVDSTCPP SKHVNDICEA
     NPHRKIWAVK KCSIINSDTF AACHAHVDPS YYYDNCVFDS CACDSGGDCE CLCTAIGAYA
     QECNRHGVHI YWRNQQLCPV QCDGCRSYDP CVSACPKSCD NYHNWAQIET TCPDTCVEGC
     TCDDEHVMSD DGKHCLPVDE CICMEINGVV YGHGEKIDVM SDDCRSCFCM EGHAQCLGLP
     CGSAWTAPTV TTICPPDVHY EGMETKPFTT KTVEECPEGK KQECSFTCNM VCHAWRSVLV
     QCVHDTNTCV DICENDKPMQ CASGYVLNDL NTCVKMQDCP CMLPCGQTLA PGSLYDDEDE
     CKRYFCWQND LKDGNERQFG EYWNSENCHY CMCYQDGSVR CRQLECDSLG ECEPGAIRDI
     KMSTDGCCEV AECRAESCKG TECEFEVPCC EYYEDIITVQ VSECCCRFEC ECNESKCETF
     NDCYCEEGFT LQMQKQADSC CSVPTCIPDY ASTTTTTSTT TTSTTTTTST TTTDNTHTWT
     IPDIKIDTTT TTTTKVMPTY THVTPTCPPL CFAVDGTEYE YGEQWTVSAC ETCCCRSDGT
     ISCTKSSCQD IMVTCDANTH TKQTADLGCC KQEKCIPREC TDEQPTPPTC DACQECVATP
     TKNKCIPYIY TCVCKECPPV GELTCKAPYV AVEYDMDQCC KATRCCHGTT EPCTTETVTV
     TPPTHTPPTP GTKTTHPTMT WTHPVRHGDT TKPIFDHTT
//
DBGET integrated database retrieval system