ID E4Y1L8_OIKDI Unreviewed; 2319 AA.
AC E4Y1L8;
DT 08-FEB-2011, integrated into UniProtKB/TrEMBL.
DT 08-FEB-2011, sequence version 1.
DT 27-MAR-2024, entry version 44.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CBY15762.1};
DE Flags: Fragment;
GN ORFNames=GSOID_T00014076001 {ECO:0000313|EMBL:CBY15762.1};
OS Oikopleura dioica (Tunicate).
OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; Copelata;
OC Oikopleuridae; Oikopleura.
OX NCBI_TaxID=34765 {ECO:0000313|EMBL:CBY15762.1};
RN [1] {ECO:0000313|EMBL:CBY15762.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21097902; DOI=10.1126/science.1194167;
RA Denoeud F., Henriet S., Mungpakdee S., Aury J.M., Da Silva C.,
RA Brinkmann H., Mikhaleva J., Olsen L.C., Jubin C., Canestro C.,
RA Bouquet J.M., Danks G., Poulain J., Campsteijn C., Adamski M., Cross I.,
RA Yadetie F., Muffato M., Louis A., Butcher S., Tsagkogeorga G., Konrad A.,
RA Singh S., Jensen M.F., Cong E.H., Eikeseth-Otteraa H., Noel B.,
RA Anthouard V., Porcel B.M., Kachouri-Lafond R., Nishino A., Ugolini M.,
RA Chourrout P., Nishida H., Aasland R., Huzurbazar S., Westhof E., Delsuc F.,
RA Lehrach H., Reinhardt R., Weissenbach J., Roy S.W., Artiguenave F.,
RA Postlethwait J.H., Manak J.R., Thompson E.M., Jaillon O., Du Pasquier L.,
RA Boudinot P., Liberles D.A., Volff J.N., Philippe H., Lenhard B.,
RA Roest Crollius H., Wincker P., Chourrout D.;
RT "Plasticity of animal genome architecture unmasked by rapid evolution of a
RT pelagic tunicate.";
RL Science 330:1381-1385(2010).
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00500}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; FN653652; CBY15762.1; -; Genomic_DNA.
DR InParanoid; E4Y1L8; -.
DR Proteomes; UP000001307; Unassembled WGS sequence.
DR CDD; cd19941; TIL; 4.
DR CDD; cd00191; TY; 1.
DR Gene3D; 2.10.25.10; Laminin; 4.
DR Gene3D; 2.60.40.780; von Hippel-Lindau disease tumour suppressor, beta domain; 1.
DR InterPro; IPR036084; Ser_inhib-like_sf.
DR InterPro; IPR000436; Sushi_SCR_CCP_dom.
DR InterPro; IPR000716; Thyroglobulin_1.
DR InterPro; IPR036857; Thyroglobulin_1_sf.
DR InterPro; IPR002919; TIL_dom.
DR InterPro; IPR037140; VHL_beta_dom_sf.
DR InterPro; IPR014853; VWF/SSPO/ZAN-like_Cys-rich_dom.
DR InterPro; IPR001007; VWF_dom.
DR InterPro; IPR001846; VWF_type-D.
DR PANTHER; PTHR11339; EXTRACELLULAR MATRIX GLYCOPROTEIN RELATED; 1.
DR PANTHER; PTHR11339:SF398; MUCIN-2-LIKE-RELATED; 1.
DR Pfam; PF08742; C8; 3.
DR Pfam; PF01826; TIL; 4.
DR Pfam; PF00094; VWD; 4.
DR SMART; SM00832; C8; 3.
DR SMART; SM00032; CCP; 3.
DR SMART; SM00211; TY; 1.
DR SMART; SM00214; VWC; 4.
DR SMART; SM00215; VWC_out; 3.
DR SMART; SM00216; VWD; 4.
DR SUPFAM; SSF57603; FnI-like domain; 1.
DR SUPFAM; SSF57567; Serine protease inhibitors; 4.
DR SUPFAM; SSF57610; Thyroglobulin type-1 domain; 1.
DR PROSITE; PS51162; THYROGLOBULIN_1_2; 1.
DR PROSITE; PS50184; VWFC_2; 2.
DR PROSITE; PS51233; VWFD; 4.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Reference proteome {ECO:0000313|Proteomes:UP000001307};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..22
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 23..2319
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5003191894"
FT DOMAIN 107..169
FT /note="Thyroglobulin type-1"
FT /evidence="ECO:0000259|PROSITE:PS51162"
FT DOMAIN 189..368
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 631..806
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 989..1158
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 1440..1609
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 1926..1994
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 2129..2196
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT REGION 2278..2319
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2291..2305
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 2319
FT /evidence="ECO:0000313|EMBL:CBY15762.1"
SQ SEQUENCE 2319 AA; 257309 MW; 40E9EE1DF0345FAB CRC64;
MGMLNPAKVL SLWTILAPGV TPAEKSVYSE GFDSAELFTE KNLCRNLLAP INGEINCYVS
DYDTVCTAHC NRGFIFSDAL EIEKTFSCAN DLGIWDPVSE VPQCVRITQC LRAKMAAGEG
EQSPQCDAQG DYESKQVDAR SGESWCVDTQ GKEIANTRRD SLDAEEADCS QFAKIPSRPM
IPTHMDLDGA CVHWDDVWYK TFDGQIYGYG VEGPLILLKS RDGQFSITML PLQNFGSDSH
AKRWIEFRKG NKLFTLRFLD GKFVIASEGQ ILSHFDEPLE LKGGFIVTFD GEWVTVTNPE
SYGKSVFMFC PLSGSHFIKT TKSFSDTEGL CGSFDNDVEN DFLAKNGNIC SESSEFAETW
QMTCPVRPYW KTYETTGATK RSVEFINNFE EAIKLYVVTM DGSWEKMAEI DQFASYEAKK
TYSTVAWIAT TKSNELLKLN NYCVFYALET EDLQHVSITT AKPEEGFAVH YEKKCNAIAP
VNGHVECKTF GGLKMCAVNC DNGYQHGHTS LYICNLETGT WAPELPYNQL IFPACHDGAR
RCTDLEAPIG GTITCSINDE GLQICVPSCP EGFEIKPSPP MSGYYTCNDG RWGTGSVPPA
CIARANAVDP REIGQVSGVF TKGDVPADQT GYCMTWGQHH YRTFDGKIYR FRGQCSYVLA
QDKLTGSFSI HIKNDGECDG TGTCRRSLTI FMNDASVDIK LDEDGQPTVY TASGEAVPLP
TMIYDLSIQR ISDYIVVREA INGWYLKWDG AESVFLRMDE GMMGQTSGLC GVFDRDQSND
FETRSGTLVK TVSLFANSWK IDQNCLDATE EGYCSIKDEV SARKATEAIM LCSYLLDTEC
KDTVDPRPYF ESCKEDVCFS SGNNTEEWIC NSMSAYFREC ARHDVHVDWR CPARCEVQCP
VGQIFKTCGS SCPQTCWAQQ YNCDDDHCID GCHCPAGTYL HDGQCLQRGE CPCKFGHDEY
APRSRIQRDC NQCVCVNGGW KCTTNECDGV CTAMGSHYTT FDGIHYDFEG DCTYVLSRGF
GDFTWEVLIQ NHDCSGSVCG RSVILRVNGE DVKLIAGGAL HATDSVTSLP YHGEYFVIEK
VTTMFHKVTL DNGLTIFWDK TDRIYLRAPA SIKGSVSGLC GNFNDAQSDE FRTPENDIEA
NAVEFATKWQ ATTCDNAPQR VPMNACAQFA NRRGHAQKIC ASILSGQFVQ CHEIVDYEAY
YGLCMQDYCK TGEEKSACTI LSDYSSACAK NGIIVDGWRQ NCTECNVECP HGMVFAECAK
SCGRSCGSLS YADDCTEGCV QGCTCPEGSV MDFDGDCIPV GECPCYYEQK KYLAGESRMQ
DCNICECHKG QWFCTDNECE IEETCGVNAE LNSCVPIDPL TCSNMHLNRI PLKVEHCRTG
CVCMHGFVLD ESTGDCVRPT ECPCHHGGKS YKEGEEMKKD CNHCQCVSGS WECTDLVCPG
HCRAYGDSHI TTFDSRNYQF QGACEYTYVE TVPSAHASFR VTARNEQCGS QGTVCTKSIT
ITLNPENTME RSSLRLVRGK PITLEKGSGF DVRYAGLWVF ITTDVGLTVQ WDKGTQMIIK
LDPMFKGKVQ GLCGDYDDRV IDDLVSRGGI STANVLSFGD SWRVDSTCPP SKHVNDICEA
NPHRKIWAVK KCSIINSDTF AACHAHVDPS YYYDNCVFDS CACDSGGDCE CLCTAIGAYA
QECNRHGVHI YWRNQQLCPV QCDGCRSYDP CVSACPKSCD NYHNWAQIET TCPDTCVEGC
TCDDEHVMSD DGKHCLPVDE CICMEINGVV YGHGEKIDVM SDDCRSCFCM EGHAQCLGLP
CGSAWTAPTV TTICPPDVHY EGMETKPFTT KTVEECPEGK KQECSFTCNM VCHAWRSVLV
QCVHDTNTCV DICENDKPMQ CASGYVLNDL NTCVKMQDCP CMLPCGQTLA PGSLYDDEDE
CKRYFCWQND LKDGNERQFG EYWNSENCHY CMCYQDGSVR CRQLECDSLG ECEPGAIRDI
KMSTDGCCEV AECRAESCKG TECEFEVPCC EYYEDIITVQ VSECCCRFEC ECNESKCETF
NDCYCEEGFT LQMQKQADSC CSVPTCIPDY ASTTTTTSTT TTSTTTTTST TTTDNTHTWT
IPDIKIDTTT TTTTKVMPTY THVTPTCPPL CFAVDGTEYE YGEQWTVSAC ETCCCRSDGT
ISCTKSSCQD IMVTCDANTH TKQTADLGCC KQEKCIPREC TDEQPTPPTC DACQECVATP
TKNKCIPYIY TCVCKECPPV GELTCKAPYV AVEYDMDQCC KATRCCHGTT EPCTTETVTV
TPPTHTPPTP GTKTTHPTMT WTHPVRHGDT TKPIFDHTT
//