GenomeNet

Database: UniProt
Entry: E4XZ52_OIKDI
LinkDB: E4XZ52_OIKDI
Original site: E4XZ52_OIKDI 
ID   E4XZ52_OIKDI            Unreviewed;       721 AA.
AC   E4XZ52;
DT   08-FEB-2011, integrated into UniProtKB/TrEMBL.
DT   08-FEB-2011, sequence version 1.
DT   24-JAN-2024, entry version 26.
DE   SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CBY14914.1};
GN   ORFNames=GSOID_T00010008001 {ECO:0000313|EMBL:CBY14914.1};
OS   Oikopleura dioica (Tunicate).
OC   Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; Copelata;
OC   Oikopleuridae; Oikopleura.
OX   NCBI_TaxID=34765 {ECO:0000313|EMBL:CBY14914.1};
RN   [1] {ECO:0000313|EMBL:CBY14914.1}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=21097902; DOI=10.1126/science.1194167;
RA   Denoeud F., Henriet S., Mungpakdee S., Aury J.M., Da Silva C.,
RA   Brinkmann H., Mikhaleva J., Olsen L.C., Jubin C., Canestro C.,
RA   Bouquet J.M., Danks G., Poulain J., Campsteijn C., Adamski M., Cross I.,
RA   Yadetie F., Muffato M., Louis A., Butcher S., Tsagkogeorga G., Konrad A.,
RA   Singh S., Jensen M.F., Cong E.H., Eikeseth-Otteraa H., Noel B.,
RA   Anthouard V., Porcel B.M., Kachouri-Lafond R., Nishino A., Ugolini M.,
RA   Chourrout P., Nishida H., Aasland R., Huzurbazar S., Westhof E., Delsuc F.,
RA   Lehrach H., Reinhardt R., Weissenbach J., Roy S.W., Artiguenave F.,
RA   Postlethwait J.H., Manak J.R., Thompson E.M., Jaillon O., Du Pasquier L.,
RA   Boudinot P., Liberles D.A., Volff J.N., Philippe H., Lenhard B.,
RA   Roest Crollius H., Wincker P., Chourrout D.;
RT   "Plasticity of animal genome architecture unmasked by rapid evolution of a
RT   pelagic tunicate.";
RL   Science 330:1381-1385(2010).
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; FN653375; CBY14914.1; -; Genomic_DNA.
DR   AlphaFoldDB; E4XZ52; -.
DR   InParanoid; E4XZ52; -.
DR   Proteomes; UP000001307; Unassembled WGS sequence.
DR   InterPro; IPR008160; Collagen.
DR   PANTHER; PTHR37456:SF3; COLLAGEN ALPHA-1(XXV) CHAIN; 1.
DR   PANTHER; PTHR37456; SI:CH211-266K2.1; 1.
DR   Pfam; PF01391; Collagen; 5.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000001307}.
FT   REGION          72..476
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          574..596
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          701..721
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        335..356
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        442..456
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        574..595
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        702..721
FT                   /note="Basic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   721 AA;  73109 MW;  78E7BA8AA1DB41E0 CRC64;
     MDLFGLSNPE SLKITRKVLF NRLEILSEAQ AEDSGICCTP TNRESVATCI IEGNFALQAS
     CSQPNLPPPD VKCERGAPGA AGEKGAPGIR GNAGSIGPKG ETGPAGKNGP KGEHGPPGID
     GKPGFAGDPG NAGQTGEKGE KGNSGQKGNV GEKGDVGEIG PQGLLGSKGQ AGIPGLNGTH
     GLDGTKGEPG QKGTDGLPGI LGSIGVKGEP GEPGQKGDDG ARGAPGARGI PGLSVKGDKG
     ESGEQGLPGI IGDNGPMGFK GEQGPKGPAG EGKPGNPGPN GNPGGRGEPG MTGPQGFRGE
     GGKSGKPGIA GIKGPQGDNG PRGLAGLPGE RGLPGPTGPI GPPGPLGPPG PPGERGGSGP
     DGPRGATGPQ GPERSDRPAG RDGEPGEPGP PGPRGHQGTS GMFGQPGKDG LPGEKGDRGS
     SGEDIIGAPG APGRQGPIGP PANCAPGEPG EPGPPGRPGQ IGRDGPTGIR GQQGVCKADE
     CKAPDEWAAN VNNYAKLFQT TPAPWNKGPP PSRNDNAPAF KPDTVMRLPA KPEDSLLGPG
     AGPTGERHTV NINKVPRPAI NPSGEQVWIP TTASTTTTTT TTRRTTRQTT PRARNRPCYS
     CEARKKAQER ITTLAPKTVK PRPATVEPAL IAPVVSKCKL NERDENSPCK PSPCKSTWTV
     CKARVACKNA AAKKKQKCVV RRIQIIRNKR HQELIRAKRQ KIRDSIKSKR SAKRNSRIIK
     S
//
DBGET integrated database retrieval system