ID E4XZ52_OIKDI Unreviewed; 721 AA.
AC E4XZ52;
DT 08-FEB-2011, integrated into UniProtKB/TrEMBL.
DT 08-FEB-2011, sequence version 1.
DT 24-JAN-2024, entry version 26.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CBY14914.1};
GN ORFNames=GSOID_T00010008001 {ECO:0000313|EMBL:CBY14914.1};
OS Oikopleura dioica (Tunicate).
OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; Copelata;
OC Oikopleuridae; Oikopleura.
OX NCBI_TaxID=34765 {ECO:0000313|EMBL:CBY14914.1};
RN [1] {ECO:0000313|EMBL:CBY14914.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21097902; DOI=10.1126/science.1194167;
RA Denoeud F., Henriet S., Mungpakdee S., Aury J.M., Da Silva C.,
RA Brinkmann H., Mikhaleva J., Olsen L.C., Jubin C., Canestro C.,
RA Bouquet J.M., Danks G., Poulain J., Campsteijn C., Adamski M., Cross I.,
RA Yadetie F., Muffato M., Louis A., Butcher S., Tsagkogeorga G., Konrad A.,
RA Singh S., Jensen M.F., Cong E.H., Eikeseth-Otteraa H., Noel B.,
RA Anthouard V., Porcel B.M., Kachouri-Lafond R., Nishino A., Ugolini M.,
RA Chourrout P., Nishida H., Aasland R., Huzurbazar S., Westhof E., Delsuc F.,
RA Lehrach H., Reinhardt R., Weissenbach J., Roy S.W., Artiguenave F.,
RA Postlethwait J.H., Manak J.R., Thompson E.M., Jaillon O., Du Pasquier L.,
RA Boudinot P., Liberles D.A., Volff J.N., Philippe H., Lenhard B.,
RA Roest Crollius H., Wincker P., Chourrout D.;
RT "Plasticity of animal genome architecture unmasked by rapid evolution of a
RT pelagic tunicate.";
RL Science 330:1381-1385(2010).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; FN653375; CBY14914.1; -; Genomic_DNA.
DR AlphaFoldDB; E4XZ52; -.
DR InParanoid; E4XZ52; -.
DR Proteomes; UP000001307; Unassembled WGS sequence.
DR InterPro; IPR008160; Collagen.
DR PANTHER; PTHR37456:SF3; COLLAGEN ALPHA-1(XXV) CHAIN; 1.
DR PANTHER; PTHR37456; SI:CH211-266K2.1; 1.
DR Pfam; PF01391; Collagen; 5.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000001307}.
FT REGION 72..476
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 574..596
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 701..721
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 335..356
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 442..456
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 574..595
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 702..721
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 721 AA; 73109 MW; 78E7BA8AA1DB41E0 CRC64;
MDLFGLSNPE SLKITRKVLF NRLEILSEAQ AEDSGICCTP TNRESVATCI IEGNFALQAS
CSQPNLPPPD VKCERGAPGA AGEKGAPGIR GNAGSIGPKG ETGPAGKNGP KGEHGPPGID
GKPGFAGDPG NAGQTGEKGE KGNSGQKGNV GEKGDVGEIG PQGLLGSKGQ AGIPGLNGTH
GLDGTKGEPG QKGTDGLPGI LGSIGVKGEP GEPGQKGDDG ARGAPGARGI PGLSVKGDKG
ESGEQGLPGI IGDNGPMGFK GEQGPKGPAG EGKPGNPGPN GNPGGRGEPG MTGPQGFRGE
GGKSGKPGIA GIKGPQGDNG PRGLAGLPGE RGLPGPTGPI GPPGPLGPPG PPGERGGSGP
DGPRGATGPQ GPERSDRPAG RDGEPGEPGP PGPRGHQGTS GMFGQPGKDG LPGEKGDRGS
SGEDIIGAPG APGRQGPIGP PANCAPGEPG EPGPPGRPGQ IGRDGPTGIR GQQGVCKADE
CKAPDEWAAN VNNYAKLFQT TPAPWNKGPP PSRNDNAPAF KPDTVMRLPA KPEDSLLGPG
AGPTGERHTV NINKVPRPAI NPSGEQVWIP TTASTTTTTT TTRRTTRQTT PRARNRPCYS
CEARKKAQER ITTLAPKTVK PRPATVEPAL IAPVVSKCKL NERDENSPCK PSPCKSTWTV
CKARVACKNA AAKKKQKCVV RRIQIIRNKR HQELIRAKRQ KIRDSIKSKR SAKRNSRIIK
S
//