GenomeNet

Database: UniProt
Entry: E4X546_OIKDI
LinkDB: E4X546_OIKDI
Original site: E4X546_OIKDI 
ID   E4X546_OIKDI            Unreviewed;       892 AA.
AC   E4X546;
DT   08-FEB-2011, integrated into UniProtKB/TrEMBL.
DT   08-FEB-2011, sequence version 1.
DT   28-JAN-2026, entry version 53.
DE   SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CBY18415.1};
GN   ORFNames=GSOID_T00002274001 {ECO:0000313|EMBL:CBY18415.1};
OS   Oikopleura dioica (Tunicate).
OC   Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; Copelata;
OC   Oikopleuridae; Oikopleura.
OX   NCBI_TaxID=34765 {ECO:0000313|EMBL:CBY18415.1};
RN   [1] {ECO:0000313|EMBL:CBY18415.1}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=21097902; DOI=10.1126/science.1194167;
RA   Denoeud F., Henriet S., Mungpakdee S., Aury J.M., Da Silva C.,
RA   Brinkmann H., Mikhaleva J., Olsen L.C., Jubin C., Canestro C.,
RA   Bouquet J.M., Danks G., Poulain J., Campsteijn C., Adamski M., Cross I.,
RA   Yadetie F., Muffato M., Louis A., Butcher S., Tsagkogeorga G., Konrad A.,
RA   Singh S., Jensen M.F., Cong E.H., Eikeseth-Otteraa H., Noel B.,
RA   Anthouard V., Porcel B.M., Kachouri-Lafond R., Nishino A., Ugolini M.,
RA   Chourrout P., Nishida H., Aasland R., Huzurbazar S., Westhof E., Delsuc F.,
RA   Lehrach H., Reinhardt R., Weissenbach J., Roy S.W., Artiguenave F.,
RA   Postlethwait J.H., Manak J.R., Thompson E.M., Jaillon O., Du Pasquier L.,
RA   Boudinot P., Liberles D.A., Volff J.N., Philippe H., Lenhard B.,
RA   Roest Crollius H., Wincker P., Chourrout D.;
RT   "Plasticity of animal genome architecture unmasked by rapid evolution of a
RT   pelagic tunicate.";
RL   Science 330:1381-1385(2010).
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; FN653025; CBY18415.1; -; Genomic_DNA.
DR   AlphaFoldDB; E4X546; -.
DR   InParanoid; E4X546; -.
DR   OrthoDB; 5983381at2759; -.
DR   Proteomes; UP000001307; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1082; COLLAGEN TRIPLE HELIX REPEAT; 1.
DR   Pfam; PF01391; Collagen; 3.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
PE   4: Predicted;
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Reference proteome {ECO:0000313|Proteomes:UP000001307}.
FT   DOMAIN          575..620
FT                   /note="Collagen type XV/XVIII trimerization"
FT                   /evidence="ECO:0000259|Pfam:PF20010"
FT   DOMAIN          720..886
FT                   /note="Collagenase NC10/endostatin"
FT                   /evidence="ECO:0000259|Pfam:PF06482"
FT   REGION          1..20
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          114..242
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          290..317
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          345..569
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          625..707
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        135..153
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        165..177
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        194..213
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        215..224
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        290..308
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        348..363
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        372..381
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        402..413
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        470..484
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        503..514
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        524..534
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        641..668
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        689..703
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   892 AA;  94367 MW;  31360E4B9898E8D7 CRC64;
     MLDSNSNYSR DHSTAAKRPA RTDLNAVAYV DGNENTVQDR RMIDEAQIEI EKSDKEFFDT
     YRFDDYETGD YEVLTIPDIS AFTGLDYDDS LYANETDEYD YYDDYDSGEI IPPTPLTVPE
     FFDADNDAPG LPGKQGERGP EGPRGADGKD GERGPAGADG LPGKDGADGK NGLDGKNGEP
     GQQGPAGERG LDGLDGPAGP AGKDGAQGAQ GIQGERGEKG DQGEPGKNGM HPNEFWDSWR
     DLTDDFPDYP DSFQDRFERG RQRGFSTRPC RTLSAMTILP QALSGLLDLL DPKGNKGDKG
     DMGERGHDGP VGPMGPAGDI GAPGFCTVIP LDANGVDIAA SYRGTREPGV AGPQGPQGPA
     GQRGETGTAG LKGEKGEDGL DGRNGIPGLQ GPPGADGRSG EDGLPGIPGI DGISVKGATG
     PKGERGEAGP PGIAAEIGEL VSIPGPKGEK GASGEPGESI IGPAGRNGDQ GEQGPAGQMG
     MPGRPGRPGE DGQPGSPGPP GPAASVVAGE PVVGLPGAPG KDGLPGRHGM DGKDGQQGPP
     GERGPRGYEG PPGQHGMPGP EGSCTTTECS CSAKYDTVPE MNDKYFDHED GDMAFVHREQ
     ETFIKTSNGW RPILLGPAIP LPCDTDKKSQ NSYKATKPSV ASTRQATTQQ STTRRTTTTT
     KTTTPRTTTT ERIEIKQTQK PTYRPVPQPT TSTTRQTPAR TSAEYAPIQA RRDCSGKRIR
     LVALDVPMSG LTHGVRGLDH RCYNAARSSR LRGTFRGFVS AVVQDINTIV RRSERYDYPI
     CNSKDELLFN SWNELFSDES NKGALYGKSI YSFSGRDILT DRRWPNKSIW TGSTPDGRRN
     PSKYCDGWRS ANSNNVGLSS NLSAGLLNQQ VSKCSERKVI LCVENAASDS GH
//
DBGET integrated database retrieval system