ID E4X546_OIKDI Unreviewed; 892 AA.
AC E4X546;
DT 08-FEB-2011, integrated into UniProtKB/TrEMBL.
DT 08-FEB-2011, sequence version 1.
DT 28-JAN-2026, entry version 53.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:CBY18415.1};
GN ORFNames=GSOID_T00002274001 {ECO:0000313|EMBL:CBY18415.1};
OS Oikopleura dioica (Tunicate).
OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; Copelata;
OC Oikopleuridae; Oikopleura.
OX NCBI_TaxID=34765 {ECO:0000313|EMBL:CBY18415.1};
RN [1] {ECO:0000313|EMBL:CBY18415.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21097902; DOI=10.1126/science.1194167;
RA Denoeud F., Henriet S., Mungpakdee S., Aury J.M., Da Silva C.,
RA Brinkmann H., Mikhaleva J., Olsen L.C., Jubin C., Canestro C.,
RA Bouquet J.M., Danks G., Poulain J., Campsteijn C., Adamski M., Cross I.,
RA Yadetie F., Muffato M., Louis A., Butcher S., Tsagkogeorga G., Konrad A.,
RA Singh S., Jensen M.F., Cong E.H., Eikeseth-Otteraa H., Noel B.,
RA Anthouard V., Porcel B.M., Kachouri-Lafond R., Nishino A., Ugolini M.,
RA Chourrout P., Nishida H., Aasland R., Huzurbazar S., Westhof E., Delsuc F.,
RA Lehrach H., Reinhardt R., Weissenbach J., Roy S.W., Artiguenave F.,
RA Postlethwait J.H., Manak J.R., Thompson E.M., Jaillon O., Du Pasquier L.,
RA Boudinot P., Liberles D.A., Volff J.N., Philippe H., Lenhard B.,
RA Roest Crollius H., Wincker P., Chourrout D.;
RT "Plasticity of animal genome architecture unmasked by rapid evolution of a
RT pelagic tunicate.";
RL Science 330:1381-1385(2010).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; FN653025; CBY18415.1; -; Genomic_DNA.
DR AlphaFoldDB; E4X546; -.
DR InParanoid; E4X546; -.
DR OrthoDB; 5983381at2759; -.
DR Proteomes; UP000001307; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR Gene3D; 3.40.1620.70; -; 1.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR050149; Collagen_superfamily.
DR InterPro; IPR010515; Collagenase_NC10/endostatin.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1082; COLLAGEN TRIPLE HELIX REPEAT; 1.
DR Pfam; PF01391; Collagen; 3.
DR Pfam; PF20010; Collagen_trimer; 1.
DR Pfam; PF06482; Endostatin; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Reference proteome {ECO:0000313|Proteomes:UP000001307}.
FT DOMAIN 575..620
FT /note="Collagen type XV/XVIII trimerization"
FT /evidence="ECO:0000259|Pfam:PF20010"
FT DOMAIN 720..886
FT /note="Collagenase NC10/endostatin"
FT /evidence="ECO:0000259|Pfam:PF06482"
FT REGION 1..20
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 114..242
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 290..317
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 345..569
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 625..707
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 135..153
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 165..177
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 194..213
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 215..224
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 290..308
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 348..363
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 372..381
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 402..413
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 470..484
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 503..514
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 524..534
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 641..668
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 689..703
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 892 AA; 94367 MW; 31360E4B9898E8D7 CRC64;
MLDSNSNYSR DHSTAAKRPA RTDLNAVAYV DGNENTVQDR RMIDEAQIEI EKSDKEFFDT
YRFDDYETGD YEVLTIPDIS AFTGLDYDDS LYANETDEYD YYDDYDSGEI IPPTPLTVPE
FFDADNDAPG LPGKQGERGP EGPRGADGKD GERGPAGADG LPGKDGADGK NGLDGKNGEP
GQQGPAGERG LDGLDGPAGP AGKDGAQGAQ GIQGERGEKG DQGEPGKNGM HPNEFWDSWR
DLTDDFPDYP DSFQDRFERG RQRGFSTRPC RTLSAMTILP QALSGLLDLL DPKGNKGDKG
DMGERGHDGP VGPMGPAGDI GAPGFCTVIP LDANGVDIAA SYRGTREPGV AGPQGPQGPA
GQRGETGTAG LKGEKGEDGL DGRNGIPGLQ GPPGADGRSG EDGLPGIPGI DGISVKGATG
PKGERGEAGP PGIAAEIGEL VSIPGPKGEK GASGEPGESI IGPAGRNGDQ GEQGPAGQMG
MPGRPGRPGE DGQPGSPGPP GPAASVVAGE PVVGLPGAPG KDGLPGRHGM DGKDGQQGPP
GERGPRGYEG PPGQHGMPGP EGSCTTTECS CSAKYDTVPE MNDKYFDHED GDMAFVHREQ
ETFIKTSNGW RPILLGPAIP LPCDTDKKSQ NSYKATKPSV ASTRQATTQQ STTRRTTTTT
KTTTPRTTTT ERIEIKQTQK PTYRPVPQPT TSTTRQTPAR TSAEYAPIQA RRDCSGKRIR
LVALDVPMSG LTHGVRGLDH RCYNAARSSR LRGTFRGFVS AVVQDINTIV RRSERYDYPI
CNSKDELLFN SWNELFSDES NKGALYGKSI YSFSGRDILT DRRWPNKSIW TGSTPDGRRN
PSKYCDGWRS ANSNNVGLSS NLSAGLLNQQ VSKCSERKVI LCVENAASDS GH
//