ID E4X861_OIKDI Unreviewed; 788 AA.
AC E4X861;
DT 08-FEB-2011, integrated into UniProtKB/TrEMBL.
DT 08-FEB-2011, sequence version 1.
DT 27-MAR-2024, entry version 37.
DE RecName: Full=SMB domain-containing protein {ECO:0000259|PROSITE:PS50958};
GN ORFNames=GSOID_T00003754001 {ECO:0000313|EMBL:CBY18884.1};
OS Oikopleura dioica (Tunicate).
OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; Copelata;
OC Oikopleuridae; Oikopleura.
OX NCBI_TaxID=34765 {ECO:0000313|EMBL:CBY18884.1};
RN [1] {ECO:0000313|EMBL:CBY18884.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21097902; DOI=10.1126/science.1194167;
RA Denoeud F., Henriet S., Mungpakdee S., Aury J.M., Da Silva C.,
RA Brinkmann H., Mikhaleva J., Olsen L.C., Jubin C., Canestro C.,
RA Bouquet J.M., Danks G., Poulain J., Campsteijn C., Adamski M., Cross I.,
RA Yadetie F., Muffato M., Louis A., Butcher S., Tsagkogeorga G., Konrad A.,
RA Singh S., Jensen M.F., Cong E.H., Eikeseth-Otteraa H., Noel B.,
RA Anthouard V., Porcel B.M., Kachouri-Lafond R., Nishino A., Ugolini M.,
RA Chourrout P., Nishida H., Aasland R., Huzurbazar S., Westhof E., Delsuc F.,
RA Lehrach H., Reinhardt R., Weissenbach J., Roy S.W., Artiguenave F.,
RA Postlethwait J.H., Manak J.R., Thompson E.M., Jaillon O., Du Pasquier L.,
RA Boudinot P., Liberles D.A., Volff J.N., Philippe H., Lenhard B.,
RA Roest Crollius H., Wincker P., Chourrout D.;
RT "Plasticity of animal genome architecture unmasked by rapid evolution of a
RT pelagic tunicate.";
RL Science 330:1381-1385(2010).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; FN653028; CBY18884.1; -; Genomic_DNA.
DR AlphaFoldDB; E4X861; -.
DR InParanoid; E4X861; -.
DR Proteomes; UP000001307; Unassembled WGS sequence.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR GO; GO:0008146; F:sulfotransferase activity; IEA:InterPro.
DR Gene3D; 4.10.410.20; -; 1.
DR Gene3D; 3.40.50.300; P-loop containing nucleotide triphosphate hydrolases; 1.
DR InterPro; IPR037359; NST/OST.
DR InterPro; IPR027417; P-loop_NTPase.
DR InterPro; IPR036024; Somatomedin_B-like_dom_sf.
DR InterPro; IPR001212; Somatomedin_B_dom.
DR InterPro; IPR000863; Sulfotransferase_dom.
DR PANTHER; PTHR10605:SF65; HEPARAN SULFATE GLUCOSAMINE 3-O-SULFOTRANSFERASE 6; 1.
DR PANTHER; PTHR10605; HEPARAN SULFATE SULFOTRANSFERASE; 1.
DR Pfam; PF01033; Somatomedin_B; 1.
DR Pfam; PF00685; Sulfotransfer_1; 1.
DR SMART; SM00201; SO; 1.
DR SUPFAM; SSF52540; P-loop containing nucleoside triphosphate hydrolases; 1.
DR SUPFAM; SSF90188; Somatomedin B domain; 1.
DR PROSITE; PS50958; SMB_2; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157,
KW ECO:0000256|PIRSR:PIRSR637359-3}; Membrane {ECO:0000256|SAM:Phobius};
KW Reference proteome {ECO:0000313|Proteomes:UP000001307};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT TRANSMEM 397..415
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 47..90
FT /note="SMB"
FT /evidence="ECO:0000259|PROSITE:PS50958"
FT REGION 274..314
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 274..291
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 708..722
FT /evidence="ECO:0000256|PIRSR:PIRSR637359-3"
SQ SEQUENCE 788 AA; 89324 MW; 9AECFBD08733A7A2 CRC64;
MERKDCCESD SVTQDCLGLC DGAQTLFWSK QYSKCSSVNR QIYSCWSRTT CVGRCNDKPR
TDHKCQCDLD CMMRGDCCID MPFACKVDLP IKISYWPSQS LSCAKTINFS DYQRAKGVHQ
MENLLIDPKK PSPYVALGLD GFVGKLKTFF NVKIDGAYRF YFMLGSKDFG QVKIDGIKIA
DLICSANPKP MQVDLLLPFG GHKLEIVFAH TGGAHQLKVA YEGPDTAMRP SKIQVLKEAD
VFEMGLPAEL ASTTTTTAVP LTRPGENIVK STTTKASIGW KNPNVPSVQP KTRRPAPTRA
PQTERPLSKS SEVQTETERF DIRIIVGLSL ESGLYSNKNV LIQERNYLLT HRMSSFSQYV
TSTADYIDPP ERRLIEIEDE DLNDTKIERK LDKCRKYLMI GLALIGILYY IAFEFDGMES
QTSMKDAAIS WVQATGSGAN RPAIVAPDPY KLPPPNDYDE PVHGFPKAYI IGAPRCGCHV
LRQYMEHHPQ MFFTDEQNLH FWDDKYSQGV EWYKQKIPAV KPWQISVDYT SEYLIKPEVP
QRMIDELPNN QTRIIMMVCD PIERAELEFF EIYNNLDTNH ADWKAKIDAV GDIITDFPSF
ADLYTKFLST IRDNPSQVSK YGVRDFEDLI ANIKTIRPEA SILTTSLFDI HIRRWLRVFD
RGNILIMDGR QLQTAPIVAV RRVQNFLRIE DAIPSKAFSF NATAGLFCVN NGLFSDFNEV
VCPARDSFVL PDAAPRISAK AKEKLKDWFA PHMASFAFMT NDTFPWAYLP QVAHIDMPTG
NGWADGWT
//