ID G3WEL1_SARHA Unreviewed; 630 AA.
AC G3WEL1;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 07-APR-2021, sequence version 2.
DT 27-MAR-2024, entry version 55.
DE RecName: Full=Galactose-3-O-sulfotransferase 2 {ECO:0008006|Google:ProtNLM};
GN Name=LOC111719705 {ECO:0000313|Ensembl:ENSSHAP00000013866.2};
OS Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus.
OX NCBI_TaxID=9305 {ECO:0000313|Ensembl:ENSSHAP00000013866.2, ECO:0000313|Proteomes:UP000007648};
RN [1] {ECO:0000313|Ensembl:ENSSHAP00000013866.2, ECO:0000313|Proteomes:UP000007648}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21709235; DOI=10.1073/pnas.1102838108;
RA Miller W., Hayes V.M., Ratan A., Petersen D.C., Wittekindt N.E., Miller J.,
RA Walenz B., Knight J., Qi J., Zhao F., Wang Q., Bedoya-Reina O.C.,
RA Katiyar N., Tomsho L.P., Kasson L.M., Hardie R.A., Woodbridge P.,
RA Tindall E.A., Bertelsen M.F., Dixon D., Pyecroft S., Helgen K.M.,
RA Lesk A.M., Pringle T.H., Patterson N., Zhang Y., Kreiss A., Woods G.M.,
RA Jones M.E., Schuster S.C.;
RT "Genetic diversity and population structure of the endangered marsupial
RT Sarcophilus harrisii (Tasmanian devil).";
RL Proc. Natl. Acad. Sci. U.S.A. 108:12348-12353(2011).
RN [2] {ECO:0000313|Ensembl:ENSSHAP00000013866.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SIMILARITY: Belongs to the galactose-3-O-sulfotransferase family.
CC {ECO:0000256|ARBA:ARBA00008124}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; G3WEL1; -.
DR Ensembl; ENSSHAT00000013983.2; ENSSHAP00000013866.2; ENSSHAG00000011858.2.
DR eggNOG; ENOG502QTXT; Eukaryota.
DR GeneTree; ENSGT00950000182923; -.
DR HOGENOM; CLU_040616_1_1_1; -.
DR TreeFam; TF314802; -.
DR Proteomes; UP000007648; Unassembled WGS sequence.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR GO; GO:0001733; F:galactosylceramide sulfotransferase activity; IEA:InterPro.
DR GO; GO:0009247; P:glycolipid biosynthetic process; IEA:InterPro.
DR Gene3D; 3.40.50.300; P-loop containing nucleotide triphosphate hydrolases; 1.
DR InterPro; IPR009729; Gal-3-0_sulfotransfrase.
DR InterPro; IPR027417; P-loop_NTPase.
DR PANTHER; PTHR14647; GALACTOSE-3-O-SULFOTRANSFERASE; 1.
DR PANTHER; PTHR14647:SF62; GALACTOSE-3-O-SULFOTRANSFERASE 2-LIKE ISOFORM X1; 1.
DR Pfam; PF06990; Gal-3-0_sulfotr; 1.
DR SUPFAM; SSF52540; P-loop containing nucleoside triphosphate hydrolases; 1.
PE 3: Inferred from homology;
KW Membrane {ECO:0000256|SAM:Phobius};
KW Reference proteome {ECO:0000313|Proteomes:UP000007648};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT TRANSMEM 18..36
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT REGION 152..171
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 186..208
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 607..630
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 186..207
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 630 AA; 72903 MW; A0CC2BE46955C9CC CRC64;
MYVLKKAQLV PKTQLGRFWI LITFISILCI SLQMLGNLQQ SCKQEIKHLI LQQNINLPQT
SQVQKNCNST LSPHSLNRVP IGHTEAIPTQ VSLQRMKQNK ALNPTLQMQN DLNYSEENAL
VDLWWLSYSS KHPKPSLWGS KKVAKDKDWT LPQIKSSKPP PKAAQMNKKR DLGKKMEMVV
SSASRSFDSV GGSQQNHPQS TIPKYGRAQS QGKIMKDSLF RQKISLKDPI TRSRTPLSPS
LTPRLRDGLF SSHEAACTSK THIFFLKVHK SASSTIMNIL FRFGEQRNLT FALPINQNSQ
LFYPSYFVAE VVEGFTGETS PSFDIMCHHL RFLQTEVQRV MPNDTFYFSI MRNPIHLMES
SFSYYKGSSS FFNAKSLDDF LNNTSKFYDP LKTDSQYSRN LMAFDFGFNH NGKASAQHTR
LLARVIESQF DLVLIAEYFD ESMVLLKDAL CWSLDDVMSF PINQQDASYR SSLSPSTIQK
IKSWNKLDWE LYLHFNRTFW ERINQSMSRE YLQQEVAALR KRRKQLAKIC LQKEQSVHSW
NIEDKLLKPF QYGRAKILGY NLRHGLDEAT KQACQSFVMP ELQYSKRLYE RQFPQKALWL
SQAKKYSKAH SKKKERPPKS YSQTSEASFF
//