ID G3VUP1_SARHA Unreviewed; 1063 AA.
AC G3VUP1;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 07-APR-2021, sequence version 2.
DT 27-MAR-2024, entry version 77.
DE SubName: Full=Tripartite motif containing 33 {ECO:0000313|Ensembl:ENSSHAP00000006896.2};
GN Name=TRIM33 {ECO:0000313|Ensembl:ENSSHAP00000006896.2};
OS Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus.
OX NCBI_TaxID=9305 {ECO:0000313|Ensembl:ENSSHAP00000006896.2, ECO:0000313|Proteomes:UP000007648};
RN [1] {ECO:0000313|Ensembl:ENSSHAP00000006896.2, ECO:0000313|Proteomes:UP000007648}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21709235; DOI=10.1073/pnas.1102838108;
RA Miller W., Hayes V.M., Ratan A., Petersen D.C., Wittekindt N.E., Miller J.,
RA Walenz B., Knight J., Qi J., Zhao F., Wang Q., Bedoya-Reina O.C.,
RA Katiyar N., Tomsho L.P., Kasson L.M., Hardie R.A., Woodbridge P.,
RA Tindall E.A., Bertelsen M.F., Dixon D., Pyecroft S., Helgen K.M.,
RA Lesk A.M., Pringle T.H., Patterson N., Zhang Y., Kreiss A., Woods G.M.,
RA Jones M.E., Schuster S.C.;
RT "Genetic diversity and population structure of the endangered marsupial
RT Sarcophilus harrisii (Tasmanian devil).";
RL Proc. Natl. Acad. Sci. U.S.A. 108:12348-12353(2011).
RN [2] {ECO:0000313|Ensembl:ENSSHAP00000006896.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; G3VUP1; -.
DR STRING; 9305.ENSSHAP00000006896; -.
DR Ensembl; ENSSHAT00000006955.2; ENSSHAP00000006896.2; ENSSHAG00000005991.2.
DR eggNOG; KOG2177; Eukaryota.
DR GeneTree; ENSGT00940000156361; -.
DR HOGENOM; CLU_005817_0_0_1; -.
DR TreeFam; TF106455; -.
DR Proteomes; UP000007648; Unassembled WGS sequence.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR CDD; cd19847; Bbox1_TIF1g_C-VI; 1.
DR CDD; cd19830; Bbox2_TIF1g_C-VI; 1.
DR CDD; cd05502; Bromo_tif1_like; 1.
DR CDD; cd15624; PHD_TIF1gamma; 1.
DR CDD; cd16766; RING-HC_TIF1gamma; 1.
DR Gene3D; 1.20.920.10; Bromodomain-like; 1.
DR Gene3D; 3.30.160.60; Classic Zinc Finger; 1.
DR Gene3D; 3.30.40.10; Zinc/RING finger domain, C3HC4 (zinc finger); 2.
DR InterPro; IPR003649; Bbox_C.
DR InterPro; IPR001487; Bromodomain.
DR InterPro; IPR036427; Bromodomain-like_sf.
DR InterPro; IPR019786; Zinc_finger_PHD-type_CS.
DR InterPro; IPR000315; Znf_B-box.
DR InterPro; IPR011011; Znf_FYVE_PHD.
DR InterPro; IPR001965; Znf_PHD.
DR InterPro; IPR019787; Znf_PHD-finger.
DR InterPro; IPR001841; Znf_RING.
DR InterPro; IPR013083; Znf_RING/FYVE/PHD.
DR InterPro; IPR017907; Znf_RING_CS.
DR PANTHER; PTHR45915:SF3; E3 UBIQUITIN-PROTEIN LIGASE TRIM33; 1.
DR PANTHER; PTHR45915; TRANSCRIPTION INTERMEDIARY FACTOR; 1.
DR Pfam; PF00439; Bromodomain; 1.
DR Pfam; PF00628; PHD; 1.
DR Pfam; PF00643; zf-B_box; 1.
DR PRINTS; PR00503; BROMODOMAIN.
DR SMART; SM00502; BBC; 1.
DR SMART; SM00336; BBOX; 2.
DR SMART; SM00297; BROMO; 1.
DR SMART; SM00249; PHD; 2.
DR SMART; SM00184; RING; 2.
DR SUPFAM; SSF57845; B-box zinc-binding domain; 1.
DR SUPFAM; SSF47370; Bromodomain; 1.
DR SUPFAM; SSF57903; FYVE/PHD zinc finger; 1.
DR SUPFAM; SSF57850; RING/U-box; 1.
DR PROSITE; PS50014; BROMODOMAIN_2; 1.
DR PROSITE; PS50119; ZF_BBOX; 2.
DR PROSITE; PS01359; ZF_PHD_1; 1.
DR PROSITE; PS50016; ZF_PHD_2; 1.
DR PROSITE; PS00518; ZF_RING_1; 1.
DR PROSITE; PS50089; ZF_RING_2; 1.
PE 4: Predicted;
KW Bromodomain {ECO:0000256|ARBA:ARBA00023117, ECO:0000256|PROSITE-
KW ProRule:PRU00035}; Coiled coil {ECO:0000256|SAM:Coils};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Reference proteome {ECO:0000313|Proteomes:UP000007648};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW ProRule:PRU00024}.
FT DOMAIN 141..201
FT /note="RING-type"
FT /evidence="ECO:0000259|PROSITE:PS50089"
FT DOMAIN 228..275
FT /note="B box-type"
FT /evidence="ECO:0000259|PROSITE:PS50119"
FT DOMAIN 287..328
FT /note="B box-type"
FT /evidence="ECO:0000259|PROSITE:PS50119"
FT DOMAIN 902..949
FT /note="PHD-type"
FT /evidence="ECO:0000259|PROSITE:PS50016"
FT DOMAIN 989..1061
FT /note="Bromo"
FT /evidence="ECO:0000259|PROSITE:PS50014"
FT REGION 1..84
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 550..579
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 689..708
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 719..834
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 876..900
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 325..406
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 719..790
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1063 AA; 114595 MW; 6A21719EBABEE291 CRC64;
MAENKGGGGG GEPESGGGGV GGGAAATTGS AAGPGAAGPA GLEAEPPLAA VLVEEEEEEE
GGRAGPEGGA AAGPEDGGVS AASSGSAPAA AAVASAGPAG PGTAIAAPVP AAPAAPAALA
APTPASAGPP LGPPASLLDT CAVCQQSLQS RREAEPKLLP CLHSFCLRCL PEPERQLSVP
IPGGSNGDIQ QVGVIRCPIC RQECRQIDLV DNYFVKDTSE APSSSDEKSE QVCTSCEDNA
SAVGFCVECG EWLCKTCIEA HQRVKFTKDH LIRKKEDVSE AVGASGQRPV FCPVHKQEQL
KLFCETCDRL TCRDCQLLEH KEHRYQFLEE AFQNQKGAIE NLLAKLLEKK NYVHFAATQV
QNRIKEVNET NKRVEQEIKV AIFTLINEIN KKGKSLLQQL ENVTKERQMK LIQQQNDITG
LSRQVKHVMN FTNWAIASGS STALLYSKRL ITFQLRHILK ARCDPVPAAN GAIRFHCDPT
FWAKNVVNLG NLVIESKPTP GYTPNVVVGQ VPPGTNHISK TPGQINLAQL RLQHMQQQVY
AQKHQQLQQM RMQQPPAGAA TTTTPPQQHP RQAAPQMLQQ QPPRLISVQT MQRGNMNCGA
FQAHQMRMAQ NAARIPGIPR HSGPQYSMMQ PHLQRQHSNP GHAGPFPVVS VHNTTINPTS
PTTATMANAN RGPTSPSVTA IELIPSVTNP ENLPSLPDIP PIQLEDAGSS SLDNLLSRYI
SGSHLPPQPT STMNPSPGPS ALSPGSSGLS NSHTPVRPPS TSSTGSRGSC GSSGRTTEKA
SLSFKSDQVK VKQEPGTEEE ICSYSGTVKQ EKTEDGRRSA CMLSSPESSL TPPLSTNLHL
ESELDALGSL ENHVKTEPMD ISESCKQPGL SSLVNGKSPM RNLMHRSTRT GEGSNKEEDP
NEDWCAVCQN GGDLLCCEKC PKVFHLTCHV PTLLSFPSGE WICTFCRDLG KPEVEYDCDN
LQHSKKGKTA QGLSPMDQRK CERLLLYLYC HELSIEFQEP VPVSIPNYYK IIKKPMDLST
VKKKLQKKHS QHYQIPDDFV ADVRLIFKNC ERFNESPKVV CVF
//