ID G3WKA9_SARHA Unreviewed; 519 AA.
AC G3WKA9;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 07-APR-2021, sequence version 2.
DT 27-MAR-2024, entry version 51.
DE RecName: Full=Ubiquilin 1 {ECO:0008006|Google:ProtNLM};
GN Name=LOC100933726 {ECO:0000313|Ensembl:ENSSHAP00000015864.2};
OS Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus.
OX NCBI_TaxID=9305 {ECO:0000313|Ensembl:ENSSHAP00000015864.2, ECO:0000313|Proteomes:UP000007648};
RN [1] {ECO:0000313|Ensembl:ENSSHAP00000015864.2, ECO:0000313|Proteomes:UP000007648}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21709235; DOI=10.1073/pnas.1102838108;
RA Miller W., Hayes V.M., Ratan A., Petersen D.C., Wittekindt N.E., Miller J.,
RA Walenz B., Knight J., Qi J., Zhao F., Wang Q., Bedoya-Reina O.C.,
RA Katiyar N., Tomsho L.P., Kasson L.M., Hardie R.A., Woodbridge P.,
RA Tindall E.A., Bertelsen M.F., Dixon D., Pyecroft S., Helgen K.M.,
RA Lesk A.M., Pringle T.H., Patterson N., Zhang Y., Kreiss A., Woods G.M.,
RA Jones M.E., Schuster S.C.;
RT "Genetic diversity and population structure of the endangered marsupial
RT Sarcophilus harrisii (Tasmanian devil).";
RL Proc. Natl. Acad. Sci. U.S.A. 108:12348-12353(2011).
RN [2] {ECO:0000313|Ensembl:ENSSHAP00000015864.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_003764838.1; XM_003764790.2.
DR AlphaFoldDB; G3WKA9; -.
DR STRING; 9305.ENSSHAP00000015864; -.
DR Ensembl; ENSSHAT00000015993.2; ENSSHAP00000015864.2; ENSSHAG00000013516.2.
DR GeneID; 100933726; -.
DR KEGG; shr:100933726; -.
DR eggNOG; KOG0010; Eukaryota.
DR GeneTree; ENSGT00940000163907; -.
DR HOGENOM; CLU_024293_4_0_1; -.
DR InParanoid; G3WKA9; -.
DR OMA; VKTPQDC; -.
DR TreeFam; TF314412; -.
DR Proteomes; UP000007648; Unassembled WGS sequence.
DR CDD; cd14399; UBA_PLICs; 1.
DR CDD; cd01808; Ubl_PLICs; 1.
DR Gene3D; 1.10.8.10; DNA helicase RuvA subunit, C-terminal domain; 1.
DR InterPro; IPR015940; UBA.
DR InterPro; IPR009060; UBA-like_sf.
DR InterPro; IPR015496; Ubiquilin.
DR InterPro; IPR000626; Ubiquitin-like_dom.
DR InterPro; IPR029071; Ubiquitin-like_domsf.
DR PANTHER; PTHR10677; UBIQUILIN; 1.
DR PANTHER; PTHR10677:SF10; UBIQUILIN 5; 1.
DR Pfam; PF00627; UBA; 1.
DR Pfam; PF00240; ubiquitin; 1.
DR SMART; SM00165; UBA; 1.
DR SMART; SM00213; UBQ; 1.
DR SUPFAM; SSF46934; UBA-like; 1.
DR SUPFAM; SSF54236; Ubiquitin-like; 1.
DR PROSITE; PS50030; UBA; 1.
DR PROSITE; PS50053; UBIQUITIN_2; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000007648}.
FT DOMAIN 24..98
FT /note="Ubiquitin-like"
FT /evidence="ECO:0000259|PROSITE:PS50053"
FT DOMAIN 476..516
FT /note="UBA"
FT /evidence="ECO:0000259|PROSITE:PS50030"
FT REGION 106..131
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 178..212
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 267..290
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 413..439
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 456..476
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 178..196
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 274..288
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 458..474
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 519 AA; 53936 MW; 16409C9BC94EA47A CRC64;
MAGAREEAGD SRLVAGREPP PRIITVTAKT PQERQEFTLA ENCSVREFKE QISMRLNCDV
NRLVLIFTGK ILRDQDTLSQ RGVLDGTTVH LVVRNRFPGF TPSCHLTATP ATSNQPIPGS
NSTGSSPSTG AAGLLTRLGR IARGSPDLAD FLGHLAQLLM AVPEAVVQFL DDPSIQGLLG
ETPSSTNPSG TGPGRLMAQP QTAPPAQAAE TVPEALRSPA LLRELLMLRA DERGLGALKA
VPGGDNALRQ VYADIQQLML TVPASAPRAK GPASLSGPSN SSSSAGPLRL GNVWPGAQGR
VLGASTQATS PYSSSIPNLY MGLGPGSQDM SPTIGKGALP GPSAPSASAL RNALNVLHQN
PSLLHQLTAG SPLRPRLPLL PILTNPRALQ AWLQIEQGLQ TLSMEIPGLG PCLRGSGRPH
GGHGGGLAGG SNSRVSSQQP TLAVLQLLQA LAHASPNALQ TPPPPPPLPP PPSEGRFQQE
LDQLKAMGFS NRDANLQALI ATGGDIHAAI ERLLGAPQA
//