ID A0A0L0DF57_THETB Unreviewed; 4084 AA.
AC A0A0L0DF57;
DT 11-NOV-2015, integrated into UniProtKB/TrEMBL.
DT 11-NOV-2015, sequence version 1.
DT 24-JAN-2024, entry version 38.
DE RecName: Full=UBR-type domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=AMSG_12005 {ECO:0000313|EMBL:KNC50776.1};
OS Thecamonas trahens ATCC 50062.
OC Eukaryota; Apusozoa; Apusomonadida; Apusomonadidae; Thecamonas.
OX NCBI_TaxID=461836 {ECO:0000313|EMBL:KNC50776.1, ECO:0000313|Proteomes:UP000054408};
RN [1] {ECO:0000313|EMBL:KNC50776.1, ECO:0000313|Proteomes:UP000054408}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 50062 {ECO:0000313|EMBL:KNC50776.1,
RC ECO:0000313|Proteomes:UP000054408};
RG The Broad Institute Genome Sequencing Platform;
RA Russ C., Cuomo C., Shea T., Young S.K., Zeng Q., Koehrsen M., Haas B.,
RA Borodovsky M., Guigo R., Alvarado L., Berlin A., Bochicchio J.,
RA Borenstein D., Chapman S., Chen Z., Freedman E., Gellesch M., Goldberg J.,
RA Griggs A., Gujja S., Heilman E., Heiman D., Hepburn T., Howarth C., Jen D.,
RA Larson L., Mehta T., Park D., Pearson M., Roberts A., Saif S., Shenoy N.,
RA Sisk P., Stolte C., Sykes S., Thomson T., Walk T., White J., Yandava C.,
RA Burger G., Gray M.W., Holland P.W.H., King N., Lang F.B.F., Roger A.J.,
RA Ruiz-Trillo I., Lander E., Nusbaum C.;
RT "The Genome Sequence of Thecamonas trahens ATCC 50062.";
RL Submitted (MAY-2010) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GL349462; KNC50776.1; -; Genomic_DNA.
DR RefSeq; XP_013756827.1; XM_013901373.1.
DR STRING; 461836.A0A0L0DF57; -.
DR EnsemblProtists; KNC50776; KNC50776; AMSG_12005.
DR GeneID; 25569920; -.
DR eggNOG; KOG1426; Eukaryota.
DR eggNOG; KOG1477; Eukaryota.
DR OrthoDB; 5491782at2759; -.
DR Proteomes; UP000054408; Unassembled WGS sequence.
DR GO; GO:0004842; F:ubiquitin-protein transferase activity; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR CDD; cd12885; SPRY_RanBP_like; 1.
DR CDD; cd19671; UBR-box_UBR4_5_6_7; 1.
DR CDD; cd02340; ZZ_NBR1_like; 1.
DR Gene3D; 2.60.120.920; -; 1.
DR Gene3D; 3.30.60.90; -; 2.
DR Gene3D; 3.30.2160.10; Hect, E3 ligase catalytic domain; 1.
DR Gene3D; 3.30.2410.10; Hect, E3 ligase catalytic domain; 1.
DR Gene3D; 3.90.1750.10; Hect, E3 ligase catalytic domains; 1.
DR InterPro; IPR001870; B30.2/SPRY.
DR InterPro; IPR043136; B30.2/SPRY_sf.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR000569; HECT_dom.
DR InterPro; IPR035983; Hect_E3_ubiquitin_ligase.
DR InterPro; IPR042469; HECTD3.
DR InterPro; IPR003877; SPRY_dom.
DR InterPro; IPR044736; Vid30/RanBPM/SPLA_SPRY.
DR InterPro; IPR000433; Znf_ZZ.
DR InterPro; IPR043145; Znf_ZZ_sf.
DR PANTHER; PTHR46654; E3 UBIQUITIN-PROTEIN LIGASE HECTD3; 1.
DR PANTHER; PTHR46654:SF1; E3 UBIQUITIN-PROTEIN LIGASE HECTD3; 1.
DR Pfam; PF00632; HECT; 1.
DR Pfam; PF00622; SPRY; 1.
DR Pfam; PF00569; ZZ; 1.
DR SMART; SM00119; HECTc; 1.
DR SMART; SM00449; SPRY; 1.
DR SMART; SM00291; ZnF_ZZ; 2.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR SUPFAM; SSF56204; Hect, E3 ligase catalytic domain; 1.
DR SUPFAM; SSF57850; RING/U-box; 2.
DR PROSITE; PS50188; B302_SPRY; 1.
DR PROSITE; PS50237; HECT; 1.
DR PROSITE; PS50135; ZF_ZZ_2; 2.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Reference proteome {ECO:0000313|Proteomes:UP000054408};
KW Ubl conjugation pathway {ECO:0000256|ARBA:ARBA00022786,
KW ECO:0000256|PROSITE-ProRule:PRU00104};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW ProRule:PRU00228}.
FT DOMAIN 2391..2583
FT /note="B30.2/SPRY"
FT /evidence="ECO:0000259|PROSITE:PS50188"
FT DOMAIN 3391..3447
FT /note="ZZ-type"
FT /evidence="ECO:0000259|PROSITE:PS50135"
FT DOMAIN 3468..3523
FT /note="ZZ-type"
FT /evidence="ECO:0000259|PROSITE:PS50135"
FT DOMAIN 3735..4082
FT /note="HECT"
FT /evidence="ECO:0000259|PROSITE:PS50237"
FT REGION 153..182
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 618..646
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1182..1211
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2672..2724
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2751..2770
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2778..2803
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3028..3055
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2754..2770
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3030..3044
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT ACT_SITE 4048
FT /note="Glycyl thioester intermediate"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00104"
SQ SEQUENCE 4084 AA; 433181 MW; 5C93AAB9622781C1 CRC64;
MGTTVSKTLT TAVGTTAAEK RSSNATWEKR LAACRDDDLK WLSEDLLHRI LESENILREH
VPEAESSTST SVDELEFERV RAVSTSDGSD GAGLPNLVGL VASSGVTNII DAQLARLVAH
AATWSSAFQE VTQARYAIVL RVLTSVTNQS RRLELANDSD GSGEGGEGSP EDAASKLNPL
PDANRPQANL GMQIAIKVFL SVVANTDPSG AAPAVEQLLA LVSGSKVLDL ANEDDDFLDA
LVAWLVAVLD ADAVPEALAT GAATALLTLA AFTGFQGRML DAMARLVAHP AASDVAPAYV
DALTKISAFK TSLELGTLAN SSPLAHLAFN ADLASELKGK ALSADARASV VSDGAYLYIH
DARGLLKVGT GLQGSAPGHV YYADRKFYPD AHLWLSVIAG SLYVRSAALS DNAVFEVYDT
ADLASPHRIV TPSDLLITTS DPQMATLASS AFVPEYDWKT AALAAKNTTC SSSTGGFGMH
FNCQSCNYAR VCAACAAECH KGHNLVMDAN ESYSYCNCAA RALDADPRPC RTRPLPHWGT
LSAASPSWSD GRYLYILTLR PKPDDLVARE ERRLAKWRAK RAKIFAKADA KLAASRNAAA
GTDAGASSSA AAAAAADSDT ADLPADTDAA DDIDDTASDG SLDRFEIDGS RARPRRDITL
ARKAGNDLSA DQQELELWES GTRYVIVVVD PLDSFHLVDV KRLDIERAGS NLPLESLGRF
SFYSNGSILV VQQVADFNPP KPPTGAFSNS LGGFGGGSSS QPNLTITSFD LSCKGVTLGA
PDEVNKFSNA LLTTANVPTD AGLSASAYDY VNNMIWAYSS MHLRCVPYKN ATPPPLFRFP
APPAEGHSHD FVHSSDDALA HPAFALPSDQ QSSIRTGTAI VRTLALLDQH AADHAQRLAL
TGSTALSAAA LSAPFALHIA PATFVALLNL ARALHSSILE AAPGPRSALI QPVYGLGATL
RLLAANLSRL LDVGRRCPAS TPNNILVDLH GLLLAMLETT LSDPELHADA YVPIASAVWE
DQQLFLLLAV LDPASVEDRE LPVNLPTDNA HATGSALRAF AGAVIASDST AVLNALLGRL
ALNSNASLLI CPAGASSGAA IDRTIIINLV EQALASFDDS LAAAIAAGPH ASPAAAPQPA
PAGRALLAFL KGLIQRVSSF TTQLADAAAK LKTAKEAAAS DKSAKAKADS GAESGADAAA
SSETADDDAP PSLAERAAAL EDAEYEKNVL VDACVGVVAS VLDAGTTALH KLRTSLDSIS
LDAAEAVLAA SLVGRVFPVL TALLGTLSST IGVQAPLMAP LVRVLREFSP VAAAFEANTA
AEAEYNAGRA YRQSGGAVRV VESAHPYTAN TDEDFSVRIP GAHALEITFS SQSNLQYNDM
LTLYLKRGHK SRFRRFESYG AFQETRVIVP GDAVYFHFKS QSQSYGDYGF KATVTPQLKA
EAPLELPWML DTFKSLGYVA GKYAAMLVAA KPQMEVPPSL AHWRASPLLS GGLSSIMLRA
NPDAARGDAR YDLAPLRAVL EASTAQLPAI PPAFAETPQA ETDVFTSFLG AFVAQGELPL
GANATPDALP AKLNAWILNN SSVQVFLARE PRAAAYNRAT RAIIAGILRH SGLVKEVHAL
ATAVEAGADT FTATKPPQHL VMVWKIASKA RRFLMGLARP QAELPSAMVA AAASSPADDA
APADGDAAAD KAKADAAAKS NARVSFDSYV EAMAAVEDRM RFLLDAAPAA THLLYALSSR
PETPKSTAAK SRWDALRTSV TSRVGAVGTE SDSVVAALKD LAKLSESVVE FAKTAQLAAL
PLSIFVDLLA DARQCGKARV FGISAYTALL ADGGEFGSVS QDVLSHLSAA LNTTAMPQSG
AAHGKTYPHY LNSVEIAGTA IATQIGTAFS ALLATLVAML RNVNTPPATK LLILDALALE
YKRADYPMLV KAQVFPALGE VMKATRKYRG YTPPPDAARG ASSLPAVLRR VNASAWAVFQ
LISLLVFAPN DRAAASGAAS APAGGLGLGS KQLRSAAFDA VLAELSSSAD EIEAAATTGS
HDPELNSSVY DLLTLLFTWA DTASHQLIHP RLQRLLARIM AVGAPALQRM TLRLFRKVLA
RHSPDDVVEN LFAADEHLLD SLLTTAGQSF VLAPAGPPAP GVADVAPAGD GVAADFGWRS
GFVHFVVGLE AMYALRTLLA ARPWAAALVP RLLAALDADS ETAPLAALAV LGGSLDVPRI
GARIAVAIGA GIREFGTIVT CAPESPTLTA VLDSDVEAGK LSKYSINDVE LVPETELSQA
ALGFVDANVT VDAIARLALA PPASDATPAA ALTQALALKA LGRLLSDSRF LTAFLSQPSR
LAALLDVMRT ADGDPRFSSV ESEERLLTSV LQAVQSADAR VVGSATQDPS SDPPPTFQPR
YAFVPHSKFP QGIDMGAANA DLVQISADGV VEYLGSGRGQ TDIGLIKGDA VVPHAVDMYY
FEVSILHEGA DGRVSIGLVP DGKTVPGTLG ALYGSYALQG ESGKKMSYKC KGYDGTGKEY
ANAASSGDVL GLLYDVQNGC IHFTISGNDF GVAFKQLKGR FRPAVGIASP GARVLVNFGQ
LPFRYAADGR TMPTKNSLGT ARTREDKAAI LLAVGFEPWL VDAALSQSAL FDEPSDALYC
WCAERFGRSR EAVEAEAAAA AAAATAAAAA ADSDGADGDV GDGVDADADG KATPPTAVKT
ASDVDTLSEY SDEDAEPAVN EGDNRANNYL VKEVKPQLLM ADVVPGMYLS VDPDYEVDGE
DDDDKDSDGD AELVAADDAM LGQSGGSEPG KGKDEVGDTD AGAGNALAQL PFESTRGQTG
TVLAVDSASR GVLLAVYQVE TGRMLLGWYP VRSLVYPSPL VDDPLTWVLA GEGAALAAGA
ARRLQGLYAR SAVVAAMVAW EPQVSPLTMD AGVGSDENLL ALLSATAYAD LAAMPDQSSS
DPGAYLYADA ESSDAIALRE ATAKLVKNCV AAMDGPSARK LWGWLVVQAH QLTADRLDLE
IGAWLYLYLF DALKVFEAKL AGADNLAEGD EATGKGKADV ETPADDADAG AADDAAELSE
EDARKAATTL AEFAALDEAY TLPPVEKTLA QLGLDSVELE DKASRRSAEA LRLLRLLVSR
RRAMRLLAQV DQIALARTRA LMYRTLTLMF EADEVFGRLP RVSVLQKAIA SLTQSFYNSS
QMTLEQQAKS GMSATLADLL RRVREKNDAV SRAEAWLANV KSMKERAKAG AKVEVDECVA
DQQVRDLARV LGVNELVSSR YASTGTSLHL LGELVDVMYE RRKVPVKFAL RAWPAAQLVE
RKVVSSSHPY EAGRVKDMVV IADASTVQVK FHPKCATMQS DVLQFSETAH FGPYGSTYSG
KFGQYASSGV AFSANGDRMY YQFNAAAGSQ LHGQSCQVCS LGQMRGTRYS CISCRSCELC
EECEANLEET QAHPTSHMFL KIQRPTTNWY ARESLPCPPI DTEAPRTHTG IACDGCGASP
IEGDLYKCAQ CGAYYLCSAC EERATELDHD VDHVFIKIRM ARTSPETYSF FRALPNMYED
GMTWGYEFVV SGILKQRALL NSLGTAMPDI LAWLETDAQE WKPALDHELV RYAVAVAEKA
DTNVSMLPLD ALEPDEEALQ SCPGLRGVTL DVLRNRFSLL LMFNELVKGA LASVDLSLVN
AEWSTAHRLC QLKGLIFPEH KEQIFNDMVS SRGSRSRQAS VTVDRTAAAV TSDDQVVPLE
STMFMQIYTG LASATNDMLR QSQQAWQVQF QGEGSIDAGG PYRESITQMC AELTAGEPVG
LFLPCANAKF AVGQNREKIV PNPGAATPAK LEQFEFVGKL MGIALRTKNP LALDLPSLVW
KRLVFTVPSR SDIDLIDQMC CQVLDSVINI EAEGVTADTF GDVITEMFTT VSADGRTVEL
VEGGADVPVT WDNRAEWADA VLNYRLHEFD RQIAAIGRGL AALVPAHLFS LFTWQELELM
VCGRAEIDIK FLKQNTVYQG CAATDAHIVM FWEVLESFSQ EERRLFIRFV WGRSRLPTSS
ANWKQKFTIT AFAGGHAGDD HLPNSHVCFN SIELPRYSTK DIMAAKIRYA MSTCMSIDLD
FVVR
//