ID V5GGW0_KALBG Unreviewed; 1297 AA.
AC V5GGW0;
DT 22-JAN-2014, integrated into UniProtKB/TrEMBL.
DT 22-JAN-2014, sequence version 1.
DT 27-MAR-2024, entry version 32.
DE RecName: Full=Arrestin C-terminal-like domain-containing protein {ECO:0000259|SMART:SM01017};
GN ORFNames=PSEUBRA_SCAF6g00829 {ECO:0000313|EMBL:EST05242.1};
OS Kalmanozyma brasiliensis (strain GHG001) (Yeast) (Pseudozyma brasiliensis).
OC Eukaryota; Fungi; Dikarya; Basidiomycota; Ustilaginomycotina;
OC Ustilaginomycetes; Ustilaginales; Ustilaginaceae; Kalmanozyma.
OX NCBI_TaxID=1365824 {ECO:0000313|EMBL:EST05242.1, ECO:0000313|Proteomes:UP000019377};
RN [1] {ECO:0000313|Proteomes:UP000019377}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=GHG001 {ECO:0000313|Proteomes:UP000019377};
RX PubMed=24356824; DOI=10.1128/genomea.00920-13;
RA Oliveira J.V.D.C., dos Santos R.A.C., Borges T.A., Riano-Pachon D.M.,
RA Goldman G.H.;
RT "Draft genome sequence of Pseudozyma brasiliensis sp. nov. strain GHG001, a
RT high producer of endo-1,4-xylanase isolated from an insect pest of
RT sugarcane.";
RL Genome Announc. 1:E0092013-E0092013(2013).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KI545892; EST05242.1; -; Genomic_DNA.
DR RefSeq; XP_016290231.1; XM_016438959.1.
DR STRING; 1365824.V5GGW0; -.
DR eggNOG; ENOG502RKAN; Eukaryota.
DR HOGENOM; CLU_003237_0_0_1; -.
DR OMA; MIAMGMQ; -.
DR OrthoDB; 1408949at2759; -.
DR Proteomes; UP000019377; Unassembled WGS sequence.
DR Gene3D; 2.60.40.640; -; 2.
DR InterPro; IPR014752; Arrestin-like_C.
DR InterPro; IPR011021; Arrestin-like_N.
DR InterPro; IPR011022; Arrestin_C-like.
DR InterPro; IPR014756; Ig_E-set.
DR PANTHER; PTHR11188; ARRESTIN DOMAIN CONTAINING PROTEIN; 1.
DR PANTHER; PTHR11188:SF17; FI24305P1-RELATED; 1.
DR Pfam; PF00339; Arrestin_N; 1.
DR SMART; SM01017; Arrestin_C; 1.
DR SUPFAM; SSF81296; E set domains; 2.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000019377}.
FT DOMAIN 179..323
FT /note="Arrestin C-terminal-like"
FT /evidence="ECO:0000259|SMART:SM01017"
FT REGION 325..360
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 431..556
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 614..728
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 794..875
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 940..1050
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1063..1127
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1169..1192
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 432..447
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 448..475
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 510..556
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 614..635
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 644..728
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 794..817
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 940..964
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 982..997
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 998..1014
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1029..1043
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1086..1127
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1297 AA; 139959 MW; 1EA1A61860DA908E CRC64;
MVTKHPKVKI DVILSSNVFE AGGAITGKVE LTCTTGQRLR LGHIAVELEA VEQLTSRDHA
ATQLFLYNRT LFQGEKLPPS NAVLPAAPVN GYWTARKGRT SFPFSFRLPS SAPSCVKFAG
NASLRYGLKA TVQTWYNEDK MVVTARREAF VLEKWADQLH PRFREPAEAV ADTRLFMGGN
GAVWLEAGVS EQLFWGGGQM LIRCGIKNST KRHLSGIKVS LARRLIFPVG SAEGFHDSPD
KLSLEPRITE IVHERTFKGR EYEFDPNAES VCTVAVDVPK DLRTIRKTRL FEVRIFAFVS
LLLGSFAKDL TIEIPVYVAH TASGQPPAQQ GLDSLHAGPP SHSPPMPPFA GGGGAPSITG
LATIEEDSES QAGTIKTLAK LPTIGKTGKG GKGNSVSRNN IEQFEAMAEA EEDEEEVKRQ
MIAMGMQPDE DAFASKEQDR SSDARAEASD STPKASTSRR ADGTQASSSG TYRLKASDIF
QHAGSADVPS SQAAEPARDL AEPTTAELGH SHDSYDSMQS NQARERRSST ASSVVSLRRS
SSSQGIGLQA LESNLVRTTT PKICTNSRPL VTMANVARAE SVPSPLAAAP AVFDDSTRSR
KNSALRAAAL AREEAERKSA EEQARAAEER ARQVQAEQAR QAAAAEEEAA RKQAARIRAE
EQQRLERERI AEHARRQREL REQEEEQRLK AEAERVAREE EAKRQEREEM ERQERQAAEA
ARARAVADDT KELRRLAYSA SDRMERIKAA IANAPKSVAP SVPSKMTAPV LAAPSRTSDQ
DRVVLKQQAV QRVDGWLSNP TSPNTSHMNT PGTPSASVLS VMRKEEPSAS AAGAGEGRSS
YAYSSPSQVI DPMWEAKPPR SEKVEVAMPK SHTMADLASS KASAAVAPAR NSIDEPIPQL
SAELRALVDG SDIRPARKSG TALKEHPISR RLSGLGSTNK ASAVVVPGPS FTHVSSQASR
ERHTSMPSWP RPGLPAGSNA SAVPFVTATP SESSSSRGTE RRASRNHETL EVGIPTRVEK
TLMAPNFDSQ RNKDEHSYDI RSARGGRGGR VTSVAQLWSK IAGDDDDAAS DVSRNSLAAP
SADGEPRSAS PSNTSTFRPK PRRSSSASNS APALDFSKKA MSSPTTSTGV TAIAATFAGE
VDSNKPESLK SFTAPQFLNT SVPKAVFSAS TDAASKPEKR PLPKPVLQPG AVPAPQLVRP
TAMRAQKPER TSSRRISTQL LSTFEKDKVA QREMSVEEMM SKVSVDSTIL GALNYDGSHD
VSTWRCNNGT EIKMSGRGTT KAIGENKLKS LRAVWGS
//