ID V5F327_KALBG Unreviewed; 1246 AA.
AC V5F327;
DT 22-JAN-2014, integrated into UniProtKB/TrEMBL.
DT 22-JAN-2014, sequence version 1.
DT 27-MAR-2024, entry version 34.
DE RecName: Full=DNA polymerase V {ECO:0008006|Google:ProtNLM};
GN ORFNames=PSEUBRA_SCAF1g00351 {ECO:0000313|EMBL:EST09894.1};
OS Kalmanozyma brasiliensis (strain GHG001) (Yeast) (Pseudozyma brasiliensis).
OC Eukaryota; Fungi; Dikarya; Basidiomycota; Ustilaginomycotina;
OC Ustilaginomycetes; Ustilaginales; Ustilaginaceae; Kalmanozyma.
OX NCBI_TaxID=1365824 {ECO:0000313|EMBL:EST09894.1, ECO:0000313|Proteomes:UP000019377};
RN [1] {ECO:0000313|Proteomes:UP000019377}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=GHG001 {ECO:0000313|Proteomes:UP000019377};
RX PubMed=24356824; DOI=10.1128/genomea.00920-13;
RA Oliveira J.V.D.C., dos Santos R.A.C., Borges T.A., Riano-Pachon D.M.,
RA Goldman G.H.;
RT "Draft genome sequence of Pseudozyma brasiliensis sp. nov. strain GHG001, a
RT high producer of endo-1,4-xylanase isolated from an insect pest of
RT sugarcane.";
RL Genome Announc. 1:E0092013-E0092013(2013).
CC -!- SIMILARITY: Belongs to the MYBBP1A family.
CC {ECO:0000256|ARBA:ARBA00006809}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KI545851; EST09894.1; -; Genomic_DNA.
DR RefSeq; XP_016294883.1; XM_016435913.1.
DR AlphaFoldDB; V5F327; -.
DR STRING; 1365824.V5F327; -.
DR GeneID; 27418542; -.
DR eggNOG; KOG1926; Eukaryota.
DR HOGENOM; CLU_005212_0_0_1; -.
DR OMA; VWKHDDP; -.
DR OrthoDB; 2050954at2759; -.
DR Proteomes; UP000019377; Unassembled WGS sequence.
DR GO; GO:0005730; C:nucleolus; IEA:InterPro.
DR GO; GO:0003677; F:DNA binding; IEA:InterPro.
DR GO; GO:0006355; P:regulation of DNA-templated transcription; IEA:InterPro.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR007015; DNA_pol_V/MYBBP1A.
DR PANTHER; PTHR13213:SF2; MYB-BINDING PROTEIN 1A; 1.
DR PANTHER; PTHR13213; MYB-BINDING PROTEIN 1A FAMILY MEMBER; 1.
DR Pfam; PF04931; DNA_pol_phi; 1.
DR SUPFAM; SSF48371; ARM repeat; 1.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000019377}.
FT REGION 45..65
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 753..864
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1198..1246
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 755..775
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 790..813
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 814..832
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 833..851
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1200..1226
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1246 AA; 138148 MW; 2861D557D86F4E01 CRC64;
MNGNNLQLFW EIASSNQDAR LTSANQLVEE LLSQQDVLAS SSKITLDLPS TTESTDDDDQ
DMDDPLTVDE QEAESVEEAL GNRTVADLMY ALRRLLRGLA SPRESSRLGF AVVLTELLSR
IRYAVSAKEI LTLVLKYSNP QVAASRQEQK DFMFAKLFGI MSLVQSELLV QPSATISDFK
RSMSILIALS SDKPWMAESC AWVMVQAVAQ LQRADVEIEW SQAAQSWMTE QISSTKELSP
EKLAVMLQLS HGAGADFFQT VALPSMRKEH PLSTANLASL ARVLKEAIPS ENEATATAGA
RSRWQAKIHF VWDLILDVFF DSTSSAETRI ASFPDFYRVV VDETLFAAAS SHDRKSWGFQ
VFQRALPRAN DTDKPMLFTP NFMRTLINQL GNRDRLLHRA AVKATETIQD VVKQQPHLGF
VLVTQLIGKN GHQNFDSITK TKTVEQVLSS LDIDRVREYV KHLGEIICQG IDTEDEDELT
EINNRRKWAL DQLLFLVRTP TVPKDDQLLV DVLTFLAAHG YYTMQKPLKK RFLLERLPTP
AFTDSLKQVT RSRFLSSVSE LAKQAPPTGA SVAGKKTPGV AQDGQLWLAK AHDVLLQLEK
DKSFKSIFDH DDEIAAKLAD GVRCLDKVRK FEAKSTDADV KERARAFQSL LLSALLVSAD
AADGASDLVE PLCACADKLF PAPSAASSRK GKQVEDQSEV TGVELLCDCL LSFLELSSAF
LRTSATNAFE AFTQDMTKGS IGLLLSHLRT ADEVAQDADE DEDEDMDDEE ASDEVEEDGK
PRASDTSATS DDDDESDADA DQEDEDDEDQ EVDEELRARV RAVLKESGMA DPDDAAQDGE
DDEDEAEDAQ SDAGSESSVE FSDLGDDEMM LLDDKLAEVF RQRVSASRGQ KEAKQEAIAL
RFKVLELVEV FARKQPQSPL MLDLIEPLYD LWSNRDEDTE QLATKARTLL LSRISKAKTV
PDDFDADHVA TLLQRMHTEA RESQSNDVSN VSQAINLYLT KVALSKSGDD VKAALGGKLV
EIYRESLLDY LMRKSSRLRH EFLIDAFRRF PMLGWELRSE LLDNCRPGAA TRAFRQVQVM
SMLQAVLTQI AQRPEKGEVL GFVPEISSCF VQIITDSCDA ASTSALSSNH MKDTIKFVLQ
AARISSRIEP GANDKLQGAW KVVKLQDAAD KLQSSERFKA STSIHDLMKQ LIKVLQPDAT
GADGKKKKNN KDGGKNDKKR AAEESTPIKV NGKSPSPAKK VKATSK
//