ID Q4N3R4_THEPA Unreviewed; 592 AA.
AC Q4N3R4;
DT 02-AUG-2005, integrated into UniProtKB/TrEMBL.
DT 02-AUG-2005, sequence version 1.
DT 24-JAN-2024, entry version 62.
DE RecName: Full=UmuC domain-containing protein {ECO:0000259|PROSITE:PS50173};
GN OrderedLocusNames=TP02_0924 {ECO:0000313|EMBL:EAN33209.1};
OS Theileria parva (East coast fever infection agent).
OC Eukaryota; Sar; Alveolata; Apicomplexa; Aconoidasida; Piroplasmida;
OC Theileriidae; Theileria.
OX NCBI_TaxID=5875 {ECO:0000313|EMBL:EAN33209.1, ECO:0000313|Proteomes:UP000001949};
RN [1] {ECO:0000313|EMBL:EAN33209.1, ECO:0000313|Proteomes:UP000001949}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Muguga {ECO:0000313|EMBL:EAN33209.1,
RC ECO:0000313|Proteomes:UP000001949};
RX PubMed=15994558; DOI=10.1126/science.1110439;
RA Gardner M.J., Bishop R., Shah T., de Villiers E.P., Carlton J.M., Hall N.,
RA Ren Q., Paulsen I.T., Pain A., Berriman M., Wilson R.J.M., Sato S.,
RA Ralph S.A., Mann D.J., Xiong Z., Shallom S.J., Weidman J., Jiang L.,
RA Lynn J., Weaver B., Shoaibi A., Domingo A.R., Wasawo D., Crabtree J.,
RA Wortman J.R., Haas B., Angiuoli S.V., Creasy T.H., Lu C., Suh B.,
RA Silva J.C., Utterback T.R., Feldblyum T.V., Pertea M., Allen J.,
RA Nierman W.C., Taracha E.L.N., Salzberg S.L., White O.R., Fitzhugh H.A.,
RA Morzaria S., Venter J.C., Fraser C.M., Nene V.;
RT "Genome sequence of Theileria parva, a bovine pathogen that transforms
RT lymphocytes.";
RL Science 309:134-137(2005).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EAN33209.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAGK01000002; EAN33209.1; -; Genomic_DNA.
DR RefSeq; XP_765492.1; XM_760399.1.
DR AlphaFoldDB; Q4N3R4; -.
DR STRING; 5875.Q4N3R4; -.
DR EnsemblProtists; EAN33209; EAN33209; TP02_0924.
DR GeneID; 3501854; -.
DR KEGG; tpv:TP02_0924; -.
DR VEuPathDB; PiroplasmaDB:TpMuguga_02g00924; -.
DR eggNOG; KOG2093; Eukaryota.
DR InParanoid; Q4N3R4; -.
DR OMA; DENICNL; -.
DR Proteomes; UP000001949; Unassembled WGS sequence.
DR GO; GO:0006281; P:DNA repair; IEA:InterPro.
DR CDD; cd03468; PolY_like; 1.
DR Gene3D; 3.30.70.270; -; 1.
DR Gene3D; 3.40.1170.60; -; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR001126; UmuC.
DR PANTHER; PTHR45990; DNA REPAIR PROTEIN REV1; 1.
DR PANTHER; PTHR45990:SF1; DNA REPAIR PROTEIN REV1; 1.
DR Pfam; PF00817; IMS; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR PROSITE; PS50173; UMUC; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000001949}.
FT DOMAIN 32..220
FT /note="UmuC"
FT /evidence="ECO:0000259|PROSITE:PS50173"
FT REGION 292..345
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 460..494
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 545..574
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 292..312
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 313..329
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 330..344
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 461..494
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 545..560
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 592 AA; 66800 MW; 71D961B65C7AA52A CRC64;
MDVGIYMNGN IMYLFFENFL LDAILGENFE GPAAIYNHGR ILSVNNECFN QGVKRGMYVE
NALDICKDMK PVKYDWDKVN ERGLRIIRIL KKYTDRILSP QYNEFYLQIS YPECRNLLEN
PSTNTDENIC NLAIQISKEV PGEPVIGIGK NMLVSKLASK KCRNLKINSQ ESDYDYELSP
IVCITSIYGG CYMVSDKICI VTDGFKFLNN VYLSEIPGVG YLGPVLKSKG LVTCNDVRKI
GTPLCLQSVL GGRIGKLVYN FCFGYDYRCA NLPKKQQDFF LNKSFTSEVT IGSHESNSST
SDNCTQNSSK QGDLGKESEK EHDESKVSSI ETSQESGCNN EGNNLNRESE ERVLRRLLSE
VLKDLCSVYS LFDKCIINYS IKYKCNITLH VHSDSQYTKS ITSILDKQSL RHTINSLYNK
IKSEFGIEFP KIKKIQLEVL DVVRVDEDIT FIDKFLKERP STGQESQSPF RMSFLSPSSA
TATSNDSVYS PSEMCTPTRR LNGWKLDSIS IDSGTPQSPA KSASSSVASR TLITPNILRY
VDFKGQISQK SNPDSSVDSK TRTPTRSRRS LKLSRGQKYI TEYLSPSKFE FS
//