ID G5GNE3_9FIRM Unreviewed; 1038 AA.
AC G5GNE3;
DT 25-JAN-2012, integrated into UniProtKB/TrEMBL.
DT 25-JAN-2012, sequence version 1.
DT 27-MAR-2024, entry version 40.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EHG21480.1};
GN ORFNames=HMPREF9334_00897 {ECO:0000313|EMBL:EHG21480.1};
OS Selenomonas infelix ATCC 43532.
OC Bacteria; Bacillota; Negativicutes; Selenomonadales; Selenomonadaceae;
OC Selenomonas.
OX NCBI_TaxID=679201 {ECO:0000313|EMBL:EHG21480.1, ECO:0000313|Proteomes:UP000004129};
RN [1] {ECO:0000313|EMBL:EHG21480.1, ECO:0000313|Proteomes:UP000004129}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 43532 {ECO:0000313|EMBL:EHG21480.1,
RC ECO:0000313|Proteomes:UP000004129};
RG The Broad Institute Genome Sequencing Platform;
RA Earl A., Ward D., Feldgarden M., Gevers D., Izard J., Blanton J.M.,
RA Baranova O.V., Dewhirst F.E., Young S.K., Zeng Q., Gargeya S.,
RA Fitzgerald M., Haas B., Abouelleil A., Alvarado L., Arachchi H.M.,
RA Berlin A., Brown A., Chapman S.B., Chen Z., Dunbar C., Freedman E.,
RA Gearin G., Gellesch M., Goldberg J., Griggs A., Gujja S., Heiman D.,
RA Howarth C., Larson L., Lui A., MacDonald P.J.P., Montmayeur A., Murphy C.,
RA Neiman D., Pearson M., Priest M., Roberts A., Saif S., Shea T., Shenoy N.,
RA Sisk P., Stolte C., Sykes S., Wortman J., Nusbaum C., Birren B.;
RT "The Genome Sequence of Selenomonas infelix ATCC 43532.";
RL Submitted (AUG-2011) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EHG21480.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ACZM01000007; EHG21480.1; -; Genomic_DNA.
DR RefSeq; WP_006692347.1; NZ_JH376798.1.
DR AlphaFoldDB; G5GNE3; -.
DR STRING; 679201.HMPREF9334_00897; -.
DR PATRIC; fig|679201.3.peg.906; -.
DR eggNOG; COG3587; Bacteria.
DR HOGENOM; CLU_011799_0_0_9; -.
DR OrthoDB; 9804145at2; -.
DR Proteomes; UP000004129; Unassembled WGS sequence.
DR GO; GO:0005524; F:ATP binding; IEA:InterPro.
DR GO; GO:0003677; F:DNA binding; IEA:InterPro.
DR GO; GO:0015668; F:type III site-specific deoxyribonuclease activity; IEA:InterPro.
DR Gene3D; 3.40.50.300; P-loop containing nucleotide triphosphate hydrolases; 2.
DR InterPro; IPR006935; Helicase/UvrB_N.
DR InterPro; IPR027417; P-loop_NTPase.
DR InterPro; IPR045572; RE_endonuc_C.
DR PANTHER; PTHR47396:SF1; ATP-DEPENDENT HELICASE IRC3-RELATED; 1.
DR PANTHER; PTHR47396; TYPE I RESTRICTION ENZYME ECOKI R PROTEIN; 1.
DR Pfam; PF19778; RE_endonuc; 1.
DR Pfam; PF04851; ResIII; 1.
DR SUPFAM; SSF52540; P-loop containing nucleoside triphosphate hydrolases; 2.
PE 4: Predicted;
FT DOMAIN 109..280
FT /note="Helicase/UvrB N-terminal"
FT /evidence="ECO:0000259|Pfam:PF04851"
FT DOMAIN 922..1028
FT /note="Type III restriction enzyme C-terminal endonuclease"
FT /evidence="ECO:0000259|Pfam:PF19778"
SQ SEQUENCE 1038 AA; 116736 MW; 79E8A92C72FAB821 CRC64;
MKFHFTVQPY QTDAVEAVVR VFEGQPYAAR TTYLRDVGTN AAQGNLLPVQ ESYMNAPLLT
AEADDGFRNE ELHLTQGELL AHIRRVQSAR NLHLSDTLSA PLGACALDVE METGTGKTYV
YIKTMFELNR RYGWSKFIVV VPSIAIREGV KKSFEMTAEH FMEHYGKKAR FFVYSSANLN
QLDAFSADAG LSVMIINTQA FAASLNEEKS IEGRGGNKEA RIIYSKRDAF GSRRPIDVIA
ANRPILILDE PQKMGKKGSV TQKALAQFAP LFTLNYSATH AERHNLVYVL DAVDAYNARL
VKKIEVKGFE VKNLPGTGRY LYLAEIVLSP KVPPRARIEF EVAQKGGVRR TLRIVSKGDN
LCHLSGGMAQ YEGYVVEHID AAAGTVLFLN GTTLHTGDVQ GDVSEADLRR VQIRETIVSH
FEKEQRLFAR GIKTLSLFFI DEVAKYRQYD ADGNAVLGEY GQIFEEEYRA VMNENRDIFD
PAYMKYLDDV PVEKVHTGYF SIDKKTGRAV DSSLRRGSDL SDDISAYDLI LKDKERLLSF
DEPTRVIFSH SALREGWDNP NVFQICTLKH AASVTAKRQE VGRGLRLAVD QTGQRMDRTV
LGDAVHEVNV LTVIASESYA GFVGDLQKQM EAELYDRPKA ATEAYFKGKT VQSADGTAME
LDEKAARVIY KYLLQHDYID DDGHVTEDYR TALAAGTLAP LPKALMPMTE GVHRLVQAIY
DESVLRTMIS DAGATKTPEN KLNERFYRKE FQELWKRINH KYAYTVSFDT RQLVRAAIDH
INAELRVSRL QYTITYGSQQ TDLDVGMVAD GTSFAYGKSR TSTLKRAQGS AVTYDLIGRI
AAGTTLTRRT VVEILQGIKA EVFAGYRCNP EEFIRTVTRL ILEQKATRIV DHIEYNTIEG
SYDASIFATE KRPYTEAYEA EKAIQNYVFT DGNVERRFAQ DLDGGNEVLI YAKLPRGFAI
PTPVGSYAPD WAIVFREGKV RHIYFVAETK GTLSSLQLRP IEEAKIACAK KLFEKLHSDC
VIYEHVNDFQ TLLNRVMK
//