ID G5F2N9_9ACTN Unreviewed; 945 AA.
AC G5F2N9;
DT 25-JAN-2012, integrated into UniProtKB/TrEMBL.
DT 25-JAN-2012, sequence version 1.
DT 03-MAY-2023, entry version 39.
DE RecName: Full=Transglutaminase-like domain-containing protein {ECO:0000259|SMART:SM00460};
GN ORFNames=HMPREF1008_00633 {ECO:0000313|EMBL:EHF02228.1};
OS Olsenella sp. oral taxon 809 str. F0356.
OC Bacteria; Actinomycetota; Coriobacteriia; Coriobacteriales; Atopobiaceae;
OC Olsenella.
OX NCBI_TaxID=661087 {ECO:0000313|EMBL:EHF02228.1, ECO:0000313|Proteomes:UP000003446};
RN [1] {ECO:0000313|EMBL:EHF02228.1, ECO:0000313|Proteomes:UP000003446}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=F0356 {ECO:0000313|EMBL:EHF02228.1,
RC ECO:0000313|Proteomes:UP000003446};
RG The Broad Institute Genome Sequencing Platform;
RA Earl A., Ward D., Feldgarden M., Gevers D., Izard J., Blanton J.M.,
RA Baranova O.V., Tanner A.C., Dewhirst F.E., Young S.K., Zeng Q., Gargeya S.,
RA Fitzgerald M., Haas B., Abouelleil A., Alvarado L., Arachchi H.M.,
RA Berlin A., Brown A., Chapman S.B., Chen Z., Dunbar C., Freedman E.,
RA Gearin G., Gellesch M., Goldberg J., Griggs A., Gujja S., Heiman D.,
RA Howarth C., Larson L., Lui A., MacDonald P.J.P., Montmayeur A., Murphy C.,
RA Neiman D., Pearson M., Priest M., Roberts A., Saif S., Shea T., Shenoy N.,
RA Sisk P., Stolte C., Sykes S., Wortman J., Nusbaum C., Birren B.;
RT "The Genome Sequence of Olsenella sp. oral taxon 809 strain F0356.";
RL Submitted (AUG-2011) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EHF02228.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ACVE01000003; EHF02228.1; -; Genomic_DNA.
DR RefSeq; WP_009278423.1; NZ_JH376563.1.
DR AlphaFoldDB; G5F2N9; -.
DR STRING; 661087.HMPREF1008_00633; -.
DR PATRIC; fig|661087.3.peg.634; -.
DR eggNOG; COG1305; Bacteria.
DR HOGENOM; CLU_325905_0_0_11; -.
DR OrthoDB; 5438043at2; -.
DR Proteomes; UP000003446; Unassembled WGS sequence.
DR Gene3D; 3.10.620.30; -; 1.
DR Gene3D; 2.60.40.1120; Carboxypeptidase-like, regulatory domain; 1.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR002931; Transglutaminase-like.
DR PANTHER; PTHR35532:SF5; CARB-BD_DOM_FAM9 DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR35532; SIMILAR TO POLYHYDROXYALKANOATE DEPOLYMERASE; 1.
DR Pfam; PF01841; Transglut_core; 2.
DR SMART; SM00460; TGc; 2.
DR SUPFAM; SSF54001; Cysteine proteinases; 2.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000003446}.
FT DOMAIN 154..213
FT /note="Transglutaminase-like"
FT /evidence="ECO:0000259|SMART:SM00460"
FT DOMAIN 618..693
FT /note="Transglutaminase-like"
FT /evidence="ECO:0000259|SMART:SM00460"
FT REGION 353..379
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 397..422
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 945 AA; 102312 MW; FA8485169AFC15DB CRC64;
MLGKELQDYA RDCFEGRLAL LDDVTRARAT ATVDASADED EACLLRYLFG TLPLSDVLDV
EPGVLVSYVR HSLMLRRGLG WTRELPEPLF VHFVLCPRVN NEPLTDCRPP LWSELHERVT
GLNEREAVLE INYWCAQMAT YQASDGRTLG PLAMLASGDG RCGEESTFLV SALRSVGIPA
RQIYTPWWAH CDDNHAWVEA YADGGWHYLG ACEPEEALDR GWFTNAAGRA LMMGTTVYSD
YALDQLGGED AGRNGCTHLI GVTPSYTRTV RLAVHVADEG GNPVEGATVC LEILNSAQWA
PATCLVTDAS GEAGVSVGLG GLRVRVTSGG RMAWRTIDTA ELHELDFILG ERGAGDEPVG
AGQAGADPTP GREAGSWQWT SPVGSAEVLT WHDVDVRAPE DHPAPSCRPT PEQLERGRGR
KASVDELRRR RVASFLDEGA QLADGLVARA NREDHARIER FMRLALGNAG EVARFLSGTS
ADDVAAAAQG PLEADRLGLL STLSDKDFRD LRADVLEEQL HGARAVCGRT LGLLAAQGIE
PDEAQRVYER YVLCPRVGLE HLTAWRGPLR GWLQSELDAG ELEAMHGDPR AIWDWLERNV
GFDERQDLAK LAGGPVGALR GRHASPVTRA TLFVALCRCL GHPARVNPES LAPELFEGGR
FVPAQEPPRP KSRRVRLRAT AGGTKSCFVD WTLGRLQAYT ERGGQGSLGF PSLELWGTSA
DEGGLTLELP LGTWRLVSTT RLPNGSQQAS EAIFRLVEGE GELELPLRTR VPEASDMLQD
IPLPELALHA ADGSPACASD ALRRSGAGKA GIVAFLGEAE EPTEHLLNEL REQADQVREA
GLGLLLVCRS PKAYDDPTLM RALDALGSAL VLFDDFDELP ERLARRMYAN PEKLPLMLLV
QPCDKAGEGE AAGATDGDVV FRGLYAVGGY NVGSVALALR LAQLA
//