GenomeNet

Database: UniProt
Entry: A0A1Y3XFN9_9ACTN
LinkDB: A0A1Y3XFN9_9ACTN
Original site: A0A1Y3XFN9_9ACTN 
ID   A0A1Y3XFN9_9ACTN        Unreviewed;       472 AA.
AC   A0A1Y3XFN9;
DT   30-AUG-2017, integrated into UniProtKB/TrEMBL.
DT   30-AUG-2017, sequence version 1.
DT   13-SEP-2023, entry version 18.
DE   RecName: Full=Transglutaminase-like domain-containing protein {ECO:0000259|SMART:SM00460};
GN   ORFNames=B5G02_09695 {ECO:0000313|EMBL:OUN84372.1};
OS   [Collinsella] massiliensis.
OC   Bacteria; Actinomycetota; Coriobacteriia; Coriobacteriales;
OC   Coriobacteriaceae; Enorma.
OX   NCBI_TaxID=1232426 {ECO:0000313|EMBL:OUN84372.1, ECO:0000313|Proteomes:UP000195781};
RN   [1] {ECO:0000313|Proteomes:UP000195781}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=An5 {ECO:0000313|Proteomes:UP000195781};
RA   Medvecky M., Cejkova D., Polansky O., Karasova D., Kubasova T., Cizek A.,
RA   Rychlik I.;
RT   "Function of individual gut microbiota members based on whole genome
RT   sequencing of pure cultures obtained from chicken caecum.";
RL   Submitted (APR-2017) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:OUN84372.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; NFIE01000031; OUN84372.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A1Y3XFN9; -.
DR   Proteomes; UP000195781; Unassembled WGS sequence.
DR   Gene3D; 3.10.620.30; -; 1.
DR   InterPro; IPR038765; Papain-like_cys_pep_sf.
DR   InterPro; IPR002931; Transglutaminase-like.
DR   Pfam; PF01841; Transglut_core; 1.
DR   SMART; SM00460; TGc; 1.
DR   SUPFAM; SSF54001; Cysteine proteinases; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000195781}.
FT   DOMAIN          276..339
FT                   /note="Transglutaminase-like"
FT                   /evidence="ECO:0000259|SMART:SM00460"
FT   REGION          38..125
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        100..117
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   472 AA;  51434 MW;  BB328E0D9B4F50B7 CRC64;
     MATYEISADS IKRAFGRLAR TTDAAVRGAV DAVRKLEGEA TAGQAASDAQ AMGMGQAPGE
     ASAQVADAQV PSAPVSAVPD PEPTTREAPV ATATPVEPAA SPKPDTPAIP VEPTASPKPD
     ASPATRPLAE YAHPGCCTFE RDRLGTSARR AYQAIRDGVL AFRPHIRLFG VTEAEIDDAY
     EAMRRSTPEV FWLDGYSLQC ISQTGRIWEL EPSYRIDRDE AARLLAEMEE RSRPLIDALS
     QLPAPEQRVQ AAHNALILNA VYSDTGDPFE YTAVGALVRG KAVCSGLAYA FKYLMDRLEV
     PCLIVRGTAA STPSWDDPER HSWNLVELDG RWTHVDVTYD LGFSPAKRYP HLAYLGVSDE
     EIAPTHKWER DALPAATLSL GCYRRRGRCV SGWDELAGLF DLTLARDRRC AFQLDERLTR
     SGIEPGAPLD PELIAQKIYD IAGASIRGSI TEPVTLALIR DDVMRVYEVT VE
//
DBGET integrated database retrieval system