GenomeNet

Database: UniProt
Entry: E7GMM3_CLOSY
LinkDB: E7GMM3_CLOSY
Original site: E7GMM3_CLOSY 
ID   E7GMM3_CLOSY            Unreviewed;       465 AA.
AC   E7GMM3;
DT   05-APR-2011, integrated into UniProtKB/TrEMBL.
DT   05-APR-2011, sequence version 1.
DT   24-JAN-2024, entry version 35.
DE   RecName: Full=BclA C-terminal domain-containing protein {ECO:0000259|Pfam:PF18573};
GN   ORFNames=HMPREF9474_02168 {ECO:0000313|EMBL:EGA93965.1};
OS   [Clostridium] symbiosum WAL-14163.
OC   Bacteria; Bacillota; Clostridia; Eubacteriales; Lachnospiraceae.
OX   NCBI_TaxID=742740 {ECO:0000313|EMBL:EGA93965.1, ECO:0000313|Proteomes:UP000002970};
RN   [1] {ECO:0000313|EMBL:EGA93965.1, ECO:0000313|Proteomes:UP000002970}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=WAL-14163 {ECO:0000313|EMBL:EGA93965.1,
RC   ECO:0000313|Proteomes:UP000002970};
RA   Earl A., Ward D., Feldgarden M., Gevers D., Finegold S.M., Summanen P.H.,
RA   Molitoris D.R., Vaisanen M.L., Daigneault M., Young S.K., Zeng Q.,
RA   Gargeya S., Fitzgerald M., Haas B., Abouelleil A., Alvarado L.,
RA   Arachchi H.M., Berlin A., Brown A., Chapman S.B., Chen Z., Dunbar C.,
RA   Freedman E., Gearin G., Gellesch M., Goldberg J., Griggs A., Gujja S.,
RA   Heilman E., Heiman D., Howarth C., Larson L., Lui A., MacDonald P.J.P.,
RA   Mehta T., Montmayeur A., Murphy C., Neiman D., Pearson M., Priest M.,
RA   Roberts A., Saif S., Shea T., Shenoy N., Sisk P., Stolte C., Sykes S.,
RA   White J., Yandava C., Nusbaum C., Birren B.;
RT   "The Genome Sequence of Clostridium symbiosum strain WAL-14163.";
RL   Submitted (DEC-2010) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:EGA93965.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; ADLQ01000049; EGA93965.1; -; Genomic_DNA.
DR   RefSeq; WP_003500294.1; NZ_GL834309.1.
DR   AlphaFoldDB; E7GMM3; -.
DR   STRING; 1512.GCA_900049235_02342; -.
DR   eggNOG; COG2931; Bacteria.
DR   HOGENOM; CLU_001074_1_7_9; -.
DR   Proteomes; UP000002970; Unassembled WGS sequence.
DR   Gene3D; 2.60.120.40; -; 1.
DR   InterPro; IPR041415; BclA_C.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR008983; Tumour_necrosis_fac-like_dom.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1086; CUTICLE COLLAGEN 99-RELATED; 1.
DR   Pfam; PF18573; BclA_C; 1.
DR   Pfam; PF01391; Collagen; 4.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000002970}.
FT   DOMAIN          338..464
FT                   /note="BclA C-terminal"
FT                   /evidence="ECO:0000259|Pfam:PF18573"
FT   REGION          1..40
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          74..141
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          172..273
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          294..323
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        11..38
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        126..140
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   465 AA;  42737 MW;  6A05D8C64C8D995D CRC64;
     MNYDDMNNLR CGNPDMSSSS GQEENQSLRC QGNSTGDYSE RRCCPGPGRV CCQGPTGPRG
     CPGPMGPRGC PGPMGPRGCP GEPGARGPQG PRGATGARGP QGIPGATGPQ GPQGIQGLRG
     ATGATGATGR TPTVTVAGTI TGEPGTPAAV YENVTPSGVE LLFTIPAGAT GPTGPTGATG
     AAGTAGATGP TGPTGATGAA GTAGATGPTG PTGATGAAGT AGATGATGPT GATGAAGAAG
     PTGPTGATGA AGTAGATGPT GPTGATGAAG TAGATGPTGA TGAAGTAGAT GPTGATGATG
     ATGPTGAAGP TGATGAAGAT GATGPTGATG TSVTANSMNA TNSTGDTITV ILGGTAVPLP
     DFQVLDGFTT TAQNTTFTVP ATGTYMVSYR VSTTAALLLS TRVLRNGTPL GGSTFTPALS
     VSSFAATTFA ALTAGDTLTL QLFGLVGAAV LQSGNGATLT VIRLV
//
DBGET integrated database retrieval system