ID E2SHK8_9FIRM Unreviewed; 477 AA.
AC E2SHK8;
DT 11-JAN-2011, integrated into UniProtKB/TrEMBL.
DT 11-JAN-2011, sequence version 1.
DT 27-MAR-2024, entry version 48.
DE SubName: Full=Collagen triple helix repeat protein {ECO:0000313|EMBL:EFP62981.1};
GN ORFNames=HMPREF0983_00458 {ECO:0000313|EMBL:EFP62981.1};
OS Erysipelotrichaceae bacterium 3_1_53.
OC Bacteria; Bacillota; Erysipelotrichia; Erysipelotrichales;
OC Erysipelotrichaceae.
OX NCBI_TaxID=658659 {ECO:0000313|EMBL:EFP62981.1, ECO:0000313|Proteomes:UP000006223};
RN [1] {ECO:0000313|EMBL:EFP62981.1, ECO:0000313|Proteomes:UP000006223}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=3_1_53 {ECO:0000313|EMBL:EFP62981.1,
RC ECO:0000313|Proteomes:UP000006223};
RG The Broad Institute Genome Sequencing Platform;
RA Ward D., Earl A., Feldgarden M., Young S.K., Pearson M., Zeng Q.,
RA Alvarado L., Berlin A., Bochicchio J., Chapman S.B., Chen Z., Freedman E.,
RA Gellesch M., Goldberg J., Griggs A., Gujja S., Heilman E., Heiman D.,
RA Howarth C., Jen D., Larson L., Mehta T., Neiman D., Park D., Roberts A.,
RA Saif S., Shea T., Shenoy N., Sisk P., Stolte C., Sykes S., Thomson T.,
RA Walk T., White J., Yandava C., Allen-Vercoe E., Strauss J., Sibley C.,
RA Daigneault M., Haas B., Nusbaum C., Birren B.;
RT "The Genome Sequence of Erysipelotrichaceae bacterium strain 3_1_53.";
RL Submitted (JUL-2010) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EFP62981.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ACTJ01000010; EFP62981.1; -; Genomic_DNA.
DR AlphaFoldDB; E2SHK8; -.
DR STRING; 658659.HMPREF0983_00458; -.
DR eggNOG; COG5164; Bacteria.
DR HOGENOM; CLU_559898_0_0_9; -.
DR BioCyc; EBAC658659-HMP:GMFE-467-MONOMER; -.
DR Proteomes; UP000006223; Unassembled WGS sequence.
DR Gene3D; 2.60.120.40; -; 1.
DR InterPro; IPR041415; BclA_C.
DR InterPro; IPR008983; Tumour_necrosis_fac-like_dom.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1082; COLLAGEN ALPHA-1(X) CHAIN; 1.
DR Pfam; PF18573; BclA_C; 1.
PE 4: Predicted;
KW Collagen {ECO:0000313|EMBL:EFP62981.1}.
FT DOMAIN 340..471
FT /note="BclA C-terminal"
FT /evidence="ECO:0000259|Pfam:PF18573"
FT REGION 40..226
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 256..289
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 55..93
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 130..150
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 166..180
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 477 AA; 46402 MW; F7B8DCB679F9C9F7 CRC64;
MHYEDYDYET SCCDDCCSLS CEAEENPCCC PPGPRGPRGF RGPQGPAGAD GATGATGNTG
ATGATGATGT GDTITIRSTT TAEPGTPAQV HDSGGSNHIL DFVIPRGDTG ATGPTGPTGP
GVGDTGPTGP TGETGATGMT GATGPTGETG ATGPTGAKGD TGETGPTGPT GETGLRGNTG
ATGPTGEPGP TGPTGVTGAT GLRGDTGPTG AMGATGETGP TGATPVVTIG PTVTSEPGTD
ADVASTETAD GVELTFTIPR GDTGPQGEIG PTGETGETGP TGPTGATPVV TIGPTVTSEP
GTDADVASTE TADGIELTFT IPRGDTGPGG GGGLLAYGGK YNDTAQTLNL LIGSQEQLPL
AVDMPASNVD LSPVNALQIQ ESGVYEINYM FNASASVGAS VTLSVRRNGT IIPSTEEQHL
LAVATESIYS GSVIENLSAG DLLDMAVSAL AALTLSLSTG VTVTLSVKRL DDIPANT
//