ID H1PL80_9FIRM Unreviewed; 1202 AA.
AC H1PL80;
DT 21-MAR-2012, integrated into UniProtKB/TrEMBL.
DT 21-MAR-2012, sequence version 1.
DT 24-JAN-2024, entry version 42.
DE RecName: Full=DUF11 domain-containing protein {ECO:0008006|Google:ProtNLM};
DE Flags: Fragment;
GN ORFNames=HMPREF0380_00938 {ECO:0000313|EMBL:EHO85546.1};
OS Eubacterium infirmum F0142.
OC Bacteria; Bacillota; Clostridia; Eubacteriales;
OC Eubacteriales Family XIII. Incertae Sedis.
OX NCBI_TaxID=883109 {ECO:0000313|EMBL:EHO85546.1, ECO:0000313|Proteomes:UP000004504};
RN [1] {ECO:0000313|EMBL:EHO85546.1, ECO:0000313|Proteomes:UP000004504}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=F0142 {ECO:0000313|EMBL:EHO85546.1,
RC ECO:0000313|Proteomes:UP000004504};
RG The Broad Institute Genome Sequencing Platform;
RA Earl A., Ward D., Feldgarden M., Gevers D., Izard J., Ganesan A.,
RA Baranova O.V., Blanton J.M., Tanner A.C., Dewhirst F.E., Young S.K.,
RA Zeng Q., Gargeya S., Fitzgerald M., Haas B., Abouelleil A., Alvarado L.,
RA Arachchi H.M., Berlin A., Chapman S.B., Gearin G., Goldberg J., Griggs A.,
RA Gujja S., Hansen M., Heiman D., Howarth C., Larimer J., Lui A.,
RA MacDonald P.J.P., McCowen C., Montmayeur A., Murphy C., Neiman D.,
RA Pearson M., Priest M., Roberts A., Saif S., Shea T., Sisk P., Stolte C.,
RA Sykes S., Wortman J., Nusbaum C., Birren B.;
RT "The Genome Sequence of Eubacterium infirmum F0142.";
RL Submitted (DEC-2011) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AGWI01000013; EHO85546.1; -; Genomic_DNA.
DR AlphaFoldDB; H1PL80; -.
DR STRING; 883109.HMPREF0380_00938; -.
DR eggNOG; COG1361; Bacteria.
DR eggNOG; COG3087; Bacteria.
DR HOGENOM; CLU_270699_0_0_9; -.
DR Proteomes; UP000004504; Unassembled WGS sequence.
DR Gene3D; 2.60.40.740; -; 5.
DR Gene3D; 2.60.530.10; Major cell-surface adhesin PAc; 1.
DR InterPro; IPR041324; AgI/II_N.
DR InterPro; IPR047589; DUF11_rpt.
DR InterPro; IPR013574; Glucan-bd_C/Surface_Ag-I/II_V.
DR InterPro; IPR036234; SA_I/II_PAC_V_sf.
DR NCBIfam; TIGR01451; B_ant_repeat; 5.
DR PANTHER; PTHR34819:SF3; CELL SURFACE PROTEIN-RELATED; 1.
DR PANTHER; PTHR34819; LARGE CYSTEINE-RICH PERIPLASMIC PROTEIN OMCB; 1.
DR Pfam; PF18652; Adhesin_P1_N; 1.
DR Pfam; PF08363; GbpC; 1.
DR SUPFAM; SSF74914; V-region of surface antigen I/II (SA I/II, PAC); 1.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Reference proteome {ECO:0000313|Proteomes:UP000004504};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..34
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 35..1202
FT /note="DUF11 domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5003553167"
FT DOMAIN 34..144
FT /note="Antigen I/II N-terminal"
FT /evidence="ECO:0000259|Pfam:PF18652"
FT DOMAIN 232..424
FT /note="Glucan-binding protein C/Surface antigen I/II V-
FT domain"
FT /evidence="ECO:0000259|Pfam:PF08363"
FT REGION 826..864
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 89..154
FT /evidence="ECO:0000256|SAM:Coils"
FT NON_TER 1202
FT /evidence="ECO:0000313|EMBL:EHO85546.1"
SQ SEQUENCE 1202 AA; 131992 MW; 2EA29BF08D1DDD50 CRC64;
MGPNIKVASK RLFTIVLSFL VLFSTMAPTL QAFAADSTKG IETKVDRTEL DNVVQSAKDV
GVPIQKDSDL DMGVATTKSE VDAKVAEIKQ DYENQIKAIK AEIEKKKECD KKQKEYEEKL
AKYNQELEKY NKDMEKYKKE VLAYNQAIAE LENHLHEDGY LSQPIGQSLV FKSEPNATVS
ISNGTVYSEK QLDSLVRSWG FGPGDWGYAY FEQLNKGLPG TPQHLVSGDL RTVFELGKTT
TVTYTNLQNS TMNGKKIAKV VFKYTVKNTT RLPGKVPVFI KQDPTATIWY TAFFGDTTIG
VDVQFFDEDD KPMALDGALL SFASLNRGDL PKYFNYNNSI EKVQNFNGQF MEINGSTIKN
HSGSAYSDTN NAYLEDGSRF ERDVWDTETS EYSWYGAIVG KVSGKRISYD MTGVYKGNVW
FSLNSNIRAK NIPVKPIKPV PPVAPTPPQC PNIQANYHYD ILYYQPAVEK KVTDDNNSDI
NNNTVLKDSV VKFILNVADL PAGHEKIDSL VFTDKLPTGY KADLVTTKQS SPDYDVNYDE
GTNVITFSAK SDYLNTINAN LNTVAKIAAP VVVGKVTKAG VTYKNDFTLT INNDYSVKSK
PVKVHTPSEP KKDVFKGDET SSINGKVVNP GDVLRYEITY KNTTGTKQNV IITDKVPKYT
KYLSSDNSGR ESGGVVRWEN EVENGKTWTV SFKVKVNDDI NGKPVDNISH VKDDFNESDT
NETHNPTPTG PKKEVLKFGT TTNIDGKRVE PEQKLTYAIT YENTTGKAVK ATITDKIPAH
TKFVNAENGG TESGGVVKWT VDVAKDQKVT VKFTVKVDKD VNGEPIDNIA RVNDGVNDYD
TNETHNPTPT EPKKEVFKGG TTTNIDGKRV EPGQDLTYAI TYKNTTGNDV NATITDKLPK
HTSFVSAENG GAELGGVVTW NVAVAKDQSV TVKFTVKVDV NVNGAPIDNV AKVNDGVNDY
KTNETHNPTS TEPKKEVFKG STTTNIDGKR VEPGQKLTYA ITYKNTTGND VNATITDKIP
AHTKFVSADN GGAETGGIVK WNVAVAKDQS ITVKFTVKVD VNVNGAPIDN VAKVNDGVND
YKTNETHNPT PTEPKKEVFK GGTTTKIDGK LVQPEEELTY AITYKNTTGK DVNATITDKI
PAHTKFVSAD NGGAETGGVV KWTVAVAKDQ SVTVKFIVKV DVNVNGAPID NIARVNDGTN
EF
//