ID F4X940_9FIRM Unreviewed; 593 AA.
AC F4X940;
DT 28-JUN-2011, integrated into UniProtKB/TrEMBL.
DT 28-JUN-2011, sequence version 1.
DT 27-MAR-2024, entry version 45.
DE RecName: Full=Sel1 repeat family protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=HMPREF0866_00965 {ECO:0000313|EMBL:EGJ48009.1};
OS Ruminococcaceae bacterium D16.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Oscillospiraceae.
OX NCBI_TaxID=552398 {ECO:0000313|EMBL:EGJ48009.1, ECO:0000313|Proteomes:UP000002801};
RN [1] {ECO:0000313|Proteomes:UP000002801}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=D16 {ECO:0000313|Proteomes:UP000002801};
RG The Broad Institute Genome Sequencing Platform;
RA Ward D., Earl A., Feldgarden M., Gevers D., Young S., Zeng Q., Koehrsen M.,
RA Alvarado L., Berlin A.M., Borenstein D., Chapman S.B., Chen Z., Engels R.,
RA Freedman E., Gellesch M., Goldberg J., Griggs A., Gujja S., Heilman E.R.,
RA Heiman D.I., Hepburn T.A., Howarth C., Jen D., Larson L., Mehta T.,
RA Park D., Pearson M., Richards J., Roberts A., Saif S., Shea T.D.,
RA Shenoy N., Sisk P., Stolte C., Sykes S.N., Walk T., White J., Yandava C.,
RA Sibley C.D., White A.P., Crowley S., Surette M.G., Strauss J.C.,
RA Ambrose C.E., Allen-Vercoe E., Haas B., Nusbaum C., Birren B.;
RT "The Genome Sequence of Clostridium sp. D5.";
RL Submitted (MAR-2010) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:EGJ48009.1, ECO:0000313|Proteomes:UP000002801}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=D16 {ECO:0000313|EMBL:EGJ48009.1,
RC ECO:0000313|Proteomes:UP000002801};
RG The Broad Institute Genomics Platform;
RA Earl A., Ward D., Feldgarden M., Gevers D., Sibley C.D., White A.P.,
RA Crowley S., Surette M.G., Strauss J.C., Ambrose C.E., Allen-Vercoe E.,
RA Walker B., Young S., Zeng Q., Gargeya S., Fitzgerald M., Haas B.,
RA Abouelleil A., Allen A.W., Alvarado L., Arachchi H.M., Berlin A.M.,
RA Chapman S.B., Gainer-Dewar J., Goldberg J., Griggs A., Gujja S., Hansen M.,
RA Howarth C., Imamovic A., Ireland A., Larimer J., McCowan C., Murphy C.,
RA Pearson M., Poon T.W., Priest M., Roberts A., Saif S., Shea T., Sisk P.,
RA Sykes S., Wortman J., Nusbaum C., Birren B.;
RT "The Genome Sequence of Ruminococcaceae bacterium D16.";
RL Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EGJ48009.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ADDX02000002; EGJ48009.1; -; Genomic_DNA.
DR AlphaFoldDB; F4X940; -.
DR STRING; 552398.HMPREF0866_00965; -.
DR eggNOG; COG0790; Bacteria.
DR HOGENOM; CLU_011901_1_0_9; -.
DR OrthoDB; 1775746at2; -.
DR Proteomes; UP000002801; Unassembled WGS sequence.
DR Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 1.
DR InterPro; IPR041073; MobL.
DR InterPro; IPR048102; MobP3.
DR InterPro; IPR006597; Sel1-like.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR NCBIfam; NF041499; MobP3; 1.
DR PANTHER; PTHR11102:SF147; FIBRONECTIN TYPE-II DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR11102; SEL-1-LIKE PROTEIN; 1.
DR Pfam; PF18555; MobL; 1.
DR Pfam; PF08238; Sel1; 4.
DR SMART; SM00671; SEL1; 4.
DR SUPFAM; SSF81901; HCP-like; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000002801}.
FT REGION 571..593
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 593 AA; 67946 MW; 0EE7BC10F038B570 CRC64;
MRGLIQKSGY IKPGSGGGHY AEYIATRDGV EVIEPAAGGY LEYMAERPRS HGLFSADGAA
DLEQTMEEIN AHAGPVWTFV YSLKREDAAR LGYENGESWR RLLLAHQTEL ATAMKIPPSS
FRWCAAFHDE KHHPHIHMMA WSNNPKQGYL TERGIEQMRS QLSNDIFQDE LLSLYQEKDF
SYQEVRDAAM EAMGRLIREM KSNLCDSPVL AGQMETLSEM LSETKGKKVY GYLKKPVKAQ
VDAIVDELAK LPEVAECYDR WNQLRDELER YYKDTPREHK PLSQQQEFKA IKNMVIREAG
NLQLGVFTFE DAQMKDEVDE DQDAVLHAWS SRWEMAEAYQ NAKEILSVYE NTEEEKAEQV
RVLERLWDAG FTVAAHQLGK CWRDGLGVLP DDEKAELWLQ RSAEAGHDFS QYALGKLLQR
QKRIDEAISW YEKAAEQDNP YSAYQLGKLY LQGEQVPKDV AKALEYLTQA AEQGSQYAQY
TLGKLYLMGE DMSQDREQAY SWLWESASQG NEYAQFLLDH LSDSHRPNVL LAVTNLLHHI
GRIFQDNSIP PSLPASQQAD RKYRRQIQQK RIALGHKPND HEETQNQGSM TMG
//