ID Q4C0S0_CROWT Unreviewed; 470 AA.
AC Q4C0S0;
DT 13-SEP-2005, integrated into UniProtKB/TrEMBL.
DT 13-SEP-2005, sequence version 1.
DT 27-MAR-2024, entry version 72.
DE SubName: Full=TPR repeat:TPR repeat {ECO:0000313|EMBL:EAM49741.1};
GN ORFNames=CwatDRAFT_2894 {ECO:0000313|EMBL:EAM49741.1};
OS Crocosphaera watsonii WH 8501.
OC Bacteria; Cyanobacteriota; Cyanophyceae; Oscillatoriophycideae;
OC Chroococcales; Aphanothecaceae; Crocosphaera.
OX NCBI_TaxID=165597 {ECO:0000313|EMBL:EAM49741.1, ECO:0000313|Proteomes:UP000003922};
RN [1] {ECO:0000313|EMBL:EAM49741.1, ECO:0000313|Proteomes:UP000003922}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=WH 8501 {ECO:0000313|EMBL:EAM49741.1,
RC ECO:0000313|Proteomes:UP000003922};
RG DOE Joint Genome Institute;
RL Submitted (FEB-2004) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:EAM49741.1, ECO:0000313|Proteomes:UP000003922}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=WH 8501 {ECO:0000313|EMBL:EAM49741.1,
RC ECO:0000313|Proteomes:UP000003922};
RG US DOE Joint Genome Institute (JGI-ORNL);
RA Larimer F., Land M.;
RT "Annotation of the draft genome assembly of Crocosphaera watsonii WH
RT 8501.";
RL Submitted (JUN-2005) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|EMBL:EAM49741.1, ECO:0000313|Proteomes:UP000003922}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=WH 8501 {ECO:0000313|EMBL:EAM49741.1,
RC ECO:0000313|Proteomes:UP000003922};
RG US DOE Joint Genome Institute (JGI-PGF);
RA Copeland A., Lucas S., Lapidus A., Barry K., Detter C., Glavina T.,
RA Hammon N., Israni S., Pitluck S., Richardson P.;
RT "Sequencing of the draft genome and assembly of Crocosphaera watsonii WH
RT 8501.";
RL Submitted (JUN-2005) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EAM49741.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AADV02000054; EAM49741.1; -; Genomic_DNA.
DR AlphaFoldDB; Q4C0S0; -.
DR KEGG; cwa:CwatDRAFT_2894; -.
DR Proteomes; UP000003922; Unassembled WGS sequence.
DR CDD; cd05483; retropepsin_like_bacteria; 1.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 3.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR034122; Retropepsin-like_bacterial.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR InterPro; IPR019734; TPR_repeat.
DR PANTHER; PTHR44858; TETRATRICOPEPTIDE REPEAT PROTEIN 6; 1.
DR PANTHER; PTHR44858:SF1; UDP-N-ACETYLGLUCOSAMINE--PEPTIDE N-ACETYLGLUCOSAMINYLTRANSFERASE SPINDLY-RELATED; 1.
DR Pfam; PF13975; gag-asp_proteas; 1.
DR Pfam; PF13432; TPR_16; 2.
DR Pfam; PF13181; TPR_8; 3.
DR SMART; SM00028; TPR; 6.
DR SUPFAM; SSF50630; Acid proteases; 1.
DR SUPFAM; SSF48452; TPR-like; 1.
DR PROSITE; PS50005; TPR; 3.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Reference proteome {ECO:0000313|Proteomes:UP000003922};
KW TPR repeat {ECO:0000256|PROSITE-ProRule:PRU00339}.
FT REPEAT 185..218
FT /note="TPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00339"
FT REPEAT 254..287
FT /note="TPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00339"
FT REPEAT 288..321
FT /note="TPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00339"
FT COILED 327..354
FT /evidence="ECO:0000256|SAM:Coils"
SQ SEQUENCE 470 AA; 52919 MW; 3438482EE7EA6E1D CRC64;
MITLINILLS RSFGNLRALK IISDWWIIFA KLHYYEAAIA KLDPLLHADP SQAKLWYAKA
LALVNLQRFG EAVTSAQLSV QLNPTFAPGY QLLGNACTQL EDKRGAISAY KQAAHCYLDQ
GDKKQAQTCL DKLKALGPQF IPNEQKSFQE SQEFFQKISV GAESGDHEAA LNNLNWLLNF
DPKNAEALAK RGLVQAKRRN YSAALADINL ALQLCPNDLN LRLQRGKIRL WLNDAEGAMA
DFSALLETEW GDTSEIYCLR SQAYQQLNDL DSAFQDLANA LYINPENSEC YRVRGDIYRG
LKDWEEAISN YRRAISLSLE QGNQFNYKKL QAKIAEIERK IEREKQEASR IIRVPIKSRH
GGTPIIEVLF NDCVTCDMVL DTGAGIVCVP EEIARSLNIV FIGNQPLRVA DGSVTNAPIG
YVRSVAIQQA KAENLEVAIL PRYSTGLLGQ NYLWQYDVRI LQTEVELYLR
//