ID K8GI01_9CYAN Unreviewed; 2741 AA.
AC K8GI01;
DT 06-FEB-2013, integrated into UniProtKB/TrEMBL.
DT 06-FEB-2013, sequence version 1.
DT 24-JAN-2024, entry version 47.
DE SubName: Full=Pre-peptidase C family protein {ECO:0000313|EMBL:EKQ68210.1};
GN ORFNames=OsccyDRAFT_2733 {ECO:0000313|EMBL:EKQ68210.1};
OS Leptolyngbyaceae cyanobacterium JSC-12.
OC Bacteria; Cyanobacteriota; Cyanophyceae; Leptolyngbyales; Leptolyngbyaceae.
OX NCBI_TaxID=864702 {ECO:0000313|EMBL:EKQ68210.1, ECO:0000313|Proteomes:UP000001332};
RN [1] {ECO:0000313|EMBL:EKQ68210.1, ECO:0000313|Proteomes:UP000001332}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=JSC-12 {ECO:0000313|EMBL:EKQ68210.1,
RC ECO:0000313|Proteomes:UP000001332};
RG DOE Joint Genome Institute;
RA Brown I., Huntemann M., Wei C.-L., Han J., Detter J.C., Han C., Tapia R.,
RA Chen A., Kyrpides N., Mavromatis K., Markowitz V., Szeto E., Ivanova N.,
RA Mikhailova N., Ovchinnikova G., Pagani I., Pati A., Goodwin L.,
RA Nordberg H.P., Cantor M.N., Hua S.X., Woyke T.;
RT "Improved high quality draft of Oscillatoriales sp. JSC-12.";
RL Submitted (OCT-2012) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EKQ68210.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AJUB01000011; EKQ68210.1; -; Genomic_DNA.
DR STRING; 864702.OsccyDRAFT_2733; -.
DR PATRIC; fig|864702.5.peg.2938; -.
DR eggNOG; COG1404; Bacteria.
DR eggNOG; COG1520; Bacteria.
DR eggNOG; COG3898; Bacteria.
DR HOGENOM; CLU_228040_0_0_3; -.
DR OrthoDB; 465541at2; -.
DR Proteomes; UP000001332; Chromosome.
DR GO; GO:0016020; C:membrane; IEA:InterPro.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR GO; GO:0007156; P:homophilic cell adhesion via plasma membrane adhesion molecules; IEA:InterPro.
DR Gene3D; 2.60.120.380; -; 2.
DR Gene3D; 2.60.40.3440; -; 1.
DR InterPro; IPR002126; Cadherin-like_dom.
DR InterPro; IPR039005; CSPG_rpt.
DR InterPro; IPR028994; Integrin_alpha_N.
DR InterPro; IPR007280; Peptidase_C_arc/bac.
DR InterPro; IPR040853; RapA2_cadherin-like.
DR NCBIfam; NF012211; tand_rpt_95; 2.
DR PANTHER; PTHR45739; MATRIX PROTEIN, PUTATIVE-RELATED; 1.
DR PANTHER; PTHR45739:SF8; TNFR-CYS DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF17963; Big_9; 2.
DR Pfam; PF16184; Cadherin_3; 12.
DR Pfam; PF17803; Cadherin_4; 2.
DR Pfam; PF04151; PPC; 2.
DR SUPFAM; SSF89260; Collagen-binding domain; 2.
DR SUPFAM; SSF69318; Integrin alpha N-terminal domain; 1.
DR PROSITE; PS50268; CADHERIN_2; 2.
DR PROSITE; PS51854; CSPG; 12.
PE 4: Predicted;
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Reference proteome {ECO:0000313|Proteomes:UP000001332};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Signal {ECO:0000256|ARBA:ARBA00022729}.
FT DOMAIN 1118..1220
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 1569..1697
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
SQ SEQUENCE 2741 AA; 284934 MW; 5C9657FBBBCF2E86 CRC64;
MPFHGVIAAM PDSIGNTLND AQSIIIGTAT KRFSDSVEFG DNDYFRFTLN SSSGFSLTLF
GLSANADVEI LNSSGNLVTS NTVPLRSTNE GTLTESINAI LDAGTYFIRV FPGPPADPAN
PAGTTPSTNY TLDVRADNGI TNEIVWRYYA ANVATNGIWR FDGTTFLSGE ALNPSTPDAL
WAIVGTGDFN NDNSYDLLWR YYGTLPELQG VNGIWLLDNG TLTTFFALNP DPDLSWQIRG
VGNFDGGIGK PDIIWHNPTT GAIRVWFLDD AYQTTAVTFL DRGLTGWELQ AVGDFNGDGN
TDLVWRSGAL NGIWYLNGTN FVSAELIIQE FDTNKQIQGA GDFNSDGSPD LLWRNFATGE
NEIWLMEGTS RTSIVPLPTV FDPAWRAITP FERRDPVALA DLAGNQIPTA FRIGPLNGSG
VYRDAIAVGD ADDFYQFSLG SQTRLNLTLD GYGTNSLLGN LNVQILSGTG VVISQSVNGG
TSPETIANLD LNPGTYFIRV FAGDAGAASP YDLNLSVNNL PVLVSSGPLT VSEGQTQTLS
NTLLLVTDGN DPANRLTYTY VSTFQNGNLL SNGQALIANS TFTQADINAG RIAYQQNGGE
GALDTFVFAV SDGRGGTIPN TTFTVNVTPV NDPPVLVSLS PITVSENSLV TLSNTSLLVT
DVEQTPSQLV YSLNSLPVNG TLSFLTGPVL GPLTLGSQFT QADINSGRIG YRQNGSETTA
DRFTFTVTDG AGGFLNPQIQ TLSINITPVN DPPVLVTNVP LTVSQAGPNI ISTAFLSATD
AELTTPAQQD QIIFSVTQLP TQGTLFLGGT AITTPFTFSQ ADLNAGVLTY AQSGTPVNSD
RFNFRLSDGT ATVPATGDFT YEIFVQRVAG PPVLATIAPL TASEGVATVI NSTLLQVTDP
DSAPPFITYS LASLPSVGSI LKAGTALTVG QTFTQADIDQ GRIAYLQNGS EQPTFNDAFT
FTFTDERGQG PATTRTFSIS ILPVNDLPTI LTPTPQATVT EGFGIDITAG LLNATDPDNL
PSQLTYQIIS APTNGSLVRS GTVVTSFTQA DINGGQIKYL QDGTESTADA FTFTVTDLSG
TPVGPNTFNI NVIPFNDPPG LAVFNPITLD EGETYTFSSI TDLQITDIDG PGPLTYTVGT
LPANGVLRVG NLTLTSGGVF SQADINNGQL FYIHNGSETT SDRFTFTASD GATTGLGAPG
LLGTRTLSIN VLPVNDSPLL TSNNILTLSE GATGSIRNTL LSAFDPDNLP AQLTYTLSAP
PAYGTLLSAG TAVTSFTQAA LNANQISYQH NGSETTLDSF IFDVSDGSAS LPGGSATFTI
AITPVNDSPT LLSNAGLTLD EGGTGAIPDT VLLVTDPDGP SPSVIFTLGA APAQGVLLNN
TVTLSAGQTF TQADISNGLL SYIHNGSETT SDRFTFTASD GSTGVLSLRT FSITVNPVND
SPIITIPTTS ASTVSVDEDV TFTFAGANRV SITDVDGGPT FNASFSTSNG GTLNLSATPG
LTNNNSSNVT YTGPLSGLNT ALNNFRYTGV QNFNGVELLT ISVSDGNGGV DTKTITINVV
PVNDAPTLTL PSSSITINED TPTTALGLLV NDVDADVNPL RVTLSATNGA LTVNDNGALT
FLSGTANGGS SVIFTGTLPD IQTALSGLVY QGRQDYFGSD RIIVTVDDQG ATGRPGPLSV
TRTISVNVLS VNDKPTFTGG ADQFVVEDSG QQIIPNWATN ISRGAANEAS QSIGFAITSS
NPTLLSELFT STPSISPTTG NLTFTPRADA NGTIQLTAVL QDNGGTANGG NDTSDPFIFT
IAVGQRNDAP TFVRGSNVTI NEDPDPGNGN TVVISNWATN IRPGPLTPAA NEISQGLNFL
IDTNNPALFA APPTITISGP AGNQVGALTF APNPNANGTA VVTVRLQDDG GTNFGGQDTS
PPQTFTIVVR PVNDAPTFTP LVTTSIDILE DAPQQAIQFA TDILAGPPNE SSQTVSFLLS
NSNPSLFAAT NGGIAPTIDP STGILTFRPA TNAFGSAVLT ATLRDNGGTT NGGIDSSVPY
VFTINVLPVN DAPSFTRGAN QTVNEDAPAQ TIVNWATGIS PGPNEAGQVV AFQVSNDNNA
LFSVQPTISS TGTLTYTPAP NAFGTATVTV SLVDNGGTAN GGVDTSAPQI FTIIVNPVND
TPTLTLPSAQ VTAEDTPINF TGPIGIFLTD IDSGSNPIDV TFRVNNGTIN LPSTNGLTVS
SGANGSSSVT FRGTVSDLIA AIASVVYTPN SNFSGSDTLT VTVNDRGFFG SGGARTAVGT
VGITITAVND PPELVTLNSL LLEEGGNRTI SNSLLRTTDT DNTAAQLIYT LVSLPSSGVL
RLQSGATFTT IGLNGTFTQA DIDANRLNYL HNGSETTSDS FSFRVSDGQI TLSDSIFNIN
IIPVNDAPGL SVNAPLTLSE GEISTISSSL LQFTDNDNTS AQLTYTITSA PINGTLRLGG
TDLTQGSTFT QEQLNSSLLT YEHNGNETTA DGFNFSLGDG TTSVPGSFNI SVIPVNDIPS
VISSGPLTVN EGASVPVLST VLNTTDPDNP PTQVVYTLAG GPGFGTLLLN STTTLTNGST
FTQAQINSGA ITYRHNGSEN LSDAFFFNVS DGQAAPTSGI VNINVNPVND APVLVRNTGL
TLPVGVPSSR AISNSQLFAT DVDNTVGQIL YRLTVLPSAG TLRLGTGALA VGQTFSQQDI
NQGRVVYQYS GTGTSDGFQF ALVDTNGGAG GTGFFQISFT S
//