GenomeNet

Database: UniProt
Entry: A0A1G1C0G6_9BACT
LinkDB: A0A1G1C0G6_9BACT
Original site: A0A1G1C0G6_9BACT 
ID   A0A1G1C0G6_9BACT        Unreviewed;       685 AA.
AC   A0A1G1C0G6;
DT   15-FEB-2017, integrated into UniProtKB/TrEMBL.
DT   15-FEB-2017, sequence version 1.
DT   31-JUL-2019, entry version 8.
DE   RecName: Full=Hepar_II_III_N domain-containing protein {ECO:0000259|Pfam:PF16889};
GN   ORFNames=A3K19_15255 {ECO:0000313|EMBL:OGV88449.1};
OS   Lentisphaerae bacterium RIFOXYB12_FULL_65_16.
OC   Bacteria; Lentisphaerae; unclassified Lentisphaerae (miscellaneous).
OX   NCBI_TaxID=1798581 {ECO:0000313|EMBL:OGV88449.1, ECO:0000313|Proteomes:UP000179250};
RN   [1] {ECO:0000313|EMBL:OGV88449.1, ECO:0000313|Proteomes:UP000179250}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=27774985; DOI=10.1038/ncomms13219;
RA   Anantharaman K., Brown C.T., Hug L.A., Sharon I., Castelle C.J.,
RA   Probst A.J., Thomas B.C., Singh A., Wilkins M.J., Karaoz U.,
RA   Brodie E.L., Williams K.H., Hubbard S.S., Banfield J.F.;
RT   "Thousands of microbial genomes shed light on interconnected
RT   biogeochemical processes in an aquifer system.";
RL   Nat. Commun. 7:13219-13219(2016).
CC   -!- CAUTION: The sequence shown here is derived from an
CC       EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is
CC       preliminary data. {ECO:0000313|EMBL:OGV88449.1}.
CC   -----------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   -----------------------------------------------------------------------
DR   EMBL; MHBP01000208; OGV88449.1; -; Genomic_DNA.
DR   Proteomes; UP000179250; Unassembled WGS sequence.
DR   GO; GO:0016829; F:lyase activity; IEA:InterPro.
DR   Gene3D; 1.50.10.100; -; 1.
DR   InterPro; IPR008929; Chondroitin_lyas.
DR   InterPro; IPR012480; Hepar_II_III.
DR   InterPro; IPR031680; Hepar_II_III_N.
DR   Pfam; PF07940; Hepar_II_III; 1.
DR   Pfam; PF16889; Hepar_II_III_N; 1.
DR   SUPFAM; SSF48230; SSF48230; 1.
PE   4: Predicted;
KW   Complete proteome {ECO:0000313|Proteomes:UP000179250};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL        1     20       {ECO:0000256|SAM:SignalP}.
FT   CHAIN        21    685       Hepar_II_III_N domain-containing protein.
FT                                {ECO:0000256|SAM:SignalP}.
FT                                /FTId=PRO_5009572899.
FT   DOMAIN      117    380       Hepar_II_III_N. {ECO:0000259|Pfam:
FT                                PF16889}.
SQ   SEQUENCE   685 AA;  75407 MW;  F1795D8DF8F53CED CRC64;
     MQTFSLCLVA GLASVWVAVA AADDAAVLAP DPAARPLLSR ADLPELKKAD RARIFAVLGS
     FDDRGLPEEA FQDAAFRLLA PQRVPGLADA LAAGDRAAAL DAVLKACRGT RPVPPKAKIG
     ATTLAAADDA IENRFSFYGE KHQLPADINW DFNPGTAHWG HDLNRFSYLN ALTQAYVATG
     DSRYSRKAVG LILDWIAKCD MGKCFTGTPY IWGSYLNNAI HCQGWSNCLV TLLACEPAGQ
     VTPAELLRVL KSLHDQIAYL EIVTNGHSGN WPTIGCQGML DTLVTLPVLR DMDRFVDYCS
     RTVKSQVADQ VLPDGVQDEL TPHYHRVVVS NLLTTLRSLR AVGRDLDTDT LQTLRKMLHY
     VQQTTVPDGS KEAGFNDSDP GCPGNYRRTL AGLGLEEFLS PPEQLGPEVF PYAGVAFLRQ
     RQDLGDLYLA FDAGPYGRGH QHEDRLGFWL FAYGRNLLVD PGRHLYDSSE RSYYSYLRST
     TAHSTIRIDG QDQHSAGCRD TWIAKQPLDL DWRVQDGEVR AAGVYDLGYG KDNKIAVVHH
     REIVFVKERF WVVFDTVTGE GEHRIESRFQ FAPGTVQLEG TTARTAFPDA NLLLIAAPMQ
     PFADTHIEQG QEKPRGGWYS DSYGKIEPAP ALSQSLTTVL PWHAATLLFP YRGSEPPAVT
     FTFDGHTARI QHPDVGDVSV VCSLP
//
DBGET integrated database retrieval system