ID A0A1R3I1S7_COCAP Unreviewed; 1384 AA.
AC A0A1R3I1S7;
DT 12-APR-2017, integrated into UniProtKB/TrEMBL.
DT 12-APR-2017, sequence version 1.
DT 27-MAR-2024, entry version 32.
DE RecName: Full=allantoinase {ECO:0000256|ARBA:ARBA00012863};
DE EC=3.5.2.5 {ECO:0000256|ARBA:ARBA00012863};
GN ORFNames=CCACVL1_15655 {ECO:0000313|EMBL:OMO76451.1};
OS Corchorus capsularis (Jute).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Grewioideae; Apeibeae; Corchorus.
OX NCBI_TaxID=210143 {ECO:0000313|EMBL:OMO76451.1, ECO:0000313|Proteomes:UP000188268};
RN [1] {ECO:0000313|EMBL:OMO76451.1, ECO:0000313|Proteomes:UP000188268}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. CVL-1 {ECO:0000313|Proteomes:UP000188268};
RC TISSUE=Whole seedling {ECO:0000313|EMBL:OMO76451.1};
RA Alam M., Haque M.S., Islam M.S., Emdad E.M., Islam M.M., Ahmed B.,
RA Halim A., Hossen Q.M.M., Hossain M.Z., Ahmed R., Khan M.M., Islam R.,
RA Rashid M.M., Khan S.A., Rahman M.S., Alam M.;
RT "Corchorus capsularis genome sequencing.";
RL Submitted (SEP-2013) to the EMBL/GenBank/DDBJ databases.
CC -!- PATHWAY: Nitrogen metabolism; (S)-allantoin degradation; allantoate
CC from (S)-allantoin: step 1/1. {ECO:0000256|ARBA:ARBA00004968}.
CC -!- SUBUNIT: Homotetramer. {ECO:0000256|ARBA:ARBA00011881}.
CC -!- SIMILARITY: Belongs to the metallo-dependent hydrolases superfamily.
CC Allantoinase family. {ECO:0000256|ARBA:ARBA00010368}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OMO76451.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AWWV01010882; OMO76451.1; -; Genomic_DNA.
DR STRING; 210143.A0A1R3I1S7; -.
DR EnsemblPlants; OMO76451; OMO76451; CCACVL1_15655.
DR Gramene; OMO76451; OMO76451; CCACVL1_15655.
DR OMA; IPERSEY; -.
DR OrthoDB; 5490617at2759; -.
DR UniPathway; UPA00395; UER00653.
DR Proteomes; UP000188268; Unassembled WGS sequence.
DR GO; GO:0004038; F:allantoinase activity; IEA:UniProtKB-EC.
DR GO; GO:0005524; F:ATP binding; IEA:UniProtKB-KW.
DR GO; GO:0050897; F:cobalt ion binding; IEA:InterPro.
DR GO; GO:0030983; F:mismatched DNA binding; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0000256; P:allantoin catabolic process; IEA:UniProtKB-UniPathway.
DR GO; GO:0006298; P:mismatch repair; IEA:InterPro.
DR Gene3D; 3.30.1370.110; -; 1.
DR Gene3D; 3.20.20.140; Metal-dependent hydrolases; 1.
DR Gene3D; 3.40.50.300; P-loop containing nucleotide triphosphate hydrolases; 1.
DR InterPro; IPR017593; Allantoinase.
DR InterPro; IPR006680; Amidohydro-rel.
DR InterPro; IPR000432; DNA_mismatch_repair_MutS_C.
DR InterPro; IPR007696; DNA_mismatch_repair_MutS_core.
DR InterPro; IPR036187; DNA_mismatch_repair_MutS_sf.
DR InterPro; IPR011059; Metal-dep_hydrolase_composite.
DR InterPro; IPR032466; Metal_Hydrolase.
DR InterPro; IPR046893; MSSS.
DR InterPro; IPR027417; P-loop_NTPase.
DR InterPro; IPR002625; Smr_dom.
DR InterPro; IPR036063; Smr_dom_sf.
DR NCBIfam; TIGR03178; allantoinase; 1.
DR PANTHER; PTHR48378; DNA MISMATCH REPAIR PROTEINS MUTS FAMILY DOMAIN-CONTAINING PROTEIN-RELATED; 1.
DR PANTHER; PTHR48378:SF2; SMR DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF01979; Amidohydro_1; 1.
DR Pfam; PF20297; MSSS; 1.
DR Pfam; PF00488; MutS_V; 1.
DR Pfam; PF01713; Smr; 1.
DR SMART; SM00534; MUTSac; 1.
DR SMART; SM00533; MUTSd; 1.
DR SMART; SM00463; SMR; 1.
DR SUPFAM; SSF51338; Composite domain of metallo-dependent hydrolases; 1.
DR SUPFAM; SSF48334; DNA repair protein MutS, domain III; 1.
DR SUPFAM; SSF51556; Metallo-dependent hydrolases; 1.
DR SUPFAM; SSF52540; P-loop containing nucleoside triphosphate hydrolases; 1.
DR SUPFAM; SSF160443; SMR domain-like; 1.
DR PROSITE; PS00486; DNA_MISMATCH_REPAIR_2; 1.
DR PROSITE; PS50828; SMR; 1.
PE 3: Inferred from homology;
KW ATP-binding {ECO:0000256|ARBA:ARBA00022840};
KW Coiled coil {ECO:0000256|SAM:Coils};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Nucleotide-binding {ECO:0000256|ARBA:ARBA00022741};
KW Reference proteome {ECO:0000313|Proteomes:UP000188268};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..23
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 24..1384
FT /note="allantoinase"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5013136777"
FT DOMAIN 1313..1384
FT /note="Smr"
FT /evidence="ECO:0000259|PROSITE:PS50828"
FT REGION 1201..1220
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1282..1302
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 678..705
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1096..1166
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 1201..1218
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1384 AA; 152406 MW; 7242CF11256AFA96 CRC64;
MDWQWRLLPL LALLASFLVL FYFQDSFKAS QGECSLLPYS HYWIASKRIV TQQGIISGAV
EVKGGKIISI VKDADWNGKS KQVVDYGDAV VMPGLIDVHV HLDDPGRDEW EGFPSGTKAA
AAGGVTTVVD MPLNNFPSTV STETLNLKIK AAEKRIYVDV GFWGGLVPEN AFNATTLEAL
LDAGVLGLKS FMCPSGINDF PMTDIHHIKA GLSALAKYRR PLLVHSEIQD DKSNVQVDNG
GDDPRSYSTY LKTRPPSWEE AAIRELLTAT KDTRSGGPAE GAHLHVVHLS DASSSFDLIK
DAKKRGDSIT VETCPHYLAF SAEEIPDGDT RFKCAPPIRD AANKEKLWNA LMEGDIDMLS
SDHSPTVPKL KLLNEGNFLK AWGGISSIQF VLPVTWSSGQ KFGITLEKIA LWWSERPAKL
IGQHSKGAIA IGNHADIVVW EPEVEFDLNA DHPMYVKNPS ISAYLGKRLS GNVLATFVRG
NLVYKQGNHA PAACVSVRPL QFKPKLISSV TNSLESRTSE LATTLQSETL KTLEWPAICN
YLSTFTSTSM GLYLTKTAAI PVGQSRDESQ RLLDQTTAAL HAMEAFKSEP LDLSSIEDVS
GIVHSAASGQ MLTVRELCRV RRMLAAARAV SEKLGAVAHG GSSERYTPLL EILQSSNFQM
ELEKKIGFCI DCNLSTVLDR ASEELELIRA ERKRNMENLD SLLKEVSVSI YQAGGIDRPL
VTKRRSRMCV GIRASHKYLL PDGVVLNVSS SGATYFMEPR EAVELNNMEV KLSNSEKAEE
MAILSMLTCD IAESEAEIRY LLDRLLEVDL AFARAAYARW VNGVCPILTS EEPEVLISEA
DNALSVDIEG IQHPLLLGSS LGNFSDIIAP NSIDPSSSKV VSNFPVPIDI KVQSGTRVVV
VSGPNTGGKT ASMKTLGLSS IMSKAGMYLP AKRQPRVPWF DLVLADIGDS QSLEQSLSTF
SGHISRICEI LEVASKESLV LIDEIGSGTD PSEGVALSTS ILQYLKNRVN LAVVTTHYAD
LSRLKDNDSQ YENAAMEFSL ETLQPTYQII WGSTGDSYAL TIANSIGFDR NIIERAKNWV
ERLNPEKQQE RKGVLYQSLM EERNRLEAQF KRAESLHAEI MGLYNEVRGE ADNLEEREIA
LRAKEMQKVQ QELDTAKSQI NTVVQEFENQ LRIANSDEFN SLIRKSESAI NSIVKAHFPG
DSSSFTESET SSYEPQSGEQ VHVKKLGNKL ATVVEASEDD NTVLVQYGKI RVRVEKSNIR
PISRSKSNAA ISSRQSLKRL VRNKRRDFPL DSDATNSDAT SYSPLIQTSK NTVDLRGMRV
EEATLHLEMA IAARESKSVL FVVHGMGTGV IKELALDILG KNPRVVKYEQ DNPMNYGCTV
AYIK
//