ID A0A445DV44_ARAHY Unreviewed; 849 AA.
AC A0A445DV44;
DT 08-MAY-2019, integrated into UniProtKB/TrEMBL.
DT 08-MAY-2019, sequence version 1.
DT 27-MAR-2024, entry version 13.
DE RecName: Full=Peptidase A2 domain-containing protein {ECO:0000259|PROSITE:PS50175};
GN ORFNames=Ahy_A03g013281 {ECO:0000313|EMBL:RYR67043.1};
OS Arachis hypogaea (Peanut).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade;
OC dalbergioids sensu lato; Dalbergieae; Pterocarpus clade; Arachis.
OX NCBI_TaxID=3818 {ECO:0000313|EMBL:RYR67043.1, ECO:0000313|Proteomes:UP000289738};
RN [1] {ECO:0000313|EMBL:RYR67043.1, ECO:0000313|Proteomes:UP000289738}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Fuhuasheng {ECO:0000313|Proteomes:UP000289738};
RC TISSUE=Leaves {ECO:0000313|EMBL:RYR67043.1};
RA Chen X.;
RT "Sequencing of cultivated peanut Arachis hypogaea provides insights into
RT genome evolution and oil improvement.";
RL Submitted (JAN-2019) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:RYR67043.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; SDMP01000003; RYR67043.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A445DV44; -.
DR Proteomes; UP000289738; Chromosome a03.
DR GO; GO:0004190; F:aspartic-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR CDD; cd00303; retropepsin_like; 1.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR001995; Peptidase_A2_cat.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR005162; Retrotrans_gag_dom.
DR PANTHER; PTHR33240:SF8; H0502G05.11 PROTEIN; 1.
DR PANTHER; PTHR33240; OS08G0508500 PROTEIN; 1.
DR Pfam; PF03732; Retrotrans_gag; 1.
DR SUPFAM; SSF50630; Acid proteases; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR PROSITE; PS50175; ASP_PROT_RETROV; 1.
PE 4: Predicted;
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Reference proteome {ECO:0000313|Proteomes:UP000289738}.
FT DOMAIN 573..612
FT /note="Peptidase A2"
FT /evidence="ECO:0000259|PROSITE:PS50175"
FT REGION 1..39
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 78..163
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 349..393
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 466..501
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 22..39
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 103..120
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 121..137
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 138..160
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 369..383
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 466..497
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 849 AA; 96410 MW; E27D6AD4449FA717 CRC64;
MEGILEENPS SQDDSRPRHP ASEQQEVTNQ ARIASTIHRT NEYMVTDPQH PEKARDKAAQ
IIQDLCLRIQ ELEGKLTDRG KYANEHGSQA TSRPRSHRGR SPTRQHDRRD GRSTSRNHRH
EKSPERRHNK KHHRSASHDL SRQHDSDEDP RRRNTKRTRN DHIIMGATPF MERILRAKLP
KGFDKPTDMK YDGTKDPQEH LTAFEARMNL EGASDAVRCR AFPVTLAGPA IKWFNALPNG
SIASFHDIAR KFMAQFTTRI TKAKHPISLL GVTQKQEEST RKYLDCFNDE CLMVDGLTDS
VASLCLTNGL MNEDFRKHLT TKPVWTMHEI QNVTKDYIND EEVSQVVAAN KRQHAATQHG
NPPPRHNPPP KENQRDHLRP TNRPPRIGKF SNYTPLTAPI TEIYHQIADR GVIPRARPLK
ERTGGNKALY CDYHRGYGHK TQDCFDLKDA LEQAIRDGKL PEFVKIIREP RRADRDKSPE
REGRNPRTQK LPPRENPEED PTIIVNVITG KDVSNKSKLT MKKDLKIMAV RHHDPVATAD
STITFLPEDC QHGTSAEDAP FVISARIGTG LVRRILVDTG ADSNILFRGA LDKLGLRNDN
LQTHRHGVTG LGDNFLKPDG SITLPITIGT SNQRKTILSE FVVLKDSTAY NVILGRKTIN
DFSAVIFTKY LLMKFRTDDG TIGTIHGDRE VAAECDNNSL ALRKKSRDAA EIFLADLDAR
LDGQPRPEPK GDMEKLQIGP TKEEYTFINR NLPYDLKEEL SQLLKQNRDL FAFTPADMPG
INPDLMSHHL AVDPLAKPVA QRRRKMSPDR AAEVRNQVKA LLEANFIREL PYTTWLANVV
LVRKSNGKW
//