ID A0A016WA68_9BILA Unreviewed; 807 AA.
AC A0A016WA68;
DT 11-JUN-2014, integrated into UniProtKB/TrEMBL.
DT 11-JUN-2014, sequence version 1.
DT 27-MAR-2024, entry version 43.
DE RecName: Full=Peptidase A1 domain-containing protein {ECO:0000259|PROSITE:PS51767};
DE Flags: Fragment;
GN Name=Acey_s0928.g3081 {ECO:0000313|EMBL:EYC36162.1};
GN ORFNames=Y032_0928g3081 {ECO:0000313|EMBL:EYC36162.1};
OS Ancylostoma ceylanicum.
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC Rhabditina; Rhabditomorpha; Strongyloidea; Ancylostomatidae;
OC Ancylostomatinae; Ancylostoma.
OX NCBI_TaxID=53326 {ECO:0000313|EMBL:EYC36162.1, ECO:0000313|Proteomes:UP000024635};
RN [1] {ECO:0000313|Proteomes:UP000024635}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=HY135 {ECO:0000313|Proteomes:UP000024635};
RX PubMed=25730766; DOI=10.1038/ng.3237;
RA Schwarz E.M., Hu Y., Antoshechkin I., Miller M.M., Sternberg P.W.,
RA Aroian R.V.;
RT "The genome and transcriptome of the zoonotic hookworm Ancylostoma
RT ceylanicum identify infection-specific gene families.";
RL Nat. Genet. 47:416-422(2015).
CC -!- SIMILARITY: Belongs to the peptidase A1 family.
CC {ECO:0000256|ARBA:ARBA00007447, ECO:0000256|RuleBase:RU000454}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EYC36162.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JARK01000528; EYC36162.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A016WA68; -.
DR Proteomes; UP000024635; Unassembled WGS sequence.
DR GO; GO:0004190; F:aspartic-type endopeptidase activity; IEA:UniProtKB-KW.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd05471; pepsin_like; 1.
DR Gene3D; 3.30.70.270; -; 2.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR InterPro; IPR001461; Aspartic_peptidase_A1.
DR InterPro; IPR001969; Aspartic_peptidase_AS.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR034164; Pepsin-like_dom.
DR InterPro; IPR033121; PEPTIDASE_A1.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR041577; RT_RNaseH_2.
DR PANTHER; PTHR37984:SF7; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR37984; PROTEIN CBG26694; 1.
DR Pfam; PF00026; Asp; 1.
DR Pfam; PF17919; RT_RNaseH_2; 1.
DR PRINTS; PR00792; PEPSIN.
DR SUPFAM; SSF50630; Acid proteases; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR PROSITE; PS00141; ASP_PROTEASE; 1.
DR PROSITE; PS51767; PEPTIDASE_A1; 1.
PE 3: Inferred from homology;
KW Aspartyl protease {ECO:0000256|RuleBase:RU000454};
KW Hydrolase {ECO:0000256|RuleBase:RU000454};
KW Protease {ECO:0000256|RuleBase:RU000454};
KW Reference proteome {ECO:0000313|Proteomes:UP000024635};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..16
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 17..807
FT /note="Peptidase A1 domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5009738594"
FT DOMAIN 75..495
FT /note="Peptidase A1"
FT /evidence="ECO:0000259|PROSITE:PS51767"
FT REGION 597..621
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 597..615
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 807
FT /evidence="ECO:0000313|EMBL:EYC36162.1"
SQ SEQUENCE 807 AA; 92333 MW; 083EF8B3F250C10B CRC64;
MLLWLVLLVA ANHAMGERYQ IPLSKIQSKM VQMLRAGSWA THVKEMRAKN HNIMDAPVKK
SNSNQNINTY HDMEYIANIT IGNPEQTFTV VLDTGSPDTW VVDYTCSADK PLVCDDSICD
QGLICKVFCP HRVCCEKKSP RKRNPCRGKH YFESAKSSTY VSMNGTWTMG YRPWGKAKGF
FGNDTLRFGD KGTKQLVVPA TKIGFANGID EYLGERRLDG VVGLAFSSLS YNDAVSPFER
AWELGLVEPK FTVYMERVGW DAENVFGGMV TYGGLDTQHC GDVIAYEPVS VATYWKFKMD
PQAMMTMSRE MMREERKEMM EMFFKHLAVQ GQGSATSEVA SVPGVMSALS NRIEKFVFDP
DMDMGFTKWY TRYKEVFIED AKQLTESARV RLLCEKLDSE TFERYQRHVL PKEVTSIGFE
ETVATLKQLF DVKTSEFTLR YQGLNLEKSD AEDYLVYTGR VNEFCERARI RELDSDGIKC
LLWIFGLKSQ REAEIRQRLI AVLDREYKAG RKLSLQELYR ECENFLSLKK DSETIAGNVK
TVEAAVKEDR RKRECWNCCG DHFAQQCKSK PWFCKQCKKT GHKERFCEVA NRRKAAENGS
EGRRSRQNSD NRRKKKIMQS SRKHVRGVKI ANATAERIAE YGFKVRLEKC SFAKPEIRYL
GFIVDKNGRR PNPEKIEAIK SMVEPKNVGQ LRAFLGMITY YAAFMPTMKD LRGPLDALLK
KDVKWEWTSK QQLAFEKLKK ALSSELNLAH YDPRQKIVVA ADACDYGIGC VISHRYKDGS
EKPIAHASRS LTAAEKNYSQ IEKEALG
//