ID W9X162_9EURO Unreviewed; 399 AA.
AC W9X162;
DT 14-MAY-2014, integrated into UniProtKB/TrEMBL.
DT 14-MAY-2014, sequence version 1.
DT 22-FEB-2023, entry version 34.
DE SubName: Full=Aspergillopepsin I {ECO:0000313|EMBL:EXJ71055.1};
GN ORFNames=A1O5_06048 {ECO:0000313|EMBL:EXJ71055.1};
OS Cladophialophora psammophila CBS 110553.
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes;
OC Chaetothyriomycetidae; Chaetothyriales; Herpotrichiellaceae;
OC Cladophialophora.
OX NCBI_TaxID=1182543 {ECO:0000313|EMBL:EXJ71055.1, ECO:0000313|Proteomes:UP000019471};
RN [1] {ECO:0000313|EMBL:EXJ71055.1, ECO:0000313|Proteomes:UP000019471}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CBS 110553 {ECO:0000313|EMBL:EXJ71055.1,
RC ECO:0000313|Proteomes:UP000019471};
RG The Broad Institute Genomics Platform;
RA Cuomo C., de Hoog S., Gorbushina A., Walker B., Young S.K., Zeng Q.,
RA Gargeya S., Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Cladophialophora psammophila CBS 110553.";
RL Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the peptidase A1 family.
CC {ECO:0000256|ARBA:ARBA00007447, ECO:0000256|RuleBase:RU000454}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EXJ71055.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AMGX01000008; EXJ71055.1; -; Genomic_DNA.
DR RefSeq; XP_007744834.1; XM_007746644.1.
DR AlphaFoldDB; W9X162; -.
DR STRING; 1182543.W9X162; -.
DR GeneID; 19190761; -.
DR eggNOG; KOG1339; Eukaryota.
DR HOGENOM; CLU_013253_0_1_1; -.
DR OrthoDB; 2900143at2759; -.
DR Proteomes; UP000019471; Unassembled WGS sequence.
DR GO; GO:0004190; F:aspartic-type endopeptidase activity; IEA:UniProtKB-KW.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd06097; Aspergillopepsin_like; 1.
DR Gene3D; 2.40.70.10; Acid Proteases; 2.
DR InterPro; IPR001461; Aspartic_peptidase_A1.
DR InterPro; IPR001969; Aspartic_peptidase_AS.
DR InterPro; IPR034163; Aspergillopepsin-like_cat_dom.
DR InterPro; IPR033121; PEPTIDASE_A1.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR PANTHER; PTHR47966:SF2; ASPERGILLOPEPSIN-1-RELATED; 1.
DR PANTHER; PTHR47966; BETA-SITE APP-CLEAVING ENZYME, ISOFORM A-RELATED; 1.
DR Pfam; PF00026; Asp; 1.
DR PRINTS; PR00792; PEPSIN.
DR SUPFAM; SSF50630; Acid proteases; 1.
DR PROSITE; PS00141; ASP_PROTEASE; 2.
DR PROSITE; PS51767; PEPTIDASE_A1; 1.
PE 3: Inferred from homology;
KW Aspartyl protease {ECO:0000256|ARBA:ARBA00022750,
KW ECO:0000256|RuleBase:RU000454};
KW Disulfide bond {ECO:0000256|PIRSR:PIRSR601461-2};
KW Hydrolase {ECO:0000256|RuleBase:RU000454};
KW Protease {ECO:0000256|RuleBase:RU000454};
KW Reference proteome {ECO:0000313|Proteomes:UP000019471};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..19
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 20..399
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5004934588"
FT DOMAIN 85..395
FT /note="Peptidase A1"
FT /evidence="ECO:0000259|PROSITE:PS51767"
FT ACT_SITE 101
FT /evidence="ECO:0000256|PIRSR:PIRSR601461-1"
FT ACT_SITE 287
FT /evidence="ECO:0000256|PIRSR:PIRSR601461-1"
FT DISULFID 323..358
FT /evidence="ECO:0000256|PIRSR:PIRSR601461-2"
SQ SEQUENCE 399 AA; 42029 MW; 81E384263796940D CRC64;
MLSFTVLLAA SAWTSVSFAK PIPVPQVEIK TGFTLVQSVA KPFQAGPVVL QRAYLKYNSS
VPADVKAAAA GGTVTATPEQ YDAEYLCPVN IGGQTLNLDF DTGSSDLWVF SSELSSSEQS
GHSIYDPSKS STAKELSGAT WKISYGDGSS ASGNVYTDTV NVGGTTVTGQ AVELAQQISS
QFQQDANNDG LLGLAFSSIN TVQPQQQTTF FDTAISQGVL QQNVFTADLK KGAPGSYDFG
FIDSSKYTGD ITYTPVDNSQ GFWEFTGTGY QVGNGRFQSA SIDAIADTGT TLLLMDDNIV
SAYYNQVDGA QYDNSQGGYT FPCSANLPDF AVGVGSYHAV IPGSFMNFAP TSAASSSCYG
GLQSNQGIGI SIYGDIFLKA VFAVFDNDNM QFGVAAKNL
//