ID A0A1D2VKT5_9ASCO Unreviewed; 1292 AA.
AC A0A1D2VKT5;
DT 30-NOV-2016, integrated into UniProtKB/TrEMBL.
DT 30-NOV-2016, sequence version 1.
DT 24-JAN-2024, entry version 23.
DE SubName: Full=Cysteine proteinase {ECO:0000313|EMBL:ODV62224.1};
GN ORFNames=ASCRUDRAFT_133106 {ECO:0000313|EMBL:ODV62224.1};
OS Ascoidea rubescens DSM 1968.
OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; Saccharomycetes;
OC Saccharomycetales; Ascoideaceae; Ascoidea.
OX NCBI_TaxID=1344418 {ECO:0000313|EMBL:ODV62224.1, ECO:0000313|Proteomes:UP000095038};
RN [1] {ECO:0000313|Proteomes:UP000095038}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DSM 1968 {ECO:0000313|Proteomes:UP000095038};
RG DOE Joint Genome Institute;
RA Riley R., Haridas S., Wolfe K.H., Lopes M.R., Hittinger C.T., Goker M.,
RA Salamov A., Wisecaver J., Long T.M., Aerts A.L., Barry K., Choi C.,
RA Clum A., Coughlan A.Y., Deshpande S., Douglass A.P., Hanson S.J.,
RA Klenk H.-P., Labutti K., Lapidus A., Lindquist E., Lipzen A.,
RA Meier-Kolthoff J.P., Ohm R.A., Otillar R.P., Pangilinan J., Peng Y.,
RA Rokas A., Rosa C.A., Scheuner C., Sibirny A.A., Slot J.C., Stielow J.B.,
RA Sun H., Kurtzman C.P., Blackwell M., Grigoriev I.V., Jeffries T.W.;
RT "Comparative genomics of biotechnologically important yeasts.";
RL Submitted (MAY-2016) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KV454477; ODV62224.1; -; Genomic_DNA.
DR RefSeq; XP_020048531.1; XM_020189054.1.
DR STRING; 1344418.A0A1D2VKT5; -.
DR GeneID; 30962690; -.
DR InParanoid; A0A1D2VKT5; -.
DR OrthoDB; 1353379at2759; -.
DR Proteomes; UP000095038; Unassembled WGS sequence.
DR Gene3D; 3.90.70.10; Cysteine proteinases; 2.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000095038}.
FT REGION 1..25
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 58..107
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 354..375
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 388..458
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 847..875
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 58..99
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 847..865
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1292 AA; 148294 MW; E04FD1688D222B6A CRC64;
MSKKFGLPFP KLKDASDAND DEQPRPCYLE AYYVLDDNKY WLRVIEDIKR DQKDLKERLK
ERKEKEKEKE KEKKKEEKEK EKEKKTDKPT DKPTDDPPVE TPNDPDDVLA YDKEIIELVK
TSTSSFPTSF SSTSSIASMK IAIMKSFLQN NPVSNQLIIF SLISKFSKKD LTKSFELLRF
IKLSSDGFLI SNSYIYKNYM GNSINKKIIL KGAENWGYVM CYLDSLLFSM FARLQDFEPI
LLIDNESQST PESKKRTKLK FLLRLYINLL RSGNLITNDL TRLLCQSLAK NGYTEAISGS
QEDSSSLFQF LTDYFNMPLL TLKLDIHHGG KAVQNDDHKF TKERVLYVSV PEDDYDKSHT
RNNNIKNTGS INIINKSNNN SKIIEEEEFS TQSNENLNIL KKTSPDNRNP MNESLINLSQ
SHQPNPSNSS NQSNKSNESN ESNESSKSKK SASSNSHIDP EPILLEECLE QFFNTSIQVR
RQLERRMTLE NARNNNCTKN GGIFNSNSNN YNNYYNNYSD SNRKFYSDNV NIDNNIDNID
MSQKGIVTFE SYDDLDVESS LNESHNSNRN IFNDDAKSIV SSFTNNDKKN CSKEDLDFLK
KLNKKSPIND SNSLFLLNTK ITNQKDSLSN DPDQTLNDDE YYMFDKNNSN LKDFKLNDTN
CCSSTSLSIN HKTNDDDKEN QIPFPSLNEM DESVSPTTSI QRSPVTPFQP ENVLLKNRKL
RSSSLSKYPD VNITIRPRNS VSSRTRSSTL SIWSNKTSNE VTLPAWMFLQ LLPFYTDLDS
NSSSSREFMN KRPILPICLK RYYYDNENGK KKGEHGSKRN LRRVIIPPTI DLPKFIADDI
EVDDNMINSR SNDNINNNNS TNNTKEESTE KTDKSEFPEK LYGDFKLILE SAVCHRGKSV
NSGHYISLVR ENPSDPYTSE EEELNSIWLL FDDLLRGENK VRRRVFKDVF EKEWPYILFY
RMVPFDKNKS EEIQKEKLEK LEKANSTIVA SSLVSKMESL KIKISDNFND SFLSTNSENK
IKSSDNLNLK SEATSITSNL NSSLNSDKLL FNSQSNLFIE VEKSSNTNDS TINLDNSAPT
QVQQKPTLSA SLSNTHLGRS KSTRIGIHRL HHNFHDQSMF KYFKDSTQPY PNDPGYIDIS
NKFFWYVKVP EGNYINENSG LSDVSLLEKS SIISKDGESF VKIQKSDAAS DSEIKSTEYI
FEDSNSNLLK PPNEIRGKEK RQSADISIAS YPSLEKVLSD SNNNEPTPNS HTIVDHKAKT
INIFGDLHDR KSKKLYNHKK RLNYKKEKCI IT
//