ID A0A383WEI9_TETOB Unreviewed; 2215 AA.
AC A0A383WEI9;
DT 07-NOV-2018, integrated into UniProtKB/TrEMBL.
DT 07-NOV-2018, sequence version 1.
DT 27-MAR-2024, entry version 17.
DE RecName: Full=Peptidase C1A papain C-terminal domain-containing protein {ECO:0000259|SMART:SM00645};
GN ORFNames=BQ4739_LOCUS15861 {ECO:0000313|EMBL:SZX75582.1};
OS Tetradesmus obliquus (Green alga) (Acutodesmus obliquus).
OC Eukaryota; Viridiplantae; Chlorophyta; core chlorophytes; Chlorophyceae;
OC CS clade; Sphaeropleales; Scenedesmaceae; Tetradesmus.
OX NCBI_TaxID=3088 {ECO:0000313|EMBL:SZX75582.1, ECO:0000313|Proteomes:UP000256970};
RN [1] {ECO:0000313|EMBL:SZX75582.1, ECO:0000313|Proteomes:UP000256970}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Cai Z.;
RL Submitted (OCT-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; FNXT01001236; SZX75582.1; -; Genomic_DNA.
DR STRING; 3088.A0A383WEI9; -.
DR Proteomes; UP000256970; Unassembled WGS sequence.
DR GO; GO:0005524; F:ATP binding; IEA:InterPro.
DR GO; GO:0140658; F:ATP-dependent chromatin remodeler activity; IEA:InterPro.
DR GO; GO:0008234; F:cysteine-type peptidase activity; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR025661; Pept_asp_AS.
DR InterPro; IPR000668; Peptidase_C1A_C.
DR InterPro; IPR000330; SNF2_N.
DR PANTHER; PTHR13484; FIP1-LIKE 1 PROTEIN; 1.
DR PANTHER; PTHR13484:SF0; PRE-MRNA 3'-END-PROCESSING FACTOR FIP1; 1.
DR Pfam; PF00112; Peptidase_C1; 1.
DR Pfam; PF00176; SNF2-rel_dom; 1.
DR SMART; SM00645; Pept_C1; 1.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
DR PROSITE; PS00640; THIOL_PROTEASE_ASN; 1.
PE 4: Predicted;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000256970}.
FT DOMAIN 1631..1833
FT /note="Peptidase C1A papain C-terminal"
FT /evidence="ECO:0000259|SMART:SM00645"
FT REGION 100..142
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 207..303
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 417..505
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 660..970
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1102..1123
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1268..1311
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1443..1524
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1539..1616
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1676..1726
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 122..142
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 226..287
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 427..441
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 719..737
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 745..760
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 779..813
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 844..861
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 933..969
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1268..1287
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1676..1697
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2215 AA; 229428 MW; 44F8D42FEC398C40 CRC64;
MAHDSLDSRL VLLQQQLLGF VTVFQDTIEG VRAELRAAGP QLALQQLQQQ QASNPFDSLQ
AGAFQPRSAA GPWTSPGQLG GVAGASSTAP LLLHGASTAL HQQGASGQHA APAAGGLAQQ
QRFPADVQQA TGAQHTSSPA AAAAGTSCDE VLLETQLVGG NAADSTLLQT ELLDDGLTVD
QPAAAAAVAE LTPGTLLAGS TAAGVSGELQ QRQNHGHERL PQQQEELEQE QQQQQQQTGI
QERTQPELNA TDSSDSRPMQ QSPAAASDSD VFSTATGSST SLASMETAPS AHEDGAAAAR
RMEADSEAAA AAAAAAAAAG GDAAEAGEVL VEVAGDDDTH TSMLRVAYPG SPAGLRTLGV
FINDAPALEG EDGSLLPNTA IPDMLAHPSG LHLAEHLQHS MAWVLANLSH SRRQLGWDHT
EDDEQQQQQE VADEEDEELW QQLGQQSDGD AAKEQQAAAA AAHVSPGSGR RKQAKPVRRP
NAADRALELS SDSEDERHPA APRSGNSALA SLLQGLNAAA AAGAAGAGSA HAAAAAAAAE
PAGFASAVMS AGLAEVEALV AEDVERLRQQ WRVAELPKLT AGAAEIWHEQ RSQLAYWQRV
AEERQQQLAK GMQAMALEHA GNVAGYSRKR TRLSEALHGW VNLAEGAAAV LAIVELPEAP
QPAAESEEDA AADGGRSKRR AATQAAGAGG SPSAAAAAAG EGGGEDLGSS SSQGDEESAD
AGGDLDGFID DNDDDAEQPA AGATAAAVQK QQQAGSGSEQ EAGGEGSGDE GAAATWNVPH
LTPGSTQVLQ PGRRQRQQQR ALLLATQQQQ QQQHEEEAQE EQQQKVKEQQ QQQEVHEEHQ
QVEEQLSGSA GDAQAASQHS TPLAEAAEGE PTEQPFVDAE VADDQAAAAA DDMPLCASLQ
TPPEPSGAAL AAGELVDELP ALQLPPMPPP ATSSDSSQQQ QQQQQQQQQQ QDGQDQQQQA
ASSSSDASEL AADMEAITQE QYEAPSRQQA ALPAAAYAYP GLVEQGAEQV PGELRLLQAC
ELLAGPPLEG QVMRWADALL VRQAVRLPRW HSWTGAAGVV GLDGAVGSVR VRPANGPMFT
AQLGNVRPCP PPGLASIIPP SALQQQQQQG TGGDALPPAA SAPTWASSPQ GLWHLCKLLR
PGLLLEYRRG DGTWLPALLL HSSLLRPPVN SELLRLAKQY NTQDEGQQQQ GDSGARVTAA
GLVASFRALH GSKPCAEGVA YCQPPQPGDR VLTLQLLGGS RDVRHIRWQQ LYEVSPCVRR
MLHWTADDDA DTDDDADTDD DEQRGSQDCD NTDEPAAAAD ASMAGNDDTV DAAADPAAAP
AAAANGSPSS SAVLAAGRWR VLGRYLSWPL ARLVRSQLAA EHQAWLRARG WPLHRRAPRS
QQQQQLLPSD VVASIARKLA RRPVAVGVTP ALVAAAGLGP CSKAAVRHHI APRKLRVQLG
AEPELSGGDE AEAAAGGADR RGGRQQRKRR VLLDEFSGSS EQDQEEQQQR LAKPGSRRPS
KRRRITASAD GAGGSDCSGS GTDDELQAMD LDAAAAAADR GAAGSSRSRR HLRRRSSVAA
MAATAVTAAA EAKAAAGDDG DEEDEELLPS NEPSASVSGE SDSKEDGGED DGVAAAAAAA
AEGCLLNDSE PPGSLNPWLP RCWFSPGLPP SQQVIKPHQL QGLRFMWDCL VREHADDSGS
EQPADGHKSR GWSDNSDDDF VEEPAAAGGG RAGSKGSSKK QKKRGVVAAG DDDAGGCILA
HSMGLGKSFQ AVALLWLFFQ QLEGIYRNKH CTLKPSQLDH AVILVGWGED SSSGQRFWLV
KNSWSKLWGN DGYIKIHRGG NDCGIASDAA FADVSPEYVV PGAAKRALQL VLSKSRGQTQ
APSAALVLLA CAALLGAAEG VCLLKLNIST PQSAFTGSGK MTAPIPGTIN STPVKATGQL
FLSVPTTSCP PAGLTAEDAV ALLEKSSLLV PVDEESISFT PEDLKGNATD NTGEKIEGIT
VDFKGMQLAF TGNEPLAPGA VKLGTGSAAE AKAAAGELCS QLIMLGGELS MQSLLGPRTA
AVKHVAMNRT FSVGASVKKG DAKVPVTLFL GKAGLALNIS KGAMPGVEFT RATYLLSGDI
VATADLANKA SWQEVAAADV DWSGVPRLDS PIVIDPESSS PPAKGRIVGV NSTLTQQCTP
TRTASAAAVT AADGAPAAAA AASPAKSAAG GSRVGVLQVL LAAAAPAVLA GVAML
//