ID C5KCE2_PERM5 Unreviewed; 1976 AA.
AC C5KCE2;
DT 28-JUL-2009, integrated into UniProtKB/TrEMBL.
DT 28-JUL-2009, sequence version 1.
DT 13-SEP-2023, entry version 54.
DE RecName: Full=Chromo domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=Pmar_PMAR023734 {ECO:0000313|EMBL:EER17804.1};
OS Perkinsus marinus (strain ATCC 50983 / TXsc).
OC Eukaryota; Sar; Alveolata; Perkinsozoa; Perkinsea; Perkinsida; Perkinsidae;
OC Perkinsus.
OX NCBI_TaxID=423536 {ECO:0000313|Proteomes:UP000007800};
RN [1] {ECO:0000313|EMBL:EER17804.1, ECO:0000313|Proteomes:UP000007800}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 50983 / TXsc {ECO:0000313|Proteomes:UP000007800};
RA El-Sayed N., Caler E., Inman J., Amedeo P., Hass B., Wortman J.;
RL Submitted (JUL-2008) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GG671995; EER17804.1; -; Genomic_DNA.
DR RefSeq; XP_002786008.1; XM_002785962.1.
DR EnsemblProtists; EER17804; EER17804; Pmar_PMAR023734.
DR GeneID; 9087130; -.
DR InParanoid; C5KCE2; -.
DR OrthoDB; 217750at2759; -.
DR Proteomes; UP000007800; Unassembled WGS sequence.
DR GO; GO:0003723; F:RNA binding; IEA:InterPro.
DR CDD; cd00024; CD_CSD; 1.
DR Gene3D; 2.40.50.40; -; 1.
DR Gene3D; 3.30.1370.10; K Homology domain, type 1; 1.
DR Gene3D; 1.10.720.30; SAP domain; 1.
DR InterPro; IPR016197; Chromo-like_dom_sf.
DR InterPro; IPR000953; Chromo/chromo_shadow_dom.
DR InterPro; IPR036612; KH_dom_type_1_sf.
DR InterPro; IPR003034; SAP_dom.
DR InterPro; IPR036361; SAP_dom_sf.
DR Pfam; PF02037; SAP; 1.
DR SMART; SM00298; CHROMO; 1.
DR SMART; SM00513; SAP; 1.
DR SUPFAM; SSF54160; Chromo domain-like; 1.
DR SUPFAM; SSF54791; Eukaryotic type KH-domain (KH-domain type I); 1.
DR SUPFAM; SSF68906; SAP domain; 1.
DR PROSITE; PS50013; CHROMO_2; 1.
DR PROSITE; PS50800; SAP; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000007800}.
FT DOMAIN 7..72
FT /note="Chromo"
FT /evidence="ECO:0000259|PROSITE:PS50013"
FT DOMAIN 1738..1772
FT /note="SAP"
FT /evidence="ECO:0000259|PROSITE:PS50800"
FT REGION 92..169
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1160..1196
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1608..1628
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1667..1711
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1906..1931
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1951..1976
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 113..128
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1167..1189
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1976 AA; 219684 MW; 2B70697CBCAAD67F CRC64;
MASPEYYEVQ ALLDVRKISG RKEEFLVKWK NSRELTWEPS GNIGSGLNDV KEVWYQLNDY
YKHQHAQTLR QKKERFRGRK LTESVASPAG VVRKVYRKRS PSRSSSSGDD YTSHSESSDD
DDLKEEPLNG AAGDYQTGKL NEDEKMKGYQ ASPSDTTMGG TNRAPAAPRP IPVARFRGGL
FGMVDDLKQN AVLRVANTAG VRYLPETKLN KLFIEYLYGT DVESSVMSDL QYEQAVVDLM
WSMAVNLQGL DEATLSDLKE CLLRLARQLR LQPATTSKML QDWFNRHHHV HATGSPSYNL
MNVSTLRAIC SWMSTLTDEG ATKMEDFSDT GSVADVDVDA LLKGSDDVEM EEPIETVEDP
IVAEFTQAQN EMFGETEDTD EGDICTLIHL VYAAKNPSKL SEMPGLIAKY SATLKALYES
ICKKYGVDCA EFAIQSNIKN YRTMEDFAAY SLASARSLYA TYNPAKLDEI EDIMAKNANQ
EMQVVAAIRQ KYDEGRGYKP GMYYPPSSPE YGIMELIREE LAVYNPDLLR SIEHTVFTQY
RSRELYVYLA ICDKYGVTID EDRLAAVVET YREKQNTQRQ AQVFNVVKEI YKTYNRGKVK
DIHRLVNKFS KSTSSELELI QAVCNKFRVP LHVKVDIISR MLDPGALDLT WHSKFQEAVE
KSDLSMHVWG LIQMEMSSNM DSHLVSQPPM RLSGEDASML TMDRLSGEEQ WTTVEMLIRG
RGGMISNEVI MRDLLDQKVH ELGINNYPSQ LRYRITQQEP ESVQWDSFWV AETPRLSCTL
SMNSVSPPEP SAAGYRPQRL WLSMQDLLQW IHHIRQQYQV AADELAQSVL QKGGQEDGPV
VQPPLTAKRE QPTQPLLSCN VALVSPKLTV FAGDYRTLII DEYPECSLNR QQNHVWFLPK
GGRRSDIVDK LRAACAAWPR WLRHGGAVII DYDKVHFIGG LRRRAMYREG IKQISRLLGD
EAGSVRIPRY LLSLPLGPLQ PQLQELVSDY DDHESQDDYG ESDPEVPTEL IRPLRLSRQQ
MRDAFIQGAL CLNCDDITHK PADCGIRRKV CWNCHGAHQG HTCTTRCRFC QCNHSAFVLL
ECVKRGGRRA QEWAKNRHYQ EQFQLNRTYL DVEQAIASAE AAGKQWSDEI VEMVRILYNA
GQWLPNSWKE HFEKALADAN AVKSEEGGGQ DQASTADNEA SVSTGSIKPV LPSTPPPILP
DQKYKWIERI FLDELLGAGM LGRDAVRAIV GSRGQNHKEM ESATQSRIYF RGMGMKDGSV
GGGAEEMPDL RHFDTEARIH VLVKCDQPPQ ARAVRSMLIR IIRGLESERQ RGHGALPPLC
DAFHWLESSS KYQSGMGKRE DMDRYSGGAD DGGARALPGG GIQGLEITLG DNRDKEVNAL
QHWLHAQGIE AKTDSNLTRI LIPSTRKIEA SVAIKEREQP GEFDDKMEER KRIINSPAVR
HSVYSSLFQL FALWPAQPPA VSGVYWFEPW QLEPIGLIGR IRATAPEGQP IEDDRFWMDV
GQRCRLSMKG AEEVASLLEA MEELPDKVDK DMCVEILRTF VGSVQCAARD HRLLLYLRYP
WAMVHTVGDD ELTALNLFPR NLSVHLINKE IRETGRLGTN TGDVTALKSS VKQQSVKSGD
RPGDHLLERL SPDRVPEMEP PYIGFAVEWL LPDDVLLHTA AAAAAAGHAG GPAGGEVRAQ
YGDEGEPPQR VPIAAPVAVP SPSPTEARPT TIKGEMGQDA VIVDDASDHA PQNQREEYMA
MSVAQLREKL RAQGLPVTGR KTDLVDRLLK KPLGASQKTA QGPALYKCRV ELPAQLMGWH
ELRNNLCGDR NAHFEHVHSQ CPGTKLSLAG NQSAALQGSA RLHVLVTTNG LEADYKRAKD
LVVDLAKAVA EHGAEVMLGD LSPTQLESVL AEIKIVELTT SVPAAVPANT AGAGGDSEGS
GGASQGAAGA GTSSAFNRLF GALQAAAAKG SQAPGSAAAP NAGTPTGRQA ASYEDL
//