ID A0A267DSJ9_9PLAT Unreviewed; 614 AA.
AC A0A267DSJ9;
DT 22-NOV-2017, integrated into UniProtKB/TrEMBL.
DT 22-NOV-2017, sequence version 1.
DT 13-SEP-2023, entry version 20.
DE RecName: Full=CSD domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=BOX15_Mlig002129g2 {ECO:0000313|EMBL:PAA52263.1};
OS Macrostomum lignano.
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes;
OC Rhabditophora; Macrostomorpha; Macrostomida; Macrostomidae; Macrostomum.
OX NCBI_TaxID=282301 {ECO:0000313|EMBL:PAA52263.1, ECO:0000313|Proteomes:UP000215902};
RN [1] {ECO:0000313|EMBL:PAA52263.1, ECO:0000313|Proteomes:UP000215902}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DV1 {ECO:0000313|EMBL:PAA52263.1};
RC TISSUE=Whole organism {ECO:0000313|EMBL:PAA52263.1};
RA Berezikov E.;
RT "A platform for efficient transgenesis in Macrostomum lignano, a flatworm
RT model organism for stem cell research.";
RL Submitted (JUN-2017) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PAA52263.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; NIVC01003273; PAA52263.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A267DSJ9; -.
DR STRING; 282301.A0A267DSJ9; -.
DR Proteomes; UP000215902; Unassembled WGS sequence.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR Gene3D; 2.40.50.140; Nucleic acid-binding proteins; 2.
DR InterPro; IPR011129; CSD.
DR InterPro; IPR002059; CSP_DNA-bd.
DR InterPro; IPR012340; NA-bd_OB-fold.
DR InterPro; IPR024642; SUZ-C.
DR PANTHER; PTHR12913:SF1; COLD SHOCK DOMAIN-CONTAINING PROTEIN E1; 1.
DR PANTHER; PTHR12913; UNR PROTEIN N-RAS UPSTREAM GENE PROTEIN; 1.
DR Pfam; PF00313; CSD; 2.
DR Pfam; PF12901; SUZ-C; 1.
DR SMART; SM00357; CSP; 2.
DR SUPFAM; SSF50249; Nucleic acid-binding proteins; 2.
DR PROSITE; PS51857; CSD_2; 2.
DR PROSITE; PS51938; SUZ_C; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000215902}.
FT DOMAIN 11..80
FT /note="CSD"
FT /evidence="ECO:0000259|PROSITE:PS51857"
FT DOMAIN 178..240
FT /note="CSD"
FT /evidence="ECO:0000259|PROSITE:PS51857"
FT DOMAIN 546..586
FT /note="SUZ-C"
FT /evidence="ECO:0000259|PROSITE:PS51938"
FT REGION 337..365
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 507..550
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 571..595
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 507..538
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 614 AA; 66400 MW; 98158206C90C368F CRC64;
MSTESQKQQS VSQGRIDQLQ GTFGFIKSYS PSEGRLFFHF KDLPPEQPPD SLKIGDSVEF
RIRRDPRGGN KRVQAYCVSK TGQNTANQQP MQSQSSVSST VLEGTVVSPC KPGEPGKIAH
VFNGEYFFLP YTKESVRNGE SLNAGELVFF QVASGPGDPD SHASASWAVN VHRPAASRCR
GIVSSLKDNY GFIEREDRVQ DVHFHFSGCP EGRQPPLGAP VEFSLQLSRS GRETATDIVL
LPAGSVTFAD IDQRRLIGTL LRLPTHQRKE PLTGQVSFQC PDTGAFVQLP FGSADLSSRC
SLRPGDRVAF NAATDRRDRL RRATAIELDP VETFGQSYDS AAGVNSPQRE PRRQGVVHVS
PSPSNKGLLV SDSQSFQFDA YELVGDDLVG NVGDLVQFSV SDGAPDVAVR LCRLPAEAAA
ASADRLTLMR PQLLTGVLHQ ADTEPGIIMY NCAGREKPVC CPLTQISGGA RNDRVQFQLL
EFPVTGASLA CNVRLIDKAA DQVAAVASNS QQPQQQPQAS KPASKSNSSG SSATQQQPTQ
PLPPQPRPDR LRHRLKTLVD AQDGPRLVLI RQPARPDGTK GFVNRPRRQT PSGLQTDCSA
GAALTVETAA LTVQ
//