ID B8LEG8_THAPS Unreviewed; 497 AA.
AC B8LEG8;
DT 03-MAR-2009, integrated into UniProtKB/TrEMBL.
DT 03-MAR-2009, sequence version 1.
DT 27-MAR-2024, entry version 49.
DE RecName: Full=DNA polymerase alpha catalytic subunit N-terminal domain-containing protein {ECO:0000259|Pfam:PF12254};
DE Flags: Fragment;
GN ORFNames=THAPSDRAFT_bd843 {ECO:0000313|EMBL:EED86278.1};
OS Thalassiosira pseudonana (Marine diatom) (Cyclotella nana).
OC Eukaryota; Sar; Stramenopiles; Ochrophyta; Bacillariophyta;
OC Coscinodiscophyceae; Thalassiosirophycidae; Thalassiosirales;
OC Thalassiosiraceae; Thalassiosira.
OX NCBI_TaxID=35128 {ECO:0000313|EMBL:EED86278.1, ECO:0000313|Proteomes:UP000001449};
RN [1] {ECO:0000313|EMBL:EED86278.1, ECO:0000313|Proteomes:UP000001449}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CCMP1335 {ECO:0000313|EMBL:EED86278.1};
RX PubMed=15459382; DOI=10.1126/science.1101156;
RA Armbrust E.V., Berges J.A., Bowler C., Green B.R., Martinez D.,
RA Putnam N.H., Zhou S., Allen A.E., Apt K.E., Bechner M., Brzezinski M.A.,
RA Chaal B.K., Chiovitti A., Davis A.K., Demarest M.S., Detter J.C.,
RA Glavina T., Goodstein D., Hadi M.Z., Hellsten U., Hildebrand M.,
RA Jenkins B.D., Jurka J., Kapitonov V.V., Kroger N., Lau W.W., Lane T.W.,
RA Larimer F.W., Lippmeier J.C., Lucas S., Medina M., Montsant A., Obornik M.,
RA Parker M.S., Palenik B., Pazour G.J., Richardson P.M., Rynearson T.A.,
RA Saito M.A., Schwartz D.C., Thamatrakoln K., Valentin K., Vardi A.,
RA Wilkerson F.P., Rokhsar D.S.;
RT "The genome of the diatom Thalassiosira pseudonana: ecology, evolution, and
RT metabolism.";
RL Science 306:79-86(2004).
RN [2] {ECO:0000313|EMBL:EED86278.1, ECO:0000313|Proteomes:UP000001449}
RP GENOME REANNOTATION.
RC STRAIN=CCMP1335 {ECO:0000313|EMBL:EED86278.1};
RG Diatom Consortium;
RA Grigoriev I., Grimwood J., Kuo A., Otillar R.P., Salamov A., Detter J.C.,
RA Schmutz J., Lindquist E., Shapiro H., Lucas S., Glavina del Rio T.,
RA Bruce D., Pitluck S., Rokhsar D., Armbrust V.;
RL Submitted (SEP-2008) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; DS999440; EED86278.1; -; Genomic_DNA.
DR RefSeq; XP_002297408.1; XM_002297372.1.
DR AlphaFoldDB; B8LEG8; -.
DR STRING; 35128.B8LEG8; -.
DR PaxDb; 35128-Thapsdraft843; -.
DR EnsemblProtists; EED86278; EED86278; THAPSDRAFT_bd843.
DR GeneID; 7447741; -.
DR KEGG; tps:THAPSDRAFT_bd843; -.
DR eggNOG; ENOG502T1PW; Eukaryota.
DR HOGENOM; CLU_549363_0_0_1; -.
DR InParanoid; B8LEG8; -.
DR Proteomes; UP000001449; Unassembled WGS sequence.
DR InterPro; IPR024647; DNA_pol_a_cat_su_N.
DR Pfam; PF12254; DNA_pol_alpha_N; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000001449}.
FT DOMAIN 49..89
FT /note="DNA polymerase alpha catalytic subunit N-terminal"
FT /evidence="ECO:0000259|Pfam:PF12254"
FT REGION 27..52
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 120..144
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 187..448
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 187..203
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 262..300
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 309..330
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 343..364
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 390..405
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 432..448
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 497
FT /evidence="ECO:0000313|EMBL:EED86278.1"
SQ SEQUENCE 497 AA; 53521 MW; A5F146D951DE305C CRC64;
MAPSRKATKK AALQKLRTAR HEGFARFNSD HVNSDDDAFD DDDGARARSH LDSVKFDEEK
PVYEEMDEEA YRNYVSEKLD REDFVVDDGL LLAFYGAHRF QTTSTADGLG YHDDGEYDIR
NLGSNDDHHN NNHPNKKKRG NGTAALTKEA LRKARKTKAL IGDGEEAPKD AKNATMWDFV
NKGVAGSSGS GTIANNSSGG KNKVLIGDNG EGNAGGRNRG GVGADMDNGL DDLLSGLDDV
TSARPRNSGG IGGGVRGRSL HNRHQYGSSG RSRSSAASSS SRHTPTSSRQ HTTPASSARK
RRAYGSSGDY GLVNTSGRRK LHSNATPGSS RRGGGGGVQR NDTRAGDSDE EPIDFNNRGY
DEGEDDADFG NDGGDVDFGG EDNGNDTFEE DADMNGKKAE SNEEVEGEQR EEDSSAEAAA
TEATSSRPRK IGRLAARKEA QEKAAAQKKL LEEQQKQTKK MNDSKPKEKV VTFEEDIKVD
MTSTSFRPES IAAASAE
//