ID K0S4M5_THAOC Unreviewed; 1391 AA.
AC K0S4M5;
DT 28-NOV-2012, integrated into UniProtKB/TrEMBL.
DT 28-NOV-2012, sequence version 1.
DT 22-FEB-2023, entry version 22.
DE RecName: Full=BRCT domain-containing protein {ECO:0000259|SMART:SM00292};
GN ORFNames=THAOC_19955 {ECO:0000313|EMBL:EJK59784.1};
OS Thalassiosira oceanica (Marine diatom).
OC Eukaryota; Sar; Stramenopiles; Ochrophyta; Bacillariophyta;
OC Coscinodiscophyceae; Thalassiosirophycidae; Thalassiosirales;
OC Thalassiosiraceae; Thalassiosira.
OX NCBI_TaxID=159749 {ECO:0000313|EMBL:EJK59784.1, ECO:0000313|Proteomes:UP000266841};
RN [1] {ECO:0000313|EMBL:EJK59784.1, ECO:0000313|Proteomes:UP000266841}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CCMP1005 {ECO:0000313|EMBL:EJK59784.1,
RC ECO:0000313|Proteomes:UP000266841};
RX PubMed=22835381; DOI=10.1186/gb-2012-13-7-r66;
RA Lommer M., Specht M., Roy A.S., Kraemer L., Andreson R., Gutowska M.A.,
RA Wolf J., Bergner S.V., Schilhabel M.B., Klostermeier U.C., Beiko R.G.,
RA Rosenstiel P., Hippler M., Laroche J.;
RT "Genome and low-iron response of an oceanic diatom adapted to chronic iron
RT limitation.";
RL Genome Biol. 13:R66-R66(2012).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EJK59784.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AGNL01022316; EJK59784.1; -; Genomic_DNA.
DR EnsemblProtists; EJK59784; EJK59784; THAOC_19955.
DR Proteomes; UP000266841; Unassembled WGS sequence.
DR Gene3D; 3.40.50.10190; BRCT domain; 1.
DR InterPro; IPR001357; BRCT_dom.
DR InterPro; IPR036420; BRCT_dom_sf.
DR Pfam; PF00533; BRCT; 1.
DR SMART; SM00292; BRCT; 1.
DR SUPFAM; SSF52113; BRCT domain; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000266841}.
FT DOMAIN 1138..1222
FT /note="BRCT"
FT /evidence="ECO:0000259|SMART:SM00292"
FT REGION 716..740
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1298..1391
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1344..1358
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1391 AA; 155016 MW; 9620FA8C5BEAB252 CRC64;
MVEAANDLVD ACEKQTIPER LQALIDDIVE ESPHENNEPI SQHDAHRFVQ SLAGHLKNWV
KDQQGHSSAV RFDDDAFALA FNHFLRCPAS YKQMRSDDCL VRPSESLFKK LKKDIGVEGG
QSFNTAILQP FVRGDNGREE DSAEASAFMV DGAGLGVNWS VDSKWRSGCG SEWGEVACDE
ATISQGMVTN SKNNVSRGIT NDFFELRRVM SNITDSDTIE SMDRPATKVN QWWFRATNGR
MFCVMSFFNA GSLSGDVILR QLIAVIMACE AVGSRVFLFA CDAGGANQGT FKSLRGNEAI
PDDMVWLDDE YIRFINPYDP SRWIYLTHCS THNLKNMRNR VKESKADGSK CFLDVNGNII
TWDFVNQIYN DEQLLIDGTG TPTSGLTEKA VYPDRWSTMN AQHAKAPFAP KALFVMLAVI
HRELNVSKDE AYRVVDHYKV IKELEVIKNS GMNEKAARMI GYFTTVAKRL WDLFNARCPG
NLNLAGKIAT FEFFAHTSEI YNLRLLKMDE LITWANIKVY KEQAKKNLTY FSDLHLAMIK
RRKDKKYSAD WRMTGLPGCT FDILRITVSG FFGYAEYLFQ LAKTSPSLPE SVNRDFAVTP
AQSNTSFGES FFSWVRARGL DNACSFLVAV LNTAMIDMLK NSLDHNPMYD DEDVGKVTKD
ANRLGIRKIV TILKHLRKVV ARLIGEYNEK SEPYASPAAA FSPGAVAFAQ DEGILPSDDE
AEGDEPGDEP PPPARPTSAQ MGMIIIERIQ EKRKLRNGFA SYLLERKSFR EMITVTMGQE
IWPFFVSVLH DTRELDVSKK FDTACRLIMD KLLGMSLNSV LRNSRGESYE YQLFHFLSSK
EFAEICFANL PTDAMKYISA GWVFLGLELS DILQEWLVIE AKGVRMKLDP DLFNASRTLQ
LGPEELKSET NRFGGWAGKS CMARRSKYLG GDAEKCSKDP QYRLLDLIVC YEKDLSEDYV
ATKVEFALYL SNYAGEGGLS FVAEPFFDAL ISIMRVVSTA VTLEDFASNN SGDVWAKGKN
LVSLSSNLLS DFSLACKRQV DIAQEADPTF PDINHAVIAK VYTDVVTKIC NARFGAVEAA
YKLRASLKGD AHLSFRGMLF ASVEGSKKKS DTDDVNEKSA SPGTITLPFP GDRTAVGKDF
LKGKRIVICG CYEEIAEIAL IAKAELRSCL ESFGAKVGVS LTDSTDYFLA GKGTPPAKIK
KAQEKNIRIV NLNRMLRLLR GNLKDFNAMH ALHPLDKTSF TDKNYERAVD EAADSAEDAG
VEPDADDEED FATVELDADK VRAKAARVAK IGAKPATYAE ALKQTPPKRN HRRRRALAPL
TTNAQAGDRG AAATEVRTQT PPQQQQKKKK RRGGGRKKRG AGPSPATSPS NSDKKAKAPR
KESKKKKKEK K
//