GenomeNet

Database: UniProt
Entry: K0S4M5_THAOC
LinkDB: K0S4M5_THAOC
Original site: K0S4M5_THAOC 
ID   K0S4M5_THAOC            Unreviewed;      1391 AA.
AC   K0S4M5;
DT   28-NOV-2012, integrated into UniProtKB/TrEMBL.
DT   28-NOV-2012, sequence version 1.
DT   22-FEB-2023, entry version 22.
DE   RecName: Full=BRCT domain-containing protein {ECO:0000259|SMART:SM00292};
GN   ORFNames=THAOC_19955 {ECO:0000313|EMBL:EJK59784.1};
OS   Thalassiosira oceanica (Marine diatom).
OC   Eukaryota; Sar; Stramenopiles; Ochrophyta; Bacillariophyta;
OC   Coscinodiscophyceae; Thalassiosirophycidae; Thalassiosirales;
OC   Thalassiosiraceae; Thalassiosira.
OX   NCBI_TaxID=159749 {ECO:0000313|EMBL:EJK59784.1, ECO:0000313|Proteomes:UP000266841};
RN   [1] {ECO:0000313|EMBL:EJK59784.1, ECO:0000313|Proteomes:UP000266841}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=CCMP1005 {ECO:0000313|EMBL:EJK59784.1,
RC   ECO:0000313|Proteomes:UP000266841};
RX   PubMed=22835381; DOI=10.1186/gb-2012-13-7-r66;
RA   Lommer M., Specht M., Roy A.S., Kraemer L., Andreson R., Gutowska M.A.,
RA   Wolf J., Bergner S.V., Schilhabel M.B., Klostermeier U.C., Beiko R.G.,
RA   Rosenstiel P., Hippler M., Laroche J.;
RT   "Genome and low-iron response of an oceanic diatom adapted to chronic iron
RT   limitation.";
RL   Genome Biol. 13:R66-R66(2012).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:EJK59784.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AGNL01022316; EJK59784.1; -; Genomic_DNA.
DR   EnsemblProtists; EJK59784; EJK59784; THAOC_19955.
DR   Proteomes; UP000266841; Unassembled WGS sequence.
DR   Gene3D; 3.40.50.10190; BRCT domain; 1.
DR   InterPro; IPR001357; BRCT_dom.
DR   InterPro; IPR036420; BRCT_dom_sf.
DR   Pfam; PF00533; BRCT; 1.
DR   SMART; SM00292; BRCT; 1.
DR   SUPFAM; SSF52113; BRCT domain; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000266841}.
FT   DOMAIN          1138..1222
FT                   /note="BRCT"
FT                   /evidence="ECO:0000259|SMART:SM00292"
FT   REGION          716..740
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1298..1391
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1344..1358
FT                   /note="Basic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1391 AA;  155016 MW;  9620FA8C5BEAB252 CRC64;
     MVEAANDLVD ACEKQTIPER LQALIDDIVE ESPHENNEPI SQHDAHRFVQ SLAGHLKNWV
     KDQQGHSSAV RFDDDAFALA FNHFLRCPAS YKQMRSDDCL VRPSESLFKK LKKDIGVEGG
     QSFNTAILQP FVRGDNGREE DSAEASAFMV DGAGLGVNWS VDSKWRSGCG SEWGEVACDE
     ATISQGMVTN SKNNVSRGIT NDFFELRRVM SNITDSDTIE SMDRPATKVN QWWFRATNGR
     MFCVMSFFNA GSLSGDVILR QLIAVIMACE AVGSRVFLFA CDAGGANQGT FKSLRGNEAI
     PDDMVWLDDE YIRFINPYDP SRWIYLTHCS THNLKNMRNR VKESKADGSK CFLDVNGNII
     TWDFVNQIYN DEQLLIDGTG TPTSGLTEKA VYPDRWSTMN AQHAKAPFAP KALFVMLAVI
     HRELNVSKDE AYRVVDHYKV IKELEVIKNS GMNEKAARMI GYFTTVAKRL WDLFNARCPG
     NLNLAGKIAT FEFFAHTSEI YNLRLLKMDE LITWANIKVY KEQAKKNLTY FSDLHLAMIK
     RRKDKKYSAD WRMTGLPGCT FDILRITVSG FFGYAEYLFQ LAKTSPSLPE SVNRDFAVTP
     AQSNTSFGES FFSWVRARGL DNACSFLVAV LNTAMIDMLK NSLDHNPMYD DEDVGKVTKD
     ANRLGIRKIV TILKHLRKVV ARLIGEYNEK SEPYASPAAA FSPGAVAFAQ DEGILPSDDE
     AEGDEPGDEP PPPARPTSAQ MGMIIIERIQ EKRKLRNGFA SYLLERKSFR EMITVTMGQE
     IWPFFVSVLH DTRELDVSKK FDTACRLIMD KLLGMSLNSV LRNSRGESYE YQLFHFLSSK
     EFAEICFANL PTDAMKYISA GWVFLGLELS DILQEWLVIE AKGVRMKLDP DLFNASRTLQ
     LGPEELKSET NRFGGWAGKS CMARRSKYLG GDAEKCSKDP QYRLLDLIVC YEKDLSEDYV
     ATKVEFALYL SNYAGEGGLS FVAEPFFDAL ISIMRVVSTA VTLEDFASNN SGDVWAKGKN
     LVSLSSNLLS DFSLACKRQV DIAQEADPTF PDINHAVIAK VYTDVVTKIC NARFGAVEAA
     YKLRASLKGD AHLSFRGMLF ASVEGSKKKS DTDDVNEKSA SPGTITLPFP GDRTAVGKDF
     LKGKRIVICG CYEEIAEIAL IAKAELRSCL ESFGAKVGVS LTDSTDYFLA GKGTPPAKIK
     KAQEKNIRIV NLNRMLRLLR GNLKDFNAMH ALHPLDKTSF TDKNYERAVD EAADSAEDAG
     VEPDADDEED FATVELDADK VRAKAARVAK IGAKPATYAE ALKQTPPKRN HRRRRALAPL
     TTNAQAGDRG AAATEVRTQT PPQQQQKKKK RRGGGRKKRG AGPSPATSPS NSDKKAKAPR
     KESKKKKKEK K
//
DBGET integrated database retrieval system