ID K0RWL8_THAOC Unreviewed; 2257 AA.
AC K0RWL8;
DT 28-NOV-2012, integrated into UniProtKB/TrEMBL.
DT 28-NOV-2012, sequence version 1.
DT 27-MAR-2024, entry version 32.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EJK56854.1};
GN ORFNames=THAOC_23174 {ECO:0000313|EMBL:EJK56854.1};
OS Thalassiosira oceanica (Marine diatom).
OC Eukaryota; Sar; Stramenopiles; Ochrophyta; Bacillariophyta;
OC Coscinodiscophyceae; Thalassiosirophycidae; Thalassiosirales;
OC Thalassiosiraceae; Thalassiosira.
OX NCBI_TaxID=159749 {ECO:0000313|EMBL:EJK56854.1, ECO:0000313|Proteomes:UP000266841};
RN [1] {ECO:0000313|EMBL:EJK56854.1, ECO:0000313|Proteomes:UP000266841}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CCMP1005 {ECO:0000313|EMBL:EJK56854.1,
RC ECO:0000313|Proteomes:UP000266841};
RX PubMed=22835381; DOI=10.1186/gb-2012-13-7-r66;
RA Lommer M., Specht M., Roy A.S., Kraemer L., Andreson R., Gutowska M.A.,
RA Wolf J., Bergner S.V., Schilhabel M.B., Klostermeier U.C., Beiko R.G.,
RA Rosenstiel P., Hippler M., Laroche J.;
RT "Genome and low-iron response of an oceanic diatom adapted to chronic iron
RT limitation.";
RL Genome Biol. 13:R66-R66(2012).
CC -!- SUBCELLULAR LOCATION: Cell projection, stereocilium
CC {ECO:0000256|ARBA:ARBA00004645}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EJK56854.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AGNL01030309; EJK56854.1; -; Genomic_DNA.
DR EnsemblProtists; EJK56854; EJK56854; THAOC_23174.
DR eggNOG; ENOG502TAG5; Eukaryota.
DR Proteomes; UP000266841; Unassembled WGS sequence.
DR Gene3D; 1.25.40.20; Ankyrin repeat-containing domain; 2.
DR InterPro; IPR002110; Ankyrin_rpt.
DR InterPro; IPR036770; Ankyrin_rpt-contain_sf.
DR PANTHER; PTHR24153; ESPIN; 1.
DR PANTHER; PTHR24153:SF8; FORKED, ISOFORM F; 1.
DR SMART; SM00248; ANK; 5.
DR SUPFAM; SSF48403; Ankyrin repeat; 2.
PE 4: Predicted;
KW Actin-binding {ECO:0000256|ARBA:ARBA00023203};
KW Cell projection {ECO:0000256|ARBA:ARBA00023273};
KW Hearing {ECO:0000256|ARBA:ARBA00022740};
KW Reference proteome {ECO:0000313|Proteomes:UP000266841};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT REGION 37..90
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 126..170
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2199..2250
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 70..90
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 126..155
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2232..2250
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2257 AA; 250023 MW; 9A947855BA213FE3 CRC64;
MEEFDSELHT LCEQARWAEV AALVALYIES IAARDAAVVD SPSKKEDEDG SSLSPSYLDT
KAVKHHGASS FAPRSNTSAT ENQTSDICSS TVSGSYVDEL RYSYGGIQKH EGSGSMGTDT
IQIPTAQMSD QNGSEESISS SNRAGEEDSV SSIHNTRPPT PPPDKFDDDV MSQGRIDLLK
SQLISRVGPR RWTPLMIACV DAPVQVISLL LRACPEACGI PDRSGNLPLH IASKWRSSKL
VVDESLTEDV DDSPYELPLV LYMLMVAHPE SVATMNRWKQ TPLHSLFESK HPVLPSHIPT
CKGSGTQLAA VETMLGRWDN EVYRIFDEHT CVEDVTEVKQ LVEYATNCGL RVHDGNGRLP
LHCAALCQWV DLSIIRVLIQ AYPPSTWTPV LPTSECDEDS ASSSYSWDVN RDSVATDGNI
YSTEGGYHGR DLAVHLFHKR CMLPAELSGH SASEDELWSK SHVTNASAFL SADFHCEAIS
LLLTPMVDAA LHAINEARSA CSASGVPLED ESSSSSIVLP IHVACIHGVS FDLLEKLCRV
YPVSIQIPLT SMVHPDRMGM LPIELFEEGR AGHEVNNASD ASFPQLSTAY FKRSDLLFSH
FTEARALNGI FYYQDAARMS RFVDQIQREI NESPHHLISD TAGAVWMMFC RNYTKTRRKG
FPNFGTLVGR VLEGLDESVT PRLNVIKTES APGGISLCKL ANGRTVKEEA MARAPSGSLE
HILDGGEIRI FHRHVLSFLS GRDALSYSAT CLKAWAYGAR SLRKIKENGI IDCHGSFDCD
DFKAEDGQTL SMKWQDFTIP CVQKCTHSVM VSADISYRGW TEMDSSACSG GGIRIVGSDG
KLCGRTPPLK SKGGPGAQKA FPVAVIFNHV PNREYSFHCY GSTKHTLTVS NIKVSAKLEL
GAMNVIYPQR CEGFGQDLPI HYALNAGVEE RVLRCLIETN PAALLDTDSE GRTALHSAFD
TRKIPNLSCI QALMMHSGLH ALHLKDSRGR LPIHIASASG APSSVLALLV ESYADSCYRR
TDLPLHLLVR SGSANQVAVE ILLTPIMHSS SVCTFEGSVG VNLPLHIAAE FRVKYAILEA
LVTAYSEGCK TRRQLMKENK EIKEKCRPEY PLDIFETGRR AEGFTKDPDV ESDFDRRSDL
LFVYNPDVAK ASTANRYVQS YYRDDKMRID RLSSKIKTEA VQCKLSVVST LAWCWMCSND
SNLDAVSSIL SSLPIEAVRY LTQIENPNSQ PTRGMPMKDC STQRVNVIFK TSLSFLGRYA
FVDASPLYQS DSTLVLRAKD LGAVDTFLTI TKLLDDTEVD IDDYSHDCGS VYAIKSNSIE
ISTFELFVDR LGLNKGIAIS EIESLILDPQ EKDVERAPDL KELGVKSSAF KDFCRLHNVQ
DDGTRDVAIK FMKNVYSFEV ERQARDVLTE VSSTSGFVPI LHDFSLDEES VRLADELHTG
SMSLLDYKCG FIMPCADASL VECLARGDLD SSQIRGISKH MAETLRGIHE QGLHALGGTR
CRLSPSILPP EMIARISLTE GNSMKRVLDY WAHVKRDADA LCVLTPSERE AVSDYVRRSS
AGNADWRGEI STLFETIMFH DLPPVISSIA TLPDFCLIWA RLQENLSLWE VIRPRVDKKN
KCAYMVKFYE NRDDSPALDV SVLPYQVKPP SESVDTCNSR GTLWHTSFNG NLLGVRTFSA
LHNWDTSSAK DIINEHVQDP LAKDLLYSIL APSNQRAPSM AAILDHPYFS PESVDAERYL
ERHEELQILE ENTCHINRVS TSMATLFEES MEHYCKFAFG VEPVFPTCFV ALPYALKWNK
SSQRMEAPPY ASILLQAEKM GVALLEINKA TARLSFWARM NKRMSGPNNN AFKMQLQGWL
KRARNESCSL IAAEIIEELG IERNYVMIVE EVLSLDGSQS KARTYMRDPL RAAKKAVRQN
TSELIKLFDD QCYVYLVDEA TMLPMCPPQQ LSAYPFILEP NSKLIMNVLM PFINIAVMKA
LAKDKFVGLL KLLGIEGMRS VPSAWPKTEP VLLHNTGTRA MIEEIVSLQQ VLRKEDLSAY
QDDVSVSSFS VMSDAISVSN RSTFSVSALG LAHIDLAPLD PRIASLPVTQ MELIFREYDP
DRQFASLCRV TAGTKEQQEG TGSGMWTTYV TIQDMISMEE FSQIEDSLND LRQGKNDQAA
AEKTFTHLMS RRQQVLKTLA TVVSPAGLEL FGVDVPTTTA DSQLDGEAAG SGPTDAADPG
QSAAQSSRGR FKLLGRGKKT GKVRKSKKKF RPWFTAC
//