ID A0A090M906_OSTTA Unreviewed; 1098 AA.
AC A0A090M906;
DT 26-NOV-2014, integrated into UniProtKB/TrEMBL.
DT 26-NOV-2014, sequence version 1.
DT 28-JUN-2023, entry version 24.
DE SubName: Full=Armadillo-type fold {ECO:0000313|EMBL:CEF99192.1};
GN ORFNames=OT_ostta09g03830 {ECO:0000313|EMBL:CEF99192.1};
OS Ostreococcus tauri.
OC Eukaryota; Viridiplantae; Chlorophyta; Mamiellophyceae; Mamiellales;
OC Bathycoccaceae; Ostreococcus.
OX NCBI_TaxID=70448 {ECO:0000313|EMBL:CEF99192.1, ECO:0000313|Proteomes:UP000009170};
RN [1] {ECO:0000313|Proteomes:UP000009170}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=OTTH0595 {ECO:0000313|Proteomes:UP000009170};
RX PubMed=16868079; DOI=10.1073/pnas.0604795103;
RA Derelle E., Ferraz C., Rombauts S., Rouze P., Worden A.Z., Robbens S.,
RA Partensky F., Degroeve S., Echeynie S., Cooke R., Saeys Y., Wuyts J.,
RA Jabbari K., Bowler C., Panaud O., Piegu B., Ball S.G., Ral J.-P.,
RA Bouget F.-Y., Piganeau G., De Baets B., Picard A., Delseny M., Demaille J.,
RA Van de Peer Y., Moreau H.;
RT "Genome analysis of the smallest free-living eukaryote Ostreococcus tauri
RT unveils many unique features.";
RL Proc. Natl. Acad. Sci. U.S.A. 103:11647-11652(2006).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CEF99192.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CAID01000009; CEF99192.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A090M906; -.
DR STRING; 70448.A0A090M906; -.
DR InParanoid; A0A090M906; -.
DR OrthoDB; 5478662at2759; -.
DR Proteomes; UP000009170; Chromosome 9.
DR Gene3D; 1.25.10.10; Leucine-rich Repeat Variant; 4.
DR InterPro; IPR011989; ARM-like.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR024395; CLASP_N_dom.
DR InterPro; IPR034085; TOG.
DR PANTHER; PTHR21567; CLASP; 1.
DR PANTHER; PTHR21567:SF9; CLIP-ASSOCIATING PROTEIN; 1.
DR Pfam; PF12348; CLASP_N; 1.
DR SMART; SM01349; TOG; 2.
DR SUPFAM; SSF48371; ARM repeat; 2.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000009170}.
FT DOMAIN 307..529
FT /note="TOG"
FT /evidence="ECO:0000259|SMART:SM01349"
FT DOMAIN 618..860
FT /note="TOG"
FT /evidence="ECO:0000259|SMART:SM01349"
FT REGION 1..52
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 526..561
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 576..603
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 11..25
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 583..597
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1098 AA; 119244 MW; C13D96C87C8C5B06 CRC64;
MARAGRWARH RRSPLRDGSS STSRRRRARA RDRRARVVGP ARAGGGRGRG RGWRGGDRVG
FEICPNKRQL STMIPSDVDV NAALRALEPA SRDANGFKAR SALVAMRELA MGAMRDGKDC
GAGASACWSG IRNALGSQKR EIRSDGVKTM CAFLEARAMT EKDAEERLGF AWMDNNWRFR
VSAIEILSGA CASEGGARRC FDRFAGALGD REAEVRERAM DGILMVHERF PSVVRAALEA
ARDDMRPQHV KELERRMNET ETGVRTIASS GGNTSASASE DGAVVAAPVL GDALVPPPPE
KISSERELAR AMDRIGRDLN PSQDWLQRIA AMVRLEAITL GGGADVYEET YTESLGKLVE
MLNAQIGDKR SAVVKQVSHL IVVLARNAST AFEKYVDQFL RALLKTTIVT IGVIAESGNA
CIRGVIAHCE APRIVNILAE TVVNERSPKM RRYIVEYMTL ILKSWSLNER QIESIGGALQ
KTLSDADAMV RSNSKACFEV LSVTAPAASE VLLTKVHSKV ARTLSGGAME SESETRGSRG
ASSKGGAAPK PWQKPPQGKR PQNDIVFEVV VAEKPPPAGA QAQPKQSIVN DRPGNAATRA
QPTPKPAAIV PDIAEVVMFA ERVERAAERA SRADACSKLR DALDDADVRT HGASAVQFEA
QVTLHASRIA ELILGYISDT NALVIDPALE AVSILVYVAT DELKPLMPDL CLGVFECLTD
YRESTRALAS EALTAIGDAH KPDALLPSLL RSLSLSETPR AKTGVLEFAL YVLSGRGGGA
NEVSYPPAKV SPDLESWIDL VFELACDVDE AMAKAAGSNL AAIYSHVDDS VVTNRLMGSS
EFKRVRFMEA LERRVPKLAR VLQPLLEAAQ PPPPKAKTPK FKAPNDAGCD SLDDDYADQN
VTLLNRMYET MKIEASPAKH RTFGERVVEA LEGLRDENVD VVVRSLRDIT ALVMEDFEQL
KTYLKLIVPA MCSTMDDRNE LVAAHAFGAL NAIFQNPRID AGDAFTALSP LVSASASSDS
PMYCVQTVID HAMTDAEDMD IILPSLARAC ESKTLAVRQR AFHALGTVQR VFGAEFVSPF
VNSMASEHRE LIEYYARK
//