ID D7FJ23_ECTSI Unreviewed; 2779 AA.
AC D7FJ23;
DT 10-AUG-2010, integrated into UniProtKB/TrEMBL.
DT 10-AUG-2010, sequence version 1.
DT 24-JAN-2024, entry version 46.
DE RecName: Full=HEAT repeat-containing protein 1 {ECO:0000256|RuleBase:RU367065};
GN ORFNames=Esi_0125_0072 {ECO:0000313|EMBL:CBJ49062.1};
OS Ectocarpus siliculosus (Brown alga) (Conferva siliculosa).
OC Eukaryota; Sar; Stramenopiles; Ochrophyta; PX clade; Phaeophyceae;
OC Ectocarpales; Ectocarpaceae; Ectocarpus.
OX NCBI_TaxID=2880 {ECO:0000313|EMBL:CBJ49062.1, ECO:0000313|Proteomes:UP000002630};
RN [1] {ECO:0000313|EMBL:CBJ49062.1, ECO:0000313|Proteomes:UP000002630}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Ec32 / CCAP1310/4 {ECO:0000313|Proteomes:UP000002630};
RX PubMed=20520714; DOI=10.1038/nature09016;
RA Cock J.M., Sterck L., Rouze P., Scornet D., Allen A.E., Amoutzias G.,
RA Anthouard V., Artiguenave F., Aury J.M., Badger J.H., Beszteri B.,
RA Billiau K., Bonnet E., Bothwell J.H., Bowler C., Boyen C., Brownlee C.,
RA Carrano C.J., Charrier B., Cho G.Y., Coelho S.M., Collen J., Corre E.,
RA Da Silva C., Delage L., Delaroque N., Dittami S.M., Doulbeau S., Elias M.,
RA Farnham G., Gachon C.M., Gschloessl B., Heesch S., Jabbari K., Jubin C.,
RA Kawai H., Kimura K., Kloareg B., Kupper F.C., Lang D., Le Bail A.,
RA Leblanc C., Lerouge P., Lohr M., Lopez P.J., Martens C., Maumus F.,
RA Michel G., Miranda-Saavedra D., Morales J., Moreau H., Motomura T.,
RA Nagasato C., Napoli C.A., Nelson D.R., Nyvall-Collen P., Peters A.F.,
RA Pommier C., Potin P., Poulain J., Quesneville H., Read B., Rensing S.A.,
RA Ritter A., Rousvoal S., Samanta M., Samson G., Schroeder D.C., Segurens B.,
RA Strittmatter M., Tonon T., Tregear J.W., Valentin K., von Dassow P.,
RA Yamagishi T., Van de Peer Y., Wincker P.;
RT "The Ectocarpus genome and the independent evolution of multicellularity in
RT brown algae.";
RL Nature 465:617-621(2010).
CC -!- FUNCTION: Involved in nucleolar processing of pre-18S ribosomal RNA.
CC {ECO:0000256|RuleBase:RU367065}.
CC -!- SUBCELLULAR LOCATION: Nucleus, nucleolus
CC {ECO:0000256|ARBA:ARBA00004604, ECO:0000256|RuleBase:RU367065}.
CC -!- SIMILARITY: Belongs to the HEATR1/UTP10 family.
CC {ECO:0000256|ARBA:ARBA00010559, ECO:0000256|RuleBase:RU367065}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; FN647904; CBJ49062.1; -; Genomic_DNA.
DR STRING; 2880.D7FJ23; -.
DR EnsemblProtists; CBJ49062; CBJ49062; Esi_0125_0072.
DR eggNOG; KOG1837; Eukaryota.
DR InParanoid; D7FJ23; -.
DR OMA; PEVESEC; -.
DR OrthoDB; 5480100at2759; -.
DR Proteomes; UP000002630; Chromosome LG18.
DR GO; GO:0005730; C:nucleolus; IEA:UniProtKB-SubCell.
DR GO; GO:1990904; C:ribonucleoprotein complex; IEA:UniProtKB-KW.
DR GO; GO:0006364; P:rRNA processing; IEA:UniProtKB-UniRule.
DR Gene3D; 1.25.10.10; Leucine-rich Repeat Variant; 2.
DR InterPro; IPR011989; ARM-like.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR012954; BP28_C_dom.
DR InterPro; IPR022125; U3snoRNP10_N.
DR InterPro; IPR040191; UTP10.
DR PANTHER; PTHR13457; BAP28; 1.
DR PANTHER; PTHR13457:SF1; HEAT REPEAT-CONTAINING PROTEIN 1; 1.
DR Pfam; PF08146; BP28CT; 1.
DR Pfam; PF12397; U3snoRNP10; 1.
DR SMART; SM01036; BP28CT; 1.
DR SUPFAM; SSF48371; ARM repeat; 1.
PE 3: Inferred from homology;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|RuleBase:RU367065};
KW Reference proteome {ECO:0000313|Proteomes:UP000002630};
KW Ribonucleoprotein {ECO:0000256|ARBA:ARBA00023274,
KW ECO:0000256|RuleBase:RU367065};
KW Ribosome biogenesis {ECO:0000256|ARBA:ARBA00022517,
KW ECO:0000256|RuleBase:RU367065};
KW rRNA processing {ECO:0000256|ARBA:ARBA00022552,
KW ECO:0000256|RuleBase:RU367065}.
FT DOMAIN 2417..2632
FT /note="BP28 C-terminal"
FT /evidence="ECO:0000259|SMART:SM01036"
FT REGION 1..25
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 760..781
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 833..865
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1201..1248
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1384..1411
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1520..1539
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1567..1595
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2138..2182
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2556..2606
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1204..1218
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1386..1407
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1567..1591
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2167..2182
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2557..2571
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2779 AA; 288046 MW; 355DAC070C106528 CRC64;
MASSLRRQVD ALRHASSGTS SGYIQRKKGK PSLLLTPDQA AEVDLSSVRE TAVMGLQRLE
AVDQDFSPFQ DTLFSKEAEG TLRDLQTKES NDRIDKSISA FLALLSPHFA CREAHFCLEY
LIRHYKVYQY NVDAVVECCL PWHDTVAFAR MVQLLSLKGS RWEFLTPVQK TGSPLPREAI
ARRCARDFGL LRFLCVSARR AAARAASCSV ATAQSTSTSS VSAATAAGRA KLFSFYAATV
VEALASLGTT SDPLLRALVP TVVHGLSATK SPEYQMASYM ILSALSGKGP LSLDVTGQAL
TALVCKPCQG GLEPSLLCAL AVAQTQVGFY SKKDPSNSSA AAPLLLPTAA FEKLSAMESL
AGVLAGLAQK FDATAFLWLL LQSYVQHLPD QDSRHGESLT RLCTYPYWPS HSLPRLIRRL
VDRLIRSYLA LTLEGAEDEA AMEVDDAPEK SRRSTAMKTE SQTVLRVLSQ RYPEEVDAAV
AKISRSVTKA RTATAKAALQ DDDKRNDAGV MTAKREAHLK KEALSSLLVS TFAGAQLAPH
LPLQQQEEDA ATVGQEGRED GAVHGSLMVA LEHSSAVVRA RAVEQLSRAI LSAAENDGAA
AGESASTSAG GVEELSPVLV RRLHDEDPDV VLAITESDTL VQRVLLRPRS FSPMADDSTG
GDGVGGADDS DVVDRAAAVA SAAMTAAGPW LSAISEVRPK HPVASAGRVL CGLVRLAAAA
SSACGEGGSA GVKQARDPAL SLLLECLPGP HATARVRSAQ RQAVGADGRG DSATDEGATD
PAAAAAATGK ACKKASRAVG RAAIEAVSRL GVGVEGDALA GLFAVVGKVL NKSDDEGSGK
GSAAKSPGKN NSKKKKSGDA DDLEAEGVKS KGLKVMGEEV CDALAAAIVG PGDLDSKNHQ
LQTLRDTCGA GGRWLVLECT YRALNLASAS SSGGGNAKAR AAATAAASSL SSLLASLATS
ELRLRPHLLS ASPDGDDAKV VGAVGAADLL RYLRACAAHL PRLERSSSLF AEDHLAASAA
TPSVGGNCDT SVGGDGFAVT PAPLLGDVLV SVLTATAEGG SKGGATAVWE AVSAVVLHGY
ERRPLPALAA VAIEGVTVAG RRGRGVAARQ QPRRGGGVDD AKLVAAARAL CVAATFVRAG
AEAVASGDTV EDGESSKAAE RDIVAVFPAA VLAVASDDEG LRDSGLLFIT TVSAHGDALS
AELSGSSGGA STSGAKKSKP APSPGPRVED SEFLPLGQMG DGAAEPSSLN APSLSSLVEL
AAALAGKNRE SALDAGELTR HLATVLGEAA GGGWQEDVLR YLVAWAARLG WRDPTASALL
LECLVWAKTP MRECCLPLLR CCLDSEQAQN NQNNERLLNI LLGVTLNLAV GDATVAAVAV
AVAEREESSR NTKSPKKTTR GSKDKASAAE DYPASASSTI FASVEAEQLM HRALRSPGPA
QSHALHCLCA TSLPSCGVGA QPRGEDESTR QRRDLVAVLV EAGLAGADGS LVAAAARALP
CLPGDMAALL VGLPPLSGAG AGASTPSKGR KKKSSAAFKA DTATTETGWV LTLGGLVELA
QALCGPSSSS SSVPDQQQQS QSESHSTKGV RNGRGAVTAG ATTSAEGWGW AGPLAIARPL
FDVLPSLIAV VTKGAAEEEG SSSTAVGGGD VVDAAEYGLW LTMDVLGGVL RRWGKGGVGG
GGGSAAAEKL YEGCRAGEDA ETVLACVREN PSPQTRNSAL ALLSRMATLF PAQMASRLRS
LLDAVCAAAA GGEGRSSGGG GGGGARAALR QTQKAVQSVV PALKAHGSEA GVGAHFVVQV
FVEALDDAPA HARRALFSTL VDSLGEESLA IMSALLLRRA VSSGRPDVGG KSADEETTSA
LIEFVHQTIH RSRARGQVRT LVGLVQACHR LSVSAAERSG SLSGQALDEA VECFVEDSAA
SEKGEDQLKT DYLQLPLVQQ EPARDKKASS PAFMTMDLRA LSEEDSKYMA GNFLELVLSF
VRDHLVSRPL VMAVAAAAER GADGKGLSED EGIQEGFLLL CEELLLFLRT LSVCGRVSSS
GSGDEPAWSS LQGLAYAVFE TLQSLLSVPS FVAVTQELLL HQDPHLRRKA LRMFTRRLDP
AEGGGSGGAP RLSPGEESLF VEMVPSLRLV AMGKRSVGRA GEDDDDIGEG IEEVMGPSSP
EGGEGGPDAM DQDRDEDGMK ESAVNRQTAL LSLDVLARVL GRRHQGAFEG VLGDVTEMVA
GVGPGALPAV GQPGSDGILP LRASAFLLVA TLCAVLGVRA FPRLPRFFPA MLEALEFQTP
FTTAVTDGAA GAGGRGGGRS LLWTSALSAV ATVAASLPSF LSPYLGRILA VALRPASAGA
GSSGSPAAAG SKQAADRVLS LLSTGVEARL LVPAVCGAYS GCVESVKAGE EEGVGAAGRS
IARLLAYVQE IVAGLEKAAA SAALPQLTRL LTQALDFRRQ HASGSTPQVR ESAALVETEA
SSALVGLVMR LSEVELRPLF LHLCEWKAGV SSGEGDLKAT LGALDRRLSF YRVLDGLAGA
LKSIFTPYFA HVLTDCCDDM EAASLLTSVA ANGSPAAAKK KKRKRASEEA SSKRKRRKTL
SSGDASDSDE EIGEGNDGEG EDTPELRWRR SAASRLVLSA LRRCFQSDRS GFVNKTRFEL
VLPAVVAQLE CGSDFSAAAA GGGAEDDVSS CRLHAEELVG PCLAQLASAS GKDALWKALA
NAVLMKTRSR RAGVRVAALV SLRQCFEVVG EEFLALLPEC LPFLSELLED GHPEVESECR
ALVKYIEGVL GESIESYLV
//