ID J9I5J9_9SPIT Unreviewed; 1976 AA.
AC J9I5J9;
DT 31-OCT-2012, integrated into UniProtKB/TrEMBL.
DT 31-OCT-2012, sequence version 1.
DT 24-JAN-2024, entry version 33.
DE RecName: Full=HEAT repeat-containing protein 1 {ECO:0000256|RuleBase:RU367065};
GN ORFNames=OXYTRI_13997 {ECO:0000313|EMBL:EJY65845.1};
OS Oxytricha trifallax.
OC Eukaryota; Sar; Alveolata; Ciliophora; Intramacronucleata; Spirotrichea;
OC Stichotrichia; Sporadotrichida; Oxytrichidae; Oxytrichinae; Oxytricha.
OX NCBI_TaxID=1172189 {ECO:0000313|EMBL:EJY65845.1};
RN [1] {ECO:0000313|EMBL:EJY65845.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=JRB310 {ECO:0000313|EMBL:EJY65845.1};
RX PubMed=23382650; DOI=10.1371/journal.pbio.1001473;
RA Swart E.C., Bracht J.R., Magrini V., Minx P., Chen X., Zhou Y.,
RA Khurana J.S., Goldman A.D., Nowacki M., Schotanus K., Jung S., Fulton R.S.,
RA Ly A., McGrath S., Haub K., Wiggins J.L., Storton D., Matese J.C.,
RA Parsons L., Chang W.J., Bowen M.S., Stover N.A., Jones T.A., Eddy S.R.,
RA Herrick G.A., Doak T.G., Wilson R.K., Mardis E.R., Landweber L.F.;
RT "The Oxytricha trifallax Macronuclear Genome: A Complex Eukaryotic Genome
RT with 16,000 Tiny Chromosomes.";
RL PLoS Biol. 11:E1001473-E1001473(2013).
CC -!- FUNCTION: Involved in nucleolar processing of pre-18S ribosomal RNA.
CC {ECO:0000256|RuleBase:RU367065}.
CC -!- SUBCELLULAR LOCATION: Nucleus, nucleolus
CC {ECO:0000256|ARBA:ARBA00004604, ECO:0000256|RuleBase:RU367065}.
CC -!- SIMILARITY: Belongs to the HEATR1/UTP10 family.
CC {ECO:0000256|ARBA:ARBA00010559, ECO:0000256|RuleBase:RU367065}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EJY65845.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AMCR01021143; EJY65845.1; -; Genomic_DNA.
DR EnsemblProtists; EJY65845; EJY65845; OXYTRI_13997.
DR OrthoDB; 5480100at2759; -.
DR GO; GO:0005730; C:nucleolus; IEA:UniProtKB-SubCell.
DR GO; GO:1990904; C:ribonucleoprotein complex; IEA:UniProtKB-KW.
DR GO; GO:0006364; P:rRNA processing; IEA:UniProtKB-UniRule.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR012954; BP28_C_dom.
DR InterPro; IPR022125; U3snoRNP10_N.
DR InterPro; IPR040191; UTP10.
DR PANTHER; PTHR13457; BAP28; 1.
DR PANTHER; PTHR13457:SF1; HEAT REPEAT-CONTAINING PROTEIN 1; 1.
DR Pfam; PF08146; BP28CT; 1.
DR Pfam; PF12397; U3snoRNP10; 1.
DR SUPFAM; SSF48371; ARM repeat; 2.
PE 3: Inferred from homology;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|RuleBase:RU367065};
KW Ribonucleoprotein {ECO:0000256|ARBA:ARBA00023274,
KW ECO:0000256|RuleBase:RU367065};
KW Ribosome biogenesis {ECO:0000256|ARBA:ARBA00022517,
KW ECO:0000256|RuleBase:RU367065};
KW rRNA processing {ECO:0000256|ARBA:ARBA00022552,
KW ECO:0000256|RuleBase:RU367065}.
FT DOMAIN 97..210
FT /note="U3 small nucleolar RNA-associated protein 10 N-
FT terminal"
FT /evidence="ECO:0000259|Pfam:PF12397"
FT DOMAIN 1671..1842
FT /note="BP28 C-terminal"
FT /evidence="ECO:0000259|Pfam:PF08146"
SQ SEQUENCE 1976 AA; 229832 MW; 45C039DED9262CC3 CRC64;
MIQVVNLKQD DYFQFLEQFA FKGQALDKNI LIKSLARNDA IVFSKYSQFC FDLADFHQEI
SASLISVESC LHWKFFGTML SEVLKSQAGQ SQTSLFNLLP FISKAMKSEI KDFKISSYMA
ISQISCRRSL STEYASAFFK QVLLSLKDQQ NEEYIEKGIL VISLIIQYQN IKMIDEKDVN
QFLQIFRVQK LIKRMADQQD LSSLLKTIMT STLVQTTIDL GKVYQLLISG SLNLKLAYLL
IENAVSLAMQ NDNLLPRCHE LLNLIKTHHE NDYNQALLFI LKQSKESKDK IIRVISGESG
NSNLIAQFGE KEYSLLSALG SQHKITKVNA LQSVVENINK LTQNEKDLII LQLVTIIQSA
SDEENILEQV LSVYDKVVHQ NHNKDKQAFD FEFETLHTFL DFNLQNKRYS IKINEKLTTM
LLKLNPQAII NQQLKMILLL NQDKITNANI VPQMFRDILK SQKQQITREE LIQSLVNANK
NTEVKDLVLF ALSNHKQFKQ FHHIEKFVSL YLQKVEYSVE LIKSLLQTLK TSLSSLSVNF
LTQFLQTILT KLPNDVIQTA PKIFSKILGL ALNLKQKDLT YNMNQIISHL ITNFLNNQIQ
RYVEYLIQLC LETKNDLKKM FCLTNLSQLS KLRQIEPAFM VITALVCMND QSQLVRSGAF
NLASILISQG ESLELLEIFK NTPKPRSAKK SFSDKQAFTP MKQKPMKSIL KDLIQMKTEI
LTDRSQLSVG LNNSPHVKLI DQVVRSMIDL PSIKVKSKFV EVLSLLNNFS VIFSMGLQEQ
LNKEFLMINQ NKLSQDSQDY LYQVGQLIQQ VCRRQTHETI EEHISSLVFP YISLIFTLSE
KKVQYNYKHE ELLQNLFTKI IIKEDLLQVK PQNISELLRY YIKSQIYIYS KLIRNQIKES
FVALREEQFS DKEINPILNE IIEKITSKKD VESSLANYEI LLELIAQIKV ENDYQNLPQI
LNIVKTLSNE MSQQNVFYII ELSNQVALTI LENKKNSKKN LGKNLEIIVG VLDIHIDKLK
NEIIQKVENQ VSQTSDPFKD FIMEVDTSSK QDQKKEELQT EEFTGSFQII SSILNLLSQY
EIIIRTNLTI TFQIFDKLSS LVQFLTRHEK SSSAVATSLI FQLFRFYHNQ VVRTCKNSQK
KQQDLKLLLA QYIDTLLMLQ DFVSDTTQLF GYLEKVSFTL ENQYSTFIIT ILLCHAKDQI
QVKDNIQIDS LEIQLIQRLI LQTLTPLQSL QVLEELLFTV SVFKISETSN DFIKTKLKLV
FNQDLVTKRM DEASWSQNRY LITSDQEVRK FKYLSVFLIN SWISQNQQYQ KHLSLTLHQD
NKELKQTYQV FYQIYLTSSI FNEQINHLLQ TKDYASDKKA KKSLKRLSSR VQEFQKNLNT
LLSYEVQIDI VLRILEFKDK QTTYLKTQSL QLLISKTPQF SQGLSKVQER DLIHSFTPLI
LKSIKILQEF EAGSQKFNNE KQNFIHALFC LITRTFTSQL SIEMKQQILD LSFQYISLSE
QFSMILTSQL ILTLMTVFEV QQLDILEHLP RFITHLIRSF QRLHIEGTDQ EFFEESYSKT
LLKATLSLIT FFSKFLTATQ YEELIINVLV VASTEHNQEV IQIFDLIAKE SSKNIQFKLI
FQAMINSYET VIIQGGVIEN KYKNNLNIII IRFFNDLMKP IVMRMKKDFC QENHNKIYLF
FKDAFELTLN YYRNNNKQEL SGFGNIEQAI AESFEQFVVK LNEDQLRPII VKLSKWAFKT
INDADQAVPF NIFKTTVFYR CLNTVLNTIK EFFVPLLPLY FERTLELLIS LASQQQATGK
KRGRIQVDFE VELGHQQHTL FDLMKLACEN IRLNFLYDNL AFIQNDSFEK LSDPLSNLVA
LEQLGKHYLP FIEDTLKPTI FEAVERINND DMWKKINNEL LMHTRNTNPT VRLGAFRVIE
NLYTKIGERY LVLLNDTIQF LSEGMEDENP DVEATARSIV QRIESITGDS IHEYLK
//