ID A0A0V0SEY2_9BILA Unreviewed; 2711 AA.
AC A0A0V0SEY2;
DT 16-MAR-2016, integrated into UniProtKB/TrEMBL.
DT 16-MAR-2016, sequence version 1.
DT 24-JAN-2024, entry version 15.
DE SubName: Full=Small subunit processome component 20-like protein {ECO:0000313|EMBL:KRX25357.1};
GN Name=UTP20 {ECO:0000313|EMBL:KRX25357.1};
GN ORFNames=T07_5922 {ECO:0000313|EMBL:KRX25357.1};
OS Trichinella nelsoni.
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Enoplea; Dorylaimia;
OC Trichinellida; Trichinellidae; Trichinella.
OX NCBI_TaxID=6336 {ECO:0000313|EMBL:KRX25357.1, ECO:0000313|Proteomes:UP000054630};
RN [1] {ECO:0000313|EMBL:KRX25357.1, ECO:0000313|Proteomes:UP000054630}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ISS37 {ECO:0000313|EMBL:KRX25357.1};
RA Korhonen P.K., Edoardo P., Giuseppe L.R., Gasser R.B.;
RT "Evolution of Trichinella species and genotypes.";
RL Submitted (JAN-2015) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KRX25357.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JYDL01000012; KRX25357.1; -; Genomic_DNA.
DR STRING; 6336.A0A0V0SEY2; -.
DR Proteomes; UP000054630; Unassembled WGS sequence.
DR Gene3D; 1.25.10.10; Leucine-rich Repeat Variant; 2.
DR InterPro; IPR011989; ARM-like.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR046523; UTP20_C.
DR InterPro; IPR011430; UTP20_N.
DR PANTHER; PTHR17695:SF11; SMALL SUBUNIT PROCESSOME COMPONENT 20 HOMOLOG; 1.
DR PANTHER; PTHR17695; UNCHARACTERIZED; 1.
DR Pfam; PF20416; UTP20_C; 1.
DR Pfam; PF07539; UTP20_N; 1.
DR SUPFAM; SSF48371; ARM repeat; 2.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000054630}.
FT DOMAIN 873..1519
FT /note="U3 small nucleolar RNA-associated protein 20 N-
FT terminal"
FT /evidence="ECO:0000259|Pfam:PF07539"
FT DOMAIN 1732..1967
FT /note="U3 small nucleolar RNA-associated protein 20 C-
FT terminal"
FT /evidence="ECO:0000259|Pfam:PF20416"
SQ SEQUENCE 2711 AA; 310874 MW; E27F48D849736FF7 CRC64;
MKHCKSEKKF RFIGFNERIK NIHVDITRQC RSFETAPDDE ETFFAQTLAK QSELNYSKEF
SDFLQDISTL AINTYPLLIF HKDEILRKLK KHLLEKNISS ISSLLELCIA VCRDLHDEFQ
PYFWEIFEVI CDILSVYNDP DVLEACFSTL AYLLKFSWRR MIHNSRDVFQ HFQHLLSHTK
EYIRRFSAEA LVFFIRKAGD FGDIVEYCLS KIDEDNALLL GMAELFIQCI IGPQNNFHGK
AKMVRNVHVT HMMKTLLSFV HISPLKETAY ALLCQVNIAL VQHAVVDTAA IIFDCILDEG
DCIFDKLLRN DNAALDVKVA YLMDLTSVWI EMKQGCLVPF ETKEKIIKNW LMRLLVNGNF
EWKIFNSFIN LGRRIFQSPD VCTKCTILGD FLNLLFDVGA QFEHAVLIMP DSLFVVEVYQ
CRIIRFFASY TDRYLVENEI FKNCRLLASL CKMSSLAETN AVGNGQWPLA TRRLFSPLQK
GTIVNLLDLI FTKFSERFAL AGSDKAEMGI LLPHLICAAL IYPMVTMDRD EHVALTNVTT
NLALSSLVVV GKEYSLALRC LFKAVHQIKE SAKLTEMVIY LAERMLANLH SGAKMAFYLE
ATVSCVEYLN DRLSNFDLQV DVIDSACRAA ISNLWLSDVG TRVLSLRLLR HLCTLGGCRI
GVDFADLCLE AESVPATIHQ YRQRLLVYMK ICSLCKSGSS EKTVLRNWSL IGLVLRFLVG
QFHVNLSLLW NHVIETIATT MEMANDADQC WTVLVDALEF GTANLANSNT SEQCCFSVEE
ENILSRAISE RLGMGESNNS REDFVNFRHL VWLTLDKLAT SSPAPSGQVE RIVDSALSWF
NVEYQFNDMN TIWFDSRVAV PCGSGGDVSV KLQAKTAVAA LNVLGKLSKN HRRLFVRLES
LYTRLLRQTR REVQAAALNC VLGLDDELEI YRPHLSCLLT GDNFCDKLIK LACRFDQLAV
EESERRRRLA TVLLNLVYGL LRSRGSGSTL DLRRHLSQLF AFLGYCNSDE LNEFFELVIS
PIGCHFHQHY SNNQIGQTIQ ALDSVFEGRT VSDLRYGVLG EIVEFLNQMV EKVSDRMDEK
CQRRIFHAAL ILQVIVNWIQ LTAGGSGENG KVHEEETNED GREEDCVWKK RGKKIRKQLD
GLFVKFSTQF GEYPFSDVEL GLLFELVVWP ALVVDDDDNV QGRLCKLFVV WSESELLWPL
LYCSKWNRSP VHCLVSTLGK RNAPAAVHRF AIDLVDGLLQ LDSNPDSCWA TNSEIANFAQ
KYWQVVDVQH RRPDFFVENY AEIIIAYCRE DFEKSGDHRR PIDRRKLDIL CRLCGRLDVA
RSRGCGSSLI TSLFEQLQCR AAAAVDDEIE IQLILFDAVL RLLEHSSNDT LQYLDSVINL
YACLRQRSVR TRLHEIFTAI VADDPEYIKL SQIIDGLNSW NKRQVEEFDY DRRLAAHQQA
NHFLYEVSKR ALKPTLSLLF YNCFHFIELV DDLSLRTNAT HTVDQCFHCM STRLVQDEWN
SILNHIVLPY LTNALKNQND IIRNSFVGIL RSCILKFATF HDQLKPLFQL VDNVDLELDF
FENIRHIQLH RRARALRRLC DQLETSSDPN TTIRPRQIRL YILPLISAYL SDETFSNKHP
SLVDECVHLI SCYCRLANWT SYSCILRKYL HKLKTDLNNQ KLHTRIVVSI LDSFHFINFD
LPSSRKQQHV VDSLSKIILP KLKAALIGRF DQVDDNKFDL HRRRAGKLKL SNETDELIRV
PIALAMIKLL VKLPAHIASS DISRYKKRKE KKRIIHCCSI TILKVCNMLK SRWLEVRNLA
RKTLCSMLNS LGSAYLAVVL KELRSTLKKG FQMHVLCYTV HMLLEITCSS ENFLENTLDA
ASLRDILEIC HGELFGSVAE EKRAKEVLRS VWEAKAVKVF NTYAIIGRFV SKHNLFDVLD
PLKEVLKQSP NHEMLKKISN CLKQFASGVS KNTNLTVETL MVFCYEVLND GLQSLCKNDR
KEQQQQQIEK GKSLRPAPSF LFPPEPGRRG FAVNVSKRTN MHLLISFALD VLLSLLRAGK
LSAVETESLS LLDPFVDNLI LCLNAAYPKI ISVALRCLTS IVKYPLLSLE KNIDSLVKTV
FTLLHNYACL GVACRGDSRD VLRNCFKLIT AVIVYSGGKA VNGEQMDILL RYSEEDLADT
QKQATAFSVV KAIFAKRHYS GTVEAILSKL QNLSITSGLE HVRVQCRQAI GIYFKFNYVK
NKKKNKKQQL FVKAVEFFCV QLNYAEESGR VSALEMLEML LSMAKSNLLP KIDMVVFLHL
SARLQIDQSE RCRNIIALAI KKLISRVSKE KFAEMFNVTL DWLGEDEIAL KQTGALCLSL
LFDENRTELV EKVPIAMQII FSQFQKYLPA LTADDDDDDQ KKKSGCLNML QKSTSDDRCI
DHFLFALLNL CQKIIKAVPL DDDRFTCKWL PFRETLIEIL LKTLLYPHVW VRTSASQFLG
CCLLFIVNNN ECKSVDESDH WNNVLFEFCE KLFEQLKIHH NNEELIEQSV KNLVFIAKQT
PDYNIPQSDQ PKQLKKATGR RTFCLSWMIR RLIRLCRMEL VKNAKLSLLR SFVFKWIAAV
ALQCNDTTLR NNLHTFLPPL YRELTIANND ENLKILSTSV CEIVQNRVGL DAFSTEWNRC
QNFAAAKRLE RKRRLATEVL SDPVEAAKRR IKRQKLKIVA KKKMRTTLSN KPYSSNNEDF
DEMLVEHETF E
//