ID A0A0F8B4R1_CERFI Unreviewed; 2699 AA.
AC A0A0F8B4R1;
DT 22-JUL-2015, integrated into UniProtKB/TrEMBL.
DT 22-JUL-2015, sequence version 1.
DT 24-JAN-2024, entry version 25.
DE SubName: Full=U3 small nucleolar RNA-associated protein 20 {ECO:0000313|EMBL:KKF95355.1};
GN Name=utp20 {ECO:0000313|EMBL:KKF95355.1};
GN ORFNames=CFO_g2296 {ECO:0000313|EMBL:KKF95355.1};
OS Ceratocystis fimbriata f. sp. platani.
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Sordariomycetes;
OC Hypocreomycetidae; Microascales; Ceratocystidaceae; Ceratocystis.
OX NCBI_TaxID=88771 {ECO:0000313|EMBL:KKF95355.1, ECO:0000313|Proteomes:UP000034841};
RN [1] {ECO:0000313|EMBL:KKF95355.1, ECO:0000313|Proteomes:UP000034841}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CFO {ECO:0000313|EMBL:KKF95355.1,
RC ECO:0000313|Proteomes:UP000034841};
RA Belbahri L.;
RT "Genome sequence of Ceratocystis platani, a major pathogen of plane
RT trees.";
RL Submitted (APR-2015) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KKF95355.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LBBL01000100; KKF95355.1; -; Genomic_DNA.
DR Proteomes; UP000034841; Unassembled WGS sequence.
DR Gene3D; 1.25.10.10; Leucine-rich Repeat Variant; 2.
DR InterPro; IPR011989; ARM-like.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR046523; UTP20_C.
DR InterPro; IPR011430; UTP20_N.
DR PANTHER; PTHR17695:SF11; SMALL SUBUNIT PROCESSOME COMPONENT 20 HOMOLOG; 1.
DR PANTHER; PTHR17695; UNCHARACTERIZED; 1.
DR Pfam; PF20416; UTP20_C; 1.
DR Pfam; PF07539; UTP20_N; 1.
DR SUPFAM; SSF48371; ARM repeat; 2.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000034841}.
FT DOMAIN 904..1511
FT /note="U3 small nucleolar RNA-associated protein 20 N-
FT terminal"
FT /evidence="ECO:0000259|Pfam:PF07539"
FT DOMAIN 1743..1961
FT /note="U3 small nucleolar RNA-associated protein 20 C-
FT terminal"
FT /evidence="ECO:0000259|Pfam:PF20416"
FT REGION 1..25
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2465..2527
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2654..2699
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2465..2486
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2487..2504
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2654..2670
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2671..2686
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2699 AA; 303733 MW; 64B8894A5DAB8A14 CRC64;
MPPPSGRILK KKAKSSTPHQ KNHRWESFTS KISKLHSLDP IKKVRRYDLD ADDDAAEVSY
FSTALHKWSE MNISRSFVSF RREAMPLCET LVQILHFEDR IFGLLSKHIS ANDKESLEPL
LDLLTSFAHD LGARFEKYYA QSLELIVGVA SQPQDVDVIE WTFGALAFLF KYLSKLIVPD
MRPTYDVLSP LLGKTKVPPH IARFAAEALS FLVKKAASPS TRESSLQKLA QHVRKDLYAQ
RGERQFLLFK DGLMTMFAEA CKGNSETVHS NGAAVIISLL RSIPHDEFAL IEECIWTDVV
CGVTTSIMHH SNAAAMAETV TAICEYGNAQ IENQKPASVP WRLLPIIRVY GTLMGVRKGS
RISDWAPLVK SMIQALETIS KTSPTIAISE DEHAPFWNYV MFNIALMWQL APIDSLITSI
TPFTNILNRD PFTRWYIQFC SLFAELDSVR FKSLFLNHFQ SFIATCYADL DNEELLCLLL
PRLHELNVIS SPTDKEGFGI PQGWQAQIVN KFEKLEIYPF PERGAYDKDP RTWRDKCLPK
YASLLHVLES CNVHPSTNAR IAELMLRKLK LSLRPTTTLE SDEVHFIVSR GFHAYLRMVR
GAGSVDASLQ PLLRAALPRF GRSIGFLESM LSYEQHLTAQ EPNPEKIERA SESPIDIETA
VMPLVANLAS PSHELRYNSL LLLRVMFKGQ GQEIHECVDT MVGIEHTPLE LASTRHIGML
MRKLGISFAS IEVPWLQKAV ASFIFGMLTV KLAPVWDYAI EAIQKVTESK VGEEHVSALA
FDWLTVPSPR WTGCATQTTE YGKRYMTDFE CTNVDKTRAV STKVFETVSD AVGKLLAAFD
EKQSIVEDYC DQARGRALKA FAAVPLIAES KSRLFIPHFL SWSVNDEEAA KNMAEETDHQ
VQKSWSLSDR KAMLKVIEQF INPCVLFQHK KVYDALLSLL ENGDVDIQRL ALKGILTWKQ
DGVKPYAERL ETLLDDSKFK NELTLMLQGE QEIHPSHRAE LMPVLLRLLY GRTISKKGIA
SGRHGLQATR LAVLRNLSVA DLGAFIEISL GALRGVKILD GNGQLKTEIV DQEIIPVRQM
FGFLNMASSL ISELGSSVLP YTETLSNSVL YCLFFCVRRL RGIAEEDQDV EELEEQASNN
SLLRSARTAG LKCLNLLFRN ASMFEWTAYA APMVAELITP RLEKLPIESA QGVSNVLQLV
ATWSTLPKAA MFMAIDAQLL PKMMAILAVE KAKDDVKVFS LRVVRNLVKI VELPAAESEF
NELVRTELLE TNMNCVLDHI TIVLEAAASS HELLEACVDT IVEISPYVDA RQNVRSVLDI
STRLLGQPPR KVNPRTKGKI LLVLEKFVAI EGPRLVDSSE LSTKIRDALA SLFSYFKDHE
NRDSLCRVLA VLAKHDKSLA ESSHLCTELN AYADGLSDEP AYDRRLAAYS AIYSARDVPM
TPQQWLPILH NLVFFLKFDE EYGILSSNSA DGLRRFVEDV AGVKDPETRL AYEAELGDIV
LPAMYSGARE PAETVRRECL RVMGHMLVHI PEWEPISDMK GLRKMEEDAS EPVFFFNILS
PSTAKQLEAF DILRAANAEC EISSANLSMF FIPLLEHFIF GRSEGLDPAM GSNATTVLSD
LALSLEFRHY RSILLRYIGY IESKPDFQKQ NIRLMGRFAN SLMVAWKQRV AVSVVGESDV
NNAADTDVVL EMETDTTSET KTEAETQKAH TQRLQLTIPE QEKLNGDIIN NFLPPMMKHL
HEKDESEVSY RVPVGVIVVR LLVILPESEM SSRLASVLTD ICHILRSKAW ESREMARDTL
VKIAVVLGAK YFGFILNELK GALQRGYQLH VLSYTMHSLL LATIPEFGQG SLDYCLSTMV
TIIMDDIFGV IGQEKDADGY TTTMKEIKSS KSQDSMELIS KSASITHLSD LVAPMQQMLL
QKVDLKIVRK IDALLARISA GLIQNPAAES RDSLIFCYEV IKEVYRSQKV EKQPKIDPKL
RRYLIVKGAK KSDRAATSRH TYKLTRFAID LLRAVTKRFD SLRNSTNLAG FVPILGDSIV
DGEEEVKIAV FKLLVTISKV PFSTPNAAGV YKVAVKEATK TISQSHSTTT DAAQAALKFL
SIVLRDRPDI EVRPAAIDVL AEKLKDDLTN TLYRHVTFNF LRAVLDRKIE TAAIYDTLDY
VGTIMITNED KDTRDLARGA FFQFLRDYPQ KKARWTKQMN FIVSNLQYER EGGRISVMEI
MHLLLQKSAD DFVQEVAGTC FLPLVMVVAN DDSEKCRLAA VELVKQIFRR ANTEGTSTFL
GLIRSWASNV ENRSVFSLAL KTMGLYFDAA PTAPENKKDL KLALKLIVDN LPSDATSEAS
DADLPNVVID VVRILTTRVP QTLLSPACAP LWEGISRWMV HPEHSVKLNA TRLISTYLMD
FAANRRADSD VLNGSYGLQL GKEDIEQFAR LAMRILTTRE MEEELASEAG QMLIFLGQHL
AQVDGAHTRA EEEEEYEGRG GAEEKEDSVE QIEEDTDTAE AEAENEPEPT VTAAQLEEEP
EDENDSLDTK SFNLLFLFKK LSFVLRRGTA PRPLAMYPKV AAMEVLETLC RRVAPAALPE
YLKTVMFPLH HLTDPNIAAP FSIDEVFKAK HEGVRTRAQI LMDLLKKRVG IAEYTRTLLE
VRAEVRARRL QRQAKRKIEA VSMPEKHGRD KRKKFERKKV KRQQRGQDHK TMRQSYKGW
//