ID A0A0G0A1I2_TRIHA Unreviewed; 2636 AA.
AC A0A0G0A1I2;
DT 22-JUL-2015, integrated into UniProtKB/TrEMBL.
DT 22-JUL-2015, sequence version 1.
DT 27-MAR-2024, entry version 27.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:KKO99168.1};
GN ORFNames=THAR02_08725 {ECO:0000313|EMBL:KKO99168.1};
OS Trichoderma harzianum (Hypocrea lixii).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Sordariomycetes;
OC Hypocreomycetidae; Hypocreales; Hypocreaceae; Trichoderma.
OX NCBI_TaxID=5544 {ECO:0000313|EMBL:KKO99168.1, ECO:0000313|Proteomes:UP000034112};
RN [1] {ECO:0000313|Proteomes:UP000034112}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=T6776 {ECO:0000313|Proteomes:UP000034112};
RX PubMed=26067977; DOI=10.1128/genomeA.00647-15;
RA Baroncelli R., Piaggeschi G., Fiorini L., Bertolini E., Zapparata A.,
RA Pe M.E., Sarrocco S., Vannacci G.;
RT "Draft whole-genome sequence of the biocontrol agent Trichoderma harzianum
RT T6776.";
RL Genome Announc. 3:E0064715-E0064715(2015).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KKO99168.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JOKZ01000349; KKO99168.1; -; Genomic_DNA.
DR OMA; EGLMAMF; -.
DR OrthoDB; 5482709at2759; -.
DR Proteomes; UP000034112; Unassembled WGS sequence.
DR Gene3D; 1.25.10.10; Leucine-rich Repeat Variant; 3.
DR InterPro; IPR011989; ARM-like.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR046523; UTP20_C.
DR InterPro; IPR011430; UTP20_N.
DR PANTHER; PTHR17695:SF11; SMALL SUBUNIT PROCESSOME COMPONENT 20 HOMOLOG; 1.
DR PANTHER; PTHR17695; UNCHARACTERIZED; 1.
DR Pfam; PF20416; UTP20_C; 1.
DR Pfam; PF07539; UTP20_N; 1.
DR SUPFAM; SSF48371; ARM repeat; 2.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000034112}.
FT DOMAIN 901..1511
FT /note="U3 small nucleolar RNA-associated protein 20 N-
FT terminal"
FT /evidence="ECO:0000259|Pfam:PF07539"
FT DOMAIN 1720..1938
FT /note="U3 small nucleolar RNA-associated protein 20 C-
FT terminal"
FT /evidence="ECO:0000259|Pfam:PF20416"
FT REGION 1..25
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2443..2465
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2589..2636
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 7..25
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2589..2611
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2618..2636
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2636 AA; 297069 MW; 59A9075C0CD0E4EE CRC64;
MPARSSGRIE KARKYKNSTP HQKNHRWESF STKIAKFNSL QPLRKVRRHD LESEDLSSTT
SYLQNGLQKW SELNISQPFS SFKREVLPLC DSLPQILHFE ERIMASLAKY ISTQDKEGLE
PLLDLLTAFA HDLGIRFEKY FSQSLDLILA IAGRSQPVEV IEWTFGALAF LFKYLSKLIV
PDLRPTYAVM APLLGKSKHP PFIARFGAEA MSFLVRKAAA PSLRETALVS FVDYVRDDIC
KMVDDRQFML YKDGIMTMFA EAMKGTDRTI HSTGSAIFIE LMNAIPKEEC TLAEITTWTD
LVCGVLTSVI HHTEIQTFRE FEEAIFDSCD AKLKEAHGDD SQWWMVPYIR VLGVLAGVRK
GGRITDWTRL VQQLTAFLST ATRPIDETVE QNELLWALVV VNVAITCHHA PMDSLIPHIS
RLLQSLTREP FMRLFIPFCS YFSELDARRF GSLFRSDFQK FIATHWSQGQ NEDMLCVLLP
QIIENRGFPS AGEKDCCKLP QAWQDQIVSK FENLEISPFP ERGPYNKDPQ VWRDRCLPKY
AALLRILELT AVHPSTNARI AELLLRKLKL ALRPSSTLAS DEVNFIVSQG FHAYLGMSKT
AGSTDPQLAP LLRAAVPRFA RSVGFLRTYL AYEQTFPHGH EVDKHEASSG GDTSVAEDDP
VIQSLVENLS SPSHDIRLAS LELLNKMSNT PEQSQCVEIM LQIEQSPLNL EHTRSIAMYL
RKLGQEYGSL TDTTWLKQGI PNFMFGMLTV NLSPVWDDSI EAMKQVAENK QGEDAIALTA
FRWLDVSSPR WTLPQSETGS SRAFYSDFEC TTLNALEASG KSVQEASGDS KNVMLRTFDD
SQRLVEPFST NAKAQALKVF KALPSIAERR SRQLVPHLFA WAREEEDVHA EEDAQLQAVF
WSLADRKGLV GVFAQFINPR VLFQHEQVYD IMLQLLGNGD LDLQKLALKA ILAWKQDGVK
AYQENLEYLL DEARFKNEIT VFLQGDSVIK YEHRADLMPV LLHLLYGRAI SRKGGVGGRH
GQQTTRLAII RNLSIEDLGG FLGIAVGKLK DVRVVGPAVN RKKLFDEEII SIRKQFGFLN
MALSLISELG TNVSPYMETL VNTILYCLIF SCRRLGGVDT EVDPEAPEEE EKASTHSLLK
STRSTGIKCL IALFQNAQDF QWEPYQDIII EELVAPRHEN LPAETAQGVS GMLQLLSTWS
NLPKAALFLA PHSNALPHGA LPKIISCIAF EKSKESVKIY VLEMLRSLAR LALASASESE
FNEVIKAELL DSNSTAILDN ITTVLRIGDI SNALLETGIE TILSLAPVFQ TLEDVQAVIR
ISSFLLQQPA RRVNPKLKGR ILLVIENFAS LPDAAQDETL LQEVYKTVSS LFSYFKDREN
RSSLCRVLLA ISKQDAEISN VANLCYELNL FKEGRIDEPD YDRRLAAFSS ISSSHDEPWS
PRQWLPVLHN LIFFIRMDEE FGVLSSNAAD GIRRLVQEAS DCQSQETKAV FDEYLKQILM
PSIYGGAREE SETVRREYLR VLGFILTTMP EWAPVADLGG LLNERQEDST EPSFFFDILS
PATAKQMDAL RTLEAANASK EMGSQNLTQF FIPLLEHFIF GRADGGDDHG LGAQAASTIS
NLAMSLNWNH FRTTFHRYIG YIESRQEQQK HTIRLLGKFT DALLLSLPES QDPETMEVDQ
EESSSSAVRR LRLMAPKPAE LATQIMDYFL PPLTKHLHQK DESEVSYRVP VGITIVKLLK
LLPGPQMDQK LAGVLTDICH ILRSKAWEAR EMTRDTLVKI AVLLGPSFFG FILKELRGAL
TKGYQLHVLS YTVHSILVAT VPHFAPGDLN PCLSSIVTVI MDDIFGVVGQ EKDAEGYVSQ
MKEVKSSKSQ DSMELIAKNA SISRLIELVR PLQALLLQKV DLKIVRKIDA LLARITAGLL
QNPATESRDT LVFCYEVIQE VYNARNPEAE PKLDWKTRRY LVQRGAKKSD RGQTSKHTYK
LTRFAFEMLR SILKKYDSIR IPENISGFIP IIGDAILEGE DEVKISAFRL LAVIVKVPFE
TADGANIYKI AVKEATRCIS MSTSTTTDLA QAALKMLAVV LRDRRDIEVK DAAIDMLLGK
LKDDFTEPLY RHITFNFLRS VLDRRIETAT VYDTLDQVGT VMITNDDKDT RDLARGAFFQ
FIRDYPQMRS RWTKQLNFVV ANLKYDREGG RISVMEVIHL LLMKSSSDFI QEIAATCFLP
LFLVLANDDS EKCRLAAEAL IKEIFRKADK ERVISFLNLL RSWLGKDGNA TVVKLAVQLF
GYYFECHEDA VKNQTDFKLI FDKVMTILGS EDVRDVDGGL VGAAIGVVRI FVTSFPEKTL
SANREELWAK LPHCLGHSET SVKLAAIQVI SLYLADFAQR GAGTSAGEDV EGSHGLTLRT
GNVEELVRLA LRALNGTEVN EGLANELGQV LIFLGPRLPI GDSPDSSGAE SEADELDAED
KEEPSAKPKD LQYLFWRLSH ILRKEIRPNA VAIIPKVVAM EVLETICRRS ALERLQPSLK
TILTPLHNLT DPSIPAPFSM DEVFKTKHEG LKTRAQIMMD SLQKKFGTAE YSKQLMSIRD
EVKARRLQRS SKRKIEAVAQ PEKYSKDKRK KFEKNRERKK TRSKEQKVMR QAYKTW
//