ID F2UB75_SALR5 Unreviewed; 2297 AA.
AC F2UB75;
DT 31-MAY-2011, integrated into UniProtKB/TrEMBL.
DT 31-MAY-2011, sequence version 1.
DT 24-JAN-2024, entry version 45.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EGD74087.1};
GN ORFNames=PTSG_12358 {ECO:0000313|EMBL:EGD74087.1};
OS Salpingoeca rosetta (strain ATCC 50818 / BSB-021).
OC Eukaryota; Choanoflagellata; Craspedida; Salpingoecidae; Salpingoeca.
OX NCBI_TaxID=946362 {ECO:0000313|Proteomes:UP000007799};
RN [1] {ECO:0000313|Proteomes:UP000007799}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 50818 {ECO:0000313|Proteomes:UP000007799};
RA Russ C., Cuomo C., Burger G., Gray M.W., Holland P.W.H., King N.,
RA Lang F.B.F., Roger A.J., Ruiz-Trillo I., Young S.K., Zeng Q., Gargeya S.,
RA Alvarado L., Berlin A., Chapman S.B., Chen Z., Freedman E., Gellesch M.,
RA Goldberg J., Griggs A., Gujja S., Heilman E., Heiman D., Howarth C.,
RA Mehta T., Neiman D., Pearson M., Roberts A., Saif S., Shea T., Shenoy N.,
RA Sisk P., Stolte C., Sykes S., White J., Yandava C., Haas B., Nusbaum C.,
RA Birren B.;
RT "Annotation of Salpingoeca rosetta.";
RL Submitted (AUG-2009) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GL832967; EGD74087.1; -; Genomic_DNA.
DR RefSeq; XP_004993649.1; XM_004993592.1.
DR EnsemblProtists; EGD74087; EGD74087; PTSG_12358.
DR GeneID; 16074225; -.
DR KEGG; sre:PTSG_12358; -.
DR OMA; CFYMLRH; -.
DR OrthoDB; 5482709at2759; -.
DR Proteomes; UP000007799; Unassembled WGS sequence.
DR Gene3D; 1.25.10.10; Leucine-rich Repeat Variant; 2.
DR InterPro; IPR011989; ARM-like.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR046523; UTP20_C.
DR InterPro; IPR011430; UTP20_N.
DR PANTHER; PTHR17695:SF11; SMALL SUBUNIT PROCESSOME COMPONENT 20 HOMOLOG; 1.
DR PANTHER; PTHR17695; UNCHARACTERIZED; 1.
DR Pfam; PF20416; UTP20_C; 1.
DR Pfam; PF07539; UTP20_N; 1.
DR SUPFAM; SSF48371; ARM repeat; 2.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000007799}.
FT DOMAIN 391..1005
FT /note="U3 small nucleolar RNA-associated protein 20 N-
FT terminal"
FT /evidence="ECO:0000259|Pfam:PF07539"
FT DOMAIN 1328..1544
FT /note="U3 small nucleolar RNA-associated protein 20 C-
FT terminal"
FT /evidence="ECO:0000259|Pfam:PF20416"
FT REGION 242..363
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1200..1291
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2069..2101
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2239..2297
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 243..259
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 260..282
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 293..307
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 319..355
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1200..1220
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1233..1249
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1250..1291
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2075..2101
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2297 AA; 259918 MW; 389F5B4EAE3B2763 CRC64;
MHLLRTAEAL PLTLHAHRDK RLQIMRASSG KYLSRTPDML LRLTMAYLLG LHHVNLTLLW
ADAKKALGEF ARYRPLVLWT LFEDKFMRVL DASVEMGEIM TEQARTIGQQ QVTDSAHRIV
DDRLDVLSFR CLVHPLEHRI NLLQKPDWDN NMTQVADRFV NAVCPQPRVD VLKLMQTFAE
LANTWPLLTE QKHRTLVEKF LQLHDDYLVV EEGAGKLQDI SVANAGLAGI SMQSLGIEDL
GGDLDNLSDD ESVSNDDDDD DNDERRRKRS AMQHDGRGGR GGRGGRGGRG GRGGRGGRGR
GRGGRGGRGG RGGRGGRGGR GRGGRDSKRP KRENDTFNDK VDDDDTKRDG TDATNDGDDA
MQAYDPTAAA RRRVFWRRRQ MVRMVRKRRK VALSQMEAML SVLASFNNGK ALMEWERLQA
IVHNFLHHSD ANIRANALKA SASFSRVVRK YLEPMMAVNT EMGPQLTERL KTFNVASPAV
VAPEHVEEVT TVFFRLLQAK MFARKGKSQK YSPKFQRALI LRYLADCRTE GGEGFVLDAF
LHLLVLPFAQ SPTPPVSVAP GAAVDVDALL AEFNVGTDVH AAVPLNRQVG FLMLAESLLK
VFGTRLQPHL PMMVAIMVQM AREAHALLAR RDEVAPRYIR ELRMIRKLSI RRLCSLYNFF
NAKLLEPFAS AVHAHVIGPQ VASLYHECSN AVVPILQLLI TWSEDPNMHR LFNISGSAVL
SRVFEVLSHP RIAFPVEREV IGMADNLLPG TAGRMSKVSE EDKDSSVMEP HIPKLVSSLY
ALMIRQAAEA RDRNSIARFS WVKLSVLARI APFATEEKDA KLLAQLLVPY LEVHTRRVSE
NRKALILSVL ADSFRKFGDS LQYAGVMCTL FGWLATDNAR LALCKTVDAM ANADATELKP
IVDILYDLNA TAKGKLGVSF DFVKRLSALE RVRNAVDEFT RLQLEALIEA VTNLIFLEDF
SLRTGAVETI MVMINSIMGR DNRRSLLHVI VKKRLLLVLK RAIRSDLDMT RAAALELLQF
ITANVSGDGI LADMVPLLPH KGEDETKCFF RGMRSVKQIE RSRALLRIAR TIGMKRVQHK
QQEDDNDVDG SGDADVSRPC TLRFSHRFTQ DTITGFLLPL ALRAIADGGD DADASFVDVA
IDAIGALAGQ MHWPLYLQTL RRHLSTVRRQ RVVEKRMVRA VVAVLDSFHF DTKAEVVAAE
EEGKDTHKKK KGQEVQVKVV KDEEDDDDDD DEEEGGRKKK RKTTTKERDG DEDDEDDEDD
EDKDEDDEDE DDDEDDGDNN DAADAGLDDD EDLLREIDEE DDYDPSAVHK SMVLFILPKL
KKMLTFKQEE RVVVRPIVAT AVVRLLMLLP KRTMHDHLPS TVSKVASVLR DRMDSIRSEA
RYTMVQLAKM LGPVYFDFIL REMQTALRRG YQLHVLGYTI HSILQTILPQ LTTGDLDSCL
EQLLPIFIDD IFGTVAEEKE AEEIPRKLLE MKAEKSYSSF QVVAQFLSPS KFHALMSPLQ
SLLAVTEVRR TVNKVEKILT FVCNGFLDNT AVSTPQLLEF ALTVVDQHIG LSQAGVKLGE
KVKRRDASML IVQPLKARTS RKTRTKTTYN TNAHVIVEFG LSILQGLLKG GVLDFEDTKQ
RDVVNRFAGP LASCLFSKYS KTLSMSAMLW SFILPHIAHM PAMSSKVIEK VFLRLMKLYE
QTTGNQETFT AVTNTLISLI RNATSFRPPQ HHIEKLLSSV HMDMDNPDRQ AIMFDVVRAA
IEREYKLPVV YDIIDRIQEI VVQSQSDTAR KRARETMLAF LKKCEVNSKR VLNHIMFLLT
NLAYKYEAGR KSVMQMLTEI VKNLPESMVV KHADLMFMTF AQHFGNTEES STRSAMQRLL
QLLLSRVPAA KRQEFFDMAV RWLKEEKLPV RSLGLRLLVV FIDQRPRAFK DQAPAILATL
ADIITTAEDL KVYLRDGEVE EQQPGGDRWF LLFQALTCTQ KMVDGVSGSI TLLAKKHADV
WPRLQSHLLY SHAWVRLAAG RLMNTFLQRT CSAEFVSDPE RVNELAQAFT TQVRSRNLGE
EMVEVLVSNL TKVAQLQMAT AARANQLFTT SHEQQQRQQE DDSSDVPRKK KAKKSVWLSD
THAEAKDHEA ADLVLHEGPF EQLMHELTGI AMREVLYREG NMQRRVVFAW YRHVARFLSD
AKKVAAVLPY MTRALFRTME RWDQPEDIQE LAKSTHDLIR GIVGAKRMSN AMSEEKHKVS
AKRAVRRQRE QQELITNPKK ALRRKARSHE LKRDARRRKN AAEKDARIGG MPINPVELKK
LHNKRREHNR KLNLMDE
//