ID I3JPR7_ORENI Unreviewed; 436 AA.
AC I3JPR7;
DT 11-JUL-2012, integrated into UniProtKB/TrEMBL.
DT 17-JUN-2020, sequence version 2.
DT 27-MAR-2024, entry version 67.
DE SubName: Full=HIV Tat-specific factor 1 homolog {ECO:0000313|Ensembl:ENSONIP00000010861.2};
GN Name=LOC100696985 {ECO:0000313|Ensembl:ENSONIP00000010861.2};
OS Oreochromis niloticus (Nile tilapia) (Tilapia nilotica).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Cichlomorphae; Cichliformes; Cichlidae; African cichlids;
OC Pseudocrenilabrinae; Oreochromini; Oreochromis.
OX NCBI_TaxID=8128 {ECO:0000313|Ensembl:ENSONIP00000010861.2, ECO:0000313|Proteomes:UP000005207};
RN [1] {ECO:0000313|Proteomes:UP000005207}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG Broad Institute Genome Assembly Team;
RG Broad Institute Sequencing Platform;
RA Di Palma F., Johnson J., Lander E.S., Lindblad-Toh K.;
RT "The Genome Sequence of Oreochromis niloticus (Nile Tilapia).";
RL Submitted (JAN-2012) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSONIP00000010861.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SIMILARITY: Belongs to the HTATSF1 family.
CC {ECO:0000256|ARBA:ARBA00007747}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; I3JPR7; -.
DR STRING; 8128.ENSONIP00000010862; -.
DR Ensembl; ENSONIT00000010870.2; ENSONIP00000010861.2; ENSONIG00000008638.2.
DR eggNOG; KOG1548; Eukaryota.
DR GeneTree; ENSGT00390000009902; -.
DR TreeFam; TF313623; -.
DR Proteomes; UP000005207; Linkage group LG10.
DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0000398; P:mRNA splicing, via spliceosome; IEA:InterPro.
DR CDD; cd12281; RRM1_TatSF1_like; 1.
DR CDD; cd12282; RRM2_TatSF1_like; 1.
DR Gene3D; 3.30.70.330; -; 2.
DR InterPro; IPR012677; Nucleotide-bd_a/b_plait_sf.
DR InterPro; IPR035979; RBD_domain_sf.
DR InterPro; IPR000504; RRM_dom.
DR InterPro; IPR003954; RRM_dom_euk.
DR InterPro; IPR034393; TatSF1-like.
DR InterPro; IPR034392; TatSF1-like_RRM1.
DR PANTHER; PTHR15608:SF0; HIV TAT-SPECIFIC FACTOR 1; 1.
DR PANTHER; PTHR15608; SPLICING FACTOR U2AF-ASSOCIATED PROTEIN 2; 1.
DR Pfam; PF00076; RRM_1; 1.
DR SMART; SM00360; RRM; 2.
DR SMART; SM00361; RRM_1; 1.
DR SUPFAM; SSF54928; RNA-binding domain, RBD; 2.
DR PROSITE; PS50102; RRM; 1.
PE 3: Inferred from homology;
KW mRNA processing {ECO:0000256|ARBA:ARBA00022664};
KW mRNA splicing {ECO:0000256|ARBA:ARBA00023187};
KW Reference proteome {ECO:0000313|Proteomes:UP000005207};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW RNA-binding {ECO:0000256|ARBA:ARBA00022884, ECO:0000256|PROSITE-
KW ProRule:PRU00176}.
FT DOMAIN 262..347
FT /note="RRM"
FT /evidence="ECO:0000259|PROSITE:PS50102"
FT REGION 18..39
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 74..146
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 228..254
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 368..436
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 74..93
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 98..113
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 120..140
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 238..254
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 383..411
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 415..436
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 436 AA; 49663 MW; 779CB0067F6AD8D9 CRC64;
MSGESDPNKE FMEQLRLQEL YGQRNEDGSD PYTYTDPEDG TVYDWDHEKK AWFPKITEDF
MAAYQANYGF TQESALDGKN TSLTGTNPAA PGTDSKPPQK PPEKEKPEDP APNSASEQHE
TPGKETKQKG EKRKAEQGLP PDISSDEFAE LMSKCGIVMR DPITEEYKVK LYKDKEGNLK
GDGLCCYLKK ESVDLAIRLI DESEVRGYKL HVEAARFELK GQYDASKKKK KSKDYKKKLQ
QQQKQLDWRP EKKGDLRKRH EKVVIIRNMF HPSDFEEDPL VLNEYREDLR SECEKFGEVK
KVILFDRHPD GVASVAFKEP EQADACILSF NGRWFGGRQL SAQLWDGTTD YQVEETTRER
EERLKGWSQF LEGGDGEQQK DSKPAESSTA TEPSEPSSTT EPEQQPKSEP QPSEEQQEEE
VDSTDSSLAG SDDEEA
//