ID R4GD49_ANOCA Unreviewed; 661 AA.
AC R4GD49;
DT 26-JUN-2013, integrated into UniProtKB/TrEMBL.
DT 29-SEP-2021, sequence version 2.
DT 27-MAR-2024, entry version 48.
DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSACAP00000023291.2};
OS Anolis carolinensis (Green anole) (American chameleon).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Lepidosauria; Squamata; Bifurcata; Unidentata; Episquamata; Toxicofera;
OC Iguania; Dactyloidae; Anolis.
OX NCBI_TaxID=28377 {ECO:0000313|Ensembl:ENSACAP00000023291.2, ECO:0000313|Proteomes:UP000001646};
RN [1] {ECO:0000313|Ensembl:ENSACAP00000023291.2}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=JBL SC #1 {ECO:0000313|Ensembl:ENSACAP00000023291.2};
RG The Genome Sequencing Platform;
RA Di Palma F., Alfoldi J., Heiman D., Young S., Grabherr M., Johnson J.,
RA Lander E.S., Lindblad-Toh K.;
RT "The Genome Sequence of Anolis carolinensis (Green Anole Lizard).";
RL Submitted (DEC-2009) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSACAP00000023291.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the PRP38 family.
CC {ECO:0000256|ARBA:ARBA00006164}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; R4GD49; -.
DR STRING; 28377.ENSACAP00000023291; -.
DR Ensembl; ENSACAT00000030711.2; ENSACAP00000023291.2; ENSACAG00000028682.2.
DR eggNOG; KOG2888; Eukaryota.
DR GeneTree; ENSGT00730000111085; -.
DR HOGENOM; CLU_034151_1_2_1; -.
DR InParanoid; R4GD49; -.
DR Proteomes; UP000001646; Unplaced.
DR Bgee; ENSACAG00000028682; Expressed in forelimb bud and 13 other cell types or tissues.
DR GO; GO:0071011; C:precatalytic spliceosome; IBA:GO_Central.
DR GO; GO:0006397; P:mRNA processing; IEA:UniProtKB-KW.
DR GO; GO:0008380; P:RNA splicing; IEA:UniProtKB-KW.
DR InterPro; IPR005037; PRP38.
DR PANTHER; PTHR23142:SF2; PRE-MRNA-SPLICING FACTOR 38B; 1.
DR PANTHER; PTHR23142; UNCHARACTERIZED; 1.
DR Pfam; PF03371; PRP38; 1.
PE 3: Inferred from homology;
KW mRNA processing {ECO:0000256|ARBA:ARBA00022664};
KW mRNA splicing {ECO:0000256|ARBA:ARBA00023187};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000001646};
KW Spliceosome {ECO:0000256|ARBA:ARBA00022728}.
FT REGION 117..163
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 349..661
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 127..141
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 349..374
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 375..399
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 400..444
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 459..538
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 539..571
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 572..607
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 608..623
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 624..661
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 661 AA; 77292 MW; E564D27D1E5BA4FE CRC64;
MATGSRESCV WVLWEHLPLL PLSFRRRVTA LPPSLASRAS ALRQPLEPFR GLAPPPFARA
PPRLVASGAE GPEPHFRASR HLVGGWRRSL RREPVFFPGR SGPLPASPMA NNCPVLGAGN
CHQPPPQQQQ PAGPPAGAVL SPPPLQLQSA GPKPAASGKQ GNVLPLWGNE KTMNLNPMIL
TNILSSPYFK VQLYELKTYH EVVDEIYFKV THVEPWEKGS RKTAGQTGMC GGVRGVGTGG
IVSTAFCLLY KLFTLKLTRK QVMGLITHTD SPYIRALGFM YIRYTQPPTD LWDWFESFLD
DEEDLDVKAG GGCVMTIGEM LRSFLTKLEW FSTLFPRIPV PVQKNIDQQI KARPRKIKKD
GKEGVEEVDR HTERRRSRSP RRSLSPRRSP RRSRSRSRHR EGRGSSSFDR ELERERERQR
QEREAKERER RQSRSSDRTL ERRRSRSRER HRSRSRDRKG DRRERDRERE KENERSRKKE
RDYDKDRGIE RERERSRERD RSREKSKDRK SKSEIDERRH KDEKDERRHR EEKRDSKKER
RHSRSRSRDR KHRSRSVSRN AGKHSRSRSK EKLSKHKSES KEKSNKRSRS RSRGRTDSSE
KSRKRDRSPS KERSRKRSKS KERSHKHDHS DKDHSNKHNR RSQSTERENQ EKQSKNKDET
V
//