ID U5D5S3_AMBTC Unreviewed; 434 AA.
AC U5D5S3;
DT 11-DEC-2013, integrated into UniProtKB/TrEMBL.
DT 11-DEC-2013, sequence version 1.
DT 24-JAN-2024, entry version 39.
DE RecName: Full=G-patch domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=AMTR_s00059p00129260 {ECO:0000313|EMBL:ERN17560.1};
OS Amborella trichopoda.
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; Amborellales; Amborellaceae; Amborella.
OX NCBI_TaxID=13333 {ECO:0000313|EMBL:ERN17560.1, ECO:0000313|Proteomes:UP000017836};
RN [1] {ECO:0000313|Proteomes:UP000017836}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=24357323;
RG Amborella Genome Project;
RT "The Amborella genome and the evolution of flowering plants.";
RL Science 342:1241089-1241089(2013).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KI392312; ERN17560.1; -; Genomic_DNA.
DR AlphaFoldDB; U5D5S3; -.
DR STRING; 13333.U5D5S3; -.
DR EnsemblPlants; ERN17560; ERN17560; AMTR_s00059p00129260.
DR Gramene; ERN17560; ERN17560; AMTR_s00059p00129260.
DR eggNOG; KOG0965; Eukaryota.
DR HOGENOM; CLU_624709_0_0_1; -.
DR OMA; HQWKQCE; -.
DR Proteomes; UP000017836; Unassembled WGS sequence.
DR GO; GO:0005654; C:nucleoplasm; IBA:GO_Central.
DR GO; GO:0003723; F:RNA binding; IEA:InterPro.
DR GO; GO:0006397; P:mRNA processing; IEA:UniProtKB-KW.
DR GO; GO:0008380; P:RNA splicing; IEA:UniProtKB-KW.
DR Gene3D; 1.10.10.790; Surp module; 1.
DR InterPro; IPR000467; G_patch_dom.
DR InterPro; IPR040169; SUGP1/2.
DR InterPro; IPR000061; Surp.
DR InterPro; IPR035967; SWAP/Surp_sf.
DR PANTHER; PTHR23340; ARGININE/SERINE RICH SPLICING FACTOR SF4/14; 1.
DR PANTHER; PTHR23340:SF0; SURP AND G-PATCH DOMAIN-CONTAINING PROTEIN 1-LIKE PROTEIN; 1.
DR Pfam; PF01585; G-patch; 1.
DR Pfam; PF01805; Surp; 1.
DR SMART; SM00443; G_patch; 1.
DR SMART; SM00648; SWAP; 1.
DR SUPFAM; SSF109905; Surp module (SWAP domain); 1.
DR PROSITE; PS50174; G_PATCH; 1.
DR PROSITE; PS50128; SURP; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000017836}.
FT DOMAIN 148..191
FT /note="SURP motif"
FT /evidence="ECO:0000259|PROSITE:PS50128"
FT DOMAIN 353..400
FT /note="G-patch"
FT /evidence="ECO:0000259|PROSITE:PS50174"
FT REGION 24..149
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 201..318
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 24..61
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 68..82
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 205..242
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 258..273
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 283..298
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 434 AA; 47461 MW; CB83A6EE928A7D8B CRC64;
MDKPGDPKLF VNDGSFMERF KQLQQAQLDK QGSASAESVK PPTFSGLPKN NMTNIVMSKK
VDTGSNDARR PILNSQNPGG KLAFSLKQKP KLVVASKLGV DDEDDDETGH SDGSSKKPKL
DNSIASSMSL DRGEFVPSPP SDPEVKKVAD KLASFVAKNG RHLENVTRQR NPGDTPFKFL
FDVDCADYKY YEYRIVEEEK ELAQTKASQA TSAPSSSSSG GFRPANTPPK ISSYQHTRYQ
TPASALYCGD DEPSSVGRTP SHGESSDQSQ TADPIAMMEY YAKKAAQEER KRPPKQSKDE
MPPPPSLQAP GKKGHHMGDY IPQEELEKFL SACNDAAAQK AAKEAAEKAK IQADNVGHRL
LSKMGWREGE GLGSHKSGRA DPVMAGNVKL NNLGVGAEQP GEVTPEDDIY EQYKKRMMLG
YRYRPNPLVS GSFI
//