ID R1CRX4_EMIHU Unreviewed; 709 AA.
AC R1CRX4;
DT 26-JUN-2013, integrated into UniProtKB/TrEMBL.
DT 26-JUN-2013, sequence version 1.
DT 27-MAR-2024, entry version 55.
DE RecName: Full=G-patch domain-containing protein {ECO:0000259|PROSITE:PS50174};
GN ORFNames=EMIHUDRAFT_107468 {ECO:0000313|EMBL:EOD05015.1};
OS Emiliania huxleyi (Coccolithophore) (Pontosphaera huxleyi).
OC Eukaryota; Haptista; Haptophyta; Prymnesiophyceae; Isochrysidales;
OC Noelaerhabdaceae; Emiliania.
OX NCBI_TaxID=2903 {ECO:0000313|EMBL:EOD05015.1};
RN [1] {ECO:0000313|EMBL:EOD05015.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=CCMP1516 {ECO:0000313|EMBL:EOD05015.1};
RG DOE Joint Genome Institute;
RA Read B., Kegel J., Klute M., Kuo A., Lefebvre S.C., Maumus F., Mayer C.,
RA Miller J., Allen A., Bidle K., Borodovsky M., Bowler C., Brownlee C.,
RA Claverie J.-M., Cock M., De Vargas C., Elias M., Frickenhaus S.,
RA Gladyshev V.N., Gonzalez K., Guda C., Hadaegh A., Herman E.,
RA Iglesias-Rodriguez D., Jones B., Lawson T., Leese F., Lin Y.-C.,
RA Lindquist E., Lobanov A., Lucas S., Malik S.-H.B., Marsh M.E., Mock T.,
RA Monier A., Moreau H., Mueller-Roeber B., Napier J., Ogata H., Parker M.,
RA Probert I., Quesneville H., Raines C., Rensing S., Riano-Pachon D.M.,
RA Richier S., Rokitta S., Salamov A., Sarno A.F., Schmutz J., Schroeder D.,
RA Shiraiwa Y., Soanes D.M., Valentin K., Van Der Giezen M., Van Der Peer Y.,
RA Vardi A., Verret F., Von Dassow P., Wheeler G., Williams B., Wilson W.,
RA Wolfe G., Wurch L.L., Young J., Dacks J.B., Delwiche C.F., Dyhrman S.,
RA Glockner G., John U., Richards T., Worden A.Z., Zhang X., Grigoriev I.V.;
RT "Genome variability drives Emilianias global distribution.";
RL Submitted (JUL-2012) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Proteomes:UP000013827}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=CCMP1516 {ECO:0000313|Proteomes:UP000013827};
RX PubMed=23760476; DOI=10.1038/nature12221;
RA Read B.A., Kegel J., Klute M.J., Kuo A., Lefebvre S.C., Maumus F.,
RA Mayer C., Miller J., Monier A., Salamov A., Young J., Aguilar M.,
RA Claverie J.M., Frickenhaus S., Gonzalez K., Herman E.K., Lin Y.C.,
RA Napier J., Ogata H., Sarno A.F., Shmutz J., Schroeder D., de Vargas C.,
RA Verret F., von Dassow P., Valentin K., Van de Peer Y., Wheeler G.,
RA Dacks J.B., Delwiche C.F., Dyhrman S.T., Glockner G., John U., Richards T.,
RA Worden A.Z., Zhang X., Grigoriev I.V., Allen A.E., Bidle K., Borodovsky M.,
RA Bowler C., Brownlee C., Cock J.M., Elias M., Gladyshev V.N., Groth M.,
RA Guda C., Hadaegh A., Iglesias-Rodriguez M.D., Jenkins J., Jones B.M.,
RA Lawson T., Leese F., Lindquist E., Lobanov A., Lomsadze A., Malik S.B.,
RA Marsh M.E., Mackinder L., Mock T., Mueller-Roeber B., Pagarete A.,
RA Parker M., Probert I., Quesneville H., Raines C., Rensing S.A.,
RA Riano-Pachon D.M., Richier S., Rokitta S., Shiraiwa Y., Soanes D.M.,
RA van der Giezen M., Wahlund T.M., Williams B., Wilson W., Wolfe G.,
RA Wurch L.L.;
RT "Pan genome of the phytoplankton Emiliania underpins its global
RT distribution.";
RL Nature 499:209-213(2013).
RN [3] {ECO:0000313|EnsemblProtists:EOD05015}
RP IDENTIFICATION.
RG EnsemblProtists;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the TFP11/STIP family.
CC {ECO:0000256|ARBA:ARBA00010900}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KB870222; EOD05015.1; -; Genomic_DNA.
DR RefSeq; XP_005757444.1; XM_005757387.1.
DR STRING; 2903.R1CRX4; -.
DR PaxDb; 2903-EOD05015; -.
DR EnsemblProtists; EOD05015; EOD05015; EMIHUDRAFT_107468.
DR GeneID; 17251092; -.
DR KEGG; ehx:EMIHUDRAFT_107468; -.
DR eggNOG; KOG2184; Eukaryota.
DR HOGENOM; CLU_389553_0_0_1; -.
DR Proteomes; UP000013827; Unassembled WGS sequence.
DR GO; GO:0005681; C:spliceosomal complex; IEA:UniProtKB-KW.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0000390; P:spliceosomal complex disassembly; IEA:InterPro.
DR InterPro; IPR000467; G_patch_dom.
DR InterPro; IPR022783; GCFC_dom.
DR InterPro; IPR022159; STIP/TFIP11_N.
DR InterPro; IPR045211; TFP11/STIP/Ntr1.
DR PANTHER; PTHR23329:SF1; TUFTELIN-INTERACTING PROTEIN 11; 1.
DR PANTHER; PTHR23329; TUFTELIN-INTERACTING PROTEIN 11-RELATED; 1.
DR Pfam; PF01585; G-patch; 1.
DR Pfam; PF07842; GCFC; 1.
DR Pfam; PF12457; TIP_N; 1.
DR SMART; SM00443; G_patch; 1.
DR PROSITE; PS50174; G_PATCH; 1.
PE 3: Inferred from homology;
KW mRNA processing {ECO:0000256|ARBA:ARBA00022664};
KW mRNA splicing {ECO:0000256|ARBA:ARBA00023187};
KW Reference proteome {ECO:0000313|Proteomes:UP000013827};
KW Spliceosome {ECO:0000256|ARBA:ARBA00022728}.
FT DOMAIN 128..174
FT /note="G-patch"
FT /evidence="ECO:0000259|PROSITE:PS50174"
FT REGION 1..116
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 181..226
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 370..395
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 685..709
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 11..25
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 50..64
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 73..92
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 209..226
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 709 AA; 73616 MW; 8AB8F982CFA53F83 CRC64;
MSSSDDDAPA GLGARGRKRS RKASDDAMLG VFGDSSEDEG GGGGGGRGGG RGGRERESKR
PLAEKPVSFA KAAAPSAPPP PAPAPAAVAP ADKAPPSRPS RGGLGSGGAK PLPAEKVDKE
FGAFEAHSKG IGSRLLSKMG WKPGQGIGKA GGGVVNPLEQ KLRPNSMGLG YGGFKETTSK
AKAQQARILH GDGAADPPPS GAELDRANAA AAPRRDNWRR GERRELKVRS AQQLMAEWEG
KARPQRGAPA EAGAAVVDMR GPQPRVHASL ADADAAAAAA GEEAEAAAGM LPELRHNMRM
LVELSEVRMQ QTHRKLRAER DAMRSYSARR DAHAAAARQT ATRLEAIEEV QAALRRCVAG
AKAVRQAQLS SAAPGGGGGG GGGGGGGGGG GGGAASSAAE ASLAAWAEEW GKLRRAHPEE
WSVHKLHEAL ANEWRAADAT SATALLSAWR ALLPAALFGE VLAGQVLPRL VGEVGAWSPS
LGADSAPLHA SLRPWERLLP PPGLKPLFPA IRQKLLALLL RTFFPRWPAA DPDWGEVSQW
YCGWKALLLR PAPASPSTAG GPNEPAIRGA VFMPLGVRSE GRAVFSFGGV QVVLDSDKKL
VYARQLGGGG GGGGGGGGFR PISLKELVAL AGGWRAARAD DWTYPNLPGP TQTYPDVFTP
YPDLPGPTQT YSPRTQTYPD LPIHTPYPDL PRPTQTYSHP VGMQSVQSG
//