GenomeNet

Database: UniProt
Entry: R7TG61_CAPTE
LinkDB: R7TG61_CAPTE
Original site: R7TG61_CAPTE 
ID   R7TG61_CAPTE            Unreviewed;       986 AA.
AC   R7TG61;
DT   24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT   24-JUL-2013, sequence version 1.
DT   27-MAR-2024, entry version 44.
DE   RecName: Full=MI domain-containing protein {ECO:0000259|PROSITE:PS51366};
GN   ORFNames=CAPTEDRAFT_167187 {ECO:0000313|EMBL:ELT90541.1};
OS   Capitella teleta (Polychaete worm).
OC   Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Annelida; Polychaeta;
OC   Sedentaria; Scolecida; Capitellidae; Capitella.
OX   NCBI_TaxID=283909 {ECO:0000313|EMBL:ELT90541.1};
RN   [1] {ECO:0000313|Proteomes:UP000014760}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=I ESC-2004 {ECO:0000313|Proteomes:UP000014760};
RA   Hellsten U., Grimwood J., Chapman J.A., Shapiro H., Aerts A., Otillar R.P.,
RA   Terry A.Y., Boore J.L., Simakov O., Marletaz F., Cho S.-J.,
RA   Edsinger-Gonzales E., Havlak P., Kuo D.-H., Larsson T., Lv J., Arendt D.,
RA   Savage R., Osoegawa K., de Jong P., Lindberg D.R., Seaver E.C.,
RA   Weisblat D.A., Putnam N.H., Grigoriev I.V., Rokhsar D.S.;
RL   Submitted (DEC-2012) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|EMBL:ELT90541.1, ECO:0000313|Proteomes:UP000014760}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=I ESC-2004 {ECO:0000313|EMBL:ELT90541.1,
RC   ECO:0000313|Proteomes:UP000014760};
RX   PubMed=23254933; DOI=10.1038/nature11696;
RA   Simakov O., Marletaz F., Cho S.J., Edsinger-Gonzales E., Havlak P.,
RA   Hellsten U., Kuo D.H., Larsson T., Lv J., Arendt D., Savage R.,
RA   Osoegawa K., de Jong P., Grimwood J., Chapman J.A., Shapiro H., Aerts A.,
RA   Otillar R.P., Terry A.Y., Boore J.L., Grigoriev I.V., Lindberg D.R.,
RA   Seaver E.C., Weisblat D.A., Putnam N.H., Rokhsar D.S.;
RT   "Insights into bilaterian evolution from three spiralian genomes.";
RL   Nature 493:526-531(2013).
RN   [3] {ECO:0000313|EnsemblMetazoa:CapteP167187}
RP   IDENTIFICATION.
RG   EnsemblMetazoa;
RL   Submitted (JUN-2015) to UniProtKB.
CC   -!- SUBCELLULAR LOCATION: Nucleus speckle {ECO:0000256|ARBA:ARBA00004324}.
CC   -!- SIMILARITY: Belongs to the CWC22 family.
CC       {ECO:0000256|ARBA:ARBA00006856}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AMQN01014286; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; KB310862; ELT90541.1; -; Genomic_DNA.
DR   AlphaFoldDB; R7TG61; -.
DR   STRING; 283909.R7TG61; -.
DR   EnsemblMetazoa; CapteT167187; CapteP167187; CapteG167187.
DR   HOGENOM; CLU_006308_1_0_1; -.
DR   OMA; MINQRIV; -.
DR   Proteomes; UP000014760; Unassembled WGS sequence.
DR   GO; GO:0016607; C:nuclear speck; IEA:UniProtKB-SubCell.
DR   GO; GO:0003723; F:RNA binding; IEA:InterPro.
DR   Gene3D; 1.25.40.180; -; 1.
DR   InterPro; IPR016024; ARM-type_fold.
DR   InterPro; IPR003891; Initiation_fac_eIF4g_MI.
DR   InterPro; IPR003890; MIF4G-like_typ-3.
DR   PANTHER; PTHR18034; CELL CYCLE CONTROL PROTEIN CWF22-RELATED; 1.
DR   PANTHER; PTHR18034:SF3; PRE-MRNA-SPLICING FACTOR CWC22 HOMOLOG; 1.
DR   Pfam; PF02847; MA3; 1.
DR   Pfam; PF02854; MIF4G; 1.
DR   SMART; SM00544; MA3; 1.
DR   SMART; SM00543; MIF4G; 1.
DR   SUPFAM; SSF48371; ARM repeat; 1.
DR   PROSITE; PS51366; MI; 1.
PE   3: Inferred from homology;
KW   Reference proteome {ECO:0000313|Proteomes:UP000014760}.
FT   DOMAIN          484..600
FT                   /note="MI"
FT                   /evidence="ECO:0000259|PROSITE:PS51366"
FT   REGION          1..138
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          435..473
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          679..986
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1..27
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        38..138
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        452..466
FT                   /note="Acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        690..749
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        750..839
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        840..855
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        867..976
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   986 AA;  113533 MW;  1D56DF312F40FEDC CRC64;
     MSDSEIRRGS FESESESESE DDALKPSLRS HVVSQADSVN NKRKRRESES PVRNGHSDDG
     GSDGERVQRR NWRENERGRD DRGTFPRFRD HRRDDRRYSS HRYSDRGRDG SRRDDHRDER
     KRKAEGDEGV DENADPKLEA EKLAKRAKKN DDVNTRTGGA YIPPARLRQM QAEITDKGSQ
     PYQRIAWEAL KKSLNGLINK VNVSNIKDIV HELFQENIVR GRGVLVRSIM QAQSASPTFT
     HVYAALVSII NTRFPQIGEL LLKRLVIQFR KGFRRNNKDL CLSSVKFVAH LVNQQVAHEI
     IALEILTLLL ETPSDDSVEV AVGFMKEVGL KMGEVSPRGM HAIFERMRTV LHEGEIDKRV
     QYMVEVMFAV RKDGFKDHPA VLPELDLVDE DDQFTHMLTL EDAVDPEDML NIFKFDPDYE
     AVEEKYKTLK REILEADSSS DEDGSGSGSG SSDSDDEEEG EGEGEEAAAG DSNIIDATET
     NLVALRRVIY LTIQSSLDFE ECVHKMLKID LKPGQEVELC NMILDCCAQQ RTYEKFFGLM
     AQRFCMLDKK YVEPYQKIFI EQYESIHRLE ANKLRNVSKF FAHLLFTDAI SWEVMESIKL
     NEDDTTSSSR IFIKILFQEL SEYMGLPKLN DRLKDETLQT HFEGLLPRDN PRNTRFAINF
     FTSIGLGGLT DDLREHLRTM PKPAPVPQPE EEESSSSSSS SSSSSSSSSS SDSSSDSSDS
     DSSSSSSSSS SSSSSSSSSS SSSSSSSEEE EEQPAPKRES PRGRHEQVAE RERSRQHESG
     GDSRQRVREE RRSRERPRAE RSGREQHQAE KDAINESLQK RQRERGSRAR DEELMREIQK
     ESGSSSSSAD SSSDSSSSDD SEEEQTKEGE QERRRPAREE EEDRRRRPAR EEEEDRRRRP
     AREEEEDRRR RPAREEEEDR RRRPAREERQ SEERQDSRRR RRASSSGDER QGRSQYEERD
     RRNRDRQQQL RRSRSRSGDR RRDRRR
//
DBGET integrated database retrieval system