ID A0A176VS48_MARPO Unreviewed; 876 AA.
AC A0A176VS48;
DT 07-SEP-2016, integrated into UniProtKB/TrEMBL.
DT 07-SEP-2016, sequence version 1.
DT 22-FEB-2023, entry version 23.
DE RecName: Full=MIF4G domain-containing protein {ECO:0000259|SMART:SM00543};
GN ORFNames=AXG93_620s1100 {ECO:0000313|EMBL:OAE23243.1};
OS Marchantia polymorpha subsp. ruderalis.
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Marchantiophyta;
OC Marchantiopsida; Marchantiidae; Marchantiales; Marchantiaceae; Marchantia.
OX NCBI_TaxID=1480154 {ECO:0000313|EMBL:OAE23243.1, ECO:0000313|Proteomes:UP000077202};
RN [1] {ECO:0000313|EMBL:OAE23243.1, ECO:0000313|Proteomes:UP000077202}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Tak-1 and cv. Tak-2 {ECO:0000313|Proteomes:UP000077202};
RC TISSUE=Whole gametophyte {ECO:0000313|EMBL:OAE23243.1};
RA Honkanen S., Jones V.A., Morieri G., Champion C., Hetherington A.J.,
RA Kelly S., Saint-Marcoux D., Proust H., Prescott H., Dolan L.;
RT "Mechanisms controlling the formation of the plant cell surface in tip-
RT growing cells are functionally conserved among land plants.";
RL Submitted (MAR-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the NCBP1 family.
CC {ECO:0000256|ARBA:ARBA00007413}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OAE23243.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LVLJ01002901; OAE23243.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A176VS48; -.
DR OrthoDB; 5477544at2759; -.
DR Proteomes; UP000077202; Unassembled WGS sequence.
DR GO; GO:0005846; C:nuclear cap binding complex; IEA:InterPro.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0000339; F:RNA cap binding; IEA:InterPro.
DR GO; GO:0006406; P:mRNA export from nucleus; IEA:InterPro.
DR GO; GO:0006397; P:mRNA processing; IEA:UniProtKB-KW.
DR GO; GO:0008380; P:RNA splicing; IEA:UniProtKB-KW.
DR Gene3D; 1.25.40.180; -; 3.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR027159; CBP80.
DR InterPro; IPR015172; MIF4G-like_typ-1.
DR InterPro; IPR015174; MIF4G-like_typ-2.
DR InterPro; IPR003890; MIF4G-like_typ-3.
DR PANTHER; PTHR12412; CAP BINDING PROTEIN; 1.
DR PANTHER; PTHR12412:SF2; NUCLEAR CAP-BINDING PROTEIN SUBUNIT 1; 1.
DR Pfam; PF02854; MIF4G; 1.
DR Pfam; PF09088; MIF4G_like; 1.
DR Pfam; PF09090; MIF4G_like_2; 1.
DR SMART; SM00543; MIF4G; 1.
DR SUPFAM; SSF48371; ARM repeat; 3.
PE 3: Inferred from homology;
KW mRNA processing {ECO:0000256|ARBA:ARBA00022664};
KW mRNA splicing {ECO:0000256|ARBA:ARBA00023187};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000077202}.
FT DOMAIN 6..231
FT /note="MIF4G"
FT /evidence="ECO:0000259|SMART:SM00543"
FT REGION 694..719
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 704..719
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 876 AA; 98823 MW; FFCDCCC7134DBC43 CRC64;
MSTSWHSLLV RIGERCAAYG GHADAEDHIE TCFNSLLRDL HHHGNEILAL LMQCVEELPH
KDPLYGTLVG LINLENTEFV EKVVTQAHDT LQAALDSGNC NQIRILMRFL TTLMSSNVIA
PGSLIEVFET LVSSAATTID EDGGNPSWQP RADFYVFCIL ASLPWGGLEL AERAPEDLER
VLAGVEAYLT LRKRNAGSAL SVFNIQDEGV EQATEQEDFL EDLWSRIQTL MENSWRVESV
PRPHLAFESR LVTGPSHEFG QVTCPDAPKL PTNPTDVIMG KKREEAELLY PRRVGRLRIF
PAAKTDETMA PIDRFVVEEY LLDVLFFLSG CRKECAAYMV GLPVPFRYEY LMAETVFSQL
LLLPAPPFKL IYYTVVMVDL CKALPGAFPA VVAGAVRSLF EKVGDMDVEC RTRLVQWLSH
HLSNFQFVWP WEEWAHVLEQ PKWSPQRVFA QEVLEKEVRL AYWERIKESL HSSPNLVELL
PPKLTAPIYK YDDPQAVSAE EHRIATELIM MTRGKRRARD IQVYIEEKIL PDYGQKMAIE
VAVQTFLYIG SKSFTHTVGI LEKYGTVLSK LAANNHSWQT VIIESVAQFW KNSAQMTSIV
IDRMMGYRIV SNLAIVAWVF SADNVNRFHT SHHVWEVLEN AINKTNNRTA DLRKDVASVE
KSVDAATAAL AKAVAKVEAA VALLETATDD ETRAGAKSKL EWATSAQKKA KDEESSSQES
LEVKEALLLR ALHEQEALFM AVYQNFATVL TQRLSVPLPS VEVVKTDTEA EHVEDPMAVD
QETTVGAEDG DNVEIDGERR NQKYSVKTNG VSTAEELEVQ EQLTWRKCTL GYLRAITRHY
ATEVWLVMDR IDSEIFTDTV DQLVIQTAYS GLRRSW
//