ID A0A1A6HD85_NEOLE Unreviewed; 1804 AA.
AC A0A1A6HD85;
DT 05-OCT-2016, integrated into UniProtKB/TrEMBL.
DT 05-OCT-2016, sequence version 1.
DT 13-SEP-2023, entry version 29.
DE RecName: Full=Matrin-type domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=A6R68_17019 {ECO:0000313|EMBL:OBS76528.1};
OS Neotoma lepida (Desert woodrat).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea;
OC Cricetidae; Neotominae; Neotoma.
OX NCBI_TaxID=56216 {ECO:0000313|EMBL:OBS76528.1, ECO:0000313|Proteomes:UP000092124};
RN [1] {ECO:0000313|EMBL:OBS76528.1, ECO:0000313|Proteomes:UP000092124}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=417 {ECO:0000313|EMBL:OBS76528.1};
RC TISSUE=Liver {ECO:0000313|EMBL:OBS76528.1};
RA Campbell M., Oakeson K.F., Yandell M., Halpert J.R., Dearing D.;
RT "The Draft Genome Sequence and Annotation of the Desert Woodrat Neotoma
RT lepida.";
RL Submitted (JUN-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OBS76528.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LZPO01034893; OBS76528.1; -; Genomic_DNA.
DR STRING; 56216.A0A1A6HD85; -.
DR Proteomes; UP000092124; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-KW.
DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0008380; P:RNA splicing; IEA:InterPro.
DR CDD; cd12716; RRM1_2_NP220; 1.
DR Gene3D; 3.30.70.330; -; 2.
DR Gene3D; 3.30.160.60; Classic Zinc Finger; 1.
DR InterPro; IPR000690; Matrin/U1-C_Znf_C2H2.
DR InterPro; IPR003604; Matrin/U1-like-C_Znf_C2H2.
DR InterPro; IPR012677; Nucleotide-bd_a/b_plait_sf.
DR InterPro; IPR035979; RBD_domain_sf.
DR InterPro; IPR000504; RRM_dom.
DR InterPro; IPR033096; ZNF638_RRM1/2.
DR InterPro; IPR022755; Znf_C2H2_jaz.
DR InterPro; IPR036236; Znf_C2H2_sf.
DR InterPro; IPR013087; Znf_C2H2_type.
DR PANTHER; PTHR15592; MATRIN 3/NUCLEAR PROTEIN 220-RELATED; 1.
DR PANTHER; PTHR15592:SF1; ZINC FINGER PROTEIN 638; 1.
DR Pfam; PF12171; zf-C2H2_jaz; 1.
DR SMART; SM00360; RRM; 2.
DR SMART; SM00451; ZnF_U1; 1.
DR SUPFAM; SSF57667; beta-beta-alpha zinc fingers; 1.
DR SUPFAM; SSF54928; RNA-binding domain, RBD; 2.
DR PROSITE; PS50102; RRM; 1.
DR PROSITE; PS50171; ZF_MATRIN; 1.
DR PROSITE; PS00028; ZINC_FINGER_C2H2_1; 1.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Phosphoprotein {ECO:0000256|ARBA:ARBA00022553};
KW Reference proteome {ECO:0000313|Proteomes:UP000092124};
KW RNA-binding {ECO:0000256|PROSITE-ProRule:PRU00176};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771}.
FT DOMAIN 705..786
FT /note="RRM"
FT /evidence="ECO:0000259|PROSITE:PS50102"
FT DOMAIN 1754..1784
FT /note="Matrin-type"
FT /evidence="ECO:0000259|PROSITE:PS50171"
FT REGION 76..133
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 148..204
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 591..683
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 896..937
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1103..1139
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1292..1335
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1358..1392
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1575..1628
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1641..1712
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1776..1804
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 84..103
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 148..181
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 184..204
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 591..611
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 643..665
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 905..928
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1292..1308
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1312..1335
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1368..1392
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1675..1702
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1804 AA; 199528 MW; BE532F7720A355EB CRC64;
MSFVLFLENF APLVNSLSLG IANPLLLGPS PLHLAQIKTQ LALQQPNAIA SHGPTPPYTL
LNQAFLKVAM SRPRFNPRGT FPLQRPRAPN PPGMRPPGPF MRPGSMGLPR FYPAGRARGI
PHRFPGHESY QNMGPQRMNV QVTQHRTDPR LTKEKMDFPE AQQKKGKPHG SRWDDEPHIT
PPVEVKQSSV TQVTEQSPKV QSRYTKESAS SILASFGLSN EDLEELSRYP DEQLTPENMP
LILRDIRMRK MGRRLPNLPS HSRNKETLGK EAVSSNVIDY GHASKYGYTE DPLEVRIYDP
EIPTDEVKNE FRSQQNISAT VPSPNVICNS VFPVEDVFRQ MDFPGESSSQ SFFPVESGTK
MSGIHISGQS VLEPVKSVSQ STSQTVSQTV SQSLIPPSVN QPSFSSELIS ALSQQERIPH
KPVISSADTH VGPRGNKKSY QSETDLPIRS PFGIVKASWL PKFTQAGTQK MKRLPTPSMM
NDYYAASPRI FPHLCSLCNV EYTLIGTLRS SHPEEMRAIG KKMKLQEDVL ILIPXVRGIL
GDQAQVTDSI DLEVQLVIPI DQGVEVREFA IVSFLNTDQD PDPVLHTEVE ITLEETQSPT
DRKKALEDGG QRSTHGTEVN KXKNTEAVDK GLLPAQKPKL PSGTKPSVKS VSSLKSDSNL
GENAAHKSKN LEDDTLPEGK QVSGKGAFIQ RKIRSRKDQS LSSNSILLVS ELPEDGFTEE
DIRKAFQPFG SITDVLLVPC RNEAYLEMEL RKTVTAIMKY IETTPLEING KTKSGKKSLE
AKKSGIIKNK DSNKLVTVPG TLEATENEPV SKEMEEMSVV FISNLPNKGY STEEIYNLAK
PFGGLKDILI LSSHKKAYIE INKKSADSMV KFYTCFPISM DGNQLSISTA PEDVDIKDEK
VTQSTFSPDL KNSPVDESEV QTAADSPSVK PSEAEEETAC NIETETSVQQ ETLGKEESKQ
ALYESDFAIE TLELEAQGAE VSIEIPLVAS TPANNEFFSE NIEESALNQQ MYTSDFVKEE
AEVTNPETEL SISDSVFTEE RNIKGILEDS PSEAEDSFSG IAQPMVEAIA EVDKHETVSE
VLPSACVVTQ VPGSYIEDEK VVSKKDTSEK GSMDDKEENE FNTEETRMDL QVNTEKAEKN
ETDTFVEKLE KIIAAIREKP IESAVIKADP KKGVGQSSKP DETGKTSVLT VSNVCSSKTS
IKAAVVSSPK AKSTTSKTES QKIFLKPVLR EQINAEKKVS AKEFGLLKNT RLDLAESGSK
SKSTQSGVNR GCSARISALQ CKDSKLDYKD ITKQSQETEA KPPIMKRDDS NNKALALQNT
KNSKSTTGRS SKSKEEPLFT FNLDEFVTVD EVIEEVNPSQ AXQNPLKGKR KETFKIPPSP
ELNLKKKKGE TSVPHSVEGE LSFVTLDEIG EEEDAAAQAL VTVDEVIDEE DLNMEEMVKN
SNSLLTLDEL IDQDDCISHS EPKDVTVLSM AEEQDLQQER LVTVDEIGEV EESADITFAT
LNSKRDEGTT VRDSIGFISS QMPEDPSTLV TVDEIQDDRS DFHLVTLDEV TEEDEDSLAD
FKNLKEELNF VTVDEVGDEE DGDNDSEVEL AQGKIDHHTA KKGXRKRRAV DPKKTKLESF
SQVGPGSETV TQDLKTMIER PTAAKTPMKR VRVGKTSPSQ KAVMAEPAKG EEAFQMSEGV
EETELKESEP DEKRKKTEDS SLGKSVTPDV PGAEAKRSSR ILVPPVKLLL ATLALLQAVD
TASRNLHQPQ KAGFFCPICS LFYSDEKAMA NHCKSTRHKQ NTEKFMAKKR KEKEQNETEE
RSSR
//