ID A0A1A6HVH8_NEOLE Unreviewed; 908 AA.
AC A0A1A6HVH8;
DT 05-OCT-2016, integrated into UniProtKB/TrEMBL.
DT 05-OCT-2016, sequence version 1.
DT 27-MAR-2024, entry version 29.
DE RecName: Full=Arsenite-resistance protein 2 {ECO:0000256|ARBA:ARBA00033161};
GN ORFNames=A6R68_23778 {ECO:0000313|EMBL:OBS82229.1};
OS Neotoma lepida (Desert woodrat).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea;
OC Cricetidae; Neotominae; Neotoma.
OX NCBI_TaxID=56216 {ECO:0000313|EMBL:OBS82229.1, ECO:0000313|Proteomes:UP000092124};
RN [1] {ECO:0000313|EMBL:OBS82229.1, ECO:0000313|Proteomes:UP000092124}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=417 {ECO:0000313|EMBL:OBS82229.1};
RC TISSUE=Liver {ECO:0000313|EMBL:OBS82229.1};
RA Campbell M., Oakeson K.F., Yandell M., Halpert J.R., Dearing D.;
RT "The Draft Genome Sequence and Annotation of the Desert Woodrat Neotoma
RT lepida.";
RL Submitted (JUN-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the ARS2 family.
CC {ECO:0000256|ARBA:ARBA00005407}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OBS82229.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LZPO01008120; OBS82229.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1A6HVH8; -.
DR STRING; 56216.A0A1A6HVH8; -.
DR Proteomes; UP000092124; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-KW.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0006397; P:mRNA processing; IEA:InterPro.
DR InterPro; IPR035979; RBD_domain_sf.
DR InterPro; IPR039727; SE/Ars2.
DR InterPro; IPR007042; SERRATE/Ars2_C.
DR InterPro; IPR021933; SERRATE/Ars2_N.
DR PANTHER; PTHR13165; ARSENITE-RESISTANCE PROTEIN 2; 1.
DR PANTHER; PTHR13165:SF0; SERRATE RNA EFFECTOR MOLECULE HOMOLOG; 1.
DR Pfam; PF04959; ARS2; 1.
DR Pfam; PF12066; SERRATE_Ars2_N; 1.
DR SUPFAM; SSF54928; RNA-binding domain, RBD; 1.
PE 3: Inferred from homology;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000092124}.
FT DOMAIN 153..262
FT /note="SERRATE/Ars2 N-terminal"
FT /evidence="ECO:0000259|Pfam:PF12066"
FT DOMAIN 645..883
FT /note="SERRATE/Ars2 C-terminal"
FT /evidence="ECO:0000259|Pfam:PF04959"
FT REGION 1..90
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 272..411
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 571..596
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 865..886
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 272..347
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 363..382
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 383..411
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 908 AA; 103818 MW; 5708863F88B3CF30 CRC64;
MGDSDDEYDR RRRDKFRRER SDYDRSRERD ERRRGDDWND REWDRGRERR SRGEYRDYDR
NRRERFSPPR HELSPPQKRM RRDWDEHSSD PYHSGYDMPY AGGGGGPTYG PPQPWGHPDV
HIMQHHVLPI QARLGSIAEI DLGVPPPVMK SFKEFLLSLD DSVDETEAVK RYNDYKLDFR
RQQMQDFFLA HKDEEWFRSK YHPDEVGKRR QEARGALQNR LKVFLSLMES GWFDNLLLDI
DKADAIVKML DAAVIKMEGG TENDLRILEQ EEEEEQAGKT GEASKKEEGR AGPGPGDGER
KVNDKDEKKE EGKQAENDSS NDDKTKKSEG DGDKEEKKEE AEKEAKKSKK RNRKHSGDDS
FDEGSVSESE SESESGQAEE EKEEAEEALK EKEKPKEEER EKPKDAAGLE CKPRPLHKTC
SLFMRNIAPN ISRAEIISLC KRYPGFMRVA LSEPQPERRF FRRGWVTFDR SVNIKEICWN
LQNIRLRECE LSPGVNRDLT RRVRNINGIT QHKQIVRNDI KLAAKLIHTL DDRTQLWASE
PGTPPVPTSL PSQNPILKNI TDYLIEEVSA EEEELLGSSG GPPPEEPPKE GNPAEMNVER
DEKLIKVLDK LLLYLRIVHS LDYYNTCEYP NEDEMPNRCG IIHVRGPMPP NRISHGEGTL
QVPPSCLLCP TSVFSCPSSL PAPSSLSSSS VIRIVLEWQK TFEEKLTPLL SVRESLSEEE
AQKMGRKDPE QEVEKFVTSN TQELGKDKWL CPLSGKKFKG PEFVRKHIFN KHAEKIEEVK
KEVAFFNNFL TDAKRPALPE IKPAQPPGPA QSLTPGLPYP HQTPQGLMPY GQPRPPILGY
GAGAVRPAVP TGGPPYPHAP YGAGRGNYDA FRGQGGYPGK PRNRMVRGDP RAIVEYRDLD
APDDVDFF
//