ID A0A1A6HLN3_NEOLE Unreviewed; 709 AA.
AC A0A1A6HLN3;
DT 05-OCT-2016, integrated into UniProtKB/TrEMBL.
DT 05-OCT-2016, sequence version 1.
DT 22-FEB-2023, entry version 21.
DE RecName: Full=DIRP domain-containing protein {ECO:0000259|SMART:SM01135};
DE Flags: Fragment;
GN ORFNames=A6R68_19008 {ECO:0000313|EMBL:OBS78602.1};
OS Neotoma lepida (Desert woodrat).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea;
OC Cricetidae; Neotominae; Neotoma.
OX NCBI_TaxID=56216 {ECO:0000313|EMBL:OBS78602.1, ECO:0000313|Proteomes:UP000092124};
RN [1] {ECO:0000313|EMBL:OBS78602.1, ECO:0000313|Proteomes:UP000092124}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=417 {ECO:0000313|EMBL:OBS78602.1};
RC TISSUE=Liver {ECO:0000313|EMBL:OBS78602.1};
RA Campbell M., Oakeson K.F., Yandell M., Halpert J.R., Dearing D.;
RT "The Draft Genome Sequence and Annotation of the Desert Woodrat Neotoma
RT lepida.";
RL Submitted (JUN-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the lin-9 family.
CC {ECO:0000256|ARBA:ARBA00006732}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OBS78602.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LZPO01027406; OBS78602.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1A6HLN3; -.
DR STRING; 56216.A0A1A6HLN3; -.
DR Proteomes; UP000092124; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-KW.
DR GO; GO:0017053; C:transcription repressor complex; IEA:InterPro.
DR GO; GO:0006351; P:DNA-templated transcription; IEA:InterPro.
DR InterPro; IPR033471; DIRP.
DR InterPro; IPR010561; LIN-9/ALY1.
DR InterPro; IPR045831; LIN9_C.
DR PANTHER; PTHR21689; LIN-9; 1.
DR PANTHER; PTHR21689:SF2; PROTEIN LIN-9 HOMOLOG; 1.
DR Pfam; PF06584; DIRP; 1.
DR Pfam; PF19438; LIN9_C; 3.
DR SMART; SM01135; DIRP; 1.
PE 3: Inferred from homology;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000092124}.
FT DOMAIN 353..458
FT /note="DIRP"
FT /evidence="ECO:0000259|SMART:SM01135"
FT REGION 1..66
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 97..202
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 266..305
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 16..30
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 285..302
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:OBS78602.1"
SQ SEQUENCE 709 AA; 79112 MW; FDECF4563C6A8B2C CRC64;
SLVTVRLPSS LPVSGGEGIS ERTQDSSLAV GLHQLGDLSR SPPEGRSQLR FRQKLRRPGP
RVRGADAAAW ALARGKGGTE APETRAVWGA EYALLRAEPA PCKATRNPGP VRRATRPRET
NSLQFPQPQR GPRAGSERRR GGAGGRAGAS GLAPRPPRSA LRGLSARPPP ALLQARSGVE
ARPPEAPSGQ ARAEPDGLAG AGAAGRELLI RTSLADSLRP TMHRGGQPLK KRRGSFKMAE
LDQLPDEKGS LSNTWNEKYS SLQKTPVWKG RNTGPAVEMP FRNSKRSRLF SDEDDRQINT
KSPKRNQRVA MIPQKFTATM STPDKKASQK IGFRLRNLLK LPKAHKWCIY EWFYSNIDKP
LFEGDNDFCV CLKESFPNLK TRKLTRVEWG KIRRLMGKPR RCSSAFFEEE RSALKQKRQK
IRLLQQRKVA DVSQFKDLPD EIPLPLVIGT KVTARLRGVH DGLFTGQIDA VDTLNATYRV
TFDRTGLGTH TIPDYEVLSN EPHETMPIAA FGQKQRPSRF FMTPPRLHYT PPLQSPITDS
DPLLGQSPWR SKISGSDTET LGGFPVEFLI QVTKLSKILM IKKEHIKKLR EMNTEAEKLK
SYSMPIGIEF QRRYATIVLE LEQLNKDLNK PTDLRRRCEE EAQEIVRQAN SSSGQPCVEN
ENLTDLIARL TAILLQIKNN VEIHVAHIQS GLSQMGNLHA FAANNTNRD
//