GenomeNet

Database: UniProt
Entry: A0A1A6HC74_NEOLE
LinkDB: A0A1A6HC74_NEOLE
Original site: A0A1A6HC74_NEOLE 
ID   A0A1A6HC74_NEOLE        Unreviewed;       389 AA.
AC   A0A1A6HC74;
DT   05-OCT-2016, integrated into UniProtKB/TrEMBL.
DT   05-OCT-2016, sequence version 1.
DT   03-JUL-2019, entry version 14.
DE   SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:OBS75871.1};
GN   ORFNames=A6R68_17677 {ECO:0000313|EMBL:OBS75871.1};
OS   Neotoma lepida (Desert woodrat).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha;
OC   Muroidea; Cricetidae; Neotominae; Neotoma.
OX   NCBI_TaxID=56216 {ECO:0000313|EMBL:OBS75871.1, ECO:0000313|Proteomes:UP000092124};
RN   [1] {ECO:0000313|EMBL:OBS75871.1, ECO:0000313|Proteomes:UP000092124}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=417 {ECO:0000313|EMBL:OBS75871.1};
RC   TISSUE=Liver {ECO:0000313|EMBL:OBS75871.1};
RA   Campbell M., Oakeson K.F., Yandell M., Halpert J.R., Dearing D.;
RT   "The Draft Genome Sequence and Annotation of the Desert Woodrat
RT   Neotoma lepida.";
RL   Submitted (JUN-2016) to the EMBL/GenBank/DDBJ databases.
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|RuleBase:RU004019,
CC       ECO:0000256|SAAS:SAAS00594387}.
CC   -!- SIMILARITY: Belongs to the ETS family.
CC       {ECO:0000256|RuleBase:RU004019, ECO:0000256|SAAS:SAAS00594391}.
CC   -!- CAUTION: The sequence shown here is derived from an
CC       EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is
CC       preliminary data. {ECO:0000313|EMBL:OBS75871.1}.
CC   -----------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   -----------------------------------------------------------------------
DR   EMBL; LZPO01035210; OBS75871.1; -; Genomic_DNA.
DR   Proteomes; UP000092124; Unassembled WGS sequence.
DR   GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR   GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR   GO; GO:0043565; F:sequence-specific DNA binding; IEA:InterPro.
DR   CDD; cd08542; SAM_PNT-ETS-1; 1.
DR   Gene3D; 1.10.10.10; -; 1.
DR   Gene3D; 1.10.150.50; -; 1.
DR   InterPro; IPR000418; Ets_dom.
DR   InterPro; IPR003118; Pointed_dom.
DR   InterPro; IPR013761; SAM/pointed_sf.
DR   InterPro; IPR041886; SAM_PNT-ETS-1.
DR   InterPro; IPR036388; WH-like_DNA-bd_sf.
DR   InterPro; IPR036390; WH_DNA-bd_sf.
DR   Pfam; PF00178; Ets; 1.
DR   Pfam; PF02198; SAM_PNT; 1.
DR   PRINTS; PR00454; ETSDOMAIN.
DR   SMART; SM00413; ETS; 1.
DR   SMART; SM00251; SAM_PNT; 1.
DR   SUPFAM; SSF46785; SSF46785; 1.
DR   SUPFAM; SSF47769; SSF47769; 1.
DR   PROSITE; PS00346; ETS_DOMAIN_2; 1.
DR   PROSITE; PS50061; ETS_DOMAIN_3; 1.
DR   PROSITE; PS51433; PNT; 1.
PE   3: Inferred from homology;
KW   Complete proteome {ECO:0000313|Proteomes:UP000092124};
KW   DNA-binding {ECO:0000256|RuleBase:RU004019,
KW   ECO:0000256|SAAS:SAAS00594397};
KW   Nucleus {ECO:0000256|RuleBase:RU004019,
KW   ECO:0000256|SAAS:SAAS00594400};
KW   Reference proteome {ECO:0000313|Proteomes:UP000092124}.
FT   DOMAIN       23    108       PNT. {ECO:0000259|PROSITE:PS51433}.
FT   DOMAIN      319    363       ETS. {ECO:0000259|PROSITE:PS50061}.
SQ   SEQUENCE   389 AA;  44418 MW;  566E573EBBA8963A CRC64;
     MECADVPLLT PSSKEMMSQA LKATFSGFTK EQQRLGIPKD PRQWTETHVR DWVMWAVNEF
     SLKGVDFQKF CMNGASLCAL GKECFLDLAP DFVGDILWEH LEILQKEDVK PYQVNGVNPT
     YPESRYTSDY FISYGIEHAQ CVPPSEFSEP SFITESYQTL HPISSEELLS LKYENDYPSV
     ILRDPLQTDT LQTDYFAIKQ EVLTPDNMCM GRASRGKLGG QDSFESIESY DSCDRLTQSW
     SSQSSFNSLQ RVPSYDSFDS EDYPAALPSH KPKGTFKDYV RDRADLNKDK PVIPAAALAG
     YTGVWGVDLD AAGRYAKLVA RRWGKRKNKP KMNYEKLSRG LRYYYDKNII HKTAGKRYVY
     RFVCDLQSLL GYTPEELHAM LDVKPDADE
//
DBGET integrated database retrieval system