ID G7NZ23_MACFA Unreviewed; 771 AA.
AC G7NZ23;
DT 25-JAN-2012, integrated into UniProtKB/TrEMBL.
DT 25-JAN-2012, sequence version 1.
DT 22-FEB-2023, entry version 38.
DE RecName: Full=S1 motif domain-containing protein {ECO:0000259|PROSITE:PS50126};
GN ORFNames=EGM_10864 {ECO:0000313|EMBL:EHH51486.1};
OS Macaca fascicularis (Crab-eating macaque) (Cynomolgus monkey).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini;
OC Cercopithecidae; Cercopithecinae; Macaca.
OX NCBI_TaxID=9541;
RN [1] {ECO:0000313|EMBL:EHH51486.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CE-4 {ECO:0000313|EMBL:EHH51486.1};
RX PubMed=22002653; DOI=10.1038/nbt.1992;
RA Yan G., Zhang G., Fang X., Zhang Y., Li C., Ling F., Cooper D.N., Li Q.,
RA Li Y., van Gool A.J., Du H., Chen J., Chen R., Zhang P., Huang Z.,
RA Thompson J.R., Meng Y., Bai Y., Wang J., Zhuo M., Wang T., Huang Y.,
RA Wei L., Li J., Wang Z., Hu H., Yang P., Le L., Stenson P.D., Li B., Liu X.,
RA Ball E.V., An N., Huang Q., Zhang Y., Fan W., Zhang X., Li Y., Wang W.,
RA Katze M.G., Su B., Nielsen R., Yang H., Wang J., Wang X., Wang J.;
RT "Genome sequencing and comparison of two nonhuman primate animal models,
RT the cynomolgus and Chinese rhesus macaques.";
RL Nat. Biotechnol. 29:1019-1023(2011).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001277; EHH51486.1; -; Genomic_DNA.
DR AlphaFoldDB; G7NZ23; -.
DR eggNOG; ENOG502QPJ1; Eukaryota.
DR OrthoDB; 2918730at2759; -.
DR Proteomes; UP000009130; Chromosome 2.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR CDD; cd00164; S1_like; 1.
DR Gene3D; 2.40.50.140; Nucleic acid-binding proteins; 1.
DR Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 1.
DR InterPro; IPR012340; NA-bd_OB-fold.
DR InterPro; IPR003029; S1_domain.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR InterPro; IPR019734; TPR_repeat.
DR InterPro; IPR039190; TTC14.
DR PANTHER; PTHR23184; TETRATRICOPEPTIDE REPEAT PROTEIN 14; 1.
DR PANTHER; PTHR23184:SF9; TETRATRICOPEPTIDE REPEAT PROTEIN 14; 1.
DR Pfam; PF13414; TPR_11; 1.
DR SMART; SM00028; TPR; 3.
DR SUPFAM; SSF50249; Nucleic acid-binding proteins; 1.
DR SUPFAM; SSF48452; TPR-like; 1.
DR PROSITE; PS50126; S1; 1.
DR PROSITE; PS50005; TPR; 2.
PE 4: Predicted;
KW TPR repeat {ECO:0000256|PROSITE-ProRule:PRU00339}.
FT DOMAIN 126..208
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT REPEAT 307..340
FT /note="TPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00339"
FT REPEAT 341..374
FT /note="TPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00339"
FT REGION 35..56
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 439..458
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 464..551
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 570..634
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 656..749
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 477..497
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 498..514
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 615..634
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 681..743
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 771 AA; 88305 MW; 26F30B77BC08894C CRC64;
MDRDLLRQSL NCHGSSLLSL LRSEQQDNPH FRSLLGSAAE PTRGPPPQQP LQGRKEKRVD
NIEIQKFISK KADLLFALSW KSDAPATSET NEDSEDHYAV MPPLEQFMEI PSMDRRELFF
RDIERGDIVI GRISSVREFG FFMVLICLGS GIMRDIAHLE ITALCPLRDV PSHSNHGDPL
SYYQTGDIIR AGIKDIDRYH EKLAVSLYSS SLPPHLSGIK LGVISSEELP LYYRRSVELN
SNSLESYENI MQSSLGFVNP GVVEFLLEKL GIDESNPPSL MRGLQSKNFS EDDFASALRK
KQSASWALKC VKIGVDYFKV GRHVDAMNEY NKALEIDKQN VEALVARGAL YATKGSLNKA
IEDFELALEN CPTHRNARKY LCQTLVERGG QLEEEDKFLN AESYYKKALA LDETFKDAED
ALQKLHKYMQ KSLELREKQA EKEEKQKTKK IETSAEKLRK LLKEEKRLKK KRRKSTSSSS
SVSSADESVS SSSSSSSSGH KRHKKHKRNR SESSRSSRRH SSRASSSQID QNRKDECYPV
PANTSASFLN HKQEVEKLLG KQDRLQYEKT QIKEKDRCPL SSSSLEIPDD FGGRSEDPGD
FYNSYRTQAG SSKTEKPYKS ERHFSSRRNS SDSFCRNSED KIYGYRRFEK DIEGRKEHYR
RWEPGSVRHS TSPASSDYSW KSVEKYKKYT HSGSRDFSRH EQRYRLNTNQ GEYEREDNYG
EDIKTEVPEE DALSSKEHSE SSVKKNLPQN LLNIFNQIAE FEKEKGNKSK N
//