ID G3R9S9_GORGO Unreviewed; 950 AA.
AC G3R9S9;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 28-FEB-2018, sequence version 2.
DT 27-MAR-2024, entry version 48.
DE SubName: Full=NHS like 2 {ECO:0000313|Ensembl:ENSGGOP00000012174.3};
GN Name=NHSL2 {ECO:0000313|Ensembl:ENSGGOP00000012174.3};
OS Gorilla gorilla gorilla (Western lowland gorilla).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC Gorilla.
OX NCBI_TaxID=9595 {ECO:0000313|Ensembl:ENSGGOP00000012174.3, ECO:0000313|Proteomes:UP000001519};
RN [1] {ECO:0000313|Ensembl:ENSGGOP00000012174.3, ECO:0000313|Proteomes:UP000001519}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Scally A.;
RT "Insights into the evolution of the great apes provided by the gorilla
RT genome.";
RL Submitted (MAY-2011) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSGGOP00000012174.3, ECO:0000313|Proteomes:UP000001519}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=22398555; DOI=10.1038/nature10842;
RA Scally A., Dutheil J.Y., Hillier L.W., Jordan G.E., Goodhead I.,
RA Herrero J., Hobolth A., Lappalainen T., Mailund T., Marques-Bonet T.,
RA McCarthy S., Montgomery S.H., Schwalie P.C., Tang Y.A., Ward M.C., Xue Y.,
RA Yngvadottir B., Alkan C., Andersen L.N., Ayub Q., Ball E.V., Beal K.,
RA Bradley B.J., Chen Y., Clee C.M., Fitzgerald S., Graves T.A., Gu Y.,
RA Heath P., Heger A., Karakoc E., Kolb-Kokocinski A., Laird G.K., Lunter G.,
RA Meader S., Mort M., Mullikin J.C., Munch K., O'Connor T.D., Phillips A.D.,
RA Prado-Martinez J., Rogers A.S., Sajjadian S., Schmidt D., Shaw K.,
RA Simpson J.T., Stenson P.D., Turner D.J., Vigilant L., Vilella A.J.,
RA Whitener W., Zhu B., Cooper D.N., de Jong P., Dermitzakis E.T.,
RA Eichler E.E., Flicek P., Goldman N., Mundy N.I., Ning Z., Odom D.T.,
RA Ponting C.P., Quail M.A., Ryder O.A., Searle S.M., Warren W.C.,
RA Wilson R.K., Schierup M.H., Rogers J., Tyler-Smith C., Durbin R.;
RT "Insights into hominid evolution from the gorilla genome sequence.";
RL Nature 483:169-175(2012).
RN [3] {ECO:0000313|Ensembl:ENSGGOP00000012174.3}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CABD030125725; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; CABD030125726; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; CABD030125727; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; CABD030125728; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; CABD030125729; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; CABD030125730; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; CABD030125731; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; CABD030125732; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR AlphaFoldDB; G3R9S9; -.
DR Ensembl; ENSGGOT00000012527.3; ENSGGOP00000012174.3; ENSGGOG00000012481.3.
DR eggNOG; ENOG502QQ7S; Eukaryota.
DR GeneTree; ENSGT00950000182963; -.
DR HOGENOM; CLU_009104_0_0_1; -.
DR TreeFam; TF333323; -.
DR Proteomes; UP000001519; Chromosome X.
DR Bgee; ENSGGOG00000012481; Expressed in cerebellum and 5 other cell types or tissues.
DR InterPro; IPR024845; NHS-like.
DR PANTHER; PTHR23039; NANCE-HORAN SYNDROME PROTEIN; 1.
DR PANTHER; PTHR23039:SF2; NHS-LIKE PROTEIN 2; 1.
DR Pfam; PF15273; NHS; 2.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000001519}.
FT REGION 1..112
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 128..148
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 179..357
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 395..491
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 537..734
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 764..818
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 853..927
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..43
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 248..274
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 395..420
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 431..461
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 537..557
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 577..598
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 599..613
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 626..641
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 660..695
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 778..796
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 865..894
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 950 AA; 102039 MW; 5948A6E395542C2B CRC64;
MMGNSHHKQP RSKSQSRMHS ATGHSNSPAG SVAHSTTSDI RPSHSVPEGV HGRVAVGQDA
WFPSLTSPVL RTPSSEPDEP HQARSGPNPP GMESMGMVYS VPSSCNGPTE SSFSTSWKGD
AFTYMTPSAT SQSNQVNENG KNPSCGNSWV SLNKVPPLVP KEATTLLVAR DNPAGCSGSA
GYPEHLIQQR RMPERPSEIG LLTSGTSRLE TGPGGASRFR ERSLSVPTDS GTTDVDYDEE
QKANEACALP FASTSSEGSN SADNIASLSA QQEAQHRRQR SKSISLRKAK KKPSPPTRSV
SLVKDEPGLL PEGGSALPKD QRPKSLCLSL EHQGHHSSHP DAQGHPAVPN HKDPESTQFS
HHWYLTDWKS GDTYQSLSSS STATGTTVIE CTQVQGSSES LASPSTSRAT TPSQLSIEVE
AREISSPGRP PGLMSPSSGY SSQSETPTPT VSMSLTLGHL PPPSSSVRVR PVVPERKSSL
PPTSPMEKFP KSRLSFDLPL TSSPNLDLSG MSISIRSKTK VSRHHSETNF GVKLAQKTNP
NQPIMPMVTQ SDLRSVRLRS VSKSEPEDDI ESPEYAEEPR AEEVFTLPER KTKPPVAEKP
PVARRPPSLV HKPPSVPEEY ALTSPTLATP PRSSIQHARP LPQDSYTVVR KPKPSSFPDG
RSPGESTAPS SLVFTPFASS SDAFFSGTQQ PPQGSVEDEG PKVRVLPERI SLQSQEEAEK
KKGKIPPPVP KKPSVLYLPL TSPTAQMEAY VAEPRLPLSP IITLEEDTKC PPTGDDLQSL
GQRVTSTPQA DSEREASPLG SSVEPGTEEK SLISDKTAEW IAEDDDDVFV ASRTTEDLFT
VIHRSKRKLL GWKEPGEAFV GGRTSSHSPI KNTAESPISE STATAGSGSS ANLDAGRNDD
FKALLQKKGS KATPRSRPSA AELLKTTNPL ARRIIAQFSK DYETTDNPST
//