ID A0A2I3HGA4_NOMLE Unreviewed; 661 AA.
AC A0A2I3HGA4;
DT 28-FEB-2018, integrated into UniProtKB/TrEMBL.
DT 28-FEB-2018, sequence version 1.
DT 27-MAR-2024, entry version 25.
DE SubName: Full=Sorbin and SH3 domain containing 2 {ECO:0000313|Ensembl:ENSNLEP00000042496.1};
GN Name=SORBS2 {ECO:0000313|Ensembl:ENSNLEP00000042496.1};
OS Nomascus leucogenys (Northern white-cheeked gibbon) (Hylobates leucogenys).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hylobatidae;
OC Nomascus.
OX NCBI_TaxID=61853 {ECO:0000313|Ensembl:ENSNLEP00000042496.1, ECO:0000313|Proteomes:UP000001073};
RN [1] {ECO:0000313|Ensembl:ENSNLEP00000042496.1, ECO:0000313|Proteomes:UP000001073}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG Gibbon Genome Sequencing Consortium;
RL Submitted (OCT-2012) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSNLEP00000042496.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ADFV01066245; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; ADFV01066246; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; ADFV01066247; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR AlphaFoldDB; A0A2I3HGA4; -.
DR Ensembl; ENSNLET00000054347.1; ENSNLEP00000042496.1; ENSNLEG00000011297.2.
DR GeneTree; ENSGT00940000157056; -.
DR Proteomes; UP000001073; Chromosome 7b.
DR CDD; cd11923; SH3_Sorbs2_2; 1.
DR Gene3D; 2.30.30.40; SH3 Domains; 3.
DR InterPro; IPR036028; SH3-like_dom_sf.
DR InterPro; IPR001452; SH3_domain.
DR InterPro; IPR003127; SoHo_dom.
DR PANTHER; PTHR14167:SF56; DREBRIN-LIKE PROTEIN-RELATED; 1.
DR PANTHER; PTHR14167; SH3 DOMAIN-CONTAINING; 1.
DR Pfam; PF00018; SH3_1; 2.
DR Pfam; PF14604; SH3_9; 1.
DR Pfam; PF02208; Sorb; 1.
DR PRINTS; PR00499; P67PHOX.
DR PRINTS; PR00452; SH3DOMAIN.
DR SMART; SM00326; SH3; 3.
DR SMART; SM00459; Sorb; 1.
DR SUPFAM; SSF50044; SH3-domain; 3.
DR PROSITE; PS50002; SH3; 3.
DR PROSITE; PS50831; SOHO; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000001073};
KW SH3 domain {ECO:0000256|ARBA:ARBA00022443, ECO:0000256|PROSITE-
KW ProRule:PRU00192}.
FT DOMAIN 135..196
FT /note="SoHo"
FT /evidence="ECO:0000259|PROSITE:PS50831"
FT DOMAIN 424..483
FT /note="SH3"
FT /evidence="ECO:0000259|PROSITE:PS50002"
FT DOMAIN 499..560
FT /note="SH3"
FT /evidence="ECO:0000259|PROSITE:PS50002"
FT DOMAIN 602..661
FT /note="SH3"
FT /evidence="ECO:0000259|PROSITE:PS50002"
FT REGION 1..72
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 204..309
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 323..425
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..31
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 46..72
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 204..233
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 236..251
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 252..282
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 359..393
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 661 AA; 73885 MW; 5AC0BCA1B95DE5BE CRC64;
MNTGRDSQSP DSAWRSYNDG NQETLNGDAT YSSLAAKGFR SVRPNLQDKR SPTQSQITVN
GNSGGAVSPM SYYQRPFSPS AYSLPASLNS SIVMQHGTSL DSTDTYPQHA QSLDGTTSSS
IPLYRSSEEE KRVTVIKAPH YPGIGPVDES GIPTAIRTTV DRPKDWYKTM FKQIHMVHKP
DDDTDMYHTP YTYNAGLYNP PYSAQSHPAA KTQTYRPLSK SHSDNSTNAF KDASSPVPPP
HVPPPVPPLR PRDRSSTEKH DWDPPDRKVD TRKFRSEPRS IFEYEPGKSS ILQHERPPPK
KPLDYVQDHS SGVFNEASLY QSSIDRSLER PMSSASMASD FRKRRKSEPA VGPPRGLGDQ
SASRTSPGRV DLPGSSTPLT KSFTSSSPSS PSRAKDRESP RSYSSTLIDI GRSAPRERRG
TPEKEKLPAK AVYDFKAQTS KELSFKKGDT VYILRKIDQN WYEGEHHGRV GIFPISYVEK
LTPPEKAQPA RPPPPAQPGE IGEAIAKYNF NADTNVELSL RKGDRVILLK RVDQNWYEGK
IPGTNRQGIF PVSYVEVVKK NTKGAEDYPD PPIPHSYSSD RIHSLSSNKP QRPVFTHENI
QGGGEPFQAL YNYTPRNEDE LELRESDVID VMEKCDDGWF VGTSRRTKFF GTFPGNYVKR
L
//