ID A0A096MVG2_PAPAN Unreviewed; 2915 AA.
AC A0A096MVG2;
DT 26-NOV-2014, integrated into UniProtKB/TrEMBL.
DT 25-MAY-2022, sequence version 3.
DT 24-JAN-2024, entry version 39.
DE SubName: Full=Trinucleotide repeat containing 18 {ECO:0000313|Ensembl:ENSPANP00000003837.3};
GN Name=TNRC18 {ECO:0000313|Ensembl:ENSPANP00000003837.3};
OS Papio anubis (Olive baboon).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini;
OC Cercopithecidae; Cercopithecinae; Papio.
OX NCBI_TaxID=9555 {ECO:0000313|Ensembl:ENSPANP00000003837.3, ECO:0000313|Proteomes:UP000028761};
RN [1] {ECO:0000313|Ensembl:ENSPANP00000003837.3, ECO:0000313|Proteomes:UP000028761}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Liu Y.L., Abraham K.A., Akbar H.A., Ali S.A., Anosike U.A., Aqrawi P.A.,
RA Arias F.A., Attaway T.A., Awwad R.A., Babu C.B., Bandaranaike D.B.,
RA Battles P.B., Bell A.B., Beltran B.B., Berhane-Mersha D.B., Bess C.B.,
RA Bickham C.B., Bolden T.B., Carter K.C., Chau D.C., Chavez A.C.,
RA Clerc-Blankenburg K.C., Coyle M.C., Dao M.D., Davila M.L.D.,
RA Davy-Carroll L.D., Denson S.D., Dinh H.D., Fernandez S.F., Fernando P.F.,
RA Forbes L.F., Francis C.F., Francisco L.F., Fu Q.F., Garcia-Iii R.G.,
RA Garrett T.G., Gross S.G., Gubbala S.G., Hirani K.H., Hogues M.H.,
RA Hollins B.H., Jackson L.J., Javaid M.J., Jhangiani S.J., Johnson A.J.,
RA Johnson B.J., Jones J.J., Joshi V.J., Kalu J.K., Khan N.K., Korchina V.K.,
RA Kovar C.K., Lago L.L., Lara F.L., Le T.-K.L., Lee S.L., Legall-Iii F.L.,
RA Lemon S.L., Liu J.L., Liu Y.-S.L., Liyanage D.L., Lopez J.L.,
RA Lorensuhewa L.L., Mata R.M., Mathew T.M., Mercado C.M., Mercado I.M.,
RA Morales K.M., Morgan M.M., Munidasa M.M., Ngo D.N., Nguyen L.N.,
RA Nguyen T.N., Nguyen N.N., Obregon M.O., Okwuonu G.O., Ongeri F.O.,
RA Onwere C.O., Osifeso I.O., Parra A.P., Patil S.P., Perez A.P., Perez Y.P.,
RA Pham C.P., Pu L.-L.P., Puazo M.P., Quiroz J.Q., Rouhana J.R., Ruiz M.R.,
RA Ruiz S.-J.R., Saada N.S., Santibanez J.S., Scheel M.S., Schneider B.S.,
RA Simmons D.S., Sisson I.S., Tang L.-Y.T., Thornton R.T., Tisius J.T.,
RA Toledanes G.T., Trejos Z.T., Usmani K.U., Varghese R.V., Vattathil S.V.,
RA Vee V.V., Walker D.W., Weissenberger G.W., White C.W., Williams A.W.,
RA Woodworth J.W., Wright R.W., Zhu Y.Z., Han Y.H., Newsham I.N.,
RA Nazareth L.N., Worley K.W., Muzny D.M., Rogers J.R., Gibbs R.G.;
RT "Whole Genome Assembly of Papio anubis.";
RL Submitted (MAR-2012) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSPANP00000003837.3}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR Ensembl; ENSPANT00000008529.3; ENSPANP00000003837.3; ENSPANG00000018936.4.
DR GeneTree; ENSGT00940000157099; -.
DR HOGENOM; CLU_000573_0_0_1; -.
DR Proteomes; UP000028761; Chromosome 4.
DR Bgee; ENSPANG00000018936; Expressed in aorta and 55 other cell types or tissues.
DR ExpressionAtlas; A0A096MVG2; baseline.
DR GO; GO:0003682; F:chromatin binding; IEA:InterPro.
DR CDD; cd04714; BAH_BAHCC1; 1.
DR Gene3D; 2.30.30.140; -; 1.
DR Gene3D; 2.30.30.490; -; 1.
DR InterPro; IPR001025; BAH_dom.
DR InterPro; IPR043151; BAH_sf.
DR InterPro; IPR048924; BAHCC1-like_Tudor.
DR PANTHER; PTHR12505; PHD FINGER TRANSCRIPTION FACTOR; 1.
DR PANTHER; PTHR12505:SF21; TRINUCLEOTIDE REPEAT-CONTAINING GENE 18 PROTEIN; 1.
DR Pfam; PF01426; BAH; 1.
DR Pfam; PF21744; BAHCC1-like_Tudor; 1.
DR SMART; SM00439; BAH; 1.
DR PROSITE; PS51038; BAH; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000028761}.
FT DOMAIN 2764..2909
FT /note="BAH"
FT /evidence="ECO:0000259|PROSITE:PS51038"
FT REGION 1..27
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 84..212
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 252..304
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 330..436
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 556..627
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 639..676
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 887..951
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 967..1013
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1052..1187
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1227..1260
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1444..1507
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1631..1802
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1859..2097
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2240..2718
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 85..103
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 167..181
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 191..205
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 253..267
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 275..300
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 410..424
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 887..903
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 972..994
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 998..1013
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1091..1114
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1444..1470
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1765..1788
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1911..1929
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1938..1953
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1984..2005
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2252..2266
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2276..2299
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2338..2363
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2386..2432
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2503..2525
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2549..2623
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2670..2684
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2693..2718
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2915 AA; 309363 MW; 3E865E1708A9B9F6 CRC64;
MDGRDFGPQR SVHGPPPPLL SGLAMDSHRV GAATAGRLPA SGLPGPLPPG KYMAGLNLHP
HPGFSHLPSG LYPSYLHLSH LEPPSSGSPL LSQLGQPSIF DTQKGQGPGG DGFYLPTAGA
PGALHSHVPS ARTPGGGHSS GPPAKGSSSR DGPAKERAGR GGEPPLLFGK KDPRARGEEA
SGPRGVVDLT QEARVEGRQD RGPPRLAERL SPFLTESKTK NAALQPSVLT MCNGGAGDAG
LPALVAEAGR GGAKEAARQD EGARLLRRTE TLLPGPRPCP SPLPPPPAPP KGPPAPPAAT
PAGVYTVFRE QGREHRVVAP TFVPSVEVFD ERPGPIQIAS QARDARARER EAGRPGVLQA
PPGSPRPLDR PEGLREKNSV IRSLKRPSPA DAPTVRAARA SPDPRAYLPA KELLKPEADP
RPCERAPRGP AGPAAQQAAK LFGLEPGRPP PTGPEHKWKP FELGNFATTQ MAVLAAQHHH
SRAEEEAAVV AASSSKKAYL DPGAVLPRSA ATCGRPIADM HSTAHGPGEA SAMQSLIKYS
GSFAREAVAV RPGGCGKKSP FGGLGTMKPE PAPTSAGAPR AQGRLPHSGG PAAGGSRQLK
RDPERPESAK AFGREGSGAQ GEAEVRHPPV GIAVAVARQK DSGGSGRLGP GLADQDRSLS
LSNVKGHGRA DEDCVDDRAR HREERLLGAR LDRDQEKLLR ESKELADLAR LHPTSCAPNG
LNPNLMVTGG PALAGSGRWS ADPAAHLATH PWLPRSGSTS MWLAGHPYGL GPPSLHQGMA
PAFPPGLGGS LPSAYQFVRD PQSGQLVVIP SDHLPHFAEL MERATVPPLW PALYPPGRSP
LHHAQQLQLF SQQHFLRQQE FLYLQQQAAQ ALELQRSAQL VQERLKAQEH RAEMEEKGSK
RGLEAAGKAG LATAGPGLLP RKPPGLTAGP AGTYGKAVSP PPSPRASPVA ALKAKVIQKL
EDVSKPPAYA YPATPSSHPT SPPPASPPPT PGITRKEEAP ENVVEKKDLE LEKEAPSPFQ
ALFSDMPPRY PFQALPPHYG RPYPFLLQPT AAADADGLAP DVPLPTDGPE RLALSPEDKP
IRLSPSKITE PLREGPEEEP LAEREVKAEV EDMDEGPTEL PPLESPLPLP AAEAMATPSP
TGGCGGGLLE AQALSATGQS CAEPSECPDF VEGPEPRVDS PGRTEPCTAA LDLGVQLTPE
TLVEAKEEPV EVPVAVPVVE AVPEEGLAQV APSESQPTLE MSDCDVPAGE GQCPSLEPQE
AVPVPGSTCY LEEASSDQFL PSLEDPLAGM NALAAAAELP QARPLPSPGA AGAQALEKLE
AAESLVLEQS FLHGITLLSE IAELELERRS QEIGAERALV ARPSLESLLA AGSHMLREVL
DGPVVDPLKN LRLPRELKPN KKYSWMRKKE ERMYAMKSSL EDMDALELDF RMRLAEVQRQ
YKEKQRELVK LQRRRDSEDR REEPHRSLAR RGPGRPRKRT HAPSALSPPR KRGKSGHSSG
KLSSKSLLTS DDYELGAGIR KRHKGSEEEH DTLVGMGKAR GRNQTWDEHE ASSDFISQLK
IKKKKMASDQ EQLASKLDKA LSLTKQDKLK SPFKFSDSAG GKSKTGGGCG RYLTPYDSLL
GKDRKAMAKG LGLSLKSSRE GKHKRAAKAR KMEVGFKARG QPKSAHSPFA SEVSSYSYNT
DSEEDEEFLK DEWPAQGPSS SKLTPSLLCG MVAKNSKAAG GPKLTKRGLV APRTLKPKPA
TSRKQPFCLL LREAEARSSF SDSSEESFDQ DESSEEEDEE EELEEEDEAS GSGYRLGARE
RALSPGLEES GLGLLARFAA SALPSPTVGP SLSVVQLEAK QKARKKEERQ SLLGTEFEYT
DSESEVKVRK RSPAGLLRPK KGLGEPGPSL AAPTPGARGP GPSPSSPDKA KLAVEKGRKA
RKLRGPKEPG FEAGPEASDD DLWTRRRSER IFLHDASAAA PAPTSTAPAT KASRCAKGGP
LSPRKDTGRA KDRKDPRKKK KGKEAGPGAG VPPPRAPALP SEARAPHASS LTAAKRSKAK
AKGKEVKKEN RGKGGAVSKL MESMAAEEDF EPNQDSSFSE DEHLPRGGAV ERPLTPAPRS
CIIDKDELKD GLRVLIPMDD KLLYAGHVQT VHSPDIYRVV VEGERGNRPH IYCLEQLLQE
AIIDVRPAST RFLPQGTRIA AYWSQQYRCL YPGTVVRGLL DLEDDGDLIT VEFDDGDTGR
IPLSHIRLLP PDYKIQCAEP SPALLVPSAK RRSRKTSKDT GEGKDGGTTG SEEPGAKARG
RGRKPSAKAK GDRAATLEEG SPTDEVPSTP LALEPSSTPG SKKSPPEPVD KRAKTPKARP
APPQPSPAPP AFTSCPAPEP FGELPAPAST LAAAPLITMP ATRPKPKKAR AAEESGAKGP
RRPGEETELL VKLDHEGVTS PKSKKAKEAL LLREDPGAGG WQEPKSLLSL GSYPQAAGSS
EPKAPWPKAT EGDLAQEPGP GLTFEDSGNP KSPDKAQAEQ DGAEESESSS SSSGSSSSSS
SSGSETEGEE EGDKNGDGGC GAGGRNCSAA SSRAASPASS SSSSSSSSSS SSSSSSSSSS
SSSSSSSSSS SSSSSSSSSS SSSSSSSSSS SSSSSSSSTT DEDSSCSSDD EAAPAPTAGP
SAQPALPTKA TKQAGKARPS AHSPGKKASA PQPQAPPPQP TQPLQPKAQA GAKSRPKKRE
GVHLPTTKEL AKRQRLPSVE NRPKIAAFLP ARQLWKWFGK PTQRRGMKGK ARKLFYKAIV
RGKEMIRIGD CAVFLSAGRP NLPYIGRIQS MWESWGNNMV VRVKWFYHPE ETSPGKQFHQ
GQHWDQKSSR SLPAALRVSS QRKDFMERAL YQSSHVDEND VQTVSHKCLV VGLEQYEQML
KTKKYQDSEG LYYLAGTYEP TTGMIFSTDG VPVLC
//