ID A0A096NAF2_PAPAN Unreviewed; 2080 AA.
AC A0A096NAF2;
DT 28-FEB-2018, integrated into UniProtKB/TrEMBL.
DT 25-MAY-2022, sequence version 2.
DT 24-JAN-2024, entry version 34.
DE SubName: Full=Host cell factor C1 {ECO:0000313|Ensembl:ENSPANP00000009660.3};
OS Papio anubis (Olive baboon).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini;
OC Cercopithecidae; Cercopithecinae; Papio.
OX NCBI_TaxID=9555 {ECO:0000313|Ensembl:ENSPANP00000009660.3, ECO:0000313|Proteomes:UP000028761};
RN [1] {ECO:0000313|Ensembl:ENSPANP00000009660.3, ECO:0000313|Proteomes:UP000028761}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Liu Y.L., Abraham K.A., Akbar H.A., Ali S.A., Anosike U.A., Aqrawi P.A.,
RA Arias F.A., Attaway T.A., Awwad R.A., Babu C.B., Bandaranaike D.B.,
RA Battles P.B., Bell A.B., Beltran B.B., Berhane-Mersha D.B., Bess C.B.,
RA Bickham C.B., Bolden T.B., Carter K.C., Chau D.C., Chavez A.C.,
RA Clerc-Blankenburg K.C., Coyle M.C., Dao M.D., Davila M.L.D.,
RA Davy-Carroll L.D., Denson S.D., Dinh H.D., Fernandez S.F., Fernando P.F.,
RA Forbes L.F., Francis C.F., Francisco L.F., Fu Q.F., Garcia-Iii R.G.,
RA Garrett T.G., Gross S.G., Gubbala S.G., Hirani K.H., Hogues M.H.,
RA Hollins B.H., Jackson L.J., Javaid M.J., Jhangiani S.J., Johnson A.J.,
RA Johnson B.J., Jones J.J., Joshi V.J., Kalu J.K., Khan N.K., Korchina V.K.,
RA Kovar C.K., Lago L.L., Lara F.L., Le T.-K.L., Lee S.L., Legall-Iii F.L.,
RA Lemon S.L., Liu J.L., Liu Y.-S.L., Liyanage D.L., Lopez J.L.,
RA Lorensuhewa L.L., Mata R.M., Mathew T.M., Mercado C.M., Mercado I.M.,
RA Morales K.M., Morgan M.M., Munidasa M.M., Ngo D.N., Nguyen L.N.,
RA Nguyen T.N., Nguyen N.N., Obregon M.O., Okwuonu G.O., Ongeri F.O.,
RA Onwere C.O., Osifeso I.O., Parra A.P., Patil S.P., Perez A.P., Perez Y.P.,
RA Pham C.P., Pu L.-L.P., Puazo M.P., Quiroz J.Q., Rouhana J.R., Ruiz M.R.,
RA Ruiz S.-J.R., Saada N.S., Santibanez J.S., Scheel M.S., Schneider B.S.,
RA Simmons D.S., Sisson I.S., Tang L.-Y.T., Thornton R.T., Tisius J.T.,
RA Toledanes G.T., Trejos Z.T., Usmani K.U., Varghese R.V., Vattathil S.V.,
RA Vee V.V., Walker D.W., Weissenberger G.W., White C.W., Williams A.W.,
RA Woodworth J.W., Wright R.W., Zhu Y.Z., Han Y.H., Newsham I.N.,
RA Nazareth L.N., Worley K.W., Muzny D.M., Rogers J.R., Gibbs R.G.;
RT "Whole Genome Assembly of Papio anubis.";
RL Submitted (MAR-2012) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSPANP00000009660.3}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_009196757.1; XM_009198493.2.
DR STRING; 9555.ENSPANP00000009660; -.
DR Ensembl; ENSPANT00000026085.3; ENSPANP00000009660.3; ENSPANG00000024276.4.
DR GeneID; 100137257; -.
DR CTD; 3054; -.
DR GeneTree; ENSGT00940000161383; -.
DR OMA; PDYGQMK; -.
DR Proteomes; UP000028761; Chromosome X.
DR Bgee; ENSPANG00000024276; Expressed in postnatal subventricular zone and 57 other cell types or tissues.
DR ExpressionAtlas; A0A096NAF2; baseline.
DR GO; GO:0005737; C:cytoplasm; IEA:Ensembl.
DR GO; GO:0071339; C:MLL1 complex; IEA:Ensembl.
DR GO; GO:0043025; C:neuronal cell body; IEA:Ensembl.
DR GO; GO:0044545; C:NSL complex; IEA:Ensembl.
DR GO; GO:0048188; C:Set1C/COMPASS complex; IEA:Ensembl.
DR GO; GO:0003682; F:chromatin binding; IEA:Ensembl.
DR GO; GO:0140297; F:DNA-binding transcription factor binding; IEA:Ensembl.
DR GO; GO:0042802; F:identical protein binding; IEA:Ensembl.
DR GO; GO:0030674; F:protein-macromolecule adaptor activity; IEA:Ensembl.
DR GO; GO:0003713; F:transcription coactivator activity; IEA:Ensembl.
DR GO; GO:0010628; P:positive regulation of gene expression; IEA:Ensembl.
DR GO; GO:0051571; P:positive regulation of histone H3-K4 methylation; IEA:Ensembl.
DR GO; GO:0045944; P:positive regulation of transcription by RNA polymerase II; IEA:Ensembl.
DR GO; GO:0050821; P:protein stabilization; IEA:Ensembl.
DR GO; GO:0043254; P:regulation of protein-containing complex assembly; IEA:Ensembl.
DR CDD; cd00063; FN3; 2.
DR Gene3D; 6.10.250.2590; -; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 2.
DR Gene3D; 2.120.10.80; Kelch-type beta propeller; 2.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR043536; HCF1/2.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR015915; Kelch-typ_b-propeller.
DR InterPro; IPR006652; Kelch_1.
DR PANTHER; PTHR46003; HOST CELL FACTOR; 1.
DR PANTHER; PTHR46003:SF3; HOST CELL FACTOR 1; 1.
DR Pfam; PF01344; Kelch_1; 1.
DR Pfam; PF13415; Kelch_3; 1.
DR Pfam; PF13854; Kelch_5; 2.
DR SMART; SM00060; FN3; 3.
DR SUPFAM; SSF49265; Fibronectin type III; 1.
DR SUPFAM; SSF117281; Kelch motif; 1.
DR PROSITE; PS50853; FN3; 1.
PE 4: Predicted;
KW Kelch repeat {ECO:0000256|ARBA:ARBA00022441};
KW Reference proteome {ECO:0000313|Proteomes:UP000028761};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 1935..2051
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT REGION 407..434
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1293..1365
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1435..1470
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2039..2080
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 414..432
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1293..1346
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2042..2060
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2080 AA; 213453 MW; FEB4C1FF44074DC6 CRC64;
MASAVSPANS PAVLLQPRWK RVVGWSGPVP RPRHGHRAVA IKELIVVFGG GNEGIVDELH
VYNTATNQWF IPAVRGDIPP GCAAYGFVCD GTRLLVFGGM VEYGKYSNDL YELQASRWEW
KRLKAKTPKN GPPPCPRLGH SFSLVGNKCY LFGGLANDSE DPKNNIPRYL NDLYILELRP
GSGVVAWDIP ITYGVLPPPR ESHTAVVYTE KDNKKSKLVI YGGMSGCRLG DLWTLDIDTL
TWNKPSLSGV APLPRSLHSA TTIGNKMYVF GGWVPLVMDD VKVATHEKEW KCTNTLACLN
LDTMAWETIL MDTLEDNIPR ARAGHCAVAI NTRLYIWSGR DGYRKAWNNQ VCCKDLWYLE
TEKPPPPARV QLVRANTNSL EVSWGAVATA DSYLLQLQKY DIPATAATAT SPTPNPVPSV
PANPPKSPAP AAAAPAVQPL TQVGITLLPQ AAPAPPTTTT IQVLPTVPGS SISVPTAART
QGVPAVLKVT GPQATTGTPL VTMRPTSQAG KAPVTVTSLP AGVRMVVPTQ SAQGTVIGSS
PQMSGMAALA AAAAATQKIP PSSAPTVLSV PAGTTIVKTM AVTPGTTTLP ATVKVASSPV
MVSNPATRML KTAAAQVGTS VSSATNTSTR PIITVHKSGT VTVAQQAQVV TTVVGGVTKT
ITLVKSPISV PGGSALISNL GKVMSVVQTK PVQTSAVTGQ ASTGPVTQII QTKGPLPAGT
ILKLVTSADG KPTTIITTTQ ASGAGTKPTI LGISSVSPST TKPGTTTIIK TIPMSAIITQ
AGATGVTSSP GIKSPITIIT TKVMTSGTGA PAKIITAVPK IATGHGQQGV TQVVLKGAPG
QPGTILRTVP MGGVRLVTPV TVSAVKPAVT TLVVKGTTGV TTLGTVTGTV STSLAGAGGH
STSASLATPI TTLGTIATLS SQVINPTAIT VSAAQTTLTA AGGLTTPTIT MQPVSQPTQV
TLITAPSGVE AQPVHDLPVS ILASPTTEQP TATVTIADSG QGDVQPGTVT LVCSNPPCET
HETGTTNTAT TTVVANLGGH PQPTQVQFVC DRQEAAASLV TSTVGQQNGS VVRVCSNPPC
ETHETGTTTT ATTATSNMAG QHGCSNPPCE THETGTTNTA TTAMSSVGAN HQRDARRACA
AGTPAVIRIS VATGALEAAQ GSKPQCQTRQ TSTTSTTMTV MATGAPCSAG PLLGPSMARE
PGGRGPAFVQ LAPLSSKVRL SSPGSKDLPA GRHSHVANTT AMARSSMGAG EPRTAPACES
LQGGSPSTTV TVTALEALLC PSATVTQVCS NPPCETHETG TTNTATTSNA GSAQRVCSNP
PCETHETGTT HTATTATSNG GTGQPEGGQQ PPAGHPCETH QTTSTGTTMS VSMGALLPDA
TSSHRTLESG LEVAAAPSVT PQAGTALLAP FPTQRVCSNP PCETHETGTT HTATTVTSNM
SSNQDPPPAA SDQGEVESTQ GDSVNITSSS AITTTVSSTL TRAVTTVTQS TPVPGPSVPK
ISSMTETAPR ALTTEVPIPA KITVTIANTE TSDMPFSAVD ILQPPEELQV SPGPRQQLPP
RQLLQSASTA LMGESTEVLS ASQTPELPAA VDLSSTGEPS SGQESASSAV VATVVVQPPP
PAQSEVDQLS LPQELMAEAQ AGTTTLMVTG LTPEELAVTA AAEAAAQAAA TEEAQALAIQ
AVLQAAQQAV MAGTGEPMDT SEAAATVTQA ELGHLSAEGQ EGQATTIPIV LTQQELAALV
QQQQLQEAQA QQQHHHLPTE ALAPADSLND PAIESNCLNE LAGTVPSTVA LLPSTATESL
APSNTFVAPQ PVVVASPAKL QAAATLTEVA NGIESLGVKP DLPPPPSKAP MKKENQWFDV
GVIKGTNVMV THYFLPPDDA VPSDDDSGTV PDYNQLKKQE LQPGTAYKFR VAGINACGRG
PFSEISAFKT CLPGFPGAPC AIKISKSPDG AHLTWEPPSV TSGKIIEYSV YLAIQSSQAG
GELKSSTPAQ LAFMRVYCGP SPSCLVQSSS LSNAHIDYTT KPAIIFRIAA RNEKGYGPAT
QVRWLQETSK DSSGTKPANK RPMSSPEMKS APKKSKADGQ
//