ID A0A0D9R3A5_CHLSB Unreviewed; 2035 AA.
AC A0A0D9R3A5;
DT 27-MAY-2015, integrated into UniProtKB/TrEMBL.
DT 27-MAY-2015, sequence version 1.
DT 27-MAR-2024, entry version 52.
DE SubName: Full=Host cell factor C1 {ECO:0000313|Ensembl:ENSCSAP00000003094.1};
GN Name=HCFC1 {ECO:0000313|Ensembl:ENSCSAP00000003094.1};
OS Chlorocebus sabaeus (Green monkey) (Cercopithecus sabaeus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini;
OC Cercopithecidae; Cercopithecinae; Chlorocebus.
OX NCBI_TaxID=60711 {ECO:0000313|Ensembl:ENSCSAP00000003094.1, ECO:0000313|Proteomes:UP000029965};
RN [1] {ECO:0000313|Ensembl:ENSCSAP00000003094.1, ECO:0000313|Proteomes:UP000029965}
RP NUCLEOTIDE SEQUENCE.
RA Warren W., Wilson R.K.;
RL Submitted (MAR-2014) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSCSAP00000003094.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AQIB01153645; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR RefSeq; XP_007991315.1; XM_007993124.1.
DR STRING; 60711.ENSCSAP00000003094; -.
DR Ensembl; ENSCSAT00000004853.1; ENSCSAP00000003094.1; ENSCSAG00000006809.1.
DR eggNOG; KOG4152; Eukaryota.
DR GeneTree; ENSGT00940000161383; -.
DR OMA; PDYGQMK; -.
DR OrthoDB; 4642026at2759; -.
DR BioGRID-ORCS; 103232813; 0 hits in 9 CRISPR screens.
DR Proteomes; UP000029965; Chromosome X.
DR Bgee; ENSCSAG00000006809; Expressed in adrenal cortex and 7 other cell types or tissues.
DR GO; GO:0005737; C:cytoplasm; IEA:Ensembl.
DR GO; GO:0071339; C:MLL1 complex; IEA:Ensembl.
DR GO; GO:0043025; C:neuronal cell body; IEA:Ensembl.
DR GO; GO:0044545; C:NSL complex; IEA:Ensembl.
DR GO; GO:0048188; C:Set1C/COMPASS complex; IEA:Ensembl.
DR GO; GO:0003682; F:chromatin binding; IEA:Ensembl.
DR GO; GO:0140297; F:DNA-binding transcription factor binding; IEA:Ensembl.
DR GO; GO:0042802; F:identical protein binding; IEA:Ensembl.
DR GO; GO:0030674; F:protein-macromolecule adaptor activity; IEA:Ensembl.
DR GO; GO:0003713; F:transcription coactivator activity; IEA:Ensembl.
DR GO; GO:0010628; P:positive regulation of gene expression; IEA:Ensembl.
DR GO; GO:0051571; P:positive regulation of histone H3-K4 methylation; IEA:Ensembl.
DR GO; GO:0045944; P:positive regulation of transcription by RNA polymerase II; IEA:Ensembl.
DR GO; GO:0050821; P:protein stabilization; IEA:Ensembl.
DR GO; GO:0043254; P:regulation of protein-containing complex assembly; IEA:Ensembl.
DR CDD; cd00063; FN3; 2.
DR Gene3D; 6.10.250.2590; -; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 2.
DR Gene3D; 2.120.10.80; Kelch-type beta propeller; 2.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR043536; HCF1/2.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR015915; Kelch-typ_b-propeller.
DR InterPro; IPR006652; Kelch_1.
DR PANTHER; PTHR46003; HOST CELL FACTOR; 1.
DR PANTHER; PTHR46003:SF3; HOST CELL FACTOR 1; 1.
DR Pfam; PF01344; Kelch_1; 1.
DR Pfam; PF13415; Kelch_3; 1.
DR Pfam; PF13854; Kelch_5; 2.
DR SMART; SM00060; FN3; 3.
DR SUPFAM; SSF49265; Fibronectin type III; 1.
DR SUPFAM; SSF117281; Kelch motif; 1.
DR PROSITE; PS50853; FN3; 1.
PE 4: Predicted;
KW Kelch repeat {ECO:0000256|ARBA:ARBA00022441};
KW Reference proteome {ECO:0000313|Proteomes:UP000029965};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 1890..2006
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT REGION 407..434
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1293..1365
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1435..1470
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1487..1515
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1994..2035
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 414..432
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1293..1346
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1997..2015
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2035 AA; 208708 MW; 89BC9FDB5F3C1754 CRC64;
MASAVSPANS PAVLLQPRWK RVVGWSGPVP RPRHGHRAVA IKELIVVFGG GNEGIVDELH
VYNTATNQWF IPAVRGDIPP GCAAYGFVCD GTRLLVFGGM VEYGKYSNDL YELQASRWEW
KRLKAKTPKN GPPPCPRLGH SFSLVGNKCY LFGGLANDSE DPKNNIPRYL NDLYILELRP
GSGVVAWDIP ITYGVLPPPR ESHTAVVYTE KDNKKSKLVI YGGMSGCRLG DLWTLDIDTL
TWNKPSLSGV APLPRSLHSA TTIGNKMYVF GGWVPLVMDD VKVATHEKEW KCTNTLACLN
LDTMAWETIL MDTLEDNIPR ARAGHCAVAI NTRLYIWSGR DGYRKAWNNQ VCCKDLWYLE
TEKPPPPARV QLVRANTNSL EVSWGAVATA DSYLLQLQKY DIPATAATAT SPTPNPVPSV
PANPPKSPAP AAAAPAVQPL TQVGITLLPQ AAPAPPTTTT IQVLPTVPGS SISVPTAART
QGVPAVLKVT GPQATTGTPL VTMRPTSQAG KAPVTVTSLP AGVRMVVPTQ SAQGTVIGSS
PQMSGMAALA AAAAATQKIP PSSAPTVLSV PAGTTIVKTM AVTPGTTTLP ATVKVASSPV
MVSNPATRML KTAAAQVGTS VSSATNTSTR PIITVHKSGT VTVAQQAQVV TTVVGGVTKT
ITLVKSPISV PGGSALISNL GKVMSVVQTK PVQTSAVTGQ ASTGPVTQII QTKGPLPAGT
ILKLVTSADG KPTTIITTTQ ASGAGTKPTI LGISSVSPST TKPGTTTIIK TIPMSAIITQ
AGATGVTSSP GIKSPITIIT TKVMTSGTGA PAKIITAVPK IATGHGQQGV TQVVLKGAPG
QPGTILRTVP MGGVRLVTPV TVSAVKPAVT TLVVKGTTGV TTLGTVTGTV STSLAGAGGH
STSASLATPI TTLGTIATLS SQVINPTAIT VSAAQTTLTA AGGLTTPTIT MQPVSQPTQV
TLITAPSGVE AQPVHDLPVS ILASPTTEQP TATVTIADSG QGDVQPGTVT LVCSNPPCET
HETGTTNTAT TTVVANLGGH PQPTQVQFVC DRQEAAASLV TSTVGQQNGS VVRVCSNPPC
ETHETGTTNT ATTATSNMAG QHGCSNPPCE THETGTTNTA TTAMSSVGAN HQRDARRACA
AGTPAVIRIS VASGALEAAQ GSKPQCQTRQ TSTTSTTMTV MATGAPCSAG PLLGPSMARE
PGGRGPAFVQ LAPLSSKVRL SSPGSKDLPA GRHSHVANTT AMARSSMGAG EPRTAPACES
LQGGSPSTTV TVTALEALLC PSATVTQVCS NPPCETHETG TTNTATTSNA GSAQRVCSNP
PCETHETGTT HTATTATSNG GTGQPEGGQQ PPAGHPCETH QTTSTGTTMS VSMGALLPDA
TSSHRTLESG LEVAAAPSVT PQAGTALLAP FPTQRVCSNP PCETHETGTT HTATTVTSNM
SSNQDPPPAA SDQGEVESTQ GDSVNITSSS AITTTVSSTL TRAVTTVTQS TPVPGPSVPP
PEELQVSPGP RQQLPPRQLL QSASTALMGE STEVLSASQT PELPAAVDLS STGEPSSGQE
SASSAVVATV VVQPPPPAQS EVDQLSLPQE LMAEAQAGTT TLMVTGLTPE ELAVTAAAEA
AAQAAATEEA QALAIQAVLQ AAQQAVMGTG EPMDTSEAAA TVTQAELGHL SAEGQEGQAT
TIPIVLTQQE LAALVQQQQL QEAQAQQQHH HLPTEALAPA DSLNDPAIES NCLNELAGTV
PSTVALLPST ATESLAPSNT FVAPQPVVVA SPAKLQAAAT LTEVANGIES LGVKPDLPPP
PSKAPMKKEN QWFDVGVIKG TNVMVTHYFL PPDDAVPSDD DSGTVPDYNQ LKKQELQPGT
AYKFRVAGIN ACGRGPFSEI SAFKTCLPGF PGAPCAIKIS KSPDGAHLTW EPPSVTSGKI
IEYSVYLAIQ SSQAGGELKS STPAQLAFMR VYCGPSPSCL VQSSSLSNAH IDYTTKPAII
FRIAARNEKG YGPATQVRWL QETSKDSSGT KPANKRPMSS PEMKSAPKKS KADGQ
//