ID G1PPP6_MYOLU Unreviewed; 1994 AA.
AC G1PPP6;
DT 19-OCT-2011, integrated into UniProtKB/TrEMBL.
DT 19-OCT-2011, sequence version 1.
DT 27-MAR-2024, entry version 75.
DE SubName: Full=Host cell factor C1 {ECO:0000313|Ensembl:ENSMLUP00000012995.2};
GN Name=HCFC1 {ECO:0000313|Ensembl:ENSMLUP00000012995.2};
OS Myotis lucifugus (Little brown bat).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Chiroptera; Microchiroptera; Vespertilionidae;
OC Myotis.
OX NCBI_TaxID=59463 {ECO:0000313|Ensembl:ENSMLUP00000012995.2, ECO:0000313|Proteomes:UP000001074};
RN [1] {ECO:0000313|Ensembl:ENSMLUP00000012995.2, ECO:0000313|Proteomes:UP000001074}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21993624; DOI=10.1038/nature10530;
RA Lindblad-Toh K., Garber M., Zuk O., Lin M.F., Parker B.J., Washietl S.,
RA Kheradpour P., Ernst J., Jordan G., Mauceli E., Ward L.D., Lowe C.B.,
RA Holloway A.K., Clamp M., Gnerre S., Alfoldi J., Beal K., Chang J.,
RA Clawson H., Cuff J., Di Palma F., Fitzgerald S., Flicek P., Guttman M.,
RA Hubisz M.J., Jaffe D.B., Jungreis I., Kent W.J., Kostka D., Lara M.,
RA Martins A.L., Massingham T., Moltke I., Raney B.J., Rasmussen M.D.,
RA Robinson J., Stark A., Vilella A.J., Wen J., Xie X., Zody M.C., Baldwin J.,
RA Bloom T., Chin C.W., Heiman D., Nicol R., Nusbaum C., Young S.,
RA Wilkinson J., Worley K.C., Kovar C.L., Muzny D.M., Gibbs R.A., Cree A.,
RA Dihn H.H., Fowler G., Jhangiani S., Joshi V., Lee S., Lewis L.R.,
RA Nazareth L.V., Okwuonu G., Santibanez J., Warren W.C., Mardis E.R.,
RA Weinstock G.M., Wilson R.K., Delehaunty K., Dooling D., Fronik C.,
RA Fulton L., Fulton B., Graves T., Minx P., Sodergren E., Birney E.,
RA Margulies E.H., Herrero J., Green E.D., Haussler D., Siepel A., Goldman N.,
RA Pollard K.S., Pedersen J.S., Lander E.S., Kellis M.;
RT "A high-resolution map of human evolutionary constraint using 29 mammals.";
RL Nature 478:476-482(2011).
RN [2] {ECO:0000313|Ensembl:ENSMLUP00000012995.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAPE02015459; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR STRING; 59463.ENSMLUP00000012995; -.
DR Ensembl; ENSMLUT00000014286.2; ENSMLUP00000012995.2; ENSMLUG00000014275.2.
DR eggNOG; KOG4152; Eukaryota.
DR GeneTree; ENSGT00940000161383; -.
DR HOGENOM; CLU_002603_0_0_1; -.
DR InParanoid; G1PPP6; -.
DR OMA; PDYGQMK; -.
DR TreeFam; TF314757; -.
DR Proteomes; UP000001074; Unassembled WGS sequence.
DR GO; GO:0005737; C:cytoplasm; IEA:Ensembl.
DR GO; GO:0071339; C:MLL1 complex; IEA:Ensembl.
DR GO; GO:0043025; C:neuronal cell body; IEA:Ensembl.
DR GO; GO:0044545; C:NSL complex; IEA:Ensembl.
DR GO; GO:0048188; C:Set1C/COMPASS complex; IEA:Ensembl.
DR GO; GO:0003682; F:chromatin binding; IEA:Ensembl.
DR GO; GO:0140297; F:DNA-binding transcription factor binding; IEA:Ensembl.
DR GO; GO:0042802; F:identical protein binding; IEA:Ensembl.
DR GO; GO:0030674; F:protein-macromolecule adaptor activity; IEA:Ensembl.
DR GO; GO:0003713; F:transcription coactivator activity; IEA:Ensembl.
DR GO; GO:0010628; P:positive regulation of gene expression; IEA:Ensembl.
DR GO; GO:0051571; P:positive regulation of histone H3-K4 methylation; IEA:Ensembl.
DR GO; GO:0045944; P:positive regulation of transcription by RNA polymerase II; IEA:Ensembl.
DR GO; GO:0050821; P:protein stabilization; IEA:Ensembl.
DR GO; GO:0043254; P:regulation of protein-containing complex assembly; IEA:Ensembl.
DR CDD; cd00063; FN3; 2.
DR Gene3D; 6.10.250.2590; -; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 2.
DR Gene3D; 2.120.10.80; Kelch-type beta propeller; 2.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR043536; HCF1/2.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR015915; Kelch-typ_b-propeller.
DR InterPro; IPR006652; Kelch_1.
DR PANTHER; PTHR46003; HOST CELL FACTOR; 1.
DR PANTHER; PTHR46003:SF3; HOST CELL FACTOR 1; 1.
DR Pfam; PF01344; Kelch_1; 1.
DR Pfam; PF13415; Kelch_3; 1.
DR Pfam; PF13854; Kelch_5; 2.
DR SMART; SM00060; FN3; 2.
DR SUPFAM; SSF49265; Fibronectin type III; 1.
DR SUPFAM; SSF117281; Kelch motif; 1.
DR PROSITE; PS50853; FN3; 1.
PE 4: Predicted;
KW Kelch repeat {ECO:0000256|ARBA:ARBA00022441};
KW Reference proteome {ECO:0000313|Proteomes:UP000001074};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 1849..1965
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT REGION 407..434
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1212..1239
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1280..1363
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1393..1428
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1445..1475
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1953..1994
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 414..432
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1212..1233
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1280..1302
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1312..1333
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1956..1973
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1994 AA; 204615 MW; 34BD66952DBA2C91 CRC64;
MASAVSPANS AAVLLQPRWK RVVGWSGPVP RPRHGHRAVA IKELIVVFGG GNEGIVDELH
VYNTATNQWF IPAVRGDIPP GCAAYGFVCD GTRLLVFGGM VEYGKYSNDL YELQASRWEW
KRLKAKTPKN GPPPCPRLGH SFSLVGNKCY LFGGLANDSE DPKNNIPRYL NDLYILELRP
GSGVVAWDIP ITYGVLPPPR ESHTAVVYTE KDNKKSKLVI YGGMSGCRLG DLWTLDIETL
TWNKPSLSGV APLPRSLHSA TTIGNKMYVF GGWVPLVMDD VKVATHEKEW KCTNTLACLN
LDTMAWETIL MDTLEDNIPR ARAGHCAVAI NTRLYIWSGR DGYRKAWNNQ VCCKDLWYLE
TEKPPPPARV QLVRANTNSL EVSWGAVATA DSYLLQLQKY DIPATAATAT SPTPNPVPSV
PANPPKSPAP AAAAPAVQPL TQVGITLLPQ AAAAPPTTTT IQVLPPVPGS SISVPAATRT
QGVPAVLKVT GPQATTGTPL VTMRPASQAG KAPVTVTSLP AGVRMVVPSQ SAQGTVIGSN
PQMSGMAALA AAAAATQKIP PSSAPTVLSV PAGTTIVKTV AVTPGTTTLP ATVKVASSPV
MVSNPATRML KTAAAQVGTS VSSAANTSTR PIITVHKSGT VTVAQQAQVV TTVVGGVTKT
ITLVKSPISV PGGSALISNL GKVMSVVQTK PVQTSAVTGQ ASTGPVTQII QTKGPLPAGT
ILKLVTSADG KPTTIITTTQ ASGAGSKPTI LGISSVSPST TKPGTTTIIK TIPMSAIITQ
AGATGVTSSP GIKSPITIIT TKVMTSGTGT PAKIITAVPK IATGHGQQGV TQVVLKGAPG
QPGTILRTVP MGGVRLVTPV TVSAVKPAVT TLVVKGTTGV TTLGTMTGTV STSLAGAGGH
STSASLATPI TTLGTIATLS SQVINPTAIT VSAAQTTLTG AGSLTTPTIT MQPVSQPTQV
TLITAPSGVE AQPVHDLPVS ILASPTTEQP TATVTIADSG QGDVQPGTVT LVCSNPPCET
HETGTTNTAT TTVVANMGGH PQPNQVQFVC DRPDAAASLV TSTVGQQNGN VVRLGCSNPP
CETHETGTTS TATTAMSSIG AGQQQGARHA CVATTIPVVR ISVPTGTLEG AQGFKSSCQT
RQTSATSTTM TVMATGTSRS DSPLLAPSRA LEAAGHSPAF VQLASTTGQV RLGPSSKDSP
VTDLGQLVSM GRQPEAQHTH TTNTPTTARS SMGAGESGEL RGIPTPAYES SSSAAVTVTG
LEALLCSSAT VTQVCSNPPC ETHETGTTHT PTTATSGGGA GQPEGGQQPP ASRPCETHQT
TSTGTTMSVS VGALLPDSTP PLRTLESGLE GAAPPTSASQ ASTSLLTPFP TQRVCSNPPC
ETHETGTTHT ATTVTSNMSS NQDPPPAASD QGEVESTQGD SVNITSSSAI TTTVSSTLTR
AVTTVTQSTP VPGPSVPPPE ELQASPGPRQ QLPPQQLLQP ASTPLMGEST EVLSASQTSE
LQAAVDLSST GDPASGQESA NSAMVATVVV QPPAPTQSEV DQLSLPQELM AEAQAGTTTL
MVTGLTPEEL AVTAAAEAAA QAAATEEAQA LAIQAVLQAA QQAVMGTGEA MDTSEGAGAV
TQAELSHLSA EGQEGQATTI PIVLTQQELA ALVQQQQQLQ EAQAQQQHHH LPTEALAPAD
SLNDPTIEGN CLSELAGAVP STVALLPSTA TESLAPSNTF VAPQPVVGSS PAKLQAAATL
TEVANGIESL GVKPELPPPP SKAPVKKENQ WFDVGVIKGT NVMVTHYFLP PDDAAPSDDD
SGTLPDYNQL KKHELQPGTA YKFRVAGINA CGRGPFSEIS AFKTCLPGFP GAPCAIKISK
SPDGAHLTWE PPSVTSGKIV EYSVYLAIQS SQASGETKSS TPAQLAFMRV YCGPSPSCLV
QSSSLSNAHI DYTTKPAIIF RIAARNEKGY GPATQVRWLQ ETSKDSPGTK PASKRPMSSP
EMKTAPKKSK ADGQ
//