ID A0A2K6M2V7_RHIBE Unreviewed; 1960 AA.
AC A0A2K6M2V7;
DT 28-MAR-2018, integrated into UniProtKB/TrEMBL.
DT 28-MAR-2018, sequence version 1.
DT 24-JAN-2024, entry version 26.
DE SubName: Full=Host cell factor C1 {ECO:0000313|Ensembl:ENSRBIP00000030102.1};
GN Name=HCFC1 {ECO:0000313|Ensembl:ENSRBIP00000030102.1};
OS Rhinopithecus bieti (Black snub-nosed monkey) (Pygathrix bieti).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini;
OC Cercopithecidae; Colobinae; Rhinopithecus.
OX NCBI_TaxID=61621 {ECO:0000313|Ensembl:ENSRBIP00000030102.1, ECO:0000313|Proteomes:UP000233180};
RN [1] {ECO:0000313|Ensembl:ENSRBIP00000030102.1, ECO:0000313|Proteomes:UP000233180}
RP NUCLEOTIDE SEQUENCE.
RA Wu, C.-I. and Zhang, Y.;
RT "Genome of Rhinopithecus bieti.";
RL Submitted (JUN-2016) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSRBIP00000030102.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR Ensembl; ENSRBIT00000054060.1; ENSRBIP00000030102.1; ENSRBIG00000038921.1.
DR GeneTree; ENSGT00940000161383; -.
DR OMA; PDYGQMK; -.
DR Proteomes; UP000233180; Unplaced.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0010468; P:regulation of gene expression; IEA:UniProt.
DR CDD; cd00063; FN3; 2.
DR Gene3D; 6.10.250.2590; -; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 2.
DR Gene3D; 2.120.10.80; Kelch-type beta propeller; 2.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR043536; HCF1/2.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR015915; Kelch-typ_b-propeller.
DR PANTHER; PTHR46003; HOST CELL FACTOR; 1.
DR PANTHER; PTHR46003:SF3; HOST CELL FACTOR 1; 1.
DR Pfam; PF13415; Kelch_3; 1.
DR Pfam; PF13854; Kelch_5; 2.
DR SMART; SM00060; FN3; 3.
DR SUPFAM; SSF49265; Fibronectin type III; 1.
DR SUPFAM; SSF117281; Kelch motif; 1.
DR PROSITE; PS50853; FN3; 1.
PE 4: Predicted;
KW Kelch repeat {ECO:0000256|ARBA:ARBA00022441};
KW Reference proteome {ECO:0000313|Proteomes:UP000233180};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..21
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 22..1960
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5014409521"
FT DOMAIN 1815..1931
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT REGION 361..388
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1037..1058
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1222..1294
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1364..1399
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1416..1444
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1919..1960
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 368..386
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1222..1275
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1922..1940
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1960 AA; 200690 MW; 03C530FB6100744B CRC64;
MRGMGLLGVL TSASFCSPAT NQWFIPAVRG DIPPGCAAYG FVCDGTRLLV FGGMVEYGKY
SNDLYELQAS RWEWKRLKAK TPKNGPPPCP RLGHSFSLVG NKCYLFGGLA NDSEDPKNNI
PRYLNDLYIL ELRPGSGVVA WDIPITYGVL PPPRESHTAV VYTEKDNKKS KLVIYGGMSG
CRLGDLWTLD IDTLTWNKPS LSGVAPLPRS LHSATTIGNK MYVFGGWVPL VMDDVKVATH
EKEWKCTNTL ACLNLDTMAW ETILMDTLED NIPRARAGHC AVAINTRLYI WSGRDGYRKA
WNNQVCCKDL WYLETEKPPP PARVQLVRAN TNSLEVSWGA VATADSYLLQ LQKYDIPATA
ATATSPTPNP VPSVPANPPK SPAPAAAAPA VQPLTQVGIT LLPQAAPAPP TTTTIQVLPT
VPGSSISVPT AARTQGVPAV LKVTGPQATT GTPLVTMRPT SQAGKAPVTV TSLPAGVRMV
VPTQSAQGTV IGSSPQMSGM AALAAAAAAT QKIPPSSAPT VLSVPAGTTI VKTMAVTPGT
TTLPATVKVA SSPVMVSNPA TRMLKTAAAQ VGTSVSSATN TSTRPIITVH KSGTVTVAQQ
AQVVTTVVGG VTKTITLVKS PISVPGGSAL ISNLGKVMSV VQTKPVQTSA VTGQASTGPV
TQIIQTKGPL PAGTILKLVT SADGKPTTII TTTQASGAGT KPTILGISSV SPSTTKPGTT
TIIKTIPMSA IITQAGATGV TSSPGIKSPI TIITTKVMTS GTGAPAKIIT AVPKIATGHG
QQGVTQVVLK GAPGQPGTIL RTVPMGGVRL VTPVTVSAVK PAVTTLVVKG TTGVTTLGTV
TGTVSTSLAG AGGHSTSASL ATPITTLGTI ATLSSQVINP TAITVSAAQT TLTAAGGLTT
PTITMQPVSQ PTQVTLITAP SGVEAQPVHD LPVSILASPT TEQPTATVTI ADSGQGDVQP
GTVTLVCSNP PCETHETGTT NTATTTVVAN LGGHPQPTQV QFVCDRQEAA ASLVTSTVGQ
QNGSVVRVCS NPPCETHETG TTNTHGCSNP PSMSSVGANH QRDARRACAA GTPAVIRISV
ATGALEAAQG SKPQCQTRQT STTSTTMTVM ATGAPCSAGP LLGPSMAREP GGRGPAFVQL
APLSSKVRLS SPGSKDLPTG RHSHVANTAA MARSSMGAGE PRTAPACESL QGGSPSTTVT
VTALEALLCP SATVTQVCSN PPCETHETGT TNTATTSNAG SAQRVCSNPP CETHETGTTH
TATTATSNGG TGQPEGGQQP PAGHPCETHQ TTSTGTTMSV SVGALLPDAT SSHRTLESGL
EVAAAPSVTP QAGTALLAPF PTQRVCSNPP CETHETGTTH TATTVTSNMS SNQDPPPAAS
DQGEVESTQG DSVNITSSSA ITTTVSSTLT RAVTTVTQST PVPGPSVPPP EELQVSPGPR
QQLPPRQLLQ SASTALMGES TEVLSASQTP ELPAAVDLSS TGEPSSGQES ASSAVVATVV
VQPPPPAQSE VDQLSLPQEL MAEAQAGTTT LMVTGLTPEE LAVTAAAEAA AQAAATEEPA
FLVSSVSLDP GAGTGEPMDT SEAAATVTQA ELGHLSAEGQ EGQATTIPIV LTQQELAALV
QQQQLQEAQA QQQHHHLPTE ALAPADSLND PAIESNCLNE LAGTVPSTVA LLPSTATESL
APSNTFVAPQ PVVVASPAKL QAAATLTEVA NGIESLGVKP DLPPPPSKAP MKKENQWFDV
GVIKGTNVMV THYFLPPDDA VPSDDDSGTV PDYNQLKKQE LQPGTAYKFR VAGINACGRG
PFSEISAFKT CLPGFPGAPC AIKISKSPDG AHLTWEPPSV TSGKIIEYSV YLAIQSSQAG
GELKSSTPAQ LAFMRVYCGP SPSCLVQSSS LSNAHIDYTT KPAIIFRIAA RNEKGYGPAT
QVRWLQETSK DSSGTKPANK RPMSSPEMKS APKKSKADGQ
//