ID A0A2I3RV78_PANTR Unreviewed; 2006 AA.
AC A0A2I3RV78;
DT 28-FEB-2018, integrated into UniProtKB/TrEMBL.
DT 28-FEB-2018, sequence version 1.
DT 27-MAR-2024, entry version 32.
DE SubName: Full=Host cell factor C1 {ECO:0000313|Ensembl:ENSPTRP00000093755.1};
GN Name=HCFC1 {ECO:0000313|Ensembl:ENSPTRP00000068593.1,
GN ECO:0000313|VGNC:VGNC:12398};
OS Pan troglodytes (Chimpanzee).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC Pan.
OX NCBI_TaxID=9598 {ECO:0000313|Ensembl:ENSPTRP00000068593.1, ECO:0000313|Proteomes:UP000002277};
RN [1] {ECO:0000313|Ensembl:ENSPTRP00000093755.1, ECO:0000313|Proteomes:UP000002277}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=16136131; DOI=10.1038/nature04072;
RG Chimpanzee sequencing and analysis consortium;
RT "Initial sequence of the chimpanzee genome and comparison with the human
RT genome.";
RL Nature 437:69-87(2005).
RN [2] {ECO:0000313|Ensembl:ENSPTRP00000068593.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AC159035; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR Ensembl; ENSPTRT00000077613.1; ENSPTRP00000093755.1; ENSPTRG00000022421.6.
DR Ensembl; ENSPTRT00000094437.1; ENSPTRP00000068593.1; ENSPTRG00000043866.1.
DR VGNC; VGNC:12398; HCFC1.
DR GeneTree; ENSGT00940000161383; -.
DR Proteomes; UP000002277; Unplaced.
DR Bgee; ENSPTRG00000022421; Expressed in hindlimb stylopod muscle and 19 other cell types or tissues.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR CDD; cd00063; FN3; 2.
DR Gene3D; 6.10.250.2590; -; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 2.
DR Gene3D; 2.120.10.80; Kelch-type beta propeller; 2.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR043536; HCF1/2.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR015915; Kelch-typ_b-propeller.
DR InterPro; IPR006652; Kelch_1.
DR PANTHER; PTHR46003; HOST CELL FACTOR; 1.
DR PANTHER; PTHR46003:SF3; HOST CELL FACTOR 1; 1.
DR Pfam; PF01344; Kelch_1; 1.
DR Pfam; PF13415; Kelch_3; 1.
DR Pfam; PF13854; Kelch_5; 2.
DR SMART; SM00060; FN3; 2.
DR SUPFAM; SSF49265; Fibronectin type III; 1.
DR SUPFAM; SSF117281; Kelch motif; 2.
DR PROSITE; PS50853; FN3; 1.
PE 4: Predicted;
KW Kelch repeat {ECO:0000256|ARBA:ARBA00022441};
KW Reference proteome {ECO:0000313|Proteomes:UP000002277};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 1861..1977
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT REGION 407..434
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1270..1337
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1407..1442
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1459..1487
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1518..1537
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1965..2006
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 414..432
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1270..1318
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1968..1986
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2006 AA; 205931 MW; 99E99CC479176825 CRC64;
MASAVSPANL PAVLLQPRWK RVVGWSGPVP RPRHGHRAVA IKELIVVFGG GNEGIVDELH
VYNTATNQWF IPAVRGDIPP GCAAYGFVCD GTRLLVFGGM VEYGKYSNDL YELQASRWEW
KRLKAKTPKN GPPPCPRLGH SFSLVGNKCY LFGGLANDSE DPKNNIPRYL NDLYILELRP
GSGVVAWDIP ITYGVLPPPR ESHTAVVYTE KDNKKSKLVI YGGMSGCRLG DLWTLDIDTL
TWNKPSLSGV APLPRSLHSA TTIGNKMYVF GGWVPLVMDD VKVATHEKEW KCTNTLACLN
LDTMAWETIL MDTLEDNIPR ARAGHCAVAI NTRLYIWSGR DGYRKAWNNQ VCCKDLWYLE
TEKPPPPARV QLVRANTNSL EVSWGAVATA DSYLLQLQKY DIPATAATAT SPTPNPVPSV
PANPPKSPAP AAAAPAVQPL TQVGITLLPQ AAPAPPTTTT IQVLPTVPGS SISVPTAART
QGVPAVLKVT GPQATTGTPL VTMRPASQAG KAPVTVTSLP AGVRMVVPTQ SAQGTVIGSS
PQMSGMAALA AAAAATQKIP PSSAPTVLSV PAGTTIVKTM AVTPGTTTLP ATVKVASSPV
MVSNPATRML KTAAAQVGTS VSSATNTSTR PIITVHKSGT VTVAQQAQVV TTVVGGVTKT
ITLVKSPISV PGGSALISNL GKVMSVVQTK PVQTSAVTGQ ASTGPVTQII QTKGPLPAGT
ILKLVTSADG KPTTIITTTQ ASGAGTKPTI LGISSVSPST TKPGTTTIIK TIPMSAIITQ
AGATGVTSSP GIKSPITIIT TKVMTSGTGA PAKIITAVPK IATGHGQQGV TQVVLKGAPG
QPGTILRTVP MGGVRLVTPV TVSAVKPAVT TLVVKGTTGV TTLGTVTGTV STSLAGAGGH
STSASLATPI TTLGTIATLS SQVINPTAIT VSAAQTTLTA AGGLTTPTIT MQPVSQPTQV
TLITAPSGVE AQPVHDLPVS ILASPTTEQP TATVTIADSG QGDVQPGTVT LVCSNPPCET
HETGTTNTAT TTVVANLGGH PQPTQVQFVC DRQEAAASLV TSTVGQQNGS VVRHGCSNPP
CETHETGTTN TATTAMSSVG ANHQRDARRA CAAGTPAVIR ISVATGALEA AQGSKPQCQT
RQTSATSTTM TVMATGAPCS AGPLLGPSMA REPGGRSPAF VQLAPLSSKV RLSSPSSKDL
PAGRHSHAVN TAAMTRSSVG AGEPRMAPVC ESLQGGSPST TVTVTALEAL LCPSATVTQV
CSNPPCETHE TGTTNTATTS NAGSAQRVCS NPPCETHETG TTHTATTATS NGGTGQPEGG
QQPPAGRPCE THQTTSTGTT MSVSVGALLP DATSSHRTVE SGLELAAAPS VTPQAGTALL
APFPTQRVCS NPPCETHETG TTHTATTVTS NMSSNQDPPP AASDQGEVES TQGDSVNITS
SSAITTTVSS TLTRAVTTVT QSTPVPGPSV PPPEELQVSP GPRQQLPPRQ LLQSASTALM
GESAEVLSAS QTPELPAAVD LSSTGEPSSG QESASSAVVA TVVVQPPPPT QSEVDQLSLP
QELMAEAQAG TTTLMVTGLT PEELAVTAAA EAALGASARA ARFWPAFLVS SVSLDPGAGT
GEPMDTSEAA ATVTQAELGH LSAEGQEGQA TTIPIVLTQQ ELAALVQQQQ LQEAQAQQQH
HHLPTEALAP ADSLNDPAIE SNCLNELAGT VPSTVALLPS TATESLAPSN TFVAPQPVVV
ASPAKLQAAA TLTEVANGIE SLGVKPDLPP PPSKAPMKKE NQWFDVGVIK GTNVMVTHYF
LPPDDAVPSD DDLGTVPDYN QLKKQELQPG TAYKFRVAGI NACGRGPFSE ISAFKTCLPG
FPGAPCAIKI SKSPDGAHLT WEPPSVTSGK IIEYSVYLAI QSSQAGGELK SSTPAQLAFM
RVYCGPSPSC LVQSSSLSNA HIDYTTKPAI IFRIAARNEK GYGPATQVRW LQETSKDSSG
TKPANKRPMS SPEMKSAPKK SKADGQ
//