ID V8NZA6_OPHHA Unreviewed; 2259 AA.
AC V8NZA6;
DT 19-FEB-2014, integrated into UniProtKB/TrEMBL.
DT 19-FEB-2014, sequence version 1.
DT 27-MAR-2024, entry version 32.
DE SubName: Full=Host cell factor 1 {ECO:0000313|EMBL:ETE67395.1};
GN Name=HCFC1 {ECO:0000313|EMBL:ETE67395.1};
GN ORFNames=L345_06812 {ECO:0000313|EMBL:ETE67395.1};
OS Ophiophagus hannah (King cobra) (Naja hannah).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Lepidosauria; Squamata; Bifurcata; Unidentata; Episquamata; Toxicofera;
OC Serpentes; Colubroidea; Elapidae; Elapinae; Ophiophagus.
OX NCBI_TaxID=8665 {ECO:0000313|EMBL:ETE67395.1, ECO:0000313|Proteomes:UP000018936};
RN [1] {ECO:0000313|EMBL:ETE67395.1, ECO:0000313|Proteomes:UP000018936}
RP NUCLEOTIDE SEQUENCE.
RC TISSUE=Blood {ECO:0000313|EMBL:ETE67395.1};
RX PubMed=24297900; DOI=10.1073/pnas.1314702110;
RA Vonk F.J., Casewell N.R., Henkel C.V., Heimberg A.M., Jansen H.J.,
RA McCleary R.J., Kerkkamp H.M., Vos R.A., Guerreiro I., Calvete J.J.,
RA Wuster W., Woods A.E., Logan J.M., Harrison R.A., Castoe T.A.,
RA de Koning A.P., Pollock D.D., Yandell M., Calderon D., Renjifo C.,
RA Currier R.B., Salgado D., Pla D., Sanz L., Hyder A.S., Ribeiro J.M.,
RA Arntzen J.W., van den Thillart G.E., Boetzer M., Pirovano W., Dirks R.P.,
RA Spaink H.P., Duboule D., McGlinn E., Kini R.M., Richardson M.K.;
RT "The king cobra genome reveals dynamic gene evolution and adaptation in the
RT snake venom system.";
RL Proc. Natl. Acad. Sci. U.S.A. 110:20651-20656(2013).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:ETE67395.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AZIM01001307; ETE67395.1; -; Genomic_DNA.
DR Proteomes; UP000018936; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR CDD; cd00063; FN3; 2.
DR Gene3D; 6.10.250.2590; -; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 2.
DR Gene3D; 2.120.10.80; Kelch-type beta propeller; 2.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR043536; HCF1/2.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR015915; Kelch-typ_b-propeller.
DR InterPro; IPR006652; Kelch_1.
DR PANTHER; PTHR46003; HOST CELL FACTOR; 1.
DR PANTHER; PTHR46003:SF3; HOST CELL FACTOR 1; 1.
DR Pfam; PF01344; Kelch_1; 1.
DR Pfam; PF13415; Kelch_3; 1.
DR Pfam; PF13854; Kelch_5; 2.
DR SMART; SM00060; FN3; 2.
DR SUPFAM; SSF49265; Fibronectin type III; 1.
DR SUPFAM; SSF117281; Kelch motif; 1.
DR PROSITE; PS50853; FN3; 1.
PE 4: Predicted;
KW Kelch repeat {ECO:0000256|ARBA:ARBA00022441};
KW Reference proteome {ECO:0000313|Proteomes:UP000018936};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 1985..2084
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT REGION 1..20
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 413..435
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 481..504
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1123..1149
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1163..1225
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1285..1310
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1435..1765
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1820..1851
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1888..1932
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2195..2216
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2229..2259
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 420..435
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 489..504
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1168..1225
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1435..1512
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1526..1626
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1641..1683
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1687..1726
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2259 AA; 232769 MW; CA08BDAF8F7DAA81 CRC64;
MASPVVPSGG SPTGGGATAG LLQPRWKRVI GWSGPVPRPR HGHRAVAIKE LIVVFGGGNE
GIVDELHVYN TATNQWFIPA VRGDIPPGCA AYGFVCDGTR LLVFGGMVEY GKYSNDLYEL
QASRWEWKRL KAKMPKNGST PCPRLGHSFS LVGNKCYLFG GLANDSEDPK NNIPRYLNDL
YILELRPGSG VGSWDIPITY GVLPPPRESH TAVVYTEKDN KKSKLVIYGG MSGCRLGDLW
TLDIDSLTWN KPNLRGVAPL PRSLHSATTI GNKMYVFGGW VPLVMDDVKV ATHEKEWKCT
NTLACLNLDS MAWEPILMDT LEENTPRARA GHCAVAINTR LYIWSGRDGY RKAWNNQVCC
KDLWYLETEK PPPPSRVQLV RANTNSLEVS WGSVPTADTY LLQLQKYDIP AASSPAANPV
PSVPVNPPKS PAPAAAAPVP AVQPLAQMGI TLLPQTTAVA AATAATAAAA AAAPAPPTTT
TIQVLPTVPT SPMPGPPPAA AAATPRPQTV PAVLKVTGPQ ATTGAPLVTV RSAGLAGKGP
VTVTSLPAGV RMVVPTQSTQ GTVIGSSSQM SGMAALAAAA AATQKIPPSS APTMLTMPAA
ATIVKTVAVS PGTATLPTTV KVSNPATRML KTAAAQVGTS VSSSTTNTPT RPIITVHKSG
TVTVAQQAQV MTTVVGGVTK TITLVKSPIS VPGGSALISN LGKVMSVVQT KPVQTSAVTG
QASTGPVTQI IQTKGPLPAG TILKLVTSAD GKPTTIITSS QAGGTGTKPT ILGISSVSPN
TTKPSTTTII KTIPMSAIIT QAGATGVTSS SGIKSPITII TTKVMTAGTG TPAKIITAVP
KLTTSHGQQS VTQVVLKGAP GQPGTILRTV PMGGVRLVTP VTVSAVKPTV TTLVVKGTTG
VTTLGTVTGT VSTSLAGTGA HNTNASLATP ITTLGTIATL SSQVINPAAI TVSAAQTTLT
AAGGLTTPTI TMQPVSQPTQ VTLITTPSGV EAQPVHDLPV SILASPTTEQ PTATVTIAEP
GQGESQPNTV TLVCSNPPCE THDTGTTNTA TTTVVGTLGG PAQLQFVCDG QEGGGLQPNG
RVVRICSNPP CEMHETGTTN TATTVLSAGQ KICSNPPCET HETGTTNTAT ISKTERAPSE
PPCHTFQTSA TGTTMTVKPI VGTGQLVCSN PPRETHETGT TNTATTTTSN MGQGQPDDGQ
KELTNTSCQT KQTGSTSTTM TTKTNIPTEH PCAVQIVSFM PAAGLPSAAS GTTVEKSNER
LQALKKVQCE SHHTHTTNTA TTARSLMGQG QPDGEWVGSA NPPCQTQQTN STSTTMSVKI
DVPVQSSAPC KTQSTDTKDT NLQVCSNPPC ETHETGTTNT ATVTTSNLGA TQQVCSNPPC
EIHETGTTNT ATTATSNIAV GQQVCSNPPC ETHETGTTNT ATTATSNIAV GQQVCSNPPC
ETHETGTTNT ATTTTSNLGA EQNEGAQHQH PPVTPPCETQ QTNSTSTTMT PSIGGDGAQS
SSSASAQCGN SELRKTPERE PSGIAGSLPH SSTPRICSNP PCETHETGTT HTATTVTSSM
GANQDQPPAT NGQQGEPEIP VTESPAASSP NTAVVTTVSS TQIRAVTTVT QSTPAPGPSV
PNISSLTDVP AGEAVPPVET PDATSTDSLP APSEPLQPSI EMQTAVATPE PQGQAQVAST
GTPPTEGPLG EPAPHPEPAP PTXEGPLGEP APHPELAPPT EEAPPTPATP TQESVAMVVL
PQATPAAPPS SEVEQLALPQ ELMAESQAGT TTLMVTGLTP EELAVTAAAE AAAQAAATEE
AQALAIQAVL QAAQQAVMGP GEAMDTSEAA ATQAELGHLS SEGPEGQPTA IPIVLTQQDL
AALVQQQQQQ LQEAQAAAAA AAVQQQQQQQ QQQQQQPPPP QHQLPAHLPT EALAPADSLN
DPASESNGLN ELASAVTSTV GLLPPTPTES LAPSNTFVAP QPVVVASPAK LQAAAALTEV
ANGIEPAPVK PEPPTQPPKV VVKKENQWFD VGIVKATNMV VTHYFLPPDD APATDDDSGT
TPDYSQFKKQ ELQPGTAYKF RVAGINACGR GPFSEISAFK TCLPGFPGAP CAIKISKSPD
GAHLTWEPPS VTSGKITEYS VYLAIQSPQA GEPKSAAPAQ LAFMRVYCGP SPSCLVQSAS
LSNAHIDYTT KPAIIFRIAA RNEKGYGPAT QVRWLQESSK DGSMTKPTNK RPLSSSDFLC
LNSPSQPVTP CMEEDGTAES TFHTPWDELL GWGGREKRR
//