GenomeNet

Database: UniProt
Entry: V8NZA6_OPHHA
LinkDB: V8NZA6_OPHHA
Original site: V8NZA6_OPHHA 
ID   V8NZA6_OPHHA            Unreviewed;      2259 AA.
AC   V8NZA6;
DT   19-FEB-2014, integrated into UniProtKB/TrEMBL.
DT   19-FEB-2014, sequence version 1.
DT   27-MAR-2024, entry version 32.
DE   SubName: Full=Host cell factor 1 {ECO:0000313|EMBL:ETE67395.1};
GN   Name=HCFC1 {ECO:0000313|EMBL:ETE67395.1};
GN   ORFNames=L345_06812 {ECO:0000313|EMBL:ETE67395.1};
OS   Ophiophagus hannah (King cobra) (Naja hannah).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Lepidosauria; Squamata; Bifurcata; Unidentata; Episquamata; Toxicofera;
OC   Serpentes; Colubroidea; Elapidae; Elapinae; Ophiophagus.
OX   NCBI_TaxID=8665 {ECO:0000313|EMBL:ETE67395.1, ECO:0000313|Proteomes:UP000018936};
RN   [1] {ECO:0000313|EMBL:ETE67395.1, ECO:0000313|Proteomes:UP000018936}
RP   NUCLEOTIDE SEQUENCE.
RC   TISSUE=Blood {ECO:0000313|EMBL:ETE67395.1};
RX   PubMed=24297900; DOI=10.1073/pnas.1314702110;
RA   Vonk F.J., Casewell N.R., Henkel C.V., Heimberg A.M., Jansen H.J.,
RA   McCleary R.J., Kerkkamp H.M., Vos R.A., Guerreiro I., Calvete J.J.,
RA   Wuster W., Woods A.E., Logan J.M., Harrison R.A., Castoe T.A.,
RA   de Koning A.P., Pollock D.D., Yandell M., Calderon D., Renjifo C.,
RA   Currier R.B., Salgado D., Pla D., Sanz L., Hyder A.S., Ribeiro J.M.,
RA   Arntzen J.W., van den Thillart G.E., Boetzer M., Pirovano W., Dirks R.P.,
RA   Spaink H.P., Duboule D., McGlinn E., Kini R.M., Richardson M.K.;
RT   "The king cobra genome reveals dynamic gene evolution and adaptation in the
RT   snake venom system.";
RL   Proc. Natl. Acad. Sci. U.S.A. 110:20651-20656(2013).
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:ETE67395.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AZIM01001307; ETE67395.1; -; Genomic_DNA.
DR   Proteomes; UP000018936; Unassembled WGS sequence.
DR   GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR   CDD; cd00063; FN3; 2.
DR   Gene3D; 6.10.250.2590; -; 1.
DR   Gene3D; 2.60.40.10; Immunoglobulins; 2.
DR   Gene3D; 2.120.10.80; Kelch-type beta propeller; 2.
DR   InterPro; IPR003961; FN3_dom.
DR   InterPro; IPR036116; FN3_sf.
DR   InterPro; IPR043536; HCF1/2.
DR   InterPro; IPR013783; Ig-like_fold.
DR   InterPro; IPR015915; Kelch-typ_b-propeller.
DR   InterPro; IPR006652; Kelch_1.
DR   PANTHER; PTHR46003; HOST CELL FACTOR; 1.
DR   PANTHER; PTHR46003:SF3; HOST CELL FACTOR 1; 1.
DR   Pfam; PF01344; Kelch_1; 1.
DR   Pfam; PF13415; Kelch_3; 1.
DR   Pfam; PF13854; Kelch_5; 2.
DR   SMART; SM00060; FN3; 2.
DR   SUPFAM; SSF49265; Fibronectin type III; 1.
DR   SUPFAM; SSF117281; Kelch motif; 1.
DR   PROSITE; PS50853; FN3; 1.
PE   4: Predicted;
KW   Kelch repeat {ECO:0000256|ARBA:ARBA00022441};
KW   Reference proteome {ECO:0000313|Proteomes:UP000018936};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT   DOMAIN          1985..2084
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   REGION          1..20
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          413..435
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          481..504
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1123..1149
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1163..1225
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1285..1310
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1435..1765
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1820..1851
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1888..1932
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2195..2216
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2229..2259
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        420..435
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        489..504
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1168..1225
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1435..1512
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1526..1626
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1641..1683
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1687..1726
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   2259 AA;  232769 MW;  CA08BDAF8F7DAA81 CRC64;
     MASPVVPSGG SPTGGGATAG LLQPRWKRVI GWSGPVPRPR HGHRAVAIKE LIVVFGGGNE
     GIVDELHVYN TATNQWFIPA VRGDIPPGCA AYGFVCDGTR LLVFGGMVEY GKYSNDLYEL
     QASRWEWKRL KAKMPKNGST PCPRLGHSFS LVGNKCYLFG GLANDSEDPK NNIPRYLNDL
     YILELRPGSG VGSWDIPITY GVLPPPRESH TAVVYTEKDN KKSKLVIYGG MSGCRLGDLW
     TLDIDSLTWN KPNLRGVAPL PRSLHSATTI GNKMYVFGGW VPLVMDDVKV ATHEKEWKCT
     NTLACLNLDS MAWEPILMDT LEENTPRARA GHCAVAINTR LYIWSGRDGY RKAWNNQVCC
     KDLWYLETEK PPPPSRVQLV RANTNSLEVS WGSVPTADTY LLQLQKYDIP AASSPAANPV
     PSVPVNPPKS PAPAAAAPVP AVQPLAQMGI TLLPQTTAVA AATAATAAAA AAAPAPPTTT
     TIQVLPTVPT SPMPGPPPAA AAATPRPQTV PAVLKVTGPQ ATTGAPLVTV RSAGLAGKGP
     VTVTSLPAGV RMVVPTQSTQ GTVIGSSSQM SGMAALAAAA AATQKIPPSS APTMLTMPAA
     ATIVKTVAVS PGTATLPTTV KVSNPATRML KTAAAQVGTS VSSSTTNTPT RPIITVHKSG
     TVTVAQQAQV MTTVVGGVTK TITLVKSPIS VPGGSALISN LGKVMSVVQT KPVQTSAVTG
     QASTGPVTQI IQTKGPLPAG TILKLVTSAD GKPTTIITSS QAGGTGTKPT ILGISSVSPN
     TTKPSTTTII KTIPMSAIIT QAGATGVTSS SGIKSPITII TTKVMTAGTG TPAKIITAVP
     KLTTSHGQQS VTQVVLKGAP GQPGTILRTV PMGGVRLVTP VTVSAVKPTV TTLVVKGTTG
     VTTLGTVTGT VSTSLAGTGA HNTNASLATP ITTLGTIATL SSQVINPAAI TVSAAQTTLT
     AAGGLTTPTI TMQPVSQPTQ VTLITTPSGV EAQPVHDLPV SILASPTTEQ PTATVTIAEP
     GQGESQPNTV TLVCSNPPCE THDTGTTNTA TTTVVGTLGG PAQLQFVCDG QEGGGLQPNG
     RVVRICSNPP CEMHETGTTN TATTVLSAGQ KICSNPPCET HETGTTNTAT ISKTERAPSE
     PPCHTFQTSA TGTTMTVKPI VGTGQLVCSN PPRETHETGT TNTATTTTSN MGQGQPDDGQ
     KELTNTSCQT KQTGSTSTTM TTKTNIPTEH PCAVQIVSFM PAAGLPSAAS GTTVEKSNER
     LQALKKVQCE SHHTHTTNTA TTARSLMGQG QPDGEWVGSA NPPCQTQQTN STSTTMSVKI
     DVPVQSSAPC KTQSTDTKDT NLQVCSNPPC ETHETGTTNT ATVTTSNLGA TQQVCSNPPC
     EIHETGTTNT ATTATSNIAV GQQVCSNPPC ETHETGTTNT ATTATSNIAV GQQVCSNPPC
     ETHETGTTNT ATTTTSNLGA EQNEGAQHQH PPVTPPCETQ QTNSTSTTMT PSIGGDGAQS
     SSSASAQCGN SELRKTPERE PSGIAGSLPH SSTPRICSNP PCETHETGTT HTATTVTSSM
     GANQDQPPAT NGQQGEPEIP VTESPAASSP NTAVVTTVSS TQIRAVTTVT QSTPAPGPSV
     PNISSLTDVP AGEAVPPVET PDATSTDSLP APSEPLQPSI EMQTAVATPE PQGQAQVAST
     GTPPTEGPLG EPAPHPEPAP PTXEGPLGEP APHPELAPPT EEAPPTPATP TQESVAMVVL
     PQATPAAPPS SEVEQLALPQ ELMAESQAGT TTLMVTGLTP EELAVTAAAE AAAQAAATEE
     AQALAIQAVL QAAQQAVMGP GEAMDTSEAA ATQAELGHLS SEGPEGQPTA IPIVLTQQDL
     AALVQQQQQQ LQEAQAAAAA AAVQQQQQQQ QQQQQQPPPP QHQLPAHLPT EALAPADSLN
     DPASESNGLN ELASAVTSTV GLLPPTPTES LAPSNTFVAP QPVVVASPAK LQAAAALTEV
     ANGIEPAPVK PEPPTQPPKV VVKKENQWFD VGIVKATNMV VTHYFLPPDD APATDDDSGT
     TPDYSQFKKQ ELQPGTAYKF RVAGINACGR GPFSEISAFK TCLPGFPGAP CAIKISKSPD
     GAHLTWEPPS VTSGKITEYS VYLAIQSPQA GEPKSAAPAQ LAFMRVYCGP SPSCLVQSAS
     LSNAHIDYTT KPAIIFRIAA RNEKGYGPAT QVRWLQESSK DGSMTKPTNK RPLSSSDFLC
     LNSPSQPVTP CMEEDGTAES TFHTPWDELL GWGGREKRR
//
DBGET integrated database retrieval system