ID A0A0N4U887_DRAME Unreviewed; 1040 AA.
AC A0A0N4U887;
DT 09-DEC-2015, integrated into UniProtKB/TrEMBL.
DT 09-DEC-2015, sequence version 1.
DT 27-MAR-2024, entry version 35.
DE SubName: Full=Host cell factor 1 {ECO:0000313|WBParaSite:DME_0000324301-mRNA-1};
GN ORFNames=DME_LOCUS7402 {ECO:0000313|EMBL:VDN57429.1};
OS Dracunculus medinensis (Guinea worm).
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC Spirurina; Dracunculoidea; Dracunculidae; Dracunculus.
OX NCBI_TaxID=318479 {ECO:0000313|Proteomes:UP000038040, ECO:0000313|WBParaSite:DME_0000324301-mRNA-1};
RN [1] {ECO:0000313|WBParaSite:DME_0000324301-mRNA-1}
RP IDENTIFICATION.
RG WormBaseParasite;
RL Submitted (FEB-2017) to UniProtKB.
RN [2] {ECO:0000313|EMBL:VDN57429.1, ECO:0000313|Proteomes:UP000274756}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG Pathogen Informatics;
RL Submitted (NOV-2018) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; UYYG01001160; VDN57429.1; -; Genomic_DNA.
DR STRING; 318479.A0A0N4U887; -.
DR WBParaSite; DME_0000324301-mRNA-1; DME_0000324301-mRNA-1; DME_0000324301.
DR Proteomes; UP000038040; Unplaced.
DR Proteomes; UP000274756; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR CDD; cd00063; FN3; 2.
DR Gene3D; 6.10.250.2590; -; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 2.
DR Gene3D; 2.120.10.80; Kelch-type beta propeller; 2.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR043536; HCF1/2.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR015915; Kelch-typ_b-propeller.
DR InterPro; IPR006652; Kelch_1.
DR PANTHER; PTHR46003; HOST CELL FACTOR; 1.
DR PANTHER; PTHR46003:SF1; HOST CELL FACTOR; 1.
DR Pfam; PF01344; Kelch_1; 1.
DR Pfam; PF13415; Kelch_3; 1.
DR Pfam; PF13418; Kelch_4; 1.
DR Pfam; PF13854; Kelch_5; 1.
DR SMART; SM00060; FN3; 2.
DR SUPFAM; SSF49265; Fibronectin type III; 1.
DR SUPFAM; SSF117281; Kelch motif; 1.
PE 4: Predicted;
KW Kelch repeat {ECO:0000256|ARBA:ARBA00022441};
KW Reference proteome {ECO:0000313|Proteomes:UP000274756};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 367..879
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|SMART:SM00060"
FT DOMAIN 895..990
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|SMART:SM00060"
FT REGION 647..718
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 732..769
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 660..711
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 734..769
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1040 AA; 113458 MW; 90B77B9E22464EEF CRC64;
MDSDRRDPMV GTSFSNFPSV RWKKVTNTTG PTPRPRHGHR AVAIKDLMIV FGGGNEGIVD
ELHVYNTATN QWFVPAIRGD VPPGCAAFGI ICDGTRIFLF GGMVEFGRYS AELYELQASR
WQWKHLRVRP PKSGSAGPCE RLGHSFTLAS NQVCYIFGGL ANDSPDPKNN IPRYLNDLWA
LDLKYGTNNL QWECPSTFGT GPSARESHAG VMFEKDGYRK LIIHGGMDGC RLNDLWILDL
NSMTWSNPQP EGIPPLPRSL HSANIIGDRM FIFGGWVPLV LGDSKMDQIE KEWKCTNTLA
SLDLNSMCWE TLSTEIYEDA LPRARAGHSA VVINKRIYIW SGRDGYRKAW NNQVCCKDMW
FLETDRPAAP SKIQLVRATI CGLEVCWNSV PTAEAYLLQL RKYESTPRGT EDGARAVGMV
RMSAAKSQIS SNKLIAIQRS SGGPQVMKVV RSGPHLSSGV ITPTSGGSLL RVVSASKGPQ
TTATSIPKSG TKTIIVTKAT GTGTAARKLF LVQPSTSSGS SVGGVRSNTQ LSLVDSVPQR
ISSKASSVAV VGSEHTSTIS THGTTYTSPM TSNVDTGLPQ NLLDESLMET DSGMGQPDSV
DQIQEPSDSY RTSQVVEFMP TEQASTSLSS ETTQGLNDGV TPEITHDSEA PVEQKINQSS
KEIQNEENEI KSTEDLLGLR NSDEIKENSD HVKENFGKQE IPADDMKESI TESQNIESGD
GVVPFFKVEA PESSVSDNLV NPSASESKSG ENTVESSDNM PVSSTAPSVA DVASENSVAC
DSKVTQTLKV VKVEATSTQN EPYPVKEDPP WFDVGIIKGT SCVVTHFFLP SDMPLEDTYS
TDFEVGMHTS QVGLLRKAEL EPGTAYKFRV AGINACGRGE WSEVTAFKTC LPGFPGAPSN
IKVSKSVRGA HLSWDPPHNI GGRICEYSVY LAVRSSHMGN ETQLAFVRVY VGPESSCVVS
TTDLQVAYID MASKPAIIFR IAARNDKGYG PATQVRWLQD QRQRLPPVIG RQSQPLHQPM
VNDYHGISPV HIQAKRMRLE
//