ID A0A2U9BH34_SCOMX Unreviewed; 1606 AA.
AC A0A2U9BH34;
DT 12-SEP-2018, integrated into UniProtKB/TrEMBL.
DT 12-SEP-2018, sequence version 1.
DT 27-MAR-2024, entry version 19.
DE SubName: Full=Putative host cell factor 1 {ECO:0000313|EMBL:AWP03345.1};
GN ORFNames=SMAX5B_014135 {ECO:0000313|EMBL:AWP03345.1};
OS Scophthalmus maximus (Turbot) (Psetta maxima).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Carangaria; Pleuronectiformes; Pleuronectoidei; Scophthalmidae;
OC Scophthalmus.
OX NCBI_TaxID=52904 {ECO:0000313|EMBL:AWP03345.1, ECO:0000313|Proteomes:UP000246464};
RN [1] {ECO:0000313|EMBL:AWP03345.1, ECO:0000313|Proteomes:UP000246464}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Martinez P.;
RT "Integrating genomic resources of turbot (Scophthalmus maximus) in depth
RT evaluation of genetic and physical mapping variation across individuals.";
RL Submitted (DEC-2017) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CP026248; AWP03345.1; -; Genomic_DNA.
DR STRING; 52904.ENSSMAP00000022844; -.
DR Proteomes; UP000246464; Chromosome 6.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR CDD; cd00063; FN3; 2.
DR Gene3D; 6.10.250.2590; -; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 2.
DR Gene3D; 2.120.10.80; Kelch-type beta propeller; 2.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR043536; HCF1/2.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR015915; Kelch-typ_b-propeller.
DR InterPro; IPR006652; Kelch_1.
DR PANTHER; PTHR46003; HOST CELL FACTOR; 1.
DR PANTHER; PTHR46003:SF3; HOST CELL FACTOR 1; 1.
DR Pfam; PF01344; Kelch_1; 1.
DR Pfam; PF13415; Kelch_3; 1.
DR Pfam; PF13854; Kelch_5; 2.
DR SMART; SM00060; FN3; 3.
DR SUPFAM; SSF49265; Fibronectin type III; 1.
DR SUPFAM; SSF117281; Kelch motif; 1.
DR PROSITE; PS50853; FN3; 2.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Kelch repeat {ECO:0000256|ARBA:ARBA00022441};
KW Reference proteome {ECO:0000313|Proteomes:UP000246464};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 1371..1460
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1462..1577
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT REGION 408..435
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 934..1161
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1567..1606
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 1255..1285
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 412..426
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 934..1137
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1606 AA; 165320 MW; D6B3EEA37E596893 CRC64;
MSAPGSAVSG TTASVLQPRW KRVLGWSGPV PRPRHGHRAV AIKELMVVFG GGNEGIVDEL
HVYNTATNQW FIPAVRGDIP PGCAAYGFVC DGTRLLVFGG MVEYGKYSND LYELQASRWE
WKKLKAKNPK NGPPPCPRLG HSFSLVGNKC YLFGGLANDS EDPKNNIPRY LNDLYTLELR
AGSSVVGWDI PITYGVLPPP RESHTAVVYT EKTSRKSRLI IYGGMSGCRL GDLWTLDIDT
LTWNKPSVNG TAPLPRSLHS ATTITNKMYV FGGWVPLVMD DVKVATHEKE WKCTNTLACL
NLDSMCWETV LMDTLEDNIP RARAGHCAVA INSRLYVWSG RDGYRKAWNN QVCCKDLWYL
ETERPHAPAR VQLVRANTNS LEVSWGAVST ADTYLLQLQK YDIPATPAAA SPAMSTAPTQ
PVNSPKSPAL AAAAPSAQSL PQTAVLKVAA QQSATGTSVV TVRPSQPGKS PVTVTSLPPG
VRMVVPAQTT QGSPIGSSPQ MSGMAALAAA AAATQKIPPS SAGTVLNVPG GATILKTVAV
SPGTTTVKMA SPVMVSNPAT RMLKTAAAQV GTATASSPTN TRPIITVHKS GAVTVAQQAQ
VVTTVVGGVT KTITLVKSPL TMGGSGTLIS NLGKMMSVVQ TKPVQTSAIT GQASTNPLTQ
LIQTKGQLPA GTILKLVTSA DGKPTTIITT SQAGGTGNKP TILNISGVSP TTTKQGTTII
KTIPMSAIMS QSGATGVTSS AGIKTPITIL TTKVMTTGTP GKIITAVPKL ATAAGQQGLT
QVVLKGAPGQ PGTILRTLPM GTVGGVRLVT PVTVSAVKPT VTTLVVKGTT GVTTLGTVTG
TVSSSLAGGT VDSSNASLVT PITTLGTIAT LSSQVISPAA ITVSAAQTSL TSASSLPSST
MTVQNQPTQV TLITTPSGVE AQPVQDLPVS ILASPTSEQP SSTEAGAAGE GSGTVTLSSC
SNPPCETHET GTTNTPSTAT SSMGSDQTST ASGQVQRVCS NPPCETHETG TTNTSSTATS
SMGGDQTSTP SGQVQRVCSN PPCETHETGT TNTATTATCN METGEGTAQQ TEEESEGTSS
TEVASTTATT GTVTNTQGRA ITTVTQSTPA PGPSVPSISS ITEGGSTAAS STEEPMQTDE
AAAAEAAPAE EAATAMETQA EGEAAAAAAA AAAAATDLNL PSELMSEGQG ATLMVTGLSD
EELAVTAAAE AAAQAAATEE AQALAIQAVL QAAQQAVMNE SDSTGESQQP TTIPIMLTQQ
ELAALVQQQQ QLQEAQAAAQ QATVDTSLPT EGLAPADSLN DPSVESNGHN EMAAAVTSAV
ASLLPRTTAE TLAPSSTFAP SVAVASPAKL QAAAALTEVA NGIEGEKQAP QPTPVKPVVK
KENQWFDVGI VKVTNMVVTH FYVPADDSHG DDDSGVMPDY SQMKKMELQP GTAYKFRVAG
INACGRGSFS EISAFKTCLP GFPGAPCAIK ISKSPDGAHL TWEPPSVTSG KIIEYSVYLA
IQSNQTAEAK ASTPAQLAFM RVYCGPNPSC LVQSSSLSNA HIDYTTKPAI IFRIAARNEK
GYGPATQVRW LQESGKDAAS AKPAPKRPGT SPDTKATGPK KARTDQ
//