ID A0A087YBP1_POEFO Unreviewed; 1769 AA.
AC A0A087YBP1;
DT 29-OCT-2014, integrated into UniProtKB/TrEMBL.
DT 26-NOV-2014, sequence version 2.
DT 27-MAR-2024, entry version 47.
DE SubName: Full=Host cell factor C1 {ECO:0000313|Ensembl:ENSPFOP00000015444.2};
OS Poecilia formosa (Amazon molly) (Limia formosa).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Atherinomorphae; Cyprinodontiformes; Poeciliidae; Poeciliinae;
OC Poecilia.
OX NCBI_TaxID=48698 {ECO:0000313|Ensembl:ENSPFOP00000015444.2, ECO:0000313|Proteomes:UP000028760};
RN [1] {ECO:0000313|Proteomes:UP000028760}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=female {ECO:0000313|Proteomes:UP000028760};
RA Schartl M., Warren W.;
RL Submitted (OCT-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSPFOP00000015444.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AYCK01010943; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR RefSeq; XP_007565701.1; XM_007565639.2.
DR STRING; 48698.ENSPFOP00000015444; -.
DR Ensembl; ENSPFOT00000015466.2; ENSPFOP00000015444.2; ENSPFOG00000015276.2.
DR GeneID; 103147348; -.
DR KEGG; pfor:103147348; -.
DR CTD; 564853; -.
DR eggNOG; KOG4152; Eukaryota.
DR GeneTree; ENSGT00940000164702; -.
DR OMA; NSETQTH; -.
DR OrthoDB; 4642026at2759; -.
DR Proteomes; UP000028760; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR CDD; cd00063; FN3; 2.
DR Gene3D; 6.10.250.2590; -; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 2.
DR Gene3D; 2.120.10.80; Kelch-type beta propeller; 2.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR043536; HCF1/2.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR015915; Kelch-typ_b-propeller.
DR InterPro; IPR006652; Kelch_1.
DR PANTHER; PTHR46003; HOST CELL FACTOR; 1.
DR PANTHER; PTHR46003:SF3; HOST CELL FACTOR 1; 1.
DR Pfam; PF01344; Kelch_1; 1.
DR Pfam; PF13415; Kelch_3; 1.
DR Pfam; PF13854; Kelch_5; 2.
DR SUPFAM; SSF49265; Fibronectin type III; 1.
DR SUPFAM; SSF117281; Kelch motif; 2.
DR PROSITE; PS50853; FN3; 2.
PE 4: Predicted;
KW Kelch repeat {ECO:0000256|ARBA:ARBA00022441};
KW Reference proteome {ECO:0000313|Proteomes:UP000028760};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 1532..1623
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1625..1740
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT REGION 408..436
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 457..480
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 934..953
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1080..1099
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1104..1336
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1441..1472
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1731..1769
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 457..473
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1104..1309
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1769 AA; 181720 MW; 230231FFE0AD480B CRC64;
MSAPGSAVSG TTPSVLQPRW KRVLGWSGPV PRPRHGHRAV AIKELMVVFG GGNEGIVDEL
HVYNTATNQW FIPAVRGDIP PGCAAYGFVC DGTRLLVFGG MVEYGKYSND LYELQASRWE
WKKLKAKNPK NGPPPCPRLG HSFSLVGNKC YLFGGLANDS EDPKNNIPRY LNDLYTLELR
PGSSVVGWDI PITYGVLPPP RESHTAVVYT EKTTRKSRLI IYGGMSGCRL GDLWTLDIDT
LTWNKPSVGG TAPLPRSLHS ATTITNKMYV FGGWVPLVMD DVKVATHEKE WKCTNTLACL
NLDSMCWETV LMDTLEDNIP RARAGHCAVA INSRLYVWSG RDGYRKAWNN QVCCKDLWYL
ETERPHAPAR VQLVRANTNS LEVSWGAVST ADTYLLQLQK YDIPATPAAA SPVTSATPSQ
PINSPKSPAP AAAAPSAQSL QQTAVLKVAA QQAATGASVV TVRPSQPGKS PVTMTSLPPG
VRMVVPTQTT QGSPIGSSPQ MSGMAALAAA AAATQKIPPA SGGTVLNVPA GATILKTVAV
SPGTTTVKVA SPVMVSNPAT RMLKTAAAQV GTATASSPTT TRPIITVHKS GAVTVAQQAQ
VVTTVVGGVT KTITLVKSPL TMGSSGTLIS NLGKMMSVVQ TKPVQTSAVT GQASTNPLTQ
IIQTKGPLPA GTILKLVTSA DGKPTTIITT SQAGGTGNKP TILNISGVSP TTTKQGTTII
KTIPMSAIMT QPGATGVTSS AGMKTPITIL TTKVMTTGTP GKIITAVPKL STAAGQQGLT
QVVLKGAPGQ PGTILRTLPM STVGGVRLVT PVTVSSVKPT VTTLVVKGTT GVTTLGTVTG
TVSTSLVGAT ADSSSPSLVT PITTLANIAT LSSQVINQAA ITVSAAQTSL TSSSTLPSST
VTVQNQPTQV TLITTPSGVE AQPVQDLPVS ILASPTSEQP SSTEAGATGD GSGTVTLVCS
NPPCETHETG TTNTATTSSA TIGAGQVCSN PPCETHETGT TNTATTSSAT IGAGQVCSNP
PCETHETGTT NTATTSSATI GAGQVCSNPP CETHETGTTN TATTSSATIG AGQVCSNPPC
ETHETGTTNT ATTASSNMST LRVCSNPPCE THETGTTNTA TTATSNMGGT QQVCSNPPCE
THVTGTTNTA TQASSSMNAG ETGTVQSAHS NPPSETHESG TSDTPSAATS SVGEDQSSTA
TGQIQRVCSN PPYETHETGT TNTATTATCS METGEGTATQ QTEEGAEGTS TTEEAATTAA
TSVATTTQGR AITTVTQSTP APGPSVPSIS SITEGVSTAA SSTEEPMQTE EPASAEAAPA
EEPAAAMETQ AEGEAAAATA LNLPSELMSE AQGTTLMVTG LSDEELAVTA AAEAAAQAAA
TEEAQALAIQ AVLQAAQQAV MNEGDGSGES QQPTNIPIML TQQELAALVQ QQQQLQEAQA
AAQQASVDTS MPTEGLAPAD SLNDPSVESN GHNEMAASVT SAVASLLPRT TAETLAPSST
FAPSISVASP AKLQAAATLA EVANGIEGEK QAPQPVPVKP VVKKENQWFD VGIVKVTNMV
VTHFYVPGDD SQGDDDSGAV PDYSQMKKME LQPGTAYKFR VAGINACGRG TFSEISAFKT
CLPGFPGAPC AIKISKSPDG AHLTWEPPSV TSGKIIEYSV YLAIQSSQTA EAKASTPAQL
AFMRVYCGPN PSCLVQSSSL SNAHIDYTTK PAIIFRIAAR NEKGYGPATQ VRWLQESGKD
ATSAKPAPKR PGTSPDTKTT GPKKARTDQ
//