ID B4HIS0_DROSE Unreviewed; 968 AA.
AC B4HIS0;
DT 23-SEP-2008, integrated into UniProtKB/TrEMBL.
DT 23-SEP-2008, sequence version 1.
DT 06-MAR-2013, entry version 30.
DE SubName: Full=GM23891;
GN Name=GM23891; Synonyms=Dsec\GM23891; ORFNames=Dsec_GM23891;
OS Drosophila sechellia (Fruit fly).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
OC Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha;
OC Ephydroidea; Drosophilidae; Drosophila; Sophophora.
OX NCBI_TaxID=7238;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Rob3c / Tucson 14021-0248.25;
RX PubMed=17994087; DOI=10.1038/nature06341;
RG Drosophila 12 genomes consortium;
RT "Evolution of genes and genomes on the Drosophila phylogeny.";
RL Nature 450:203-218(2007).
CC -----------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution-NoDerivs License
CC -----------------------------------------------------------------------
DR EMBL; CH480815; EDW42717.1; -; Genomic_DNA.
DR RefSeq; XP_002031731.1; XM_002031695.1.
DR EnsemblMetazoa; FBtr0206876; FBpp0205368; FBgn0178756.
DR GeneID; 6606937; -.
DR KEGG; dse:Dsec_GM23891; -.
DR FlyBase; FBgn0178756; Dsec\GM23891.
DR KO; K00288; -.
DR OrthoDB; EOG4WH71P; -.
DR GO; GO:0005524; F:ATP binding; IEA:InterPro.
DR GO; GO:0004329; F:formate-tetrahydrofolate ligase activity; IEA:InterPro.
DR GO; GO:0004488; F:methylenetetrahydrofolate dehydrogenase (NADP+) activity; IEA:InterPro.
DR GO; GO:0009396; P:folic acid-containing compound biosynthetic process; IEA:InterPro.
DR Gene3D; 3.40.50.720; -; 1.
DR HAMAP; MF_01543; FTHFS; 1; -.
DR HAMAP; MF_01576; THF_DHG_CYH; 1; -.
DR InterPro; IPR000559; Formate_THF_ligase.
DR InterPro; IPR020628; Formate_THF_ligase_CS.
DR InterPro; IPR016040; NAD(P)-bd_dom.
DR InterPro; IPR000672; THF_DH/CycHdrlase.
DR InterPro; IPR020630; THF_DH/CycHdrlase_cat_dom.
DR InterPro; IPR020867; THF_DH/CycHdrlase_CS.
DR InterPro; IPR020631; THF_DH/CycHdrlase_NAD-bd_dom.
DR Pfam; PF01268; FTHFS; 1.
DR Pfam; PF00763; THF_DHG_CYH; 1.
DR Pfam; PF02882; THF_DHG_CYH_C; 1.
DR PRINTS; PR00085; THFDHDRGNASE.
DR PROSITE; PS00721; FTHFS_1; 1.
DR PROSITE; PS00722; FTHFS_2; 1.
DR PROSITE; PS00766; THF_DHG_CYH_1; 1.
DR PROSITE; PS00767; THF_DHG_CYH_2; 1.
PE 3: Inferred from homology;
KW Complete proteome.
SQ SEQUENCE 968 AA; 103408 MW; E7883FDD99B92874 CRC64;
MSAQYQRFLK VLEKWPAEKS KVGSGEWTAE PAIKMSGAKI ISGTAVAKSI REELRNEVTA
MGKQLADFVP GLRIVQVGGR EDSNVYIRMK IKAATEIGID AAHVQLPRSI TEVELLDKIN
DLNEDPRVHG IIVQMPLDCD TTIDSHRITD AVSPEKDVDG LHTVNEGRLA IGDLGGFLPC
TPWGCLELIR RSGVEIAGAR AVVLGRSKIV GTPAAELLKW ANATVTVCHS KTRNLEEITR
SADILVVGIG VAEMVKGSWI KPGAVVIDCG INVKPDASKA SGSKLVGDVD YAEALQVAGH
LTPVPGGVGP MTVAMLMKNT VRSAARFLER LARSQWALQT LPLKPQRPVP SDIVIARAQK
PKDIAVLAKE IGLEAREVSL YGNKKAKISL SVLERLKDKE AGHYVVVAGM TPTPLGEGKT
TTLMGLVQAL GAHKLRNTMA ALRQPSQGPT FGIKGGAAGG GYAQVIPMEE FNLHLTGDIH
AVSAANNLLA AQLDTRIFHE NTQKDKALYD RLVPAIKGQR KFSPIQLRRL QKLGITKTDP
DTLTAEEYGP FARLDIDPDT IMWERVVDIN DRYLRTITVG QSPTEKGISR ETRFSISVAS
EIMAVLALSR SLEDMKQRLA DMVVAFDKRG KPVTADDLGV TGALAVLLKD ALEPNLMQSL
EGTPVLVHAG PFANIAHGCN SIIADEVGLK LVGKDGFVCT EAGFGSDIGM EKFCNIKCRT
SGRKPNAMVL VATVRAIKMH GGGAPVTPGA PLNKQYTEEN LELVQKGLPN LLQHIENGKA
FGMPVVVSLN AHSADTPAEH ELVKKAALEA GAFAAVVSTH WADGGAGAVE LAGAVIKACE
QGNQFRLLYD LDLPLVDKMN KIATTMYGAG KVVLSPAAEE KVKRLTDAGF GNLPICMSKV
SGSFTGDAKV KGAPKGFTLD VEDVYVSAGA GFVVAMCGEV TKMPGLPTRP AIYDIDLNTE
TGEIEGLF
//