ID Q7PZ77_ANOGA Unreviewed; 1492 AA.
AC Q7PZ77;
DT 15-DEC-2003, integrated into UniProtKB/TrEMBL.
DT 23-OCT-2007, sequence version 4.
DT 24-JAN-2024, entry version 97.
DE SubName: Full=AGAP011769-PA {ECO:0000313|EMBL:EAA00365.4};
GN ORFNames=AgaP_AGAP011769 {ECO:0000313|EMBL:EAA00365.4};
OS Anopheles gambiae (African malaria mosquito).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=7165 {ECO:0000313|EMBL:EAA00365.4};
RN [1] {ECO:0000313|EMBL:EAA00365.4}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=PEST {ECO:0000313|EMBL:EAA00365.4};
RX PubMed=12364791; DOI=10.1126/science.1076181;
RA Holt R.A., Subramanian G.M., Halpern A., Sutton G.G., Charlab R.,
RA Nusskern D.R., Wincker P., Clark A.G., Ribeiro J.M.C., Wides R.,
RA Salzberg S.L., Loftus B.J., Yandell M.D., Majoros W.H., Rusch D.B., Lai Z.,
RA Kraft C.L., Abril J.F., Anthouard V., Arensburger P., Atkinson P.W.,
RA Baden H., de Berardinis V., Baldwin D., Benes V., Biedler J., Blass C.,
RA Bolanos R., Boscus D., Barnstead M., Cai S., Center A., Chaturverdi K.,
RA Christophides G.K., Chrystal M.A.M., Clamp M., Cravchik A., Curwen V.,
RA Dana A., Delcher A., Dew I., Evans C.A., Flanigan M.,
RA Grundschober-Freimoser A., Friedli L., Gu Z., Guan P., Guigo R.,
RA Hillenmeyer M.E., Hladun S.L., Hogan J.R., Hong Y.S., Hoover J.,
RA Jaillon O., Ke Z., Kodira C.D., Kokoza E., Koutsos A., Letunic I.,
RA Levitsky A.A., Liang Y., Lin J.-J., Lobo N.F., Lopez J.R., Malek J.A.,
RA McIntosh T.C., Meister S., Miller J.R., Mobarry C., Mongin E., Murphy S.D.,
RA O'Brochta D.A., Pfannkoch C., Qi R., Regier M.A., Remington K., Shao H.,
RA Sharakhova M.V., Sitter C.D., Shetty J., Smith T.J., Strong R., Sun J.,
RA Thomasova D., Ton L.Q., Topalis P., Tu Z.J., Unger M.F., Walenz B.,
RA Wang A.H., Wang J., Wang M., Wang X., Woodford K.J., Wortman J.R., Wu M.,
RA Yao A., Zdobnov E.M., Zhang H., Zhao Q., Zhao S., Zhu S.C., Zhimulev I.,
RA Coluzzi M., della Torre A., Roth C.W., Louis C., Kalush F., Mural R.J.,
RA Myers E.W., Adams M.D., Smith H.O., Broder S., Gardner M.J., Fraser C.M.,
RA Birney E., Bork P., Brey P.T., Venter J.C., Weissenbach J., Kafatos F.C.,
RA Collins F.H., Hoffman S.L.;
RT "The genome sequence of the malaria mosquito Anopheles gambiae.";
RL Science 298:129-149(2002).
RN [2] {ECO:0000313|EMBL:EAA00365.4}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=PEST {ECO:0000313|EMBL:EAA00365.4};
RG The Anopheles Genome Sequencing Consortium;
RL Submitted (MAR-2002) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|EMBL:EAA00365.4}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=PEST {ECO:0000313|EMBL:EAA00365.4};
RX PubMed=14747013; DOI=10.1016/j.pt.2003.11.003;
RA Mongin E., Louis C., Holt R.A., Birney E., Collins F.H.;
RT "The Anopheles gambiae genome: an update.";
RL Trends Parasitol. 20:49-52(2004).
RN [4] {ECO:0000313|EMBL:EAA00365.4}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=PEST {ECO:0000313|EMBL:EAA00365.4};
RX PubMed=17210077; DOI=10.1186/gb-2007-8-1-r5;
RA Sharakhova M.V., Hammond M.P., Lobo N.F., Krzywinski J., Unger M.F.,
RA Hillenmeyer M.E., Bruggner R.V., Birney E., Collins F.H.;
RT "Update of the Anopheles gambiae PEST genome assembly.";
RL Genome Biol. 8:R5.1-R5.13(2007).
RN [5] {ECO:0000313|EMBL:EAA00365.4}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=PEST {ECO:0000313|EMBL:EAA00365.4};
RG VectorBase;
RL Submitted (MAY-2011) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EAA00365.4}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAAB01008986; EAA00365.4; -; Genomic_DNA.
DR RefSeq; XP_320741.4; XM_320741.4.
DR STRING; 7165.Q7PZ77; -.
DR PaxDb; 7165-AGAP011769-PA; -.
DR GeneID; 1280871; -.
DR KEGG; aga:AgaP_AGAP011769; -.
DR VEuPathDB; VectorBase:AGAP011769; -.
DR eggNOG; KOG4712; Eukaryota.
DR HOGENOM; CLU_002068_1_0_1; -.
DR OMA; YLYCGAY; -.
DR OrthoDB; 8542at2759; -.
DR PhylomeDB; Q7PZ77; -.
DR ExpressionAtlas; Q7PZ77; differential.
DR GO; GO:0006281; P:DNA repair; IEA:InterPro.
DR CDD; cd11721; FANCD2; 1.
DR InterPro; IPR029448; FANCD2.
DR PANTHER; PTHR32086; FANCONI ANEMIA GROUP D2 PROTEIN; 1.
DR PANTHER; PTHR32086:SF0; FANCONI ANEMIA GROUP D2 PROTEIN; 1.
DR Pfam; PF14631; FancD2; 2.
PE 4: Predicted;
FT REGION 1..32
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 917..939
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1421..1492
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..28
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1466..1492
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1492 AA; 164908 MW; 1CED101320A4D64F CRC64;
MNEIFSQAKR RRHAPMAGND RAGDANEEDD FFGSQFSVPA SQAASQRRYL SQRSNLQRSQ
VVKPGRRPPT NYYESVLFMA GLSLDEPGGV VVLKCEPIVF MHKLKSVLRT NADYPANVDR
FLAGIETTMK DRTNFLKLLE CCQVVPNATA LDDGTMQPVR KPVQESLMKM FLLVDFLQLK
LIELLFAYLS KELGTMATPE GDGAEDSIVG FVLSQLKFID HVQNGELVFE KVFELLGKAN
RQPVVFDEII FSLEDVIDVS KHDDALARLV KLRPRPQDLI TPTTVEVFTG MCLSVETLEM
LRRKVVQYAA DGCPLRYYPA LVKMLLKFNR TEPAENIAGL VREVRRLLDT GEAADTWAAS
TADDCAEQVL RTIYQAISAS AMLYDAWITV IRQLPGAEEH LPVDLLLLTI TTTLNEVKAP
RIRKLTIRKI EQQFFTDAHT DQLVGGCFRR VLQTHLDGFL QLIECCTRER HERVCEFGVA
AMSALFALDQ SGAATDNRVV LSKIVGFICE MASANVASRN DFLIGRCIGA LRKLHESHPK
EIERSAELLL RILDIAPELS LRQYRPLISV IYAAVIPPRP LDDDAELAAI RDNLEIIVKK
QLMCDSKDTK RKGIIGLVQM VYHLSLAPAS DDAAELSSSF DSERTIGTVS EIPSATGRTL
ANLVSTLFLS TNQSPDLLAI CYDELAGMLA QPRPKAAPVW EKTFIMWLSD TITMEFQGTF
LVESDTPLPS RDATGPGGIL LARKLCINEA EEEAANTEAA TIAVNVGGGV LGHTGSRHTA
ICYLAPIFRL MRSLHFVRYG GVLESINALL GCMVVVPEFY GVGEEEERRF SVDTYDETTC
TLLLDIYFHL SNWFRECVSA FVGQDDASIR KKVLERLHEL VTLEHRLGRM LERMVADSEY
IPPLASDFMD PPAAATKKLK PLESSSTRGR PKAGAHDRTV NLDAPLSTQT AAATAPATVG
QFNEQSLLLR CSYALRRMDP ELVRLFTVEL VPARVLELPE QRGTHLALAE YRFVLENVTT
ALEQSSEAEG RYQRCMVERW EAFVERTATL QHRLQEATNR LQPASVQELL PLKSCYLWSL
RLATALLPWK RFREQRGRVE LARVLRLMVL QLTEREDGAS PGDDLHVAEL AKLLLKGLIV
HESCFGDLSI AAQLYRLARA IGECVGDRTS SARSIARFTR SVLIAPDMRT ADSKAAAHFT
TILDGLIETV SLKQVKQLIA KGEATTTTVP DTVRTFASFK RAQYAQLFKG LCRAFLRVLQ
DEIRIRGSGG GGGGGPNRRL ELWETACEAT SELTAVVKRA QQAGSFGVYL RFAQIFIKLF
LKSGLPALET ILRCSAERAS NLLSTLQTTT RYLHNVCCQT KVAKGAGSSA GALAQIPFVR
ESVETLVYRV KAALVANRCS AVFWMGNLKN KDMQGELIAS QLPQPDSEDE EEGEGDGGDG
TNGVALHDVS DIADDLSDDE GRDNGGGSSK ASSSSNQRVG SVSKSSSQSK CF
//