ID W6UIK1_ECHGR Unreviewed; 1676 AA.
AC W6UIK1;
DT 16-APR-2014, integrated into UniProtKB/TrEMBL.
DT 16-APR-2014, sequence version 1.
DT 27-MAR-2024, entry version 27.
DE SubName: Full=PERQ amino acid-rich with GYF domain-containing protein 2 {ECO:0000313|EMBL:EUB61300.1};
GN ORFNames=EGR_03786 {ECO:0000313|EMBL:EUB61300.1};
OS Echinococcus granulosus (Hydatid tapeworm).
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes; Cestoda;
OC Eucestoda; Cyclophyllidea; Taeniidae; Echinococcus;
OC Echinococcus granulosus group.
OX NCBI_TaxID=6210 {ECO:0000313|EMBL:EUB61300.1, ECO:0000313|Proteomes:UP000019149};
RN [1] {ECO:0000313|EMBL:EUB61300.1, ECO:0000313|Proteomes:UP000019149}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=24013640; DOI=10.1038/ng.2757;
RA Zheng H., Zhang W., Zhang L., Zhang Z., Li J., Lu G., Zhu Y., Wang Y.,
RA Huang Y., Liu J., Kang H., Chen J., Wang L., Chen A., Yu S., Gao Z.,
RA Jin L., Gu W., Wang Z., Zhao L., Shi B., Wen H., Lin R., Jones M.K.,
RA Brejova B., Vinar T., Zhao G., McManus D.P., Chen Z., Zhou Y., Wang S.;
RT "The genome of the hydatid tapeworm Echinococcus granulosus.";
RL Nat. Genet. 45:1168-1175(2013).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EUB61300.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; APAU02000021; EUB61300.1; -; Genomic_DNA.
DR STRING; 6210.W6UIK1; -.
DR EnsemblMetazoa; XM_024493035.1; XP_024352496.1; GeneID_36339501.
DR OMA; ITDYIAM; -.
DR OrthoDB; 5406906at2759; -.
DR Proteomes; UP000019149; Unassembled WGS sequence.
DR Gene3D; 3.30.1490.40; -; 1.
DR InterPro; IPR003169; GYF.
DR InterPro; IPR035445; GYF-like_dom_sf.
DR PANTHER; PTHR14445:SF36; FI03272P-RELATED; 1.
DR PANTHER; PTHR14445; GRB10 INTERACTING GYF PROTEIN; 1.
DR Pfam; PF02213; GYF; 1.
DR SMART; SM00444; GYF; 1.
DR SUPFAM; SSF55277; GYF domain; 1.
DR PROSITE; PS50829; GYF; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000019149}.
FT DOMAIN 406..461
FT /note="GYF"
FT /evidence="ECO:0000259|PROSITE:PS50829"
FT REGION 93..143
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 174..194
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 222..256
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 286..342
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 663..763
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 779..851
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1141..1162
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1244..1278
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1350..1369
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 93..123
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 295..324
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 669..685
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 694..730
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 791..816
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1255..1278
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1676 AA; 183311 MW; 68667962736B6215 CRC64;
MEPTSIAHRY VYTREQLIEI RLAFISSGGP SRYPIADSVI SLDPEKRVLQ RKSKPIVDTA
VLQAKSSKNT ESFYSGGVEI VTCHGSDMTQ YSTGWRRTRR TSGSVSQSPL DAPTSQRHNS
GSSGGEEGCG GGNGSGGNGG GTVRREFDWS LGCGKWRHRS TSGSERGAHK TVFYASGGRG
RGRGGNGSGG SEATEFHREQ GGGVFMGSPG TRGRGFMRRP FQHGGQSTRG GRGGGGFKPF
NNHGTDHGPT EEDFEPHDEY YKHEHPHHHP YLSHHYTSHQ FHKDVGLDDV SQGGEGRWNH
QTSTAAAQLP QMSSSQSTEQ HGGGTPQPPP SPSSLPQTCQ AAVAPTMSES VATALGADLV
TKPIKEGFSF MPNFPSRPLV ESRVQATPPP PPPLPSIPLQ SSQSSSTLWY YRDPSGLVRG
PYDDATMAAW FSAGYFPIQI EVRRECDKVF SRITDYIAML GRFPFVGSSD LPPILEHRAA
MLASTGSAAV APPPLPSIAG MVPQAPPLLP QQMNLLQTPL FQSSVALPST AASASMPQPP
QISAAVVGGQ SPLVNPNQEV SGQVSQGAES SFIHYGDTRS TLQASKSIFD LNSLTKATLS
EMVRLQRDAQ RLCERVSALG MDQNLLAKSF AALSLGAVEN TASASTPPNI IDLQQLEAAI
HRQQQEENEQ LKQLQQQQAA IESVSPPPSS PVKEKENIKV EDDTGKLAKV EPTPEKKQAA
RKDKTVKVDQ PPKTPPKTEL PTPMATIAKK KKKKSKAAAV GVDAEKVVEE KETVTIVAGA
SLERDSDWIV VPSTGATSHT PAPASSTSSQ QEQDQQQQPK KSKKKRKPKP TASELQQQAW
EQEEQRRRRM NAERIAQAEA EAAAAAAAEV AEAAAASVKI SAAARRQQEE AQRLLQQRKA
REAAMREAAA NLAHLKLPAA ARWGAASSSN GGGGAPTSMA DIFASQMQEE EFARAKPAPP
ATFASKLAGG LCGSASSTAP TPVATKSKVF VPVEVPQKAL TQSKQVASTP KMAATMSKTN
VISIWDLPTD SNAIKAANSK KTKKKKANAS THLLTGPIIS LAPKARAELV RWCEGQLSGF
PRHNVDIPTL ISLLCDIEEA EDVLECIESS FGKSQRVSKF SKAFVEKRAN LMEQQFMDPI
DASESTGEQT SPDTFQTEGN NEDNTFDIDW SGEVSKWCEG QILRESTESL TRLLLEQVKE
QHLQIAEMRA NEARHQSSIE DIRRSYDKEL LRLQTLLTKH QTRSGISLNG SSKEVKDVGA
EEHASEDPKS TSRETELETE IDLQNHLIEG FQRENEALMK ENKELKQKLA STLIPSEDAN
AIAERLVHEN ARLQIGFQQV KDELNALKST HESAPKNEKK ENSTHNIQEE LEQSLKRLEE
ERDLNRGLQD FLHNLEAELK QTKEEQSRLQ SLLSTSDSKR MKEVKDRQNR IVTLNAELKR
VRQDYITAMQ RVQEKMRWQT NTQSISRRHK LEIMKLKMEL EKKQCMRSSK SKVENPFLTT
VTMDTQTDHI QVSTAETQTA VTPKDVPDVA TPAGGGTEHD FAHGDGDTRY SALVEELQEE
LRASHQRSKR LEAQLNTLSR VHMAVITGRS PKEALNWDSG VKMGKLNFEG HLNNERPVNS
KDECSASYWR SIAEKRTHDV ALLSREMDKL TNLLTEISTR KSLTAGLSTQ SAKGTV
//