ID A0A1I8GYJ3_9PLAT Unreviewed; 2782 AA.
AC A0A1I8GYJ3;
DT 18-JAN-2017, integrated into UniProtKB/TrEMBL.
DT 18-JAN-2017, sequence version 1.
DT 27-MAR-2024, entry version 25.
DE SubName: Full=ETS domain-containing protein {ECO:0000313|WBParaSite:maker-uti_cns_0003560-snap-gene-0.2-mRNA-1};
OS Macrostomum lignano.
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes;
OC Rhabditophora; Macrostomorpha; Macrostomida; Macrostomidae; Macrostomum.
OX NCBI_TaxID=282301 {ECO:0000313|Proteomes:UP000095280, ECO:0000313|WBParaSite:maker-uti_cns_0003560-snap-gene-0.2-mRNA-1};
RN [1] {ECO:0000313|WBParaSite:maker-uti_cns_0003560-snap-gene-0.2-mRNA-1}
RP IDENTIFICATION.
RG WormBaseParasite;
RL Submitted (NOV-2016) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|RuleBase:RU004019}.
CC -!- SIMILARITY: Belongs to the ETS family. {ECO:0000256|ARBA:ARBA00005562,
CC ECO:0000256|RuleBase:RU004019}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR WBParaSite; maker-uti_cns_0003560-snap-gene-0.2-mRNA-1; maker-uti_cns_0003560-snap-gene-0.2-mRNA-1; maker-uti_cns_0003560-snap-gene-0.2.
DR Proteomes; UP000095280; Unplaced.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR GO; GO:0043565; F:sequence-specific DNA binding; IEA:InterPro.
DR Gene3D; 1.10.10.1450; -; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR Gene3D; 1.10.10.10; Winged helix-like DNA-binding domain superfamily/Winged helix DNA-binding domain; 1.
DR InterPro; IPR000418; Ets_dom.
DR InterPro; IPR041426; Mos1_HTH.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR036388; WH-like_DNA-bd_sf.
DR InterPro; IPR036390; WH_DNA-bd_sf.
DR PANTHER; PTHR46060; MARINER MOS1 TRANSPOSASE-LIKE PROTEIN; 1.
DR PANTHER; PTHR46060:SF3; MARINER MOS1 TRANSPOSASE-LIKE PROTEIN; 1.
DR Pfam; PF00178; Ets; 1.
DR Pfam; PF17906; HTH_48; 1.
DR PRINTS; PR00454; ETSDOMAIN.
DR SMART; SM00413; ETS; 1.
DR SUPFAM; SSF46785; Winged helix' DNA-binding domain; 1.
DR PROSITE; PS00346; ETS_DOMAIN_2; 1.
DR PROSITE; PS50061; ETS_DOMAIN_3; 1.
PE 3: Inferred from homology;
KW DNA-binding {ECO:0000256|RuleBase:RU004019};
KW Nucleus {ECO:0000256|RuleBase:RU004019}.
FT DOMAIN 543..626
FT /note="ETS"
FT /evidence="ECO:0000259|PROSITE:PS50061"
FT REGION 351..437
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1027..1151
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1365..1439
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1502..1539
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1847..1869
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1895..1921
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 356..379
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1054..1078
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1087..1101
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1521..1539
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1851..1866
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2782 AA; 302390 MW; 7149E240C697251D CRC64;
MSTRSLRRGS SAVKANWCCA HGGLPEVHRF SRATSRPPLV LAPLVLAPPG AAPLVLAPCA
GPAPGAGAPG AGALVLAPLV LAPLVLAPLV LAPLVLAPLV LRPWCWRPWC WRPGAGAPGA
GAPAGAGALG AGALGAGASG AGASGAGASG AGASGRLLAP LVLAPLVLRL WCWRLWPLKS
LVLAPLAAKK DYDRNIKVAT RQTQQEVRQG TRPPRNWFYF ARARDAWRGP ADSNRTASTT
SAAELLMELS HSSTPPTVEP MSGPPPPIST GASGISLTSG TRFFSAGGGA SMFPQHRPVA
AADSEMFHDF PRPAATNSSL LFDSMKSLPP PPPPSQLPSL FQSQLLLQKS MPPPQAAPTL
TLPSPMPPPP PPMPPPSQQQ QQPSFYRSPF ARASPPLCPG QDSELYGPAS FQNGGAIMVP
PTIESGEPSL QNPKQSKQQE VLMLAELGQL PLPAMESWDP KQRMFWLKCM LTKHLHQPNP
AELAERVQKL RMVETCIRAQ PNTPLESYIN IFGEIFDKRI ALKLQYELDA WKQAASPEKV
GGDHLWEHIL RLLMNEDPTV EWLNIQEGLF RFVDSSGAAR SWGMTKGNMN MSYEKMSRSI
RQYYTDDKNG LIQKATEYPK LHYRFNMFHP EVVSFLKKNF MEVCQRHAFL CTVVNSIHFR
IGPELESRFE LVESWLRASS AVARECCSSL SSSASSRSTD SIVGDSAWCC WACGGAAAFL
GAFLEANGWR PDRLWVPMRG GAQRERDSCR SSAARLGAVF NRNGAAGGAL GCGCGRGDEG
WLALLQLNCR FSYDTYDKLL LMFNWWWRPS RSINSCIEDE LACLANKAEQ MRVISKKAVY
LAKAMLVSSW SNEPQWLRRN PARRPVRFLQ LFLGLQPVGI AVAQLSLVAH LQQMLQLRSA
SAAEANRLLQ SSARLAPIGQ PVRSRDPELI GRWNLIGYQA ASENGSSGWL VPAVRPQISL
LYSSSSCTFS ARTFSLRFSA VFNLRLAVIF EELSLNQHQP AACQDDHQQQ GQQSILHYAV
RLCGTGSEGP AGPAGRSQPT DSTVAKEGPA GSAGRSDQSY QTQNTLRPAS SAGGSKGTAS
SEPKGPAKSG RSDEAQRARL ERKRAKQRAK RQAKKGAAGA KAAASTMDAE ATGVPRQPGG
PAGGSSSSSA AALPGAAAAA AEAETTAQQM ATANKEVPSG ATFAAKAKAK IPGVIIQGRD
QDWTTDQLQK VWSAVDSYLI ELTVEEGISI GVERMVLRST FVLIAPSSEE DARRLLQRLP
TVSLDADLGG ALFLREGQRP KTIPYVVFVP AKSTAAGPDV IRRVLLRLNP DLPASGLVYH
RKVRRGETGN SIVLGLSSTW ASRYPNGSSF QLGALKLKLR RGKSAKGKAT ANPGISGKSG
AQSKAQGKAQ ADPKAKAASR PSASSSSATA VTAPAEPREP NLPVGEGVED EAGSSESELS
SATLHCVSAS VNLMKIAELY RPGLILIQEP WMRNARYLLN SLGHPAIQLI VVQLRLRRRR
RRRRWGSGGR SKRGLSIFES HPRRSGSSSH GSKVETVNTA SSDSAAAAAN RILQQLEKVG
LLLLHSRQLS HQLLPDRGRP GLEHKRCHIA WRDIHCLQGG CGGVGQLHDT VPAGSLLQKR
GAIVASGLRL RDQHHLVEQK HLALLEFQLA NSGINSRIVQ VSGQVAHVEH SFAAHLAQGV
LHGSSRIRRC TRSQAARAGP PRPSLRAFST MTPTRRLYRM QRCTGFIRPL VSTACRALAS
SSLFGRTAER LLSLGKPLAE GGARLLLRRR RLLLLRRLAQ NCGRQLTFAV IAEQPNGGDL
ARQEFDKLAQ LTSRHCELAG LGSLLALRIS RYSSLIGAPA GHMPNLHGSL STCSPPPPPP
PPPLSDSLTA TSLAKTMNKS NRNTWRSRPF RISCRGSQTT NESCRSTSPN LARRSPSGRP
SSMLRYEQLS RVGLINAVLN IERSQLLAKA DKAVPGVLRQ EAAGVFLQFA KKGAVMKLTR
SVLINSFNMY RRFLSPASCL CSALESNELM PLLAEPEAPE VTLLTEPRPD EALVLESCCC
GCFCSAGWEW PFLAKLVAMP YRISTFCAVS GGTADELEQG IDDSFTLSPR LQHLTLGLGQ
LVLIADYKSF NRAGVLTSAA VQVLHEGLRG APGEPAAAQE RVLLYVVLLG PFRDLLHSIW
AGQIRARVLR TAVVLLQILQ HLRHLDSAEK FVPTKLQTLL KIAHIGAPWM IGFHLFQFTM
ELNREQNRLL IWYCHKRKLT ALETHAELLA TLEAQAPSYA TVTRWYREFQ AGRTSFSDDP
RPGRPPTAVT EENIAAVQEL IHQDPRITTD MLAMNVGIGS AAVDTILHDH LRVRKLCARW
IPHILTDAQK QARMEFCQFL IDRYEHGSHQ RRSEVITGDE TFLYHFDVET KRSSAEWVPE
GGQRPLKARR SRSQGKRMFA IFFDSQGVVA MVKLEGQATV TARWYTEECL PVLAPLQLSI
EPSPALFNIL SLAVQLSQQP LQPAQRVLLV RDVKIHADRL EHVNQFTTFQ QNFFGTFHRL
NKPAATSSAS RRLAIALVKL SIRRLRFSAS VNRPSVSISA DGTRSSTALI SLSAFTMVVS
ALAMDLPTYT CLAKNGSPCS LSQASTWAQS CLKCKEDSRW YRCFFRSLLA ESRQRCECSR
RWIRLKQCWR LTSKLISATR ALGAGVCSAL PNIAEAVQQE GALLLHPIGH PTDVLFEQAQ
LVGLLQGHPG GLPFVNQETA QLVQVLNSIC DSRGQLLEVG VHADPAAADR FALGAAAEPL
GQTGRLGLQR VHRGLDNLSD LH
//