ID A0A267GYT4_9PLAT Unreviewed; 1299 AA.
AC A0A267GYT4;
DT 22-NOV-2017, integrated into UniProtKB/TrEMBL.
DT 22-NOV-2017, sequence version 1.
DT 27-MAR-2024, entry version 18.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:PAA90449.1};
GN ORFNames=BOX15_Mlig002049g1 {ECO:0000313|EMBL:PAA90449.1};
OS Macrostomum lignano.
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes;
OC Rhabditophora; Macrostomorpha; Macrostomida; Macrostomidae; Macrostomum.
OX NCBI_TaxID=282301 {ECO:0000313|EMBL:PAA90449.1, ECO:0000313|Proteomes:UP000215902};
RN [1] {ECO:0000313|EMBL:PAA90449.1, ECO:0000313|Proteomes:UP000215902}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DV1 {ECO:0000313|EMBL:PAA90449.1};
RC TISSUE=Whole organism {ECO:0000313|EMBL:PAA90449.1};
RA Berezikov E.;
RT "A platform for efficient transgenesis in Macrostomum lignano, a flatworm
RT model organism for stem cell research.";
RL Submitted (JUN-2017) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PAA90449.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; NIVC01000110; PAA90449.1; -; Genomic_DNA.
DR STRING; 282301.A0A267GYT4; -.
DR Proteomes; UP000215902; Unassembled WGS sequence.
DR Gene3D; 1.10.1410.40; -; 1.
DR Gene3D; 1.25.40.20; Ankyrin repeat-containing domain; 3.
DR InterPro; IPR002110; Ankyrin_rpt.
DR InterPro; IPR036770; Ankyrin_rpt-contain_sf.
DR PANTHER; PTHR24178:SF9; ANK_REP_REGION DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR24178; MOLTING PROTEIN MLT-4; 1.
DR Pfam; PF12796; Ank_2; 3.
DR SMART; SM00248; ANK; 11.
DR SUPFAM; SSF48403; Ankyrin repeat; 2.
DR PROSITE; PS50297; ANK_REP_REGION; 8.
DR PROSITE; PS50088; ANK_REPEAT; 8.
PE 4: Predicted;
KW ANK repeat {ECO:0000256|PROSITE-ProRule:PRU00023};
KW Reference proteome {ECO:0000313|Proteomes:UP000215902}.
FT REPEAT 42..74
FT /note="ANK"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT REPEAT 75..107
FT /note="ANK"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT REPEAT 217..249
FT /note="ANK"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT REPEAT 253..286
FT /note="ANK"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT REPEAT 287..320
FT /note="ANK"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT REPEAT 321..353
FT /note="ANK"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT REPEAT 354..386
FT /note="ANK"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT REPEAT 387..419
FT /note="ANK"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT REGION 724..757
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 726..740
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1299 AA; 142145 MW; 559582B70F33BBCC CRC64;
MTDPPEPQKF ESVLESVLEL RDCGLLTEWL SRGGSPDEQD SDGRSFLHVA SFLGNAEAAR
LLVEASANVD LAAADGSTPL HEAVLGGSER LTRLLLRAGA SPAARTVSGA LPMHLAATIE
SGDDIARLLA GPTCAATDSR QQPPPVLGAA GPLPTPLFLC VLLGRAGAVD ALTAAGADPN
CCASKLELEA LWKLPKSSSL LFCLIIALIS GQWSLRDGGP AMSAAARLGN AEIVRILLSA
AADPDRRNPN CSLGSTPLHA ATQSRHCIDV IRQLISAGVN VDITAEDGFT PVFHAAASGY
APEAVAVLLR AGANVDSSLN TGGRPLHFAV WNGYQEVAKL LISARANVDA RNSDGDTPLH
HAIYKRRAEL VAMLLRAGAR VDLLNNRGQT PLALAVDKTS PEIVSELIRA GASADMSSNN
NKDLVKEALE CRQLSMAHML IDASAKIDEK KSSEWFKLAL MAGFTEALQL LIDAGVRLDQ
TEPMGLMLSF FTSFQENASV RKLLAAGADI PADLTSFFFA ISAPDPSNPA AAPTVRSAEE
QRQFSSRLSE AMRAAGFKQA NAAVQSGAAA ALQDILRDLM DDKDTFVFGS FADGWGSSLE
ILNGEISRDS DIDVTFMESG RLLCLDCGSA GWTACSGCGR DAIRVQCENG HAYYEHGSTG
PNMARNQPFE FTTIFSGTHV NIDLISAVTC CRYPRIELLQ LGYSNRRGQP SQMPQRILNQ
LEQEALSHSP KEDSEQEALS HSPKCHAVAA SPPHKPPGSC MRVSTTLLER AVMHSLTTVQ
GQFFILVKFL IKKVISIEMK VSGLKTYMAK NLLFYMLDET PEEEWKPDNL LQLVRQSLQL
LVAMMESSDS DTVCMKHFFL RDADVYFKKG QQPKKDIVDA VNGVIDELPR WLHQFQGQLR
ENASNSVRFQ PFQLLDEFCT SRDVQRYSGY SSICGLVRHC LLKLGSDRRG QSESEHLSEL
LQLIGELPDC ARSARESLRL MAHLKFHAPL DAAGAAAGLT GQESSGIECP ESPGGGSELL
SVDEAKELVW RHLHETDSAN VFHFFFQDKV DFGFLPAASK KYLHNLKLLL LDFFRLYDSL
TGRESALVER QNIIRNSPAT VSEVCQRSDA FNWAEMIDDG TFFNLFKFNL DVIHQANPEL
FQNFISLLLA NPKFRDRSSD LYIESFRRFY RQLPPEDIRY LRAAVGSPEE LMQEMLQLSF
DTPSRQPFQL LFSNEACRSL VAQSMIRALD KALAEAESLQ AGAEPGEMQQ SNCLLDEGAE
RERRLQNFLK LEQHWMQLMM GSSDSDTVGI HGLQEKPLD
//