ID A0A267E523_9PLAT Unreviewed; 1548 AA.
AC A0A267E523;
DT 22-NOV-2017, integrated into UniProtKB/TrEMBL.
DT 22-NOV-2017, sequence version 1.
DT 27-MAR-2024, entry version 17.
DE RecName: Full=MACPF domain-containing protein {ECO:0000259|PROSITE:PS51412};
GN ORFNames=BOX15_Mlig029702g1 {ECO:0000313|EMBL:PAA55987.1};
OS Macrostomum lignano.
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes;
OC Rhabditophora; Macrostomorpha; Macrostomida; Macrostomidae; Macrostomum.
OX NCBI_TaxID=282301 {ECO:0000313|EMBL:PAA55987.1, ECO:0000313|Proteomes:UP000215902};
RN [1] {ECO:0000313|EMBL:PAA55987.1, ECO:0000313|Proteomes:UP000215902}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DV1 {ECO:0000313|EMBL:PAA55987.1};
RC TISSUE=Whole organism {ECO:0000313|EMBL:PAA55987.1};
RA Berezikov E.;
RT "A platform for efficient transgenesis in Macrostomum lignano, a flatworm
RT model organism for stem cell research.";
RL Submitted (JUN-2017) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PAA55987.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; NIVC01002678; PAA55987.1; -; Genomic_DNA.
DR STRING; 282301.A0A267E523; -.
DR Proteomes; UP000215902; Unassembled WGS sequence.
DR CDD; cd00185; TNFRSF; 1.
DR Gene3D; 2.10.50.10; Tumor Necrosis Factor Receptor, subunit A, domain 2; 7.
DR InterPro; IPR006212; Furin_repeat.
DR InterPro; IPR009030; Growth_fac_rcpt_cys_sf.
DR InterPro; IPR020864; MACPF.
DR InterPro; IPR001368; TNFR/NGFR_Cys_rich_reg.
DR InterPro; IPR011641; Tyr-kin_ephrin_A/B_rcpt-like.
DR PANTHER; PTHR46967; INSULIN-LIKE GROWTH FACTOR BINDING PROTEIN,N-TERMINAL; 1.
DR PANTHER; PTHR46967:SF2; KERATIN-ASSOCIATED PROTEIN 16-1-LIKE; 1.
DR Pfam; PF07699; Ephrin_rec_like; 5.
DR Pfam; PF01823; MACPF; 1.
DR SMART; SM01411; Ephrin_rec_like; 8.
DR SMART; SM00261; FU; 4.
DR SMART; SM00208; TNFR; 3.
DR SUPFAM; SSF57184; Growth factor receptor domain; 3.
DR PROSITE; PS51412; MACPF_2; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000215902};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..20
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 21..1548
FT /note="MACPF domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5012921700"
FT DOMAIN 171..595
FT /note="MACPF"
FT /evidence="ECO:0000259|PROSITE:PS51412"
FT REGION 318..346
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 471..517
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 471..490
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 500..517
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1548 AA; 167724 MW; 98A7D4FC5F64544B CRC64;
MHACLAAVIA IALLGFQVEA AEKDTSKTVD ANALEQYGPA VDAIADAISN SDKKTKDLLD
DLTQSTDGQA TPLITKEILK NMLNERSAKT QKAGTRSISL QAEQGFNGFS NYQPSGWVMT
ETVGEPDKVT EDAEDKMPVY VYPMGVGPVL MNGCTGASYP KTSKCYSSKV PGNGNTICMK
LLDAAAYIGV GFDGRGFYKI ESRKASLVQR YCSNMAKFLG KDVPDTMNVF GIFETTAETK
SFRSADEYMS YIRDKAGLSE QRNLFLSEAR QHGTSEASGF NLGAIGGIVG GAIGAIAGGP
AGAALGATLG GAVGGSISSG SGESSGDSSS TTSSQSASKN SQYSQAARKQ GSTEDFFAIM
YVSVIMYEIS LDDVKPNDIN FAALRDYMSL PESYFSVGAA SKFQDFLLRW GTHFIRSGKF
GGQLKITKRA KTNKFASSEE FASLAETEFQ SMFSSLQSKY SQQTTRTGFL GFGGKKTTTT
KSSSAKSESQ YRSTRNERNQ ESHSSSGSEF TQTTVEAQGG TPEIAEALVD FYTPAFKQLY
HRWIASINDY VKPFEFKLKP IDQLFDMNMD DLFPAGNRDF GCVGSAGSKV NILTEPKTRR
RYYLDEETYF ANNKTKTKLV KIYCKYKNQQ DFQNALKRRR LSLEKAVAAY MTEGPFPTSS
HELKAGQPGC ESSTLAKIDV FNSNNNTLTF LQLKKSPFLI AFDLPQDIPN LLKSQAKYVM
TYLKDRWFAT EPGHHHAHLY EGCQLEGFKP AETKICIWSI ILTYEEETGF FHIDPLDFIA
SKRRNPNLPD WIEGTTFARA SRIEGQKGAN STAAASIPCN VRWLNSHRLD PDREENCLYF
TAATAGDIFV IFASIPRDHT TWYYLQINMD GVTFYKAMKP QKRDYQKGMG TLGDKNLYQS
YFLCANQANG SILMQFGKAG SNSEIGTVYS GYKFRSDSQY SRLSFYTFGA GSKSVSLMDI
HLTKDMPQVE CLNNNYKMVD GHCILDCHKE CIDCNLAKDD TSCFQCRNVR VELWNSIGGR
RVRTVQCLPE CPVGYQLSDS KENLCVPCQP GTFKNTTGNH NCLACAPGCF SASQATVNCM
LCSPGNFASK PGSVQCQQCH VGTASSFNGS TSCGACSAGL FAAQRGQTEC TPCPVGQFNK
AERQTQCQLC SAGFFADKPG QKSCSACPAG TSSAEGSVSC ANNCKRGEFS PAEGIPCAKC
PAGSFTPSAG SVNCTSCAEG SYSSSARSAS CIPCPAGSFA DQISSTNCSQ CPAGSFSPSA
GSVNCTFCAK GSYSSRAGSA SCTPCPRGFY QPLANQTHCK SAMPGWYVDT EDRSEQRMCP
LGKYSGAAAS ECTVCPSGTF ADRVGSTNCS QCPAGSWSNA GSASCQLCKG GSFSVGNAAN
CTLCPAGTFS NSSGSSECSK CPAGTYASES GSQKCEYCNL FSLSRHSGMT HCDQCPDSGS
TFGTDTCYPN LRMRCWISFG ECEKTIDGDL DTYGTAIMSD HVPKGAEIFR TYFGYKLDRV
SSVKGFLIWR NKKSRMSFKN LMLKLLNSVV DCTFHTTKPA LVRILNMW
//