ID A0A0L0DPP5_THETB Unreviewed; 3703 AA.
AC A0A0L0DPP5;
DT 11-NOV-2015, integrated into UniProtKB/TrEMBL.
DT 11-NOV-2015, sequence version 1.
DT 27-MAR-2024, entry version 26.
DE RecName: Full=PA14 domain-containing protein {ECO:0000259|PROSITE:PS51820};
GN ORFNames=AMSG_09648 {ECO:0000313|EMBL:KNC53996.1};
OS Thecamonas trahens ATCC 50062.
OC Eukaryota; Apusozoa; Apusomonadida; Apusomonadidae; Thecamonas.
OX NCBI_TaxID=461836 {ECO:0000313|EMBL:KNC53996.1, ECO:0000313|Proteomes:UP000054408};
RN [1] {ECO:0000313|EMBL:KNC53996.1, ECO:0000313|Proteomes:UP000054408}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 50062 {ECO:0000313|EMBL:KNC53996.1,
RC ECO:0000313|Proteomes:UP000054408};
RG The Broad Institute Genome Sequencing Platform;
RA Russ C., Cuomo C., Shea T., Young S.K., Zeng Q., Koehrsen M., Haas B.,
RA Borodovsky M., Guigo R., Alvarado L., Berlin A., Bochicchio J.,
RA Borenstein D., Chapman S., Chen Z., Freedman E., Gellesch M., Goldberg J.,
RA Griggs A., Gujja S., Heilman E., Heiman D., Hepburn T., Howarth C., Jen D.,
RA Larson L., Mehta T., Park D., Pearson M., Roberts A., Saif S., Shenoy N.,
RA Sisk P., Stolte C., Sykes S., Thomson T., Walk T., White J., Yandava C.,
RA Burger G., Gray M.W., Holland P.W.H., King N., Lang F.B.F., Roger A.J.,
RA Ruiz-Trillo I., Lander E., Nusbaum C.;
RT "The Genome Sequence of Thecamonas trahens ATCC 50062.";
RL Submitted (MAY-2010) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GL349484; KNC53996.1; -; Genomic_DNA.
DR RefSeq; XP_013754197.1; XM_013898743.1.
DR STRING; 461836.A0A0L0DPP5; -.
DR EnsemblProtists; KNC53996; KNC53996; AMSG_09648.
DR GeneID; 25568067; -.
DR OrthoDB; 462435at2759; -.
DR Proteomes; UP000054408; Unassembled WGS sequence.
DR GO; GO:0016020; C:membrane; IEA:InterPro.
DR GO; GO:0007154; P:cell communication; IEA:InterPro.
DR Gene3D; 2.60.40.2030; -; 10.
DR InterPro; IPR038081; CalX-like_sf.
DR InterPro; IPR003644; Calx_beta.
DR InterPro; IPR037524; PA14/GLEYA.
DR PANTHER; PTHR24216:SF65; PAXILLIN-LIKE PROTEIN 1; 1.
DR PANTHER; PTHR24216; PAXILLIN-RELATED; 1.
DR Pfam; PF03160; Calx-beta; 11.
DR SUPFAM; SSF141072; CalX-like; 11.
DR PROSITE; PS51820; PA14; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000054408};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..18
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 19..3703
FT /note="PA14 domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5005537722"
FT DOMAIN 471..643
FT /note="PA14"
FT /evidence="ECO:0000259|PROSITE:PS51820"
FT REGION 1971..2009
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2256..2294
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2541..2579
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2826..2864
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3111..3149
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3396..3434
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3684..3703
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1978..2007
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2263..2292
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2548..2577
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2833..2862
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3118..3147
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3403..3432
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3688..3703
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 3703 AA; 375549 MW; D64CF13DA1BE19EB CRC64;
MAGIILAIMV VLATTVKGDG AYDTCAVASI SGRLWSRQWL FAASSYTQAD ATDVSMWAIF
DTVLGGRLVE TRFEGLAGFV YELGATKLGI PSKTPSYKSA LTSEGHTISP EIPLYLKYPT
HVELGNTLPE VIFDGVVYVS VGCGEYSAEC QTLVEGQIEP RFRFETDAGG GRYRIVCDTS
GDGIFNIVDG SDTVLSGDTV DGVNEVPWDG KDASGVQVAD GNYICEVWVL LAEVHFPLVD
VETVYPGMRM YSYFNGVAKP QVMYWSDEDV NPAINMPNGE PGATTSGCCG VNSTYDTNAV
PNVNARAWGN FVAGSKGDQA IINTFSFVER SVSLPISISV GATEACPEPT VRVLDKTIVE
GDSGSTDVVI DLVLTSAISS GIAISYTTTD GTANGGLDYV PNSGVINMTP GSYCYSVTVV
LTGDTDLEND EIFYFDISQM SGPPVTIAKP RGVITILNDD DSCPVVSTPP LEWPGLLMEV
FDCVDGESIA SLTSSSSYID RKPGCIDSQV TTLSAAPAMG ACKQTLHASV DYGFSLSGWL
RVGATNTYTF YVAADFAIEL WVGANEASLA RVAYNEAPNA GPTAYTETAS QASTPMALSL
GDLVYIRVLY KEGDEDYPAD DFFSVAWSGS GFGDTTEVIA SGGSYGETLF HNATKAVALE
DASSCCTSRI VETTVESVRL DSDPCSGALS ANITSTIPLT PQPASVSIGF APDAPREFEY
VQDDLFSDIG AASTSGQVFA GRVRQPRTVK VRISSGARNA FERSNDGLVV PNFRVLGMST
WTTGLIFDGV DVPPGATITN AYIAMEGDND NGQDTGTGVN IPIYAFAADT AGAMPTNNYG
ITSLAPTGAV VTWTDVPAFI TGVDYVSPDI SPVVQELVSR PGWRSGNDML IVFGLSTGTA
NLRVIASFEN PTSAPLLVIE YHTGFGSADT YTGAVYLADA SDTSVTNAQQ LSGGFRMSFV
LAQAASYTVT LQHALVANNL DAVGERVSSL LKINGGYVGN FVGSEGVSVL EELTSSGTSA
PSTSSFVIAL PSGANTIEFG AYLSSRSSSD DYAVALFDQF GLSTDSYVTQ CGAYDLSYNY
EWWAPSCSGT GPCSSTSSVS GICDTSVQLF ANNEKCCIWI DVAVPKEIPS VGLNVSTAAA
SEPGAGFVDI EIGLTLAFDA EIDIQVTLAA LNGTAFAGVD FDLLSPVVNF PAGVETVPVI
VRIYQDNVYE YWMESAPAET FDIQIVDTPT VSTTEPTDES GSRNVSITIV MGSPAPSPVT
VDVVTSVAVG SLATGDVDFE VVTTTVTIPG GSSSAVFDAV VYGDAWYDDG ETFDVTLTGV
SGPATLGGTL TTVVTIGDSG PAPTLTTRVV SGADASTGSG SQSVTADVSE GDATSKDVVL
ELALSRPAEH PTVVDVVVYP TSTATNGTDV DVLSTEGRNR GPFTIGGLVS TYNVSAVIYG
ETVVEGTEVF DFGLVATYGV SGVTSTSNAS VSIVGMAVAA VSTTEPTDES GSRNVSITIV
MGSPAPSPVT VDVVTSVAVG SLATGDVDFE VVTTTVTIPG GSSSAVFDAV VYGDAWYDDG
ETFDVTLTGV SGPATLGGTL TTVVTIGDSG PAPTLTTRVV SGADASTGSG SQSVTADVSE
GDATSKDVVL ELALSRPAEH PTVVDVVVYP TSTATNGTDV DVLSTEGRNR GPFTIGGLVS
TYNVSAVIYG ETVVEGTEVF DFGLVATYGV SGVTSTSNAS VSIVGMAVAA VSTTEPTDES
GSRNVSITIV MGSPAPSPVT VDVVTSVAVG SLATGDVDFE VVTTTVTIPG GSSSAVFDAV
VYGDAWYDDG ETFDVTLTGV SGPATLGGTL TTVVTIGDSG PAPTLTTRVV SGADASTGSG
SQSVTADVSE GDATSKDVVL ELALSRPAEH PTVVDVVVYP TSTATNGTDV DVLSTEGRNR
GPFTIGGLVS TYNVSAVIYG ETVVEGTEVF DFGLVATYGV SGVTSTSNAS VSIVNDDFSP
PPPSPPPPPP PPSPPPPPPP SPPPPFGQVG MAVAAVSTTE PTDESGSRNV SITIVMGSPA
PSPVTVDVVT SVAVGSLATG DVDFEVVTTT VTIPGGSSSA VFDAVVYGDA WYDDGETFDV
TLTGVSGPAT LGGTLTTVVT IGDSGPAPTL TTRVVSGADA STGSGSQSVT ADVSEGDATS
KDVVLELALS RPAEHPTVVD VVVYPTSTAT NGTDVDVLST EGRNRGPFTI GGLVSTYNVS
AVIYGETVVE GTEVFDFGLV ATYGVSGVTS TSNASVSIVN DDFSPPPPSP PPPPPPPSPP
PPPPPSPPPP FGQVGMAVAA VSTTEPTDES GSRNVSITIV MGSPAPSPVT VDVVTSVAVG
SLATGDVDFE VVTTTVTIPG GSSSAVFDAV VYGDAWYDDG ETFDVTLTGV SGPATLGGTL
TTVVTIGDSG PAPTLTTRVV SGADASTGSG SQSVTADVSE GDATSKDVVL ELALSRPAEH
PTVVDVVVYP TSTATNGTDV DVLSTEGRNR GPFTIGGLVS TYNVSAVIYG ETVVEGTEVF
DFGLVATYGV SGVTSTSNAS VSIVNDDFSP PPPSPPPPPP PPSPPPPPPP SPPPPFGQVG
MAVAAVSTTE PTDESGSRNV SITIVMGSPA PSPVTVDVVT SVAVGSLATG DVDFEVVTTT
VTIPGGSSSA VFDAVVYGDA WYDDGETFDV TLTGVSGPAT LGGTLTTVVT IGDSGPAPTL
TTRVVSGADA STGSGSQSVT ADVSEGDATS KDVVLELALS RPAEHPTVVD VVVYPTSTAT
NGTDVDVLST EGRNRGPFTI GGLVSTYNVS AVIYGETVVE GTEVFDFGLV ATYGVSGVTS
TSNASVSIVN DDFSPPPPSP PPPPPPPSPP PPPPPSPPPP FGQVGMAVAA VSTTEPTDES
GSRNVSITIV MGSPAPSPVT VDVVTSVAVG SLATGDVDFE VVTTTVTIPG GSSSAVFDAV
VYGDAWYDDG ETFDVTLTGV SGPATLGGTL TTVVTIGDSG PAPTLTTRVV SGADASTGSG
SQSVTADVSE GDATSKDVVL ELALSRPAEH PTVVDVVVYP TSTATNGTDV DVLSTEGRNR
GPFTIGGLVS TYNVSAVIYG ETVVEGTEVF DFGLVATYGV SGVTSTSNAS VSIVNDDFSP
PPPSPPPPPP PPSPPPPPPP SPPPPFGQVG MAVAAVSTTE PTDESGSRNV SITIVMGSPA
PSPVTVDVVT SVAVGSLATG DVDFEVVTTT VTIPGGSSSA VFDAVVYGDA WYDDGETFDV
TLTGVSGPAT LGGTLTTVVT IGDSGPAPTL TTRVVSGADA STGSGSQSVT ADVSEGDATS
KDVVLELALS RPAEHPTVVD VVVYPTSTAT NGTDVDVLST EGRNRGPFTI GGLVSTYNVS
AVIYGETVVE GTEVFDFGLV ATYGVSGVTS TSNASVSIVN DDFSPPPPSP PPPPPPPSPP
PPPPPSPPPP FGQVGMAVAA VSTTEPTDES GSRNVSITIV MGSPAPSPVT VDVVTSVAVG
SLATGDVDFE VVTTTVTIPG GSSSAVFDAV VYGDAWYDDG ETFDVTLTGV SGPATLGGTL
TTVVTIGDSG PAPTLTTRVV SGADASTGSG SQSVTADVSE GDATSKDVVL ELALSRPAEH
PTVVDVVVYP TSTATNGTDV DVLSTEGRNR GPFTIGGLVS TYNVSAVIYG ETVVEGTEVF
DFGLVATYGV SGVTSTSNAS VSIVNDDFSP PPPSPSLPLR LRT
//