ID G3WZA2_SARHA Unreviewed; 627 AA.
AC G3WZA2;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 07-APR-2021, sequence version 2.
DT 27-MAR-2024, entry version 65.
DE SubName: Full=Peroxisomal biogenesis factor 5 like {ECO:0000313|Ensembl:ENSSHAP00000020757.2};
GN Name=PEX5L {ECO:0000313|Ensembl:ENSSHAP00000020757.2};
OS Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus.
OX NCBI_TaxID=9305 {ECO:0000313|Ensembl:ENSSHAP00000020757.2, ECO:0000313|Proteomes:UP000007648};
RN [1] {ECO:0000313|Ensembl:ENSSHAP00000020757.2, ECO:0000313|Proteomes:UP000007648}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21709235; DOI=10.1073/pnas.1102838108;
RA Miller W., Hayes V.M., Ratan A., Petersen D.C., Wittekindt N.E., Miller J.,
RA Walenz B., Knight J., Qi J., Zhao F., Wang Q., Bedoya-Reina O.C.,
RA Katiyar N., Tomsho L.P., Kasson L.M., Hardie R.A., Woodbridge P.,
RA Tindall E.A., Bertelsen M.F., Dixon D., Pyecroft S., Helgen K.M.,
RA Lesk A.M., Pringle T.H., Patterson N., Zhang Y., Kreiss A., Woods G.M.,
RA Jones M.E., Schuster S.C.;
RT "Genetic diversity and population structure of the endangered marsupial
RT Sarcophilus harrisii (Tasmanian devil).";
RL Proc. Natl. Acad. Sci. U.S.A. 108:12348-12353(2011).
RN [2] {ECO:0000313|Ensembl:ENSSHAP00000020757.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SIMILARITY: Belongs to the peroxisomal targeting signal receptor
CC family. {ECO:0000256|ARBA:ARBA00005348}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; G3WZA2; -.
DR STRING; 9305.ENSSHAP00000020757; -.
DR Ensembl; ENSSHAT00000020923.2; ENSSHAP00000020757.2; ENSSHAG00000017604.2.
DR eggNOG; KOG1125; Eukaryota.
DR GeneTree; ENSGT00940000155931; -.
DR HOGENOM; CLU_013516_5_0_1; -.
DR TreeFam; TF315044; -.
DR Proteomes; UP000007648; Unassembled WGS sequence.
DR Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 1.
DR InterPro; IPR024111; PEX5/PEX5L.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR InterPro; IPR019734; TPR_repeat.
DR PANTHER; PTHR10130; PEROXISOMAL TARGETING SIGNAL 1 RECEPTOR PEX5; 1.
DR PANTHER; PTHR10130:SF1; PEX5-RELATED PROTEIN; 1.
DR Pfam; PF13432; TPR_16; 1.
DR Pfam; PF13181; TPR_8; 2.
DR SMART; SM00028; TPR; 5.
DR SUPFAM; SSF48452; TPR-like; 1.
DR PROSITE; PS50005; TPR; 3.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000007648};
KW TPR repeat {ECO:0000256|PROSITE-ProRule:PRU00339}.
FT REPEAT 361..394
FT /note="TPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00339"
FT REPEAT 475..508
FT /note="TPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00339"
FT REPEAT 509..542
FT /note="TPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00339"
FT REGION 120..168
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 187..237
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 131..148
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 187..203
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 204..230
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 627 AA; 69901 MW; C29C38CA14819E7D CRC64;
MDFWRLGKKS KEKGYGKLSS DEDIEIIVDQ NQGKSSRAAD KAVAMVMKEI PREESAEEKP
LLTMTSQLVN EQQESRPLLS PSIDDFLCET KAEAIARPVT SNTAVLSTGL DLLDLSDPVS
QTQNKTKKLE ATPKSSSHKK KADGSDLIST DAEQRGQPLR IPETSSLDLD IQTQLDKWDE
VKFHGDRNSK SHAMAERKST SSRTGSKELL WSSESRSQPE LTSGKSALNS ESAPDLDLVP
PAQARLTKEQ RWGSALLSRN HSLEEEFERA KAAVESDTEF WDKMQAEWEE MARRNWISEN
QEAQNQVTIS ASEKGYYFHT ENPFKDWPGA FEEGLKRLKE GDLPVTILFM EAAILQDPGD
AEAWQFLGVT QAENENEQAA IVALQRCLEL QPNNLKALMA LAVSYTNTGH QQDACSALKN
WIRQNPKYKY LVKNKKASPG PTRRMSKSPV DSSVLEGVKE LYLEAAHQNG EMIDPDLQTG
LGVLFHLSGE FSRAIDAFNA ALTVRPEDYS LWNRLGATLA NGDRSEEAVE AYTRALEIQP
GFIRSRYNLG ISCINLGAYR EAVSNFLTAL SLQRKSRNQQ QVPHPAISGN IWAALRIALS
MMDQPELFQA ANLGDLDVLL RAFNLDP
//