ID A7S5M3_NEMVE Unreviewed; 812 AA.
AC A7S5M3;
DT 02-OCT-2007, integrated into UniProtKB/TrEMBL.
DT 02-OCT-2007, sequence version 1.
DT 27-MAR-2024, entry version 68.
DE RecName: Full=PAX-interacting protein 1 {ECO:0000256|ARBA:ARBA00023858};
DE AltName: Full=PAX transactivation activation domain-interacting protein {ECO:0000256|ARBA:ARBA00030146};
GN ORFNames=NEMVEDRAFT_v1g207173 {ECO:0000313|EMBL:EDO41034.1};
OS Nematostella vectensis (Starlet sea anemone).
OC Eukaryota; Metazoa; Cnidaria; Anthozoa; Hexacorallia; Actiniaria;
OC Edwardsiidae; Nematostella.
OX NCBI_TaxID=45351 {ECO:0000313|EMBL:EDO41034.1, ECO:0000313|Proteomes:UP000001593};
RN [1] {ECO:0000313|EMBL:EDO41034.1, ECO:0000313|Proteomes:UP000001593}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CH2 X CH6 {ECO:0000313|Proteomes:UP000001593};
RX PubMed=17615350; DOI=10.1126/science.1139158;
RA Putnam N.H., Srivastava M., Hellsten U., Dirks B., Chapman J., Salamov A.,
RA Terry A., Shapiro H., Lindquist E., Kapitonov V.V., Jurka J.,
RA Genikhovich G., Grigoriev I.V., Lucas S.M., Steele R.E., Finnerty J.R.,
RA Technau U., Martindale M.Q., Rokhsar D.S.;
RT "Sea anemone genome reveals ancestral eumetazoan gene repertoire and
RT genomic organization.";
RL Science 317:86-94(2007).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; DS469583; EDO41034.1; -; Genomic_DNA.
DR RefSeq; XP_001633097.1; XM_001633047.1.
DR AlphaFoldDB; A7S5M3; -.
DR STRING; 45351.A7S5M3; -.
DR EnsemblMetazoa; EDO41034; EDO41034; NEMVEDRAFT_v1g207173.
DR eggNOG; KOG2043; Eukaryota.
DR HOGENOM; CLU_009382_0_0_1; -.
DR InParanoid; A7S5M3; -.
DR OMA; HVICESM; -.
DR PhylomeDB; A7S5M3; -.
DR Proteomes; UP000001593; Unassembled WGS sequence.
DR GO; GO:0044666; C:MLL3/4 complex; IBA:GO_Central.
DR CDD; cd17710; BRCT_PAXIP1_rpt2; 1.
DR CDD; cd17711; BRCT_PAXIP1_rpt3; 1.
DR CDD; cd17730; BRCT_PAXIP1_rpt4; 1.
DR CDD; cd17712; BRCT_PAXIP1_rpt5; 1.
DR CDD; cd18432; BRCT_PAXIP1_rpt6_like; 1.
DR Gene3D; 3.40.50.10190; BRCT domain; 5.
DR InterPro; IPR001357; BRCT_dom.
DR InterPro; IPR036420; BRCT_dom_sf.
DR PANTHER; PTHR23196; PAX TRANSCRIPTION ACTIVATION DOMAIN INTERACTING PROTEIN; 1.
DR PANTHER; PTHR23196:SF37; PAX-INTERACTING PROTEIN 1; 1.
DR Pfam; PF00533; BRCT; 1.
DR Pfam; PF16589; BRCT_2; 1.
DR Pfam; PF12738; PTCB-BRCT; 2.
DR Pfam; PF16770; RTT107_BRCT_5; 1.
DR SMART; SM00292; BRCT; 4.
DR SUPFAM; SSF52113; BRCT domain; 5.
DR PROSITE; PS50172; BRCT; 4.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000001593}.
FT DOMAIN 123..187
FT /note="BRCT"
FT /evidence="ECO:0000259|PROSITE:PS50172"
FT DOMAIN 371..428
FT /note="BRCT"
FT /evidence="ECO:0000259|PROSITE:PS50172"
FT DOMAIN 448..521
FT /note="BRCT"
FT /evidence="ECO:0000259|PROSITE:PS50172"
FT DOMAIN 595..676
FT /note="BRCT"
FT /evidence="ECO:0000259|PROSITE:PS50172"
FT REGION 210..237
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 559..580
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 812 AA; 89126 MW; 84D94D7A860C0D32 CRC64;
MAGLSNSPKQ LFSDVHFFIT SSCSEALFVL KLGASGIKGA PPPSQSFFAC YGTASEAIAL
IQNLKKVKQQ LKQGGARREF YISDLVTHVI TDSADFPQHS EAEEFNLPIV LIPSGDLNIL
WAMVTYHGGS VQLSLTSDCT HLVIPKPIGE KYKCALKYPG LIKIVTPSWL IDSIRKDKLL
PEEDYMPAEP PAPSPSVTAK NMEINQSPAA TTDVNTQGSS ANENIQCKTS PHDTIPPPTT
VDVGMTSLET KPPVVTKLVT STAEIATSLA STQANSTPAF TTLSNQPFTM SGAETRTPMQ
MPRQSEGHAT AAAVIETPSN VALDSTEKKG QTEGPPCVQS NGQETESHWC VFAVSDYQDL
MEHSALATWT EVIELHGGTV SPFYTPRCTH LICLHQCGKL FSLALKDGKI IVTAHWLNDV
LLKGSLFPPC NPLHLPTPFE KKLPECAGMS ISVTGFTGQE RQLVRNMVFM IGANYTGYLM
RTNTHLVCRS SESLKYKKAR EWGIPCINAK WLSDIVSGGK VPPCALTRYS QYGTSDELSI
TSSLVEPLLE PWKDFKPDTS MDYPQKRPAE EIDENRNAKK RRLTESSTVE INSNTPHVLF
TGLPQSHVYQ LQRMVLNLGG KLAESPQTCT HLVTNKIVRT VKFLSAISVC QHLVTTAWLQ
KSREVKHFVD PSLYPLQDLA SEKEYGIDIK QSLKRARERR CLQGIQVYVS PNVEPCPASM
KEIIESAGGQ MLTNIPSKQS LTTLKTLKTN EGHPALVVVT CPTDVAKCID FIRCGYASLI
TFLEEARGCS FDQEKHIHIS RLLPYLAGFS GL
//