ID W5L2R2_ASTMX Unreviewed; 674 AA.
AC W5L2R2;
DT 16-APR-2014, integrated into UniProtKB/TrEMBL.
DT 05-DEC-2018, sequence version 2.
DT 27-MAR-2024, entry version 47.
DE SubName: Full=Si:dkeyp-117b8.4 {ECO:0000313|Ensembl:ENSAMXP00000014124.2};
OS Astyanax mexicanus (Blind cave fish) (Astyanax fasciatus mexicanus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; Characiformes;
OC Characoidei; Characidae; Astyanax.
OX NCBI_TaxID=7994 {ECO:0000313|Ensembl:ENSAMXP00000014124.2, ECO:0000313|Proteomes:UP000018467};
RN [1] {ECO:0000313|Proteomes:UP000018467}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=female {ECO:0000313|Proteomes:UP000018467};
RA Jeffery W., Warren W., Wilson R.K.;
RL Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Proteomes:UP000018467}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=female {ECO:0000313|Proteomes:UP000018467};
RX PubMed=25329095; DOI=10.1038/ncomms6307;
RA McGaugh S.E., Gross J.B., Aken B., Blin M., Borowsky R., Chalopin D.,
RA Hinaux H., Jeffery W.R., Keene A., Ma L., Minx P., Murphy D., O'Quin K.E.,
RA Retaux S., Rohner N., Searle S.M., Stahl B.A., Tabin C., Volff J.N.,
RA Yoshizawa M., Warren W.C.;
RT "The cavefish genome reveals candidate genes for eye loss.";
RL Nat. Commun. 5:5307-5307(2014).
RN [3] {ECO:0000313|Ensembl:ENSAMXP00000014124.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; W5L2R2; -.
DR Ensembl; ENSAMXT00000014124.2; ENSAMXP00000014124.2; ENSAMXG00000013722.2.
DR eggNOG; KOG1121; Eukaryota.
DR GeneTree; ENSGT00940000158431; -.
DR HOGENOM; CLU_009123_12_2_1; -.
DR InParanoid; W5L2R2; -.
DR Proteomes; UP000018467; Unassembled WGS sequence.
DR Bgee; ENSAMXG00000013722; Expressed in brain and 14 other cell types or tissues.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0046983; F:protein dimerization activity; IEA:InterPro.
DR InterPro; IPR008906; HATC_C_dom.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR003656; Znf_BED.
DR InterPro; IPR036236; Znf_C2H2_sf.
DR PANTHER; PTHR46169; DNA REPLICATION-RELATED ELEMENT FACTOR, ISOFORM A; 1.
DR PANTHER; PTHR46169:SF23; INNER CENTROMERE PROTEIN A-LIKE ISOFORM X1-RELATED; 1.
DR Pfam; PF05699; Dimer_Tnp_hAT; 1.
DR Pfam; PF02892; zf-BED; 1.
DR SMART; SM00614; ZnF_BED; 1.
DR SUPFAM; SSF57667; beta-beta-alpha zinc fingers; 1.
DR SUPFAM; SSF140996; Hermes dimerisation domain; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50808; ZF_BED; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000018467};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00027}.
FT DOMAIN 30..80
FT /note="BED-type"
FT /evidence="ECO:0000259|PROSITE:PS50808"
FT REGION 638..658
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 674 AA; 75422 MW; A46BAEB8E92CF85B CRC64;
MELPIQRRPG VLGTPILRRR MDRMLPCTER RTSKVWDHYT QLSMYRVECN HCKRQLSFHN
STTSMREHLG RKHSIRDGAV PPPNPANVHN PAALHQQMVQ PNILNSRFAT GCSDKRAGVF
TDLILEMVFR DLQPLSVVEE RGFRLLLSCL EPNYPVPSPS LLGSLLWHRY HILKQCLQRH
LQTGLAPRCL ALCTEHWRSV EGCGVDGSGQ SYLTVSAHFV DSNWRLARCV LETRPIPEYK
RNNGLAKFAD ILKAVLSEFN LPENYVFCAL KLCVQEGLYV ETVRQALADA RGIVLHFQHD
AKAAAALNQK AEAANKGAAR LVLDDPSRWA TAIDMCESLL ELKWVVSSVL EEQKAAPNLA
DHQWRLLHEL VPVLRTVRIA ASFLSEDINA AISALMPCLQ GVSRLLGQNM AECSCPVVRG
VMERIRTGME KRWRLSDEEA LLESPAVLSS FLDPRFKEMR FLSPHARSKL HDKVKELLSV
QAFNDDGTVD QEIDSGLEVG RGNEAEGGPA VLGLDDPLPI PTVASLDSPE SCPSAEEGDS
IELQQNENFA GVSSPEQVNE NVLVGLSPLH WWRNKEHRFP AVARLARKYL AIPATGIPAD
RAFAPRESAI AHRRAMMGPK HLDHVLFLHQ NCDYVEQLKG GSSGHRENDH NSNVSGNQSR
ESLYQTLVSY DNKV
//