ID A0A286XF52_CAVPO Unreviewed; 267 AA.
AC A0A286XF52;
DT 22-NOV-2017, integrated into UniProtKB/TrEMBL.
DT 22-NOV-2017, sequence version 1.
DT 27-MAR-2024, entry version 26.
DE SubName: Full=Myeloid zinc finger 1 {ECO:0000313|Ensembl:ENSCPOP00000023984.1};
GN Name=MZF1 {ECO:0000313|Ensembl:ENSCPOP00000023984.1};
OS Cavia porcellus (Guinea pig).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Hystricomorpha; Caviidae;
OC Cavia.
OX NCBI_TaxID=10141 {ECO:0000313|Ensembl:ENSCPOP00000023984.1, ECO:0000313|Proteomes:UP000005447};
RN [1] {ECO:0000313|Proteomes:UP000005447}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=2N {ECO:0000313|Proteomes:UP000005447};
RX PubMed=21993624; DOI=10.1038/nature10530;
RA Lindblad-Toh K., Garber M., Zuk O., Lin M.F., Parker B.J., Washietl S.,
RA Kheradpour P., Ernst J., Jordan G., Mauceli E., Ward L.D., Lowe C.B.,
RA Holloway A.K., Clamp M., Gnerre S., Alfoldi J., Beal K., Chang J.,
RA Clawson H., Cuff J., Di Palma F., Fitzgerald S., Flicek P., Guttman M.,
RA Hubisz M.J., Jaffe D.B., Jungreis I., Kent W.J., Kostka D., Lara M.,
RA Martins A.L., Massingham T., Moltke I., Raney B.J., Rasmussen M.D.,
RA Robinson J., Stark A., Vilella A.J., Wen J., Xie X., Zody M.C., Baldwin J.,
RA Bloom T., Chin C.W., Heiman D., Nicol R., Nusbaum C., Young S.,
RA Wilkinson J., Worley K.C., Kovar C.L., Muzny D.M., Gibbs R.A., Cree A.,
RA Dihn H.H., Fowler G., Jhangiani S., Joshi V., Lee S., Lewis L.R.,
RA Nazareth L.V., Okwuonu G., Santibanez J., Warren W.C., Mardis E.R.,
RA Weinstock G.M., Wilson R.K., Delehaunty K., Dooling D., Fronik C.,
RA Fulton L., Fulton B., Graves T., Minx P., Sodergren E., Birney E.,
RA Margulies E.H., Herrero J., Green E.D., Haussler D., Siepel A., Goldman N.,
RA Pollard K.S., Pedersen J.S., Lander E.S., Kellis M.;
RT "A high-resolution map of human evolutionary constraint using 29 mammals.";
RL Nature 478:476-482(2011).
RN [2] {ECO:0000313|Ensembl:ENSCPOP00000023984.1}
RP IDENTIFICATION.
RC STRAIN=2N {ECO:0000313|Ensembl:ENSCPOP00000023984.1};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00187}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAKN02042125; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR AlphaFoldDB; A0A286XF52; -.
DR Ensembl; ENSCPOT00000037269.1; ENSCPOP00000023984.1; ENSCPOG00000009921.4.
DR VEuPathDB; HostDB:ENSCPOG00000009921; -.
DR GeneTree; ENSGT00950000182890; -.
DR Proteomes; UP000005447; Unassembled WGS sequence.
DR Bgee; ENSCPOG00000009921; Expressed in frontal cortex and 13 other cell types or tissues.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR CDD; cd07936; SCAN; 1.
DR Gene3D; 1.10.4020.10; DNA breaking-rejoining enzymes; 1.
DR InterPro; IPR003309; SCAN_dom.
DR InterPro; IPR038269; SCAN_sf.
DR PANTHER; PTHR45935; PROTEIN ZBED8-RELATED; 1.
DR PANTHER; PTHR45935:SF27; SCAN BOX DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF02023; SCAN; 1.
DR SMART; SM00431; SCAN; 1.
DR SUPFAM; SSF47353; Retrovirus capsid dimerization domain-like; 1.
DR PROSITE; PS50804; SCAN_BOX; 1.
PE 4: Predicted;
KW Nucleus {ECO:0000256|PROSITE-ProRule:PRU00187};
KW Reference proteome {ECO:0000313|Proteomes:UP000005447}.
FT DOMAIN 39..121
FT /note="SCAN box"
FT /evidence="ECO:0000259|PROSITE:PS50804"
FT REGION 1..21
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 139..164
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 178..227
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 246..267
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 267 AA; 29842 MW; 12AE519C99B4FE58 CRC64;
MRLTMLGPPE DNEPVMVKLE DSEEEEEAAL WDLEPEAARL RFRGFCYEEA VGPQETLVQL
RELCHQWLQP ELHSKEQIME LLVLEQFLGV LPPEIQAQVQ ERQPSSPGEA ADLVDRLRWE
LGGPRRWVTV QVQGQEVLSE KMEPSSFQPL PQTPDPGRET PPGAVEELPL AFQVKEEPEV
TEEPELLESG PLPVPALLPE AQGYETALEL TSPHSDTRPE GPSWREQPRA LWHEDSGGLF
FPLCDPNRFR GAGSTEPTPP ASLGLGG
//