ID G3W2C2_SARHA Unreviewed; 378 AA.
AC G3W2C2;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 07-APR-2021, sequence version 2.
DT 27-MAR-2024, entry version 63.
DE SubName: Full=Ly1 antibody reactive {ECO:0000313|Ensembl:ENSSHAP00000009577.2};
GN Name=LYAR {ECO:0000313|Ensembl:ENSSHAP00000009577.2};
OS Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus.
OX NCBI_TaxID=9305 {ECO:0000313|Ensembl:ENSSHAP00000009577.2, ECO:0000313|Proteomes:UP000007648};
RN [1] {ECO:0000313|Ensembl:ENSSHAP00000009577.2, ECO:0000313|Proteomes:UP000007648}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21709235; DOI=10.1073/pnas.1102838108;
RA Miller W., Hayes V.M., Ratan A., Petersen D.C., Wittekindt N.E., Miller J.,
RA Walenz B., Knight J., Qi J., Zhao F., Wang Q., Bedoya-Reina O.C.,
RA Katiyar N., Tomsho L.P., Kasson L.M., Hardie R.A., Woodbridge P.,
RA Tindall E.A., Bertelsen M.F., Dixon D., Pyecroft S., Helgen K.M.,
RA Lesk A.M., Pringle T.H., Patterson N., Zhang Y., Kreiss A., Woods G.M.,
RA Jones M.E., Schuster S.C.;
RT "Genetic diversity and population structure of the endangered marsupial
RT Sarcophilus harrisii (Tasmanian devil).";
RL Proc. Natl. Acad. Sci. U.S.A. 108:12348-12353(2011).
RN [2] {ECO:0000313|Ensembl:ENSSHAP00000009577.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Cytoplasm {ECO:0000256|ARBA:ARBA00004496}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_012407796.1; XM_012552342.1.
DR AlphaFoldDB; G3W2C2; -.
DR STRING; 9305.ENSSHAP00000009577; -.
DR Ensembl; ENSSHAT00000009661.2; ENSSHAP00000009577.2; ENSSHAG00000029797.1.
DR GeneID; 100918492; -.
DR CTD; 55646; -.
DR eggNOG; KOG2186; Eukaryota.
DR GeneTree; ENSGT00390000003477; -.
DR HOGENOM; CLU_057137_0_1_1; -.
DR OrthoDB; 5490568at2759; -.
DR TreeFam; TF314925; -.
DR Proteomes; UP000007648; Unassembled WGS sequence.
DR GO; GO:0005737; C:cytoplasm; IEA:UniProtKB-SubCell.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-KW.
DR GO; GO:0003677; F:DNA binding; IEA:InterPro.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR Gene3D; 3.30.1490.490; -; 1.
DR InterPro; IPR039999; LYAR.
DR InterPro; IPR041010; Znf-ACC.
DR InterPro; IPR014898; Znf_C2H2_LYAR.
DR InterPro; IPR036236; Znf_C2H2_sf.
DR PANTHER; PTHR13100:SF10; CELL GROWTH-REGULATING NUCLEOLAR PROTEIN; 1.
DR PANTHER; PTHR13100; CELL GROWTH-REGULATING NUCLEOLAR PROTEIN LYAR; 1.
DR Pfam; PF17848; zf-ACC; 1.
DR Pfam; PF08790; zf-LYAR; 1.
DR SUPFAM; SSF57667; beta-beta-alpha zinc fingers; 2.
DR PROSITE; PS51804; ZF_C2HC_LYAR; 2.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000007648};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW ProRule:PRU01145}.
FT DOMAIN 3..28
FT /note="Acetyl-coA carboxylase zinc finger"
FT /evidence="ECO:0000259|Pfam:PF17848"
FT DOMAIN 31..58
FT /note="Zinc finger C2H2 LYAR-type"
FT /evidence="ECO:0000259|Pfam:PF08790"
FT REGION 145..305
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 165..232
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 240..305
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 378 AA; 43567 MW; AAC829315FB251EE CRC64;
MVFFTCNACG ESVKKGQVEK HVNICKNCQY LSCIDCGKDF WGDEYKKHVK CITEDQKYGG
KGYEVKTSKG DVKQQEWIQK IHEVMKKPSV SPKLREILEK VSGFDNIPRK KAKFQNWMKN
SLKIYNDSLQ EQVWDIFSEA TRNVENQHSE NKVKAAPPEG GEKPAGQPEV KKNKRERKEE
RQKSRKKEKK ELKLENHQEN SSDPKSKKRK RAQAEGDAGR EEKPEISNAE KRKNKKKKAK
KECAPDAEVN GDGSQDGERM AEEAPRAPEK RKRKHSEEEA DSKKKKMRPT EEASVGDEEA
EAPVKGKFNW KGTIKAVLKQ APDNEISVKK LRKKVLAQYY AMTSDHHKSE EELLVIFNKK
ISKNPTFRLL KEKVKLLK
//