ID G3S2M0_GORGO Unreviewed; 282 AA.
AC G3S2M0;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 28-FEB-2018, sequence version 2.
DT 27-MAR-2024, entry version 53.
DE SubName: Full=Centromere protein V like 2 {ECO:0000313|Ensembl:ENSGGOP00000022318.2};
OS Gorilla gorilla gorilla (Western lowland gorilla).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC Gorilla.
OX NCBI_TaxID=9595 {ECO:0000313|Ensembl:ENSGGOP00000022318.2, ECO:0000313|Proteomes:UP000001519};
RN [1] {ECO:0000313|Ensembl:ENSGGOP00000022318.2, ECO:0000313|Proteomes:UP000001519}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Scally A.;
RT "Insights into the evolution of the great apes provided by the gorilla
RT genome.";
RL Submitted (MAY-2011) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSGGOP00000022318.2, ECO:0000313|Proteomes:UP000001519}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=22398555; DOI=10.1038/nature10842;
RA Scally A., Dutheil J.Y., Hillier L.W., Jordan G.E., Goodhead I.,
RA Herrero J., Hobolth A., Lappalainen T., Mailund T., Marques-Bonet T.,
RA McCarthy S., Montgomery S.H., Schwalie P.C., Tang Y.A., Ward M.C., Xue Y.,
RA Yngvadottir B., Alkan C., Andersen L.N., Ayub Q., Ball E.V., Beal K.,
RA Bradley B.J., Chen Y., Clee C.M., Fitzgerald S., Graves T.A., Gu Y.,
RA Heath P., Heger A., Karakoc E., Kolb-Kokocinski A., Laird G.K., Lunter G.,
RA Meader S., Mort M., Mullikin J.C., Munch K., O'Connor T.D., Phillips A.D.,
RA Prado-Martinez J., Rogers A.S., Sajjadian S., Schmidt D., Shaw K.,
RA Simpson J.T., Stenson P.D., Turner D.J., Vigilant L., Vilella A.J.,
RA Whitener W., Zhu B., Cooper D.N., de Jong P., Dermitzakis E.T.,
RA Eichler E.E., Flicek P., Goldman N., Mundy N.I., Ning Z., Odom D.T.,
RA Ponting C.P., Quail M.A., Ryder O.A., Searle S.M., Warren W.C.,
RA Wilson R.K., Schierup M.H., Rogers J., Tyler-Smith C., Durbin R.;
RT "Insights into hominid evolution from the gorilla genome sequence.";
RL Nature 483:169-175(2012).
RN [3] {ECO:0000313|Ensembl:ENSGGOP00000022318.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SIMILARITY: Belongs to the Gfa family. {ECO:0000256|ARBA:ARBA00005495}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CABD030124596; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR AlphaFoldDB; G3S2M0; -.
DR STRING; 9593.ENSGGOP00000022318; -.
DR Ensembl; ENSGGOT00000023627.2; ENSGGOP00000022318.2; ENSGGOG00000026953.2.
DR GeneTree; ENSGT00390000003183; -.
DR HOGENOM; CLU_087799_0_0_1; -.
DR InParanoid; G3S2M0; -.
DR OMA; ACPREQQ; -.
DR Proteomes; UP000001519; Chromosome X.
DR GO; GO:0016846; F:carbon-sulfur lyase activity; IEA:InterPro.
DR Gene3D; 2.170.150.70; -; 1.
DR InterPro; IPR006913; CENP-V/GFA.
DR InterPro; IPR011057; Mss4-like_sf.
DR PANTHER; PTHR28620; CENTROMERE PROTEIN V; 1.
DR PANTHER; PTHR28620:SF3; CENTROMERE PROTEIN V-LIKE PROTEIN 1; 1.
DR Pfam; PF04828; GFA; 1.
DR SUPFAM; SSF51316; Mss4-like; 1.
DR PROSITE; PS51891; CENP_V_GFA; 1.
PE 3: Inferred from homology;
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Reference proteome {ECO:0000313|Proteomes:UP000001519};
KW Zinc {ECO:0000256|ARBA:ARBA00022833}.
FT DOMAIN 143..256
FT /note="CENP-V/GFA"
FT /evidence="ECO:0000259|PROSITE:PS51891"
FT REGION 1..33
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 75..105
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 250..282
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 87..101
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 268..282
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 282 AA; 30958 MW; 7970EBA1F5D0E617 CRC64;
WARAPEAAGA MGRVRNRTTA QRRRRKRPGD PPAACAAIAV TGASRAQYPR VQVGVGSHAA
AKRWLGRWRR KRRWRRVRKA GPRDLLPSAP TPDPPGPAPS PKDLDLGAQR ERWETFRKLR
GLSCEGAAKV LLDTFEYPGL VHHTGGCHCG AVRFAVWAPA DLRVVDCSCR LCRKKQHRHF
LVPASRFTLL QGAESIVTYR SNTHPALHSF CSRCGVQSFH AAVSDPRVYG VAPHCLDEGT
VRSVVIEEVG GGDPGEEAAE EHKAIHKTSS QSAPACPREQ EQ
//