ID F7BFK5_HORSE Unreviewed; 680 AA.
AC F7BFK5;
DT 27-JUL-2011, integrated into UniProtKB/TrEMBL.
DT 10-APR-2019, sequence version 2.
DT 27-MAR-2024, entry version 55.
DE SubName: Full=Asteroid homolog 1 {ECO:0000313|Ensembl:ENSECAP00000011775.2};
GN Name=ASTE1 {ECO:0000313|VGNC:VGNC:56714};
OS Equus caballus (Horse).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Perissodactyla; Equidae; Equus.
OX NCBI_TaxID=9796 {ECO:0000313|Ensembl:ENSECAP00000011775.2, ECO:0000313|Proteomes:UP000002281};
RN [1] {ECO:0000313|Ensembl:ENSECAP00000011775.2, ECO:0000313|Proteomes:UP000002281}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000011775.2,
RC ECO:0000313|Proteomes:UP000002281};
RX PubMed=19892987; DOI=10.1126/science.1178158;
RG Broad Institute Genome Sequencing Platform;
RG Broad Institute Whole Genome Assembly Team;
RA Wade C.M., Giulotto E., Sigurdsson S., Zoli M., Gnerre S., Imsland F.,
RA Lear T.L., Adelson D.L., Bailey E., Bellone R.R., Bloecker H., Distl O.,
RA Edgar R.C., Garber M., Leeb T., Mauceli E., MacLeod J.N., Penedo M.C.T.,
RA Raison J.M., Sharpe T., Vogel J., Andersson L., Antczak D.F., Biagi T.,
RA Binns M.M., Chowdhary B.P., Coleman S.J., Della Valle G., Fryc S.,
RA Guerin G., Hasegawa T., Hill E.W., Jurka J., Kiialainen A., Lindgren G.,
RA Liu J., Magnani E., Mickelson J.R., Murray J., Nergadze S.G., Onofrio R.,
RA Pedroni S., Piras M.F., Raudsepp T., Rocchi M., Roeed K.H., Ryder O.A.,
RA Searle S., Skow L., Swinburne J.E., Syvaenen A.C., Tozaki T., Valberg S.J.,
RA Vaudin M., White J.R., Zody M.C., Lander E.S., Lindblad-Toh K.;
RT "Genome sequence, comparative analysis, and population genetics of the
RT domestic horse.";
RL Science 326:865-867(2009).
RN [2] {ECO:0000313|Ensembl:ENSECAP00000011775.2}
RP IDENTIFICATION.
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000011775.2};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SIMILARITY: Belongs to the asteroid family.
CC {ECO:0000256|ARBA:ARBA00007398}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; F7BFK5; -.
DR STRING; 9796.ENSECAP00000011775; -.
DR PaxDb; 9796-ENSECAP00000011775; -.
DR Ensembl; ENSECAT00000014715.3; ENSECAP00000011775.2; ENSECAG00000014026.4.
DR VGNC; VGNC:56714; ASTE1.
DR GeneTree; ENSGT00390000010145; -.
DR HOGENOM; CLU_017330_1_0_1; -.
DR InParanoid; F7BFK5; -.
DR OMA; AHQWNCP; -.
DR TreeFam; TF324582; -.
DR Proteomes; UP000002281; Chromosome 16.
DR Bgee; ENSECAG00000014026; Expressed in blood and 23 other cell types or tissues.
DR ExpressionAtlas; F7BFK5; baseline.
DR GO; GO:0004518; F:nuclease activity; IEA:InterPro.
DR Gene3D; 3.40.50.1010; 5'-nuclease; 1.
DR InterPro; IPR026832; Asteroid.
DR InterPro; IPR029060; PIN-like_dom_sf.
DR InterPro; IPR006085; XPG_DNA_repair_N.
DR PANTHER; PTHR15665; ASTEROID PROTEIN; 1.
DR PANTHER; PTHR15665:SF1; PROTEIN ASTEROID HOMOLOG 1; 1.
DR Pfam; PF00752; XPG_N; 1.
DR SUPFAM; SSF88723; PIN domain-like; 1.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000002281}.
FT DOMAIN 1..97
FT /note="XPG N-terminal"
FT /evidence="ECO:0000259|Pfam:PF00752"
FT REGION 621..645
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 680 AA; 77205 MW; 79FF8DFDCE648171 CRC64;
MGIRGLMSFV EDHSNEFFID LKLRDTKIII DGYALFHRLC FNSNLELRYG GDYDSFADVV
QKFFESLFAC NICPYVVLDG GCDISDKKLT TLKDRAREKI QMAHSLSVGG GGYVCPLLIR
EVFIQVLIKL HVCFVQCFSE ADRDIMTLAN HWNCPVLSSD SDFCIFDLKS GFCPLNSFQW
RNMNTLKGTR EHYIPAKCFS LDALCHHFSR MNKALLPLFA VLCGNDHINL PIMETFLSKV
RLPLGGASSK GRRHHRVLGL LNWLSQFANP TEALDNVLQY LPKKNRENVK ELLCCSMEEY
QPSQVKLQDF FQYGAYACPD ALNLALPEWV LVALAKGQLS PFISDALVLR RTILQTQVEN
MQQPSAHRIS LPIRQTIYGL LLNASAHLEN TSWNALPLQR LAFSEVERIH KNIKTSIVDA
VEVPKDHSDL STLTELSLAR RQILLLETLK VKQAVLDPIP PSLKLPIAVS CYWLQHTEAK
AKLHHLQALL LGMLMGPLHA IIHSPDKEDL REDGAKMLYE EFQRVKEQTR PGTRLDLDTA
HIFSQWQCCL QMGVYLNQLL STPLPEPDLT RLYSGSLVHG LSRQLLTTTS AESLLSMCSE
AKQLYDHLFN ATRSHAPAEL FLPKGKSNPK KKRQKKRGTS WSKNRLGATS DTRCWYEGSN
RFGLLMVENL EEHIETSEFE
//