ID W5Q822_SHEEP Unreviewed; 691 AA.
AC W5Q822;
DT 16-APR-2014, integrated into UniProtKB/TrEMBL.
DT 16-APR-2014, sequence version 1.
DT 27-MAR-2024, entry version 51.
DE RecName: Full=SCAN box domain-containing protein {ECO:0008006|Google:ProtNLM};
GN Name=ZBED9 {ECO:0000313|Ensembl:ENSOARP00000018864.1};
OS Ovis aries (Sheep).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Artiodactyla; Ruminantia; Pecora; Bovidae;
OC Caprinae; Ovis.
OX NCBI_TaxID=9940 {ECO:0000313|Ensembl:ENSOARP00000018864.1, ECO:0000313|Proteomes:UP000002356};
RN [1] {ECO:0000313|Ensembl:ENSOARP00000018864.1, ECO:0000313|Proteomes:UP000002356}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Texel {ECO:0000313|Ensembl:ENSOARP00000018864.1,
RC ECO:0000313|Proteomes:UP000002356};
RX PubMed=20809919; DOI=10.1111/j.1365-2052.2010.02100.x;
RA Archibald A.L., Cockett N.E., Dalrymple B.P., Faraut T., Kijas J.W.,
RA Maddox J.F., McEwan J.C., Hutton Oddy V., Raadsma H.W., Wade C., Wang J.,
RA Wang W., Xun X.;
RT "The sheep genome reference sequence: a work in progress.";
RL Anim. Genet. 41:449-453(2010).
RN [2] {ECO:0000313|Ensembl:ENSOARP00000018864.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00187}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AMGL01059151; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR AlphaFoldDB; W5Q822; -.
DR SMR; W5Q822; -.
DR STRING; 9940.ENSOARP00000018864; -.
DR PaxDb; 9940-ENSOARP00000018864; -.
DR Ensembl; ENSOART00000019131.1; ENSOARP00000018864.1; ENSOARG00000017580.1.
DR eggNOG; KOG0017; Eukaryota.
DR eggNOG; KOG1721; Eukaryota.
DR HOGENOM; CLU_023794_0_0_1; -.
DR OMA; VHENPRR; -.
DR Proteomes; UP000002356; Chromosome 20.
DR Bgee; ENSOARG00000017580; Expressed in embryo and 12 other cell types or tissues.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR CDD; cd15489; PHD_SF; 1.
DR CDD; cd07936; SCAN; 1.
DR Gene3D; 1.10.4020.10; DNA breaking-rejoining enzymes; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR003309; SCAN_dom.
DR InterPro; IPR038269; SCAN_sf.
DR PANTHER; PTHR45935:SF2; KRAB-A DOMAIN-CONTAINING PROTEIN 2; 1.
DR PANTHER; PTHR45935; PROTEIN ZBED8-RELATED; 1.
DR Pfam; PF02023; SCAN; 1.
DR SMART; SM00431; SCAN; 1.
DR SUPFAM; SSF47353; Retrovirus capsid dimerization domain-like; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS50804; SCAN_BOX; 1.
PE 4: Predicted;
KW Nucleus {ECO:0000256|PROSITE-ProRule:PRU00187};
KW Reference proteome {ECO:0000313|Proteomes:UP000002356}.
FT DOMAIN 52..134
FT /note="SCAN box"
FT /evidence="ECO:0000259|PROSITE:PS50804"
FT DOMAIN 366..533
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
SQ SEQUENCE 691 AA; 79044 MW; 264A28D85F4CC805 CRC64;
MAAAPGTIPA LAGHGSEEQG EIIKIKVKEE DPIWDQASFL QKNLSYSREL SRQRFRQFCY
QETPGPREAL SQLRELCRLW LSPETHTKEQ ILELLVLEQL LTILPEELQA WVREHHPESG
EEVVTVLEDL ERELDEPRQQ VPQGTYEQEV PMEEKTFLDT AKESLGTHLQ SIEDRMECES
PESHLLQDNG SFLWLSMMSQ SMGDNNFSSL DTNEAEIEPE NMREKFFRSL AVLLENKSNN
TKIFSKAKYC QLIREVKEAK AKEKKESIDY QRLARFDVII VQGHEKLIEA INGETDKIRF
YLHSEDLFDI LHDTHLSIGH GGRTRMEKEL QAKYKNITKE VIMLYLTLCK PCQQKNSKLK
KVLTSKSIKE VNSRCQVDLI DMQLNPDGQY KFIMHYQDFR TNLSFLRSLK SKRPEEVARA
LLDIFTIVGA PSVLQSDSGR EFSSQIVSEL SNIWPELKIV HGNPQACQSL SSINQVNEDI
QNRIISWMQA NNSSHWAEFL WFIQMTQNQP HHRRMQQTLC EGACSSEAKL GLSHSQSTEE
LVASLNTENE LEQADKESEN TARAQHEENI EIGTDSSDIE EILSITPKVA QNTVPEGRLN
FLSCVSCGKE CIGANSCESC CRNIHAICGV PSQRETDGYC NKITCSLCYQ TTTMKRKPDE
VPRSFTVQPF KMLKPSETPF SSDKAGDWVR I
//