ID A0A3Q1LY41_BOVIN Unreviewed; 632 AA.
AC A0A3Q1LY41;
DT 10-APR-2019, integrated into UniProtKB/TrEMBL.
DT 10-APR-2019, sequence version 1.
DT 27-MAR-2024, entry version 24.
DE RecName: Full=HTH CENPB-type domain-containing protein {ECO:0000259|PROSITE:PS51253};
GN Name=LOC101903385 {ECO:0000313|Ensembl:ENSBTAP00000062440.1};
OS Bos taurus (Bovine).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Artiodactyla; Ruminantia; Pecora; Bovidae;
OC Bovinae; Bos.
OX NCBI_TaxID=9913 {ECO:0000313|Ensembl:ENSBTAP00000062440.1, ECO:0000313|Proteomes:UP000009136};
RN [1] {ECO:0000313|Ensembl:ENSBTAP00000062440.1, ECO:0000313|Proteomes:UP000009136}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Hereford {ECO:0000313|Ensembl:ENSBTAP00000062440.1,
RC ECO:0000313|Proteomes:UP000009136};
RA Rosen B.D., Bickhart D.M., Koren S., Schnabel R.D., Hall R., Zimin A.,
RA Dreischer C., Schultheiss S., Schroeder S.G., Elsik C.G., Couldrey C.,
RA Liu G.E., Van Tassell C.P., Phillippy A.M., Smith T.P.L., Medrano J.F.;
RT "ARS-UCD1.2.";
RL Submitted (MAR-2018) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSBTAP00000062440.1}
RP IDENTIFICATION.
RC STRAIN=Hereford {ECO:0000313|Ensembl:ENSBTAP00000062440.1};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the tigger transposable element derived protein
CC family. {ECO:0000256|ARBA:ARBA00010881}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_015313992.1; XM_015458506.1.
DR RefSeq; XP_015323377.1; XM_015467891.1.
DR AlphaFoldDB; A0A3Q1LY41; -.
DR SMR; A0A3Q1LY41; -.
DR STRING; 9913.ENSBTAP00000062440; -.
DR PaxDb; 9913-ENSBTAP00000056213; -.
DR Ensembl; ENSBTAT00000066936.1; ENSBTAP00000062440.1; ENSBTAG00000054839.1.
DR GeneID; 101903385; -.
DR VEuPathDB; HostDB:ENSBTAG00000054839; -.
DR GeneTree; ENSGT00940000163154; -.
DR InParanoid; A0A3Q1LY41; -.
DR OMA; YNIMTAV; -.
DR OrthoDB; 2967227at2759; -.
DR Proteomes; UP000009136; Chromosome 18.
DR Bgee; ENSBTAG00000054839; Expressed in oviduct epithelium and 94 other cell types or tissues.
DR GO; GO:0005634; C:nucleus; IBA:GO_Central.
DR GO; GO:0003677; F:DNA binding; IBA:GO_Central.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR Gene3D; 1.10.10.10; Winged helix-like DNA-binding domain superfamily/Winged helix DNA-binding domain; 1.
DR InterPro; IPR004875; DDE_SF_endonuclease_dom.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR006600; HTH_CenpB_DNA-bd_dom.
DR InterPro; IPR007889; HTH_Psq.
DR InterPro; IPR036388; WH-like_DNA-bd_sf.
DR PANTHER; PTHR19303:SF61; HTH CENPB-TYPE DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR19303; TRANSPOSON; 1.
DR Pfam; PF04218; CENP-B_N; 1.
DR Pfam; PF03184; DDE_1; 1.
DR Pfam; PF03221; HTH_Tnp_Tc5; 1.
DR SMART; SM00674; CENPB; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 2.
DR PROSITE; PS51253; HTH_CENPB; 1.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000009136}.
FT DOMAIN 143..222
FT /note="HTH CENPB-type"
FT /evidence="ECO:0000259|PROSITE:PS51253"
FT REGION 1..23
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 516..558
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 632 AA; 68948 MW; 8900BA4F2F87B47D CRC64;
MRRLCSPSGP FEAAGPCRGP EAEMEWQERP RGHCGTTLGT ADGQREESVL RCMAQGGSLK
PPQPQGLGKA PLGVGLRHSA KRDRKSITLH VKLEVLRRFE EGEKLTQIAR ALGLATSTVA
SIRVNKDRIR ASSQAAAPVC TTQLTRCRGA LMGHMERLLS LWIEEQKRQN LPVSTLLIQD
QARRLFAQLQ HEQGGGSRAE TFGASNGWFA RFKVRHNVLL TEEPAVADAQ AAARYPAVLR
AILEEGCYSP RQVFNVDETG LFWKRLPERM LLALEGTAGP GPKASKDHLT LLLGGNAAGD
FKLKPLLVYP SENPRALRGC SKASLPVVWR SNRNDWLTPV IFQEWFTSCF CPAVESYCAS
HGLPHRALLL LDSAPCHPAH LGGLSAHVRV EFLPKNTSTL IQPMNQGVIT AFKAQYLRRT
LSQLAQEMGG ADRPSVWEFW RSYTVMTAVD NIAEAWTELQ PAAMNSAWRK LWPECVLAGA
PEPSAVPQLP RSIETLASRT GLGDVAEADV SHLLQAHGEP TPTPLGTDGG HARGPQLPCQ
CGKGLASRRP ESEATGGAEA EDTLVVALCS EHLARALSHF AAGLQVLSEN DPNRERSLWV
ARAVHCALAH LRELLRERRR QARAAAGPPE AP
//