ID K7FY13_PELSI Unreviewed; 1225 AA.
AC K7FY13;
DT 09-JAN-2013, integrated into UniProtKB/TrEMBL.
DT 09-JAN-2013, sequence version 1.
DT 27-MAR-2024, entry version 74.
DE SubName: Full=AT-rich interaction domain 4B {ECO:0000313|Ensembl:ENSPSIP00000012923.1};
GN Name=ARID4B {ECO:0000313|Ensembl:ENSPSIP00000012923.1};
OS Pelodiscus sinensis (Chinese softshell turtle) (Trionyx sinensis).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Testudinata; Testudines; Cryptodira; Trionychia;
OC Trionychidae; Pelodiscus.
OX NCBI_TaxID=13735 {ECO:0000313|Ensembl:ENSPSIP00000012923.1, ECO:0000313|Proteomes:UP000007267};
RN [1] {ECO:0000313|Proteomes:UP000007267}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Daiwa-1 {ECO:0000313|Proteomes:UP000007267};
RG Soft-shell Turtle Genome Consortium;
RL Submitted (OCT-2011) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Proteomes:UP000007267}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Daiwa-1 {ECO:0000313|Proteomes:UP000007267};
RX PubMed=23624526; DOI=10.1038/ng.2615;
RA Wang Z., Pascual-Anaya J., Zadissa A., Li W., Niimura Y., Huang Z., Li C.,
RA White S., Xiong Z., Fang D., Wang B., Ming Y., Chen Y., Zheng Y.,
RA Kuraku S., Pignatelli M., Herrero J., Beal K., Nozawa M., Li Q., Wang J.,
RA Zhang H., Yu L., Shigenobu S., Wang J., Liu J., Flicek P., Searle S.,
RA Wang J., Kuratani S., Yin Y., Aken B., Zhang G., Irie N.;
RT "The draft genomes of soft-shell turtle and green sea turtle yield insights
RT into the development and evolution of the turtle-specific body plan.";
RL Nat. Genet. 45:701-706(2013).
RN [3] {ECO:0000313|Ensembl:ENSPSIP00000012923.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AGCU01078959; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01078960; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01078961; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01078962; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01078963; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01078964; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01078965; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01078966; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01078967; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01078968; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR STRING; 13735.ENSPSIP00000012923; -.
DR Ensembl; ENSPSIT00000012985.1; ENSPSIP00000012923.1; ENSPSIG00000011632.1.
DR eggNOG; KOG2744; Eukaryota.
DR eggNOG; KOG3001; Eukaryota.
DR GeneTree; ENSGT00940000158149; -.
DR HOGENOM; CLU_007419_0_0_1; -.
DR OMA; XPEEESS; -.
DR TreeFam; TF106427; -.
DR Proteomes; UP000007267; Unassembled WGS sequence.
DR GO; GO:0005829; C:cytosol; IEA:Ensembl.
DR GO; GO:0005739; C:mitochondrion; IEA:Ensembl.
DR GO; GO:0005654; C:nucleoplasm; IEA:Ensembl.
DR GO; GO:0003677; F:DNA binding; IEA:InterPro.
DR GO; GO:0006306; P:DNA methylation; IEA:Ensembl.
DR GO; GO:0097368; P:establishment of Sertoli cell barrier; IEA:Ensembl.
DR GO; GO:0071514; P:genomic imprinting; IEA:Ensembl.
DR GO; GO:0045944; P:positive regulation of transcription by RNA polymerase II; IEA:Ensembl.
DR GO; GO:0007283; P:spermatogenesis; IEA:Ensembl.
DR GO; GO:0006366; P:transcription by RNA polymerase II; IEA:Ensembl.
DR CDD; cd16883; ARID_ARID4B; 1.
DR Gene3D; 2.30.30.140; -; 3.
DR Gene3D; 1.10.150.60; ARID DNA-binding domain; 1.
DR InterPro; IPR012603; ARID4A/B_PWWP.
DR InterPro; IPR028853; ARID4B_ARID/BRIGHT.
DR InterPro; IPR001606; ARID_dom.
DR InterPro; IPR036431; ARID_dom_sf.
DR InterPro; IPR016197; Chromo-like_dom_sf.
DR InterPro; IPR025995; Tudor-knot.
DR PANTHER; PTHR13964:SF24; AT-RICH INTERACTIVE DOMAIN-CONTAINING PROTEIN 4B; 1.
DR PANTHER; PTHR13964; RBP-RELATED; 1.
DR Pfam; PF01388; ARID; 1.
DR Pfam; PF08169; RBB1NT; 1.
DR Pfam; PF11717; Tudor-knot; 1.
DR SMART; SM01014; ARID; 1.
DR SMART; SM00501; BRIGHT; 1.
DR SUPFAM; SSF46774; ARID-like; 1.
DR SUPFAM; SSF54160; Chromo domain-like; 1.
DR SUPFAM; SSF63748; Tudor/PWWP/MBT; 1.
DR PROSITE; PS51011; ARID; 1.
PE 4: Predicted;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Phosphoprotein {ECO:0000256|ARBA:ARBA00022553};
KW Reference proteome {ECO:0000313|Proteomes:UP000007267};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT DOMAIN 212..304
FT /note="ARID"
FT /evidence="ECO:0000259|PROSITE:PS51011"
FT REGION 29..77
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 173..211
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 349..490
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 547..728
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 741..799
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 818..1125
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1165..1225
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 54..77
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 183..211
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 349..377
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 378..394
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 395..445
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 446..479
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 557..571
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 579..624
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 630..662
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 708..728
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 819..840
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 904..919
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 921..970
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 991..1021
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1056..1103
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1106..1122
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1185..1225
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1225 AA; 138405 MW; 0E69BD6FB1B440AB CRC64;
MVTVFDDGDE KXXXXLCLKG ERHFAESETL DQLPLTNPEH FGTPVIGKKT NRGRRSNHIP
EEESSSSSSD EDEDDRKQTD ELLGKVVCVD YVNVDKKKIL WFPALVVCPD CSDDIAVKKD
NVLVRSFKDG KFASVPRKDV REISETSSKP DALLKQAFDQ ALEFHKNRTV PSNWKTELKE
ESSSSEAEEE EEDEKEKEDN SSEEEEEIEP FPEERENFLQ QLYKFMEDRG TPINKRPVLG
YRNLNLFKLF RLVHKLGGFD NIESGAVWKQ VYQDLGIPVL NSAAGYNVKC AYKKYLYGFE
EYCMSANIEF QMALPEKVTN KSCKECEKEK ELKMREEPEQ DVKEITAVKE EHKEEEEEVL
IQQEETKPAE NDSECTENDK PTFVGNQKNM EESIYTQSDQ EKEFSSIKTE DEANLGDKEE
EKIKQMEILN TNMEAEEKEK SGDDTNKEED EEEEEAEEDE EDDEEEEEDN NDNNEEEEFE
CYPPGMKVQV RYGRGKNQKM YEASIKDSDV EGGEVLYLVH YCGWNVRYDE WIKADKIVRP
ADKNVPKIKH RKKIKNKTDK EKDEKYSPKN CKLRRLSKPP FHTSTSPETG SKFDSTEAKN
SEQAPVKSIE ITSILNGLQA SESSADDSEQ EDDQSAHNVH NDGKEESQDE TISQDKNELC
PKEEQSSSSP QEEGKSLADV APSKSVSKSP DRLQKELEEL SDDTDYEGED EATKKRKEVK
KEVVDKTTRL QVKRGKRRYC VTEECVKTAS PNRKDEKSKS KDSHSLEHSS NSSSDEDEDD
KSKIKITPTK KYNGLEEKRK SLRTSAFYSG FSEVAEKRIK LLNNSDERSQ NTRAKDRKDV
WSSIQGQWPK KTLKELFSDS DTEAAASPPR PASEDAAVEE QLQTLTEEVS LPSSELEKPL
LASTDVKPVE EKPTEMNDKK VDFPSSGSNS VLNTPPTTPE SPSSVTVTES NRHQSSVPVS
ETLAPNQEEI RSIKSETDST IEVDSVVGEL QDLQSEGNIS PTGFDASVSS SSSNQPEPEH
TEKVCTGQKR LKDVQGGGSS SKKQKRSHKT SVVNNKKKSK PTNSSDSEEL SAGESMTKSQ
PVKSVSTGMK SHHTKSPART QSPGKCGKNG EKESDLKEQN NRLPKVYKWS FQMSDLENLT
SAERITILQE KLQEIRKHYM SLKSEVASID RRRKRLKKKE RESAATTSSS SSSPSSSSIT
AAVMLTLAEP SMSSSSQNGM SVECR
//