ID A0A093FGN2_GAVST Unreviewed; 1092 AA.
AC A0A093FGN2;
DT 26-NOV-2014, integrated into UniProtKB/TrEMBL.
DT 26-NOV-2014, sequence version 1.
DT 24-JAN-2024, entry version 30.
DE RecName: Full=AT-rich interactive domain-containing protein 5B {ECO:0000256|ARBA:ARBA00013841};
DE Flags: Fragment;
GN ORFNames=N328_01139 {ECO:0000313|EMBL:KFV53466.1};
OS Gavia stellata (Red-throated diver) (Colymbus stellatus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Gaviiformes; Gaviidae; Gavia.
OX NCBI_TaxID=37040 {ECO:0000313|EMBL:KFV53466.1, ECO:0000313|Proteomes:UP000054313};
RN [1] {ECO:0000313|EMBL:KFV53466.1, ECO:0000313|Proteomes:UP000054313}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=BGI_N328 {ECO:0000313|EMBL:KFV53466.1};
RA Zhang G., Li C.;
RT "Genome evolution of avian class.";
RL Submitted (APR-2014) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the ARID5B family.
CC {ECO:0000256|ARBA:ARBA00010608}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KK626250; KFV53466.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A093FGN2; -.
DR Proteomes; UP000054313; Unassembled WGS sequence.
DR GO; GO:0003677; F:DNA binding; IEA:InterPro.
DR CDD; cd16885; ARID_ARID5B; 1.
DR Gene3D; 1.10.150.60; ARID DNA-binding domain; 1.
DR InterPro; IPR030408; ARID5B_ARID/BRIGHT_DNA-bd.
DR InterPro; IPR001606; ARID_dom.
DR InterPro; IPR036431; ARID_dom_sf.
DR PANTHER; PTHR13964:SF37; AT-RICH INTERACTIVE DOMAIN-CONTAINING PROTEIN 5B; 1.
DR PANTHER; PTHR13964; RBP-RELATED; 1.
DR Pfam; PF01388; ARID; 1.
DR SMART; SM01014; ARID; 1.
DR SMART; SM00501; BRIGHT; 1.
DR SUPFAM; SSF46774; ARID-like; 1.
DR PROSITE; PS51011; ARID; 1.
PE 3: Inferred from homology;
KW Activator {ECO:0000256|ARBA:ARBA00023159};
KW Reference proteome {ECO:0000313|Proteomes:UP000054313};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT DOMAIN 228..320
FT /note="ARID"
FT /evidence="ECO:0000259|PROSITE:PS51011"
FT REGION 155..187
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 320..483
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 621..649
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 790..845
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 941..968
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 320..391
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 803..845
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 941..960
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:KFV53466.1"
FT NON_TER 1092
FT /evidence="ECO:0000313|EMBL:KFV53466.1"
SQ SEQUENCE 1092 AA; 121926 MW; A403F735489B7B7A CRC64;
QDEVIAVSEK VTVKLEDLAK WAQSDFSKWK CGFRAEPVKP MDVGKNGQKE ALTRYRQSTL
NSGLNFKDIL KEKADLGEDD EDSNLLILSY PQYCRYRSML KRIQDKPSSI LTDQFVLALG
GIAVTSKNPQ IFYCRDTFDH PTLIENESIC DEFAPNLKGR PRKKKPCPQR RDSLNGIKDS
NNNSESKAVA KVKCEAKSAL PKPKSNNSNC KKGSSEDKSK IAIGEECRAD EQAFLVALYK
YMKERKTPIE RIPYLGFKQI NLWTMFQAAQ KLGGYETITA RRQWKHIYDE LGGNPGSTSA
ATCTRRHYER LILPYERFIK GEEDKPLPPV KPRKQDNSSQ EGEAKTKVSG TKRIKNENQK
SKKEKDNAQK PQDASEVSSE QEKDQESADQ KNFPEHPTAG EMKQPMQGPP SLLPETARPP
PLEKTDLTEN STNSEKAKEE VQHSSAFSSI SVPPEEDTVL DATVAKRLHP PADALEDTKP
EQRLHKAFTD SLESEPPEMP FTAFPVQLST QSDMEDDKLP EMADYIANCT VKVDQLGNED
IHNALKQTPK VLVVQNFDMF KEKELPGSMN DDSTFGYTPL LYSKGNPGIM SPLAKKKLLS
QVSGAALSCS YPYGSPPPLI SKKKLNGRDE LSSGISQGPH APNSDPVAIN RPSVIQHVQS
FKTKEERKSI NDVFKHDMLS KPDPQRCDFS KHHLSSLAES YVPKTDIQDC KDKMSEKRAL
QHSHVPTFLA DFYSSPHLHS LYRHTEHHLN NEQTSKYLPR DMFRESENIS TFTQHKHQEK
LNLNYRPSLH QQEKKAAVEA SSDDQPTDLS LPKSTHKQTA KAPGSSLPHS SMAQQEGKGI
SPFQAASSQA VSLDCNPKAC RVSPMAMTAP KKHSELLHRS GKQQAQRLEN LRKMEGMVHP
IISRRTSPQN VGAARPLKRS LEDLDKVISE KKIRAVSPLH LPKETPVKDK VPDPEGEGSK
PVHGLHSGSM LESHKFPLSA PIFPGLYPGS LCTGLNNRLP PGYSHPLQYL KNQTVLSPLM
QPLALHSFMV QRQFLTSPAN SQQLYRHLAA ATPVGSSYGD LLHNSIYPLA AINPQAAFPP
SQLSSVHPST KL
//