GenomeNet

Database: UniProt
Entry: A0A5N4CY88_CAMDR
LinkDB: A0A5N4CY88_CAMDR
Original site: A0A5N4CY88_CAMDR 
ID   A0A5N4CY88_CAMDR        Unreviewed;       144 AA.
AC   A0A5N4CY88;
DT   26-FEB-2020, integrated into UniProtKB/TrEMBL.
DT   26-FEB-2020, sequence version 1.
DT   27-MAR-2024, entry version 15.
DE   SubName: Full=General transcription factor II-I repeat domain-containing protein 1 {ECO:0000313|EMBL:KAB1263737.1};
DE   Flags: Fragment;
GN   ORFNames=Cadr_000023764 {ECO:0000313|EMBL:KAB1263737.1};
OS   Camelus dromedarius (Dromedary) (Arabian camel).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Laurasiatheria; Artiodactyla; Tylopoda; Camelidae; Camelus.
OX   NCBI_TaxID=9838 {ECO:0000313|EMBL:KAB1263737.1, ECO:0000313|Proteomes:UP000299084};
RN   [1] {ECO:0000313|EMBL:KAB1263737.1, ECO:0000313|Proteomes:UP000299084}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Drom800 {ECO:0000313|EMBL:KAB1263737.1};
RC   TISSUE=Blood {ECO:0000313|EMBL:KAB1263737.1};
RX   PubMed=30972949; DOI=.1111/1755-0998.13020;
RA   Elbers J.P., Rogers M.F., Perelman P.L., Proskuryakova A.A.,
RA   Serdyukova N.A., Johnson W.E., Horin P., Corander J., Murphy D.,
RA   Burger P.A.;
RT   "Improving Illumina assemblies with Hi-C and long reads: an example with
RT   the North African dromedary.";
RL   Mol. Ecol. Resour. 19:1015-1026(2019).
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:KAB1263737.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; JWIN03000018; KAB1263737.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A5N4CY88; -.
DR   Proteomes; UP000299084; Unassembled WGS sequence.
DR   GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR   GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR   Gene3D; 3.90.1460.10; GTF2I-like; 1.
DR   InterPro; IPR004212; GTF2I.
DR   InterPro; IPR036647; GTF2I-like_rpt_sf.
DR   PANTHER; PTHR46304; GENERAL TRANSCRIPTION FACTOR II-I REPEAT DOMAIN-CONTAINING PROTEIN 1; 1.
DR   PANTHER; PTHR46304:SF1; GENERAL TRANSCRIPTION FACTOR II-I REPEAT DOMAIN-CONTAINING PROTEIN 1; 1.
DR   Pfam; PF02946; GTF2I; 1.
DR   SUPFAM; SSF117773; GTF2I-like repeat; 1.
DR   PROSITE; PS51139; GTF2I; 1.
PE   4: Predicted;
KW   DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW   Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW   Reference proteome {ECO:0000313|Proteomes:UP000299084};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Transcription {ECO:0000256|ARBA:ARBA00023163};
KW   Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT   REGION          74..112
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        93..112
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   NON_TER         1
FT                   /evidence="ECO:0000313|EMBL:KAB1263737.1"
SQ   SEQUENCE   144 AA;  15979 MW;  D1EECD998C508F4F CRC64;
     EALGLNRPVL VPYKLIRDSP DAVEVTGLPD DIPFRNPNTY DIHRLEKILK AREHVRMVII
     NQLQPFAEIC NDTKVPAKDS SIPKRKRKRV SEGNSVSSSS SSSSSSSSNP ESVASTNQIS
     LVQWPMYMVD YAGLNVQLPG PLNY
//
DBGET integrated database retrieval system