ID A0A091I5F4_CALAN Unreviewed; 562 AA.
AC A0A091I5F4;
DT 26-NOV-2014, integrated into UniProtKB/TrEMBL.
DT 26-NOV-2014, sequence version 1.
DT 27-MAR-2024, entry version 35.
DE SubName: Full=Lens epithelium-derived growth factor {ECO:0000313|EMBL:KFP02708.1};
DE Flags: Fragment;
GN ORFNames=N300_15513 {ECO:0000313|EMBL:KFP02708.1};
OS Calypte anna (Anna's hummingbird) (Archilochus anna).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Caprimulgimorphae; Apodiformes;
OC Trochilidae; Calypte.
OX NCBI_TaxID=9244 {ECO:0000313|EMBL:KFP02708.1, ECO:0000313|Proteomes:UP000054308};
RN [1] {ECO:0000313|EMBL:KFP02708.1, ECO:0000313|Proteomes:UP000054308}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=BGI_N300 {ECO:0000313|EMBL:KFP02708.1};
RA Zhang G., Li C.;
RT "Genome evolution of avian class.";
RL Submitted (APR-2014) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the HDGF family.
CC {ECO:0000256|ARBA:ARBA00005309}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KL218151; KFP02708.1; -; Genomic_DNA.
DR RefSeq; XP_008494467.1; XM_008496245.1.
DR AlphaFoldDB; A0A091I5F4; -.
DR STRING; 9244.A0A091I5F4; -.
DR Proteomes; UP000054308; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR CDD; cd20151; PWWP_PSIP; 1.
DR Gene3D; 2.30.30.140; -; 1.
DR Gene3D; 1.20.930.10; Conserved domain common to transcription factors TFIIS, elongin A, CRSP70; 1.
DR InterPro; IPR036218; HIVI-bd_sf.
DR InterPro; IPR021567; LEDGF_IBD.
DR InterPro; IPR000313; PWWP_dom.
DR InterPro; IPR035441; TFIIS/LEDGF_dom_sf.
DR PANTHER; PTHR12550; HEPATOMA-DERIVED GROWTH FACTOR-RELATED; 1.
DR PANTHER; PTHR12550:SF49; JIL-1 ANCHORING AND STABILIZING PROTEIN, ISOFORM A; 1.
DR Pfam; PF11467; LEDGF; 1.
DR Pfam; PF00855; PWWP; 1.
DR SMART; SM00293; PWWP; 1.
DR SUPFAM; SSF140576; HIV integrase-binding domain; 1.
DR SUPFAM; SSF63748; Tudor/PWWP/MBT; 1.
DR PROSITE; PS50812; PWWP; 1.
PE 3: Inferred from homology;
KW Coiled coil {ECO:0000256|ARBA:ARBA00023054};
KW Reference proteome {ECO:0000313|Proteomes:UP000054308}.
FT DOMAIN 1..50
FT /note="PWWP"
FT /evidence="ECO:0000259|PROSITE:PS50812"
FT REGION 47..381
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 476..562
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 47..66
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 75..101
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 131..163
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 240..294
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 332..381
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 476..504
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 505..525
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 526..562
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 562
FT /evidence="ECO:0000313|EMBL:KFP02708.1"
SQ SEQUENCE 562 AA; 62648 MW; E48E0CBFD092833A CRC64;
MKGYPHWPAR VDEVPDGAVK PPTNKMPIFF FGTHETAFLG PKDIFPYSEN KDKYGKPNKR
KGFNEGLWEI DNNPKVKFSH QQSHPAVNTP IKETVQESSQ EPAEGSEEKA GAKRRKSSVP
KLSPKGDNNM PTEAETEEKE MDTSKEDDLP SDKTSKEDVV KTSDASLPKV ARRGRKRKVE
KQGETEEAAA AVATVVGGAA PVPASPKVSP KRGRPTASEA KVPKPRGRPK LVKPSCLSES
DSVNEEEKAK KKGPEEKPKK QGKKDEESQK EEEKSKKEFD KKEGKKEAEP KRKNTAKVGS
ASASDSEGEG EEQEGDKKKK GGRSLQSAHR RNIIRGQHDR DAAERKRKQE EQTESDSQSK
EEGKKTEAKK MEKKRETSMD SRLQRIHAEI KNSLKIDNLD VNRCIEALDE LASLQITMQQ
AQKHTEMILT LKKIRKFKVS QVIMEKSTML YNKFKTMFLV GEGDSVLSQV LNKSLAEQKQ
HEEANKTKEQ WKKGASKKPE KDQTSSKVLN GGCEAQDTSQ APNAGENTEE KKDKHEAGSK
KKAGGEEKEL EKPAKDSAFE GK
//