ID A0A091JKF4_EGRGA Unreviewed; 974 AA.
AC A0A091JKF4;
DT 26-NOV-2014, integrated into UniProtKB/TrEMBL.
DT 26-NOV-2014, sequence version 1.
DT 22-FEB-2023, entry version 29.
DE SubName: Full=Huntingtin-interacting protein 1 {ECO:0000313|EMBL:KFP21067.1};
DE Flags: Fragment;
GN ORFNames=Z169_00516 {ECO:0000313|EMBL:KFP21067.1};
OS Egretta garzetta (Little egret).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Pelecaniformes; Ardeidae; Egretta.
OX NCBI_TaxID=188379 {ECO:0000313|EMBL:KFP21067.1, ECO:0000313|Proteomes:UP000053119};
RN [1] {ECO:0000313|EMBL:KFP21067.1, ECO:0000313|Proteomes:UP000053119}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=BGI_Z169 {ECO:0000313|EMBL:KFP21067.1};
RA Zhang G., Li C.;
RT "Genome evolution of avian class.";
RL Submitted (APR-2014) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the SLA2 family.
CC {ECO:0000256|ARBA:ARBA00010135}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KK502172; KFP21067.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A091JKF4; -.
DR STRING; 188379.A0A091JKF4; -.
DR Proteomes; UP000053119; Unassembled WGS sequence.
DR GO; GO:0005737; C:cytoplasm; IEA:UniProtKB-KW.
DR GO; GO:0003779; F:actin binding; IEA:UniProtKB-KW.
DR GO; GO:0030276; F:clathrin binding; IEA:InterPro.
DR GO; GO:0005543; F:phospholipid binding; IEA:InterPro.
DR GO; GO:0006897; P:endocytosis; IEA:InterPro.
DR Gene3D; 1.20.5.1700; -; 1.
DR Gene3D; 1.25.40.90; -; 1.
DR Gene3D; 6.10.250.920; -; 1.
DR Gene3D; 1.20.1410.10; I/LWEQ domain; 1.
DR InterPro; IPR011417; ANTH_dom.
DR InterPro; IPR013809; ENTH.
DR InterPro; IPR008942; ENTH_VHS.
DR InterPro; IPR032422; HIP1_clath-bd.
DR InterPro; IPR035964; I/LWEQ_dom_sf.
DR InterPro; IPR002558; ILWEQ_dom.
DR InterPro; IPR030224; Sla2_fam.
DR PANTHER; PTHR10407; HUNTINGTIN INTERACTING PROTEIN 1; 1.
DR PANTHER; PTHR10407:SF14; HUNTINGTIN-INTERACTING PROTEIN 1; 1.
DR Pfam; PF07651; ANTH; 1.
DR Pfam; PF16515; HIP1_clath_bdg; 1.
DR Pfam; PF01608; I_LWEQ; 1.
DR SMART; SM00273; ENTH; 1.
DR SMART; SM00307; ILWEQ; 1.
DR SUPFAM; SSF48464; ENTH/VHS domain; 1.
DR SUPFAM; SSF109885; I/LWEQ domain; 1.
DR PROSITE; PS50942; ENTH; 1.
DR PROSITE; PS50945; I_LWEQ; 1.
PE 3: Inferred from homology;
KW Actin-binding {ECO:0000256|ARBA:ARBA00023203};
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Cytoplasm {ECO:0000256|ARBA:ARBA00022490};
KW Reference proteome {ECO:0000313|Proteomes:UP000053119}.
FT DOMAIN 1..116
FT /note="ENTH"
FT /evidence="ECO:0000259|PROSITE:PS50942"
FT DOMAIN 723..964
FT /note="I/LWEQ"
FT /evidence="ECO:0000259|PROSITE:PS50945"
FT REGION 452..473
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 502..524
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 930..959
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 456..473
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:KFP21067.1"
FT NON_TER 974
FT /evidence="ECO:0000313|EMBL:KFP21067.1"
SQ SEQUENCE 974 AA; 108371 MW; 65193EF565D5099F CRC64;
QTVSINKAIN AQEVAVKEKH ARNILPEAGC RTNELVNRLP LSGNAVLCWK FCHVFHKLLR
DGHSNVLKDS VRYKNELSDM SRMWGHLSEG YGQLCSIYLK LLRTKMEFHT KNPRFPGNLQ
MSDRQLDEAG ENDVNNFFQL TVEMFDYLEC ELNLFQTVFS SLDMSRSVSV TAAGQCRLAP
LIQVILDCSH LYDYTVKLLF KLHSCLPADT LQGHRDRFLE QFRKLKDLFY RSSNLQYFKR
LIQIPQLPEN PPNFLRASAL SEHISPVVVI PAEASSPDSE PITDLVEMDT ASQSLFDNKF
DDIFGSSFNS DPFNFNSQNG MKKDDKDRLI EQLYGEIAAL KEELENFKAE SARGVVQLRG
RASELEAELA EQRHLKQQAQ DESEFLRTEL EELKKQREDT EKAQRSLTEI ERRAQANEQR
YSKLKEKYSE LVQNHADLLR KNAEVTKQVT AARQAQGDVE REKKELEDSF QRSQEQAEVL
DTLKREVAAS RQELQVLQGT LESSAQAGAE QSTRIAGLEQ ERDSLSRAAE RHGEEMAALR
AELQGLREVL SLGAVTPALA WQESGEQALQ RRLAEEQFAL LRGTAREAER MVQDAPRLSP
ACCLPADCLL SRTLAASECV ERLRDAHGKY LSNCAAMGSL LPCLALFAHL VSDTLLQGSA
TSHVAPMEPA DRLLEMCKQC GSEAVSYLSA LQDPGTVESA DCSPVTACLG RISTIGEELR
PRGLDVKQEE LGDLVDKEMA ATAAAIETAS ARIEEMLSKA RAGDTGVKLE VNERILGSCT
GLMQAIHILV LASKDLQREI VESGRGAASP KEFYAKNSRW TEGLISASKA VGWGATVMVD
AADLVVQGKG TFEELMVCSR EIAASTAQLV AASKVKADKD STNLCKLQQA SRGVNQATAG
VVASTKAGKS QVEEKDSMDF SSMTLTQIKR QEMDSQVRVL ELENQLQKER QKLGELRKKH
YELAGVAEGW EEDG
//