ID A0A182K378_9DIPT Unreviewed; 1238 AA.
AC A0A182K378;
DT 07-SEP-2016, integrated into UniProtKB/TrEMBL.
DT 07-SEP-2016, sequence version 1.
DT 27-MAR-2024, entry version 30.
DE RecName: Full=TATA-binding protein interacting (TIP20) domain-containing protein {ECO:0000259|Pfam:PF08623};
OS Anopheles christyi.
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=43041 {ECO:0000313|EnsemblMetazoa:ACHR005213-PA, ECO:0000313|Proteomes:UP000075881};
RN [1] {ECO:0000313|Proteomes:UP000075881}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ACHKN1017 {ECO:0000313|Proteomes:UP000075881};
RG The Broad Institute Genomics Platform;
RA Neafsey D.E., Besansky N., Walker B., Young S.K., Zeng Q., Gargeya S.,
RA Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Anopheles christyi ACHKN1017.";
RL Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EnsemblMetazoa:ACHR005213-PA}
RP IDENTIFICATION.
RC STRAIN=ACHKN1017 {ECO:0000313|EnsemblMetazoa:ACHR005213-PA};
RG EnsemblMetazoa;
RL Submitted (MAY-2020) to UniProtKB.
CC -!- SIMILARITY: Belongs to the CAND family.
CC {ECO:0000256|ARBA:ARBA00007657}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A182K378; -.
DR STRING; 43041.A0A182K378; -.
DR EnsemblMetazoa; ACHR005213-RA; ACHR005213-PA; ACHR005213.
DR VEuPathDB; VectorBase:ACHR005213; -.
DR OrthoDB; 68829at2759; -.
DR Proteomes; UP000075881; Unassembled WGS sequence.
DR GO; GO:0010265; P:SCF complex assembly; IEA:InterPro.
DR Gene3D; 1.25.10.10; Leucine-rich Repeat Variant; 1.
DR InterPro; IPR011989; ARM-like.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR039852; CAND1/CAND2.
DR InterPro; IPR021133; HEAT_type_2.
DR InterPro; IPR013932; TATA-bd_TIP120.
DR PANTHER; PTHR12696:SF0; CULLIN-ASSOCIATED NEDD8-DISSOCIATED PROTEIN 1; 1.
DR PANTHER; PTHR12696; TIP120; 1.
DR Pfam; PF08623; TIP120; 1.
DR SUPFAM; SSF48371; ARM repeat; 1.
DR PROSITE; PS50077; HEAT_REPEAT; 1.
PE 3: Inferred from homology;
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT REPEAT 47..84
FT /note="HEAT"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00103"
FT DOMAIN 1041..1194
FT /note="TATA-binding protein interacting (TIP20)"
FT /evidence="ECO:0000259|Pfam:PF08623"
FT REGION 311..335
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 315..335
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1238 AA; 138815 MW; 887AEF0C0FF90E78 CRC64;
MASYQIANLL EKMTSNDKDF RFMATNDLMT ELQKDSIKLD DESEKKVVRM VLRLLEDKNG
EVQNLAVKCL GPLVNKVKEN QVETIVDLLC ANMVSNNEQL RDISSIGLKT VISELPQSSN
SLVPNVCQRI TGKLSVAIEK EDVSVQLEAL DILSDLLSRF GDLLVPFHEL ILKALVPQLG
SARQAVRKRT IVALSHLLTT CNNNAYNKVI EHLLDGLEKP QNPGTIRTYI QCLAAICRQA
GHRLCNHIER VMFLLNQYSL RDDDELREFC LQACEAFVQR CPEAIMPHIP TIVDLCLKYI
TYDPNYNYEA DDGEGGTSME MEDDEEIDSE EYSDDDDMSW KVRRSAAKCL ESVISTRHEL
LEEFYKTLSP ALIARFKERE ENVKSDIFHA YIALLKSTRP MGDDIGHDPD SMEQIPGPIS
MLTDQVPTIV KAVQPLMREK SVKTRQDCFL LLRELLNALP GALSNHIDQL MSGIHYSLND
KNSTSNMKID ALGFVYCMLV GHNPQVFHSH IQLLVPLVVN AVFDPFYKIA TEALLVLQQL
VKVIRPVDVQ TTFDFTPYVS QLYTSTLQKL RSPEVDQEVK ERAIACMGQI IANMGDVLQP
ELVTCLPLFM ERLRNEVTRL SSVKALTMIA ASPLRVNLSP IIGEVIPVLG SFLRKNQRAL
KLNSLTLLDT LVTHYSQFLD PKLLRGAVGE VPPLLSESDL HVAQLSLVLL TSVARQQPEA
LVGVHEQILQ EVMTLVRSPL LQGTALNCTL KLFQALVQAQ LPGLSYRHLL GMLMNPVYNQ
QQQGGSPLHK QAYHSLAKCI AALTLQVPNE ALTVAGEFLR EIQNRRNDSH LMFYLLTIGE
IGRHFNLHTI DTLAQTILNC FSASSEDVKG AASHALGAIA VGNLNHYLPF ILNEIEAQPK
RQYLLLHSLK EVISSLSTSK AGLEQLLPSV PSIWTQLFKH CECSEEGSRN VVAECLGKLV
LVNPEELLPR LQMALQSESA LMRTAVVSAI KFTISDQPQP IDPLLRQCIG QFLFALQDPE
PSVRRVALVA FNSAVHNKPS LVRDLLPELL PQLYSETKVK KELIREVEMG PFKHTVDDGL
DIRKAAFECM YTLLEQGLDR VDIMQFLEHV QAGLRDHYDI KMLTYLMTAR LAALCPNAVL
QKLDQFVEPL RATCTLKVKA NSVKQEYEKQ DELKRSALRA VAALLQIPKA DKNIYLAEFL
ILIRSSSELQ PLLESVQKDS SGQLNNNIDG RDLSMDQS
//