ID A0A0L0D232_THETB Unreviewed; 862 AA.
AC A0A0L0D232;
DT 11-NOV-2015, integrated into UniProtKB/TrEMBL.
DT 11-NOV-2015, sequence version 1.
DT 27-MAR-2024, entry version 32.
DE SubName: Full=Ankyrin {ECO:0000313|EMBL:KNC46175.1};
GN ORFNames=AMSG_00294 {ECO:0000313|EMBL:KNC46175.1};
OS Thecamonas trahens ATCC 50062.
OC Eukaryota; Apusozoa; Apusomonadida; Apusomonadidae; Thecamonas.
OX NCBI_TaxID=461836 {ECO:0000313|EMBL:KNC46175.1, ECO:0000313|Proteomes:UP000054408};
RN [1] {ECO:0000313|EMBL:KNC46175.1, ECO:0000313|Proteomes:UP000054408}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 50062 {ECO:0000313|EMBL:KNC46175.1,
RC ECO:0000313|Proteomes:UP000054408};
RG The Broad Institute Genome Sequencing Platform;
RA Russ C., Cuomo C., Shea T., Young S.K., Zeng Q., Koehrsen M., Haas B.,
RA Borodovsky M., Guigo R., Alvarado L., Berlin A., Bochicchio J.,
RA Borenstein D., Chapman S., Chen Z., Freedman E., Gellesch M., Goldberg J.,
RA Griggs A., Gujja S., Heilman E., Heiman D., Hepburn T., Howarth C., Jen D.,
RA Larson L., Mehta T., Park D., Pearson M., Roberts A., Saif S., Shenoy N.,
RA Sisk P., Stolte C., Sykes S., Thomson T., Walk T., White J., Yandava C.,
RA Burger G., Gray M.W., Holland P.W.H., King N., Lang F.B.F., Roger A.J.,
RA Ruiz-Trillo I., Lander E., Nusbaum C.;
RT "The Genome Sequence of Thecamonas trahens ATCC 50062.";
RL Submitted (MAY-2010) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GL349433; KNC46175.1; -; Genomic_DNA.
DR RefSeq; XP_013763150.1; XM_013907696.1.
DR AlphaFoldDB; A0A0L0D232; -.
DR STRING; 461836.A0A0L0D232; -.
DR EnsemblProtists; KNC46175; KNC46175; AMSG_00294.
DR GeneID; 25560107; -.
DR eggNOG; KOG0241; Eukaryota.
DR eggNOG; KOG4177; Eukaryota.
DR OrthoDB; 5475373at2759; -.
DR Proteomes; UP000054408; Unassembled WGS sequence.
DR Gene3D; 1.25.40.20; Ankyrin repeat-containing domain; 1.
DR Gene3D; 2.30.30.190; CAP Gly-rich-like domain; 1.
DR InterPro; IPR002110; Ankyrin_rpt.
DR InterPro; IPR036770; Ankyrin_rpt-contain_sf.
DR InterPro; IPR036859; CAP-Gly_dom_sf.
DR InterPro; IPR000938; CAP-Gly_domain.
DR PANTHER; PTHR24201; ANK_REP_REGION DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR24201:SF2; ANKYRIN REPEAT DOMAIN-CONTAINING PROTEIN 42; 1.
DR Pfam; PF13637; Ank_4; 1.
DR Pfam; PF13857; Ank_5; 1.
DR Pfam; PF01302; CAP_GLY; 1.
DR SMART; SM00248; ANK; 3.
DR SMART; SM01052; CAP_GLY; 1.
DR SUPFAM; SSF48403; Ankyrin repeat; 1.
DR SUPFAM; SSF74924; Cap-Gly domain; 1.
DR PROSITE; PS50297; ANK_REP_REGION; 2.
DR PROSITE; PS50088; ANK_REPEAT; 2.
DR PROSITE; PS50245; CAP_GLY_2; 1.
PE 4: Predicted;
KW ANK repeat {ECO:0000256|ARBA:ARBA00023043, ECO:0000256|PROSITE-
KW ProRule:PRU00023}; Reference proteome {ECO:0000313|Proteomes:UP000054408};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 21..55
FT /note="CAP-Gly"
FT /evidence="ECO:0000259|PROSITE:PS50245"
FT REPEAT 570..602
FT /note="ANK"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT REPEAT 642..675
FT /note="ANK"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT REGION 84..139
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 176..311
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 346..366
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 696..721
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 746..862
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 103..117
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 220..243
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 780..823
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 862 AA; 91363 MW; F12CFBD08B53ECFA CRC64;
MDHPGQVRFI GTTGFGPAGK VWIGVEMADP IGNCDGSIDG VRYFSAAPDR ALFVEPEGLK
VLPPCLPPPA QSLARDGGCV VPASRYALPP QPTLAAPARQ PSRRRRIQQE IPPEARRKPR
RRRPANPPPA TTSDDDTLPP ELMSILEEAS AIGASPKHTD QLIGAVLHYL RDDDGDEAGG
TVNGGSGLPP QVHRRRRRAP QPEPHPSPVA APAKLDLLSP TEVSDHDEVQ SKVRHTTARR
EQCQRQPRHR RRRRKAPSPA PALPSLAELA SAPAPLSPDF GLSPHASRSY SHEPGTPGSP
ARVQPGSSFD VESDASLLEP VRRMETMLTS IQQLLAKTDD EVAAEAAAEA TVGEPSGPRP
WPRAPKVIYP ARQPKHPRAL EEVRRMEAMN DKILEAELIA KTAQERQIAS VMEPFAVTYA
IDAVIEVGKL WTFHEIQLGL EPTSLTLAVV AEPMWALASL TVSNTNHEPT MASAEWRLSP
VVSGSSLAFT DCSGTFFVGV ARHRPLPGED VALPVTIHID VGGYVAPADD LTLQWRVARA
DDINLARVDA YHYLESSIAD YHVDAIEPHL GRTALHFAAE AGALELAQWL LSRQADANAR
DRYGNTPLFA AARGAQHNPA GALQMVELLL DSGGVVYARN LRGQSPLHIA AATERSGAIA
ALLLARRADP GARDADGQTP VDVGRDAGHG RVVRLFDESD SSPPPEPDSP VGSRLLSPSL
RSQPSTSLAA VSISDVSASC NSSVISGDGG EYGSEYGAPS TPRRINPMSH APSRDLASYV
ITPSKASNRN SGYSSDSSDS SNSSSSSSPS SNSSSSDSPS SPAMTPVPAD PAATPEPALD
SGSASSETEA SALGLEAGDG IM
//