ID R7UKP0_CAPTE Unreviewed; 1107 AA.
AC R7UKP0;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 24-JAN-2024, entry version 41.
DE RecName: Full=HEAT repeat-containing protein 6 {ECO:0000256|ARBA:ARBA00015263};
GN ORFNames=CAPTEDRAFT_228236 {ECO:0000313|EMBL:ELU07074.1};
OS Capitella teleta (Polychaete worm).
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Annelida; Polychaeta;
OC Sedentaria; Scolecida; Capitellidae; Capitella.
OX NCBI_TaxID=283909 {ECO:0000313|EMBL:ELU07074.1};
RN [1] {ECO:0000313|Proteomes:UP000014760}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=I ESC-2004 {ECO:0000313|Proteomes:UP000014760};
RA Hellsten U., Grimwood J., Chapman J.A., Shapiro H., Aerts A., Otillar R.P.,
RA Terry A.Y., Boore J.L., Simakov O., Marletaz F., Cho S.-J.,
RA Edsinger-Gonzales E., Havlak P., Kuo D.-H., Larsson T., Lv J., Arendt D.,
RA Savage R., Osoegawa K., de Jong P., Lindberg D.R., Seaver E.C.,
RA Weisblat D.A., Putnam N.H., Grigoriev I.V., Rokhsar D.S.;
RL Submitted (DEC-2012) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:ELU07074.1, ECO:0000313|Proteomes:UP000014760}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=I ESC-2004 {ECO:0000313|EMBL:ELU07074.1,
RC ECO:0000313|Proteomes:UP000014760};
RX PubMed=23254933; DOI=10.1038/nature11696;
RA Simakov O., Marletaz F., Cho S.J., Edsinger-Gonzales E., Havlak P.,
RA Hellsten U., Kuo D.H., Larsson T., Lv J., Arendt D., Savage R.,
RA Osoegawa K., de Jong P., Grimwood J., Chapman J.A., Shapiro H., Aerts A.,
RA Otillar R.P., Terry A.Y., Boore J.L., Grigoriev I.V., Lindberg D.R.,
RA Seaver E.C., Weisblat D.A., Putnam N.H., Rokhsar D.S.;
RT "Insights into bilaterian evolution from three spiralian genomes.";
RL Nature 493:526-531(2013).
RN [3] {ECO:0000313|EnsemblMetazoa:CapteP228236}
RP IDENTIFICATION.
RG EnsemblMetazoa;
RL Submitted (JUN-2015) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AMQN01007193; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; KB300180; ELU07074.1; -; Genomic_DNA.
DR AlphaFoldDB; R7UKP0; -.
DR STRING; 283909.R7UKP0; -.
DR EnsemblMetazoa; CapteT228236; CapteP228236; CapteG228236.
DR HOGENOM; CLU_007141_1_0_1; -.
DR OMA; VRTNAAW; -.
DR Proteomes; UP000014760; Unassembled WGS sequence.
DR Gene3D; 1.25.10.10; Leucine-rich Repeat Variant; 3.
DR InterPro; IPR011989; ARM-like.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR025283; DUF4042.
DR PANTHER; PTHR13366:SF0; HEAT REPEAT-CONTAINING PROTEIN 6; 1.
DR PANTHER; PTHR13366; MALARIA ANTIGEN-RELATED; 1.
DR Pfam; PF13251; DUF4042; 1.
DR SUPFAM; SSF48371; ARM repeat; 2.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000014760}.
FT DOMAIN 353..533
FT /note="DUF4042"
FT /evidence="ECO:0000259|Pfam:PF13251"
FT REGION 271..321
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 650..671
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 271..286
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1107 AA; 122518 MW; BF2CFF69ACC78F86 CRC64;
MEHELCFDQN PFAAGEVRKF RELRLKLLKL VYKDNEEYLA ELTELLDALN SLEYSHKFVR
EETVGELLAQ CCYLVPLANE RLVSKFCLLI CNLVNKQQLV LVERLETVTE YVVRGLHKCT
TWVVFDLLRA LGVIVYENTG RLTQYHEDLL GEHGLLIRLC TSQDENVLRA VIQCIENLTI
REDCQVRCFR CLLSLLQTPK PTAMETMTHC KLLMCGLKGM QNLLVVTHCL PADALGPLLA
VLKVLMFHGL PGCPTNIPLS LYPTPLTQFD PSTSSSAATT VTPKKASNVK KGKKKQAQRK
KKGGGEEEGE EEEKPNTEDV YAMSWSKVSS SDSEYSDAEG GVQASRTKNT YSRVRQCAFC
SFHALIKATE RRVMFGYWTC FIPDSPGEAA MAHTLLAAIL KDPSTKSRRA AIAVLTVLLE
SSRQFLMAAE DSDQYRSAPY TPFSVTLGHM IKEIHTCLLQ ALSSEASLLA ITQIIKCLAT
LILNVPYQRL QAGLLTAVVT QIPAFISHRD PNICVACLTC LGSVASSQAQ PTEVNALLLP
PRLTEAPPSE QATASGELSW VLKICVKNTA PSLLLDPAER SSSKTAAVQP LPVRLESLQL
LSQLAKSHFG VLRPRLTLIR DLIQLCFMDK DPAVQLHGAK LLDGVSTVVQ QQQQQQQEEG
EGEAPPALPR EPMSKQQMLD LWSPLLAGPV PNLLQTATYE AVKYTLCDCL ANVGPSVFAL
LPREQHILCL TLLLGLASDE DFHVRAAAVR ALGVYVRYPC LREDVCFMAD AANAVLTALD
EQSLVVRAKA AWALGNLSNA IVLNRSHEAF VHGFSDMLLL KLFEVATAAS KDSDRVRSNA
VRALGNLLRY LPPKSHGEVG LQSEALMFMF AATDSCRFKQ TVEAAIESLI TNVATGTMKV
RWNSCYALGN MLQNEPLLLD SQAADSARMR SKVFRTLAEA VHDCRNFKVR INAALALAVP
SRRSHYGSDF VLIWQSLLRA LQVLEENSDF TEFRYLNSLI EQIGGSLVHL AAILEADDLS
ALAKDSQHQA LLLCYLEKYK QLIHENHSVS GAQRRTALSE AERNVRSLQP HREEVFLDAL
SVAFDDCIPD AVELKNPQKS AFVQSYD
//