GenomeNet

Database: UniProt
Entry: A0A2G5CUS5_AQUCA
LinkDB: A0A2G5CUS5_AQUCA
Original site: A0A2G5CUS5_AQUCA 
ID   A0A2G5CUS5_AQUCA        Unreviewed;      1124 AA.
AC   A0A2G5CUS5;
DT   31-JAN-2018, integrated into UniProtKB/TrEMBL.
DT   31-JAN-2018, sequence version 1.
DT   24-JAN-2024, entry version 14.
DE   RecName: Full=DUF4042 domain-containing protein {ECO:0000259|Pfam:PF13251};
GN   ORFNames=AQUCO_03700336v1 {ECO:0000313|EMBL:PIA35013.1};
OS   Aquilegia coerulea (Rocky mountain columbine).
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; Ranunculales; Ranunculaceae; Thalictroideae;
OC   Aquilegia.
OX   NCBI_TaxID=218851 {ECO:0000313|EMBL:PIA35013.1, ECO:0000313|Proteomes:UP000230069};
RN   [1] {ECO:0000313|EMBL:PIA35013.1, ECO:0000313|Proteomes:UP000230069}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=cv. Goldsmith {ECO:0000313|Proteomes:UP000230069};
RA   Hodges S., Kramer E., Nordborg M., Tomkins J., Borevitz J., Derieg N.,
RA   Yan J., Mihaltcheva S., Hayes R.D., Rokhsar D.;
RT   "WGS assembly of Aquilegia coerulea Goldsmith.";
RL   Submitted (SEP-2017) to the EMBL/GenBank/DDBJ databases.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; KZ305054; PIA35013.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A2G5CUS5; -.
DR   Proteomes; UP000230069; Unassembled WGS sequence.
DR   Gene3D; 1.25.10.10; Leucine-rich Repeat Variant; 2.
DR   InterPro; IPR011989; ARM-like.
DR   InterPro; IPR016024; ARM-type_fold.
DR   InterPro; IPR025283; DUF4042.
DR   PANTHER; PTHR13366:SF0; HEAT REPEAT-CONTAINING PROTEIN 6; 1.
DR   PANTHER; PTHR13366; MALARIA ANTIGEN-RELATED; 1.
DR   Pfam; PF13251; DUF4042; 1.
DR   SUPFAM; SSF48371; ARM repeat; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000230069}.
FT   DOMAIN          418..602
FT                   /note="DUF4042"
FT                   /evidence="ECO:0000259|Pfam:PF13251"
FT   REGION          346..385
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          390..409
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        360..374
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1124 AA;  123226 MW;  11D60DC95785C9BD CRC64;
     MAMMDEAAAA ANVVRLWRTA FLSLRDETTT SNSSRRPITI MSETELLIQS HLLHKLIFSQ
     FNLLLTAAPK LSPHEVTSDV ILLVELANSV ANATTTTATS ADTFLHTCRL IHDVSCCVCL
     QFNSASWALI LDFLQSLFTC LSPSPINIIN NQLLLPTTST TTTADTPLFH ILHLLRNMVN
     QYGTKCLVPE NTHLLTLLLH IVAFTNSHLL PSPYSSPNAV AHKLHGNNLW EYQILSFTMI
     SDTLSRLGGS SFSPQMWQST LELLRKVTDS LVSNSFLVED TVMSRFYTSL LHCLHLVLSD
     PKGSLSEHVA GFVAALRMFF AYGVTIRSPI AFSDATHKQK KFNHLNRELA ESPGPERGAY
     RPPHLRKREG KSMPSLKATR SQTSVDNESY ALGFSSSDSE HSDSDGFGKD MDNLHSSKTR
     LAAIICIQDL CLADPKAVTA HLTMLLPTND VLQPRKHEAT LMTCLLFDPA LKTRMASASA
     LAAMLSGPSS VFLQVAEYKE STKCGSFTAL SSSLGQTLMQ LHSGILYLVQ REVHNGLLAS
     LFKVLILLIS ATPYARMPEE LLPEVISALR LRMINGFPSR TDQSGLMAML LNCLGAAFST
     SPPSLQVQES LQEEISTDLL GVPGKQSLLA LIFQYSERAT NPTISFEALQ VVRAVSHNYP
     KIMAACWLQV STLTYGLLRA TTAGVSNFES STRPLKGNVE NSVALLGERC IMAAAKVLDE
     CLRAISGFKG TEDVLDDRSL DTPFTSDCTK TKRISSAPSY GLEIDGPESS KRTHTQEFSG
     SQQWCEAIEK HLPLVMFHSS AMVRAASVTC FAGITSSVFF SLTKENQDFI LSSSINAALN
     DEAPSVRSAA CRAIGVISCF PQISRRAEIL EKFIHAIETN TRDPLVSVRI TASWALANIC
     DLLRHRASDF DLDMCSTDSK TYSRSIALLA ESALRLSKDG DKIKSNAVRA LGNLSRFVRF
     TSISTSQYEG ILGGPSLKDV LGDPHWLGRM VQAFVSCVTT GNVKVRWNVC HALSNLFLNE
     TLKLEDMAWA PSVFSILLLL LRDSSNFKIR IHAAAALAVP SSRLDYGSSF ADVVQGLEHV
     LETLGSDQVT APSSFKYRDA LEKQVKPMKI DIVDLTCAQP CFMH
//
DBGET integrated database retrieval system