ID F2U2R9_SALR5 Unreviewed; 851 AA.
AC F2U2R9;
DT 31-MAY-2011, integrated into UniProtKB/TrEMBL.
DT 31-MAY-2011, sequence version 1.
DT 24-JAN-2024, entry version 31.
DE RecName: Full=Macro domain-containing protein {ECO:0000259|Pfam:PF01661};
GN ORFNames=PTSG_11895 {ECO:0000313|EMBL:EGD81913.1};
OS Salpingoeca rosetta (strain ATCC 50818 / BSB-021).
OC Eukaryota; Choanoflagellata; Craspedida; Salpingoecidae; Salpingoeca.
OX NCBI_TaxID=946362 {ECO:0000313|Proteomes:UP000007799};
RN [1] {ECO:0000313|Proteomes:UP000007799}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 50818 {ECO:0000313|Proteomes:UP000007799};
RA Russ C., Cuomo C., Burger G., Gray M.W., Holland P.W.H., King N.,
RA Lang F.B.F., Roger A.J., Ruiz-Trillo I., Young S.K., Zeng Q., Gargeya S.,
RA Alvarado L., Berlin A., Chapman S.B., Chen Z., Freedman E., Gellesch M.,
RA Goldberg J., Griggs A., Gujja S., Heilman E., Heiman D., Howarth C.,
RA Mehta T., Neiman D., Pearson M., Roberts A., Saif S., Shea T., Shenoy N.,
RA Sisk P., Stolte C., Sykes S., White J., Yandava C., Haas B., Nusbaum C.,
RA Birren B.;
RT "Annotation of Salpingoeca rosetta.";
RL Submitted (AUG-2009) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GL832960; EGD81913.1; -; Genomic_DNA.
DR RefSeq; XP_004996096.1; XM_004996039.1.
DR AlphaFoldDB; F2U2R9; -.
DR EnsemblProtists; EGD81913; EGD81913; PTSG_11895.
DR GeneID; 16076684; -.
DR KEGG; sre:PTSG_11895; -.
DR InParanoid; F2U2R9; -.
DR OrthoDB; 4270351at2759; -.
DR Proteomes; UP000007799; Unassembled WGS sequence.
DR Gene3D; 3.40.220.10; Leucine Aminopeptidase, subunit E, domain 1; 1.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR002589; Macro_dom.
DR InterPro; IPR043472; Macro_dom-like.
DR Pfam; PF01661; Macro; 1.
DR SUPFAM; SSF48371; ARM repeat; 1.
DR SUPFAM; SSF52949; Macro domain-like; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000007799}.
FT DOMAIN 618..680
FT /note="Macro"
FT /evidence="ECO:0000259|Pfam:PF01661"
FT REGION 55..110
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 738..851
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 55..72
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 89..103
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 748..775
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 824..851
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 851 AA; 91275 MW; 64D8ACB035EAEFD9 CRC64;
MEKAKEIFIA TLGSSNPTTA AVYMNLGVAY TDSGVPIPQS LKDTLSRIQP FANTRMAEQE
QEQEQGEQQQ QGGGDDEAHG VAAKPTAVHD PHQYKKDGDA GSDYNKSSDA SSETTLQRVF
VLETTDTHAS VSHKFVARLC YPNDVHVAVL DPKLGGLLSD LVDAFTKSSV AKDLDVLAAA
SMGNRICFTG SNAVPVWSQA KLFASFLAPA FHPFARSDRV LGDDKIMEHV LNDYGLRLVR
YPTPGLLPHA DTAHGAMRDH PAHRIARGTP AGADTHVRPG RRRVMLLEDS LDYYLDQDRE
RGLSYINSTI GAHIKSQMKI SVTGMQYRFL DSQLLELARK FRVYVRKPSA EDRDACASSV
PVTVEGMQRQ LTAFDNAVQD TLKEHLCHVL GLPVIDTSHV PEMSEADVTF LKRKLFNYLT
KASRTLCRSK DGVVSYVAPQ DVIGTPSERN STFHTIAHIQ PTRCVCRVPV SVTKELVAGQ
QAEAAAQEGV GADIGYTAST GAASATANAS AATASSASSA AGDDGPVDLS AHAVLEGPRK
EVAAASKVLL LTTCVKLFPA KVKQAVWEVE PYQYHTTSET LPAVQLRAEI REGSIINDRD
MNAERSGPLA IKRGARDHHV LHVVAPNFAP EFGSTVTQTA ADLTTAYRAL LTTARDNNIR
SLATCALGCG AFRCSPYASA CTLFEVIDEV GRDGTLFDTI RTFEKYYGEK VGSGLYEDLQ
RSALDGAFTK YRAAIAAKHK RSNADNDDDN DGGDDDDDDD DDDDDDDDDD GGDGGDGGDG
NGGGDNDGVG GGGHDGDQSQ RPLPPQNDVP GDGAVAENND YGVNGIRGSP QDNNNNPNNN
NNNHNILGNA S
//