ID F7W0X1_SORMK Unreviewed; 997 AA.
AC F7W0X1;
DT 21-SEP-2011, integrated into UniProtKB/TrEMBL.
DT 21-SEP-2011, sequence version 1.
DT 27-MAR-2024, entry version 42.
DE SubName: Full=WGS project CABT00000000 data, contig 2.18 {ECO:0000313|EMBL:CCC11423.1};
GN ORFNames=SMAC_02081 {ECO:0000313|EMBL:CCC11423.1};
OS Sordaria macrospora (strain ATCC MYA-333 / DSM 997 / K(L3346) / K-hell).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Sordariomycetes;
OC Sordariomycetidae; Sordariales; Sordariaceae; Sordaria.
OX NCBI_TaxID=771870 {ECO:0000313|EMBL:CCC11423.1, ECO:0000313|Proteomes:UP000001881};
RN [1] {ECO:0000313|EMBL:CCC11423.1, ECO:0000313|Proteomes:UP000001881}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC MYA-333 / DSM 997 / K(L3346) / K-hell
RC {ECO:0000313|Proteomes:UP000001881};
RC TISSUE=Mycelium {ECO:0000313|EMBL:CCC11423.1};
RX PubMed=20386741; DOI=10.1371/journal.pgen.1000891;
RA Nowrousian M., Stajich J., Chu M., Engh I., Espagne E., Halliday K.,
RA Kamerewerd J., Kempken F., Knab B., Kuo H.C., Osiewacz H.D., Poeggeler S.,
RA Read N., Seiler S., Smith K., Zickler D., Kueck U., Freitag M.;
RT "De novo assembly of a 40 Mb eukaryotic genome from short sequence reads:
RT Sordaria macrospora, a model organism for fungal morphogenesis.";
RL PLoS Genet. 6:E1000891-E1000891(2010).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CCC11423.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CABT02000018; CCC11423.1; -; Genomic_DNA.
DR RefSeq; XP_003350368.1; XM_003350320.1.
DR AlphaFoldDB; F7W0X1; -.
DR STRING; 771870.F7W0X1; -.
DR GeneID; 10807922; -.
DR KEGG; smp:SMAC_02081; -.
DR VEuPathDB; FungiDB:SMAC_02081; -.
DR eggNOG; KOG3780; Eukaryota.
DR HOGENOM; CLU_008578_1_0_1; -.
DR InParanoid; F7W0X1; -.
DR OMA; WETEYGN; -.
DR OrthoDB; 3450674at2759; -.
DR Proteomes; UP000001881; Unassembled WGS sequence.
DR Gene3D; 2.60.40.640; -; 1.
DR InterPro; IPR014752; Arrestin-like_C.
DR InterPro; IPR011022; Arrestin_C-like.
DR PANTHER; PTHR11188; ARRESTIN DOMAIN CONTAINING PROTEIN; 1.
DR PANTHER; PTHR11188:SF17; FI24305P1-RELATED; 1.
DR Pfam; PF02752; Arrestin_C; 1.
DR SMART; SM01017; Arrestin_C; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000001881}.
FT DOMAIN 505..723
FT /note="Arrestin C-terminal-like"
FT /evidence="ECO:0000259|SMART:SM01017"
FT REGION 1..59
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 76..128
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 887..977
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 8..43
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 94..114
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 933..955
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 997 AA; 110505 MW; 5B7F9D137AD66383 CRC64;
MFPTGPQQRI RQRRQPYSSS ARGSIYYTPT TGRNRRSVPA SPASPGTPSF PSPLIGPRQR
RPHSIHIIAY PAGFIPHELR PDPLAEKDRK RKEQKDRKAR KKAEKKHNKR RKLEEKARER
RSEQLSPVRL LPPAGEAVHK VIDKVLSVFS IKAKNPPKSA TLASQATDTA ARTRRSRPSS
FIGHFRRSKY ETTVRPEPTS DQSTMTAVAS IPASPVVLPN NRSSMVSACT CKSAVSSVTE
LQKPVASGSG VTCSIVLAEP NVFLTGFDHN DRAREENHGA SALLRGKLQL NVSKNVKLKS
VTLKLIGKAR TEWPEGIPPN KTETYEEQVL RTQSLVFFQA IHGGQWETEY GNQCTYSLKG
SGCSHNPRSS LSSNFSNGSF SHLTGKSRHS TSLTAKELKR LSLQSVNSRS FGKGESPFAS
QIHAKGYKVF QAGTYEYTFE LPIDHHQLET TKLQYGSVKW ELEATVERAG AFKPNLHGTK
EVSIVRLPDS MSLEMSEPIS ISRQWEDQLH YDIMISGKSF PIGAKIPIAF KLTPLAKVQV
HKLKVFVTES IEYWTNDRHV TRKDAGRKIL LLEKTAGKPL DKQFETSDFR ILSGGELSPE
VREEARRSAA ARRMMQAART NAPPPPLPNP TENLLGDLDL GLETFWGSTE LEMNVQIPTC
AMMSRDKNLR LHPDCSWKNV NVFHWIKIVM RISRLDPEDP LGKRRRHFEI SIDSPFTVLN
CRATQANTAL PQYGQDGPMP FERQQTSCGC PDADVLDRSA SSSFGQLQLV EQDGLSPSGS
RLGLNRLSAS VLPSIPQAAH VHDSHAPGGL PVNRARGHTL PSPLEAQRPI HLIRCPSYNP
PAFDADEPPP PLQADLITPP PHYDAIIGTP SVDGLADYFA RLADYEDTER PQMGDRSGSQ
ESVATLRGIP RPTSRDEGFF ASNEVVTGSD LPVNKDYDDD SSSSADEEEG PARPHRGGRV
NVANPRTPGG RLIPSRSLEI ERPVMRLDMG SLTTRRR
//