ID A0A087Y938_POEFO Unreviewed; 1706 AA.
AC A0A087Y938;
DT 29-OCT-2014, integrated into UniProtKB/TrEMBL.
DT 26-NOV-2014, sequence version 2.
DT 27-MAR-2024, entry version 40.
DE SubName: Full=Atrophin 1 {ECO:0000313|Ensembl:ENSPFOP00000014541.2};
OS Poecilia formosa (Amazon molly) (Limia formosa).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Atherinomorphae; Cyprinodontiformes; Poeciliidae; Poeciliinae;
OC Poecilia.
OX NCBI_TaxID=48698 {ECO:0000313|Ensembl:ENSPFOP00000014541.2, ECO:0000313|Proteomes:UP000028760};
RN [1] {ECO:0000313|Proteomes:UP000028760}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=female {ECO:0000313|Proteomes:UP000028760};
RA Schartl M., Warren W.;
RL Submitted (OCT-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSPFOP00000014541.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AYCK01003764; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR RefSeq; XP_007550222.1; XM_007550160.2.
DR STRING; 48698.ENSPFOP00000014541; -.
DR Ensembl; ENSPFOT00000014563.2; ENSPFOP00000014541.2; ENSPFOG00000014520.2.
DR GeneID; 103136758; -.
DR KEGG; pfor:103136758; -.
DR CTD; 1822; -.
DR eggNOG; KOG2133; Eukaryota.
DR GeneTree; ENSGT00940000153615; -.
DR OMA; TQDEYYH; -.
DR OrthoDB; 4273315at2759; -.
DR Proteomes; UP000028760; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-KW.
DR InterPro; IPR002951; Atrophin-like.
DR PANTHER; PTHR13859:SF35; ATROPHIN-1 ISOFORM X1; 1.
DR PANTHER; PTHR13859; ATROPHIN-RELATED; 1.
DR Pfam; PF03154; Atrophin-1; 3.
PE 4: Predicted;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Phosphoprotein {ECO:0000256|ARBA:ARBA00022553};
KW Reference proteome {ECO:0000313|Proteomes:UP000028760};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT REGION 1..260
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 277..1011
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1057..1100
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1114..1150
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1370..1560
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..45
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 62..78
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 81..175
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 176..225
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 226..260
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 280..310
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 311..327
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 338..354
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 355..400
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 416..442
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 448..487
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 493..516
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 546..572
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 605..621
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 645..690
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 749..765
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 766..792
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 793..807
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 808..833
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 842..964
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1057..1087
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1114..1128
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1373..1387
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1394..1451
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1452..1470
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1471..1490
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1498..1534
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1706 AA; 180353 MW; DD30A52F5C7B35CE CRC64;
MKTRTHKESM PMRSGRRRGA SEERRGRRPH TSPTRPERTD RQTQRGGGEE LAGNRFSCRS
QGHDSSESEG EELVSPPKRQ KVQDSAPNPN PPTSTHLTES STTSTVPPPT SAASQSRESD
NEDGQSQGSR SSAVGSLANS SSSLSSGRDI DQDNRSSSPS LSASPLGSLD SDSDGPDSPK
QGEREKGKEV GAGKVTGEDR RTLREGRGEE SCGDGEKRDV ETIEDSSSLK PPSTPCSSSS
LTPSVRGGDS SNDSNSGRKS YFSLDSKLMC KVEYGSPAGV ESNRMTSKAS TQCINKTTIS
GGDFSHNSPN IPHSLPPPLP PPPALKPLEL GGQNLPAEVK TERDKTEKTE KLMDKAQSTP
PSLLPQCGPQ PLSQSQPQTQ PSSHPHHYSS SWQGGAATGC QGSWGYSRYP GSHHPQHQPP
VQQQQLPSVY NPPSSRHSSS HPSYLPHPHP HPHREYLPRY AGGGGDRERG PTGERERGVR
GECGGRELTR EFSAPVGNSS NSNGGGSSNN GCGAMSVPNS IPAREFGGMP VGQNREYQGS
GRDGPNLGPD RRDFGSAFRD REREREREGG REFNLPNQNQ NRDFGPTVPT GGHPRDKDGN
RWSEFGSQTR EVVSNSNPNN NSIPPGNPPS STSGLPVAPM LNRDPPASPQ NNLSHPSHSS
LPQHPHSHPP NSSNRDFPPP MDQTQMPSTG ADHFHRDYPP SGGKDFPAGA PPSAGTNREY
LSSPGVTPNL GREYPGTGGT QHAHPPHPHY QPGPKDRERD SNLRESALYQ SRGGPNQPPA
LSPSSSSSHH GQYPHPPPQP PVHPPQSSHS QAPQSTMGPS TRPPHYQSSA QTPPTPLSPL
PSPSTNQMGA FSSFPSGSSS APTSQLPVPG VSSSCSPGCR PSSFHGTLNN HPQFSGTYHS
NGSNGSTMAN SSGNSSAVSS SSNTNSQAPS PQNVSKAPPP LSNSTNNNTN VSTPALTSSL
PGVEGHSDSG LPPTAVIKEE PPEDREETES PPPVLRSPSP EPKHVDIPIH ASQSARFHKV
LDRGSRNSCA RSDVLFVPLD GSKLWKKRNE MIERARREVE QRARDLREKE RERERERERE
LDRHLQNQKD VSAAGGGRQG SSLFFPPSSS IILDPSSSSA SSSANVVAHP PAHPQHHPSH
PHAHLPPTHH LHPSLSHAIP HSLLLPTMGG ASAVVGGPQG ALGIGLGGPY LGPDTPALRT
LSEYARPHAM SPLGAASRAQ AHHPQVHHGH PHVHPSFFLP QLQNHALSHP HHLPTDAATA
AAILGFLYGG SLEGGPGVPG HPGVAGGPVP GGIGGAGLGG MGFPHAMAAH RDRLKPGFEF
KSDERVYPAG SIPDPAAIAL AHSHAHAHSN AHAHAHSLLL AGGAAANEVS LYGTPPPPAP
PGPPHLQNPT LAQVTRPPQP PAPQSLSNPP PSSLLPPSLP SHPSSAPLAA PSAPAPPSAP
PAAPPQPAPP TSNSSSLHHP VPHSSFPNSL SSHVPPPPAP AAPPETYPTP TRSPASYERD
RSGARERERE RDRAALPAFG DRERERERER ERGGSGGGGG AGGGNGGGTG GGGGGENLGR
LQMLNVTPHH HQHSHIHSHL HLHQQDTAAG GVHPLMDPLA SGSPLARLPY PGAALGTPIL
AHPLTDSEVL RQQLFGAPFR DLPQPSSLTG PMSAAHQLQA MQQAQSAELQ IQRLALEQQW
IHHHHHHSLT QDEYYSHLKK ESDKTL
//