ID B7G829_PHATC Unreviewed; 4825 AA.
AC B7G829;
DT 10-FEB-2009, integrated into UniProtKB/TrEMBL.
DT 10-FEB-2009, sequence version 1.
DT 08-NOV-2023, entry version 50.
DE RecName: Full=SAP domain-containing protein {ECO:0000259|PROSITE:PS50800};
GN ORFNames=PHATRDRAFT_48676 {ECO:0000313|EMBL:EEC45464.1};
OS Phaeodactylum tricornutum (strain CCAP 1055/1).
OC Eukaryota; Sar; Stramenopiles; Ochrophyta; Bacillariophyta;
OC Bacillariophyceae; Bacillariophycidae; Naviculales; Phaeodactylaceae;
OC Phaeodactylum.
OX NCBI_TaxID=556484 {ECO:0000313|EMBL:EEC45464.1, ECO:0000313|Proteomes:UP000000759};
RN [1] {ECO:0000313|EMBL:EEC45464.1, ECO:0000313|Proteomes:UP000000759}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CCAP 1055/1 {ECO:0000313|EMBL:EEC45464.1,
RC ECO:0000313|Proteomes:UP000000759};
RX PubMed=18923393; DOI=10.1038/nature07410;
RA Bowler C., Allen A.E., Badger J.H., Grimwood J., Jabbari K., Kuo A.,
RA Maheswari U., Martens C., Maumus F., Otillar R.P., Rayko E., Salamov A.,
RA Vandepoele K., Beszteri B., Gruber A., Heijde M., Katinka M., Mock T.,
RA Valentin K., Verret F., Berges J.A., Brownlee C., Cadoret J.P.,
RA Chiovitti A., Choi C.J., Coesel S., De Martino A., Detter J.C., Durkin C.,
RA Falciatore A., Fournet J., Haruta M., Huysman M.J., Jenkins B.D.,
RA Jiroutova K., Jorgensen R.E., Joubert Y., Kaplan A., Kroger N., Kroth P.G.,
RA La Roche J., Lindquist E., Lommer M., Martin-Jezequel V., Lopez P.J.,
RA Lucas S., Mangogna M., McGinnis K., Medlin L.K., Montsant A.,
RA Oudot-Le Secq M.P., Napoli C., Obornik M., Parker M.S., Petit J.L.,
RA Porcel B.M., Poulsen N., Robison M., Rychlewski L., Rynearson T.A.,
RA Schmutz J., Shapiro H., Siaut M., Stanley M., Sussman M.R., Taylor A.R.,
RA Vardi A., von Dassow P., Vyverman W., Willis A., Wyrwicz L.S.,
RA Rokhsar D.S., Weissenbach J., Armbrust E.V., Green B.R., Van de Peer Y.,
RA Grigoriev I.V.;
RT "The Phaeodactylum genome reveals the evolutionary history of diatom
RT genomes.";
RL Nature 456:239-244(2008).
RN [2] {ECO:0000313|Proteomes:UP000000759}
RP GENOME REANNOTATION.
RC STRAIN=CCAP 1055/1 {ECO:0000313|Proteomes:UP000000759};
RG Diatom Consortium;
RA Grigoriev I., Grimwood J., Kuo A., Otillar R.P., Salamov A., Detter J.C.,
RA Lindquist E., Shapiro H., Lucas S., Glavina del Rio T., Pitluck S.,
RA Rokhsar D., Bowler C.;
RL Submitted (AUG-2008) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM000620; EEC45464.1; -; Genomic_DNA.
DR RefSeq; XP_002183246.1; XM_002183210.1.
DR PaxDb; 2850-Phatr48676; -.
DR GeneID; 7194909; -.
DR KEGG; pti:PHATRDRAFT_48676; -.
DR eggNOG; ENOG502RWHN; Eukaryota.
DR InParanoid; B7G829; -.
DR OrthoDB; 1952247at2759; -.
DR Proteomes; UP000000759; Chromosome 18.
DR Gene3D; 1.10.720.30; SAP domain; 6.
DR InterPro; IPR003034; SAP_dom.
DR InterPro; IPR036361; SAP_dom_sf.
DR PANTHER; PTHR46551; SAP DOMAIN-CONTAINING RIBONUCLEOPROTEIN; 1.
DR PANTHER; PTHR46551:SF1; SAP DOMAIN-CONTAINING RIBONUCLEOPROTEIN; 1.
DR Pfam; PF02037; SAP; 2.
DR SMART; SM00513; SAP; 6.
DR SUPFAM; SSF68906; SAP domain; 3.
DR PROSITE; PS50800; SAP; 4.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Reference proteome {ECO:0000313|Proteomes:UP000000759}.
FT DOMAIN 693..727
FT /note="SAP"
FT /evidence="ECO:0000259|PROSITE:PS50800"
FT DOMAIN 826..860
FT /note="SAP"
FT /evidence="ECO:0000259|PROSITE:PS50800"
FT DOMAIN 934..968
FT /note="SAP"
FT /evidence="ECO:0000259|PROSITE:PS50800"
FT DOMAIN 1140..1174
FT /note="SAP"
FT /evidence="ECO:0000259|PROSITE:PS50800"
FT REGION 49..73
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1293..1313
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1368..1426
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1540..1567
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1731..1758
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1774..1801
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1921..1948
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1969..1993
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2109..2138
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2155..2185
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2468..2489
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2530..2567
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3331..3423
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3453..3517
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3601..3679
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3772..3795
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3814..3835
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3888..3924
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 4397..4463
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 4554..4796
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 430..491
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 3549..3593
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 3357..3377
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3381..3395
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3396..3423
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3601..3643
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3650..3679
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3888..3905
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 4414..4432
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 4562..4623
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 4624..4700
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 4740..4757
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 4825 AA; 532781 MW; 61B3169BAC7CDF89 CRC64;
MGRSFMIIVP TCVLVGVVGS LLLLSETEAL AGPASFRINL LTTASPLRMA PENSQDFPSS
VSLKTPRPPK RAPSIDRLRS LHEEYEPFFL RNGGTALLDE ETDEESRLSP RQGFLQSRYD
HWQECTVKAL KAELSVRKLQ VSGKKAALVE RLALDDLDNT PDRLVQKAAK TLAQEVFVHP
FTTTEVEKGV PADAIKGAAL TAATLSYLAG NSIVLSGAAA LGAAYLAISP GSAGDAVRAI
GTSAWSSTEV FVDVVKKIGP EHIGETTVGL LHRLSAAAQQ TQFLLQQKSY QSGSSASNNA
IARDKADAID VTITNDTGST ESDEASPTFA FAEKDQIVPK ADTPVVAPAK TKDDRVNRAL
LSYRIELEQT ASQKRLKQRK EQASRGLLAA RLSLETALKE RIIVEQARLA EEARLAEEAR
LAEEVKLAEE ARLAEEARIA EETRRMAEAK VAEEARIAEE ARLAEEAEQS RLAEEARVAK
IKARVAAQEE ESRLAREAQV TAEAQRVALE AQQATVLEIP EAETPTEAVF ETEASSVFDE
VGFSEEDWAA SILAAQKSID GTIVGSDDEE QDTDETESKA SWEAAKLLAE ELSPSEREDL
GKAAREAVEA MELNMNAKIQ EKAVERETWA QEVVEDEGAE DDDENNLDMF FDNEGFDMEA
LAQAARQAVE RYDAESTGET ERESWSESSL RDWASYRVAD LRNELSTRGL PAIGKKMELV
AALEAADLAL SNGEISSSAG VQSEGSGPVM AEEKELLEVE DEYGIMEFED DTAFTFEEDD
DDDENLDDIL PPMEDLAALA AAARAAVRDQ EDLFNTDVIH GSTTDWSQFK MVDLRNELTM
RGLPTVGKKT DLIAALAQSD LDQESAAVMD DDEEEEEYGS FEYDATTLVG AADDKEADLD
SLFGGRGSDL ETLAATAEAV LEMEKPVTSL GRDWSKLTVA QLRTELDKRG LPTVGKKADL
VVALESADRE LDGNAEEEKD VDNLSSNEHV FTVNHDYDDE LSEDDLLNDL LHANGADREA
LAAAARASVD REGVLPEPST DWSRLSPTEL RIELDNRGLP TVGRKGDLVA SLQASDRDLE
REIAQLDRED RVGGLGDLDM AAVARAAREA VKRFESVEEP SDEDLLEIEK EPLLSSATDY
GSLTLAELKD ELRQRGLPLS GNKADLIAKL TASDQVSLVA HRESNEENLR GVSPGLCFGN
GSIRDSPTGA DTRRPVCRPV RRTLLVRRDS GRERRADGFE WILNASSPGC VSRQFRTVNP
WFVVPARFGR TERVTALVPE SLSSIRSRSH RPSRRYHAIP PDDPGWTRAP PDANPWWERS
SVAESRSTPA DPLTTTVPGA WTIVAPQTFD DRPSDATTWV ATDTVRRTGV TPRLPTTPLP
PAAVSANPPL DDDDVASVDA SPQTPVRAET EPSPVVPSRQ RPINNNNNNE AALVLKPVSM
ENALARPKSP EQELKGKSIA PEQVQKVTER LSEEMESQSI GDSLRRIHSL SEKVGSRAAQ
AAKDLRESPQ LPALASRLSD AWTNVAKSRQ EVWDQKMTNR QEAVRAPVSS EDGSVTEVPS
NNDDERPFFL LPSIDVSSLY ASKQQNMDHA MPDGSVEEAS GLAKSGRFPK QSHTEAALVP
SLVSAENTEN TLTVSLKSRY SGEVIAGLGK MQEQVQKVTE RLSEEMESQS IGDSLRQIHS
VSEKVGSRAA QIAKDLRESP QLPALASRLS DAWTNVAKSR QEVWDQKMTN RQEAVRAPVS
SEDGSVTEVP SNNDDERPFF LVPSIDVSSL YASKQQDMDH AVPDGSVEEA SLAKSDRSAK
QSHTEAALVP SLVSAENTEN TLTVSLKSRY SGEVIAGLGK MQEQVQKVTE RLSEEMESQS
IGDSLRQIHS VSEKVGSRAA QIAKDLRESP QLPALASRLS DAWTNVAKSR QEVWDQKMTN
RQEAVRAPVS SEDGSVTEVP SNNDDERPFF LVPSIDVSSL YASKQQNMDH AMPDGSVEEA
SLAKSDRSAK QSHTEAALVP SLVSAENTEN TLTVSAKSRY SGEVIAGLGK MQEQVQKVTE
RLSEEMESQS IGDSLRRIHS VSEKVGSRAA QAAKDLRESP QLPALASRLS DAWTNVAKSR
QEVWDQKMTN RQEAVRAPVS SEDGSVTEVP SNNDDERPFF LVPSIDVSSL YASKQQDMDH
AMPDGSVEEA SLAKSDRSPK QSHTEAALVP SLVSAENTEN TLTVSLKSRY SGEVIAGLGK
MQEQVQKVTE RLSEEMESQS IGDSLRQIHS VSEEVGSRAA QAAKDLRESP QLPALASRLS
DAWTNVAKSG QELWGQKMTN RQRAVGAQAS SEDRIIPGLL NSGSDERPFF LVPSIDVSPL
YARHQRSVLD ISAGVAVEPV STKNGCREED ASSEAGLQLE QLSDEIAHRS TSGKLQFRSY
RQRALNFSNR FDMLSDQVGY REAQVSKSLQ QSLYSQTKAS GLSGAFPSTT QGDDRSVSEP
EEMIILESLN ESQGGAGSPD DDDGEAIADD IPGINISSIL VSEPSAIQVQ QLSKFQTKSS
GICEVLTNAT QGGDRNVTDP QGPSFLDEDR GGAGSPNYDD DEPNIDEIPS INVSSIFESE
SSAIREVLRQ SPENQPECIS TRKVSTNTRK DWVRSVTDPQ RPLVSVSWNE NLEGAGLPDD
ETTMDDILSI NVSSVYVSEP NAVGDVSVEL TDGLLLPKQV RQRPSLNDSA PLSWQGTSRD
AKVSTTTLSK WKWGALLRVG NFRCHLEAVT GRLAESYERT QKFLIDLVSS VEYDKAWTHA
FLQKSQGLTR ENVTLMSDDK SDAVGESLGN EKRASSNSAS VWKAEASTLV SVNRSILEDE
KSAASISQED SGAQVAFPVR RNTQRNQYFE AQPIAQRLFE QILPSVAGEP LRKDVPGYII
QSAVFSTFTW SFVLQRNDLW TSMWLATGAS YLSVTTGWQG DLVRGWSIAV YELIDFGRSD
LAVWARDSAN EFAALAPFQR RIPTPPKNPP PRVLLVWDHS ELFFLVPKSV KAERRTRSLM
EYRFELEAAD RERRRERVAE RNARCLLAAR LELRARRQTI KAPILLPEAQ SFTTVPKTLP
AIDLMEQLFF LVPKSVKAEQ RSRALLEFRI RLESEERKRR RKSVAERNPR SLLVARLQQR
EWQEALSPLD DLPSLPNSTA HIESEVGEIV VVQKTLGSER KKRVREVNNT AKLFEYYREQ
LKALEGRVLA GIRQQSICKQ ERTQRALLES RLKFHATQRK MAGSGRSQRR LIQEQQARAD
QARAANMAWW AEENRAWQNA GPEWHAQLAA DARQDRRIQE AQAVDRVRLT EKVTSQFESN
RIDQEKRLAR EACNTDEWTV TDEARVAVAD RPAREDRAAK RKKQSKAQLN TKIAAEAQRV
KETRSEQKAK QQEAELPQKI SRELPSQQNS KQQNAQKAEE AKRARKLVSE QKGKQQEAQL
AHKAEEAKRV QEILSGQKAK QHKLELVEKA VEAKHVHDIR SEQQVKQRKF ELAQKAAEEQ
DQRNLDKSGM EDKRREKLSS EQKEKQRKGQ KAEKAKLVQD VLSEQEAKQE AWLAQKADEA
KRVQEILSGQ KAKQRKLELA QKVAEAQREQ DIRSEQKAKQ RQLELDKRAI EAKHVRDIIS
NQKTKQRQLE LAQKEADARR EQGIRSEQKA KQLQQELDQK AADAKRVQAA LSEQKTKQRE
AQRGGASRQR KKQQRGEVKR IEENRSIVWA TLLEKARAGQ EGTRKNAETK LKVEATRVPP
EEVLTDNQRK ILKAKKVEEE RIVKQALLAE EKRIAERERK KKQARLIEEQ RNAELTAERE
RKAEQARLAE EKRNDKVAAE QQLLLERAAK AEELRRKEEH EAQATQAAER RKREEVRVDN
VKAHVAAGQA KEESLRVARK QRIATEHAEE QRVLIAALEE ETRLQEEAKR KRIADEEERR
QLVQAPRQQR MDGSGEKPLE ADHPRVISDT FAVQSVPSKA DKDSLAAELP ADEVIDLVAG
SVPAGVKGVN PDGTEEASRD NENGIVRGFE EDSRQAMHHF KDNETRLALE EPESTKSRVV
SEDASPAVWV PKATRSQDET FVDIFTYSKP REGGRNSAIL EMKDTRAAPQ RARQAAQLFV
TLALSRPAIF RAAAAARQSS VLALVNVLDA GYFSFVPTAG PVYFSSMRGA VVSLVSMRWK
VREVLPRAAA LQACSLCLSI LAAPAASFSV ATSHRAVTAL ASRPPRLPDP LPWSSPPPWS
RSPVHLEEPV ATWVDAVARP VSERRERSVS STNTHSSVCF GAIAVSKEVD LDWADDTDTI
SEDDSVKTDR RPLTDQATWV DPDAGLWFVR DEPKSIPTSK NLPRWDPSLP TEESMAGAFL
VVTSLGASVG GIDLPTSCVL GCLAAYLTTR PGAAGTMARR GGTLCYYLTA QAIQTIQDLH
ASGRVQTTIR TLAESVRTSG ESVTMESRAD FDTDTDTNGT VASNPESSDD TVVSSLHQDD
DKTELVFPGN ESSYGDTISS DQDMKDEDTV SFHEDVDTND KLASSNQDSG NVALVSSSYQ
DDEKTELVLP DKESDYGDTI SSDLDMKDDG TVFSHEEIDS NDKIASSYQD SGDVALVSSP
HQDNNRKDHT VSSSHLNDES TEMVWSDEHT KDDDTVSSSQ DNKGDKRVSS SHMAIDTDDT
IESSSRDSGS MASVPSHQEN DNINMDDTIS PYSKHNVDDS DSIPSFDISS LQPIPSIDIS
SMYAPGASQT LDQQQLGLPR PSSPEDLDLI AVSAPTRTPT PRTDSEGTHP VREGPVFDNT
PNRQLLSEYQ SVTDQIDPSP VDDVRREKIP NAETDALLSV DREVPQAKSK TSVPLEMADE
MMAEPSNNIP REPVWTIWPV DQPFR
//