ID A9V6D4_MONBE Unreviewed; 2640 AA.
AC A9V6D4;
DT 05-FEB-2008, integrated into UniProtKB/TrEMBL.
DT 05-FEB-2008, sequence version 1.
DT 06-MAR-2013, entry version 26.
DE SubName: Full=Predicted protein;
GN ORFNames=27806;
OS Monosiga brevicollis (Choanoflagellate).
OC Eukaryota; Choanoflagellida; Codonosigidae; Monosiga.
OX NCBI_TaxID=81824;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MX1 / ATCC 50154;
RX PubMed=18273011; DOI=10.1038/nature06617;
RG JGI Sequencing;
RA King N., Westbrook M.J., Young S.L., Kuo A., Abedin M., Chapman J.,
RA Fairclough S., Hellsten U., Isogai Y., Letunic I., Marr M., Pincus D.,
RA Putnam N., Rokas A., Wright K.J., Zuzow R., Dirks W., Good M.,
RA Goodstein D., Lemons D., Li W., Lyons J.B., Morris A., Nichols S.,
RA Richter D.J., Salamov A., Sequencing J.G., Bork P., Lim W.A.,
RA Manning G., Miller W.T., McGinnis W., Shapiro H., Tjian R.,
RA Grigoriev I.V., Rokhsar D.;
RT "The genome of the choanoflagellate Monosiga brevicollis and the
RT origin of metazoans.";
RL Nature 451:783-788(2008).
CC -----------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution-NoDerivs License
CC -----------------------------------------------------------------------
DR EMBL; CH991562; EDQ87045.1; -; Genomic_DNA.
DR RefSeq; XP_001748284.1; XM_001748232.1.
DR ProteinModelPortal; A9V6D4; -.
DR GeneID; 5893523; -.
DR KEGG; mbr:MONBRDRAFT_27806; -.
DR eggNOG; NOG308387; -.
DR GO; GO:0005643; C:nuclear pore; IEA:InterPro.
DR GO; GO:0008565; F:protein transporter activity; IEA:InterPro.
DR GO; GO:0006886; P:intracellular protein transport; IEA:InterPro.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR014797; CKK_domain.
DR InterPro; IPR014877; CRM1_C_dom.
DR InterPro; IPR001494; Importin-beta_N.
DR InterPro; IPR011033; PRC_barrel-like.
DR Pfam; PF08683; CAMSAP_CKK; 1.
DR Pfam; PF08767; CRM1_C; 1.
DR SMART; SM01051; CAMSAP_CKK; 1.
DR SUPFAM; SSF48371; ARM-type_fold; 1.
DR SUPFAM; SSF50346; PRCH_cytoplasmic; 1.
DR PROSITE; PS51508; CKK; 1.
DR PROSITE; PS50166; IMPORTIN_B_NT; 1.
PE 4: Predicted;
KW Complete proteome; Reference proteome.
SQ SEQUENCE 2640 AA; 290159 MW; 387AD6161DA55EB1 CRC64;
MAQGSYLQQH QAQAKAHHFE AAAAARAMHT AMTQPPPSAS SHIPHSLNSS HISHISHSPK
IPFHHVISLQ LTCSPMARPD PRSRPGSSHS GPSPDMAGLS HEAVLAAHTS KRPTSAAAMR
RQHNRPYSAR RSTDQSQLAN TRRVVSARPQ LSTTTDKHAT VSREHPMRTT TTLQDEQPHA
DDLADLDRFE AMHVAPQTPP PRRTQSATSP ANVKAGQARY SELVSHLRSP HSQCPPSVPS
QYPPSVPPLP CLIHNLLASH LPCLSSSLSL FLAQKQQAEE LRTTSGPANR RARPISALHA
HKSSRPESNT SNLRRAGATP ESDNPHSPGP SHYHSADARA PLQPTGQSPT EHPTSSHVQA
NDHRLPAHPN SGDARSHAHL HRQSLSRPIR DTNSAGSHRR PHASQHASPD WEDFDLVFGP
EDRIYHHNDP RRHGDHGTEM DTPSDADEEP RSEPEATPPV ASGPGTLPKP SPLQQQSLSS
APPADGNTNN FMSKAQPAVE PPTAATIEPL FPPPAATPND AEDTVVVHFG KGYGTAARAP
APTHRASIPA PHTQTKKPIA RPGRASLTVP AAHDSREVSA LASTHFTVNM APAFGSPSPV
ASADSPSPSA QRRGSTSSAM AYHTRLAEVL PKGRRSLPGP ATASGRRLSV LINATLEASE
QAERRQSQKA NATVLAAARA AARAAAAPGA VDSPAEDAAG RPQPDHESRN EEQDLSNGDA
VKMERGGDPN HSEDSDAEYE DEPDEEPDDT ASRDSSQAPR VSVQIRRDGV DDEVLPALTR
LESTTAMQRL RQVVAASLSS DSEHPDDDMT APSARRNNGR SPFHPALRSP SPMQGTDEDR
QDDDEVVVAA ALKRLATKRS EMQHHAIERF RRSTSTLDVP TGVSLLEQEE RNGATLSPEP
GAGTSRHGIT PPRQAAVNAS PPGPGAGSPE HGMAPLPRRP TPSATTHFFA RHMPQIEEQD
ASDSGSAAVS AGEVEPVQGP RASSAVRRNR RAVATSMSTA PGVQASLHLD RLVPLAPPNG
PDHLQDSAAE IQNPHARRHL ESAVQQVLNQ QGAGQLDSST QEGLVNSILR AFDAAQAPTG
LSQRSLELHA QSSPTAGAPS RTVKYSGPRA INGDARLNGS DGHSDAERRS AQDLNRSLMS
DESATTRSYT SGIASTAEMQ RHHEHKHHQQ QQHQYHQQQH HQQQQYQHHQ QQQHQQQHQH
HQQQQQQQQQ QLQQQQQQQQ QQSDPFQASS PRLAPGSATP NNEGRVRPTA VEFTVALSDT
LPRERQAEDA AKRRERLLAV QRNRLAAQRR ARSGEENTVS SHTTTARNGS HEQDNSSSNN
NKNDSSSRYH HQRHHVAQDD SSDLYHGSVH DEATPSWAQQ RHGGTVSVRY PASTADDEGP
SMHSTSRTQY RDASSPTMRG RTRRSDRSRH GSNAPSGRST AQPEGVPLPA VTLRDNRGIC
RNAIKSPACL GGGAINADRR GKALAAVEAS SLAHFVVVLR DPNNLKFRAI YGLAPERPTV
LQKVFGAGPK VIPADMVAQL FRYDSGAKEF KPLPSRDLSV CDGVAVSVLR RVATGIMDQA
SLTEFERICH VFYEGTTDAQ ERQQAQQILM SFDERPNALE QARTILEQSS QSYAQFIAAS
AITASVTKTM SPLTPADRLQ LRSFLYEYLL TKPSVDQFII TEVTKCIARL TKVSWCDADE
AGNFEARTIL EDTARFFDRG DVYMTIGVMI LNANVCEMSQ SDSVRGMTKH RKISASFRDE
VLFPIFQQSL NMIDAVTAKK VNVADPGRLL NWILQLTKNC LSFDFIGTAG DDSTDDLRTV
QAPTAWRSTI TQETLLPVLF QLYMNLEAPL STHALGILVQ MASIRRTIFN QEQRATHLDQ
LLQGICQIFQ TQQGFKDPGN YHEFCRLLAR LKTNFQLAEL IASKYYEEIV TGMANFTIVS
LTNWQYAPNS LHYVLGLWDR MIHGIPYLKP DHTHQHNLHV FAPRILDAFI QSRMSAVELV
LQNQMDDPLE DQPLLETQLK QAAVIARCEY AEGCRMLVER FDAVGTTYMQ NLTASGPSAP
ATRLAEGQMT WLVYIIGAVL GARSVSVLHD DQDQFDGELI CRCLKLLNAL QEQTQARNAP
VSEQIDIAMI NFFQQLRINY IGEHMNRSVR MQACLEQQLG LGDETALLNL IIEKIISNLR
VWVDGDRILE QTLKLFSDFC LSFNVVRKLV KLQSVQFILA NHTPSNFPFL VHALGRIMTH
EFSEEDQRFE QFMAPLAAVG QQIAQQLQMN GSPRNMELRA LALGFVRDLR GLVFACTTRS
AYMMLFEWIY PDYLQLLVKC AGLFALDSDV ANPILKCMCE LVHNRNSRLQ FGISSPNGIL
LFRETRRVAC PTAPGYCVLG NMLQAYGEQL LQTSVPANGD VYREKYKGIA VCFNILRWAL
TGDYVNFGVF SLYGDAALDR ALGIFFKMLA AIPLEDLNSY PKLSKGYYSL LQAVAKDHTH
CFAQLPADLF SYVIATVADG IQSVTTTIST HCCTTLDFLI TFVVTRRARS KPDMEASVIG
NLLEQCNDKL GEMLYDMFAS VMFEECRNQW SLSRPMLGHF EAVKMRLAQN LAGQKQQVVS
EAFEGLMAKI EPNLSMKNRD RFTANLATFR RQATMISGGN GPSQNGTNYA SSHNPDVMVS
//