GenomeNet

Database: Pfam
Entry: bCoV_NSP3_N
LinkDB: bCoV_NSP3_N
Original site: bCoV_NSP3_N 
#=GF ID   bCoV_NSP3_N
#=GF AC   PF12379.10
#=GF DE   Betacoronavirus replicase NSP3, N-terminal
#=GF PI   DUF3655; Corona_NSP3a;
#=GF AU   Gavin OL;
#=GF AU   Chuguransky S;0000-0002-0520-0736
#=GF SE   Prosite
#=GF GA   26.70 26.70;
#=GF TC   28.50 143.50;
#=GF NC   25.60 24.10;
#=GF BM   hmmbuild HMM.ann SEED.ann
#=GF SM   hmmsearch -Z 57096847 -E 1000 --cpu 4 HMM pfamseq
#=GF TP   Family
#=GF RN   [1]
#=GF RM   18367524
#=GF RT   Proteomics analysis unravels the functional repertoire of
#=GF RT   coronavirus nonstructural protein 3.
#=GF RA   Neuman BW, Joseph JS, Saikatendu KS, Serrano P, Chatterjee A,
#=GF RA   Johnson MA, Liao L, Klaus JP, Yates JR 3rd, Wuthrich K, Stevens
#=GF RA   RC, Buchmeier MJ, Kuhn P;
#=GF RL   J Virol. 2008;82:5279-5294.
#=GF RN   [2]
#=GF RM   31776274
#=GF RT   Nucleocapsid Protein Recruitment to Replication-Transcription
#=GF RT   Complexes Plays a Crucial Role in Coronaviral Life Cycle.
#=GF RA   Cong Y, Ulasli M, Schepers H, Mauthe M, V'kovski P, Kriegenburg
#=GF RA   F, Thiel V, de Haan CAM, Reggiori F;
#=GF RL   J Virol. 2020; [Epub ahead of print]
#=GF RN   [3]
#=GF RM   23943763
#=GF RT   Severe acute respiratory syndrome coronavirus nonstructural
#=GF RT   proteins 3, 4, and 6  induce double-membrane vesicles.
#=GF RA   Angelini MM, Akhlaghpour M, Neuman BW, Buchmeier MJ;
#=GF RL   mBio. 2013; [Epub ahead of print]
#=GF RN   [4]
#=GF RM   17728234
#=GF RT   Nuclear magnetic resonance structure of the N-terminal domain of
#=GF RT   nonstructural protein 3 from the severe acute respiratory
#=GF RT   syndrome coronavirus.
#=GF RA   Serrano P, Johnson MA, Almeida MS, Horst R, Herrmann T, Joseph
#=GF RA   JS, Neuman BW, Subramanian V, Saikatendu KS, Buchmeier MJ,
#=GF RA   Stevens RC, Kuhn P, Wuthrich K;
#=GF RL   J Virol. 2007;81:12049-12060.
#=GF DR   INTERPRO; IPR024358;
#=GF DR   SO; 0100021; polypeptide_conserved_region;
#=GF CC   This domain family corresponds to the N-terminal domain of NSP3
#=GF CC   (non-structural protein 3, also known as nsp3) found in
#=GF CC   Betacoronavirus, which is encoded on the replicase polyprotein.
#=GF CC   This family includes the NSP3a domain which has the
#=GF CC   ubiquitin-like 1 (UB1) and glutamic acid-rich acidic (AC)
#=GF CC   hypervariable domains [1]. NSP3a interacts with numerous other
#=GF CC   proteins involved in replication and transcription and may serve
#=GF CC   as a scaffolding protein for these processes. The N-terminal
#=GF CC   NSP3a domain interacts with N (nucleocapsid) protein to
#=GF CC   colocalise genomic RNA with the nascent replicase-transcriptase
#=GF CC   complex at the earliest stages of infection, essential for the
#=GF CC   virus [3]. The C-terminal Glu-rich subdomain is best described
#=GF CC   as a flexible tail attached to the globular UB1 subdomain [4].
#=GF CC   The family is found in association with Pfam:PF08716,
#=GF CC   Pfam:PF01661, Pfam:PF05409, Pfam:PF06471, Pfam:PF08717,
#=GF CC   Pfam:PF06478, Pfam:PF09401, Pfam:PF06460, Pfam:PF08715,
#=GF CC   Pfam:PF08710.
#=GF SQ   19
#=GS Q6UZF5_SARS/881-1029      AC Q6UZF5.1
#=GS B8Q8S4_SARS/881-1029      AC B8Q8S4.1
#=GS A0A0U1WHG0_SARS/881-1026  AC A0A0U1WHG0.1
#=GS A0A0K1Z0N1_SARS/881-1029  AC A0A0K1Z0N1.1
#=GS R1A_SARS/881-1029         AC P0C6U8.1
#=GS R1AB_SARS/881-1029        AC P0C6X7.1
#=GS R1AB_SARS/881-1029        DR PDB; 2ACF A; 184-211;
#=GS R1AB_SARS/881-1029        DR PDB; 2ACF C; 184-211;
#=GS R1AB_SARS/881-1029        DR PDB; 2FAV A; 2-30;
#=GS R1AB_SARS/881-1029        DR PDB; 2GRI A; 63-112;
#=GS R1AB_SARS/881-1029        DR PDB; 2FAV C; 3-30;
#=GS R1AB_SARS/881-1029        DR PDB; 2IDY A; 63-112;
#=GS R1AB_SARS/881-1029        DR PDB; 2FAV B; 3-30;
#=GS R1AB_SARS/881-1029        DR PDB; 2ACF B; 184-211;
#=GS R1AB_SARS/881-1029        DR PDB; 2ACF D; 184-211;
#=GS E0XIZ2_9BETC/881-1012     AC E0XIZ2.1
#=GS R1A_SARS2/880-1051        AC P0DTC1.1
#=GS R1AB_SARS2/880-1051       AC P0DTD1.1
#=GS R1AB_SARS2/880-1051       DR PDB; 6YWL A; 3-29;
#=GS R1AB_SARS2/880-1051       DR PDB; 6W02 B; 3-29;
#=GS R1AB_SARS2/880-1051       DR PDB; 6VXS B; 3-29;
#=GS R1AB_SARS2/880-1051       DR PDB; 6WEN A; 3-29;
#=GS R1AB_SARS2/880-1051       DR PDB; 6VXS A; 3-29;
#=GS R1AB_SARS2/880-1051       DR PDB; 6YWL B; 3-29;
#=GS R1AB_SARS2/880-1051       DR PDB; 6YWL E; 3-29;
#=GS R1AB_SARS2/880-1051       DR PDB; 6YWM A; 3-29;
#=GS R1AB_SARS2/880-1051       DR PDB; 6YWM B; 3-29;
#=GS R1AB_SARS2/880-1051       DR PDB; 6YWK C; 3-29;
#=GS R1AB_SARS2/880-1051       DR PDB; 6YWM C; 3-29;
#=GS R1AB_SARS2/880-1051       DR PDB; 6WEY A; 207-233;
#=GS R1AB_SARS2/880-1051       DR PDB; 6W6Y B; 3-29;
#=GS R1AB_SARS2/880-1051       DR PDB; 6YWL D; 3-29;
#=GS R1AB_SARS2/880-1051       DR PDB; 6YWK D; 3-29;
#=GS R1AB_SARS2/880-1051       DR PDB; 6YWK B; 3-29;
#=GS R1AB_SARS2/880-1051       DR PDB; 6W02 A; 4-29;
#=GS R1AB_SARS2/880-1051       DR PDB; 6W6Y A; 3-29;
#=GS R1AB_SARS2/880-1051       DR PDB; 6WCF A; 4-29;
#=GS R1AB_SARS2/880-1051       DR PDB; 6YWL C; 3-29;
#=GS R1AB_SARS2/880-1051       DR PDB; 6YWK A; 3-29;
#=GS R1AB_SARS2/880-1051       DR PDB; 6YWK E; 3-29;
#=GS R1AB_BC279/881-1027       AC P0C6V9.1
#=GS R1A_BC279/881-1027        AC P0C6F5.1
#=GS B8Q8S5_SARS/881-1029      AC B8Q8S5.1
#=GS B8Q8U7_SARS/881-1029      AC B8Q8U7.1
#=GS F2YDB1_SARS/881-1029      AC F2YDB1.1
#=GS A0A0K1YZY7_SARS/881-1029  AC A0A0K1YZY7.1
#=GS R9QTB2_SARS/881-1022      AC R9QTB2.1
#=GS A0A0U1WHK4_SARS/881-1029  AC A0A0U1WHK4.1
#=GS B8Q8U6_SARS/881-1029      AC B8Q8U6.1
#=GS F2YDB2_SARS/881-1029      AC F2YDB2.1
Q6UZF5_SARS/881-1029                 VKTLQPVSDLLTNMGIDLDEWSVATFYLFDDAGEENFSSRMYCSFYPPDEEEEDDAECEEEE.I.DETCEHEYGTEDDYQGLPLEFGASAETVRVE.EEEEEDWLDDTTEQ.......S............EIEP..EPEPTP...EEPVNQFTGYLKLTDNVAIKCVDIVKEAQS..
B8Q8S4_SARS/881-1029                 VKTLQPVSDLLTNMGIDLDEWSVATFYLFDDAGEENFSSRMYCSFYPPDEEEEDDAECEEEE.I.DETCEHEYGTEDDYQGLPLEFGASAETVRVE.EEEEEDWLDDTTEQ.......S............EIEP..EPEPTP...EEPVNQFTGYLKLTDNVAIKCVDIVKEAQS..
A0A0U1WHG0_SARS/881-1026             VKTLQPVSDLLTKMGIDLDEWSVATFYLFDDAGEEKLSSRMYCSFYPPDEEE-DCEECEDEEeVpCETCEHEYGTEDDYKGLPLECGSSIEIQQVE.D-EEEDWLDDAG--.......-............EAEP..EPESLP...EEPVNQFTGYFKLTDNVAIKCVDIVKEAQS..
A0A0K1Z0N1_SARS/881-1029             VKTLQPVSDLLTEMGVVLDEWSVATFYLFDDAGEENLSSRMYCSFYPPDEEEEDDVECEEEE.I.DETCEHEYGTEDDYQGLPLEFGASAETVQVE.EEEEEDWLDDTTEQ.......S............EIEP..EPESTP...EEPVNQFTGYLKLTDNVAIKCVDIVKEAQS..
R1A_SARS/881-1029                    VKTLQPVSDLLTNMGIDLDEWSVATFYLFDDAGEENFSSRMYCSFYPPDEEEEDDAECEEEE.I.DETCEHEYGTEDDYQGLPLEFGASAETVRVE.EEEEEDWLDDTTEQ.......S............EIEP..EPEPTP...EEPVNQFTGYLKLTDNVAIKCVDIVKEAQS..
R1AB_SARS/881-1029                   VKTLQPVSDLLTNMGIDLDEWSVATFYLFDDAGEENFSSRMYCSFYPPDEEEEDDAECEEEE.I.DETCEHEYGTEDDYQGLPLEFGASAETVRVE.EEEEEDWLDDTTEQ.......S............EIEP..EPEPTP...EEPVNQFTGYLKLTDNVAIKCVDIVKEAQS..
#=GR R1AB_SARS/881-1029        SS    HHTTXSXHHHHHHHTXXHHHHTTSXXEEEXSSSSSXXSSBXEEESSXSXXXXXXXXXXXXXX.X.XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX.XXXXXXXXXXXXXX.......X............XXXX..XXXXXX...XX-S----S-EESSSSEEEEES-HHHHHHH..
E0XIZ2_9BETC/881-1012                VKTLQPISELLTPMGIDLDEWSVAKFYLFDESGEAVLSSHMYCSFYPPDEEEEEDLE-----.-.-ESEDVEYGTEDDYTGAPLEFGASSTVEQDEvHDEEEDWLAPQ-EE.......S............----..------...EVLYDQFTDYHKLTDNVFIKCADIVEES--lk
R1A_SARS2/880-1051                   IKTLQPVSELLTPLGIDLDEWSMATYYLFDESGEFKLASHMYCSFYPPDEDEE-EGDCEEEE.F.EPSTQYEYGTEDDYQGKPLEFGATSAALQPE.EEQEEDWLDDDSQQtvgqqdgSednqtttiqtivEVQPqlEMELTPvvqTIEVNSFSGYLKLTDNVYIKNADIVEEAK-k.
R1AB_SARS2/880-1051                  IKTLQPVSELLTPLGIDLDEWSMATYYLFDESGEFKLASHMYCSFYPPDEDEE-EGDCEEEE.F.EPSTQYEYGTEDDYQGKPLEFGATSAALQPE.EEQEEDWLDDDSQQtvgqqdgSednqtttiqtivEVQPqlEMELTPvvqTIEVNSFSGYLKLTDNVYIKNADIVEEAK-k.
#=GR R1AB_SARS2/880-1051       SS    XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX-XXXXXXXX.X.XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX.XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTT---SEEESSSSEEEEES-HHHHHH-H.
R1AB_BC279/881-1027                  VKTLQPVSDLLTNMGIDLDEWSVATFYLFDDAGEEKLSSRMYCSFYPPDEEEDCEEYEDEEE.IpEETCEHEYGTEDDYKGLPLEFGASTEIQQVD.EEEEEDWLEEAI--.......-............AAKP..EPEPLP...EEPVNQFTGYLKLTDNVAIKCVDIVKEAQ-h.
R1A_BC279/881-1027                   VKTLQPVSDLLTNMGIDLDEWSVATFYLFDDAGEEKLSSRMYCSFYPPDEEEDCEEYEDEEE.IpEETCEHEYGTEDDYKGLPLEFGASTEIQQVD.EEEEEDWLEEAI--.......-............AAKP..EPEPLP...EEPVNQFTGYLKLTDNVAIKCVDIVKEAQ-h.
B8Q8S5_SARS/881-1029                 VKTLQPVSDLLTNMGIDLDEWSVATFYLFDDAGEENFSSRMYCSFYPPDEEEEDDAECEEEE.I.DETCEHEYGTEDDYQGLPLEFGASAETVRVE.EEEEEDWLDDTTEQ.......S............EIEP..EPEPTP...EEPVNQFTGYLKLTDNVAIKCVDIVKEAQS..
B8Q8U7_SARS/881-1029                 VKTLQPVSDLLTNMGIDLDEWSVATFYLFDDAGEENFSSRMYCSFYPPDEEEEDDAECEEEE.I.DETCEHEYGTEDDYQGLPLEFGASAETVRVE.EEEEEDWLDDTTEQ.......S............EIEP..EPEPTP...EEPVNQFTGYLKLTDNVAIKCVDIVKEAQS..
F2YDB1_SARS/881-1029                 VKTLQPVSDLLTNMGIDLDEWSVATFYLFDDAGEENFSSRMYCSFYPPDEEEEDDAECEEEE.I.DETCEHEYGTEDDYQGLPLEFGASAETVRVE.EEEEEDWLDDTTEQ.......S............EIEP..EPEPTP...EEPVNQFTGYLKLTDNVAIKCVDIVKEAQS..
A0A0K1YZY7_SARS/881-1029             VKTLQPVSDLLTEMGVVLDEWSVATFYLFDDAGEENLSSRMYCSFYPPDEEEEDDVECEEEE.I.DETCEHEYGTEDDYQGLPLEFGASAETVQVE.EEEEEDWLDDTTEQ.......S............EIEP..EPESTP...EEPVNQFTGYLKLTDNVAIKCVDIVKEAQS..
R9QTB2_SARS/881-1022                 VKTLQPVSDLLTNMGIDLDEWSVATFYLFDDAGEEKLSSRMYCSFYPPDEEEECEEYEEEEE.VpEQTCEHEYGTEDDYKGLPLEFGAST--QQVD.EEEEEDWLD-----.......-............EAEP..EPESLS...EEPVNQFTGYLKLTDNVAIKCVDIVKEAQS..
A0A0U1WHK4_SARS/881-1029             VKTLQPVSDLLNNMGIDLDEWSVATFYLFDDAGEEKLSSRMYCSFYPPDEEEDCDEYEEEEE.VtEESCAHEYGTEEDYQGLPLEFGASTE-MQVE.EEAEEDWLGDATEL.......S............EHEL..EPEPTP...EESVNQFTGYLKLTDNVAIKCVDIVKEAQ-n.
B8Q8U6_SARS/881-1029                 VKTLQPVSDLLTNMGIDLDEWSVATFYLFDDAGEENFSSRMYCSFYPPDEEEEDDAECEEEE.I.DETCEHEYGTEDDYQGLPLEFGASAETVRVE.EEEEEDWLDDTTEQ.......S............EIEP..EPEPTP...EEPVNQFTGYLKLTDNVAIKCVDIVKEAQS..
F2YDB2_SARS/881-1029                 VKTLQPVSDLLTNMGIDLDEWSVATFYLFDDAGEENFSSRMYCSFYPPDEEEEDDAECEEEE.I.DETCEHEYGTEDDYQGLPLEFGASAETVRVE.EEEEEDWLDDTTEQ.......S............EIEP..EPEPTP...EEPVNQFTGYLKLTDNVAIKCVDIVKEAQS..
#=GC SS_cons                         HHTTXSXHHHHHHHTXXHHHHTTSXXEEEXSSSSSXXSSBXEEESSXSXXXXX-XXXXXXXX.X.XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX.XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTT---SEEESSSSEEEEES-HHHHHHHH.
#=GC seq_cons                        VKTLQPVSDLLTNMGIDLDEWSVATFYLFDDAGEEshSSRMYCSFYPPDEEEEDDsECEEEE.I.-ETCEHEYGTEDDYQGLPLEFGASuEslpVE.EEEEEDWLDDsTEQ.......S............ElEP..EPEPTP...EEPVNQFTGYLKLTDNVAIKCVDIVKEAQS..
//
DBGET integrated database retrieval system