GenomeNet

Database: UniProt
Entry: A0A2A4K2T6_HELVI
LinkDB: A0A2A4K2T6_HELVI
Original site: A0A2A4K2T6_HELVI 
ID   A0A2A4K2T6_HELVI        Unreviewed;      1335 AA.
AC   A0A2A4K2T6;
DT   20-DEC-2017, integrated into UniProtKB/TrEMBL.
DT   20-DEC-2017, sequence version 1.
DT   13-SEP-2023, entry version 20.
DE   RecName: Full=Protein shuttle craft {ECO:0008006|Google:ProtNLM};
GN   ORFNames=B5V51_4888 {ECO:0000313|EMBL:PCG78214.1};
OS   Heliothis virescens (Tobacco budworm moth).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; Noctuoidea;
OC   Noctuidae; Heliothinae; Heliothis.
OX   NCBI_TaxID=7102 {ECO:0000313|EMBL:PCG78214.1, ECO:0000313|Proteomes:UP000218220};
RN   [1] {ECO:0000313|EMBL:PCG78214.1, ECO:0000313|Proteomes:UP000218220}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=HvINT- {ECO:0000313|EMBL:PCG78214.1};
RC   TISSUE=Whole body {ECO:0000313|EMBL:PCG78214.1};
RA   Fritz M.L., Deyonke A.M., Papanicolaou A., Micinski S., Westbrook J.,
RA   Gould F.;
RT   "Contemporary evolution of a Lepidopteran species, Heliothis virescens, in
RT   response to modern agricultural practices.";
RL   Submitted (SEP-2017) to the EMBL/GenBank/DDBJ databases.
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC   -!- SIMILARITY: Belongs to the NFX1 family.
CC       {ECO:0000256|ARBA:ARBA00007269}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:PCG78214.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; NWSH01000227; PCG78214.1; -; Genomic_DNA.
DR   STRING; 7102.A0A2A4K2T6; -.
DR   Proteomes; UP000218220; Unassembled WGS sequence.
DR   GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR   GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR   GO; GO:0003676; F:nucleic acid binding; IEA:UniProtKB-UniRule.
DR   GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR   CDD; cd06008; NF-X1-zinc-finger; 7.
DR   CDD; cd02643; R3H_NF-X1; 1.
DR   Gene3D; 3.30.1370.50; R3H-like domain; 1.
DR   InterPro; IPR034078; NFX1_fam.
DR   InterPro; IPR001374; R3H_dom.
DR   InterPro; IPR036867; R3H_dom_sf.
DR   InterPro; IPR034076; R3H_NF-X1.
DR   InterPro; IPR000967; Znf_NFX1.
DR   InterPro; IPR019787; Znf_PHD-finger.
DR   InterPro; IPR001841; Znf_RING.
DR   PANTHER; PTHR12360; NUCLEAR TRANSCRIPTION FACTOR, X-BOX BINDING 1 NFX1; 1.
DR   PANTHER; PTHR12360:SF12; TRANSCRIPTIONAL REPRESSOR NF-X1; 1.
DR   Pfam; PF01424; R3H; 1.
DR   Pfam; PF01422; zf-NF-X1; 8.
DR   SMART; SM00393; R3H; 1.
DR   SMART; SM00438; ZnF_NFX; 9.
DR   SUPFAM; SSF82708; R3H domain; 1.
DR   SUPFAM; SSF57850; RING/U-box; 1.
DR   PROSITE; PS51061; R3H; 1.
DR   PROSITE; PS50016; ZF_PHD_2; 1.
DR   PROSITE; PS50089; ZF_RING_2; 1.
PE   3: Inferred from homology;
KW   Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW   Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW   Reference proteome {ECO:0000313|Proteomes:UP000218220};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Transcription {ECO:0000256|ARBA:ARBA00023163};
KW   Transcription regulation {ECO:0000256|ARBA:ARBA00023015};
KW   Zinc {ECO:0000256|ARBA:ARBA00022833};
KW   Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW   ProRule:PRU00175}.
FT   DOMAIN          511..563
FT                   /note="PHD-type"
FT                   /evidence="ECO:0000259|PROSITE:PS50016"
FT   DOMAIN          514..561
FT                   /note="RING-type"
FT                   /evidence="ECO:0000259|PROSITE:PS50089"
FT   DOMAIN          1130..1198
FT                   /note="R3H"
FT                   /evidence="ECO:0000259|PROSITE:PS51061"
FT   REGION          143..501
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1222..1246
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1266..1335
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        143..225
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        226..289
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        290..315
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        377..476
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        477..493
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1281..1311
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1335 AA;  149112 MW;  795EA013BEDE4A7C CRC64;
     MSQWNNTYAY NNQYQAWNGD PNVQYVNQAY YPNRPDQANQ YVSFNEFISQ MQSNGAPAAS
     NYNNVQYESY PTRQYNYQNM PATPHNPQLD SYGYAATTTN IAVTEAYPMN VQNQYNPVAS
     DPSAYTNAMI LKSNLTPTAT EFVPKSSMMT PSTSTQNIPE SSTVHDDNES RSNYSSVNET
     RNQYSSGNDS QSNYSSINEP RNTSSSSSDT NWRERSQNSQ KNSEPIHKTE SYHRPQEHSR
     NQESNGRYRD GHYRNNESNG RNYESKNRNK ESNSHNQEQE DRQYESSSRH QDSSSRNYES
     NNRNYDSSNK RGQGKSNYKS KNKDDARTFY NSAISKDSQD VRNGRGEGSG RGKNWNGTQR
     LRAMERNSME DEQYANTYLQ GRDDRTEREN RDRDRAQERE NRERERALER ENRDRERAQE
     RENRERDRAL ERENRDRERA MERELRDRDR DNRERERENR ERERENRERD RMSKSDNGPS
     PARSKSKYNF DQANKEMTQR ERLSEQLDKG TLECLVCCDR VKQFDQVWSC SNCYHVLHLR
     CIRKWAMSSM VEGKWRCPAC QNTNEAIPTE YRCMCGAMRA PEYQRGAAGA HTCGRACRRA
     RACPHPCTLL CHPGPCPPCQ ATVVKQCGCG AESRSILCSS KLPQVCGRTC GRTLECGVHT
     CAKDCHEGPC DTCTETVEQV CYCPAAKSRS VPCTAETGGV SSWECGDACG RVLACGAHVC
     RAKCHAPPCP ACQLLPKYVQ TCPCGNTQLA KDSRKSCTDP IPLCGNICAK PLNCGPAGDK
     HFCKLNCHEG PCPECPDKTV LQCLCGHSSR EVPCADLPAM YNNVMCQKKC NKKLSCGRHR
     CRTLCCAATS HRCGVVCGRT LSCQTHRCEE FCHTGHCAPC PRVSFEELTC ECGAEVILPP
     VRCGTRPPAC SAPCIRERPC KHPPHHSCHS GDCPPCVVLT TKRCHGDHEE RKTIPCSQEE
     FSCGLPCGKP LPCGKHTCIK TCHKGPCDTG KCTQPCMEKR PSCGHPCAAP CHVAGGTACP
     STAPCRRPVR ATCPCSRRHA ERACADNARD LAKMMSALAA TKMSEGGSVD LSDVQRPGNM
     LKTLECDDEC RVEARTRQLA LALQIRNPDV SAKLAPRYSE HVRATAAREP AFANQIHDKL
     TELVQLAKKS KQKTRAHSFP SMNWQKRQFI HELCEHFGCE SVAYDAEPNR NVVATADREK
     SWLPAMSVLE VLAREAGKRR VPGPVLRAPP AASPAAPAPS HSTQHSVTNA ELLAVLAREA
     GKRRVPGPVL RAPPAASPAA PAPSHSTQHS VTKSTGGWAT LTSTNAWAAR SQPKPQPAAA
     PPAEKIDYFD NPPDN
//
DBGET integrated database retrieval system