ID A0A2A4K2T6_HELVI Unreviewed; 1335 AA.
AC A0A2A4K2T6;
DT 20-DEC-2017, integrated into UniProtKB/TrEMBL.
DT 20-DEC-2017, sequence version 1.
DT 13-SEP-2023, entry version 20.
DE RecName: Full=Protein shuttle craft {ECO:0008006|Google:ProtNLM};
GN ORFNames=B5V51_4888 {ECO:0000313|EMBL:PCG78214.1};
OS Heliothis virescens (Tobacco budworm moth).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; Noctuoidea;
OC Noctuidae; Heliothinae; Heliothis.
OX NCBI_TaxID=7102 {ECO:0000313|EMBL:PCG78214.1, ECO:0000313|Proteomes:UP000218220};
RN [1] {ECO:0000313|EMBL:PCG78214.1, ECO:0000313|Proteomes:UP000218220}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=HvINT- {ECO:0000313|EMBL:PCG78214.1};
RC TISSUE=Whole body {ECO:0000313|EMBL:PCG78214.1};
RA Fritz M.L., Deyonke A.M., Papanicolaou A., Micinski S., Westbrook J.,
RA Gould F.;
RT "Contemporary evolution of a Lepidopteran species, Heliothis virescens, in
RT response to modern agricultural practices.";
RL Submitted (SEP-2017) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the NFX1 family.
CC {ECO:0000256|ARBA:ARBA00007269}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PCG78214.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; NWSH01000227; PCG78214.1; -; Genomic_DNA.
DR STRING; 7102.A0A2A4K2T6; -.
DR Proteomes; UP000218220; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR GO; GO:0003676; F:nucleic acid binding; IEA:UniProtKB-UniRule.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR CDD; cd06008; NF-X1-zinc-finger; 7.
DR CDD; cd02643; R3H_NF-X1; 1.
DR Gene3D; 3.30.1370.50; R3H-like domain; 1.
DR InterPro; IPR034078; NFX1_fam.
DR InterPro; IPR001374; R3H_dom.
DR InterPro; IPR036867; R3H_dom_sf.
DR InterPro; IPR034076; R3H_NF-X1.
DR InterPro; IPR000967; Znf_NFX1.
DR InterPro; IPR019787; Znf_PHD-finger.
DR InterPro; IPR001841; Znf_RING.
DR PANTHER; PTHR12360; NUCLEAR TRANSCRIPTION FACTOR, X-BOX BINDING 1 NFX1; 1.
DR PANTHER; PTHR12360:SF12; TRANSCRIPTIONAL REPRESSOR NF-X1; 1.
DR Pfam; PF01424; R3H; 1.
DR Pfam; PF01422; zf-NF-X1; 8.
DR SMART; SM00393; R3H; 1.
DR SMART; SM00438; ZnF_NFX; 9.
DR SUPFAM; SSF82708; R3H domain; 1.
DR SUPFAM; SSF57850; RING/U-box; 1.
DR PROSITE; PS51061; R3H; 1.
DR PROSITE; PS50016; ZF_PHD_2; 1.
DR PROSITE; PS50089; ZF_RING_2; 1.
PE 3: Inferred from homology;
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000218220};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW ProRule:PRU00175}.
FT DOMAIN 511..563
FT /note="PHD-type"
FT /evidence="ECO:0000259|PROSITE:PS50016"
FT DOMAIN 514..561
FT /note="RING-type"
FT /evidence="ECO:0000259|PROSITE:PS50089"
FT DOMAIN 1130..1198
FT /note="R3H"
FT /evidence="ECO:0000259|PROSITE:PS51061"
FT REGION 143..501
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1222..1246
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1266..1335
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 143..225
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 226..289
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 290..315
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 377..476
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 477..493
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1281..1311
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1335 AA; 149112 MW; 795EA013BEDE4A7C CRC64;
MSQWNNTYAY NNQYQAWNGD PNVQYVNQAY YPNRPDQANQ YVSFNEFISQ MQSNGAPAAS
NYNNVQYESY PTRQYNYQNM PATPHNPQLD SYGYAATTTN IAVTEAYPMN VQNQYNPVAS
DPSAYTNAMI LKSNLTPTAT EFVPKSSMMT PSTSTQNIPE SSTVHDDNES RSNYSSVNET
RNQYSSGNDS QSNYSSINEP RNTSSSSSDT NWRERSQNSQ KNSEPIHKTE SYHRPQEHSR
NQESNGRYRD GHYRNNESNG RNYESKNRNK ESNSHNQEQE DRQYESSSRH QDSSSRNYES
NNRNYDSSNK RGQGKSNYKS KNKDDARTFY NSAISKDSQD VRNGRGEGSG RGKNWNGTQR
LRAMERNSME DEQYANTYLQ GRDDRTEREN RDRDRAQERE NRERERALER ENRDRERAQE
RENRERDRAL ERENRDRERA MERELRDRDR DNRERERENR ERERENRERD RMSKSDNGPS
PARSKSKYNF DQANKEMTQR ERLSEQLDKG TLECLVCCDR VKQFDQVWSC SNCYHVLHLR
CIRKWAMSSM VEGKWRCPAC QNTNEAIPTE YRCMCGAMRA PEYQRGAAGA HTCGRACRRA
RACPHPCTLL CHPGPCPPCQ ATVVKQCGCG AESRSILCSS KLPQVCGRTC GRTLECGVHT
CAKDCHEGPC DTCTETVEQV CYCPAAKSRS VPCTAETGGV SSWECGDACG RVLACGAHVC
RAKCHAPPCP ACQLLPKYVQ TCPCGNTQLA KDSRKSCTDP IPLCGNICAK PLNCGPAGDK
HFCKLNCHEG PCPECPDKTV LQCLCGHSSR EVPCADLPAM YNNVMCQKKC NKKLSCGRHR
CRTLCCAATS HRCGVVCGRT LSCQTHRCEE FCHTGHCAPC PRVSFEELTC ECGAEVILPP
VRCGTRPPAC SAPCIRERPC KHPPHHSCHS GDCPPCVVLT TKRCHGDHEE RKTIPCSQEE
FSCGLPCGKP LPCGKHTCIK TCHKGPCDTG KCTQPCMEKR PSCGHPCAAP CHVAGGTACP
STAPCRRPVR ATCPCSRRHA ERACADNARD LAKMMSALAA TKMSEGGSVD LSDVQRPGNM
LKTLECDDEC RVEARTRQLA LALQIRNPDV SAKLAPRYSE HVRATAAREP AFANQIHDKL
TELVQLAKKS KQKTRAHSFP SMNWQKRQFI HELCEHFGCE SVAYDAEPNR NVVATADREK
SWLPAMSVLE VLAREAGKRR VPGPVLRAPP AASPAAPAPS HSTQHSVTNA ELLAVLAREA
GKRRVPGPVL RAPPAASPAA PAPSHSTQHS VTKSTGGWAT LTSTNAWAAR SQPKPQPAAA
PPAEKIDYFD NPPDN
//