ID A0A0V1I645_9BILA Unreviewed; 3651 AA.
AC A0A0V1I645;
DT 16-MAR-2016, integrated into UniProtKB/TrEMBL.
DT 16-MAR-2016, sequence version 1.
DT 27-MAR-2024, entry version 44.
DE RecName: Full=Homeobox protein cut-like {ECO:0000256|RuleBase:RU361129};
GN Name=Polr2j {ECO:0000313|EMBL:KRZ17397.1};
GN ORFNames=T11_10573 {ECO:0000313|EMBL:KRZ17397.1};
OS Trichinella zimbabwensis.
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Enoplea; Dorylaimia;
OC Trichinellida; Trichinellidae; Trichinella.
OX NCBI_TaxID=268475 {ECO:0000313|EMBL:KRZ17397.1, ECO:0000313|Proteomes:UP000055024};
RN [1] {ECO:0000313|EMBL:KRZ17397.1, ECO:0000313|Proteomes:UP000055024}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ISS1029 {ECO:0000313|EMBL:KRZ17397.1};
RA Korhonen P.K., Edoardo P., Giuseppe L.R., Gasser R.B.;
RT "Evolution of Trichinella species and genotypes.";
RL Submitted (JAN-2015) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|PROSITE-ProRule:PRU00108,
CC ECO:0000256|RuleBase:RU000682}.
CC -!- SIMILARITY: Belongs to the CUT homeobox family.
CC {ECO:0000256|ARBA:ARBA00008190, ECO:0000256|RuleBase:RU361129}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KRZ17397.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JYDP01000007; KRZ17397.1; -; Genomic_DNA.
DR STRING; 268475.A0A0V1I645; -.
DR Proteomes; UP000055024; Unassembled WGS sequence.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:1990904; C:ribonucleoprotein complex; IEA:UniProtKB-KW.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR GO; GO:0005525; F:GTP binding; IEA:UniProtKB-KW.
DR GO; GO:0003924; F:GTPase activity; IEA:InterPro.
DR GO; GO:0030154; P:cell differentiation; IEA:UniProt.
DR GO; GO:0006397; P:mRNA processing; IEA:UniProtKB-KW.
DR GO; GO:0008380; P:RNA splicing; IEA:UniProtKB-KW.
DR CDD; cd00637; 7tm_classA_rhodopsin-like; 1.
DR CDD; cd04098; eEF2_C_snRNP; 2.
DR CDD; cd04090; EF2_II_snRNP; 2.
DR CDD; cd01683; EF2_IV_snRNP; 2.
DR CDD; cd00086; homeodomain; 1.
DR CDD; cd16264; snRNP_III; 2.
DR CDD; cd04167; Snu114p; 2.
DR Gene3D; 3.30.230.10; -; 2.
DR Gene3D; 3.30.70.240; -; 2.
DR Gene3D; 3.30.70.870; Elongation Factor G (Translational Gtpase), domain 3; 2.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR Gene3D; 1.10.260.40; lambda repressor-like DNA-binding domains; 3.
DR Gene3D; 3.40.50.300; P-loop containing nucleotide triphosphate hydrolases; 2.
DR Gene3D; 2.40.30.10; Translation factors; 2.
DR Gene3D; 3.90.1430.10; Yeast translation eEF2 (G' domain); 2.
DR InterPro; IPR003350; CUT_dom.
DR InterPro; IPR035647; EFG_III/V.
DR InterPro; IPR000640; EFG_V-like.
DR InterPro; IPR004161; EFTu-like_2.
DR InterPro; IPR031950; EFTUD2_N.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR010982; Lambda_DNA-bd_dom_sf.
DR InterPro; IPR027417; P-loop_NTPase.
DR InterPro; IPR020568; Ribosomal_Su5_D2-typ_SF.
DR InterPro; IPR014721; Ribsml_uS5_D2-typ_fold_subgr.
DR InterPro; IPR044121; Snu114_GTP-bd.
DR InterPro; IPR000795; T_Tr_GTP-bd_dom.
DR InterPro; IPR009000; Transl_B-barrel_sf.
DR InterPro; IPR005517; Transl_elong_EFG/EF2_IV.
DR InterPro; IPR035655; U5-116kDa_C.
DR PANTHER; PTHR42908:SF6; 116 KDA U5 SMALL NUCLEAR RIBONUCLEOPROTEIN COMPONENT; 1.
DR PANTHER; PTHR42908; TRANSLATION ELONGATION FACTOR-RELATED; 1.
DR Pfam; PF02376; CUT; 3.
DR Pfam; PF00679; EFG_C; 2.
DR Pfam; PF03764; EFG_IV; 2.
DR Pfam; PF16004; EFTUD2; 2.
DR Pfam; PF00009; GTP_EFTU; 2.
DR Pfam; PF03144; GTP_EFTU_D2; 2.
DR Pfam; PF00046; Homeodomain; 1.
DR SMART; SM01109; CUT; 3.
DR SMART; SM00838; EFG_C; 2.
DR SMART; SM00889; EFG_IV; 2.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF54980; EF-G C-terminal domain-like; 4.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR SUPFAM; SSF47413; lambda repressor-like DNA-binding domains; 3.
DR SUPFAM; SSF52540; P-loop containing nucleoside triphosphate hydrolases; 2.
DR SUPFAM; SSF54211; Ribosomal protein S5 domain 2-like; 2.
DR SUPFAM; SSF50447; Translation proteins; 2.
DR PROSITE; PS51042; CUT; 3.
DR PROSITE; PS51722; G_TR_2; 2.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 3: Inferred from homology;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; GTP-binding {ECO:0000256|ARBA:ARBA00023134};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Membrane {ECO:0000256|SAM:Phobius};
KW mRNA processing {ECO:0000256|ARBA:ARBA00022664};
KW mRNA splicing {ECO:0000256|ARBA:ARBA00023187};
KW Nucleotide-binding {ECO:0000256|ARBA:ARBA00022741};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000055024};
KW Ribonucleoprotein {ECO:0000313|EMBL:KRZ17397.1};
KW Transcription {ECO:0000256|RuleBase:RU361129};
KW Transcription regulation {ECO:0000256|RuleBase:RU361129};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT TRANSMEM 2474..2497
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 2527..2550
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 2570..2593
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 2626..2648
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 2654..2676
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 126..337
FT /note="Tr-type G"
FT /evidence="ECO:0000259|PROSITE:PS51722"
FT DOMAIN 1692..1779
FT /note="CUT"
FT /evidence="ECO:0000259|PROSITE:PS51042"
FT DOMAIN 1973..2060
FT /note="CUT"
FT /evidence="ECO:0000259|PROSITE:PS51042"
FT DOMAIN 2126..2213
FT /note="CUT"
FT /evidence="ECO:0000259|PROSITE:PS51042"
FT DOMAIN 2241..2301
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DOMAIN 2805..3016
FT /note="Tr-type G"
FT /evidence="ECO:0000259|PROSITE:PS51722"
FT DNA_BIND 2243..2302
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 1..39
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1588..1623
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1826..1859
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2058..2080
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2373..2409
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2699..2719
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 1341..1368
FT /evidence="ECO:0000256|SAM:Coils"
SQ SEQUENCE 3651 AA; 411099 MW; F96835E731212BE9 CRC64;
MDGDLYDEFG NYIGPELESD LSDSDSEYDE VHQQNDEADE TGAVVEMNGT ADAEAVVLHE
DKKYYPTALE IYGPDVESVV QEEDTQPLTE PIVKPVRPKK FQAIERDLPE TVYNKEYMAD
LMDCPNLIRN VALAGHLHHG KTCFVDCLVE QTHPTFVRRD DRDTRYTDTL NTEYERGVGI
KAMPITVVMQ DFRNKSFLLN VFDTPGHINF SDEISAAFRM CDGVVVFVDA HEGVMMSTER
TLKCAVQENL AITLCINKID RLILELKLPP TDAYLKLRHT IDEVNVLLKS FSTSHEYQTL
SPVNGNVFFA SGKYNICFTL LSFANLYKEI YPDIQPKDFA KRLWGDIYLN SKTREFQRKA
PSGDKPRTFV EFILEPIYKI FSQIVGEVDD TLPRVMSELN IRLTKEEQKL NIRPLLSLIY
RRFFGNFTSF VDVISQHIPS PAENAKNKIQ LIYTGPMSGQ LVDGMLNCNP DGPLMVYTTK
NYATPDAASF YVFGRVMSGT LHAQQDVRIL GENYSIEDEE DSQTLTVGRL WIFESRYNVE
VSRVPAGNWV LIEGIDEPVV KTSTITDVYL NEDVYIFRPL KFSTQSVIKI AIEPVNPSEL
PKMLDGLRKV NKSYPLLSTR VEESGEHVIF GTGELYLDCV LHDVRKVFSE IDIKVADPVV
AFCETVVETS SLKCFCETPN KRNKLTMVCE PMERGLAEDI ENQLITLSMN RKNLSSILQE
KYNWDLLAAR SIWAFGPDYV GPNILVDDTL PSEVDKSLLN SIRESVVQGF QWAAREGPLC
EEPIRNVKFK ILDAQISSEV LHRGGGQIIP TARRVAYSAF LMATPRLMEP YFHIEVIAPA
DCVSAVYTVL ARRRGHVTQD LPIPGSPLYT IKAFIPAIDS FGFETDLRTH TQGQAFSLSE
FHHWQIVPGD PLDRSIYIRP LEPQPATHLA REFMIKTRRR KGLSEDVSVS KFFDDPMLLE
LAKQDQEHLQ LDRLAVSVIG GCEVMLKKMA QIQEKLTGLG EQREKHNEAV LQGLRRILNT
LTVNHSIINV ASAQRSSRLQ IESKLKNLVK ANRQYSGNTQ EYVKNYPIST PDQHADCKED
SVRRRMVVVN RTLLQEELEK DKERFDVDFN DAEAQKFEME QLTLKKTLAT EKSDIDKLQA
QITKISDLQR TFVENICSQE EIIDKIGQSA VSSVDFMRIS NNAIRQAIGK KASDRFWFMF
ITINPVEKAN PHSISSQCKR SLSLRSYAYN TIDVEAICEF WKTFDLPALQ KELDETAAEL
ANRQDESECS RQKLIENSRE YKKNTPEDIR KRATHLLKSF QAEVDWLSKR SKAAEAAFLK
LYKQVIEWPN PAPALKMASI SKKLAKRMQD LESENRYLRS ELEALRSQMY HLAEKDAIIL
HLREMLKNLE CKLTAEFEAK LSHQEEAGRI MYENRERQLQ KFNLFLCQSL KDARERAEAM
RLTLSELSKA KLIAANHSSA YVDSDTIMTS PSQLNNGDNA VADFGSVVEN CISPMNNVKT
DNEENSLTNS EHGAKMFHFS EACENGRENN SLQLNAESSD SDEHVSQVQQ YVDMMASVVG
SELVNSYREW SCTVGLELEL DAAQKAMTTS DQVSSPGTSV PPLPAVPTAC PAPESSTLSA
SSLNAEPVTD LAGDTGVDEL KKRSAPGVQL NNGSEAGLSG AGVELEAVGM SSADWANVRY
LQSVLAWHVD QQSKLSGDEP LDTAEIARQC RRVLTEHNIG QRLIAKYVLN QSQGAVSELL
SKPKKWSSLT ERSKNSFRRL KAWLSDQRAV LALRNISPKR TPCSARDRYA NRSQMDSATE
ARIVHILQKA KQAREEVQLP LVVSDRGAAS STTVRPMSAS PVSSSNGEGS NVSRDRRCQS
ASTVQSVSGA SFRCRPGRYR HDDIPKSKIH EIYERELAKL RNHDGSLLIT SDQQTPTVPP
LQPSDLSVVV SNSTLVPAFT GMSSSTSSAG SNRCASASPP TRTFKAVLPP ITQEQFERFG
RINTEKLVQR VKEHLLQYSI SQRVFGEQVL GLSQGSVSDL LARPKAWHML TQKGREPFIR
MHAFLEDNEL LKKLITSRKA SQNSSNGGGG GGGSRSSSRG SQIEIQAPGF EGSAVFTERS
LNNGGQPTSK MVLVNETPTS PLSTTTICDS YAVPLDTVAI VEETRAVLAA HSIGQKLFGE
AVLHLSQGFV SDLLSKPKPW EALSAKRKEA LLRMQAWLKD ADRIKKINEY QARKNFKRLA
NCDPIQRDSA ISGQYVAYSQ LDVKRPRVTL TKAQKEQLIE AFNREPYPSV QMTSHLSVRL
GLHVSTVANW FQNRRMRQKA ALQQNTAMAD NAHQVDRLCN GITVCNVAGR EEPVAPAVPP
LFSAVFNSAD LLNQISLTNG LDLAANSLRS AGLETSNNHT SSRTSSTSTT GSSSYGEPVT
SASGGEQPTS NYLRRTILHR METNLVKPDV SWVTEEDRSE AINRIENRIK EKDQIDKNAS
NLFEKKSLFV LSTWILRVVF GLLACLTNCF YIILAIYEKS IFKSQKIVAA IRQTKLLKAV
NEASRNYMLI ACIILSLIDL ICMWINAYLL RMKEISAACY RSAVVGRFYY LFHFSCCILL
AYCSLCFYII GLIQLKSAFK NRLKAANHLI QKQRLNRESL VTKKTFILIG FTVLLQGIPN
TLRIYFFYNK SHSWISDIAL TINAFALSLY AIYYIVTSDV AKSLKKKFGN YIGPELESDL
SDSDSEYDEV HQQNDEADET GAVVEMNGTA DAEAVVLHED KKYYPTALEI YGPDVESVVQ
EEDTQPLTEP IVKPVRPKKF QAIERDLPET VYNKEYMADL MDCPNLIRNV ALAGHLHHGK
TCFVDCLVEQ THPTFVRRDD RDTRYTDTLN TEYERGVGIK AMPITVVMQD FRNKSFLLNV
FDTPGHINFS DEISAAFRMC DGVVVFVDAH EGVMMSTERT LKCAVQENLA ITLCINKIDR
LILELKLPPT DAYLKLRHTI DEVNVLLKSF STSHEYQTLS PVNGNVFFAS GKYNICFTLL
SFANLYKEIY PDIQPKDFAK RLWGDIYLNS KTREFQRKAP SGDKPRTFVE FILEPIYKIF
SQIVGEVDDT LPRVMSELNI RLTKEEQKLN IRPLLSLIYR RFFGNFTSFV DVISQHIPSP
AENAKNKIQL IYTGPMSGQL VDGMLNCNPD GPLMVYTTKN YATPDAASFY VFGRVMSGTL
HAQQDVRILG ENYSIEDEED SQTLTVGRLW IFESRYNVEV SRVPAGNWVL IEGIDEPVVK
TSTITDVYLN EDVYIFRPLK FSTQSVIKIA IEPVNPSELP KMLDGLRKVN KSYPLLSTRV
EESGEHVIFG TGELYLDCVL HDVRKVFSEI DIKVADPVVA FCETVVETSS LKCFCETPNK
RNKLTMVCEP MERGLAEDIE NQLITLSMNR KNLSSILQEK YNWDLLAARS IWAFGPDYVG
PNILVDDTLP SEVDKSLLNS IRESVVQGFQ WAAREGPLCE EPIRNVKFKI LDAQISSEVL
HRGGGQIIPT ARRVAYSAFL MATPRLMEPY FHIEVIAPAD CVSAVYTVLA RRRGHVTQDL
PIPGSPLYTI KAFIPAIDSF GFETDLRTHT QGQAFSLSEF HHWQIVPGDP LDRSIYIRPL
EPQPATHLAR EFMIKTRRRK GLSEDVSVSK FFDDPMLLEL AKQDVNISNI R
//