ID A0A0V0W6B3_9BILA Unreviewed; 1102 AA.
AC A0A0V0W6B3;
DT 16-MAR-2016, integrated into UniProtKB/TrEMBL.
DT 16-MAR-2016, sequence version 1.
DT 27-MAR-2024, entry version 24.
DE SubName: Full=CGG triplet repeat-binding protein 1 {ECO:0000313|EMBL:KRX71200.1};
GN Name=Cggbp1 {ECO:0000313|EMBL:KRX71200.1};
GN ORFNames=T06_5215 {ECO:0000313|EMBL:KRX71200.1};
OS Trichinella sp. T6.
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Enoplea; Dorylaimia;
OC Trichinellida; Trichinellidae; Trichinella.
OX NCBI_TaxID=92179 {ECO:0000313|EMBL:KRX71200.1, ECO:0000313|Proteomes:UP000054673};
RN [1] {ECO:0000313|EMBL:KRX71200.1, ECO:0000313|Proteomes:UP000054673}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ISS34 {ECO:0000313|EMBL:KRX71200.1};
RA Korhonen P.K., Edoardo P., Giuseppe L.R., Gasser R.B.;
RT "Evolution of Trichinella species and genotypes.";
RL Submitted (JAN-2015) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KRX71200.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JYDK01000251; KRX71200.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A0V0W6B3; -.
DR STRING; 92179.A0A0V0W6B3; -.
DR Proteomes; UP000054673; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:InterPro.
DR GO; GO:0003690; F:double-stranded DNA binding; IEA:InterPro.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0046983; F:protein dimerization activity; IEA:InterPro.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IEA:InterPro.
DR Gene3D; 2.20.25.240; -; 1.
DR InterPro; IPR033375; Cggbp1.
DR InterPro; IPR007021; DUF659.
DR InterPro; IPR008906; HATC_C_dom.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR007588; Znf_FLYWCH.
DR PANTHER; PTHR32344; U1-TYPE DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR32344:SF1; U1-TYPE DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF05699; Dimer_Tnp_hAT; 1.
DR Pfam; PF04937; DUF659; 2.
DR Pfam; PF04500; FLYWCH; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 2.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Reference proteome {ECO:0000313|Proteomes:UP000054673};
KW Zinc {ECO:0000256|ARBA:ARBA00022833}.
FT DOMAIN 165..322
FT /note="DUF659"
FT /evidence="ECO:0000259|Pfam:PF04937"
FT DOMAIN 426..481
FT /note="FLYWCH-type"
FT /evidence="ECO:0000259|Pfam:PF04500"
FT DOMAIN 731..888
FT /note="DUF659"
FT /evidence="ECO:0000259|Pfam:PF04937"
FT DOMAIN 1061..1096
FT /note="HAT C-terminal dimerisation"
FT /evidence="ECO:0000259|Pfam:PF05699"
SQ SEQUENCE 1102 AA; 125529 MW; 9292842019155277 CRC64;
MNCAVKRVHF IFPQSVDFSF THFLLRLLCR SYSRSATSTS FKMPKVKSKP SKKLIDLVNE
YGSDILSTDN TVLFCKACGK TINHEKKYFV YQHLQTAKHK SATEKMKTES KQACLLNTFV
ADSSSKSQFP ADLCMAFIDA GIPLWKLENK SLRSFLEKYT KQHIPSESSL RKHYIDSNFN
NVMDRVRREV AYNKIWISID ETIDPVGRFV ANVVIGTLEA DQPSKEYLLT SEVLEKSNSS
TIAQLFTSSL AVLWPEGIRH ENVLLFVTDA APYMKKAAGA LKVLFPNMLH LTCLAHGLHR
IAEHIRCLFP DVDRLISNMK KVFLKAPSRV QLFKEMAPEV PLPPQPVLTR WGTWLSAVFY
YAANFKKIQE IISCFEEEES TAVKIVHEIM QKESLLCDLV FIASNFTNFV PAITYLEKRS
ETLLRFVRNR GGSTSLVSQG RTYKLRYTNK QKKHWVCSKG REGCKGVIWT NLDVTYVITQ
KDHIESCPVN EHLAYKMEKK AVLKKRSAEE TKSILAIYDE VSAASAVPST FGHFPPSRRI
IFKRVKSTMY SHRSKRYLKL PEHRRDQQIP DAFRTTMAGE DFLPWQSASR HILVLATGSN
IRLMATRRTW ALDGTFKIVP QWYQQLFTIH AFLAGKLVLA VYCLCTDKDI PTYGFILSRS
GITGNPQPTE KMKTESKQAC LLNTFVADSS SKSQFPADLC MAFIDAGIPL WKLENKSLRS
FLEKYTKQHI PSESSLRKHY IDSNFNNVMD RVRREVAYNK IWISIDETID PVGRFVANVV
IGTLEADQPS KEYLLTSEVL EKSNSSTIAQ LFTSSLAVLW PEGIRHENVL LFVTDAAPYM
KKAAGALKVL FPNMLHLTCL AHGLHRIAEH IRCLFPDVDR LISNMKKVFL KAPSRVQLFK
EMAPEVPLPP QPVLTRWGTW LSAVFYYAAN FKKIQEIISC FEEEESTAVK IVHEIMQKES
LLCDLVFIAS NFTNFVPAIT YLEKRSETLV DRLQAFDEVI DNIHKIPGIV GEDIKSKCDK
VISANKDLKE IKSIAEVLKG NSNAQVIGMN IESAVCFKYA PVTSAEVERS FSQLKYILSD
RRYSLTPDNL KKMLVIMCNQ TR
//