ID A0A077ZI82_TRITR Unreviewed; 699 AA.
AC A0A077ZI82;
DT 29-OCT-2014, integrated into UniProtKB/TrEMBL.
DT 29-OCT-2014, sequence version 1.
DT 27-MAR-2024, entry version 33.
DE RecName: Full=Gypsy retrotransposon integrase-like protein 1 {ECO:0000256|ARBA:ARBA00039658};
GN ORFNames=TTRE_0000784901 {ECO:0000313|EMBL:CDW59514.1};
OS Trichuris trichiura (Whipworm) (Trichocephalus trichiurus).
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Enoplea; Dorylaimia;
OC Trichinellida; Trichuridae; Trichuris.
OX NCBI_TaxID=36087 {ECO:0000313|EMBL:CDW59514.1};
RN [1] {ECO:0000313|EMBL:CDW59514.1}
RP NUCLEOTIDE SEQUENCE.
RA Aslett M.;
RL Submitted (JAN-2014) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:CDW59514.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Foth B.J., Tsai I.J., Reid A.J., Bancroft A.J., Nichol S., Tracey A.,
RA Holroyd N., Cotton J.A., Stanley E.J., Zarowiecki M., Liu J.Z.,
RA Huckvale T., Cooper P.J., Grencis R.K., Berriman M.;
RT "The whipworm genome and dual-species transcriptomics of an intimate host-
RT pathogen interaction.";
RL Submitted (MAR-2014) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the XPA family. {ECO:0000256|ARBA:ARBA00005548}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; HG806616; CDW59514.1; -; Genomic_DNA.
DR STRING; 36087.A0A077ZI82; -.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003684; F:damaged DNA binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR GO; GO:0006289; P:nucleotide-excision repair; IEA:InterPro.
DR CDD; cd21076; DBD_XPA; 1.
DR Gene3D; 1.10.340.70; -; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR Gene3D; 3.90.530.10; XPA C-terminal domain; 1.
DR InterPro; IPR009061; DNA-bd_dom_put_sf.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR041588; Integrase_H2C2.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000465; XPA.
DR InterPro; IPR022656; XPA_C.
DR InterPro; IPR037129; XPA_sf.
DR InterPro; IPR022652; Znf_XPA_CS.
DR NCBIfam; TIGR00598; rad14; 1.
DR PANTHER; PTHR47266; ENDONUCLEASE-RELATED; 1.
DR PANTHER; PTHR47266:SF26; RIBONUCLEASE H; 1.
DR Pfam; PF17921; Integrase_H2C2; 1.
DR Pfam; PF00665; rve; 1.
DR Pfam; PF05181; XPA_C; 1.
DR Pfam; PF01286; XPA_N; 1.
DR SUPFAM; SSF46955; Putative DNA-binding domain; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
PE 3: Inferred from homology;
KW DNA damage {ECO:0000256|ARBA:ARBA00022763};
KW DNA repair {ECO:0000256|ARBA:ARBA00023204};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022771};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771}.
FT DOMAIN 98..259
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT REGION 392..460
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 513..533
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 392..409
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 437..457
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 699 AA; 80988 MW; B71CE4851895E57E CRC64;
MLIANWDRLR IRNGILWREW YETDGSSRLQ FVVPKQMVNK LLRLCHSAPT AGHLGQEKML
WRVRERFYWP GYRSDVKRYI RTCWECNTRG SXXHKGRALL QPQVATYRWQ RLVVDISGPL
PTTASGNRYI LIVMDAFSKF VEAIPMPNQE ATTVAGVLVQ DVFCRYGVPE VLHTDQGSQL
FQNVCQELGI KKTRTTPYHP SGNGQAERMN RTIWDMLAKS IIMEERNWHM VLPKVMMAYR
ATPHSSTGQS PYRMMFGRQC RLPEDIIDRD STPTRAPKRY VEQLIRALDE VNGKLQSRIA
KEAARQKRYY DRGANPQQFK VGDLVFLFLP RVLQGRNKKF RKPWVGPYVI ISQLSPVTYR
IQRCSYRRDV QVVHADRLKS CPDDIRFQEK KLRSPLTDQR TRRARRDHGS LQGIGPSHQH
NAYQPPRMIL VDYQNHSGSL DERQQLSPST SQLPAPERPK RSSRILLGAF NRNPTFHNTL
FLLVEYNKGI KSIEPAASLP SYAKAGGFLV EEDGDNRPCS SRQRSRPQPD FEPLPETDSD
LICLECSKKL STAFLYSMFG CCVCDSCRAG KEKYKLLTRT EAKSHYLLKD CDLDARKPPL
RYLNRKNPRS PRFGDMKLYL RAQVEERAIA VWGSNEALEE ARVKRTLNNE ANKQSRYNRK
LIVQLKLGPN SFVVIWHKFK IVFSIFLPIF ASEPKKEMP
//