ID A0A498M6T2_LABRO Unreviewed; 522 AA.
AC A0A498M6T2;
DT 05-JUN-2019, integrated into UniProtKB/TrEMBL.
DT 05-JUN-2019, sequence version 1.
DT 27-MAR-2024, entry version 13.
DE SubName: Full=DNA repair protein {ECO:0000313|EMBL:RXN16608.1};
GN ORFNames=ROHU_027449 {ECO:0000313|EMBL:RXN16608.1};
OS Labeo rohita (Indian major carp) (Cyprinus rohita).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; Cypriniformes;
OC Cyprinidae; Labeoninae; Labeonini; Labeo.
OX NCBI_TaxID=84645 {ECO:0000313|EMBL:RXN16608.1, ECO:0000313|Proteomes:UP000290572};
RN [1] {ECO:0000313|EMBL:RXN16608.1, ECO:0000313|Proteomes:UP000290572}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DASCIFA01 {ECO:0000313|EMBL:RXN16608.1};
RC TISSUE=Testis {ECO:0000313|EMBL:RXN16608.1};
RA Das P., Kushwaha B., Joshi C.G., Kumar D., Nagpure N.S., Sahoo L.,
RA Das S.P., Bit A., Patnaik S., Meher P.K., Jayasankar P., Koringa P.G.,
RA Patel N.V., Hinsu A.T., Kumar R., Pandey M., Agarwal S., Srivastava S.,
RA Singh M., Iquebal M.A., Jaiswal S., Angadi U.B., Kumar N., Raza M.,
RA Shah T.M., Rai A., Jena J.K.;
RT "Draft genome sequence of Rohu Carp (Labeo rohita).";
RL Submitted (MAR-2018) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the XPG/RAD2 endonuclease family. XPG subfamily.
CC {ECO:0000256|ARBA:ARBA00005283}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:RXN16608.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; QBIY01012778; RXN16608.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A498M6T2; -.
DR STRING; 84645.A0A498M6T2; -.
DR Proteomes; UP000290572; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0004519; F:endonuclease activity; IEA:InterPro.
DR GO; GO:0003697; F:single-stranded DNA binding; IEA:InterPro.
DR GO; GO:0006289; P:nucleotide-excision repair; IEA:InterPro.
DR CDD; cd09868; PIN_XPG_RAD2; 1.
DR Gene3D; 3.40.50.1010; 5'-nuclease; 1.
DR InterPro; IPR029060; PIN-like_dom_sf.
DR InterPro; IPR006084; XPG/Rad2.
DR InterPro; IPR001044; XPG/Rad2_eukaryotes.
DR InterPro; IPR006085; XPG_DNA_repair_N.
DR PANTHER; PTHR16171:SF7; DNA EXCISION REPAIR PROTEIN ERCC-5; 1.
DR PANTHER; PTHR16171; DNA REPAIR PROTEIN COMPLEMENTING XP-G CELLS-RELATED; 1.
DR Pfam; PF00752; XPG_N; 1.
DR PRINTS; PR00853; XPGRADSUPER.
DR PRINTS; PR00066; XRODRMPGMNTG.
DR SMART; SM00485; XPGN; 1.
DR SUPFAM; SSF88723; PIN domain-like; 1.
PE 3: Inferred from homology;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000290572}.
FT DOMAIN 1..98
FT /note="XPG N-terminal"
FT /evidence="ECO:0000259|SMART:SM00485"
FT REGION 132..174
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 247..395
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 409..522
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 136..156
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 271..297
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 316..331
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 355..369
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 423..454
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 455..522
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 522 AA; 58031 MW; 04999629DCE75877 CRC64;
MGVHGLWKLL ESTGKPINPE TLEGKILAVD ISIWLNQAVK GVRDRDGNAV QNAHLLTLFH
RLCKLLFFRI RPVFVYDGDA PLLKKQTLAI RRQRREELSR ESKQTNEKLL QTFLKRQAIK
AALGDKSQEA IPSLSSVRRD EQDDMYVLPA LPPDEDRDES SSEEERDGDE SLQTYGRFQR
SGEFSQYQLA GLLQRNKLNL RLEGVEQEMN QRSSGGADQP YDQSKDHDME IRRLVSEDSS
HYILIKGSQK KASAPDSGPA PALWSSCPWE RKGRPKGKPE PLWRPVTEDD ENKPSSSSSC
PEEEPPALEG APPSPRSLQA IQSAMMDSSS EEEGSENGRV SPRTLQAIQS AMTDPADAHK
RRTYVITSSS EDEEEAVVPD RSDMTEVTTG GGVSPRTLMA IQKALGDEAV EQTDLRSGSD
GEVPGISVSS QESSESKHQS RTPLILSSAV SGSQEETREP RDAAVNHLVE QRHPSASSDR
KPVHADRTDQ TEENKVRSED EEESSAEGTN THRDTSRDYI LK
//