ID C6JS92_SORBI Unreviewed; 1822 AA.
AC C6JS92;
DT 01-SEP-2009, integrated into UniProtKB/TrEMBL.
DT 01-SEP-2009, sequence version 1.
DT 27-MAR-2024, entry version 64.
DE RecName: Full=Integrase catalytic domain-containing protein {ECO:0000259|PROSITE:PS50994};
DE Flags: Fragment;
GN Name=Sb0139s002040 {ECO:0000313|EMBL:EES20269.1};
GN ORFNames=SORBIDRAFT_0139s002040 {ECO:0000313|EMBL:EES20269.1};
OS Sorghum bicolor (Sorghum) (Sorghum vulgare).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; Liliopsida; Poales; Poaceae; PACMAD clade;
OC Panicoideae; Andropogonodae; Andropogoneae; Sorghinae; Sorghum.
OX NCBI_TaxID=4558 {ECO:0000313|EMBL:EES20269.1};
RN [1] {ECO:0000313|EMBL:EES20269.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=19189423; DOI=10.1038/nature07723;
RA Paterson A.H., Bowers J.E., Bruggmann R., Dubchak I., Grimwood J.,
RA Gundlach H., Haberer G., Hellsten U., Mitros T., Poliakov A., Schmutz J.,
RA Spannagl M., Tang H., Wang X., Wicker T., Bharti A.K., Chapman J.,
RA Feltus F.A., Gowik U., Grigoriev I.V., Lyons E., Maher C.A., Martis M.,
RA Narechania A., Otillar R.P., Penning B.W., Salamov A.A., Wang Y., Zhang L.,
RA Carpita N.C., Freeling M., Gingle A.R., Hash C.T., Keller B., Klein P.,
RA Kresovich S., McCann M.C., Ming R., Peterson D.G., Mehboob-ur-Rahman,
RA Ware D., Westhoff P., Mayer K.F., Messing J., Rokhsar D.S.;
RT "The Sorghum bicolor genome and the diversification of grasses.";
RL Nature 457:551-556(2009).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GL002733; EES20269.1; -; Genomic_DNA.
DR RefSeq; XP_002489074.1; XM_002489029.1.
DR HOGENOM; CLU_002362_0_0_1; -.
DR ExpressionAtlas; C6JS92; baseline and differential.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-KW.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IEA:InterPro.
DR CDD; cd09272; RNase_HI_RT_Ty1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR044977; RLT1-3.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR013103; RVT_2.
DR InterPro; IPR028942; WHIM1_dom.
DR InterPro; IPR028941; WHIM2_dom.
DR PANTHER; PTHR36968; HOMEOBOX-DDT DOMAIN PROTEIN RLT2; 1.
DR PANTHER; PTHR36968:SF5; HOMEOBOX-DDT DOMAIN PROTEIN RLT2; 1.
DR Pfam; PF07727; RVT_2; 1.
DR Pfam; PF15612; WHIM1; 1.
DR Pfam; PF15613; WSD; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
PE 4: Predicted;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242}.
FT DOMAIN 941..1038
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT REGION 1..70
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 286..305
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 325..357
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 378..431
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 747..806
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1141..1186
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1232..1267
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 35..70
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 747..767
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1157..1174
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1250..1266
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:EES20269.1"
SQ SEQUENCE 1822 AA; 196007 MW; DD2884A47CB2EFD6 CRC64;
VEDDKDPPLA VKAQDEVPST TTVIGIRSEL DSVGKSDAAD TSNDSPLGGS SANHEVAPGD
SENTQIDESN QVEPWVRALA EGDYYDLSVE ERLNALVALV GVATEGNSIR GVLEKQMWAE
AQLDKRRSKE EFASRVQYNS DMGLKADIYQ ENNATEISST PACDVYKEND GHVGTINSCE
MDDQHNQGNF GSMAYERNGI GQEILATPDT SYVQQYAYAD KTRSQLKSYI GHRAEQLYVY
RSLPLGQDRR RNRYWQFTTS ASPNDLGSGR IFFESKDGCW RVIDSEEATS RRRRPAAAAM
AGRAPPPLPY LQQFPALPYP SWARAAAGGQ PRPPLQQPAG ADVAARGEAA MHADAQPADA
ARPSLLLAAV AGAEPDAAAA AAGGQPRPPL QQPAGADAAA RGEAAMAAAR AAAARGAHQD
RAPVDGTRAD AAEDAAAASK LQLGTNDDAA AAAYARGAAA AIAAGLGMPL DMTDLPRTLL
TAGGALSRGL AAGPRAAVGA APFPAPPPPP VMHRPDSTLI AALVTARAAA AEGRARVREA
ALAWERERDA ADALARQIAD AEQFLGLPAS PDVGTTSSGS TSVVWHDPAD PHVVQLHYLA
GGVQNIRLLV PVVLEPESPS YARWRDLVLL TLRRYALDDH VLLDTAGAVP TPSWLRLDSV
VLSWILGTIS LDLHDLVRNT PSARGAWLAL EGQFLGNAEA RALRLDASFR TFVQGDLSVS
EYCRQMKAWV HLRFGVLVVL LDCSRCHSST SSRRPTTVVT SGSSSPRAER GWGGGRGGRR
RRGGGRGAGR GGNTTPLPPP RGAPWPSYHH PWSGRISMWP FQASGGEPRP PAAMLAGAPP
GFPSVTPWAA PFPASSWATP PTPLPGSAGW DQAALAHSFS TMALTPPVGP EWVADSGATY
HTTPNPGILS SVRPPSPSLP SSIMVANGSC LPVTSFGLTI KAVQCDNGRE FDNSTSRAFF
LSHGVQLRMS CPYTSSQNGK AERMIRTTND TVRTLLLQAS LPARFWAESL HTSTYLLNRL
PSAACPAPTP HHALFGTPPR YDHLRVFGCA CYPNTTATAP HKLAPRSTLC VFLGYSPDHK
GYRCYDLTSR RVLISRHVVF DESIFPFSTT TTPASTSEHD LSSVFPTDPV VEPPFPVFPA
GTATSPVVRD TSGPLPCPGP EVSPSGPAPA PDAGPGSAPS TSAPPVRFAQ PVRVYQRRAP
DVGSVPSTPA PPARFAQPVR VYQRRARLAP LPPAAPVAPS SLGSPAPSAT SSPPATPTPP
PRHPATRAVT PVYHPPLLHR HPRHVHPMVT RHAAGTLQPR ALAAMPGDSQ VSPVPSSVRE
ALLDPHWRRA MEEEYAALLA NQTWDLVPRP PSSNIVTGKW IWTHKRRADG TLERYKARWV
LRGFTQRLGV DYDETFSPVV KPATVRTVLS LALTRGWPVH QLDVKNAFLH GVLTETVYCS
QPAGFVDSSC PDMVCRLKKS LYGLKQAPRA WNHRFAAFLL TLGFVEAKSD TSLFIYHYGA
ETAYLLLYVD DIVLTASSES LLRRIIASLQ QEFAMKDLGQ LHHFLGVTVE PHPAGLLLHQ
RQYTLDILER AGMTDCKPCS TPVDTQGKLS EAEGTPVTDP TAYRSLAGAL QYLTFTRPDI
TYAVQQICLH MHDPREPHLT ALKRILRYLR GSVDFGLLLH RRSSSTELVV YTDADWAGCP
DTRRSTSGYA VFLGGNLVSW SSKRQPVVSR SSAEAEYRAV ANGVAEASWL RQLLAELHSP
LSQSALVYCD NVSAVYLSTN PVQHQRTKHV EIDLHFVRDR VAVGDVRVLH VPTTSQFADI
FTKGLPSSTF AEFRSSLNIT SG
//