ID A0A151SXQ5_CAJCA Unreviewed; 1124 AA.
AC A0A151SXQ5;
DT 08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT 08-JUN-2016, sequence version 1.
DT 27-MAR-2024, entry version 23.
DE SubName: Full=Retrotransposable element Tf2 {ECO:0000313|EMBL:KYP59599.1};
GN ORFNames=KK1_015035 {ECO:0000313|EMBL:KYP59599.1};
OS Cajanus cajan (Pigeon pea) (Cajanus indicus).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade;
OC NPAAA clade; indigoferoid/millettioid clade; Phaseoleae; Cajanus.
OX NCBI_TaxID=3821 {ECO:0000313|EMBL:KYP59599.1, ECO:0000313|Proteomes:UP000075243};
RN [1] {ECO:0000313|EMBL:KYP59599.1, ECO:0000313|Proteomes:UP000075243}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Asha {ECO:0000313|Proteomes:UP000075243};
RX PubMed=22057054; DOI=10.1038/nbt.2022;
RA Varshney R.K., Chen W., Li Y., Bharti A.K., Saxena R.K., Schlueter J.A.,
RA Donoghue M.T., Azam S., Fan G., Whaley A.M., Farmer A.D., Sheridan J.,
RA Iwata A., Tuteja R., Penmetsa R.V., Wu W., Upadhyaya H.D., Yang S.P.,
RA Shah T., Saxena K.B., Michael T., McCombie W.R., Yang B., Zhang G.,
RA Yang H., Wang J., Spillane C., Cook D.R., May G.D., Xu X., Jackson S.A.;
RT "Draft genome sequence of pigeonpea (Cajanus cajan), an orphan legume crop
RT of resource-poor farmers.";
RL Nat. Biotechnol. 30:83-89(2012).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM003612; KYP59599.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A151SXQ5; -.
DR OMA; HACLANL; -.
DR Proteomes; UP000075243; Chromosome 10.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR CDD; cd00303; retropepsin_like; 1.
DR CDD; cd01647; RT_LTR; 1.
DR Gene3D; 1.10.340.70; -; 1.
DR Gene3D; 3.10.20.370; -; 1.
DR Gene3D; 3.30.70.270; -; 1.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR Gene3D; 4.10.60.10; Zinc finger, CCHC-type; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR041588; Integrase_H2C2.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR005162; Retrotrans_gag_dom.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR041577; RT_RNaseH_2.
DR InterPro; IPR001878; Znf_CCHC.
DR InterPro; IPR036875; Znf_CCHC_sf.
DR PANTHER; PTHR45835:SF105; IPP TRANSFERASE; 1.
DR PANTHER; PTHR45835; YALI0A06105P; 1.
DR Pfam; PF17921; Integrase_H2C2; 1.
DR Pfam; PF03732; Retrotrans_gag; 1.
DR Pfam; PF17919; RT_RNaseH_2; 1.
DR Pfam; PF08284; RVP_2; 1.
DR Pfam; PF00078; RVT_1; 1.
DR SMART; SM00343; ZnF_C2HC; 2.
DR SUPFAM; SSF50630; Acid proteases; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF57756; Retrovirus zinc finger-like domains; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS50158; ZF_CCHC; 1.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Reference proteome {ECO:0000313|Proteomes:UP000075243};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00047}.
FT DOMAIN 205..220
FT /note="CCHC-type"
FT /evidence="ECO:0000259|PROSITE:PS50158"
FT DOMAIN 768..931
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT REGION 137..181
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 218..247
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 220..240
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:KYP59599.1"
SQ SEQUENCE 1124 AA; 127226 MW; 45B36F5E5C9EDFBF CRC64;
ERRLTYVVYM LVGEAEHWWR NTYQMIAARG VIVDWECFKT AFMEKYFPES VRNAKEAEFL
QLHQGGLSVS EYALRFEHLA RFYSQTVSEA WKCRRFAEGL KYELKKMVVP MAITEFSALV
EKAKIVERLE EGNSVIRTTE GLAGSKRKGG SHKKPYDRPQ PQGSPVIRQP YGVASGGKQS
GSTTLRCYRC AGPHLIRDCS HTVSRCFRCQ QMGHESFNCP TRNKQEKDAQ KSDVQRGDAQ
RGDRPTTAGR IFAMTGAEAS TSSDANVIFL FDSGASHSFI SYACVDILGV PVCDLGLKLL
VSTPASTSVV ASELCVNCPI VVNEKKYKVN LICLPLIDID IILGMDWLST NRILIDCANR
LLIFPQEEDE LLISASQAKL LLKEGAECRF LLAAMSIETE KIIAKIDVVR DFAEVFPDEV
PGLPPVREME FSINLVPGAG PVSIAPYRMA PAELAELKGQ LEDLLEKQLV RPSVSPWGAP
VLLVKKKDGG SRLCVDYRQL NKLTIKNKYP LPRIDDLMDQ LRGASVFSKI DLRSGYHQIR
VKEGDIPKTA FRTRRFIEGF SKIVAPLTQL TRKEQPFIWT DTCEQSFVEL KKRLTTSPVL
VLPDSGEPFD VYCDASHQGL GCVLMQHEKV VAYASRQLKN HERNYPTHDL ELAAVVGLKQ
LQDTELVKLL GLLGTEKAIG FELGEDGILR FKGRICLPQD AELKKAVLDE GHKSRLSIHP
GMTKMYQDLK KTFWWSGMKR EIAEYVAGCL TCQKAKVEHQ KPSGLMQQME IPEWKWDSIT
MDFIVGLPRS ARNSDAIWVI VDQLTKCAHF LPVNIKWSLE KLTQLYIREI VRLHGVPSSI
ISDRDPRFTS RFWQSLHQAL GTKLKLSSAY HPQTDGQSER TIQSLEDLLR ACVLDHLGSW
EEVLPLVEFT YNNSFHASIG MAPFEALYGR RCRTPLCWYQ NGESVIVGPE LILQTTEKVK
IIQEKMRTAQ SRQKSYADKR RKPLEFAEGE HVFLKVTPTS GVGRALKARK LTPRFVGPYQ
IIQRVGPVAY RLALPSSLSN LHDVFHVSQL RKYVHDPSHV VEMDEVQVKE NLTYEKKPVA
IIDHKLTELR GKSIKLVKIL WDATTGEATW EVESQFKEQY PYLF
//