ID A0A1U8MW39_GOSHI Unreviewed; 1091 AA.
AC A0A1U8MW39;
DT 10-MAY-2017, integrated into UniProtKB/TrEMBL.
DT 10-MAY-2017, sequence version 1.
DT 27-MAR-2024, entry version 24.
DE SubName: Full=Uncharacterized protein LOC107941939 {ECO:0000313|RefSeq:XP_016731056.1};
GN Name=LOC107941939 {ECO:0000313|RefSeq:XP_016731056.1};
OS Gossypium hirsutum (Upland cotton) (Gossypium mexicanum).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium.
OX NCBI_TaxID=3635 {ECO:0000313|Proteomes:UP000189702, ECO:0000313|RefSeq:XP_016731056.1};
RN [1] {ECO:0000313|Proteomes:UP000189702}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. TM-1 {ECO:0000313|Proteomes:UP000189702};
RX PubMed=25893780; DOI=10.1038/nbt.3208;
RA Li F., Fan G., Lu C., Xiao G., Zou C., Kohel R.J., Ma Z., Shang H., Ma X.,
RA Wu J., Liang X., Huang G., Percy R.G., Liu K., Yang W., Chen W., Du X.,
RA Shi C., Yuan Y., Ye W., Liu X., Zhang X., Liu W., Wei H., Wei S., Huang G.,
RA Zhang X., Zhu S., Zhang H., Sun F., Wang X., Liang J., Wang J., He Q.,
RA Huang L., Wang J., Cui J., Song G., Wang K., Xu X., Yu J.Z., Zhu Y., Yu S.;
RT "Genome sequence of cultivated Upland cotton (Gossypium hirsutum TM-1)
RT provides insights into genome evolution.";
RL Nat. Biotechnol. 33:524-530(2015).
RN [2] {ECO:0000313|RefSeq:XP_016731056.1}
RP IDENTIFICATION.
RC TISSUE=Leaf {ECO:0000313|RefSeq:XP_016731056.1};
RG RefSeq;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_016731056.1; XM_016875567.1.
DR AlphaFoldDB; A0A1U8MW39; -.
DR STRING; 3635.A0A1U8MW39; -.
DR PaxDb; 3635-A0A1U8MW39; -.
DR Proteomes; UP000189702; Unplaced.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0004523; F:RNA-DNA hybrid ribonuclease activity; IEA:InterPro.
DR CDD; cd06222; RNase_H_like; 1.
DR CDD; cd01650; RT_nLTR_like; 1.
DR Gene3D; 3.60.10.10; Endonuclease/exonuclease/phosphatase; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR036691; Endo/exonu/phosph_ase_sf.
DR InterPro; IPR005135; Endo/exonuclease/phosphatase.
DR InterPro; IPR044730; RNase_H-like_dom_plant.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR002156; RNaseH_domain.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR PANTHER; PTHR46890:SF32; NON-LTR RETROLELEMENT REVERSE TRANSCRIPTASE-LIKE PROTEIN; 1.
DR PANTHER; PTHR46890; NON-LTR RETROLELEMENT REVERSE TRANSCRIPTASE-LIKE PROTEIN-RELATED; 1.
DR Pfam; PF03372; Exo_endo_phos; 1.
DR Pfam; PF00078; RVT_1; 1.
DR Pfam; PF13456; RVT_3; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF56219; DNase I-like; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50878; RT_POL; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000189702}.
FT DOMAIN 359..633
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
SQ SEQUENCE 1091 AA; 124532 MW; 3B5EC79FF5479D42 CRC64;
MKILSWNVRG LGSPRTVHRL RHWLRLYNPH IVFFMETKVD SRRMEQIRRR CGFLNGIEVD
ATGSRGGLCL AWRRDIGIRI QSYSKSHIDV LIEDVERGKN WRFISFYGSP YSNKEESWNL
LRRLSNARGL PREEGRMEAF RTVLEDCNLR DIGFSGRWFT WEIGNLPETN IRERLDRGVA
EDKRIFARGF QFEAWWVLEE SFYSEVKSIW GMAKGDVLSK MESLKRGLTI WADKIQKIKK
GRKMATQKRQ KNFINSLQSD DGKETDDLSE MEEITGCYFQ KLFSAGRKGD YDRIMAGIKQ
CIFEEDNQKL KENYTKEKIC VALSELGPTK APGEDGFPII FYQKCWSIIG EEVTSFCLTL
LNGGMDVTSI NKTNIILIAK IPNPVNISNF RPISLCNVLY KLLAKVIANR LRLVMNRCID
EAQSAFVPRR LISDNVLLAY EVLHSLKNKR IGKKGLMAVK LDMSKAYDRV EWSFVEEIMK
KLGFDPEWVE KLMKCVSTVS YSVVLNGVNG ERFFPSRGLR QGDPLSPFLF LFCGEGISSL
LRQAMEVNLS RGVRVSRNGP LISHLLFADD CILFGEATER GATVLKNILK EYEISSGQCV
NYNNSINPER YLGLPNLVGR GKKAAFQRLK DSLKQKIDNW SVKFLSQGGK EILVGKKRGK
KGIHWCMWEE VCDLKEAGGL GFRKLDKFNI ALLAKQAWRL INYPDSLIGR VLKAKYYPNA
CFPKAQLGNL PSLTWKSIWS ARGILEKGLC WRVGKGDRID VWEDLWISGN EDDRLQNHQR
DENIKLVSDL INADKREWKA DILSTTFNAE VVKKILQIPL ATTATEDFQL QMLDVLVVNW
KRRTAIISSV NALQQKKFGD SHSYHGRGTT EQCRIFCCGL WTIWTSRNKL VYENRQTTGS
DISYKISDFF AELKGIQEKK LILADDGAPR TEESSTRTSI YFDAAFDQQN ARSASGLLVR
GEGGEILVSK SVIHTNIATP FAAEAHAGLQ ALELGRSMWL TYLQIKGYSK TIIKKCQNSE
QDKSVIGALI RDIQELRTTF NSICFCYIPR NANIVVHSIA IEALKKGEEH YLVGAIPNTV
RLVAEKMNPR F
//