ID A0A1U8I8W3_GOSHI Unreviewed; 1110 AA.
AC A0A1U8I8W3;
DT 10-MAY-2017, integrated into UniProtKB/TrEMBL.
DT 10-MAY-2017, sequence version 1.
DT 27-MAR-2024, entry version 25.
DE SubName: Full=Uncharacterized protein LOC107894016 isoform X1 {ECO:0000313|RefSeq:XP_016674687.1};
DE SubName: Full=Uncharacterized protein LOC107894016 isoform X2 {ECO:0000313|RefSeq:XP_016674688.1};
DE SubName: Full=Uncharacterized protein LOC107894017 isoform X1 {ECO:0000313|RefSeq:XP_016674689.1};
DE SubName: Full=Uncharacterized protein LOC107894017 isoform X2 {ECO:0000313|RefSeq:XP_016674690.1};
GN Name=LOC107894017 {ECO:0000313|RefSeq:XP_016674689.1,
GN ECO:0000313|RefSeq:XP_016674690.1};
GN Synonyms=LOC107894016 {ECO:0000313|RefSeq:XP_016674687.1,
GN ECO:0000313|RefSeq:XP_016674688.1};
OS Gossypium hirsutum (Upland cotton) (Gossypium mexicanum).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium.
OX NCBI_TaxID=3635 {ECO:0000313|Proteomes:UP000189702, ECO:0000313|RefSeq:XP_016674690.1};
RN [1] {ECO:0000313|Proteomes:UP000189702}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. TM-1 {ECO:0000313|Proteomes:UP000189702};
RX PubMed=25893780; DOI=10.1038/nbt.3208;
RA Li F., Fan G., Lu C., Xiao G., Zou C., Kohel R.J., Ma Z., Shang H., Ma X.,
RA Wu J., Liang X., Huang G., Percy R.G., Liu K., Yang W., Chen W., Du X.,
RA Shi C., Yuan Y., Ye W., Liu X., Zhang X., Liu W., Wei H., Wei S., Huang G.,
RA Zhang X., Zhu S., Zhang H., Sun F., Wang X., Liang J., Wang J., He Q.,
RA Huang L., Wang J., Cui J., Song G., Wang K., Xu X., Yu J.Z., Zhu Y., Yu S.;
RT "Genome sequence of cultivated Upland cotton (Gossypium hirsutum TM-1)
RT provides insights into genome evolution.";
RL Nat. Biotechnol. 33:524-530(2015).
RN [2] {ECO:0000313|RefSeq:XP_016674687.1, ECO:0000313|RefSeq:XP_016674688.1}
RP IDENTIFICATION.
RC TISSUE=Leaf {ECO:0000313|RefSeq:XP_016674687.1,
RC ECO:0000313|RefSeq:XP_016674688.1};
RG RefSeq;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_016674687.1; XM_016819198.1.
DR RefSeq; XP_016674688.1; XM_016819199.1.
DR RefSeq; XP_016674689.1; XM_016819200.1.
DR RefSeq; XP_016674690.1; XM_016819201.1.
DR PaxDb; 3635-A0A1U8I8W3; -.
DR Proteomes; UP000189702; Chromosome 15.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR CDD; cd00303; retropepsin_like; 1.
DR Gene3D; 1.10.340.70; -; 1.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR Gene3D; 4.10.60.10; Zinc finger, CCHC-type; 1.
DR InterPro; IPR016197; Chromo-like_dom_sf.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR041588; Integrase_H2C2.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR005162; Retrotrans_gag_dom.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR001878; Znf_CCHC.
DR PANTHER; PTHR45835:SF105; IPP TRANSFERASE; 1.
DR PANTHER; PTHR45835; YALI0A06105P; 1.
DR Pfam; PF17921; Integrase_H2C2; 1.
DR Pfam; PF03732; Retrotrans_gag; 1.
DR Pfam; PF08284; RVP_2; 1.
DR Pfam; PF00098; zf-CCHC; 1.
DR SMART; SM00343; ZnF_C2HC; 1.
DR SUPFAM; SSF54160; Chromo domain-like; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS50158; ZF_CCHC; 1.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Reference proteome {ECO:0000313|Proteomes:UP000189702};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00047}.
FT DOMAIN 372..387
FT /note="CCHC-type"
FT /evidence="ECO:0000259|PROSITE:PS50158"
FT DOMAIN 737..900
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT REGION 26..49
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 74..110
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 312..336
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 392..437
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 78..99
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 404..424
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1110 AA; 129190 MW; E17C6F420427C5D1 CRC64;
MCMIRSNLCY FYRYSPIIKM SNLPERTDQE EVNSRAQNSE QRASSDVPIS QMREHELKNM
IYGIMNQWYT EMRQERNQAE PPPPPTAPSM VPPVAPPPFT TTESSKRSPL ERLRKLGAEE
FRGRTDDDPV KAEYWLQSLI RIFKEMACSP DDYLRCAVSL LKEEAYSWWE TIEAVVPAEK
ISWEFFQNEF KKKYVGRRYL DKKKREFLDL RQGNKSVVEY EREFVYLSKY ARDIVSTEEE
MCIRFEEGLN DEIRMMIGGN EIREFVVLSD RAQKIEEVYN RKMQRDRKSK EPFKRGASKS
FSDFPVKKSR EEINRTTSVS GRSGRDRPRQ SDFRVFDRPV ASVSSVQNAS RPKCQYCGRY
HFGECRTKMG ACYKCGATDH LIRDCPRLQK DEVEQKEIQR TIPQRSRRSG QSSATGTTRS
GTRESVGRSE NRAPARTYAI RAREEATAPD VIAGHSVMVN LICRNCPLKV KGYEFPADLM
LLPFREFDII LGMDWLMRHD AVVNCRDKQI SLKCQTGDVI SVGSENMGDT VRIISALSAQ
RLLRKGNEAF LAYILDTRGS DLKLEQVPVV NEFPDVFPEE LPGLPPDREV EFVIDVIPGT
TPISMTPYRM APAELKAKPV FFQRIRELQD EDPKLMLKRQ MVQNELSLEY SIDENGMLYY
RNRICVPNNL DLKNDILSEA HSSMCSIHPG STKMYCDLKK MYWWPGMKRE ICEYVARCLI
CQQVKAEHQV PTGLLQPIMI PEWKWEHVTM DFVSGLPVTP KKKDSIWVIV DRLTKSAHFI
PVRTDYQLEK LAELYVSEIV RLHGVPISII SDRDPRFTSR FWSKLQEALG TKLNFSTAFH
PQTDGQSERV IQILEDMLRC CILEFGGSWE RYLPLAEFAY NNSYQTSIKM APFEALYGRK
CRTPLYWTEL SESKLVGVDL IRETEEKVRI IRDCLKAASD RQKSYADLKR RDIEFSVGDR
VFLKVSPWKK VLRFGRKGKL SPRFIGPYEI IERIGPVAYR LALPRELENI HNVFHVSMLR
RYRSDPSHVI PHTEIELQPD MTYSEEPVKI LAREVKELRN KRVPLVKVLW NRHGSEEATW
ETEELMRFQY PNLFPDREPE TSRGKEKVAD
//