ID A0A1U8N8I0_GOSHI Unreviewed; 580 AA.
AC A0A1U8N8I0;
DT 10-MAY-2017, integrated into UniProtKB/TrEMBL.
DT 10-MAY-2017, sequence version 1.
DT 27-MAR-2024, entry version 23.
DE SubName: Full=Uncharacterized protein LOC107944788 {ECO:0000313|RefSeq:XP_016734104.1};
GN Name=LOC107944788 {ECO:0000313|RefSeq:XP_016734104.1};
OS Gossypium hirsutum (Upland cotton) (Gossypium mexicanum).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium.
OX NCBI_TaxID=3635 {ECO:0000313|Proteomes:UP000189702, ECO:0000313|RefSeq:XP_016734104.1};
RN [1] {ECO:0000313|Proteomes:UP000189702}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. TM-1 {ECO:0000313|Proteomes:UP000189702};
RX PubMed=25893780; DOI=10.1038/nbt.3208;
RA Li F., Fan G., Lu C., Xiao G., Zou C., Kohel R.J., Ma Z., Shang H., Ma X.,
RA Wu J., Liang X., Huang G., Percy R.G., Liu K., Yang W., Chen W., Du X.,
RA Shi C., Yuan Y., Ye W., Liu X., Zhang X., Liu W., Wei H., Wei S., Huang G.,
RA Zhang X., Zhu S., Zhang H., Sun F., Wang X., Liang J., Wang J., He Q.,
RA Huang L., Wang J., Cui J., Song G., Wang K., Xu X., Yu J.Z., Zhu Y., Yu S.;
RT "Genome sequence of cultivated Upland cotton (Gossypium hirsutum TM-1)
RT provides insights into genome evolution.";
RL Nat. Biotechnol. 33:524-530(2015).
RN [2] {ECO:0000313|RefSeq:XP_016734104.1}
RP IDENTIFICATION.
RC TISSUE=Leaf {ECO:0000313|RefSeq:XP_016734104.1};
RG RefSeq;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_016734104.1; XM_016878615.1.
DR AlphaFoldDB; A0A1U8N8I0; -.
DR STRING; 3635.A0A1U8N8I0; -.
DR PaxDb; 3635-A0A1U8N8I0; -.
DR Proteomes; UP000189702; Unplaced.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR CDD; cd00303; retropepsin_like; 1.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 4.10.60.10; Zinc finger, CCHC-type; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR032567; LDOC1-rel.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR005162; Retrotrans_gag_dom.
DR InterPro; IPR001878; Znf_CCHC.
DR PANTHER; PTHR15503; LDOC1 RELATED; 1.
DR PANTHER; PTHR15503:SF43; REVERSE TRANSCRIPTASE RNASE H-LIKE DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF03732; Retrotrans_gag; 1.
DR Pfam; PF08284; RVP_2; 1.
DR Pfam; PF00098; zf-CCHC; 1.
DR SMART; SM00343; ZnF_C2HC; 1.
DR SUPFAM; SSF50630; Acid proteases; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR PROSITE; PS50158; ZF_CCHC; 1.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Reference proteome {ECO:0000313|Proteomes:UP000189702};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00047}.
FT DOMAIN 233..248
FT /note="CCHC-type"
FT /evidence="ECO:0000259|PROSITE:PS50158"
FT REGION 178..208
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 257..289
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 580 AA; 65591 MW; 50AFD74004A942F6 CRC64;
MESTKRILQQ LDCTPRECLI CAVSLLQGEA YLWWESVVRH LPESQITWDL FQKEFQKKYI
GEMYIEDKKQ EFLLLQQGDM SVIDYEREFS RLSRYASEFI PTEADSCKRF LRGLRDEIKV
QLVSHRITEL VDLIERAKMV EQVLGLDKKT EVVRPTGKRT GTTSSNPQPK RLKEFQSGWR
SSFRSDRGGR SRGKQTMIST GSVRGPSREI DIPDYQHCGK KHRGECWKLT RGCFRCGSTD
HFIRDCSKVD STVPVTSQRS VSTARGRGLG RGGSISRGGS IRRSSDIATQ QSEAKVPARA
YVVRTREEGD AHDVVTGIFL LYSEPVYALI DPGSSHSYIN SKLVELGKFN SEISRVTVEV
SSPLGQTVLV NQVCPRCPLI IQNKTFPIDL LIMPFGDFDI ILGMDWLAEH GVVLDCYKKK
FSIQTEDGDR IEVNGIRTNG SARIISAIKA NKLLQRGCTA YLAYVINSDL VGSQCSKIRT
VCEFPDVFPE ELPGLPPDRE VEFAIEVYPG TAPISIPPYR MSPTELKELK VQLQDLLDRG
FIRPSISPWG APVLFVKKKD GSMRLCIDYR QLNKVTIKNR
//