ID A0A1R3KR00_9ROSI Unreviewed; 440 AA.
AC A0A1R3KR00;
DT 12-APR-2017, integrated into UniProtKB/TrEMBL.
DT 12-APR-2017, sequence version 1.
DT 27-MAR-2024, entry version 16.
DE RecName: Full=Retrotransposon gag protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=COLO4_05398 {ECO:0000313|EMBL:OMP09516.1};
OS Corchorus olitorius.
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Grewioideae; Apeibeae; Corchorus.
OX NCBI_TaxID=93759 {ECO:0000313|EMBL:OMP09516.1, ECO:0000313|Proteomes:UP000187203};
RN [1] {ECO:0000313|Proteomes:UP000187203}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. O-4 {ECO:0000313|Proteomes:UP000187203};
RA Alam M., Haque M.S., Islam M.S., Emdad E.M., Islam M.M., Ahmed B.,
RA Halim A., Hossen Q.M.M., Hossain M.Z., Ahmed R., Khan M.M., Islam R.,
RA Rashid M.M., Khan S.A., Rahman M.S., Alam M., Yahiya A.S., Khan M.S.,
RA Azam M.S., Haque T., Lashkar M.Z.H., Akhand A.I., Morshed G., Roy S.,
RA Uddin K.S., Rabeya T., Hossain A.S., Chowdhury A., Snigdha A.R.,
RA Mortoza M.S., Matin S.A., Hoque S.M.E., Islam M.K., Roy D.K., Haider R.,
RA Moosa M.M., Elias S.M., Hasan A.M., Jahan S., Shafiuddin M., Mahmood N.,
RA Shommy N.S.;
RT "Corchorus olitorius genome sequencing.";
RL Submitted (SEP-2013) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OMP09516.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AWUE01012334; OMP09516.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1R3KR00; -.
DR STRING; 93759.A0A1R3KR00; -.
DR OrthoDB; 953779at2759; -.
DR Proteomes; UP000187203; Unassembled WGS sequence.
DR CDD; cd00303; retropepsin_like; 1.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR InterPro; IPR032567; LDOC1-rel.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR PANTHER; PTHR15503:SF22; CCHC-TYPE DOMAIN-CONTAINING PROTEIN-RELATED; 1.
DR PANTHER; PTHR15503; LDOC1 RELATED; 1.
DR Pfam; PF08284; RVP_2; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000187203}.
SQ SEQUENCE 440 AA; 49490 MW; E0ECAC9401113369 CRC64;
MAPKVDRSDT SITEALHALT ETFNTQFQEL RASQIELKQS LDTKLDLAIA DFNKKLAIRE
SPSSSSLPSH KYDGVLGTFP GLTPPNTTNP LFKPKTPKFF LSSFDGTNVH AWLFQAEQYF
KFYSITHEQR VPMVHFFMTG EAAAWYQWMY KNNQLSDWES SRAIEICFGP SKFLNPQSAL
FKLRQTGTEV ITFKPITLAH AFELAKHIES KLFESRQTPS RAPPRAFQTP VQKPIPPTSY
PIRRLSPTKM QARHSKGLCF NCDEQFKPGH RYKTTLFLLL QTEDDFYDPL ISLETSKTTD
CLPLSSLPLP PPPKMPLIPE EKTPDFQVSL HTLHGIASHS CLKLTGVIHG HSSTILIDSG
STHNLVQPRV VRHLGLAVEP APPLTVRVWN GDVLRCAGKI SMLKVDLQNM VFSLYLFLLD
IHGADSIGHP MVGPTRLNLC
//