ID A0A5N6KYY5_9ROSI Unreviewed; 1327 AA.
AC A0A5N6KYY5;
DT 26-FEB-2020, integrated into UniProtKB/TrEMBL.
DT 26-FEB-2020, sequence version 1.
DT 27-MAR-2024, entry version 13.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:KAB8388288.1};
GN ORFNames=FH972_024764 {ECO:0000313|EMBL:KAB8388288.1};
OS Carpinus fangiana.
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; fabids; Fagales; Betulaceae; Carpinus.
OX NCBI_TaxID=176857 {ECO:0000313|EMBL:KAB8388288.1, ECO:0000313|Proteomes:UP000327013};
RN [1] {ECO:0000313|EMBL:KAB8388288.1, ECO:0000313|Proteomes:UP000327013}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Cfa_2016G {ECO:0000313|EMBL:KAB8388288.1};
RC TISSUE=Leaf {ECO:0000313|EMBL:KAB8388288.1};
RA Yang X., Wang Z., Zhang L., Hao G., Liu J., Yang Y.;
RT "A chromosomal-level reference genome of Carpinus fangiana (Coryloideae,
RT Betulaceae).";
RL Submitted (JUN-2019) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KAB8388288.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; VIBQ01000024; KAB8388288.1; -; Genomic_DNA.
DR Proteomes; UP000327013; Unassembled WGS sequence.
DR GO; GO:0004527; F:exonuclease activity; IEA:UniProtKB-KW.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR CDD; cd18673; PIN_XRN1-2-like; 1.
DR Gene3D; 1.25.40.1050; -; 1.
DR Gene3D; 3.40.50.12390; -; 2.
DR InterPro; IPR027073; 5_3_exoribonuclease.
DR InterPro; IPR041412; Xrn1_helical.
DR InterPro; IPR004859; Xrn1_N.
DR PANTHER; PTHR12341:SF79; 5'-3' EXORIBONUCLEASE 4-LIKE ISOFORM X1; 1.
DR PANTHER; PTHR12341; 5'->3' EXORIBONUCLEASE; 1.
DR Pfam; PF17846; XRN_M; 2.
DR Pfam; PF03159; XRN_N; 1.
PE 4: Predicted;
KW Exonuclease {ECO:0000256|ARBA:ARBA00022839};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Nuclease {ECO:0000256|ARBA:ARBA00022722};
KW Reference proteome {ECO:0000313|Proteomes:UP000327013}.
FT DOMAIN 1..244
FT /note="Xrn1 N-terminal"
FT /evidence="ECO:0000259|Pfam:PF03159"
FT DOMAIN 316..409
FT /note="Xrn1 helical"
FT /evidence="ECO:0000259|Pfam:PF17846"
FT DOMAIN 445..735
FT /note="Xrn1 helical"
FT /evidence="ECO:0000259|Pfam:PF17846"
FT REGION 1107..1126
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1327 AA; 148533 MW; 2E994B1C9B21ECAB CRC64;
MGIPAFYRWL VDRYPLAVVS AIEDEPPVTD TTRPNPNGDE FDNLYLDMNG IVHPCFHPDG
LPPPKSYEDI FLAVFNYIDR IFSIVRPRKL LYLAIDGVAP RAKMNQQRSR RFRAAKDAAD
EASGREWKLE AEEERSVSLE QSKKLDSNVI TPGTEFMVLL SSALRYYVHL RMNRDAGWRE
IKVILSDANV PGEGEHKIMS YIRLQRNLPG YDPNTRHCLY GLDADLIMLA LATHEIHCSI
LREDVRRASL NDKSLKHAKN SLPINRQEER GNSKQMEVVG ENLEDYVSRQ KFQFLNIWVL
REYLTLDMKV KGRKQKAERQ IDDFVFMCLF VGNDFLPHIP SLEISEGAID LLMMVYKKEF
VKMGGYLTNS FKVNLERVEH FVHAVGSHEC AIFRRRSQVE KDRKLQLRRF LGKKRARCYI
NRMRQNPPKS RAVGIQTPSN CTFISANAVV DKIKLGEMGW KERFYAEKFE AESEDDRNKI
QRDAVFKYIE GICWVMRYYY EGVCSWQWFY PYHYAPFASD FHGFGQLKIE FTLGKPFKPF
DQLLSVLPAA SAHSLPLFYR KLMTDMSSPL LDFYPTDFEL DMNGKRFLWQ AICKLPFIDE
ARLLSEITKV EHTLTDEERR RNSFSLDVLF VHISHPLAVK IRSLCEKKSC HLNLPEAKVK
RKINPEFSGG MNGYMYISGE PVQPAEIYSP IEEMEMILNN EVLSVFFKCP RFHSHIPRQS
SGVKLPRKSV RKQDILAPPV LWHEKSAVLG RIFSERSTHK SISGRRLAKL AHKLLSKYYN
LKSQKAWGSM ELADHIGADG LTSHSNLEET VRVGGTGHIE NCIDGVIPDN HVGEDKLSCH
LNLEEMVCAR TEGDNNCFDG KAKKRIWSEC NGHSTDGVVD GSACSSNLEG TVTVGGAGGD
ETCINIKSRK RRRKGHKLRD TLENNVGAEK LECPLNLGPT VCEVGIEADE NSVDNGVPNN
QIGETGASGL ACNSVAPDKK CVNGKSRKQK RKRKESNAIV VLDNHEGAKD FAMEPTCCDV
GTEADKNSVD NGVPNNHIGE TGASGLACHS VLDNHEGAMD LAMGPTVCKV GTEADKNFVD
NGVPNNHIGE TGASGLACHS VAPDKKCVNG KSRKQKRKQK QRQNNAIVVP DNHEGAKDLA
FPSDLLPTVC AAGVEADKNR VDNGTPIKLV GETGVNGLAC ANEVRDYGKS SKRKRKRKQS
NIILVADNHV GAKGLTRPSN WMPTVCDVGT GAVKCVDHGV PEKQVGETGE NGFAGANGVP
DKNCVDDKLG RQEGSKVPVP WMKNIAKEGL ACTSNWGPTV CEERIIADEN CVDSGFPDNH
LVVESQI
//