ID A0A251U0L6_HELAN Unreviewed; 1069 AA.
AC A0A251U0L6;
DT 22-NOV-2017, integrated into UniProtKB/TrEMBL.
DT 22-NOV-2017, sequence version 1.
DT 24-JAN-2024, entry version 17.
DE SubName: Full=Occludin domain-containing protein {ECO:0000313|EMBL:KAF5793376.1};
DE SubName: Full=Putative occludin-like domain-containing protein {ECO:0000313|EMBL:OTG16890.1};
GN ORFNames=HannXRQ_Chr09g0276331 {ECO:0000313|EMBL:OTG16890.1},
GN HanXRQr2_Chr09g0416731 {ECO:0000313|EMBL:KAF5793376.1};
OS Helianthus annuus (Common sunflower).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC asterids; campanulids; Asterales; Asteraceae; Asteroideae;
OC Heliantheae alliance; Heliantheae; Helianthus.
OX NCBI_TaxID=4232 {ECO:0000313|EMBL:OTG16890.1, ECO:0000313|Proteomes:UP000215914};
RN [1] {ECO:0000313|EMBL:KAF5793376.1, ECO:0000313|Proteomes:UP000215914}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. SF193 {ECO:0000313|Proteomes:UP000215914};
RC TISSUE=Leaves {ECO:0000313|EMBL:KAF5793376.1};
RX PubMed=28538728; DOI=10.1038/nature22380;
RA Badouin H., Gouzy J., Grassa C.J., Murat F., Staton S.E., Cottret L.,
RA Lelandais-Briere C., Owens G.L., Carrere S., Mayjonade B., Legrand L.,
RA Gill N., Kane N.C., Bowers J.E., Hubner S., Bellec A., Berard A.,
RA Berges H., Blanchet N., Boniface M.C., Brunel D., Catrice O., Chaidir N.,
RA Claudel C., Donnadieu C., Faraut T., Fievet G., Helmstetter N., King M.,
RA Knapp S.J., Lai Z., Le Paslier M.C., Lippi Y., Lorenzon L., Mandel J.R.,
RA Marage G., Marchand G., Marquand E., Bret-Mestries E., Morien E.,
RA Nambeesan S., Nguyen T., Pegot-Espagnet P., Pouilly N., Raftis F.,
RA Sallet E., Schiex T., Thomas J., Vandecasteele C., Vares D., Vear F.,
RA Vautrin S., Crespi M., Mangin B., Burke J.M., Salse J., Munos S.,
RA Vincourt P., Rieseberg L.H., Langlade N.B.;
RT "The sunflower genome provides insights into oil metabolism, flowering and
RT Asterid evolution.";
RL Nature 546:148-152(2017).
RN [2] {ECO:0000313|EMBL:OTG16890.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC TISSUE=Leaves {ECO:0000313|EMBL:OTG16890.1};
RA Langlade N., Munos S.;
RT "Sunflower complete genome.";
RL Submitted (FEB-2017) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|EMBL:KAF5793376.1}
RP NUCLEOTIDE SEQUENCE.
RC TISSUE=Leaves {ECO:0000313|EMBL:KAF5793376.1};
RA Gouzy J., Langlade N., Munos S.;
RT "Helianthus annuus Genome sequencing and assembly Release 2.";
RL Submitted (JUN-2020) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; MNCJ02000324; KAF5793376.1; -; Genomic_DNA.
DR EMBL; CM007898; OTG16890.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A251U0L6; -.
DR STRING; 4232.A0A251U0L6; -.
DR EnsemblPlants; mRNA:HanXRQr2_Chr09g0416731; mRNA:HanXRQr2_Chr09g0416731; HanXRQr2_Chr09g0416731.
DR Gramene; mRNA:HanXRQr2_Chr09g0416731; mRNA:HanXRQr2_Chr09g0416731; HanXRQr2_Chr09g0416731.
DR InParanoid; A0A251U0L6; -.
DR OMA; NGSHYNN; -.
DR OrthoDB; 1215502at2759; -.
DR Proteomes; UP000215914; Chromosome 9.
DR Gene3D; 6.10.140.340; -; 1.
DR InterPro; IPR010844; Occludin_ELL.
DR PANTHER; PTHR38372; DENTIN SIALOPHOSPHOPROTEIN-LIKE PROTEIN; 1.
DR PANTHER; PTHR38372:SF2; DENTIN SIALOPHOSPHOPROTEIN-LIKE PROTEIN; 1.
DR Pfam; PF07303; Occludin_ELL; 1.
DR SUPFAM; SSF144292; occludin/ELL-like; 1.
DR PROSITE; PS51980; OCEL; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000215914}.
FT DOMAIN 959..1067
FT /note="OCEL"
FT /evidence="ECO:0000259|PROSITE:PS51980"
FT REGION 1..62
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 221..307
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 402..655
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 667..778
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 799..826
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 839..945
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 23..62
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 245..307
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 421..442
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 499..532
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 561..628
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 702..731
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 754..771
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 810..826
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 894..945
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1069 AA; 119370 MW; 8EA6C73D3F454672 CRC64;
MYGGSAKLGG RGRGGGPVKR NIHSAFQPSS VQRPSATPAG TGHRNRSNTP AAAAATTSTA
EESFSLVRNN PLNFGMIIRL TPVLIEEIKR LEAEGGVARM RFDSSANNPN GNVINVGDKE
FRFTWSQETG DLCDIYEERR TGDGDGLLVE SGGAWRKLNV QRELDESVKN HVKMRTVEAE
RKHKSRKAII LDHKNPSMKN QMKALAAAEV NNAWRGSYKQ KKDLPFKKTK PETSSAVIPL
KSGGKAGFSS STPSKVRAST SPLQMTPEQS GVPLSPLRSN NANKSHANRE DATLTQSSKE
NASTSEREMF NRVPVPAGIV QNKPGSNERF GNKPTDPQSL LISLLMGKPQ GMNLKALEKA
IGETIPKSVK QIEPILKKIA VFQAPGRYVL KPDVELESVK NPLFESGSSP ENDNHHREAT
TAPESSFPSR TDDVTETEQP SHLISKPYED LNILENNDIG NPSPDALSDK KVSHNNEDHA
VSSSRSGSDS DSESDSSDSE SNSRSPVGSK SGNSSDSDSD SDASLNSKQG SDEDVDIMSD
DDKEPNQNLQ PFNHELGYAP HNMLDLEKDL FEDNKDEDNY ADDTKNLFNN HQESEVHGHA
KKVVSRSNTS KRGSDEKHFD ESENAKRLKS GNSSRSSVLR SPDGPGMMSR TVGDLSDYDY
ENVNNREFLG NSTLDSPRSG PRSIDLNARA KPPADMDNIV RFSERGPQDN RVNKETRDED
GQPKDRRPPK NSGGKPPGSH QKKHGALIGK NKEPELLSTS QIKNSPADLK KSTVINGRGP
ALQRELSDLE MGELRENWHE EKRFGKNNSF KQSENKSSSD YWNLDESTVK PNDVVNLPKK
VVSDDHVDDF TRFNGKQSLS RTDHLKSGSQ NDKGKHNEAG SEGYTVNTES QRKGHVGGPH
KHEKQVMPTK DKKRHKSKDI GEKKKDFRLI NSRDNSETKR REMESCSDDS ITSYIKYEKD
EPEMKGPIRD ISQYNEYVQE YHEKYDCYHT LNKILESYRN EFQTFGRDLE LAKGKDIERY
NKILEQLMES YRQCGMKHKR LKKIFVVLHH ELQHLKEMIR DFVEKQTKG
//