ID I3K7I4_ORENI Unreviewed; 2395 AA.
AC I3K7I4;
DT 11-JUL-2012, integrated into UniProtKB/TrEMBL.
DT 17-JUN-2020, sequence version 2.
DT 27-MAR-2024, entry version 59.
DE SubName: Full=Uncharacterized LOC100700562 {ECO:0000313|Ensembl:ENSONIP00000017079.2};
GN Name=LOC100700562 {ECO:0000313|Ensembl:ENSONIP00000017079.2};
OS Oreochromis niloticus (Nile tilapia) (Tilapia nilotica).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Cichlomorphae; Cichliformes; Cichlidae; African cichlids;
OC Pseudocrenilabrinae; Oreochromini; Oreochromis.
OX NCBI_TaxID=8128 {ECO:0000313|Ensembl:ENSONIP00000017079.2, ECO:0000313|Proteomes:UP000005207};
RN [1] {ECO:0000313|Proteomes:UP000005207}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG Broad Institute Genome Assembly Team;
RG Broad Institute Sequencing Platform;
RA Di Palma F., Johnson J., Lander E.S., Lindblad-Toh K.;
RT "The Genome Sequence of Oreochromis niloticus (Nile Tilapia).";
RL Submitted (JAN-2012) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSONIP00000017079.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SIMILARITY: Belongs to the beta/gamma-crystallin family.
CC {ECO:0000256|ARBA:ARBA00009646}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_019206314.1; XM_019350769.1.
DR STRING; 8128.ENSONIP00000054278; -.
DR Ensembl; ENSONIT00000017094.2; ENSONIP00000017079.2; ENSONIG00000013581.2.
DR GeneID; 100700562; -.
DR KEGG; onl:100700562; -.
DR CTD; 55057; -.
DR eggNOG; ENOG502R2J7; Eukaryota.
DR GeneTree; ENSGT00940000157740; -.
DR HOGENOM; CLU_002147_1_0_1; -.
DR OrthoDB; 5354696at2759; -.
DR TreeFam; TF331078; -.
DR Proteomes; UP000005207; Linkage group LG22.
DR GO; GO:0030246; F:carbohydrate binding; IEA:UniProtKB-KW.
DR CDD; cd00161; RICIN; 1.
DR Gene3D; 2.80.10.50; -; 1.
DR Gene3D; 2.60.20.10; Crystallins; 6.
DR InterPro; IPR001064; Beta/gamma_crystallin.
DR InterPro; IPR011024; G_crystallin-like.
DR InterPro; IPR035992; Ricin_B-like_lectins.
DR InterPro; IPR000772; Ricin_B_lectin.
DR PANTHER; PTHR11818; BETA/GAMMA CRYSTALLIN; 1.
DR PANTHER; PTHR11818:SF42; RICIN B-TYPE LECTIN DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF00030; Crystall; 6.
DR Pfam; PF00652; Ricin_B_lectin; 1.
DR SMART; SM00458; RICIN; 1.
DR SMART; SM00247; XTALbg; 6.
DR SUPFAM; SSF49695; gamma-Crystallin-like; 3.
DR SUPFAM; SSF50370; Ricin B-like lectins; 1.
DR PROSITE; PS50915; CRYSTALLIN_BETA_GAMMA; 6.
DR PROSITE; PS50231; RICIN_B_LECTIN; 1.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000005207};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 1723..1764
FT /note="Beta/gamma crystallin 'Greek key'"
FT /evidence="ECO:0000259|PROSITE:PS50915"
FT DOMAIN 1895..1941
FT /note="Beta/gamma crystallin 'Greek key'"
FT /evidence="ECO:0000259|PROSITE:PS50915"
FT DOMAIN 1942..1984
FT /note="Beta/gamma crystallin 'Greek key'"
FT /evidence="ECO:0000259|PROSITE:PS50915"
FT DOMAIN 2037..2079
FT /note="Beta/gamma crystallin 'Greek key'"
FT /evidence="ECO:0000259|PROSITE:PS50915"
FT DOMAIN 2128..2171
FT /note="Beta/gamma crystallin 'Greek key'"
FT /evidence="ECO:0000259|PROSITE:PS50915"
FT DOMAIN 2218..2259
FT /note="Beta/gamma crystallin 'Greek key'"
FT /evidence="ECO:0000259|PROSITE:PS50915"
FT REGION 1..110
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 126..190
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 239..273
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 313..382
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 523..545
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 746..806
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1011..1048
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1087..1112
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1221..1275
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1292..1312
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1354..1469
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1481..1503
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1748..1781
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 21..59
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 149..164
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 247..262
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 313..380
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 758..795
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1017..1048
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1242..1266
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1405..1469
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1757..1781
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2395 AA; 264292 MW; 1D0D7AE7CE361505 CRC64;
MAKRSSTRKS LKSLFSKSEA NLDESVEKDV QKNGGEKKRF KLFKFKIQSK NDSEKSANES
QKMLRAGANS KTADDGEPSA DNNGPHSKPP PVYATAPRVK AKELSYSETD LRKPKRFATI
SFGLRKKKKK DEENISKSTF GLPTQGIEER EEIPVEREPG NRKKFSMSQP ELDSNSKFDI
PSPPPVAANE TDSYFTLSAQ SQSVPVITVN TQQESKGPLT NASDLSDVQR QTLRAPIASI
PELQLDDTGS LEEKENIPVH DSSSRNDAKS ALLTATSETT SVVRTQAEHT LLASQPPTVL
DSSAYEAATV DSGNNSYDYI PQENESTSGT LPYHQSDAPD TVDSTISNSE SSLTSVDNSN
YTDVTGPSPD PQNDKLASSA DSAANHHEIF ISHTERSFPE TAKSVSTTSA VSSESSTAPV
NQDKVYGALY ESLFPQNFTS EVISSLLSPS PHICTDTTLL DTKTEFVIVK THNESHAETN
TQHSQTLTSR DLDLAAGGTI DAPDGSASIG EIKGSSFSES SRRSFSASQI QTVPENQNRY
VPTSLSATSK YSDSAGIPYS ELTRSRSELK SDNKSLVTDS VWSGPLCQNS APPISDYETS
HGIDAQVTDS KRRVILVKEL VTDETNVYPG CSSSVIDKMN LVKDLKGEIE MTDGFSGPTV
RNDRLLSPAY LSVGSDDDST KEIYYSAEED NAEESGDEEM FTIDEREESI VGLREEIVQH
SDKEISQRDE GDFRVVIVKM EQKEGKIDDE RKGEDYNGQS EVQQQLEEQK SGNQVHEMGT
SESSVKEQST TVWPQMKGEE EEEGRKDLLA TPVQQVSTMM VCGFAPPSSE QHGQVEESGG
NWTADLETED HSNGENLFPT EVEETLPNKD FTDSGCIHAA SSYTKEEDVI VHIITEEQVP
LIVNADKTAE GNKDTEISLV ITNSTLTDTA GAAEAVMGNL THSAHLQSDA TEIEHNRVPR
STEWVDTITQ SPDRNNTVQV LEQVAVELSA SNADTHHLDS DTAEGLCEHP KEAQPDLNEG
SLTTSRTTTA SSESQQTDDC QDSADKMNSG YRSLSTKLLI KTSSLSKEDE AKHRFRKLSL
INDADATDNS QDEVDFRKTS TSTESNGTDL DSDYKWKNRF EGVSQYKPSK TENYTFSDSL
TYSISDTSSS NYSSSSSALP DATAYTHFTD SFSSPSLPSL EEPITLDNRL SDRTEFYLSR
ESNPVKEPAD DWRRSLVEQE EPAAAAGEML GRREGESESV KSAWESRQLP VSGLSSTQQT
SYSNTVSLEE EDHDSSRFTG VFTATLVEAV SDPVATTPST PPASPDEDST SQFDMEILVD
TLKSMGPSLR QRNVSVRPAN PGLISALPPI VEDAPSPISS DVPDSVSSLK TKMEGTGAPA
ESLNGLYTLP ADLGLKRTSP RDSRSPLELI RQNQDGQPGT RGLNLPFRAS TLNSTVMRKS
SDSSSEDLKS PVLNGNEVPP SPTTSSRLDH SVLFGNYKPS STVQTQENGK AHRSVFRTSS
LPETSKDRLT AGLKELGDLS AGTESGGSRF ERLSFLMNSS SSLSGSLSGA EDLSTHMSRP
ASLAIGSPPA SNSPTRLLSP TGSIDFHRPF ANTDTPLSIF SQTSQMAMGV GSGTMGTPIL
QRSLSSDAAV GVHSSLFSNI NGKSQFQSQP TEPDRNLLSK YRAFPDAYLT KEKEHGKLNP
RPGKMYIFDR PDMCGQRIEV RGDVIDATPW KLQETISIRV VRGGWVLYEK PDFKGEKIAL
DEGDIEITYP FSPPEEQQEN RQDEGEEQNG DTNDEQKETK PARRFIIGSI RRAVRDYSVP
EIALFPEEEA EGKKVIFRDT SDDARIFGFP IKANSIIINA GLWLVFAQPF FQGVPRVLEV
GGYANPAAWG VEQPYVGSLH PLKIGEPKVE DLNDPKMVIY EKPYFTGKSR TITTNMRDFM
TRIERQQTTF MYSVGSLKVL GGIWVGYEKE GFRGHQYLLE EGEYHDWRVW GGCDAELRSV
RVIRADLTDP AMVMFEQLEE DQEGDQEEKT FEVTEAIPDV ELFGYKTSTR SIEVISGAWI
AYSHVDFSGH QYILEKGFYN NCADWGSQDT RICSVQPILL APNDSSRSRD EIKLYCEPDL
QGECYIFNRK QEEVPEKLVT KSCRVTGRSW VLYENEQFSG NLYVLSEGDY PNLTSMGCPP
DCSIRSFKIV PLLFSVPSIS LFGLECLEGR EITTDTEILS MVEEGFNNHI LSLRVNSGCW
VICEHSNYRG RQFLLEPIEI TNWPKFSSLH TIGSMYPVRQ KRHFFRIKNT ERGHFLSVQG
GVEELKSGRV VVTPEVEPLS DIWFYQDGLI KNKLSPTMSL QVVGNIEPAA KVVLWTETRQ
PVQTWTAQMK GLITSNTFPG MVLDVKGGKT YDKDHVVIMP ENDERPSQQW EIELL
//