ID G5B4U1_HETGA Unreviewed; 1303 AA.
AC G5B4U1;
DT 14-DEC-2011, integrated into UniProtKB/TrEMBL.
DT 14-DEC-2011, sequence version 1.
DT 27-MAR-2024, entry version 63.
DE SubName: Full=Attractin {ECO:0000313|EMBL:EHB04302.1};
DE Flags: Fragment;
GN ORFNames=GW7_03208 {ECO:0000313|EMBL:EHB04302.1};
OS Heterocephalus glaber (Naked mole rat).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Hystricomorpha; Bathyergidae;
OC Heterocephalus.
OX NCBI_TaxID=10181 {ECO:0000313|EMBL:EHB04302.1, ECO:0000313|Proteomes:UP000006813};
RN [1] {ECO:0000313|EMBL:EHB04302.1, ECO:0000313|Proteomes:UP000006813}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21993625; DOI=10.1038/nature10533;
RA Kim E.B., Fang X., Fushan A.A., Huang Z., Lobanov A.V., Han L.,
RA Marino S.M., Sun X., Turanov A.A., Yang P., Yim S.H., Zhao X.,
RA Kasaikina M.V., Stoletzki N., Peng C., Polak P., Xiong Z., Kiezun A.,
RA Zhu Y., Chen Y., Kryukov G.V., Zhang Q., Peshkin L., Yang L., Bronson R.T.,
RA Buffenstein R., Wang B., Han C., Li Q., Chen L., Zhao W., Sunyaev S.R.,
RA Park T.J., Zhang G., Wang J., Gladyshev V.N.;
RT "Genome sequencing reveals insights into physiology and longevity of the
RT naked mole rat.";
RL Nature 479:223-227(2011).
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00460}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JH168482; EHB04302.1; -; Genomic_DNA.
DR STRING; 10181.G5B4U1; -.
DR eggNOG; KOG1388; Eukaryota.
DR InParanoid; G5B4U1; -.
DR Proteomes; UP000006813; Unassembled WGS sequence.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR GO; GO:0030246; F:carbohydrate binding; IEA:UniProtKB-KW.
DR CDD; cd00041; CUB; 1.
DR CDD; cd00055; EGF_Lam; 2.
DR Gene3D; 2.120.10.80; Kelch-type beta propeller; 2.
DR Gene3D; 2.10.25.10; Laminin; 2.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR Gene3D; 2.60.120.290; Spermadhesin, CUB domain; 1.
DR InterPro; IPR001304; C-type_lectin-like.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR000859; CUB_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR015915; Kelch-typ_b-propeller.
DR InterPro; IPR006652; Kelch_1.
DR InterPro; IPR002049; LE_dom.
DR InterPro; IPR002165; Plexin_repeat.
DR InterPro; IPR016201; PSI.
DR InterPro; IPR035914; Sperma_CUB_dom_sf.
DR PANTHER; PTHR46376:SF3; ATTRACTIN; 1.
DR PANTHER; PTHR46376; LEUCINE-ZIPPER-LIKE TRANSCRIPTIONAL REGULATOR 1; 1.
DR Pfam; PF00431; CUB; 1.
DR Pfam; PF01344; Kelch_1; 2.
DR Pfam; PF13854; Kelch_5; 1.
DR Pfam; PF00059; Lectin_C; 1.
DR Pfam; PF01437; PSI; 2.
DR SMART; SM00034; CLECT; 1.
DR SMART; SM00042; CUB; 1.
DR SMART; SM00180; EGF_Lam; 2.
DR SMART; SM00423; PSI; 5.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
DR SUPFAM; SSF57196; EGF/Laminin; 1.
DR SUPFAM; SSF117281; Kelch motif; 1.
DR SUPFAM; SSF49854; Spermadhesin, CUB domain; 1.
DR PROSITE; PS50041; C_TYPE_LECTIN_2; 1.
DR PROSITE; PS01180; CUB; 1.
DR PROSITE; PS00022; EGF_1; 1.
DR PROSITE; PS50026; EGF_3; 1.
DR PROSITE; PS01248; EGF_LAM_1; 1.
DR PROSITE; PS50027; EGF_LAM_2; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Kelch repeat {ECO:0000256|ARBA:ARBA00022441};
KW Laminin EGF-like domain {ECO:0000256|ARBA:ARBA00023292,
KW ECO:0000256|PROSITE-ProRule:PRU00460};
KW Lectin {ECO:0000256|ARBA:ARBA00022734};
KW Membrane {ECO:0000256|ARBA:ARBA00023136, ECO:0000256|SAM:Phobius};
KW Reference proteome {ECO:0000313|Proteomes:UP000006813};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Transmembrane {ECO:0000256|ARBA:ARBA00022692, ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|ARBA:ARBA00022989,
KW ECO:0000256|SAM:Phobius}.
FT TRANSMEM 1153..1177
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 10..126
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 124..161
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 673..797
FT /note="C-type lectin"
FT /evidence="ECO:0000259|PROSITE:PS50041"
FT DOMAIN 941..986
FT /note="Laminin EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50027"
FT DISULFID 128..138
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 132..149
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 151..160
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 958..967
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
FT DISULFID 970..984
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:EHB04302.1"
SQ SEQUENCE 1303 AA; 145781 MW; 9D85514B63BB398D CRC64;
GWVGEQCQHC GGRFRLTGSS GFVTDGPGNY KYKTKCTWLI EGQPNRIMRL RFNHFATECS
WDHLYVYDGD SIYAPLIAAF SGLIVPERDS NETVPEVVVT SGYALLHFFS DAAYNLTGFN
ITYNFDMCPN NCSARGVCKI SNSSNTVECE CYENWKGESC DIPYCTDNCG FPHRGICNSS
DVRGCSCFSE WQGPGCSVPV PANQSFWMRE ECSNLKLPRA SHKAVVIGNI MWIVGGYMFN
HSDYNMVLAY DLVSKEWLPL NRSVNNVVVR YGHSLALYKD KIYMYGGKID STGNVTNELR
VLHIHNESWA LLTPKAKEQY AVVGHSAHII TLKTGRVVML VIFGHCPLYG YISSVQEYDL
DKNTWNILQT QGALVQGGYG HSSVYDHKTK ALYIHGGYKA FSANKYRLTD DLYRYEVDTQ
MWTILKDSRF FRYLHTAVIV SGTMLVFGGN THNDTSMSHG AKCFSSDFMA YDIACDRWSV
LPRPDLHHDV NRFGHSAVLH NSTMYVFGGF NSLLLSDILV FTLEQCETHR SEAACIAAGP
GVRCVWDVET SQCVSWELAT EEQAKKLKSE CFSKRTLDHD KCDQNTDCYS CTANTNDCHW
CSDHCAPRNH SCTEGQISIF KYENCPKDNP MYYCNKKTSC RSCALDQNCQ WEPRNQECIA
LPENICGTGW HLVGNSCLKI TTAKENYDNA KLSCRNHNAF LASLTTQKKV EFVLKQLRMM
QSSQTMSKLT LTPWVGLRKI NVSYWCWEDM SPFTNSLLQW MPSEPSDAGF CGILSEPSTR
GLKAATCINP LNGSICERPA NHSAKQCRTP CALRTACGEC TSSSSECMWC SNMKQCVDSN
AYVASFPFGQ CMEWYTMSSC PPENCSGYCT CSHCLEQPGC GWCTDPSNTG KGKCIEGSYK
GPVKMPSQAS TGITYPQPLL NSSMCLEDGR YNWSFIHCPA CQCNGHSKCI NQSICEKCEN
LTTGKHCETC ISGYYGDPTN GGKCQPCRCN GHASLCNTNT GKCFCTTKGV KGDECQLCEV
ENRYQGNPLK GTCYYTLLID YQFTFSLSQE DDRYYTAINF VATPDEQNRD LDMFINASKN
FNLNITWAAS FSAGTQAGEE MPVVSKTNIK EYKDSFSNEK FDFRNHPNIT FFVYVSNFTW
PIAFSQHSNF MDLVQFFVTF FSCFLSLLLV AAVVWKIKQS CWASRRREQL LREMQQMASR
PFASVNVALE TDEEPPDLIG GSIKTVPKPI ALEPCFGNKA AVLSVFVRLP RGLGGIPPPG
QSGLAVASAL VDISQQMPIV YKEKSGAVRN RKQQPPAQPG TCI
//