ID G1NKN8_MELGA Unreviewed; 1223 AA.
AC G1NKN8;
DT 19-OCT-2011, integrated into UniProtKB/TrEMBL.
DT 29-SEP-2021, sequence version 3.
DT 27-MAR-2024, entry version 84.
DE RecName: Full=Attractin {ECO:0008006|Google:ProtNLM};
OS Meleagris gallopavo (Wild turkey).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Galloanserae; Galliformes; Phasianidae;
OC Meleagridinae; Meleagris.
OX NCBI_TaxID=9103 {ECO:0000313|Ensembl:ENSMGAP00000014009.3, ECO:0000313|Proteomes:UP000001645};
RN [1] {ECO:0000313|Ensembl:ENSMGAP00000014009.3, ECO:0000313|Proteomes:UP000001645}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=20838655; DOI=10.1371/journal.pbio.1000475;
RA Dalloul R.A., Long J.A., Zimin A.V., Aslam L., Beal K., Blomberg L.A.,
RA Bouffard P., Burt D.W., Crasta O., Crooijmans R.P., Cooper K.,
RA Coulombe R.A., De S., Delany M.E., Dodgson J.B., Dong J.J., Evans C.,
RA Frederickson K.M., Flicek P., Florea L., Folkerts O., Groenen M.A.,
RA Harkins T.T., Herrero J., Hoffmann S., Megens H.J., Jiang A., de Jong P.,
RA Kaiser P., Kim H., Kim K.W., Kim S., Langenberger D., Lee M.K., Lee T.,
RA Mane S., Marcais G., Marz M., McElroy A.P., Modise T., Nefedov M.,
RA Notredame C., Paton I.R., Payne W.S., Pertea G., Prickett D., Puiu D.,
RA Qioa D., Raineri E., Ruffier M., Salzberg S.L., Schatz M.C., Scheuring C.,
RA Schmidt C.J., Schroeder S., Searle S.M., Smith E.J., Smith J.,
RA Sonstegard T.S., Stadler P.F., Tafer H., Tu Z.J., Van Tassell C.P.,
RA Vilella A.J., Williams K.P., Yorke J.A., Zhang L., Zhang H.B., Zhang X.,
RA Zhang Y., Reed K.M.;
RT "Multi-platform next-generation sequencing of the domestic turkey
RT (Meleagris gallopavo): genome assembly and analysis.";
RL PLoS Biol. 8:E1000475-E1000475(2010).
RN [2] {ECO:0000313|Ensembl:ENSMGAP00000014009.3}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00460}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; G1NKN8; -.
DR Ensembl; ENSMGAT00000014929.3; ENSMGAP00000014009.3; ENSMGAG00000013246.3.
DR GeneTree; ENSGT00940000157346; -.
DR HOGENOM; CLU_003930_0_0_1; -.
DR TreeFam; TF321873; -.
DR Proteomes; UP000001645; Chromosome 4.
DR Bgee; ENSMGAG00000013246; Expressed in gizzard and 17 other cell types or tissues.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR GO; GO:0030246; F:carbohydrate binding; IEA:UniProtKB-KW.
DR CDD; cd00041; CUB; 1.
DR CDD; cd00055; EGF_Lam; 2.
DR Gene3D; 2.120.10.80; Kelch-type beta propeller; 2.
DR Gene3D; 2.10.25.10; Laminin; 2.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR Gene3D; 2.60.120.290; Spermadhesin, CUB domain; 1.
DR InterPro; IPR001304; C-type_lectin-like.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR000859; CUB_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR015915; Kelch-typ_b-propeller.
DR InterPro; IPR006652; Kelch_1.
DR InterPro; IPR002049; LE_dom.
DR InterPro; IPR002165; Plexin_repeat.
DR InterPro; IPR016201; PSI.
DR InterPro; IPR035914; Sperma_CUB_dom_sf.
DR PANTHER; PTHR46376:SF3; ATTRACTIN; 1.
DR PANTHER; PTHR46376; LEUCINE-ZIPPER-LIKE TRANSCRIPTIONAL REGULATOR 1; 1.
DR Pfam; PF00431; CUB; 1.
DR Pfam; PF01344; Kelch_1; 2.
DR Pfam; PF13964; Kelch_6; 1.
DR Pfam; PF01437; PSI; 2.
DR SMART; SM00034; CLECT; 1.
DR SMART; SM00042; CUB; 1.
DR SMART; SM00180; EGF_Lam; 2.
DR SMART; SM00423; PSI; 5.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
DR SUPFAM; SSF57196; EGF/Laminin; 1.
DR SUPFAM; SSF117281; Kelch motif; 1.
DR SUPFAM; SSF49854; Spermadhesin, CUB domain; 1.
DR PROSITE; PS50041; C_TYPE_LECTIN_2; 1.
DR PROSITE; PS01180; CUB; 1.
DR PROSITE; PS00022; EGF_1; 1.
DR PROSITE; PS50026; EGF_3; 1.
DR PROSITE; PS01248; EGF_LAM_1; 1.
DR PROSITE; PS50027; EGF_LAM_2; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Kelch repeat {ECO:0000256|ARBA:ARBA00022441};
KW Laminin EGF-like domain {ECO:0000256|ARBA:ARBA00023292,
KW ECO:0000256|PROSITE-ProRule:PRU00460};
KW Lectin {ECO:0000256|ARBA:ARBA00022734};
KW Membrane {ECO:0000256|ARBA:ARBA00023136};
KW Reference proteome {ECO:0000313|Proteomes:UP000001645};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Transmembrane {ECO:0000256|ARBA:ARBA00022692};
KW Transmembrane helix {ECO:0000256|ARBA:ARBA00022989}.
FT DOMAIN 1..113
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 111..148
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 655..779
FT /note="C-type lectin"
FT /evidence="ECO:0000259|PROSITE:PS50041"
FT DOMAIN 923..968
FT /note="Laminin EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50027"
FT DISULFID 115..125
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 119..136
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 138..147
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 940..949
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
FT DISULFID 952..966
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
SQ SEQUENCE 1223 AA; 135889 MW; B663044B296F2388 CRC64;
AFKGSSEGYV TDGPGNYKYK TKCTWLIEGR PNTILRLRFN HFATECSWDH LYVYDGDSIY
APLLAAFSGL IVPEKDSNET VPEVVATSGY ALLHFFSDAA YNLTGFNITY NFNMCPNNCS
GRGECRLNNS SNALECECAK YWKGEACDIP YCTDDCGAPE RGFCNFNDTK ACVCSAGWQG
PGCSIPVPAN RSFWTREEYS LPKLPRASHK TIIHDNKMWI VGGYVFNHSD SQKVLAYDLI
SEEWLPLNNT VNSVEMRYGH SLALHKDDIY MYGGKIDATG NVSSQLWVFN IPRQSWTQAA
PKAKEQYAVV GHSAHIVTLE DESVVMLVIF GHCPLYGYIS NVQEYNLVTN TWSILQTSGA
LVQGGYGHSS VYDPHTRSIY IHGGYKAFSA NKYRLADDLY RYEVDSRMWT ILKDSRFFRY
LHTAVIMSGT MLVFGGNTHN DTSMSHGAKC FSSDFMAYDI ACNKWSVLPR PSLHHDVNRF
GHSAVLYNST MYVFGGFNSL LLSDILKYTP ERCEAFTNET SCTHAGPGVR CVWAPARPGC
VPWEMATVQQ QQKVLEDFVD NEKCDQITDC YSCTANTNNC QWCTDQCISM HNNCTEEQVP
ITAYENCPKD NPAYYCNKKT SCKSCAMDQN CQWEPRNQEC IALPENICGT NWHLVSNSCL
KITNAKENYD HAKLSCRSSG ASLASLTTQK KVEFVLKELQ KMQSSVSASL LSLTPWVGLR
KINVSYWCWE DMSPFTNTLL QWLPSEPSDA GFCGYLAEPS SQGLKAATCI NEVNGSVCER
AANHSAKQCR TPCALRTMCG ECTSGSSECM WCSNMKQCVD SNAYVASFPY GQCMEWYTMS
SCPPENCSGY CTCSHCLEQP GCGWCTDPSN TGKGKCIDGA YRGAVKIPTP SATGKQSLEP
VLNVSMCAGE HNYNWSFIQC PACQCNGHSK CINESICEKC ENLTTGKHCE TCISGYYGDP
TNGGTCQPCK CNGHASVCNT NTGKCFCTTK GIKGDECQLC EVENRYQGNP LKGTCYYTLL
IDYQFTFSLS QEDDRYYTAI NFVATPEEQN RDLDMFINAS KNFNLNITWA TSFAAGTQAG
EEIPVVSRTN IKEYKDSFSN EKFDFRNNPN ITFFVYVSNF TWPIKIQVGA LCLCGIISIY
FPFLLSNVAI QLCCVLSKLK QLSVHRHSTT AAANNNSGQP WAPACSGLIQ VSTQCAQSDR
GVPLKDRAIA GTGTEMTNKA LCV
//