GenomeNet

Database: UniProt
Entry: G1NIV3_MELGA
LinkDB: G1NIV3_MELGA
Original site: G1NIV3_MELGA 
ID   G1NIV3_MELGA            Unreviewed;       931 AA.
AC   G1NIV3;
DT   19-OCT-2011, integrated into UniProtKB/TrEMBL.
DT   29-SEP-2021, sequence version 3.
DT   27-MAR-2024, entry version 71.
DE   RecName: Full=von Willebrand factor D and EGF domains {ECO:0008006|Google:ProtNLM};
OS   Meleagris gallopavo (Wild turkey).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC   Coelurosauria; Aves; Neognathae; Galloanserae; Galliformes; Phasianidae;
OC   Meleagridinae; Meleagris.
OX   NCBI_TaxID=9103 {ECO:0000313|Ensembl:ENSMGAP00000013221.3, ECO:0000313|Proteomes:UP000001645};
RN   [1] {ECO:0000313|Ensembl:ENSMGAP00000013221.3, ECO:0000313|Proteomes:UP000001645}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=20838655; DOI=10.1371/journal.pbio.1000475;
RA   Dalloul R.A., Long J.A., Zimin A.V., Aslam L., Beal K., Blomberg L.A.,
RA   Bouffard P., Burt D.W., Crasta O., Crooijmans R.P., Cooper K.,
RA   Coulombe R.A., De S., Delany M.E., Dodgson J.B., Dong J.J., Evans C.,
RA   Frederickson K.M., Flicek P., Florea L., Folkerts O., Groenen M.A.,
RA   Harkins T.T., Herrero J., Hoffmann S., Megens H.J., Jiang A., de Jong P.,
RA   Kaiser P., Kim H., Kim K.W., Kim S., Langenberger D., Lee M.K., Lee T.,
RA   Mane S., Marcais G., Marz M., McElroy A.P., Modise T., Nefedov M.,
RA   Notredame C., Paton I.R., Payne W.S., Pertea G., Prickett D., Puiu D.,
RA   Qioa D., Raineri E., Ruffier M., Salzberg S.L., Schatz M.C., Scheuring C.,
RA   Schmidt C.J., Schroeder S., Searle S.M., Smith E.J., Smith J.,
RA   Sonstegard T.S., Stadler P.F., Tafer H., Tu Z.J., Van Tassell C.P.,
RA   Vilella A.J., Williams K.P., Yorke J.A., Zhang L., Zhang H.B., Zhang X.,
RA   Zhang Y., Reed K.M.;
RT   "Multi-platform next-generation sequencing of the domestic turkey
RT   (Meleagris gallopavo): genome assembly and analysis.";
RL   PLoS Biol. 8:E1000475-E1000475(2010).
RN   [2] {ECO:0000313|Ensembl:ENSMGAP00000013221.3}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC       feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   AlphaFoldDB; G1NIV3; -.
DR   Ensembl; ENSMGAT00000014123.3; ENSMGAP00000013221.3; ENSMGAG00000012558.3.
DR   GeneTree; ENSGT00940000160835; -.
DR   HOGENOM; CLU_002130_0_0_1; -.
DR   InParanoid; G1NIV3; -.
DR   TreeFam; TF351702; -.
DR   Proteomes; UP000001645; Chromosome 7.
DR   Bgee; ENSMGAG00000012558; Expressed in breast and 2 other cell types or tissues.
DR   GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR   CDD; cd00054; EGF_CA; 2.
DR   Gene3D; 2.10.25.10; Laminin; 3.
DR   InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR   InterPro; IPR000742; EGF-like_dom.
DR   InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR   InterPro; IPR018097; EGF_Ca-bd_CS.
DR   InterPro; IPR001846; VWF_type-D.
DR   PANTHER; PTHR14949; EGF-LIKE-DOMAIN, MULTIPLE 7, 8; 1.
DR   PANTHER; PTHR14949:SF52; VON WILLEBRAND FACTOR D AND EGF DOMAIN-CONTAINING PROTEIN; 1.
DR   Pfam; PF00008; EGF; 2.
DR   Pfam; PF00094; VWD; 1.
DR   SMART; SM00181; EGF; 3.
DR   SMART; SM00179; EGF_CA; 3.
DR   SMART; SM00216; VWD; 1.
DR   SUPFAM; SSF57196; EGF/Laminin; 2.
DR   PROSITE; PS00010; ASX_HYDROXYL; 1.
DR   PROSITE; PS00022; EGF_1; 2.
DR   PROSITE; PS01186; EGF_2; 1.
DR   PROSITE; PS50026; EGF_3; 3.
DR   PROSITE; PS01187; EGF_CA; 2.
DR   PROSITE; PS51233; VWFD; 1.
PE   4: Predicted;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW   ProRule:PRU00076};
KW   EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW   ProRule:PRU00076}; Reference proteome {ECO:0000313|Proteomes:UP000001645}.
FT   DOMAIN          86..259
FT                   /note="VWFD"
FT                   /evidence="ECO:0000259|PROSITE:PS51233"
FT   DOMAIN          793..832
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          834..869
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          871..910
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   REGION          329..384
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        329..366
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        367..382
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   DISULFID        822..831
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT   DISULFID        838..848
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT   DISULFID        859..868
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ   SEQUENCE   931 AA;  102868 MW;  65F25B1809852502 CRC64;
     MFTLDSQLLG PPNIVLSACQ VDLVPAPCSQ GSCAAAVVTV TAVTDFAQDG NRISHIRAEP
     VGHRDVLWRA HASKDVKVTV QDLPTGNCYS FTDPHIITFD GWRYDNYKIG TFLLCQSTSR
     AFEVHVRQWD CGGHHSATAC NCGVAAWEGS DVVRLDACNG HFQDSRPQLN IQSTEASPQV
     KILTSYGGRK ITILFPSGAF IRADVSEWGM GLTVRTPSSD FNSTRGLCGL FDGISHNDLN
     NVPEEDFIEE WRIPPGKSLF DKTPASSEEK QRKNYCRCQK ESTKSMPLVK MLNAFQMQSS
     GCHYDNVDYT FAIPYLDITS EFVTHSGKEF ASRDDGERSP KSFDQRSLPK SVKKRGSHEE
     RLKPFSQHAS MKNNSSLNFT KPAEELQRPK RQEDYFEYSA PHPLHSPSQT DTESFAYFFP
     EDYFEGIRLK LPLGWPTPNG LTSAKAQEIC HGILANSTIG LVCKALLGKL IDEAINICML
     DLQLKDDVAW VRALIALLEN ECERRVLRNR GEVFHVGSQS TATQEEILTI LRCPAFCNGN
     GQCTDLGCQC FEDHSSYDCS TAKKQALEIT SLENKGLCDI RTSDCSRVRV FGVGFKDSPH
     LRCEVTRLIH LNGEWISREQ EITQADFLSS KAVDCQIPLL NITETEAVHF VAGDEPFARW
     QVKVGISSYD NTCNIDGLCY GEGESSPASP CLLCEPDISK FTWSINENNL PPVFQAPSSQ
     LLTFIGENFV YQLTAVDPEG SAVLFILEAG PQDARLSPAG LLIWKVDSEE MQTFEFTVSD
     ECNALSRYTV EVRVKPCSCL NGGTCVTNIK FPPGLGEYLC LCPNGFDGGL CQEDINECKS
     NPCKSGTCVD GVDSYACQCP PGLGGLTCQE DKNECEEGLC FPGVSCMNTF GSYVCGICPS
     GMEGNGKTCK CKSMICYNRR IYFLISKIHV C
//
DBGET integrated database retrieval system