GenomeNet

Database: UniProt
Entry: A0A151NXR4_ALLMI
LinkDB: A0A151NXR4_ALLMI
Original site: A0A151NXR4_ALLMI 
ID   A0A151NXR4_ALLMI        Unreviewed;       724 AA.
AC   A0A151NXR4;
DT   08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT   08-JUN-2016, sequence version 1.
DT   24-JAN-2024, entry version 20.
DE   SubName: Full=von Willebrand factor C and EGF domain-containing protein {ECO:0000313|EMBL:KYO41666.1};
GN   Name=VWCE {ECO:0000313|EMBL:KYO41666.1};
GN   ORFNames=Y1Q_0006411 {ECO:0000313|EMBL:KYO41666.1};
OS   Alligator mississippiensis (American alligator).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Archelosauria; Archosauria; Crocodylia; Alligatoridae; Alligatorinae;
OC   Alligator.
OX   NCBI_TaxID=8496 {ECO:0000313|EMBL:KYO41666.1};
RN   [1] {ECO:0000313|EMBL:KYO41666.1, ECO:0000313|Proteomes:UP000050525}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=KSC_2009_1 {ECO:0000313|EMBL:KYO41666.1};
RX   PubMed=22293439; DOI=10.1186/gb-2012-13-1-415;
RA   St John J.A., Braun E.L., Isberg S.R., Miles L.G., Chong A.Y., Gongora J.,
RA   Dalzell P., Moran C., Bed'hom B., Abzhanov A., Burgess S.C., Cooksey A.M.,
RA   Castoe T.A., Crawford N.G., Densmore L.D., Drew J.C., Edwards S.V.,
RA   Faircloth B.C., Fujita M.K., Greenwold M.J., Hoffmann F.G., Howard J.M.,
RA   Iguchi T., Janes D.E., Khan S.Y., Kohno S., de Koning A.J., Lance S.L.,
RA   McCarthy F.M., McCormack J.E., Merchant M.E., Peterson D.G., Pollock D.D.,
RA   Pourmand N., Raney B.J., Roessler K.A., Sanford J.R., Sawyer R.H.,
RA   Schmidt C.J., Triplett E.W., Tuberville T.D., Venegas-Anaya M.,
RA   Howard J.T., Jarvis E.D., Guillette L.J.Jr., Glenn T.C., Green R.E.,
RA   Ray D.A.;
RT   "Sequencing three crocodilian genomes to illuminate the evolution of
RT   archosaurs and amniotes.";
RL   Genome Biol. 13:415-415(2012).
CC   -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC       feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:KYO41666.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AKHW03001628; KYO41666.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A151NXR4; -.
DR   STRING; 8496.A0A151NXR4; -.
DR   eggNOG; KOG1216; Eukaryota.
DR   eggNOG; KOG1217; Eukaryota.
DR   Proteomes; UP000050525; Unassembled WGS sequence.
DR   GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR   CDD; cd00054; EGF_CA; 1.
DR   Gene3D; 6.20.200.20; -; 5.
DR   Gene3D; 2.10.70.10; Complement Module, domain 1; 1.
DR   Gene3D; 2.10.25.10; Laminin; 4.
DR   InterPro; IPR026823; cEGF.
DR   InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR   InterPro; IPR000742; EGF-like_dom.
DR   InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR   InterPro; IPR018097; EGF_Ca-bd_CS.
DR   InterPro; IPR009030; Growth_fac_rcpt_cys_sf.
DR   InterPro; IPR001007; VWF_dom.
DR   PANTHER; PTHR47333; VON WILLEBRAND FACTOR C AND EGF DOMAIN-CONTAINING PROTEIN; 1.
DR   PANTHER; PTHR47333:SF1; VON WILLEBRAND FACTOR C AND EGF DOMAIN-CONTAINING PROTEIN; 1.
DR   Pfam; PF12662; cEGF; 2.
DR   Pfam; PF00093; VWC; 2.
DR   SMART; SM00181; EGF; 4.
DR   SMART; SM00179; EGF_CA; 3.
DR   SMART; SM00214; VWC; 6.
DR   SMART; SM00215; VWC_out; 3.
DR   SUPFAM; SSF57603; FnI-like domain; 6.
DR   SUPFAM; SSF57184; Growth factor receptor domain; 1.
DR   PROSITE; PS00010; ASX_HYDROXYL; 3.
DR   PROSITE; PS01186; EGF_2; 2.
DR   PROSITE; PS50026; EGF_3; 2.
DR   PROSITE; PS01187; EGF_CA; 2.
DR   PROSITE; PS01208; VWFC_1; 4.
DR   PROSITE; PS50184; VWFC_2; 5.
PE   4: Predicted;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW   ProRule:PRU00076}; Reference proteome {ECO:0000313|Proteomes:UP000050525};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..21
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           22..724
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5007586463"
FT   DOMAIN          181..219
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          220..262
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          313..365
FT                   /note="VWFC"
FT                   /evidence="ECO:0000259|PROSITE:PS50184"
FT   DOMAIN          423..484
FT                   /note="VWFC"
FT                   /evidence="ECO:0000259|PROSITE:PS50184"
FT   DOMAIN          490..550
FT                   /note="VWFC"
FT                   /evidence="ECO:0000259|PROSITE:PS50184"
FT   DOMAIN          551..609
FT                   /note="VWFC"
FT                   /evidence="ECO:0000259|PROSITE:PS50184"
FT   DOMAIN          609..667
FT                   /note="VWFC"
FT                   /evidence="ECO:0000259|PROSITE:PS50184"
SQ   SEQUENCE   724 AA;  77356 MW;  996CDA7546E74322 CRC64;
     MLAGLLLRAA CVLAALPAAQ ARLYPGRKKP GSFAVERRRM GPHVCFSGFG SGCCPGWMPS
     PGSGQCTLPL CSFGCGNGLC IAPNVCSCPN GKQGITCLDP PSACGEYGCD LTCNHGGCQE
     VARVCPLGFS MVETANGVRC TDINECLSAS CEGLCVNTEG GFVCECGPGM QLSSDRHSCQ
     DTDECLATPC QHWCKNSIGS YRCSCRPGYH LHGNRHSCVD VNECRWPSEK RACQHSCHNT
     PGSFLCTCRP GYRLSGDRVS CEGLPKTILA PSPILQSLQH PPTLLLLPPD SGRHLLAPKG
     SLPSRTPALA PAPGALWEHD SRWMEPSCLS CTCKGGHVLC EVVTCRISCS HPVPPKNGEC
     CPSCTGCLYN GVTRAEGDVF SLSSGNCTVC VCLAGNVSCI SPECPPGSCH SPAQSDCCSC
     QTAKCKFQGR TYAHGEEFSL DGDDCTTCVC RSGEVECSFT PCPVLECPRQ DWLLVPGQCC
     FSCQEPVPVS GCFVDDNGVE FPIGQIWSPG DPCELCICQA DGLVSCKRTE CLETCPHPIQ
     IPGQCCPDCS AGCTYMGRVF YNNETFPSVL DPCLSCICLL GSVACSPVDC TVFCTYPFHP
     EGECCPVCND CNYEGRKVVN GQTFVPEAEP CIHCTCQFGE VSCEKRPCPI SCTESSTTPT
     DCCLGCQVSQ VLLQSSRGPG QLDDVKAASK MMNRTCLNRQ DLQEDPSSSY RILTIHLQHH
     HHCL
//
DBGET integrated database retrieval system