ID A0A151NXR4_ALLMI Unreviewed; 724 AA.
AC A0A151NXR4;
DT 08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT 08-JUN-2016, sequence version 1.
DT 24-JAN-2024, entry version 20.
DE SubName: Full=von Willebrand factor C and EGF domain-containing protein {ECO:0000313|EMBL:KYO41666.1};
GN Name=VWCE {ECO:0000313|EMBL:KYO41666.1};
GN ORFNames=Y1Q_0006411 {ECO:0000313|EMBL:KYO41666.1};
OS Alligator mississippiensis (American alligator).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Crocodylia; Alligatoridae; Alligatorinae;
OC Alligator.
OX NCBI_TaxID=8496 {ECO:0000313|EMBL:KYO41666.1};
RN [1] {ECO:0000313|EMBL:KYO41666.1, ECO:0000313|Proteomes:UP000050525}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=KSC_2009_1 {ECO:0000313|EMBL:KYO41666.1};
RX PubMed=22293439; DOI=10.1186/gb-2012-13-1-415;
RA St John J.A., Braun E.L., Isberg S.R., Miles L.G., Chong A.Y., Gongora J.,
RA Dalzell P., Moran C., Bed'hom B., Abzhanov A., Burgess S.C., Cooksey A.M.,
RA Castoe T.A., Crawford N.G., Densmore L.D., Drew J.C., Edwards S.V.,
RA Faircloth B.C., Fujita M.K., Greenwold M.J., Hoffmann F.G., Howard J.M.,
RA Iguchi T., Janes D.E., Khan S.Y., Kohno S., de Koning A.J., Lance S.L.,
RA McCarthy F.M., McCormack J.E., Merchant M.E., Peterson D.G., Pollock D.D.,
RA Pourmand N., Raney B.J., Roessler K.A., Sanford J.R., Sawyer R.H.,
RA Schmidt C.J., Triplett E.W., Tuberville T.D., Venegas-Anaya M.,
RA Howard J.T., Jarvis E.D., Guillette L.J.Jr., Glenn T.C., Green R.E.,
RA Ray D.A.;
RT "Sequencing three crocodilian genomes to illuminate the evolution of
RT archosaurs and amniotes.";
RL Genome Biol. 13:415-415(2012).
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KYO41666.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AKHW03001628; KYO41666.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A151NXR4; -.
DR STRING; 8496.A0A151NXR4; -.
DR eggNOG; KOG1216; Eukaryota.
DR eggNOG; KOG1217; Eukaryota.
DR Proteomes; UP000050525; Unassembled WGS sequence.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR CDD; cd00054; EGF_CA; 1.
DR Gene3D; 6.20.200.20; -; 5.
DR Gene3D; 2.10.70.10; Complement Module, domain 1; 1.
DR Gene3D; 2.10.25.10; Laminin; 4.
DR InterPro; IPR026823; cEGF.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR InterPro; IPR018097; EGF_Ca-bd_CS.
DR InterPro; IPR009030; Growth_fac_rcpt_cys_sf.
DR InterPro; IPR001007; VWF_dom.
DR PANTHER; PTHR47333; VON WILLEBRAND FACTOR C AND EGF DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR47333:SF1; VON WILLEBRAND FACTOR C AND EGF DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF12662; cEGF; 2.
DR Pfam; PF00093; VWC; 2.
DR SMART; SM00181; EGF; 4.
DR SMART; SM00179; EGF_CA; 3.
DR SMART; SM00214; VWC; 6.
DR SMART; SM00215; VWC_out; 3.
DR SUPFAM; SSF57603; FnI-like domain; 6.
DR SUPFAM; SSF57184; Growth factor receptor domain; 1.
DR PROSITE; PS00010; ASX_HYDROXYL; 3.
DR PROSITE; PS01186; EGF_2; 2.
DR PROSITE; PS50026; EGF_3; 2.
DR PROSITE; PS01187; EGF_CA; 2.
DR PROSITE; PS01208; VWFC_1; 4.
DR PROSITE; PS50184; VWFC_2; 5.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Reference proteome {ECO:0000313|Proteomes:UP000050525};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..21
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 22..724
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5007586463"
FT DOMAIN 181..219
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 220..262
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 313..365
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 423..484
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 490..550
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 551..609
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 609..667
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
SQ SEQUENCE 724 AA; 77356 MW; 996CDA7546E74322 CRC64;
MLAGLLLRAA CVLAALPAAQ ARLYPGRKKP GSFAVERRRM GPHVCFSGFG SGCCPGWMPS
PGSGQCTLPL CSFGCGNGLC IAPNVCSCPN GKQGITCLDP PSACGEYGCD LTCNHGGCQE
VARVCPLGFS MVETANGVRC TDINECLSAS CEGLCVNTEG GFVCECGPGM QLSSDRHSCQ
DTDECLATPC QHWCKNSIGS YRCSCRPGYH LHGNRHSCVD VNECRWPSEK RACQHSCHNT
PGSFLCTCRP GYRLSGDRVS CEGLPKTILA PSPILQSLQH PPTLLLLPPD SGRHLLAPKG
SLPSRTPALA PAPGALWEHD SRWMEPSCLS CTCKGGHVLC EVVTCRISCS HPVPPKNGEC
CPSCTGCLYN GVTRAEGDVF SLSSGNCTVC VCLAGNVSCI SPECPPGSCH SPAQSDCCSC
QTAKCKFQGR TYAHGEEFSL DGDDCTTCVC RSGEVECSFT PCPVLECPRQ DWLLVPGQCC
FSCQEPVPVS GCFVDDNGVE FPIGQIWSPG DPCELCICQA DGLVSCKRTE CLETCPHPIQ
IPGQCCPDCS AGCTYMGRVF YNNETFPSVL DPCLSCICLL GSVACSPVDC TVFCTYPFHP
EGECCPVCND CNYEGRKVVN GQTFVPEAEP CIHCTCQFGE VSCEKRPCPI SCTESSTTPT
DCCLGCQVSQ VLLQSSRGPG QLDDVKAASK MMNRTCLNRQ DLQEDPSSSY RILTIHLQHH
HHCL
//