ID A0A151MR48_ALLMI Unreviewed; 2572 AA.
AC A0A151MR48;
DT 08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT 08-JUN-2016, sequence version 1.
DT 27-MAR-2024, entry version 26.
DE RecName: Full=VWFA domain-containing protein {ECO:0000259|PROSITE:PS50234};
GN ORFNames=Y1Q_0019407 {ECO:0000313|EMBL:KYO27005.1};
OS Alligator mississippiensis (American alligator).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Crocodylia; Alligatoridae; Alligatorinae;
OC Alligator.
OX NCBI_TaxID=8496 {ECO:0000313|EMBL:KYO27005.1};
RN [1] {ECO:0000313|EMBL:KYO27005.1, ECO:0000313|Proteomes:UP000050525}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=KSC_2009_1 {ECO:0000313|EMBL:KYO27005.1};
RX PubMed=22293439; DOI=10.1186/gb-2012-13-1-415;
RA St John J.A., Braun E.L., Isberg S.R., Miles L.G., Chong A.Y., Gongora J.,
RA Dalzell P., Moran C., Bed'hom B., Abzhanov A., Burgess S.C., Cooksey A.M.,
RA Castoe T.A., Crawford N.G., Densmore L.D., Drew J.C., Edwards S.V.,
RA Faircloth B.C., Fujita M.K., Greenwold M.J., Hoffmann F.G., Howard J.M.,
RA Iguchi T., Janes D.E., Khan S.Y., Kohno S., de Koning A.J., Lance S.L.,
RA McCarthy F.M., McCormack J.E., Merchant M.E., Peterson D.G., Pollock D.D.,
RA Pourmand N., Raney B.J., Roessler K.A., Sanford J.R., Sawyer R.H.,
RA Schmidt C.J., Triplett E.W., Tuberville T.D., Venegas-Anaya M.,
RA Howard J.T., Jarvis E.D., Guillette L.J.Jr., Glenn T.C., Green R.E.,
RA Ray D.A.;
RT "Sequencing three crocodilian genomes to illuminate the evolution of
RT archosaurs and amniotes.";
RL Genome Biol. 13:415-415(2012).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KYO27005.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AKHW03005461; KYO27005.1; -; Genomic_DNA.
DR STRING; 8496.A0A151MR48; -.
DR eggNOG; KOG3544; Eukaryota.
DR Proteomes; UP000050525; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0004867; F:serine-type endopeptidase inhibitor activity; IEA:InterPro.
DR CDD; cd01472; vWA_collagen; 5.
DR CDD; cd01450; vWFA_subfamily_ECM; 3.
DR Gene3D; 4.10.410.10; Pancreatic trypsin inhibitor Kunitz domain; 1.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 9.
DR InterPro; IPR002223; Kunitz_BPTI.
DR InterPro; IPR036880; Kunitz_BPTI_sf.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24020:SF70; PH DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF00014; Kunitz_BPTI; 1.
DR Pfam; PF00092; VWA; 9.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00327; VWA; 9.
DR SUPFAM; SSF57362; BPTI-like; 1.
DR SUPFAM; SSF53300; vWA-like; 10.
DR PROSITE; PS50234; VWFA; 9.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Reference proteome {ECO:0000313|Proteomes:UP000050525}.
FT DOMAIN 58..232
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 253..425
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 453..628
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 661..828
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 844..1017
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 1031..1214
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 1228..1403
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 1986..2168
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 2192..2383
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT REGION 1636..1948
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2478..2504
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1718..1759
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2478..2494
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2572 AA; 284284 MW; E4699FA1AE90DD95 CRC64;
MEWADCWERK AKETKLKSFT REDLRMDNCK LLLILLFAAA FCFTDAQTTA CRKATSADIV
FLVETSSRIG QENFQKMKDF LYTLVSSLDV GNDQVRVGLA QYSHEPYKVF LLNQYSLKSD
ILEQIKNLPN RSGGTYTGTA LDFIRTEYFT RAAGSRAEEN VPQVVILVSV GESNDEVRTQ
AKELKVRGIS LYVVGINDRD PTELKKISSR PFKKFLFRTD SFDGLQDLST SLLQTMCFAI
ESQIQAFTKH YGDVVFLVDS SVHMGSSTFE QVKQFAYHVV EQLDVGIDKF RIGLAQYSTE
SQGEFFLNTY ANKEDVLNHI QEYVAFMGGP LQTGSALKFL REAFFTEDAG SRFSEGTPQF
AVVITSAKSE DEVLESALKL KEMGVKVISI GVQNSDRQEM EVIVTSPWVY QVDEGDSISQ
LHQDIINILE PPVQQHHEIM KMPEVCATVS IADIVFLVDE SSSIGLRNFK LMRDFLFTII
NVLHISPNNV RVGLVLYSDE PRLEFTLDTF ENKLEILHYL QKLPYRGGKT YTGAALDFLR
KDVFTRKAGS RKKQGVQQIA VVITGGQSLD DFTKPATKLR RSGVDIYVVG TQNAFESSQL
NKIASHPPRK HVANLESFLQ LSNIGRKIKK RICSEMMVQS FAIPVQTRMM KEGCVEIEEA
DIYFLIDGSG SIMPSDFQDM KTFMNEMIDV FQVGADRVRF GVVQYESIPR TQFEIGQYNT
MVQLKAAVRA IQQMGGGTKT GDALRYMKSL FAKASRTNVP QILIVITDGK SQDEVTRAAE
ELRQEGIIIY AIGIKQAVQE ELKDIARSED RMFFVNDFDS LKHIKHDIVH DICSSKACEN
VKADIIFLVD GSESIHPVDF QKMKDFMQLI VNRSDIGTDK VRIGLLQFSS QAKEEFQLNS
YSTKPGLRRA ISEIRQLRSG TLTGKALAFA SSYFDKSKGG RPEIKQYLIM ITDGEAQDSV
GEPAKMIRDK GITVYAIGVL QANETQLVEI SGTPGKVLFE DNFDSLIFLE KQILSEICKP
EDLCKRTEVA DIIFVIHGSS SITDLQFKSV QQLMMALVND SVVGKNNVQF GAVVYSVNSK
ERFSLNEYST KLYVREAIFN LRPLPEQGLQ IFTARALNFT RERFGVAYGG RASSHGVSQI
LVLITDRPTA PSDSYNLAAV AKSLKLDGIN IFAVGVDRAS RTELEQIVGE RERVLFALSY
SDLESLHGNL AHKICDKSRP VCENQAADVV FLIDGSESIS STNFSTMKSF MKEIVSRFHI
AENKARVGVV QYSEDPQKEF YLNEFYLETA IKEQIDSIRQ FKSSTFAGKG LRFVKRLFEP
AHGGRKNQNI PQTLIVITDG YSSDPVSEAA LALRNDGIYV FAVGIGILRA TELLQIAGNV
QRVFLVENFA RLERIERTIV KEVCDSSDRP SQDCNIDVSV GIDVSGPVRS TPALHLKKQL
QTDLPRFLQQ VESLTNVCCI SESQLNIRFK YQVFGPTGLP LFDSNFEKYN EEIIQKFLAA
QITVDTYLNA RFLQLLWEDS FEVASANVKV LLVFTDGLDD PVEVLRIAAD SLLLKGLDAL
LMVGLDNMPN LSDLQEIEFG RGLGNKHPLS IRFSDHPGLL QRELENVAER KCCHVACKCY
GEIGFHGIYG NPGKKGIPGF RGSPGHPGEE GGIGERGPRG INGTHGDKGC PGVRGLKGAR
GYRGSQGERG IDGIDGIDGE KGEQGSPGPS GEKGSTGRRG GKGPRGESGE RGEPGLRGDH
GDSGTDNYIR GHKGEKGKPG QQGEPGTDGV QGGLGPKGSD GERGRRGAQG LQGIQGDLGE
EGSPGISGPQ GPQGSRGPGG ISGLRGTQGI PGCRGNPGPP GESGSIGNPG PRGRKGEPGA
PGEKGLLGPP GSRGLPGLDG NDGYGFQGEK GTKGVTGFPG LPGSQGEEGN PGSPGNKGSK
GVRGRRGNAG IQGPMGNPGE RGPPGPMGTR GPLGIAAVAP CELVNFTREN CPCSSDKSKC
PVYPTEVVFA FDMSEDVTQV AFAKMRHVVL SLLKKIRISE SNCPTGARVS IVSYNTNTQY
LIRFSEFRNH KLLLEAVQSI PLERSSGRRN IGMAMRFVAR NVFKRIRQGV LTRRVAIFFA
NGPSQDAASI NTAVLEFSAL DIVPVVIAFN EVPNVRHAFS NDDTGRFQLF VWENQQNEHL
ERIEYCALCY DKCKPDINCE VPFPPPVMVN MDIAYIMDSS RNIASEEFET AKDFVSTMVD
HFIIAPQPSL AGARVALVQH APRDFTPSSG RQPVNPEFDL VTYSSKNVMK KYIQESVHQL
EGPSAIGHAL QWTVENIFFK APSPKQYRVI FTIVGSKTSA WDRQKLKKAA LEAKCQGFIM
FTLALGNKVS DSELIDLSSS PTDQHLLQMG RVLNLEMAYA EKFTWAFLNL LKREMNSYPT
PELQEECERL DQGDAQEQVA VFESIPFPQF DELDPGRSLE EREPTETTLL GEITETTQEL
ENVREKEYDY EGGEYFTEGQ KEEGERKGYE GGQENNEENL EETVGTGTAL TNHDACVVAQ
ETGECQDYVL KWYYDKEQKT CGQFWHITGN ARANITTHYL TSLLEICFKV KS
//