GenomeNet

Database: UniProt
Entry: A0A151MR48_ALLMI
LinkDB: A0A151MR48_ALLMI
Original site: A0A151MR48_ALLMI 
ID   A0A151MR48_ALLMI        Unreviewed;      2572 AA.
AC   A0A151MR48;
DT   08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT   08-JUN-2016, sequence version 1.
DT   27-MAR-2024, entry version 26.
DE   RecName: Full=VWFA domain-containing protein {ECO:0000259|PROSITE:PS50234};
GN   ORFNames=Y1Q_0019407 {ECO:0000313|EMBL:KYO27005.1};
OS   Alligator mississippiensis (American alligator).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Archelosauria; Archosauria; Crocodylia; Alligatoridae; Alligatorinae;
OC   Alligator.
OX   NCBI_TaxID=8496 {ECO:0000313|EMBL:KYO27005.1};
RN   [1] {ECO:0000313|EMBL:KYO27005.1, ECO:0000313|Proteomes:UP000050525}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=KSC_2009_1 {ECO:0000313|EMBL:KYO27005.1};
RX   PubMed=22293439; DOI=10.1186/gb-2012-13-1-415;
RA   St John J.A., Braun E.L., Isberg S.R., Miles L.G., Chong A.Y., Gongora J.,
RA   Dalzell P., Moran C., Bed'hom B., Abzhanov A., Burgess S.C., Cooksey A.M.,
RA   Castoe T.A., Crawford N.G., Densmore L.D., Drew J.C., Edwards S.V.,
RA   Faircloth B.C., Fujita M.K., Greenwold M.J., Hoffmann F.G., Howard J.M.,
RA   Iguchi T., Janes D.E., Khan S.Y., Kohno S., de Koning A.J., Lance S.L.,
RA   McCarthy F.M., McCormack J.E., Merchant M.E., Peterson D.G., Pollock D.D.,
RA   Pourmand N., Raney B.J., Roessler K.A., Sanford J.R., Sawyer R.H.,
RA   Schmidt C.J., Triplett E.W., Tuberville T.D., Venegas-Anaya M.,
RA   Howard J.T., Jarvis E.D., Guillette L.J.Jr., Glenn T.C., Green R.E.,
RA   Ray D.A.;
RT   "Sequencing three crocodilian genomes to illuminate the evolution of
RT   archosaurs and amniotes.";
RL   Genome Biol. 13:415-415(2012).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:KYO27005.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AKHW03005461; KYO27005.1; -; Genomic_DNA.
DR   STRING; 8496.A0A151MR48; -.
DR   eggNOG; KOG3544; Eukaryota.
DR   Proteomes; UP000050525; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0004867; F:serine-type endopeptidase inhibitor activity; IEA:InterPro.
DR   CDD; cd01472; vWA_collagen; 5.
DR   CDD; cd01450; vWFA_subfamily_ECM; 3.
DR   Gene3D; 4.10.410.10; Pancreatic trypsin inhibitor Kunitz domain; 1.
DR   Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 9.
DR   InterPro; IPR002223; Kunitz_BPTI.
DR   InterPro; IPR036880; Kunitz_BPTI_sf.
DR   InterPro; IPR002035; VWF_A.
DR   InterPro; IPR036465; vWFA_dom_sf.
DR   PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24020:SF70; PH DOMAIN-CONTAINING PROTEIN; 1.
DR   Pfam; PF00014; Kunitz_BPTI; 1.
DR   Pfam; PF00092; VWA; 9.
DR   PRINTS; PR00453; VWFADOMAIN.
DR   SMART; SM00327; VWA; 9.
DR   SUPFAM; SSF57362; BPTI-like; 1.
DR   SUPFAM; SSF53300; vWA-like; 10.
DR   PROSITE; PS50234; VWFA; 9.
PE   4: Predicted;
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Reference proteome {ECO:0000313|Proteomes:UP000050525}.
FT   DOMAIN          58..232
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          253..425
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          453..628
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          661..828
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          844..1017
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          1031..1214
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          1228..1403
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          1986..2168
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          2192..2383
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   REGION          1636..1948
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2478..2504
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1718..1759
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2478..2494
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   2572 AA;  284284 MW;  E4699FA1AE90DD95 CRC64;
     MEWADCWERK AKETKLKSFT REDLRMDNCK LLLILLFAAA FCFTDAQTTA CRKATSADIV
     FLVETSSRIG QENFQKMKDF LYTLVSSLDV GNDQVRVGLA QYSHEPYKVF LLNQYSLKSD
     ILEQIKNLPN RSGGTYTGTA LDFIRTEYFT RAAGSRAEEN VPQVVILVSV GESNDEVRTQ
     AKELKVRGIS LYVVGINDRD PTELKKISSR PFKKFLFRTD SFDGLQDLST SLLQTMCFAI
     ESQIQAFTKH YGDVVFLVDS SVHMGSSTFE QVKQFAYHVV EQLDVGIDKF RIGLAQYSTE
     SQGEFFLNTY ANKEDVLNHI QEYVAFMGGP LQTGSALKFL REAFFTEDAG SRFSEGTPQF
     AVVITSAKSE DEVLESALKL KEMGVKVISI GVQNSDRQEM EVIVTSPWVY QVDEGDSISQ
     LHQDIINILE PPVQQHHEIM KMPEVCATVS IADIVFLVDE SSSIGLRNFK LMRDFLFTII
     NVLHISPNNV RVGLVLYSDE PRLEFTLDTF ENKLEILHYL QKLPYRGGKT YTGAALDFLR
     KDVFTRKAGS RKKQGVQQIA VVITGGQSLD DFTKPATKLR RSGVDIYVVG TQNAFESSQL
     NKIASHPPRK HVANLESFLQ LSNIGRKIKK RICSEMMVQS FAIPVQTRMM KEGCVEIEEA
     DIYFLIDGSG SIMPSDFQDM KTFMNEMIDV FQVGADRVRF GVVQYESIPR TQFEIGQYNT
     MVQLKAAVRA IQQMGGGTKT GDALRYMKSL FAKASRTNVP QILIVITDGK SQDEVTRAAE
     ELRQEGIIIY AIGIKQAVQE ELKDIARSED RMFFVNDFDS LKHIKHDIVH DICSSKACEN
     VKADIIFLVD GSESIHPVDF QKMKDFMQLI VNRSDIGTDK VRIGLLQFSS QAKEEFQLNS
     YSTKPGLRRA ISEIRQLRSG TLTGKALAFA SSYFDKSKGG RPEIKQYLIM ITDGEAQDSV
     GEPAKMIRDK GITVYAIGVL QANETQLVEI SGTPGKVLFE DNFDSLIFLE KQILSEICKP
     EDLCKRTEVA DIIFVIHGSS SITDLQFKSV QQLMMALVND SVVGKNNVQF GAVVYSVNSK
     ERFSLNEYST KLYVREAIFN LRPLPEQGLQ IFTARALNFT RERFGVAYGG RASSHGVSQI
     LVLITDRPTA PSDSYNLAAV AKSLKLDGIN IFAVGVDRAS RTELEQIVGE RERVLFALSY
     SDLESLHGNL AHKICDKSRP VCENQAADVV FLIDGSESIS STNFSTMKSF MKEIVSRFHI
     AENKARVGVV QYSEDPQKEF YLNEFYLETA IKEQIDSIRQ FKSSTFAGKG LRFVKRLFEP
     AHGGRKNQNI PQTLIVITDG YSSDPVSEAA LALRNDGIYV FAVGIGILRA TELLQIAGNV
     QRVFLVENFA RLERIERTIV KEVCDSSDRP SQDCNIDVSV GIDVSGPVRS TPALHLKKQL
     QTDLPRFLQQ VESLTNVCCI SESQLNIRFK YQVFGPTGLP LFDSNFEKYN EEIIQKFLAA
     QITVDTYLNA RFLQLLWEDS FEVASANVKV LLVFTDGLDD PVEVLRIAAD SLLLKGLDAL
     LMVGLDNMPN LSDLQEIEFG RGLGNKHPLS IRFSDHPGLL QRELENVAER KCCHVACKCY
     GEIGFHGIYG NPGKKGIPGF RGSPGHPGEE GGIGERGPRG INGTHGDKGC PGVRGLKGAR
     GYRGSQGERG IDGIDGIDGE KGEQGSPGPS GEKGSTGRRG GKGPRGESGE RGEPGLRGDH
     GDSGTDNYIR GHKGEKGKPG QQGEPGTDGV QGGLGPKGSD GERGRRGAQG LQGIQGDLGE
     EGSPGISGPQ GPQGSRGPGG ISGLRGTQGI PGCRGNPGPP GESGSIGNPG PRGRKGEPGA
     PGEKGLLGPP GSRGLPGLDG NDGYGFQGEK GTKGVTGFPG LPGSQGEEGN PGSPGNKGSK
     GVRGRRGNAG IQGPMGNPGE RGPPGPMGTR GPLGIAAVAP CELVNFTREN CPCSSDKSKC
     PVYPTEVVFA FDMSEDVTQV AFAKMRHVVL SLLKKIRISE SNCPTGARVS IVSYNTNTQY
     LIRFSEFRNH KLLLEAVQSI PLERSSGRRN IGMAMRFVAR NVFKRIRQGV LTRRVAIFFA
     NGPSQDAASI NTAVLEFSAL DIVPVVIAFN EVPNVRHAFS NDDTGRFQLF VWENQQNEHL
     ERIEYCALCY DKCKPDINCE VPFPPPVMVN MDIAYIMDSS RNIASEEFET AKDFVSTMVD
     HFIIAPQPSL AGARVALVQH APRDFTPSSG RQPVNPEFDL VTYSSKNVMK KYIQESVHQL
     EGPSAIGHAL QWTVENIFFK APSPKQYRVI FTIVGSKTSA WDRQKLKKAA LEAKCQGFIM
     FTLALGNKVS DSELIDLSSS PTDQHLLQMG RVLNLEMAYA EKFTWAFLNL LKREMNSYPT
     PELQEECERL DQGDAQEQVA VFESIPFPQF DELDPGRSLE EREPTETTLL GEITETTQEL
     ENVREKEYDY EGGEYFTEGQ KEEGERKGYE GGQENNEENL EETVGTGTAL TNHDACVVAQ
     ETGECQDYVL KWYYDKEQKT CGQFWHITGN ARANITTHYL TSLLEICFKV KS
//
DBGET integrated database retrieval system