ID A0A1A6GU61_NEOLE Unreviewed; 1080 AA.
AC A0A1A6GU61;
DT 05-OCT-2016, integrated into UniProtKB/TrEMBL.
DT 05-OCT-2016, sequence version 1.
DT 13-SEP-2023, entry version 17.
DE RecName: Full=VWFA domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=A6R68_02587 {ECO:0000313|EMBL:OBS68872.1};
OS Neotoma lepida (Desert woodrat).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea;
OC Cricetidae; Neotominae; Neotoma.
OX NCBI_TaxID=56216 {ECO:0000313|EMBL:OBS68872.1, ECO:0000313|Proteomes:UP000092124};
RN [1] {ECO:0000313|EMBL:OBS68872.1, ECO:0000313|Proteomes:UP000092124}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=417 {ECO:0000313|EMBL:OBS68872.1};
RC TISSUE=Liver {ECO:0000313|EMBL:OBS68872.1};
RA Campbell M., Oakeson K.F., Yandell M., Halpert J.R., Dearing D.;
RT "The Draft Genome Sequence and Annotation of the Desert Woodrat Neotoma
RT lepida.";
RL Submitted (JUN-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OBS68872.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LZPO01075859; OBS68872.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1A6GU61; -.
DR STRING; 56216.A0A1A6GU61; -.
DR Proteomes; UP000092124; Unassembled WGS sequence.
DR CDD; cd01461; vWA_interalpha_trypsin_inhibitor; 1.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 1.
DR InterPro; IPR013694; VIT.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR46299:SF1; VON WILLEBRAND FACTOR A DOMAIN-CONTAINING PROTEIN 5B1; 1.
DR PANTHER; PTHR46299; VON WILLEBRAND FACTOR A DOMAIN-CONTAINING PROTEIN 5B2-RELATED; 1.
DR Pfam; PF13757; VIT_2; 1.
DR Pfam; PF13768; VWA_3; 1.
DR SMART; SM00327; VWA; 1.
DR SUPFAM; SSF53300; vWA-like; 1.
DR PROSITE; PS51468; VIT; 1.
DR PROSITE; PS50234; VWFA; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000092124}.
FT DOMAIN 1..96
FT /note="VIT"
FT /evidence="ECO:0000259|PROSITE:PS51468"
FT DOMAIN 281..449
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT REGION 511..536
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 564..588
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 605..633
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 831..859
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 960..988
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 511..530
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 605..621
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1080 AA; 119242 MW; E85D966ECE8335BC CRC64;
MKYQQLKRRG LFVYPMDEYT TVVGFEAVIA DRVVTIQIKD KAKLDKSHLD IQSATVTGLM
TAPSGVRVDL ERILFVVNLG TIAPMENVTV FISTSSELPT LPSGAVRVVL PAICAPTVPQ
FCSHRYGSSS QQAQGKDPHC FGIQTMDSWN GLCLATLLDT EVTNPMEYEF NFQLEIRGPY
LLAGVESPTH EVRADAAPSA HSAKSIIITL ANKHTFDRPV EILLHPSEPH MPHVLVEKGD
MTLGEFDQHL KGKADFIRGT KKDGSAERKS VQPNLRKTHG EFIFLVDRSS SVSKTSVQCV
KEAMLVALKS LRLDCLFNIV GFGSTFKTVF ASSQIYNEEN LAIACNCIQR IRADMGSTNI
LSPLKWIFRQ PVHQGHPRLL FLITDGAVSN TGKVLELVRN HASSTRCYSF GIGPSVCYRL
VKGLASVSKG SAEFLTEGER LQPKMIKSLK KAMAPMLSDV TVEWVFPETT EVLISPVSTS
SLFPGDRLMG YGIVIKGESL DVFRRRRAYS TNQISSQKGP RATTASDPTG TARRYPLRKA
KVQDLANQSS SGSQRLQIDL QPLLNSGHDL SQGPRLHGPG ARRPSLLPQG CQSLLFSGQE
PQAWSPMQEL DRGSSPKSAP DSRSSEDLEL PLHPSTFETE ISSDWEPMAE SEEQVSPCRP
ATPGPVLGKA LVKGLCDNQC LQWEVSFELE PPALQRGNAQ NADMWSETFH HLAARAIIRD
FEQLAEREDE IELGSNRRYQ VNAVHTSKAC NVISKYTAFV PVDINKRRYL PTVVKYPNSG
AMQQMVSFRN LTRQWKRASA NVGRSQTMLR EQASAAGDNK FQTLALEESP TSTISKLLSP
SREKHTGAEG PPHNLSTSAP SSIKASEALF VTKLNLHKPR LLTQAAKGFL SKSLAKAPEP
IPSIQSFDYI PLVSLQLASG AFLLDQAFCE AIQIPMEKLK WTSPFMCLRM SLVTHRHESK
SQSPQHGIGL SSPCPPCEDT SLESQAESSS GLDPSVLLEC TGKLWATAVA LAWLEHSSAS
YFIEWELVAA KASLWLEQQK VPEGRTLSTL KATARQLFVL LRHWDENLEF NMLCYNPNYV
//