ID A0A452GFK6_9SAUR Unreviewed; 2734 AA.
AC A0A452GFK6;
DT 08-MAY-2019, integrated into UniProtKB/TrEMBL.
DT 08-MAY-2019, sequence version 1.
DT 27-MAR-2024, entry version 22.
DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSGAGP00000000348.1};
OS Gopherus agassizii (Agassiz's desert tortoise).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Testudinata; Testudines; Cryptodira; Durocryptodira;
OC Testudinoidea; Testudinidae; Gopherus.
OX NCBI_TaxID=38772 {ECO:0000313|Ensembl:ENSGAGP00000000348.1, ECO:0000313|Proteomes:UP000291020};
RN [1] {ECO:0000313|Proteomes:UP000291020}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=28562605;
RA Tollis M., DeNardo D.F., Cornelius J.A., Dolby G.A., Edwards T.,
RA Henen B.T., Karl A.E., Murphy R.W., Kusumi K.;
RT "The Agassiz's desert tortoise genome provides a resource for the
RT conservation of a threatened species.";
RL PLoS ONE 12:e0177708-e0177708(2017).
RN [2] {ECO:0000313|Ensembl:ENSGAGP00000000348.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 38772.ENSGAGP00000000348; -.
DR Ensembl; ENSGAGT00000000392.1; ENSGAGP00000000348.1; ENSGAGG00000000288.1.
DR Proteomes; UP000291020; Unassembled WGS sequence.
DR GO; GO:0007596; P:blood coagulation; IEA:UniProtKB-KW.
DR CDD; cd19941; TIL; 4.
DR CDD; cd01450; vWFA_subfamily_ECM; 3.
DR Gene3D; 2.10.25.10; Laminin; 4.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 3.
DR InterPro; IPR036084; Ser_inhib-like_sf.
DR InterPro; IPR002919; TIL_dom.
DR InterPro; IPR014853; VWF/SSPO/ZAN-like_Cys-rich_dom.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR001007; VWF_dom.
DR InterPro; IPR001846; VWF_type-D.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR11339; EXTRACELLULAR MATRIX GLYCOPROTEIN RELATED; 1.
DR PANTHER; PTHR11339:SF361; VON WILLEBRAND FACTOR; 1.
DR Pfam; PF08742; C8; 4.
DR Pfam; PF01826; TIL; 3.
DR Pfam; PF00092; VWA; 3.
DR Pfam; PF00094; VWD; 4.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00832; C8; 4.
DR SMART; SM00327; VWA; 3.
DR SMART; SM00214; VWC; 5.
DR SMART; SM00215; VWC_out; 2.
DR SMART; SM00216; VWD; 4.
DR SUPFAM; SSF57603; FnI-like domain; 2.
DR SUPFAM; SSF57567; Serine protease inhibitors; 5.
DR SUPFAM; SSF53300; vWA-like; 3.
DR PROSITE; PS50234; VWFA; 3.
DR PROSITE; PS01208; VWFC_1; 3.
DR PROSITE; PS50184; VWFC_2; 3.
DR PROSITE; PS51233; VWFD; 4.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Reference proteome {ECO:0000313|Proteomes:UP000291020};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..20
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 21..2734
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5019367448"
FT DOMAIN 28..196
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 381..555
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 860..1027
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 1270..1446
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 1496..1663
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 1684..1865
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 1942..2118
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 2251..2324
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 2425..2491
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 2577..2642
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
SQ SEQUENCE 2734 AA; 305104 MW; 3BB8FBE931ED02E9 CRC64;
MICFTVLLMQ LALSIVTTDG QTEQPSLFRC SLFGDDHIKT FDESFYDFAG DCSYLLAGDC
HKRSFTLIGD YQNGRRNGFS VYLGEYFDIH LALDGTVTQG DKRVSLPFAS HGIFVETEAG
YYKLSSDEHG FVVKIDIDVN IQILFTDKHY NKTCGLCGNF NYFAEDDFRT QEGTLVESSY
DFANSWALHS EDKRCKRVRA PSNTCNISSE FANKDIMERC QLLRTSSVFS KCHHRVDPEL
FIGLCEEDLC TCAQDMHCHC PTFLEYARSC AQQGVIVDGW PNDSSCKPRC PVGMEYKECV
SPCVKTCQSL NINEVCQGQC ADGCSCPVGK LLDGDSCVDS SECSCIHSGM RYPPGSSISQ
DCNSCICRRG IWICSNEECP GECSVTGQSH FKSFDNKHFT FSGICQYLFA KDCKENSFSV
IIETVQCADD LDAVCTRSAS VRIQDMENSI IKMKHGGVIS LNGQDVQIPL TQGALRIQRI
MMSSVRLTYG EDLQIDWDGR GKLLVKLSPM YTDRICGLCG NFNGNEGDDF LTPSGLVEAL
VEDFGNTWKL NGDCQDLLKQ DNDPCSLNPR LAKYAEDSCS VLMSSIFVPC HHEVSPSPYL
KNCRYDVCSC SDGKDCLCTA VSTYATACAR KGVLIKWREP DFCSMTCPEG QIYQQCGTPC
NQTCRSLSYP DVGCNEFCME GCYCPHGQYI DEHGDCVPKS QCSCYYDGEI FQPDDVFSDH
YTMCYCENGF MHCTTNRLPG AFLPSVFFDH QPSARIKRSL SCKAPMDKFV CPPNNPRAEG
IECTKTCQNY DLECASHSCI SGCLCPKGMV RHENKCLVPE RCPCFHNGRE YAKGETVNKD
CNTCVCRGRK WDCTDNVCDA TCSVIGKAHY LTFDGLKYMF PGDCQYVLVQ DYCEDESGTF
RVLISNDGCG LTGEKCTKRL TILFDNGEIE LFNGNVNIKQ RLRDETNFEV LKSGRYYILL
LGKGISVTWD LAMGVSVILK GHFRDQVCGL CGNFDGIQNN DLTSRNNNLE VDPIDFGNSW
KVSPGCADVR KVPQGQTMTT SLCNGNLMKQ MMVETSCSIL ISELFKECKK LVNPELYMDI
CMYDTCACES VGDCACFCDA IAAYAHICAQ KGAVVHWRSS TLCPQSCEDL NKHELEYQCE
WRYNSCGPAC PITCQHFEPA VCPLQCVEGC HVHCPEGKIL DELSQSCIDP ESCPVCILEG
VRISHGKRVV LNKDDDEHCL SCHCEGKNLT CSACESEDKE EGIITPTPVA TEETTIDTGT
RVYSCSKMMD LAFLMDGSNK LSEKDFKLLK AFIISMMEKL HISQKKIRVS VLEYHTGSNI
YLGLKDIKTQ SQMRKIVQNI KYTGGDVASA TEVLKYIVFH VFGKAPRTNA ARIALLLTAS
KSPGNIQRIL TLLQKKKVTV IPVGIGPSIS MEQIKLIERQ WPENKAFIMN NVQELMENKD
EIINYLCDLV PEESDVLTTT TQKPITIPPR ATTVTVRNQS LATTSFPGKG IYKVLDIAFV
VEGSEKVGEE NFNIIKEFIA KVIRKMNIGE ETIHITIIQY SYTVTVEYSF SEAQSKHDII
EMVRKIQYQG GNATNTGNAL NYVSKHTFTT DNGGRRQVPH LVYMVTANPA TDVISRVASD
INVIPIGITP NANFQELEKI SQPHTPIIIE GYNQLIRETP DLVLKTCCSN ESIIPSEFCN
RPMDVMFLLD GSSSVGASEF QEMKSFVKAF IENYGISRNS TQVSVLQYAR AKTLEISWNM
PQESENLLNR VSSIQQREQG PNKLGEAVNF AIQHAMSEAH GGRPNASKIA VIIVSEKSED
PVNSAAYSAS INRVTLFPIG VGNRYDEEQL RTLAGQSGTD RIIKLRRFED LPDMVTLDDA
FVHKLCKEPA RECIDEDGNK RRPGDKWTLL DSCHFVTCSP GGHMSLESHR INCVKMQKPT
CHNNLPAVKT EETCGCRWTC PCKCMGSSTR HIVTFDGLDF KLTGNCSYTL FQDKEHDIEM
ILHNGACSST PKLNCMNAIE VKHHGKSIQL SGDMTVTVNR EMTTVPYMDD YFEVNIYGAI
MHEIRFSQLG HNFTFTPRNN EFILHLNPRS FSSQIYGLCG VCDQNNGNDF MLRNGSVTSD
SSTFIQEWTV KEPGKICEIK RDDRCTEHAT TKCNILGSPQ FAQCRGIISP DMFYSACEEN
SCYEDEICEV IASYAHVCRT NGICVVWRSP EFCAMKCPNS LIYDHCRTSC TKHCENSTSI
SVCKDYPIEG CFCPPGQVTL NSYCVDEEVC TECIAEDGTH YQHTETWIPS NEPCKICMCL
ENRIINCTTQ PCPTAKPAVC GPCEVSRLRQ DSDQCCPKYE CVCDLVTCDL PAVPVCEDGL
QPVLTNPGEC RPTYSCACKQ EECKLEHIPS CPPHRELTVK KTQCCDEYEC TCSCTNSTVS
CPAGYLSTSV TNDCGCTTTN CIPDKVCVHH NVVYPTGKSW EEVCRDCTCT DMEDAVTGLH
IAECLEKECS TICPQGYTYV NREGECCGKC QKTMCEEQNS WSRGDVDVHL HEVGTEWRSP
FSPCIINKCV RVNDEVFVEK RNVSCSQMDA PTCHLGYELR CDRITNCCPS CRCEPVNGCV
LNGTILGVGE RLMLDQCTNC QCSLQRGLPM NFKLTCRKTT CEPCPKSYRM EEISGSCCGK
CVPTSCGIKL RDGRILYLKP NETVQDGCDS HSCKVNEKRE FIWEKRITGC PPFDSRRCLA
EGVSSEVGPA CMFLNIFCHV YGALYKSRMC IRDG
//