GenomeNet

Database: UniProt
Entry: A0A226EHK2_FOLCA
LinkDB: A0A226EHK2_FOLCA
Original site: A0A226EHK2_FOLCA 
ID   A0A226EHK2_FOLCA        Unreviewed;       846 AA.
AC   A0A226EHK2;
DT   25-OCT-2017, integrated into UniProtKB/TrEMBL.
DT   25-OCT-2017, sequence version 1.
DT   27-MAR-2024, entry version 16.
DE   SubName: Full=Collagen alpha-1(XIV) chain {ECO:0000313|EMBL:OXA56810.1};
GN   ORFNames=Fcan01_08492 {ECO:0000313|EMBL:OXA56810.1};
OS   Folsomia candida (Springtail).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Collembola;
OC   Entomobryomorpha; Isotomoidea; Isotomidae; Proisotominae; Folsomia.
OX   NCBI_TaxID=158441 {ECO:0000313|EMBL:OXA56810.1, ECO:0000313|Proteomes:UP000198287};
RN   [1] {ECO:0000313|EMBL:OXA56810.1, ECO:0000313|Proteomes:UP000198287}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=VU population {ECO:0000313|EMBL:OXA56810.1,
RC   ECO:0000313|Proteomes:UP000198287};
RC   TISSUE=Whole body {ECO:0000313|EMBL:OXA56810.1};
RA   Faddeeva A., Derks M.F., Anvar Y., Smit S., Van Straalen N., Roelofs D.;
RT   "The genome of Folsomia candida.";
RL   Submitted (DEC-2015) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:OXA56810.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; LNIX01000003; OXA56810.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A226EHK2; -.
DR   STRING; 158441.A0A226EHK2; -.
DR   Proteomes; UP000198287; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   CDD; cd01450; vWFA_subfamily_ECM; 1.
DR   Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 1.
DR   InterPro; IPR031993; DUF4789.
DR   InterPro; IPR002035; VWF_A.
DR   InterPro; IPR036465; vWFA_dom_sf.
DR   PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24020:SF70; PH DOMAIN-CONTAINING PROTEIN; 1.
DR   Pfam; PF16033; DUF4789; 1.
DR   Pfam; PF00092; VWA; 1.
DR   PRINTS; PR00453; VWFADOMAIN.
DR   SMART; SM00327; VWA; 1.
DR   SUPFAM; SSF53300; vWA-like; 1.
DR   PROSITE; PS50234; VWFA; 1.
PE   4: Predicted;
KW   Collagen {ECO:0000313|EMBL:OXA56810.1};
KW   Reference proteome {ECO:0000313|Proteomes:UP000198287};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..20
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           21..846
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5012488761"
FT   DOMAIN          190..361
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   REGION          457..513
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   846 AA;  95190 MW;  D1FF095F79A321EB CRC64;
     MIQIQGQLII FIGVLALVYS QSIDETLTPR SKGADVSEAV VRKIQRSGIF PHDQNLLRRL
     SYVESRDGED DKTYRPDYFG GIWQLDEVLF NQTKNVPGLN TIYQRLDATW NIQWNSVTWA
     DLLVPMYSGL GARLYLHTLS LSNRPIPGTI DSQAEYWRLN YNRADQNATT YKDLVKRLES
     DGGVCEGLMD ICIVLDGSRS IGAGAFIRAK GFVADLVQTF SLNSTRVAFV LYSDNAQKIF
     DFSNTLTPSA MNTTIRNVRY PNGNTNTPAG LLMAVTFFNQ ATPRPGVPQV VATFTDGNSN
     RGNLTVAVAA VKTANLTSFA VGIGNEQDIR ESELQQIALN DSSRVFRVND YEALAEFFFQ
     MNKATCEVPQ EPEIGDEHNG TLNKNEKRYF KYQIPDAGFT LNLLTSGQIS GWYSYTEQTP
     NSAVHDGLIL THTFIQSAME NSVVHIALQG DSVDPTEYNI RTDEGNQVTS TTPQSTTTYT
     TSTTPTSASS TTPTSASSTT PAPTSSTTLE TTTPNSAKNV IASIVSVVLD HNFTNVSARD
     RKIRATIPGT GWTSMEYFPP DFMDYENSNR IYGGNDKQPP IFQRDQITGR ISHTDDKDQF
     ALKNQPEQFP LPKIIDRNGY IKPTKQSVGT VDITPPFETH YTYPFESKSE FICPQNDIAG
     YKRFAYSRKH NKCFEVGYRG PCKPNMKFFL DRSGSQFGKC DCDPEVGYNC GRPLVFLDDW
     CYPVYSQGPC HHNDWLVLDQ GQPRCERNAC SWQQTFDKIK ETDCLHWVYM NGNCHLTSTR
     AFCERDEVLF WDVYDSRATD PTCIPAWSYP ADFMNYRIDK RRIDPALPCF PGQRQTIARQ
     CQPNYK
//
DBGET integrated database retrieval system