ID A0A226EHK2_FOLCA Unreviewed; 846 AA.
AC A0A226EHK2;
DT 25-OCT-2017, integrated into UniProtKB/TrEMBL.
DT 25-OCT-2017, sequence version 1.
DT 27-MAR-2024, entry version 16.
DE SubName: Full=Collagen alpha-1(XIV) chain {ECO:0000313|EMBL:OXA56810.1};
GN ORFNames=Fcan01_08492 {ECO:0000313|EMBL:OXA56810.1};
OS Folsomia candida (Springtail).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Collembola;
OC Entomobryomorpha; Isotomoidea; Isotomidae; Proisotominae; Folsomia.
OX NCBI_TaxID=158441 {ECO:0000313|EMBL:OXA56810.1, ECO:0000313|Proteomes:UP000198287};
RN [1] {ECO:0000313|EMBL:OXA56810.1, ECO:0000313|Proteomes:UP000198287}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=VU population {ECO:0000313|EMBL:OXA56810.1,
RC ECO:0000313|Proteomes:UP000198287};
RC TISSUE=Whole body {ECO:0000313|EMBL:OXA56810.1};
RA Faddeeva A., Derks M.F., Anvar Y., Smit S., Van Straalen N., Roelofs D.;
RT "The genome of Folsomia candida.";
RL Submitted (DEC-2015) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OXA56810.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LNIX01000003; OXA56810.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A226EHK2; -.
DR STRING; 158441.A0A226EHK2; -.
DR Proteomes; UP000198287; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR CDD; cd01450; vWFA_subfamily_ECM; 1.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 1.
DR InterPro; IPR031993; DUF4789.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24020:SF70; PH DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF16033; DUF4789; 1.
DR Pfam; PF00092; VWA; 1.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00327; VWA; 1.
DR SUPFAM; SSF53300; vWA-like; 1.
DR PROSITE; PS50234; VWFA; 1.
PE 4: Predicted;
KW Collagen {ECO:0000313|EMBL:OXA56810.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000198287};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..20
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 21..846
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5012488761"
FT DOMAIN 190..361
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT REGION 457..513
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 846 AA; 95190 MW; D1FF095F79A321EB CRC64;
MIQIQGQLII FIGVLALVYS QSIDETLTPR SKGADVSEAV VRKIQRSGIF PHDQNLLRRL
SYVESRDGED DKTYRPDYFG GIWQLDEVLF NQTKNVPGLN TIYQRLDATW NIQWNSVTWA
DLLVPMYSGL GARLYLHTLS LSNRPIPGTI DSQAEYWRLN YNRADQNATT YKDLVKRLES
DGGVCEGLMD ICIVLDGSRS IGAGAFIRAK GFVADLVQTF SLNSTRVAFV LYSDNAQKIF
DFSNTLTPSA MNTTIRNVRY PNGNTNTPAG LLMAVTFFNQ ATPRPGVPQV VATFTDGNSN
RGNLTVAVAA VKTANLTSFA VGIGNEQDIR ESELQQIALN DSSRVFRVND YEALAEFFFQ
MNKATCEVPQ EPEIGDEHNG TLNKNEKRYF KYQIPDAGFT LNLLTSGQIS GWYSYTEQTP
NSAVHDGLIL THTFIQSAME NSVVHIALQG DSVDPTEYNI RTDEGNQVTS TTPQSTTTYT
TSTTPTSASS TTPTSASSTT PAPTSSTTLE TTTPNSAKNV IASIVSVVLD HNFTNVSARD
RKIRATIPGT GWTSMEYFPP DFMDYENSNR IYGGNDKQPP IFQRDQITGR ISHTDDKDQF
ALKNQPEQFP LPKIIDRNGY IKPTKQSVGT VDITPPFETH YTYPFESKSE FICPQNDIAG
YKRFAYSRKH NKCFEVGYRG PCKPNMKFFL DRSGSQFGKC DCDPEVGYNC GRPLVFLDDW
CYPVYSQGPC HHNDWLVLDQ GQPRCERNAC SWQQTFDKIK ETDCLHWVYM NGNCHLTSTR
AFCERDEVLF WDVYDSRATD PTCIPAWSYP ADFMNYRIDK RRIDPALPCF PGQRQTIARQ
CQPNYK
//