GenomeNet

Database: UniProt
Entry: A0A3Q2DDI1_CYPVA
LinkDB: A0A3Q2DDI1_CYPVA
Original site: A0A3Q2DDI1_CYPVA 
ID   A0A3Q2DDI1_CYPVA        Unreviewed;       575 AA.
AC   A0A3Q2DDI1;
DT   10-APR-2019, integrated into UniProtKB/TrEMBL.
DT   10-APR-2019, sequence version 1.
DT   27-MAR-2024, entry version 24.
DE   SubName: Full=Collagen, type XXI, alpha 1 {ECO:0000313|Ensembl:ENSCVAP00000016887.1};
OS   Cyprinodon variegatus (Sheepshead minnow).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC   Ovalentaria; Atherinomorphae; Cyprinodontiformes; Cyprinodontidae;
OC   Cyprinodon.
OX   NCBI_TaxID=28743 {ECO:0000313|Ensembl:ENSCVAP00000016887.1, ECO:0000313|Proteomes:UP000265020};
RN   [1] {ECO:0000313|Ensembl:ENSCVAP00000016887.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   AlphaFoldDB; A0A3Q2DDI1; -.
DR   STRING; 28743.ENSCVAP00000016887; -.
DR   Ensembl; ENSCVAT00000025398.1; ENSCVAP00000016887.1; ENSCVAG00000019932.1.
DR   GeneTree; ENSGT00940000153769; -.
DR   OMA; YVSTQRF; -.
DR   Proteomes; UP000265020; Unplaced.
DR   CDD; cd01472; vWA_collagen; 1.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 1.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR048287; TSPN-like_N.
DR   InterPro; IPR002035; VWF_A.
DR   InterPro; IPR036465; vWFA_dom_sf.
DR   PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24020:SF20; COLLAGEN ALPHA-1(XXI) CHAIN; 1.
DR   Pfam; PF01391; Collagen; 2.
DR   Pfam; PF00092; VWA; 1.
DR   PRINTS; PR00453; VWFADOMAIN.
DR   SMART; SM00210; TSPN; 1.
DR   SMART; SM00327; VWA; 1.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR   SUPFAM; SSF53300; vWA-like; 1.
DR   PROSITE; PS50234; VWFA; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000265020};
KW   Signal {ECO:0000256|ARBA:ARBA00022729}.
FT   DOMAIN          7..180
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   REGION          402..557
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        436..450
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        507..521
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   575 AA;  61384 MW;  078F235971D5D5D2 CRC64;
     CRTAPSDLVF ILDGSWSVVD INFEIVKGWL VNITTSFNIG QKFTQVGVVQ YSDDPVLEIP
     LGKFSSKKDL IRAMENIEYM GGNTRTGTAI KFATDKLFGL SERGPAGISR IAVVLTDGKS
     QDEVLKAAEA ARKKGVILFA IGVGPETEQD ELRDIANKPS STYVFSVEDY KAISRIIQVI
     RQKLTVCPAK IPTDSRDEKG FDILLNLNLA KKAKKTQGSL FVNKAYEVTS AVDLSEATSL
     FPDGLPPSYV FVATLRYKGS VATEKWDLWR IQTLDGEPQM AVTLDGLDNT VMFTTTSNAP
     SGIQTVKFSQ QTMLFDEKWH QLRLLVTEED VTLYVDNMEI ETQPLEPSVG IFINGKTQVG
     KYVNKEATVP FEIQKLRIYC DPAQNMRETA CEIPGVYSCQ KGIAGDPGQR GPEGQTGRPG
     QPGRSGPTGP QGIRGDTGPP GPPGPEGRSV SPGPQGPAGP QGLRGLPGLS GSRGHPGRPG
     RPGIIGLKGE PGYKGEKGNP GQVMVGDQGP PGPPPIGPQG YSKTGPPGKP GLPGQNGAEG
     KPGNPGVPGQ PGVCDPSLCY ASMMRRSSFS KGPNY
//
DBGET integrated database retrieval system