GenomeNet

Database: UniProt
Entry: A0A087X3T4_POEFO
LinkDB: A0A087X3T4_POEFO
Original site: A0A087X3T4_POEFO 
ID   A0A087X3T4_POEFO        Unreviewed;      1015 AA.
AC   A0A087X3T4;
DT   29-OCT-2014, integrated into UniProtKB/TrEMBL.
DT   26-NOV-2014, sequence version 2.
DT   27-MAR-2024, entry version 42.
DE   SubName: Full=Collagen type VI alpha 2 chain {ECO:0000313|Ensembl:ENSPFOP00000000437.2};
GN   Name=COL6A2 {ECO:0000313|Ensembl:ENSPFOP00000000437.2};
OS   Poecilia formosa (Amazon molly) (Limia formosa).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC   Ovalentaria; Atherinomorphae; Cyprinodontiformes; Poeciliidae; Poeciliinae;
OC   Poecilia.
OX   NCBI_TaxID=48698 {ECO:0000313|Ensembl:ENSPFOP00000000437.2, ECO:0000313|Proteomes:UP000028760};
RN   [1] {ECO:0000313|Proteomes:UP000028760}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=female {ECO:0000313|Proteomes:UP000028760};
RA   Schartl M., Warren W.;
RL   Submitted (OCT-2013) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Ensembl:ENSPFOP00000000437.2}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (SEP-2023) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AYCK01018496; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   AlphaFoldDB; A0A087X3T4; -.
DR   STRING; 48698.ENSPFOP00000000437; -.
DR   Ensembl; ENSPFOT00000000438.2; ENSPFOP00000000437.2; ENSPFOG00000000302.2.
DR   eggNOG; KOG3544; Eukaryota.
DR   GeneTree; ENSGT00940000155682; -.
DR   OMA; ELYRDDY; -.
DR   Proteomes; UP000028760; Unassembled WGS sequence.
DR   CDD; cd00198; vWFA; 1.
DR   Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 3.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR002035; VWF_A.
DR   InterPro; IPR036465; vWFA_dom_sf.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1082; COLLAGEN ALPHA-1(X) CHAIN; 1.
DR   Pfam; PF01391; Collagen; 4.
DR   Pfam; PF00092; VWA; 3.
DR   PRINTS; PR00453; VWFADOMAIN.
DR   SMART; SM00327; VWA; 3.
DR   SUPFAM; SSF53300; vWA-like; 3.
DR   PROSITE; PS50234; VWFA; 3.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000028760};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..23
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           24..1015
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5001832278"
FT   DOMAIN          37..226
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          609..796
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          828..1010
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   REGION          250..579
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        518..532
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1015 AA;  107224 MW;  C8423992959AFB8D CRC64;
     CWQRRLLKRV YVLTCFLIIK ADTQQLLCST IIDCPIKLFF TIDTSETIAL QESPPGSLVE
     SIKEFMKIFA QKLEDEEYRG QIQITWSFGG LHFSQKQVLF SQFTTRQSFI RNLSQIKYLG
     KGTYIDCALT NMTRQMTYSG KQAVLFSVVI TDGHVTGSPC GGIKVAGDRA RDQGIHMFSV
     AASTSIDELG MMEIASSPTE VYRDDYIVME IVNGRPRMNT ETIDRIIKVM KYQAYLQCYK
     PTCLAVPGNP GRKGASGPKG IKGDRGIKGP KGYKGNQGDP GIEGSIGLPG QKGKTGFKGE
     KGEIGLIGAK GAVGTPGKNG ADGQKGKIGR IGAPGCKGDP GEKGPDGLPG DPGDSGLSGK
     EGEKGDIGLP GKNGPPGPVG NPGLTGEKGH PGNPGPPGER GFPGISGKPG LKGELGRRGD
     PGRKGAPGQD GAPGPKGDRG ASGDRGRPGE AGVKGAKGDQ GLPGPRGWPG EPGSVGGNGI
     VGPPGDAGLR GNPGPPGPQG DNGRPGFSYP GPRGPTGERG DPGKRGPRGG RGECGAKGEP
     GAKGAPGEPG EPGQPGEPGE RGPPGDPGKD GAPGPAGDPG LTDCDVMTYI RETCGCCDSD
     CEKHCGALDI VFVIDSSESV GMTNFTLEKN FVINTINRMG SMASDPTSPT GTRVGVVQFS
     HEGTFEAIRL DDPSIDSMSS FKTAVKNLQW IAGGTFTPSA LKFAYDNLIR DSKRARANVS
     VVVVTDGRFD PRDDDSKLRY LCNDPNVVVN AIGVGDMFDK EHDSETLVSI ACDNKNRITE
     MKRYSDLVAD NFIQKMETVL CPDPVIKCPD LPCKTELDVA PCVGRPVELV FLLDGSERLG
     MENFGHARHF VQMVANALTM ARNRNDQNGA RLALTEFGNE NENQVAFLLT HDQKAITSGL
     SGLHYLDASS AVGPAIFKAI DEILGKGPTR KTRRGAEVSF VFITDGVTNI TNLDKAASAM
     ASEHIFSTVI ATGSDVDEEA LTKLVMGDQT AIFKSQTFSD VLQPSFFDRF IRWVC
//
DBGET integrated database retrieval system