ID A0A087X3T4_POEFO Unreviewed; 1015 AA.
AC A0A087X3T4;
DT 29-OCT-2014, integrated into UniProtKB/TrEMBL.
DT 26-NOV-2014, sequence version 2.
DT 27-MAR-2024, entry version 42.
DE SubName: Full=Collagen type VI alpha 2 chain {ECO:0000313|Ensembl:ENSPFOP00000000437.2};
GN Name=COL6A2 {ECO:0000313|Ensembl:ENSPFOP00000000437.2};
OS Poecilia formosa (Amazon molly) (Limia formosa).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Atherinomorphae; Cyprinodontiformes; Poeciliidae; Poeciliinae;
OC Poecilia.
OX NCBI_TaxID=48698 {ECO:0000313|Ensembl:ENSPFOP00000000437.2, ECO:0000313|Proteomes:UP000028760};
RN [1] {ECO:0000313|Proteomes:UP000028760}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=female {ECO:0000313|Proteomes:UP000028760};
RA Schartl M., Warren W.;
RL Submitted (OCT-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSPFOP00000000437.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AYCK01018496; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR AlphaFoldDB; A0A087X3T4; -.
DR STRING; 48698.ENSPFOP00000000437; -.
DR Ensembl; ENSPFOT00000000438.2; ENSPFOP00000000437.2; ENSPFOG00000000302.2.
DR eggNOG; KOG3544; Eukaryota.
DR GeneTree; ENSGT00940000155682; -.
DR OMA; ELYRDDY; -.
DR Proteomes; UP000028760; Unassembled WGS sequence.
DR CDD; cd00198; vWFA; 1.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 3.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1082; COLLAGEN ALPHA-1(X) CHAIN; 1.
DR Pfam; PF01391; Collagen; 4.
DR Pfam; PF00092; VWA; 3.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00327; VWA; 3.
DR SUPFAM; SSF53300; vWA-like; 3.
DR PROSITE; PS50234; VWFA; 3.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000028760};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..23
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 24..1015
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5001832278"
FT DOMAIN 37..226
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 609..796
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 828..1010
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT REGION 250..579
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 518..532
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1015 AA; 107224 MW; C8423992959AFB8D CRC64;
CWQRRLLKRV YVLTCFLIIK ADTQQLLCST IIDCPIKLFF TIDTSETIAL QESPPGSLVE
SIKEFMKIFA QKLEDEEYRG QIQITWSFGG LHFSQKQVLF SQFTTRQSFI RNLSQIKYLG
KGTYIDCALT NMTRQMTYSG KQAVLFSVVI TDGHVTGSPC GGIKVAGDRA RDQGIHMFSV
AASTSIDELG MMEIASSPTE VYRDDYIVME IVNGRPRMNT ETIDRIIKVM KYQAYLQCYK
PTCLAVPGNP GRKGASGPKG IKGDRGIKGP KGYKGNQGDP GIEGSIGLPG QKGKTGFKGE
KGEIGLIGAK GAVGTPGKNG ADGQKGKIGR IGAPGCKGDP GEKGPDGLPG DPGDSGLSGK
EGEKGDIGLP GKNGPPGPVG NPGLTGEKGH PGNPGPPGER GFPGISGKPG LKGELGRRGD
PGRKGAPGQD GAPGPKGDRG ASGDRGRPGE AGVKGAKGDQ GLPGPRGWPG EPGSVGGNGI
VGPPGDAGLR GNPGPPGPQG DNGRPGFSYP GPRGPTGERG DPGKRGPRGG RGECGAKGEP
GAKGAPGEPG EPGQPGEPGE RGPPGDPGKD GAPGPAGDPG LTDCDVMTYI RETCGCCDSD
CEKHCGALDI VFVIDSSESV GMTNFTLEKN FVINTINRMG SMASDPTSPT GTRVGVVQFS
HEGTFEAIRL DDPSIDSMSS FKTAVKNLQW IAGGTFTPSA LKFAYDNLIR DSKRARANVS
VVVVTDGRFD PRDDDSKLRY LCNDPNVVVN AIGVGDMFDK EHDSETLVSI ACDNKNRITE
MKRYSDLVAD NFIQKMETVL CPDPVIKCPD LPCKTELDVA PCVGRPVELV FLLDGSERLG
MENFGHARHF VQMVANALTM ARNRNDQNGA RLALTEFGNE NENQVAFLLT HDQKAITSGL
SGLHYLDASS AVGPAIFKAI DEILGKGPTR KTRRGAEVSF VFITDGVTNI TNLDKAASAM
ASEHIFSTVI ATGSDVDEEA LTKLVMGDQT AIFKSQTFSD VLQPSFFDRF IRWVC
//