ID K7G4Y9_PELSI Unreviewed; 1156 AA.
AC K7G4Y9;
DT 09-JAN-2013, integrated into UniProtKB/TrEMBL.
DT 09-JAN-2013, sequence version 1.
DT 27-MAR-2024, entry version 49.
DE RecName: Full=Collagen type VII alpha 1 chain {ECO:0008006|Google:ProtNLM};
OS Pelodiscus sinensis (Chinese softshell turtle) (Trionyx sinensis).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Testudinata; Testudines; Cryptodira; Trionychia;
OC Trionychidae; Pelodiscus.
OX NCBI_TaxID=13735 {ECO:0000313|Ensembl:ENSPSIP00000015350.1, ECO:0000313|Proteomes:UP000007267};
RN [1] {ECO:0000313|Proteomes:UP000007267}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Daiwa-1 {ECO:0000313|Proteomes:UP000007267};
RG Soft-shell Turtle Genome Consortium;
RL Submitted (OCT-2011) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Proteomes:UP000007267}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Daiwa-1 {ECO:0000313|Proteomes:UP000007267};
RX PubMed=23624526; DOI=10.1038/ng.2615;
RA Wang Z., Pascual-Anaya J., Zadissa A., Li W., Niimura Y., Huang Z., Li C.,
RA White S., Xiong Z., Fang D., Wang B., Ming Y., Chen Y., Zheng Y.,
RA Kuraku S., Pignatelli M., Herrero J., Beal K., Nozawa M., Li Q., Wang J.,
RA Zhang H., Yu L., Shigenobu S., Wang J., Liu J., Flicek P., Searle S.,
RA Wang J., Kuratani S., Yin Y., Aken B., Zhang G., Irie N.;
RT "The draft genomes of soft-shell turtle and green sea turtle yield insights
RT into the development and evolution of the turtle-specific body plan.";
RL Nat. Genet. 45:701-706(2013).
RN [3] {ECO:0000313|Ensembl:ENSPSIP00000015350.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AGCU01138587; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01138588; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01138589; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01138590; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01138591; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01138592; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01138593; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR AlphaFoldDB; K7G4Y9; -.
DR STRING; 13735.ENSPSIP00000015350; -.
DR Ensembl; ENSPSIT00000015422.1; ENSPSIP00000015350.1; ENSPSIG00000013727.1.
DR eggNOG; KOG3544; Eukaryota.
DR GeneTree; ENSGT00940000153769; -.
DR HOGENOM; CLU_275826_0_0_1; -.
DR OMA; IARTHTM; -.
DR Proteomes; UP000007267; Unassembled WGS sequence.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR CDD; cd00063; FN3; 8.
DR Gene3D; 2.60.40.10; Immunoglobulins; 9.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 1.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR46708:SF2; FIBRONECTIN TYPE-III DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR46708; TENASCIN; 1.
DR Pfam; PF00041; fn3; 8.
DR Pfam; PF00092; VWA; 1.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00060; FN3; 9.
DR SMART; SM00327; VWA; 1.
DR SUPFAM; SSF49265; Fibronectin type III; 6.
DR SUPFAM; SSF53300; vWA-like; 1.
DR PROSITE; PS50853; FN3; 8.
DR PROSITE; PS50234; VWFA; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000007267};
KW Secreted {ECO:0000256|ARBA:ARBA00022525}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..19
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 20..1156
FT /note="Collagen type VII alpha 1 chain"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5003904977"
FT DOMAIN 34..207
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 228..324
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 414..503
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 504..592
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 595..683
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 686..773
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 776..864
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 869..957
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 958..1046
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
SQ SEQUENCE 1156 AA; 124696 MW; 995AB1DBE0197657 CRC64;
MRRWILFLAV LLAPAPALAQ KRNQGTCINV YAADIVFLVD GSSSIGRANF RMIRSFMEDL
VRPFVHVVGE RGVRFGAVQY SDDPRLEFTL SQHPNGTEVR RAIQQLSYKT GNTRTGAGLR
YIADNFFGPT QIRPGVPQVC ILITDGKSQD DTEQPSVKLK AQGTKVFAVG IKNADRAELM
SVASTPSDDY FFYVNDFKIL STLLPLVSRR VCASTGGVLQ TGSQVYSGPS NLVFVEQAAD
ALRIRWTAAG GPVTGYKVQY VPLTSLGQQV TAEMKEVSLS PGETSTVLRQ LRGGTDYLVT
VIAQYANSIG ESVSGTGRTE ALSGVLNFRV VEAGPSFLRL AWAAALESLQ GYRITYMAQG
EAQAEEMSLG ANSVSIMLSN LRPNTDYVVT LQPVSQQQTV APTRITGRTL RLERVQQLTV
ENISQQSMVV TWRGVSGATG YRVSWGLPSG QDIRKFDVDA SKNSYLLTGL QPGTDYLLTV
MPLYGQIEGP PASIRRRTET GIVQSLRTVI LGPTSIQVLW NIIRDARGYR LEWKRATGAE
PPQTVTLPTN IDRYQLTGLQ PATEYRITLY TLYDGREVAT PVTISQTGVE PQVGSISDLR
VVGTVGKRLR LAWGGVPGAT EYKIVLRSSQ DGSETTRQIP GTQTMLELDD LSEEVTYIVR
VSALIGRREG SAVPLSIRIG PQPVGSITNL RVAEIRPNQL RVTWSGLPGA TSYKLTWRAS
DGQEISRVLP ADRTSFSIEG LQAGAAYVVG VSALVGDREG SPVTIPVRTA PEQVGMVSSL
KVLSSRSNVV RVTWVGVPGA TAYRVVWSRR DGGSESSKLV SGDTSSFDIL NLEGGVSYTV
KVTALIGNRE GDPVSIVVTT PAEVAPVQPV GNLQVIDSSE QRVRLTWSPA PGSSGYRLSW
RPVAGGPERS QLLPPNVRSY DIEGLEAGVR YEIRITSLVG SQESEPVGIA VSTAPLGRVT
NFRVTETQDD SVTLAWTPAP GATGYLLTWK LPQEGGEMQR MLPGSATSHQ VSGLRLGHRY
VFTIQPLFGS TRGMESSVTD RTVCRDARGD VIFLVHGTRD SAYSAEAVRA LLTNTVSALG
RLGPDATQVW GAGLWVSPLT HLTHMPLVLP PALGAASGHP VSPLPLAWDR HSAPCATRSP
AAVPWELPPA LPGSTC
//