ID M7BIX5_CHEMY Unreviewed; 3178 AA.
AC M7BIX5;
DT 29-MAY-2013, integrated into UniProtKB/TrEMBL.
DT 29-MAY-2013, sequence version 1.
DT 27-MAR-2024, entry version 40.
DE SubName: Full=Collagen alpha-1(VII) chain {ECO:0000313|EMBL:EMP35630.1};
GN ORFNames=UY3_07249 {ECO:0000313|EMBL:EMP35630.1};
OS Chelonia mydas (Green sea-turtle) (Chelonia agassizi).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Testudinata; Testudines; Cryptodira; Durocryptodira;
OC Americhelydia; Chelonioidea; Cheloniidae; Chelonia.
OX NCBI_TaxID=8469 {ECO:0000313|EMBL:EMP35630.1, ECO:0000313|Proteomes:UP000031443};
RN [1] {ECO:0000313|Proteomes:UP000031443}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=23624526; DOI=10.1038/ng.2615;
RA Wang Z., Pascual-Anaya J., Zadissa A., Li W., Niimura Y., Huang Z., Li C.,
RA White S., Xiong Z., Fang D., Wang B., Ming Y., Chen Y., Zheng Y.,
RA Kuraku S., Pignatelli M., Herrero J., Beal K., Nozawa M., Li Q., Wang J.,
RA Zhang H., Yu L., Shigenobu S., Wang J., Liu J., Flicek P., Searle S.,
RA Wang J., Kuratani S., Yin Y., Aken B., Zhang G., Irie N.;
RT "The draft genomes of soft-shell turtle and green sea turtle yield insights
RT into the development and evolution of the turtle-specific body plan.";
RL Nat. Genet. 45:701-706(2013).
CC -!- SUBCELLULAR LOCATION: Secreted {ECO:0000256|ARBA:ARBA00004613}.
CC -!- SIMILARITY: Belongs to the sauvagine/corticotropin-releasing
CC factor/urotensin I family. {ECO:0000256|ARBA:ARBA00009287}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KB528245; EMP35630.1; -; Genomic_DNA.
DR STRING; 8469.M7BIX5; -.
DR Proteomes; UP000031443; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-SubCell.
DR GO; GO:0005179; F:hormone activity; IEA:InterPro.
DR CDD; cd00063; FN3; 9.
DR CDD; cd01450; vWFA_subfamily_ECM; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 10.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 2.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR000187; CRF.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1082; COLLAGEN ALPHA-1(X) CHAIN; 1.
DR Pfam; PF01391; Collagen; 14.
DR Pfam; PF00473; CRF; 1.
DR Pfam; PF00041; fn3; 10.
DR Pfam; PF00092; VWA; 2.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00060; FN3; 10.
DR SMART; SM00327; VWA; 1.
DR SUPFAM; SSF49265; Fibronectin type III; 7.
DR SUPFAM; SSF53300; vWA-like; 2.
DR PROSITE; PS50853; FN3; 9.
DR PROSITE; PS50234; VWFA; 2.
PE 3: Inferred from homology;
KW Collagen {ECO:0000313|EMBL:EMP35630.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000031443};
KW Secreted {ECO:0000256|ARBA:ARBA00022525}.
FT DOMAIN 32..205
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 226..322
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 412..501
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 510..608
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 611..699
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 701..789
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 792..879
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 882..970
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 975..1063
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1064..1152
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1156..1327
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT REGION 540..560
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1349..2128
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2148..2933
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1432..1448
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1543..1571
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1602..1621
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1706..1721
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1896..1910
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2031..2061
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2160..2177
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2245..2265
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2305..2334
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2486..2505
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2643..2657
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2680..2711
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2860..2874
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 3178 AA; 324170 MW; E9A1DFC427B2B9FE CRC64;
MRGERDVSQA TVNKQLQGGN GRGTCINVYA ADIVFLVDGS SSIGRANFRM IRAFMEDLVR
PFVHVVGESA VRFGAVQYSD DPRVEFTFSQ HPNGTEVKRA IQQLSYKGGN TRTGAGLRYI
ADNFFGPTQI RAGVPKVCIL ITDGKSQDDT EQPSVKLKAQ GTKVFAVGIK NADSAELTRV
ASTPTEDYFF YVNDFKILGT LLPLVSQRVC ASTGGVLQTG PRVYSGPSNL VFVEQTVNML
RIRWTAAGGP VTGYKVQYVP LTGLGQQVTA EMQEVNLSPG ETSAVLSQLI GGTDYLVTVI
AQYANSIGES VSGKGRTGAL SGVSNFRVVE AGPSFLRLAW GAALENLQGY RITYMARGDA
QAEEMSLGAN AVSVTLSNLR PNTDYVVTLQ PISQRQMAAP AQLTGRTLRL EGVQQLSVQN
ISQQSMLVTW RGVSGATGYR VSWGLPSGQD IRKFDVDASK NSYLLTGLQP DTDYLLTVIS
LYGQVEGPPA SIRRRTETGI VQSLRTVILG PTSIQVAWNI IRDARGYRLE WRRATASDCL
ISAPPSPPPS HSGFNEDPQT VSLPTNINSY QLTGLQPATD YRITLYTLYD GREVATPVTI
SQTGAEPPVG SISDLRVIDT VGKRIRLAWT GVPGATEYKI VLRSSQDGTD RTRQIPGTQT
MLELEDLREE VTYVVRVSAL IGRREGSAVP ISVRIEPSVS SVSNLQVVPV GVGRVRLLWS
AMPRATEYKL VITNRQDGSE ETRRIPGNRN FFELQDLKEG VTYLVQVTTL VGSQESDPAT
ITVQLDPPSV GSVTNLRVAE IRPNQLRVAW SGLPGASGYK LTWRASDGQE ISRVLPADRT
SFSIEELQAG AVYVIGVSAL VGSREGSPVT IAARTAPEQV GMVSSLKILS SRSNVVRVTW
VGVPGATAYK VVWSRRDGGS ESSEVVSGDT SSFDILNLEG GVSYTVKVTA LIGNREGDPV
SIVVTTPAEV APAQPVGNLR VIDSSEQRIR LTWSPAPGST GYRLSWRPAD GGPERSQLLT
PNVNSYDIEG LEAGERYEIR ITSLVGSRES ETVGIAANTA PLGRVTSFRV TETRDDSVTL
AWTPAPGATG YLLTWKLPRE GGETQRTLPG SATSHQVSGL RLGHRYLFTI RPLFGSVKGA
ESTLTDRTVC RDVRGDIIFL VHGTRDSAYS AEAVRALLSN TVSALGQLGP DAAQVGLVIY
SYRSIPWVLL TXXXXXXXXX XTMRYEEPSG NALGAAINFA RTYVLSPSAG RRPGVPGVLV
VLADSPSGDD AIGPAREIKA TGIQVLAVGM DGVDHEQLRR IVTSEDPRNV FYVKDSRAGL
SELEDRLSST LCRVTVTDRL EPCTVQCPKV SPTPVSTFAA SSHEEGAAGE RLQKAGPLLE
LGEGRDGSPP GAVGMGHPQL EPCTVQCPKG EKGEPGQTGQ KGRVGQPGPP GHPGLNGLPG
PPGPIGPQGA PGEIIERPGE KGDRGFPGVD GIPGSPGRPG NSGSPGQPGR QGLPGLRGSP
GDQGPVGPPG LRGEKGEPGD PGVIVNGGGR LPGRKGEPGS PGNPGLPGNP GPRGVVGDPG
PPGPSGPPGP SGPAGEFVKG AKGERGERGP PGLIDGVPPG GEPGTPGLPG DPGPRGPPGP
PGQKGDKGDG EEGFPGPPGR PGDPGDRGDG EEGFPGPPGR PGDPGDRGDG EEGFPGPPGR
PGDPGDRGPR GPPGEQGSKG DRGQPGELGE AGEKGDRGLP GPEGTKGEPG APGRLGPTGR
EGDQGAPGEP GKPASSVSGV KGEKGPPGFS VPGPPGPKGE QGDRGIVGLT GKSGPKGDPG
EPGEKGELGR PGAPGQMGLR GKEGERGEKG DEGTLGLPGN RGLPGEKGDQ GDPGDDGRNG
SPGVPGAKGD RGEPGLPGPP GRIVDSGAVG GTGERGEKGE PGDPGEDGMK GAKGEAGVPG
LPGERGIEGP RGPPGARGDP GDRGQSGDKG DRGPPGLDGR NGLDGKPGQM GPAGQRGDPG
KQGDPGRDGL PGLRGEQGAP GAIGPPGPPG LAGKPGDDGK PGLNGKNGED GTPGEDGRKG
DKGEAGAPGR DGQEGPKGER GDRGAPGPLG PPGVPGVPGQ VGPPGQGAPG LAGVAGQKGD
RGEAGSKGEQ GRPGDPGQRG EPGTVSNVER ALEAYGIKIA SLREITGAYD GSTDPFLPYP
DRRRGQKGDR GDSGERGPPG KEGVMGFPGE RGPKGDKGDQ GPAGPQGPMG RAIGERGPEG
PPGQAGEPGK PGIPGVPGRA GELGEAGRPG EKGDRGEKGD RGEPGRDGVQ GPPGPPGPKA
DVVEGSLSGL PGERGPTGPK GAKGEPAVDG ERGPKGDKGE PGQKGDRGEA GEKGRDGSPG
LPGERGLAGP EGKPGNQGDP GPPGTAGLVG PPGAQGPPGI KGDVGEPGSS IRGLPGPQGS
VGLPGPSGPP GLVGPQGTQG LPGQVGETGK PGVPGRDGVP GKEGEPGLPG KMAIVGPPGM
KGEKGAPGGI EGNLLGEPGA KGERGLPGPR GEKGEPGRQG EPGDPGEDGV KGSPGVKGEK
GSVGIGLQGP PGQDGPQGLK GDTGLPGPPG SPGLPGIAGT PGQPGLRAEN GQPGPPGPPG
ERGLIGFPGR DGTSGSPGPP GPPGPAGAQG IPGLKGDKGN VGAGQPGPRG ERGDPGPRGP
PGNRGERGDK GDIGAMGPKG DKGDTVIVEG PAGVRGSKGE PGDRGLKGME GEKGDKGDAG
LPGEKGGRGE QGEKGSTGFP GARGPGGQKG EVGESGDPGE SGLPGKDGIP GARGEKGDIG
PLGMRGPKGD RGPKGACGQD GDKGGKGDPG IPGRMGLPGR KGELGELGMP GTPGIPGKEG
LMGPKGDRGF DGQQGAKGDQ GEKGDRGTRG IIGSPGPRGN DGAPGPPGPP GSVGPKGPEG
IQGQKGERGP PGESAVGTRG VPGIPGERGD QGNPGLEGSR GEKGDPGMTE QEIRAFVRQE
MSQHCACGGQ FSSSEPRLLP NYPSTQPFLS VNAHLVPVLK LSHAEEEEGH EVRVVNTNDP
EYEHVYAMED YEESLEADGT ESTMLSDARA SKKMGREMLQ NIYDAEANFH QTGPEKRDNP
VSRETMENSK ILNLLGAKGL SVANGSRHSS GDEAGSSHKE TRWALLEESA KRTLSLLPSL
SHKKAMIVKK SSPGTKFSLS LDVPTHILKI LIDLAKAKQM RAKAAANAEL MAQIGRRK
//