ID A0A3P9D5V9_9CICH Unreviewed; 2850 AA.
AC A0A3P9D5V9;
DT 13-FEB-2019, integrated into UniProtKB/TrEMBL.
DT 13-FEB-2019, sequence version 1.
DT 27-MAR-2024, entry version 22.
DE SubName: Full=Collagen alpha-1(XII) chain {ECO:0000313|Ensembl:ENSMZEP00005029963.1};
OS Maylandia zebra (zebra mbuna).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Cichlomorphae; Cichliformes; Cichlidae; African cichlids;
OC Pseudocrenilabrinae; Haplochromini; Maylandia; Maylandia zebra complex.
OX NCBI_TaxID=106582 {ECO:0000313|Ensembl:ENSMZEP00005029963.1, ECO:0000313|Proteomes:UP000265160};
RN [1] {ECO:0000313|Ensembl:ENSMZEP00005029963.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR Ensembl; ENSMZET00005030906.1; ENSMZEP00005029963.1; ENSMZEG00005022304.1.
DR GeneTree; ENSGT00940000154923; -.
DR Proteomes; UP000265160; Unplaced.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR CDD; cd00063; FN3; 15.
DR CDD; cd01482; vWA_collagen_alphaI-XII-like; 2.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 16.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 4.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR048287; TSPN-like_N.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24020:SF70; PH DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF00041; fn3; 15.
DR Pfam; PF00092; VWA; 4.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00060; FN3; 16.
DR SMART; SM00210; TSPN; 1.
DR SMART; SM00327; VWA; 4.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR SUPFAM; SSF49265; Fibronectin type III; 12.
DR SUPFAM; SSF53300; vWA-like; 4.
DR PROSITE; PS50853; FN3; 15.
DR PROSITE; PS50234; VWFA; 4.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Reference proteome {ECO:0000313|Proteomes:UP000265160};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..23
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 24..2850
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5018021063"
FT DOMAIN 24..110
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 127..299
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 323..412
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 427..603
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 621..712
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 714..805
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 848..938
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 939..1027
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1029..1119
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1140..1312
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 1324..1413
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1414..1502
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1503..1593
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1671..1760
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1761..1851
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1852..1944
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1945..2024
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 2026..2120
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 2145..2322
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT REGION 2102..2132
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2565..2584
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2611..2698
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2741..2837
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2102..2116
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2117..2132
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2668..2682
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2748..2763
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2778..2799
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2850 AA; 310832 MW; 7D910C69C69A1818 CRC64;
MKSRLSLAAV AAVLVLLLSS SQAQEQDLRF KILSENSVEM MWRRPRSRIQ GYRIEVTSDT
EPTKEFTLPA SATKTSISDL TPDVDYVVTI TAYSGSEQSL PISGQITQRC FCHCCPGPRC
SASVIGDVVF LVDGSWSVGR PNFKYIRSFI SAAAGAFHIG EEKTRVGVVQ YSDDARTEFN
LNEHQTRPAL LRAIGSMPYK GGNTNTGDAL DYVLRNVFSE ASGARNSLPK VLVIITDGKS
EDPVASYAKQ LRSTGVEIFV LGIQQADEEE MRLMASTPYS SHIYSVATFD MIKNVQKQLI
TRMCAGVEDQ LNVVVSGDEF VEPASNLQVQ EVISKSMRLM WDPSIGDVSG YKIQLIPMMA
GSMRQELYVG AGQTSVVVRG LSADTEYQIV LFALKGLTPS EPITAMQKTL PVRVSLECSL
GVDVQADVVL LVDGSYSIGL ANFAKVKAFL EVLVNTFDIG PDKVQISLVQ YSRDPHPEFY
LNTHHDLNAV VKAVRNFPYR GGSTNTGRAM TYVKEKIFQA ARGARANVPR VTILITDGKS
SDAFQEPATR LRNADVEIFA VGVKDAVRSE LEAIANPPAE THVYTVEDFD AFQRISKELI
QSICLRIEQE LRNILQRQLI SPKDLTFSEI GSRSFRTSWK TDATNVESYL VQFRPADDPD
GHFVSMSVPG DSLTAFLPYL NPYIRYEVNI YAHYEKGESL PVTGYQTTLE EQGPVSNLRV
SEETTDSFRV SWLAAPGPVI RYRLNYEPVD DESRRMETTT TGREITTVLH ELRPQTTYRV
TVTPEYPSGP GTPLQTVGTT KEGNSWQAGD HYLCVLCTCR LNVLRSAVKP MLDFIRTLFS
ALEIYGGPPK NLVTSDVTDT SFVASWTAAP GNVNTYRVQW KSMFSQEIQE KTVPGDETST
VLEGLTPETL YQVSVVAAYR HKESEPLTGS EITDASADGK RLRVSDETEK TMRVTWTPAP
GNLLNYRLKY VPHNGGKEVV QKIPAKATST IMKNLKPATT YNITVLPIYK RREGKARQGV
GTTLSPYKSP RNLLTSEPTR TSFRVSWDPA PGEVRGYKVT FHPTGNDIDL EELLVGPYDN
TVVLEELRSG TKYSVAVFGM FDGGESLPLA GEEKTTLKDE PESPHVTLSD AQCKTTAKAD
IVLLVDGSWS IGRINFKTIR SFIGRMVSVF DIGPERVQIG LAQYSGDPKT EWHLNAHPTK
ESLMNAVANL PYKGGNTMTG MALNFILQNN FKPNVGLRPD SRKIGVLITD GKSQDEVIVN
SQNLRDSDIE LYAIGVKNAD ENELRSIASD PDDIHMYNVN DFKFLLDIVD ELSENLCNSV
KGSAPTNLVT SEVTQSSFRA TWTPPDGPVD QFRVTYVMAA GGPTEEVLVD GSVNTLVLQK
LSHRTEYIVN VYSVVGEVNS EPLKGTETTL PRPAAGQLRI SDVTHSTMRL NWDAAPGPVR
KYIITYKPEE GEVKEIEVNG DITTLVLSSL ISQTEYDLAV TPIYDDGPAT PMLGSAITDV
VPAPKNLRFS EVTQSSFRVT WEHGGPDVSL YRVGWVKRGE TDFQQDILNS DETSHVLENL
DPDTDYSVTV TAIYPDESES EDLMGNERTL LPASPKNLRV FNATTTTLTA KWEPAPGAVQ
NYKITYRPVA GGEQMVSTTN IILRNLMPDT PYTVSVLPVY PAREGKQALG GVKNLQVTDP
TTSSLKVRWE PAEGNVRQYR IFYVPASGGA EDMEQVSGGT TNTILRNLLS DTPYTVTVVP
VYPEGEGLRQ SDKGKTLPRM PPRNIKVYNP TPNTLNVRWE PATGQVQQYR VAYAPLTGAS
PSESVLVSGS INNAFLDNLI PDTPYSVTVT ALYADGEGSA VKDNGKTLPR AGPRNMRVFD
ATTSTLTIGW DHAEGPVRQY KISYAPMTGD PITESTLVLG NRNNAMLQNL LPDTPYNITV
EAIYAEGPGG SLNGNGRTIG MLSPRNLRIS DEWYTRFRVA WDPVTAPIQG YRLIYTPPID
FFVGDVSSYT LTNLQPGTTY DVKVLAQYTS GVSAPLIGQG TTLYLNVTNI ETYAIGHDKF
CIKWTPHRAA TSYRIKLNPA DPSSKGQHEI TIPAGLPQYC FDGLSPDALY TATVFVQTPN
LEGPGVSTKA RTLQEPTPDP TTPPTPTAPP TIPPGWAVCK GAKADVVFLI DGSWSIGEES
FTKVVHFVSD MIAAFDVIGP SGMQVSFVQY SDNAKTEFKL KAYKDKGLAM AAPSYIRYRG
GNTKTGEALK HTYEKAFSLE NGMRRNVPKV VVAITDGRSQ DEVKKNAARL QHAGYSVFAI
GVADMDFIEL QQIGSKPSER HVFVVDDFDA FNTIKENLIT FICETATSCY VPLNFCEWIY
FTWMLEAFNL TEKTYSYIKG VSMEPGSFNS YTAYRLHKNA FLTQPTTDIH PLGLPHAYTI
IMMFRLLPDT PKEAFDIWQV SSKDHKPETG VTLDPSSQTV SFYNKDESGE LQRVTFDTRE
VKKIFHGSFH KLHILVSPTN VQLNIDCQVV AEKPIKAAGN MSSDGYQVLG KMSKSIGSKG
ESATFQLQMF DIVCNLAWTS RDRCCDLPSR RDEMKCPSLP NACTCTSEAN GPPGPQGPYT
STGTRDINNK FLPCLNISLV CILQGPLGPR GEIGPPGPMG LPGPQGPSGL SIPGEAGRPG
PKGDPGDSGL PGVSIGPQGK EGPAGPRGAP GPMGPPGSPG VPGTTGKPGN PGDSGPPVSL
LAGLCGDFAP QNMMRSIARQ VCGQLVQEQM TRVNSLLNQI PSNMYRNNNP GPPGPPGPPG
NQGPRGEPGA TGRNGFPGSS GLPGQQGERG TSPSGESRTG PPGPSGPAGS RGPPGRPGYA
GVRGPPGPPG YCDSSQCVGI PYNGQGYIGM
//