ID A0A3L8SB41_CHLGU Unreviewed; 3272 AA.
AC A0A3L8SB41;
DT 13-FEB-2019, integrated into UniProtKB/TrEMBL.
DT 13-FEB-2019, sequence version 1.
DT 27-MAR-2024, entry version 23.
DE RecName: Full=CO7A1 protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=DV515_00009730 {ECO:0000313|EMBL:RLV99448.1};
OS Chloebia gouldiae (Gouldian finch) (Erythrura gouldiae).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Passeriformes; Passeroidea; Passeridae;
OC Chloebia.
OX NCBI_TaxID=44316 {ECO:0000313|EMBL:RLV99448.1, ECO:0000313|Proteomes:UP000276834};
RN [1] {ECO:0000313|EMBL:RLV99448.1, ECO:0000313|Proteomes:UP000276834}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Red01 {ECO:0000313|EMBL:RLV99448.1};
RC TISSUE=Muscle {ECO:0000313|EMBL:RLV99448.1};
RX PubMed=30282656;
RA Toomey M.B., Marques C.I., Andrade P., Araujo P.M., Sabatino S.,
RA Gazda M.A., Afonso S., Lopes R.J., Corbo J.C., Carneiro M.;
RT "A non-coding region near Follistatin controls head colour polymorphism in
RT the Gouldian finch.";
RL Proc. R. Soc. B 285:0-0(2018).
CC -!- SUBCELLULAR LOCATION: Secreted {ECO:0000256|ARBA:ARBA00004613}.
CC -!- SIMILARITY: Belongs to the sauvagine/corticotropin-releasing
CC factor/urotensin I family. {ECO:0000256|ARBA:ARBA00009287}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:RLV99448.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; QUSF01000032; RLV99448.1; -; Genomic_DNA.
DR STRING; 44316.ENSEGOP00005008986; -.
DR Proteomes; UP000276834; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-SubCell.
DR GO; GO:0005179; F:hormone activity; IEA:InterPro.
DR GO; GO:0004867; F:serine-type endopeptidase inhibitor activity; IEA:InterPro.
DR GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR CDD; cd00063; FN3; 9.
DR CDD; cd22627; Kunitz_collagen_alpha1_VII; 1.
DR CDD; cd01450; vWFA_subfamily_ECM; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 9.
DR Gene3D; 4.10.410.10; Pancreatic trypsin inhibitor Kunitz domain; 1.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 2.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR000187; CRF.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR002223; Kunitz_BPTI.
DR InterPro; IPR036880; Kunitz_BPTI_sf.
DR InterPro; IPR020901; Prtase_inh_Kunz-CS.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR37456:SF5; -; 1.
DR PANTHER; PTHR37456; SI:CH211-266K2.1; 1.
DR Pfam; PF01391; Collagen; 14.
DR Pfam; PF00473; CRF; 1.
DR Pfam; PF00041; fn3; 8.
DR Pfam; PF00014; Kunitz_BPTI; 1.
DR Pfam; PF00092; VWA; 2.
DR PRINTS; PR00759; BASICPTASE.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00060; FN3; 9.
DR SMART; SM00131; KU; 1.
DR SMART; SM00327; VWA; 2.
DR SUPFAM; SSF57362; BPTI-like; 1.
DR SUPFAM; SSF49265; Fibronectin type III; 6.
DR SUPFAM; SSF53300; vWA-like; 2.
DR PROSITE; PS00280; BPTI_KUNITZ_1; 1.
DR PROSITE; PS50279; BPTI_KUNITZ_2; 1.
DR PROSITE; PS50853; FN3; 9.
DR PROSITE; PS50234; VWFA; 2.
PE 3: Inferred from homology;
KW Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Reference proteome {ECO:0000313|Proteomes:UP000276834};
KW Secreted {ECO:0000256|ARBA:ARBA00022525}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..22
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 23..3272
FT /note="CO7A1 protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5017998624"
FT DOMAIN 37..210
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 232..328
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 329..417
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 419..506
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 507..595
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 600..688
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 711..811
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 814..902
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 906..994
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 995..1083
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1087..1261
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 3056..3106
FT /note="BPTI/Kunitz inhibitor"
FT /evidence="ECO:0000259|PROSITE:PS50279"
FT REGION 1265..1520
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1591..1631
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1701..1788
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1879..2101
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2122..2425
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2464..2866
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3170..3230
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1310..1325
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1401..1420
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1454..1468
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1879..1896
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1975..1989
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2005..2022
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2139..2169
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2209..2238
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2294..2311
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2401..2425
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2589..2603
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2616..2651
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3177..3191
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 3272 AA; 337183 MW; A0144FBF15C369CB CRC64;
MSSQLLLLAV LPALLRPPAA VAQKRSQAVC EDVLEADIAF LVDGSSSIGR NNFRAVRTFM
EELVGPFVQV VGEKAVRFAM AQYSDDPRVE FSFSQHTDGT SVRRAIQQLS YKGGNTRTGA
GFRYIADNFF GPTQRRPGVP QICILITDGK SQDDAEGPAA KLKSQGIKVF AVGIKNADRK
ELIRVASMPT DAFFFYVGDF KLLDTLVPLI TRRVCTTVGG TLRLLDGPSH TGPSNLEIPE
QGLDHLQIRW RAASGPITGY RVQYVPLTGL GQPIMAERQE VSVGPRETNT VLRGLRVGME
YLVTVTAQYA NSIGESVSGR ARTQSRAGSV LDFRVVETGP TFLRLAWQPG PEPPRGYSLS
YAVQGAPQRE EKSLAASAHS ATLSDLQPDT EYVVMLQPHY AQQPAVPATL TARTRHLVGV
KHLTVHNVSA QSMLLAWQSV SGATGYRLSW ATLTGQDRHR VDLDARQMSH ALVGLQPNTD
YVVTVAPLFR QLEGPAATVR QRTEAGSVQT LQTNILGPTS IQVLWTSARD ARGYRLEWKR
ATGLEPPRTV SLPSSTNTYQ LMGLKPGTEY RITLYTLYDG GEVATPVTTF QTGVEAPVGA
VSDLRLVEES GRWVRLSWTG VLGATEYKVV VRNNQDGTER TRRIPGSQTG LELGDLREGV
TYLVRVSALA GSREGNAATL TVRLSDAVVL WAGAEYPDVG SISDLRVMEA GPSQLRVTWR
GLPGAGGYLL TWQGSDGECM DHGVLWQQCW SPSYPSHLLG LKKSRFLPAD PTTFTIEGLH
ANIVYTISVS AIVDGREGSP VTATGQIVPE QVGKVTQLEV QASRSNIARV TWVGVPGATA
YRVVWSRRDG GLENSRRVPG HTNYFDIPNL EGGVSYTVKV TALIGNREGN PVSVVVTTPE
AVPLRPVSGF QVTEASEHHL HLTWLPVAGS TGYRLFWRLA EGGPQHSQQL PATSSSYALS
GLEPGRHYQI SITSLAGSRE SEPATITAIT TAPSHITSLR VTEVQRDSVT LTWTPVPGAS
GYILSWSPPA AGGEKGQTLP STASSQQVSG LRLGQRYTFT LRPLLGSAPG AEVSVSERTV
CRDALGDVVF LVHGTRNSSS GANAVRTLLS NTVSALGRLG PEGTQVALAT YSYRSLPWLL
LNRSSDLPTV LEQIRAMRYE DPSGNAIGAA ITFARTYLLS PGAGRRLGVP AVLVILADSP
SGDDAITVAR DVKAGGVQVL AVGLEGADQE QLQRMVTSED PRYIYHGRDL AELEGELTDD
LCTIISTKPA PKPKPCTVKC PKGEKGDRGE AGLQGRMGPP GLPGQPGRHG LPGSPGPPGP
QGPPGESAEG PGKKGDRGIQ GIPGPRGDPG TRGPMGLTGL KGEKGEPGEP RVITDGVRGL
PGQKGEPGVP GSLGSPGIPG PRGPLGDPGP PGPLGPMGLP GPPGEFVKGE KGDRGERGPP
GFVDGAIPRG DTGPPGLPGD PGPRGPAGPP GLKGEKGDGI EGFPGPPGRP GDPGDRVSFL
REGSSGTTGG AGQEGEPCRD PRDALAMPLA ERRGFWVLCQ DPGCPPKVLL LLTLGFSYTR
ETVGHRVSWE RRARRETAVC RAQRVRRVRW ESQGTQGHQA ERHCQPPEVS NLATGSSRTD
WPPGGKGEGK VGSQWGGRGL LGKGLAVTPV SLQGDPGPPG RPAPSVAGVG EKVTLAQLCP
SPSGHGCGLG RGIAQLDSSW SQAWSTAAGD RGFPGPEGPP GPKGETGDKG ASGPPGLSIP
GPAGPKGEQG DRGIVGLPGR SGPKGDPGEP GEKGEPGRAG TPGQIGLRGK EVVWDKGLES
EERKVTKAPR VKRVCLASLE TGVPGAFLAT VAPPGRRVTW GTQDLKAGMA ALEHQGARVT
VGTQGLPDLQ DELSMLASEE WGERGDPGDP GEDGAKGARG DAGSPGLPGE RGVEGPRGPP
GTRGDRGPPG LDGRNGLEGK AGPPGPAGLR GDPGKPGDPG RDGLPGLHGE QGPPGPTGPL
GPPGVPGKPG EDGKPGLNGK NGEDGTPGED GRKGDKGDAG LPGREGRSGA KGEQGDRGVP
GPLGPPGLPG VPGQVGPPGQ GSPGLRGVAG PKGDPGEPGL PGEPGRPGNP GARGDPGTTV
NIERGLESLG IKVSSLKELT GAYDGSSDSF PPAVERLRGQ KGSRGDPGER GPPGREGDRG
EKGDRGEQGR DGLPGPPGPP GPKVDVVEGS LMGLPGERGP IGPKGAKGEP GAEGERGPKG
DKGEGGLRGD RGEPGEKGRD GNPGLPGERG LAGPEGKPGS TGPPGTQGPA GAKGDPGEPG
SSIRGLPGPQ GNMGLPGPLG PPGPGGPPGV PGLPGQVGES GKPGVPGRDG VPGKDGEAGV
PGKMGLPGPS GPAGPKGEPG DAGAPGQAIA GPPGAKGEKG EPALLEGVLL GEPGSKGDRG
LPGPKGEKGE PGRLGEPGDP GEDVSRRHIG VVGGYWGQWM LVGAGICAPA TYCPGLPGGL
VSSQGAKGSS GAKGEKGSVG VGARGPPGQD GPPGLKGDTG LPGLPGPPGL AGTAGMPGQP
GLRGDNGQPG PPGPPGERGL IGFPGRDGAA GPPGPVGPPG PAGIQGAPGL KGDKGAPGAG
LPGARGERGD PGPRGEDGRP GPEGDRGPAG LPGNRGERGD KGERGLKGTE GDKGDKGEQG
MLGEKGTRGE QGEKGSMGFP GARGPSGQKG EVGAPGEPGE PGQPGRDGIP GARGEKGDMG
PLGMRGLKGD RGMKGACGLN GDKGEKGEPG IPGRSGLPGR KGEPGELGLS GPPGIPGKEG
LMGPKGDRGF DGQQGAKGDQ GEKGDRGAPG VMGGPGPRGS DGAPGPPGPS GSVGPRGPEG
MQGQKGERGP PGQSVPGARG MPGIPGERGE QGSPGTHGLR GEKGEPGMTE EEIRAFVRQE
MSQHCACGGH FPQHQDSREH SGCSSSFLSP CPTPVAQKKP RHPGGWGAQA RSSFHPAQGV
GRIVRQEKEH DQTGDGESGG SVRSGDMGMC ASLCPPLSLS CWAPKSFSTQ PFLAGSAHIV
PVLKLSHAEE DEGQDRQITL DNDDLTYDSM DGEEDDYDEV PEMDSLEQPT VDAEPCRLPL
DEGQCQRYTL RWYYNQRVAQ CRPFVYSGCQ GNLNRFDSKE ECELHCGQRP GPDSRGSAGS
KAPLATLQGR MLLSFLLLLG TPTRAWKGPS HEWLLPAPQT VDGGKLMRQE STGKMLPDPS
KVRRGDDGSG SGSHPDEASL PLLEGSERQA LPWLMSPTTK REAPRKKGRK VSLSFDVHTH
LLKILLDLAR EKELQAKAAA NAELMARLGR RR
//