ID A0AAD9V535_ACRCE Unreviewed; 2772 AA.
AC A0AAD9V535;
DT 29-MAY-2024, integrated into UniProtKB/TrEMBL.
DT 29-MAY-2024, sequence version 1.
DT 28-JAN-2026, entry version 8.
DE SubName: Full=Collagen alpha-1(XXVII) chain {ECO:0000313|EMBL:KAK2561573.1};
GN ORFNames=P5673_015557 {ECO:0000313|EMBL:KAK2561573.1};
OS Acropora cervicornis (Staghorn coral).
OC Eukaryota; Metazoa; Cnidaria; Anthozoa; Hexacorallia; Scleractinia;
OC Astrocoeniina; Acroporidae; Acropora.
OX NCBI_TaxID=6130 {ECO:0000313|EMBL:KAK2561573.1, ECO:0000313|Proteomes:UP001249851};
RN [1] {ECO:0000313|EMBL:KAK2561573.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=K2 {ECO:0000313|EMBL:KAK2561573.1};
RX PubMed=37804092;
RA Selwyn J.D., Vollmer S.V.;
RT "Whole genome assembly and annotation of the endangered Caribbean coral
RT Acropora cervicornis.";
RL G3 (Bethesda) 0:0-0(2023).
RN [2] {ECO:0000313|EMBL:KAK2561573.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=K2 {ECO:0000313|EMBL:KAK2561573.1};
RX PubMed=37769073;
RA Vollmer S.V., Selwyn J.D., Despard B.A., Roesel C.L.;
RT "Genomic signatures of disease resistance in endangered staghorn corals.";
RL Science 381:1451-1454(2023).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KAK2561573.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JARQWQ010000032; KAK2561573.1; -; Genomic_DNA.
DR Proteomes; UP001249851; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 3.40.1620.70; -; 1.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR050149; Collagen_superfamily.
DR InterPro; IPR010515; Collagenase_NC10/endostatin.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR048287; TSPN-like_N.
DR InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF914; OTOLIN-1; 1.
DR Pfam; PF01391; Collagen; 5.
DR Pfam; PF20010; Collagen_trimer; 1.
DR Pfam; PF06482; Endostatin; 1.
DR SMART; SM00210; TSPN; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119, ECO:0000313|EMBL:KAK2561573.1};
KW Reference proteome {ECO:0000313|Proteomes:UP001249851};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 280..468
FT /note="Thrombospondin-like N-terminal"
FT /evidence="ECO:0000259|SMART:SM00210"
FT REGION 1..22
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 504..557
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 575..606
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 668..723
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 755..818
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 897..916
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1012..1150
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1284..1305
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1319..1514
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1559..1640
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1675..2518
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 504..520
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 521..533
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 543..557
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 582..606
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 668..685
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 693..703
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 710..723
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 770..782
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1013..1026
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1038..1054
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1055..1069
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1075..1110
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1116..1127
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1130..1145
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1284..1294
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1319..1328
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1352..1416
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1417..1434
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1439..1449
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1457..1482
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1491..1510
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1608..1621
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1694..1707
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1716..1731
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1797..1809
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1846..1855
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1961..1981
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2140..2159
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2164..2173
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2241..2250
FT /note="Gly residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2260..2282
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2284..2293
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2314..2327
FT /note="Gly residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2328..2343
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2361..2376
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2379..2394
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2407..2416
FT /note="Gly residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2420..2429
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2455..2470
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2472..2482
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2483..2495
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2772 AA; 288937 MW; 9BDD2BDAEDED1DB9 CRC64;
MEQPRSDDPS LDSAQKAKER SWDKQQPAVC RLALAVNAKL NLSIVAFQTF YGGDYEPQIP
DLCLFELEED GSGHREVIEI AGRLLEMADW IRKTLRNYRL QPWSVIRERN KRCGENADFD
TSHSRYRLNI ILMPTAKSLC VSVTDYRFKE KLLRYTLGEL GSLKVVGLNI AALRPLTLDS
CREMRDGLGI WITPCGFQIP DPGFWITEVT GFQIFAVFWI PNAVILKSKD EYICQTFLDS
RRKYFLLAEL IDRSSRFSSE KLCVPVAGTP YGTPDSNLRD IDLLGTIGEP LPFGVSFTKG
PNKSPAFHFR SYANVGRFAS YIFPKQFFAE FSITLTIKPK RVRRGLVFSI LPHFRTDKVL
LALEIRDVKG ASVVRLTHAS SEKSKTVFDF EVPDISNKWT WLGFSVKKDG VVFYLNCEEA
VTKFQQSFLG ELSLPPYSVL YIGRAGWSRD SRSSAFESLS YPLVVSWVNC FIHRVISLSW
PCTRIQAKPK NWSVRQIPLP TMSTTPTVTT SLTTAKTTPK LETSSIGAPT DLRSTGRSEE
QTSEPPTTRY NDSGNTVWAS TGQRTAAAST ITTDGISIEE VSPTSEGVET DKPSQQASID
APTASSSNIK ALVTEGDIPS TLNAPSTAAR TDRSNDDCLA TNGEIKELSR GVDSIVTEGS
TVDITTEDSA ASKATDTMRP TTEKASTALE AVTTEPLTST TATTKRKDST GLSQEESASK
VVTITTEAST VGAFSTETSH EESSSVQVSV TIAPSTENTL STDSSDKTEA STQEFSYSSK
DISATEAVKQ GSPASVEGVL TDHPTEKTLS TEVEGTRERP ITDVTTESST ESGTKTAEEE
MAATDAPNLY TLSTKEEVDT DLHTVQTLST KAVSSTQRAT EFTKGVVIED VTTEETLSTS
INPTKRPTED SEPASELTTM KILTEESWPR TEALTTDLSS EAAFTSKEVT ASPTEDAITE
MATAEYATTQ FGKEESSLFG TNPKSVPTEY SWATSEAVSS EFPTKDSLLS EAYATSSAAS
GSQPLTSEPV ETKFQTEDSP TTVEDSVTVT ESPTKNMSST KGEVSTQHST EYKTTEIAAT
KSSSKAETTP TESSSSTTQE IITEFTTEEI PSSRKALTED SSSKDITPES EFNATKPQTE
NNPTTAEDLM IYTDVPTEYT SSTRGVMSLE QSTDDSLLTK SASEYSEVGT TEPPTGEPVT
VTGVRAESMS STRKLLPQHQ TDAAFSVTTT AFESEFTTKE LATEESLSAM ETVTTEFPTE
ESLAKTATTP TNEDTFSTRV AMETQPTTRL SSSGHFDAST KPGIEYSSTM RKTNATQSEF
GASTSAANTV EPRPQATLST NAAVVTKDQT GEDSSTSDPV TTAPPTTHSL PTDAPSSSQR
FTVKSSPSSE MATTALATEG SSSQSEQLTT QFRSGDNSST SATATTAKNS NITSERPTEI
SSLRTTESEL PSKVPFTDTT KLSSRPQSET HSIGVSTTSS VVAHSKVNPK GTLTTDQQIA
TTRSKSTVTL RTGDDFNEPT EAIIARIEGV KQDHGKDSST TSPNTLWSSY ASTVKAKTLT
TTSSTVSRKD DSIVGPRGPS GPPGSPGSKG QKGESIIGPQ GPRGVRGFRG EKGERGSKGD
VGEPGPKGES GDQGAPGPAI FVDSGEEVLT VKGQKGKPGP PATIVREINC TKGECDQHIQ
GPKGEPGKNG LPGVPGPAGL PGPPGAPGEP GLLLDPETKL PIDPPKGDKG ESGMSGVNGS
KGEPGQKGQR GLIGLPGKKG DRGPVGIPGL PGKNGAVGMM GMPGVGIPGV PGPPGEDGSK
GEEGKKGQMG EKGSPGTEGK EGPVGDMGPR GPRGPKGAKG DSVIGPPGPP GPPGPRGDGS
MVDDIFGPAK GNPGNPGPPG PPGNITNVDE LASYLGKHGA KGEKGASGQK GLKGEKGAEG
LNGFKGNVGP MGPRGPVGPS GPPGLPGNVS TAGAPGSKGE PGVDGKPGRP GESGTKGEKG
DSGIAGFPGV KGRPGQKGEV GPQGLGREGK KGQRGEQGLQ GDVGQKGEKG DQGIGIKGEP
GKPGQTGSPG PPGPPGGSSL PPMKGQKGSV GEVGFPGSKG VTGKKGQIGF PGSKGEKGAQ
GPSGKGETGH SGPAGQKGNT GSKGEKGEEG DGGPQGNKGT KGDKGDRGEK GLDGRKGDTG
EGIPGPPGPP GPPGVVTDVG DKHSSAQIKA QKGKKVTEWQ NVHLKGRGTT GEKGTKGEKG
SRGAIGPKGE PGFAGLRGEP GVKGQGGAAG KDGPPGPVGP TGTPGEIGPQ GPVGFPGKKG
VAGPPGPPGP PGTPGIQGEK GEPGVGDPFI GSFEGIGGNG NGNGKLGNGE KGEQGAHGEK
GTKGDSGATG APGPEGPQGP VGPQGIAGPP GFPGEFGPEG EKGESGKPGE KGEEGQPGAP
GKQGVHGSKG GKGDTGFPGE KGRAGEKGDS GQTGPQGPKG EPGSVTTQLG SQGQPGPPGE
PGPSGTPGPR GPAGSPGSTS VIPGPPGPPG LPGPPGKNGK NGKNEHIGHI SQQDDPNMNP
LSPAVVVFDS QVSLVKAVET VNIGTLAFVL SDELLYIKSS GGWRTVVLGP TVFAASTATP
KVTQPTSTSQ LRMIALNAPQ TGRMMSTNGA DEMCWRQAHA ANLTGEFTAF ISNRFQHVYT
IVPIRNRNLP VGNLQGDQLF PSWNSIFSKR YAFNPEIPIY SFNGTDVLKS TIWPFKHIWH
GSKENGNKRD RKNCRSWLSR SSQDFSTATA LNGRVPALDQ EEFPCSSNLI VLCVEVNPRR
RGKLTLRSKH KF
//