ID A0A8X7X661_POLSE Unreviewed; 663 AA.
AC A0A8X7X661;
DT 14-DEC-2022, integrated into UniProtKB/TrEMBL.
DT 14-DEC-2022, sequence version 1.
DT 28-JAN-2026, entry version 9.
DE SubName: Full=MARCO protein {ECO:0000313|EMBL:KAG2460847.1};
DE Flags: Fragment;
GN Name=Marco {ECO:0000313|EMBL:KAG2460847.1};
GN ORFNames=GTO96_0011021 {ECO:0000313|EMBL:KAG2460847.1};
OS Polypterus senegalus (Senegal bichir).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Polypteriformes; Polypteridae; Polypterus.
OX NCBI_TaxID=55291 {ECO:0000313|EMBL:KAG2460847.1, ECO:0000313|Proteomes:UP000886611};
RN [1] {ECO:0000313|EMBL:KAG2460847.1, ECO:0000313|Proteomes:UP000886611}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Bchr_013 {ECO:0000313|EMBL:KAG2460847.1};
RX PubMed=33545088;
RA Bi X., Wang K., Yang L., Pan H., Jiang H., Wei Q., Fang M., Yu H., Zhu C.,
RA Cai Y., He Y., Gan X., Zeng H., Yu D., Zhu Y., Jiang H., Qiu Q., Yang H.,
RA Zhang Y.E., Wang W., Zhu M., He S., Zhang G.;
RT "Tracing the genetic footprints of vertebrate landing in non-teleost ray-
RT finned fishes.";
RL Cell 0:0-0(2021).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KAG2460847.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JAATIS010004753; KAG2460847.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A8X7X661; -.
DR Proteomes; UP000886611; Unassembled WGS sequence.
DR GO; GO:0046983; F:protein dimerization activity; IEA:InterPro.
DR Gene3D; 3.40.1620.70; -; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR008906; HATC_C_dom.
DR InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR PANTHER; PTHR24637; COLLAGEN; 1.
DR PANTHER; PTHR24637:SF421; CUTICLE COLLAGEN DPY-2; 1.
DR Pfam; PF01391; Collagen; 1.
DR Pfam; PF20010; Collagen_trimer; 1.
DR Pfam; PF05699; Dimer_Tnp_hAT; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000886611};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..24
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 25..663
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5036446409"
FT DOMAIN 539..578
FT /note="Collagen type XV/XVIII trimerization"
FT /evidence="ECO:0000259|Pfam:PF20010"
FT DOMAIN 580..635
FT /note="HAT C-terminal dimerisation"
FT /evidence="ECO:0000259|Pfam:PF05699"
FT REGION 138..166
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 414..459
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 419..437
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:KAG2460847.1"
FT NON_TER 663
FT /evidence="ECO:0000313|EMBL:KAG2460847.1"
SQ SEQUENCE 663 AA; 71169 MW; B646E8F2399EC5FA CRC64;
MTAAVLPLIW RVVTIAMLVV MTEAQWWNLF NTEPRTTAPA STVRTTTSSQ LLDQSTSKEM
LGGHITSEES VLEQTTLHRS PGLTTDFKEN LKMDTTSDDA LFKQTASNDM LTVHSTFIGD
RLLEHSTTDD ILEAQTTFDD SDLEQTTSGD SLDVETTSEG TVQEQTTPAD LELGHTVSED
TVTVQATSED FELEQTPFED TLSEQITSDH AEMEETTSGD TELEDTAQVT VRVHTTIEGT
HLEKSISENP SLNIQTSSDH NEMGQTTVEV SHLENRITEN PSISIHKISD HTEIARTTPQ
DIVLVQTIFD DIDLEQSTYG RSVLEQTTSS GGVHKKATSN GNSMEQTAFG ISLEALTADV
SPQNKQMGKK STNRTPLGWT ATKEGVTESD ILCYFLLQGD RGPKGDQGYR GLKGDPGLPG
FPGPPGIPGP IGPPGPPGLS GTCGDAMPSE HPHPSGFPGP SLPCDCPKLP GVPGPPGLPG
LPGLPGPPGP SVPSGAPVIS ETCHSACMGH QGKGRPLIPG PIGLPGPPGP PGTQLHTWVF
ESTSALLASS PIIPEGSMVY IKQEGSVFVR IADGWQQLKY REDVFPNLRV GLHILLTVGT
SIASCERSFS KLKRILSYLR SSMAQKRLSA LALLSVVREV TDSRHFGELI DKFAAAKARK
NFL
//