GenomeNet

Database: UniProt
Entry: A0A8X7X661_POLSE
LinkDB: A0A8X7X661_POLSE
Original site: A0A8X7X661_POLSE 
ID   A0A8X7X661_POLSE        Unreviewed;       663 AA.
AC   A0A8X7X661;
DT   14-DEC-2022, integrated into UniProtKB/TrEMBL.
DT   14-DEC-2022, sequence version 1.
DT   28-JAN-2026, entry version 9.
DE   SubName: Full=MARCO protein {ECO:0000313|EMBL:KAG2460847.1};
DE   Flags: Fragment;
GN   Name=Marco {ECO:0000313|EMBL:KAG2460847.1};
GN   ORFNames=GTO96_0011021 {ECO:0000313|EMBL:KAG2460847.1};
OS   Polypterus senegalus (Senegal bichir).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Polypteriformes; Polypteridae; Polypterus.
OX   NCBI_TaxID=55291 {ECO:0000313|EMBL:KAG2460847.1, ECO:0000313|Proteomes:UP000886611};
RN   [1] {ECO:0000313|EMBL:KAG2460847.1, ECO:0000313|Proteomes:UP000886611}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Bchr_013 {ECO:0000313|EMBL:KAG2460847.1};
RX   PubMed=33545088;
RA   Bi X., Wang K., Yang L., Pan H., Jiang H., Wei Q., Fang M., Yu H., Zhu C.,
RA   Cai Y., He Y., Gan X., Zeng H., Yu D., Zhu Y., Jiang H., Qiu Q., Yang H.,
RA   Zhang Y.E., Wang W., Zhu M., He S., Zhang G.;
RT   "Tracing the genetic footprints of vertebrate landing in non-teleost ray-
RT   finned fishes.";
RL   Cell 0:0-0(2021).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:KAG2460847.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; JAATIS010004753; KAG2460847.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A8X7X661; -.
DR   Proteomes; UP000886611; Unassembled WGS sequence.
DR   GO; GO:0046983; F:protein dimerization activity; IEA:InterPro.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR008906; HATC_C_dom.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24637; COLLAGEN; 1.
DR   PANTHER; PTHR24637:SF421; CUTICLE COLLAGEN DPY-2; 1.
DR   Pfam; PF01391; Collagen; 1.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF05699; Dimer_Tnp_hAT; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000886611};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..24
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           25..663
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5036446409"
FT   DOMAIN          539..578
FT                   /note="Collagen type XV/XVIII trimerization"
FT                   /evidence="ECO:0000259|Pfam:PF20010"
FT   DOMAIN          580..635
FT                   /note="HAT C-terminal dimerisation"
FT                   /evidence="ECO:0000259|Pfam:PF05699"
FT   REGION          138..166
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          414..459
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        419..437
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   NON_TER         1
FT                   /evidence="ECO:0000313|EMBL:KAG2460847.1"
FT   NON_TER         663
FT                   /evidence="ECO:0000313|EMBL:KAG2460847.1"
SQ   SEQUENCE   663 AA;  71169 MW;  B646E8F2399EC5FA CRC64;
     MTAAVLPLIW RVVTIAMLVV MTEAQWWNLF NTEPRTTAPA STVRTTTSSQ LLDQSTSKEM
     LGGHITSEES VLEQTTLHRS PGLTTDFKEN LKMDTTSDDA LFKQTASNDM LTVHSTFIGD
     RLLEHSTTDD ILEAQTTFDD SDLEQTTSGD SLDVETTSEG TVQEQTTPAD LELGHTVSED
     TVTVQATSED FELEQTPFED TLSEQITSDH AEMEETTSGD TELEDTAQVT VRVHTTIEGT
     HLEKSISENP SLNIQTSSDH NEMGQTTVEV SHLENRITEN PSISIHKISD HTEIARTTPQ
     DIVLVQTIFD DIDLEQSTYG RSVLEQTTSS GGVHKKATSN GNSMEQTAFG ISLEALTADV
     SPQNKQMGKK STNRTPLGWT ATKEGVTESD ILCYFLLQGD RGPKGDQGYR GLKGDPGLPG
     FPGPPGIPGP IGPPGPPGLS GTCGDAMPSE HPHPSGFPGP SLPCDCPKLP GVPGPPGLPG
     LPGLPGPPGP SVPSGAPVIS ETCHSACMGH QGKGRPLIPG PIGLPGPPGP PGTQLHTWVF
     ESTSALLASS PIIPEGSMVY IKQEGSVFVR IADGWQQLKY REDVFPNLRV GLHILLTVGT
     SIASCERSFS KLKRILSYLR SSMAQKRLSA LALLSVVREV TDSRHFGELI DKFAAAKARK
     NFL
//
DBGET integrated database retrieval system