GenomeNet

Database: UniProt
Entry: A0A8S1F5M5_9PELO
LinkDB: A0A8S1F5M5_9PELO
Original site: A0A8S1F5M5_9PELO 
ID   A0A8S1F5M5_9PELO        Unreviewed;      1178 AA.
AC   A0A8S1F5M5;
DT   12-OCT-2022, integrated into UniProtKB/TrEMBL.
DT   12-OCT-2022, sequence version 1.
DT   28-JAN-2026, entry version 16.
DE   RecName: Full=Fibronectin type-III domain-containing protein {ECO:0000259|PROSITE:PS50853};
GN   ORFNames=CBOVIS_LOCUS9117 {ECO:0000313|EMBL:CAB3407148.1};
OS   Caenorhabditis bovis.
OC   Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC   Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae;
OC   Caenorhabditis.
OX   NCBI_TaxID=2654633 {ECO:0000313|EMBL:CAB3407148.1, ECO:0000313|Proteomes:UP000494206};
RN   [1] {ECO:0000313|EMBL:CAB3407148.1, ECO:0000313|Proteomes:UP000494206}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA   Laetsch R D., Stevens L., Kumar S., Blaxter L. M.;
RL   Submitted (APR-2020) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:CAB3407148.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; CADEPM010000006; CAB3407148.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A8S1F5M5; -.
DR   OrthoDB; 5983381at2759; -.
DR   Proteomes; UP000494206; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR   GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR   CDD; cd00063; FN3; 2.
DR   FunFam; 3.40.1620.70:FF:000003; Collagen type XVIII alpha 1; 1.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 1.20.5.320; 6-Phosphogluconate Dehydrogenase, domain 3; 1.
DR   Gene3D; 2.60.40.10; Immunoglobulins; 2.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR003961; FN3_dom.
DR   InterPro; IPR036116; FN3_sf.
DR   InterPro; IPR013783; Ig-like_fold.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF845; COLLAGEN ALPHA-1(XXII) CHAIN; 1.
DR   Pfam; PF01391; Collagen; 2.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   Pfam; PF00041; fn3; 2.
DR   SMART; SM00060; FN3; 2.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR   SUPFAM; SSF49265; Fibronectin type III; 1.
DR   PROSITE; PS50853; FN3; 2.
PE   4: Predicted;
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Reference proteome {ECO:0000313|Proteomes:UP000494206};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..19
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           20..1178
FT                   /note="Fibronectin type-III domain-containing protein"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5035789673"
FT   DOMAIN          27..121
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          131..229
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   REGION          735..867
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          955..989
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        786..801
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        820..833
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        958..967
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        980..989
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1178 AA;  129285 MW;  2B1E3AC469ED4AE5 CRC64;
     MRTKGVSLTL IFILRNANAI FKHENYVPNA PQNVRVKTQA TSAVLWWEAP SDTSVLIRGY
     TVEYGEESIS KRIVIEGPDS TSFTVTRLEP DTNYVFAVSA YNEADGEDGA KVMLTAKTLP
     LDGYTSEKLW PPTSVRSKIL DDNVAIVSWD DPNAEQSSEN SIDAHQRQYI VNYGIHGEEI
     QQKVRSNAKS VKLTNLLPGK EYEVAVKVAA GNGMESSWSI RDLFVVPEVK KPYEASKFDW
     SCHIDDENMC SLSLSPHWKF CAAREDHHTK RMSGVCPKTS YPHHTALIST PPIELPNAQN
     LCLHMQYALL QFHPGHMKVE IIGGSNASTR NVVWKTKMAN VKTHIIQNLY LPIARQINPF
     KIAISVKWDI EHQIPRLIFH HFAVSSGGCA ERTIDTETEV DLLAPVEASL NADSRVFRAK
     GIDSLPAIGI QRGVEIAVPY RLYLPRNFFK QFSLLATVKP MDKMGGYLFA AVNAFDSAVD
     IGLLIEPSGT KQSNISFIIR STAIASFIVE DFSQEWTQFA LEVSPTTVTF YFKCQRYGTK
     HISSLPNMNF DEAEKLYIAS AGPIIDNGFE VMRVELIAIL LLAAGLANCH DAIDGDDDDE
     VLLTENWEQL AFIDNETDER NPTVYQIQLA HDNVHLRHVR SSNEDTSIPP VEPIEGAIQE
     LKLIDDASQG ASQCDEQWID GSGSAEMSNA PEVEVRKSQI LPPIPSPPPP LPNPIQQEDQ
     ITVAKMLPQT CQQVCKGEIG PMGPPGPKGS DGMDGVPGSH GAPGRPGEPG PMGPPGLQGE
     RGDQGPPGPA GMPGLPGPPG PVAAHVNQVG GQSVAAVEGP PGPPGQPGLP GPPGRDGFHG
     EQGPMGPPGP PGPRGEDGRS VENLSEADVE RIARRVAALS KYNKVAENVH KPSVDFGRAS
     MTPNIYATTV ELFASARATS VGQLAFATSS QQLFIRVNNG WKEVQLTHFH PAVEARKSAQ
     NSRQNDVGEN PPMSIWQPRP VEHPQRDGAH NKDRKIHMIA LSNPTNGNMH GLRGADLQCY
     REARAAGFTT TFRAMLSNNV QDLVKIVHHA DADTPVVNLA GQHLFPSWRA FINGAHISQA
     QLFSFDRHDV LHDSKWPDKR IWHGSKADGT RGETYCDGWR RADANQHSLA GFVFSNTSLL
     QSATRVACDT KLVVLCVENM SKYHIEKILR LRRLQHDF
//
DBGET integrated database retrieval system