ID A0A3M0J5G7_HIRRU Unreviewed; 1237 AA.
AC A0A3M0J5G7;
DT 13-FEB-2019, integrated into UniProtKB/TrEMBL.
DT 13-FEB-2019, sequence version 1.
DT 27-MAR-2024, entry version 22.
DE RecName: Full=TSP C-terminal domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=DUI87_33821 {ECO:0000313|EMBL:RMB89806.1};
OS Hirundo rustica rustica.
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Passeriformes; Sylvioidea; Hirundinidae;
OC Hirundo.
OX NCBI_TaxID=333673 {ECO:0000313|EMBL:RMB89806.1, ECO:0000313|Proteomes:UP000269221};
RN [1] {ECO:0000313|EMBL:RMB89806.1, ECO:0000313|Proteomes:UP000269221}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Chelidonia {ECO:0000313|EMBL:RMB89806.1};
RC TISSUE=Blood {ECO:0000313|EMBL:RMB89806.1};
RA Formenti G., Chiara M., Poveda L., Francoijs K.-J., Bonisoli-Alquati A.,
RA Canova L., Gianfranceschi L., Horner D.S., Saino N.;
RT "A high quality draft genome assembly of the barn swallow (H. rustica
RT rustica).";
RL Submitted (JUL-2018) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the thrombospondin family.
CC {ECO:0000256|ARBA:ARBA00009456}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:RMB89806.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; QRBI01000268; RMB89806.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A3M0J5G7; -.
DR STRING; 333673.A0A3M0J5G7; -.
DR Proteomes; UP000269221; Unassembled WGS sequence.
DR GO; GO:0005576; C:extracellular region; IEA:InterPro.
DR GO; GO:0005509; F:calcium ion binding; IEA:UniProtKB-UniRule.
DR GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR CDD; cd00054; EGF_CA; 2.
DR CDD; cd16077; TSP-5cc; 1.
DR Gene3D; 1.20.5.10; -; 1.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 2.10.25.10; Laminin; 3.
DR Gene3D; 4.10.1080.10; TSP type-3 repeat; 2.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR018097; EGF_Ca-bd_CS.
DR InterPro; IPR003367; Thrombospondin_3-like_rpt.
DR InterPro; IPR017897; Thrombospondin_3_rpt.
DR InterPro; IPR008859; Thrombospondin_C.
DR InterPro; IPR039081; TSP-5_cc.
DR InterPro; IPR024665; TSP/COMP_coiled-coil.
DR InterPro; IPR046970; TSP/COMP_coiled-coil_sf.
DR InterPro; IPR028974; TSP_type-3_rpt.
DR PANTHER; PTHR10199:SF88; CARTILAGE OLIGOMERIC MATRIX PROTEIN; 1.
DR PANTHER; PTHR10199; THROMBOSPONDIN; 1.
DR Pfam; PF11598; COMP; 1.
DR Pfam; PF07645; EGF_CA; 2.
DR Pfam; PF02412; TSP_3; 6.
DR Pfam; PF05735; TSP_C; 1.
DR SMART; SM00181; EGF; 3.
DR SMART; SM00179; EGF_CA; 3.
DR SUPFAM; SSF58006; Assembly domain of cartilage oligomeric matrix protein; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR SUPFAM; SSF57196; EGF/Laminin; 2.
DR SUPFAM; SSF103647; TSP type-3 repeat; 3.
DR PROSITE; PS50026; EGF_3; 1.
DR PROSITE; PS01187; EGF_CA; 1.
DR PROSITE; PS51234; TSP3; 3.
DR PROSITE; PS51236; TSP_CTER; 1.
PE 3: Inferred from homology;
KW Calcium {ECO:0000256|ARBA:ARBA00022837, ECO:0000256|PROSITE-
KW ProRule:PRU00634}; Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Reference proteome {ECO:0000313|Proteomes:UP000269221};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Signal {ECO:0000256|ARBA:ARBA00022729}.
FT DOMAIN 381..418
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT REPEAT 777..812
FT /note="TSP type-3"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00634"
FT REPEAT 836..871
FT /note="TSP type-3"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00634"
FT REPEAT 973..1008
FT /note="TSP type-3"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00634"
FT DOMAIN 1012..1226
FT /note="TSP C-terminal"
FT /evidence="ECO:0000259|PROSITE:PS51236"
FT REGION 242..284
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 774..794
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 834..984
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 242..261
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 893..920
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 922..937
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 938..953
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1237 AA; 137092 MW; 43186C02D5A9B8B6 CRC64;
MDVVVPDTCE GLRWWRDHSE GDVVVPDTCE GLRWWRDHSE GDVVVPDTCE GLRWWTGHSK
GDLVAPDTCE GLRWWTGHSE GDVVAPDTCE GLRLGTGHSE GDVVAPDTCE VLRWWTGHSE
GDVVAPDTCE GLRSGTGHSE GDVVAPDTCE GLRSGTGHSE GDVVAPDTYQ ELEFTGQSHG
KRRNSVSVDS SWSQPGLSSL ENLWNASGTP WQGDTGNKQS FLLCSCAELG LEMIYREFTA
GLGKESDESK ENKDPSRPLH SAATRKDAAF SRNPGNALGT GGEVGPEMLQ EMRETNRVLL
EVRDLLKQQI KEITFLKNTV MECDACGMRP EVTGPVITMT QFGRCVPNPC FPGVPCTESA
GGFRCGPCPA GYSGNGTHCT DINECNANPC FPKVQCINTN PGFRCDPCPP GFTGQLLEGV
GMAFARANKQ VCTDINECET GAARNCVPNS ICINTRGSYK CGACKPGFVG DQISGCRSQA
APGTRRCPNG EISPCHEKAE CIVERDGSLS CQTGVWFHPM EIGVWFHPME IGVWFLPMEI
GVWFHPMEIG VWFHPVEIGV WFHPMEMGVW FHPMDIGVWF HPMEIGVWFH PVEIGVWFHP
MEMGVWFHPM DIGVWFHPME IGVWFHPVEI GVWFHPVEIG VWFHPMEIGV WFHPVEIGVC
FLPMEIGVWF HPMEIGVWFL PIDTGVWFLP MEIGVCFHPI DIGVCFHLME IGVWFHPIDT
GVWSLPVEIG CLVGWAGNGY VCGKDTDIDG VPDEKQRCSD KKCRKDNCVT VPNSGQEDAD
RDGIGDACDD DADGDGIPNA EDNCVYTRNT DQRNADKDNF GDACDNCRQV KNNDQRDIDG
DGKGDECDDD MDGDGIKNPM DNCRRVPNPD QRDGDGDGVG DACDSCPTLS NPDQKDTDHD
LVGDVCDTNQ DSDGDGHQDS RDNCPTVPNS SQVDTDNDGL GDECDEDDDD DGIPDFRPPG
PDNCRLVPNP GQEDSDGDGV GNVCEDDFDR DMVIDRIDVC PENAEVTLTD FRAFQTVVLD
PEGDAQIDPN WIVLNQGMEI VQTMNSDPGL AVGYTAFNGV DFEGTFHVNT ATDDDYAGFI
FGYQDSSSFY VVMWKQMEQT YWQANPFRAV AEPGIQLKAV KSKTGPGEYL RNSLWHTGDT
TDQVKLLWKD PRNSGWKDKT SYRWFLQHRP QVGYIRARFY EGPEVVADTG VVLDTTMRGG
RLGVFCFSQE NIIWSNLRYR CNDTIPEDYE TFRVQQD
//