ID A0A151NG01_ALLMI Unreviewed; 1452 AA.
AC A0A151NG01;
DT 08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT 08-JUN-2016, sequence version 1.
DT 27-MAR-2024, entry version 31.
DE RecName: Full=Ovostatin {ECO:0008006|Google:ProtNLM};
GN ORFNames=Y1Q_0018185 {ECO:0000313|EMBL:KYO35579.1};
OS Alligator mississippiensis (American alligator).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Crocodylia; Alligatoridae; Alligatorinae;
OC Alligator.
OX NCBI_TaxID=8496 {ECO:0000313|EMBL:KYO35579.1};
RN [1] {ECO:0000313|EMBL:KYO35579.1, ECO:0000313|Proteomes:UP000050525}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=KSC_2009_1 {ECO:0000313|EMBL:KYO35579.1};
RX PubMed=22293439; DOI=10.1186/gb-2012-13-1-415;
RA St John J.A., Braun E.L., Isberg S.R., Miles L.G., Chong A.Y., Gongora J.,
RA Dalzell P., Moran C., Bed'hom B., Abzhanov A., Burgess S.C., Cooksey A.M.,
RA Castoe T.A., Crawford N.G., Densmore L.D., Drew J.C., Edwards S.V.,
RA Faircloth B.C., Fujita M.K., Greenwold M.J., Hoffmann F.G., Howard J.M.,
RA Iguchi T., Janes D.E., Khan S.Y., Kohno S., de Koning A.J., Lance S.L.,
RA McCarthy F.M., McCormack J.E., Merchant M.E., Peterson D.G., Pollock D.D.,
RA Pourmand N., Raney B.J., Roessler K.A., Sanford J.R., Sawyer R.H.,
RA Schmidt C.J., Triplett E.W., Tuberville T.D., Venegas-Anaya M.,
RA Howard J.T., Jarvis E.D., Guillette L.J.Jr., Glenn T.C., Green R.E.,
RA Ray D.A.;
RT "Sequencing three crocodilian genomes to illuminate the evolution of
RT archosaurs and amniotes.";
RL Genome Biol. 13:415-415(2012).
CC -!- SUBCELLULAR LOCATION: Secreted {ECO:0000256|ARBA:ARBA00004613}.
CC -!- SIMILARITY: Belongs to the protease inhibitor I39 (alpha-2-
CC macroglobulin) family. {ECO:0000256|ARBA:ARBA00010952}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KYO35579.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AKHW03003163; KYO35579.1; -; Genomic_DNA.
DR RefSeq; XP_006277342.1; XM_006277280.3.
DR GeneID; 102565486; -.
DR KEGG; amj:102565486; -.
DR CTD; 2; -.
DR eggNOG; KOG1366; Eukaryota.
DR OrthoDB; 2970602at2759; -.
DR Proteomes; UP000050525; Unassembled WGS sequence.
DR GO; GO:0005615; C:extracellular space; IEA:InterPro.
DR GO; GO:0004867; F:serine-type endopeptidase inhibitor activity; IEA:UniProtKB-KW.
DR CDD; cd02897; A2M_2; 1.
DR Gene3D; 1.50.10.20; -; 1.
DR Gene3D; 2.20.130.20; -; 2.
DR Gene3D; 2.60.120.1540; -; 1.
DR Gene3D; 2.60.40.1930; -; 2.
DR Gene3D; 2.60.40.1940; -; 1.
DR Gene3D; 2.60.40.690; Alpha-macroglobulin, receptor-binding domain; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 2.
DR InterPro; IPR009048; A-macroglobulin_rcpt-bd.
DR InterPro; IPR036595; A-macroglobulin_rcpt-bd_sf.
DR InterPro; IPR011625; A2M_N_BRD.
DR InterPro; IPR041813; A2M_TED.
DR InterPro; IPR047565; Alpha-macroglob_thiol-ester_cl.
DR InterPro; IPR011626; Alpha-macroglobulin_TED.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR014756; Ig_E-set.
DR InterPro; IPR001599; Macroglobln_a2.
DR InterPro; IPR019742; MacrogloblnA2_CS.
DR InterPro; IPR002890; MG2.
DR InterPro; IPR041555; MG3.
DR InterPro; IPR040839; MG4.
DR InterPro; IPR008930; Terpenoid_cyclase/PrenylTrfase.
DR PANTHER; PTHR11412; MACROGLOBULIN / COMPLEMENT; 1.
DR PANTHER; PTHR11412:SF170; OVOSTATIN; 1.
DR Pfam; PF00207; A2M; 1.
DR Pfam; PF07703; A2M_BRD; 1.
DR Pfam; PF07677; A2M_recep; 1.
DR Pfam; PF01835; MG2; 1.
DR Pfam; PF17791; MG3; 1.
DR Pfam; PF17789; MG4; 1.
DR Pfam; PF07678; TED_complement; 1.
DR SMART; SM01360; A2M; 1.
DR SMART; SM01359; A2M_N_2; 1.
DR SMART; SM01361; A2M_recep; 1.
DR SMART; SM01419; Thiol-ester_cl; 1.
DR SUPFAM; SSF49410; Alpha-macroglobulin receptor domain; 1.
DR SUPFAM; SSF81296; E set domains; 1.
DR SUPFAM; SSF48239; Terpenoid cyclases/Protein prenyltransferases; 1.
DR PROSITE; PS00477; ALPHA_2_MACROGLOBULIN; 1.
PE 3: Inferred from homology;
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Protease inhibitor {ECO:0000256|ARBA:ARBA00022690};
KW Reference proteome {ECO:0000313|Proteomes:UP000050525};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Serine protease inhibitor {ECO:0000256|ARBA:ARBA00022900};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..17
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 18..1452
FT /note="Ovostatin"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5007586031"
FT DOMAIN 447..594
FT /note="Alpha-2-macroglobulin bait region"
FT /evidence="ECO:0000259|SMART:SM01359"
FT DOMAIN 726..816
FT /note="Alpha-2-macroglobulin"
FT /evidence="ECO:0000259|SMART:SM01360"
FT DOMAIN 1353..1440
FT /note="Alpha-macroglobulin receptor-binding"
FT /evidence="ECO:0000259|SMART:SM01361"
SQ SEQUENCE 1452 AA; 162799 MW; 77B98DA3B69F0A72 CRC64;
MWLRFLLGAF LLHVTAGQTP ELQYVLMVPS VLQNDSPYQL CLQFLNLNES VSVSIVLEYN
AVNTTIFDEI MKKKDAFQCN TITVPRATFS PLAFITFSAV GRTVRLLERR SVAIQNTDSI
VFIQTDKPIY KPGQTVMFRV VALNTSFRPV QEMYPLITIQ DPQGNRIFQW LDVTSQTAIV
QLMFQLIKEP ILGDYQITVE KRSGDKIRHT FTAKEYVLPK FELKINAPKT ISVINPDFTV
KVCGMYTYGQ PVEGTIQLSV CRNFNLYGAC KKDPICQAVT KQLNKDGCLS QVFSSKIFEL
SRSGYWMSLD VKATVTEKGT GVQISDSAYV PITQVLGSVR FENMDRYYKR GLPYFGQIKV
VDKDDSPIKN EVVQLFLSEK NICNYTTDDN GTVQFKIDTS EMFNPEFSLK AVYKTSDVCH
MEGWLVPFYT EAFFSIQRFY SWTNSFVRIE PVWKELNCGL NKLITVHYIL NKKQYRGATS
VNFFYLGMAR GKIILHGKKE VNVGDALKGA FSISLTINEK LAPVLQLLVY TLHPAREIVA
DSARIQIEKC FENKVQLKFS QEEAVPASNV SLLIKAAANS HCALRAVDQS VLLLKPEQQL
SAETVYSLLP LQDLFGYYFK NLNLEDDRKD PCIPTDNIFH NGLYYTPVTS NLGPDVYMFF
KEMGMKVFTN SRLRQPVVCE SERYRPEFLS RPGVADFSTV HMAGMSGVNA REAKVIETVR
KFFPDTWIWD LVPVGSTGKA NLTFTVPDTI TEWKASMFCV AEEAGFGISV PASLTAFQMF
FVEMTLPYSI IRGEDFLLRA NVFNYMGTCN QVKVFLANSE DYLVQLLSPS DADGCVCSNE
QKTYVWKIIS KNIGEVTFNI TAEILDGGSC QGKSAGDLVI RWKDTLIRTL LVEPEGIEKE
VTQSSLICTK DGAASRSMSL KLPTNVVEGS DRAFFSVIGD ILGTAMQNLH QLLQMPFGCG
EQNMVLFAPN IYILDYLNKT RQLSEETKSK AIGYLVSGYQ KQMSYKHPDG SYSIFGSRDE
EGNTWLTAFV YKCLAQGNRY IYIDSNVQEQ TLVWLSGKQK PDGCFQSVGK LFNNALKGGV
DDELSLSAYI TIALVEAGLP TSHTVVRNGL FCLEAASEKG ISKVYVQALL AYAFCLIGNQ
AKCNFFLKEL DKSAKEVGGT IHWEREEKPL TESFPSFSAR APSAEVEMTA YVLLALLNKP
NRTLGDLTRA SQIVQWVVRQ QNPYGGFSST QDTVIALQAL ADYGAASYSE VGRNTVSISS
SKPFKKVFIV NNRNRLLLQQ APLPDVPGNY SLEVNGSGCV FVQTTLRYNI PLPQKASGFT
LSVKVENASC ANPLGLKFDI IISTSYTGKR NVSNMAIVDV QMLSGFVPVK SSWQKLLDDN
TVMQVETKQN HILFYIDSVS RVKTRISFTV EQELSVFNLK PVPVLIYDYY EPDENAIAKY
KMPCNETISE SS
//