ID A0A226MEF7_CALSU Unreviewed; 2368 AA.
AC A0A226MEF7;
DT 25-OCT-2017, integrated into UniProtKB/TrEMBL.
DT 25-OCT-2017, sequence version 1.
DT 27-MAR-2024, entry version 21.
DE RecName: Full=NTR domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=ASZ78_013716 {ECO:0000313|EMBL:OXB53663.1};
OS Callipepla squamata (Scaled quail).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Galloanserae; Galliformes; Odontophoridae;
OC Callipepla.
OX NCBI_TaxID=9009 {ECO:0000313|EMBL:OXB53663.1, ECO:0000313|Proteomes:UP000198323};
RN [1] {ECO:0000313|EMBL:OXB53663.1, ECO:0000313|Proteomes:UP000198323}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Texas {ECO:0000313|EMBL:OXB53663.1,
RC ECO:0000313|Proteomes:UP000198323};
RC TISSUE=Leg muscle {ECO:0000313|EMBL:OXB53663.1};
RA Oldeschulte D.L., Halley Y.A., Bhattarai E.K., Brashear W.A., Hill J.,
RA Metz R.P., Johnson C.D., Rollins D., Peterson M.J., Bickhart D.M.,
RA Decker J.E., Seabury C.M.;
RT "Disparate Historic Effective Population Sizes Predicted by Modern Levels
RT of Genome Diversity for the Scaled Quail (Callipepla squamata) and the
RT Northern Bobwhite (Colinus virginianus): Inferences from First and Second
RT Generation Draft Genome Assemblies for Sympatric New World Quail.";
RL Submitted (JUL-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Secreted {ECO:0000256|ARBA:ARBA00004613}.
CC -!- SIMILARITY: Belongs to the protease inhibitor I39 (alpha-2-
CC macroglobulin) family. {ECO:0000256|ARBA:ARBA00010952}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OXB53663.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; MCFN01001103; OXB53663.1; -; Genomic_DNA.
DR STRING; 9009.A0A226MEF7; -.
DR Proteomes; UP000198323; Unassembled WGS sequence.
DR GO; GO:0005615; C:extracellular space; IEA:InterPro.
DR GO; GO:0004867; F:serine-type endopeptidase inhibitor activity; IEA:UniProtKB-KW.
DR Gene3D; 1.50.10.20; -; 3.
DR Gene3D; 2.20.130.20; -; 1.
DR Gene3D; 2.60.40.1930; -; 4.
DR Gene3D; 2.60.40.1940; -; 1.
DR Gene3D; 6.20.50.160; -; 1.
DR Gene3D; 2.60.40.690; Alpha-macroglobulin, receptor-binding domain; 2.
DR Gene3D; 2.60.40.10; Immunoglobulins; 4.
DR InterPro; IPR009048; A-macroglobulin_rcpt-bd.
DR InterPro; IPR036595; A-macroglobulin_rcpt-bd_sf.
DR InterPro; IPR011625; A2M_N_BRD.
DR InterPro; IPR047565; Alpha-macroglob_thiol-ester_cl.
DR InterPro; IPR011626; Alpha-macroglobulin_TED.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR014756; Ig_E-set.
DR InterPro; IPR001599; Macroglobln_a2.
DR InterPro; IPR019742; MacrogloblnA2_CS.
DR InterPro; IPR002890; MG2.
DR InterPro; IPR041555; MG3.
DR InterPro; IPR040839; MG4.
DR InterPro; IPR008930; Terpenoid_cyclase/PrenylTrfase.
DR PANTHER; PTHR11412; MACROGLOBULIN / COMPLEMENT; 1.
DR PANTHER; PTHR11412:SF150; ZGC:171445 PROTEIN-RELATED; 1.
DR Pfam; PF00207; A2M; 1.
DR Pfam; PF07703; A2M_BRD; 2.
DR Pfam; PF07677; A2M_recep; 2.
DR Pfam; PF01835; MG2; 2.
DR Pfam; PF17791; MG3; 2.
DR Pfam; PF17789; MG4; 2.
DR Pfam; PF07678; TED_complement; 2.
DR SMART; SM01360; A2M; 1.
DR SMART; SM01359; A2M_N_2; 2.
DR SMART; SM01361; A2M_recep; 1.
DR SMART; SM01419; Thiol-ester_cl; 2.
DR SUPFAM; SSF49410; Alpha-macroglobulin receptor domain; 2.
DR SUPFAM; SSF81296; E set domains; 2.
DR SUPFAM; SSF48239; Terpenoid cyclases/Protein prenyltransferases; 2.
DR PROSITE; PS00477; ALPHA_2_MACROGLOBULIN; 2.
PE 3: Inferred from homology;
KW Protease inhibitor {ECO:0000256|ARBA:ARBA00022690};
KW Reference proteome {ECO:0000313|Proteomes:UP000198323};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Serine protease inhibitor {ECO:0000256|ARBA:ARBA00022900};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..17
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 18..2368
FT /note="NTR domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5012556393"
FT DOMAIN 434..584
FT /note="Alpha-2-macroglobulin bait region"
FT /evidence="ECO:0000259|SMART:SM01359"
FT DOMAIN 668..758
FT /note="Alpha-2-macroglobulin"
FT /evidence="ECO:0000259|SMART:SM01360"
FT DOMAIN 1157..1245
FT /note="Alpha-macroglobulin receptor-binding"
FT /evidence="ECO:0000259|SMART:SM01361"
FT DOMAIN 1779..1927
FT /note="Alpha-2-macroglobulin bait region"
FT /evidence="ECO:0000259|SMART:SM01359"
SQ SEQUENCE 2368 AA; 267520 MW; 7D45EEBFB7DEA8CF CRC64;
MWSSILSAIT LIWIVAASSE LQYVLLVPTV VRSNSPQTAC VQFHSVSEPL SLSVVLEYSN
VQTTLFKEFV TKNDYFVCHE FKVPPHTSDP LAFISFSAKS NAVNLTERRL VAIENVHDTL
FIQTDKPIYK PGQKVMFRVV TLDSQFRPVQ ETYPRIIVKD PEQNQIFQWL DVSSTHGIIQ
LSFPLIEEPI LGSYHIIVEK KSGDEEHEYF TVEEYVLPKF EMTTNMPRRI SFFDEEIRVN
VCALGGEKQY GPTIKIFACD TAEGQGSLGN DGCLNTVVST KTFQLYRSYT RMYASFNIET
IITENGTGIQ MKNSDYVAVS QENDRVMFRN MDQYYRRGIP YFGEITVTNA DGKPVAGRVV
VLEVNGEYQA NYTTDENGTA AFSLDTSNFF NPSVKLRATQ APDDCEDLFM WRNDHESQAL
FFVRRFYSRT NSFVRIEPVR EKVSCGQQKM INIYYVLSRN RYTNATHTDF YYVVMAKGQI
VLSGQKQVRI SHASAAPWGT FAITLAITEK LTPTSGLLLY TVHPDGEIVA DSSWIHSDIC
FKNKLQLEFS EKQAFPGSKI NIHLEAAANS CCALRAVDQS VFLLQREREL SAESVYYRFR
LSDLYGYYHN GLNLQDDKPE ECTPVKTTFF DGLYYEPVNV SHDGDVYRIF MITAAGVVNT
VRKYFPETWI WDLVHTDSTG EANVFYTIPD TITEWKASAF CLQDVAGFGI SSPVSLTAFQ
PFFVDLTLPY SVIRGEKFNV IANIFNYLNK CIQISATLAE SCDYKTEVLS PEGNSATVCA
NERKTYIWSA SPLSLGEVKF TVTAEAKLNT KAAKNSTSAE EEISHMDTLI QKLLVEPEGV
KKELTQSSLI CTKGTTVSEP VLLNLPRNAV QGSARAYFSV IGDILGTALR NMENLLHMPY
GCGEQNMALF TPNIYVLDYL NKTGQLTEEI GVKGTGYLTT AEKEERYQYF LEKLERRATR
VGGSVYWQRE NKPPAENFPA FYSRAPSAEI EITSYALMAL LNKAKLMPDD LSYISHIVYW
LVKQQNPYGG FSSSQDTVVA LQALAQYGYL TFSKESNNTI KVNFMEIPKK TFQVNDENRF
LLQQTSLPIV PGNYSVEVYG TGCVYMQTTL RYNIHLPKKL AGFFLSVEPA NVVCTSNFPP
KFDLVFSASY TGNRNVSNMA IIDVKMLSGF IPNRSSLKKL QYQDSVVDHV DIKNDHIFFY
LQKLSQTEVS FSFTLEQSLP VSDIKPAPVH VYDYYETGLV FFTFQLDTDG CLSQVLSSKI
FELNRTGYRR NLDVKAIFTE KGTGLQLTAT QSIYITQVIS SLQFENVDHH YRRGIPYVGQ
ILQKRRKEIT FSVEQDFIEA HLKPAPVQIY DYYETEMLLQ KWINESKGLI MEGVSFDLVR
QYMVLLPFLV HTDSPEKVCI QLTHLNESVT LSATLEYQGE NRSLIDDVVS EKDLFTCIPY
SISKLNSTSV ALLTVTVKGE TLQFRRRKSV LVKNPESLVF VQTDKPIYKP GQTVLFRIVS
LDKDFHPLNE KFPFVYVQDP QRNRVYQWQG VELETGLTQL SFPLTSDPIQ GSYKIVVQKN
FISHVEHSFT VEEYVLPKYE VLVKLPKMIT IEDMEFPVST CGLYTYGKPV PGLVNVQVCR
KFSHSASHCY RKEGEAVCEE FTRQADARGC VSAVVRTKIF QLRRRGYEMS IEVQGKITED
GTGRCPIEVK LVDGNDSPIA NETIRISVNG DLYKGNYTTD EQGQSWFSLN TTTFTEASLE
IRAEYKPELN CYDSDWITPS YEHAMRRISR FYSPSKSFLK IEPKLEMLSC GSSTEIQVHY
IFTPEAIEHQ RKIVIYYLVM AKGSIMLADT HDLTVNPGNA YGIFQLTLPA EVNIAPLAQM
LVYTTSPSGE VIASTAEFQV ENCLPNKVNL SFVSEKSLPA SNTSLKLHSS PRSLCALHAV
DRSVLLMKPE EELSPSSVYD LLPVKELRGY SFKDYYLEEE DVNPCVSLDN ILLNGFTYVP
ISPDGEADAY DIFKLLGLKV FTSNKIHKPE VCQHYTAHLM ERSYGGSISA SQLLDDSDYA
VLEGMDAGNP VETIRKYFPE TWIWDIVSVN LHETPLYNLQ VSVSLAESTN FLAAPAEKEE
ESYCICLNER KTVAWSVTAK TLGLMEFLVS TEALQNQQPC GNATVMTPEK GQKDIVIRQL
LVEFSCVLFL PEKSVSESVA LALPENVVDG SARAYFLVLG DTMGAAMQNL HQLLEMPFGC
GEQNMVLFAP NIYVLDYLNK TGQLSEEIKS KAIGYLVNGY QRQLKYKHWD GSYSTFGPHF
GQVGNTWLTA FVLKSFARAR SHIFIEEKHI QDALIWLSQK QKENGCFCSS GVLLNNAMKG
GVNDEITLTA YITIALLEIP LPVTVRVT
//