ID A0A2I3M8W2_PAPAN Unreviewed; 2768 AA.
AC A0A2I3M8W2;
DT 28-FEB-2018, integrated into UniProtKB/TrEMBL.
DT 25-MAY-2022, sequence version 2.
DT 27-MAR-2024, entry version 31.
DE RecName: Full=Thyroglobulin {ECO:0000256|ARBA:ARBA00017326};
GN Name=TG {ECO:0000313|Ensembl:ENSPANP00000032197.2};
OS Papio anubis (Olive baboon).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini;
OC Cercopithecidae; Cercopithecinae; Papio.
OX NCBI_TaxID=9555 {ECO:0000313|Ensembl:ENSPANP00000032197.2, ECO:0000313|Proteomes:UP000028761};
RN [1] {ECO:0000313|Ensembl:ENSPANP00000032197.2, ECO:0000313|Proteomes:UP000028761}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Liu Y.L., Abraham K.A., Akbar H.A., Ali S.A., Anosike U.A., Aqrawi P.A.,
RA Arias F.A., Attaway T.A., Awwad R.A., Babu C.B., Bandaranaike D.B.,
RA Battles P.B., Bell A.B., Beltran B.B., Berhane-Mersha D.B., Bess C.B.,
RA Bickham C.B., Bolden T.B., Carter K.C., Chau D.C., Chavez A.C.,
RA Clerc-Blankenburg K.C., Coyle M.C., Dao M.D., Davila M.L.D.,
RA Davy-Carroll L.D., Denson S.D., Dinh H.D., Fernandez S.F., Fernando P.F.,
RA Forbes L.F., Francis C.F., Francisco L.F., Fu Q.F., Garcia-Iii R.G.,
RA Garrett T.G., Gross S.G., Gubbala S.G., Hirani K.H., Hogues M.H.,
RA Hollins B.H., Jackson L.J., Javaid M.J., Jhangiani S.J., Johnson A.J.,
RA Johnson B.J., Jones J.J., Joshi V.J., Kalu J.K., Khan N.K., Korchina V.K.,
RA Kovar C.K., Lago L.L., Lara F.L., Le T.-K.L., Lee S.L., Legall-Iii F.L.,
RA Lemon S.L., Liu J.L., Liu Y.-S.L., Liyanage D.L., Lopez J.L.,
RA Lorensuhewa L.L., Mata R.M., Mathew T.M., Mercado C.M., Mercado I.M.,
RA Morales K.M., Morgan M.M., Munidasa M.M., Ngo D.N., Nguyen L.N.,
RA Nguyen T.N., Nguyen N.N., Obregon M.O., Okwuonu G.O., Ongeri F.O.,
RA Onwere C.O., Osifeso I.O., Parra A.P., Patil S.P., Perez A.P., Perez Y.P.,
RA Pham C.P., Pu L.-L.P., Puazo M.P., Quiroz J.Q., Rouhana J.R., Ruiz M.R.,
RA Ruiz S.-J.R., Saada N.S., Santibanez J.S., Scheel M.S., Schneider B.S.,
RA Simmons D.S., Sisson I.S., Tang L.-Y.T., Thornton R.T., Tisius J.T.,
RA Toledanes G.T., Trejos Z.T., Usmani K.U., Varghese R.V., Vattathil S.V.,
RA Vee V.V., Walker D.W., Weissenberger G.W., White C.W., Williams A.W.,
RA Woodworth J.W., Wright R.W., Zhu Y.Z., Han Y.H., Newsham I.N.,
RA Nazareth L.N., Worley K.W., Muzny D.M., Rogers J.R., Gibbs R.G.;
RT "Whole Genome Assembly of Papio anubis.";
RL Submitted (MAR-2012) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSPANP00000032197.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- SIMILARITY: Belongs to the type-B carboxylesterase/lipase family.
CC {ECO:0000256|ARBA:ARBA00005964}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00500}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 9555.ENSPANP00000032197; -.
DR Ensembl; ENSPANT00000033906.2; ENSPANP00000032197.2; ENSPANG00000011411.3.
DR GeneTree; ENSGT00940000159300; -.
DR OMA; IQCDGPP; -.
DR Proteomes; UP000028761; Chromosome 8.
DR Bgee; ENSPANG00000011411; Expressed in thyroid gland and 17 other cell types or tissues.
DR ExpressionAtlas; A0A2I3M8W2; baseline.
DR GO; GO:0005615; C:extracellular space; IEA:Ensembl.
DR GO; GO:0005179; F:hormone activity; IEA:UniProtKB-KW.
DR GO; GO:0042802; F:identical protein binding; IEA:Ensembl.
DR GO; GO:0042446; P:hormone biosynthetic process; IEA:UniProtKB-KW.
DR GO; GO:0015705; P:iodide transport; IEA:Ensembl.
DR GO; GO:0031641; P:regulation of myelination; IEA:Ensembl.
DR GO; GO:0030878; P:thyroid gland development; IEA:Ensembl.
DR GO; GO:0006590; P:thyroid hormone generation; IEA:Ensembl.
DR CDD; cd00191; TY; 8.
DR Gene3D; 3.40.50.1820; alpha/beta hydrolase; 1.
DR Gene3D; 4.10.800.10; Thyroglobulin type-1; 10.
DR Gene3D; 2.10.50.10; Tumor Necrosis Factor Receptor, subunit A, domain 2; 1.
DR InterPro; IPR029058; AB_hydrolase.
DR InterPro; IPR002018; CarbesteraseB.
DR InterPro; IPR019819; Carboxylesterase_B_CS.
DR InterPro; IPR016324; Thyroglobulin.
DR InterPro; IPR000716; Thyroglobulin_1.
DR InterPro; IPR036857; Thyroglobulin_1_sf.
DR InterPro; IPR011641; Tyr-kin_ephrin_A/B_rcpt-like.
DR PANTHER; PTHR14093; HLA CLASS II GAMMA CHAIN; 1.
DR PANTHER; PTHR14093:SF19; THYROGLOBULIN; 1.
DR Pfam; PF00135; COesterase; 1.
DR Pfam; PF07699; Ephrin_rec_like; 1.
DR Pfam; PF00086; Thyroglobulin_1; 10.
DR PIRSF; PIRSF001831; Thyroglobulin; 1.
DR SMART; SM01411; Ephrin_rec_like; 1.
DR SMART; SM00211; TY; 10.
DR SUPFAM; SSF53474; alpha/beta-Hydrolases; 1.
DR SUPFAM; SSF57610; Thyroglobulin type-1 domain; 11.
DR PROSITE; PS00941; CARBOXYLESTERASE_B_2; 1.
DR PROSITE; PS00484; THYROGLOBULIN_1_1; 6.
DR PROSITE; PS51162; THYROGLOBULIN_1_2; 11.
PE 3: Inferred from homology;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00500}; Hormone {ECO:0000256|ARBA:ARBA00022702};
KW Iodination {ECO:0000256|ARBA:ARBA00022653};
KW Reference proteome {ECO:0000313|Proteomes:UP000028761};
KW Signal {ECO:0000256|SAM:SignalP};
KW Sulfation {ECO:0000256|ARBA:ARBA00022641};
KW Thyroid hormone {ECO:0000256|ARBA:ARBA00022920};
KW Thyroid hormones biosynthesis {ECO:0000256|ARBA:ARBA00022534}.
FT SIGNAL 1..19
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 20..2768
FT /note="Thyroglobulin"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5035291110"
FT DOMAIN 31..92
FT /note="Thyroglobulin type-1"
FT /evidence="ECO:0000259|PROSITE:PS51162"
FT DOMAIN 93..160
FT /note="Thyroglobulin type-1"
FT /evidence="ECO:0000259|PROSITE:PS51162"
FT DOMAIN 161..245
FT /note="Thyroglobulin type-1"
FT /evidence="ECO:0000259|PROSITE:PS51162"
FT DOMAIN 298..358
FT /note="Thyroglobulin type-1"
FT /evidence="ECO:0000259|PROSITE:PS51162"
FT DOMAIN 605..658
FT /note="Thyroglobulin type-1"
FT /evidence="ECO:0000259|PROSITE:PS51162"
FT DOMAIN 659..726
FT /note="Thyroglobulin type-1"
FT /evidence="ECO:0000259|PROSITE:PS51162"
FT DOMAIN 727..921
FT /note="Thyroglobulin type-1"
FT /evidence="ECO:0000259|PROSITE:PS51162"
FT DOMAIN 1005..1073
FT /note="Thyroglobulin type-1"
FT /evidence="ECO:0000259|PROSITE:PS51162"
FT DOMAIN 1074..1145
FT /note="Thyroglobulin type-1"
FT /evidence="ECO:0000259|PROSITE:PS51162"
FT DOMAIN 1146..1210
FT /note="Thyroglobulin type-1"
FT /evidence="ECO:0000259|PROSITE:PS51162"
FT DOMAIN 1511..1565
FT /note="Thyroglobulin type-1"
FT /evidence="ECO:0000259|PROSITE:PS51162"
FT REGION 1829..1851
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2728..2768
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2752..2768
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 63..70
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00500"
FT DISULFID 72..92
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00500"
FT DISULFID 131..138
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00500"
FT DISULFID 140..160
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00500"
FT DISULFID 164..183
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00500"
FT DISULFID 338..358
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00500"
FT DISULFID 638..658
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00500"
FT DISULFID 1042..1049
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00500"
FT DISULFID 1181..1188
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00500"
FT DISULFID 1190..1210
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00500"
SQ SEQUENCE 2768 AA; 304757 MW; B7BD86C283499A7D CRC64;
MALVLEIFSL LASVCWVSAN IFEYQVDAQP LRPCELQRER AFLKQADYVP QCAEDGSFQT
VQCQNDGRSC WCVGADGSEV LGSRQPGRPV ACLSFCQLQK QQILLSGYIN STDTSYLPQC
QDSGDYTPVQ CDVQQVQCWC VDAEGMEVYG TRQLGRPKRC PRSCEIRNRR LLHGVGDKSP
PQCSAEGEFM PVQCKFVNTT DMMIFDLVHS YNRFPDAFVT FSSFQRRFPE VSGYCHCADS
QGRELAETGL ELLLDEIYDT IFAGLDLPST FTETTLYRIL QRRFLAVQSV ISGRFRCPTK
CEVERFTATN FGHPYVPSCR RNGDYQAVQC QTEGPCWCVD VQGKEIHGTR QQGEPPSCAE
GRSCASKRQQ ALSRLYFGTS GYFSQHDLFS SPEKRWASPR VARFATSCPP TIKELFVDSG
LLRPMVEGQS QQFSVSESLL KEAIRAIFPS RGLARLALRF TTNPKRLQQN LFGGKFLVNL
GQFNLSGALG TRGTFNFSQF FQQLGLASFS NGGRLEDLAK PVSVGLDSNS STGTPEAAKK
DVAMNKPIVG SFGFEINLQE NQNALKFLAS LLELPEFLLF LQHAISVPED VARDLGDVME
TVLRSQTCEQ TPERLFVPSC TTEGSYEDVQ CFAGECWCVD SWGKELPGSR VRGGQPRCPT
DCEKQRARMQ SLMGSQPAGS SLFVPACTSE GYFLPVQCFN SECYCVDAEG QAIPGTRSAM
GKPKKCPTPC QLQAEQAFLR TVQALLSNSS MLPTLSDTYI PQCSADGQWR QVQCDGPPEQ
VFEWYRRWEA QNKGQELMPA ELLVKIMSYR EAASGNFGLF IQSLYEAGQQ GVFPVLSQYP
SLQDVPLAAL EGNRSQSREN VLLDPYLFWQ ILNGQLSRYP GPYSDFSTPL AHFDLRNCWC
VDEAGQELEG TRAEPSKLPT CPGSCEEAKL LVLQFIRETE EIVSASNSSR FPLGESFLVA
KGIRLRNEDL GLPPLFPLRE ALAEQFLRGS DYAIRLAAQS TLSFYQRCRF SLDDSAGASA
LLRLGPYVPQ CDAFGSWEPV QCHTGTGHCW CVDEKGGFIP ASLTARSLQI PQCPTTCMKS
RTSGLLSSWK QARSRGNPSP KDLFIPACLE TGEYDRLQAS EGGTWCVDPA SGEELLPGSN
SSAQCPSLCN VLKSGVLSRR VGSGYVPACR EEDGGFSPVQ CDQAQGSCWC VTDSGEEVPE
TRVAGSQPAC ESPQCPLPFN TLEVVGGTIL CETASGPTGA AIQQCQLLCR QGSRSVFPTG
PLICSLESRR WESQLPQPRA CQRPQLWQTI QTQGHFQLQL PPGKMCSADY AGLLQAFQVF
ILDELTARGF CQIQVKTFGT PVSIPVCDNS SVQVGCLTRE YLGVNVTWKS RLEDIPVASL
PDLHDIERAL VGKDLLGRFT DLIQSGSFQL HLDSKTFPAE TIRFLQGDHF GTSPRTWFGC
LEGFYQVLTS EASQDGLGCV KCPEGSYSQD EECIPCPVGF YQEQAGSLAC VPCPAGRMTI
SAGAFSQTHC VTDCQRNEAG LQCDQNGEYR ASQRDRGSGK AFCVDGEGWR LPWSETEAPL
EDSQCLMLQK FEKAPESKVI FDASAPVAVR SKVPDSEFPV MQCLTDCAED EACSFLTVSM
TEPEISCDFY AWTSDNVACM TSDQKQDALG NSKATSFESL RCQVKVRSRG QDSPAVYLKK
GQGSTTTLQK SFEPTGFQNM LSGLYNPIVF SASGANLTDA HLFCLLACDR DLCCDGFVLT
QVQGVAIICG LLSSPNVLLC NVKDWMDPSE ARANATCPGV TYDQESRQVT LRLGGQEFIK
SLTPLEGTQD TFTNFQQVYL WKDSDMGSRP ESMGCRKDTV PRPASPTETG LTTDLFSPVD
LDQVIVNGNR SLPSRKHWLF KHLFSAQQAN LWCLSRCVQE HSFCQFAEIT ESASLYFTCT
LYPEAQVCDD ILESNDQGCR LILPQRPKAL FQKKVILEDK VKNFYTRLPF QKLMGISIRN
KVPMSEKSIS NGFFECERRC DADPCCTGFG FLNVSQLKGG EVTCLTLNSL GLQMCSEENG
GAWRILDCGS PDIEVHTYPF GWYQKPIAQN NAPSFCPSVV LPSLTEKVSL DSWQSLALSS
VVVDPSIRHF DVAHVSTAAT SNFSAVRDLC LSECSQHEDC LITTLQTQPG AVRCMFYADT
HSCTYSLQGQ NCRLLLREEA THIYRKPGIS LLSYEASVPS VLIVTHGRLL GRSQAIQVGT
SWKQVDQFLG VPYAAPPLAE RRFRAPEPLN WTGSWDASKP RASCWQPGTR TSTSPGVSED
CLYLNVFIPQ NVAPNASVLV FFHNTMDREG SEGWPAIDGS FLAAVGNLIV VTASYRVGVF
GFLSSGSGEV SGNWGLLDQV AALTWVQTHI RGFGGDPRRV SLAADRAGAD VASIHLLMAR
ATNSQLFRRA VLMGGSALSP AAIISHERAQ QQAVALAKEV SCPVSSSQEV VSCLRQKPAN
ILNDAQTKLL AVSGPFHYWG PVIDGQFLRE PPARALKRSL RAEVDLLIGS SQDDGLINRA
KAVKQFEESQ GRTSSKTAFY QALQNSLGGE DSDARVEAAA TWYYSLEHST DDYASFSRAL
ENATRDYFII CPIIDMASAW AKRARGNVFM YHAPESYGRG SLELLADVQF AFGLPFYPAY
EGQFSLEEKS LSLKIMQYFS HFIRSGNPNY PYEFSRKVPT FATPWPDFVP YAGGENYKEF
SALLPNRQGL KKADCSFWSK YISSLKASAD GAKGGQSAES EEEELTAGSG LTEDLLSLQE
PGSKSYSK
//