ID A0A182NH72_9DIPT Unreviewed; 1478 AA.
AC A0A182NH72;
DT 07-SEP-2016, integrated into UniProtKB/TrEMBL.
DT 07-SEP-2016, sequence version 1.
DT 27-MAR-2024, entry version 31.
DE RecName: Full=Titin {ECO:0008006|Google:ProtNLM};
OS Anopheles dirus.
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=7168 {ECO:0000313|EnsemblMetazoa:ADIR006995-PA, ECO:0000313|Proteomes:UP000075884};
RN [1] {ECO:0000313|Proteomes:UP000075884}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=WRAIR2 {ECO:0000313|Proteomes:UP000075884};
RG The Broad Institute Genomics Platform;
RA Neafsey D.E., Walton C., Walker B., Young S.K., Zeng Q., Gargeya S.,
RA Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Anopheles dirus WRAIR2.";
RL Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EnsemblMetazoa:ADIR006995-PA}
RP IDENTIFICATION.
RC STRAIN=WRAIR2 {ECO:0000313|EnsemblMetazoa:ADIR006995-PA};
RG EnsemblMetazoa;
RL Submitted (MAY-2020) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 7168.A0A182NH72; -.
DR EnsemblMetazoa; ADIR006995-RA; ADIR006995-PA; ADIR006995.
DR VEuPathDB; VectorBase:ADIR006995; -.
DR OrthoDB; 4232090at2759; -.
DR Proteomes; UP000075884; Unassembled WGS sequence.
DR GO; GO:0009653; P:anatomical structure morphogenesis; IEA:UniProt.
DR GO; GO:0030154; P:cell differentiation; IEA:UniProt.
DR CDD; cd00063; FN3; 2.
DR CDD; cd00096; Ig; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 4.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR007110; Ig-like_dom.
DR InterPro; IPR036179; Ig-like_dom_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR013098; Ig_I-set.
DR InterPro; IPR003599; Ig_sub.
DR InterPro; IPR003598; Ig_sub2.
DR InterPro; IPR010629; Ins_allergen.
DR PANTHER; PTHR13817:SF185; STRETCHIN-MLCK, ISOFORM U; 1.
DR PANTHER; PTHR13817; TITIN; 1.
DR Pfam; PF00041; fn3; 2.
DR Pfam; PF07679; I-set; 2.
DR Pfam; PF06757; Ins_allergen_rp; 1.
DR PRINTS; PR00014; FNTYPEIII.
DR SMART; SM00060; FN3; 2.
DR SMART; SM00409; IG; 2.
DR SMART; SM00408; IGc2; 2.
DR SUPFAM; SSF49265; Fibronectin type III; 2.
DR SUPFAM; SSF48726; Immunoglobulin; 2.
DR PROSITE; PS50853; FN3; 2.
DR PROSITE; PS50835; IG_LIKE; 2.
PE 4: Predicted;
KW Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..15
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 16..1478
FT /note="Titin"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5013289112"
FT DOMAIN 235..335
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 341..424
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 434..527
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 532..625
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT REGION 203..246
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 609..665
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 709..732
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 745..782
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 867..1004
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1020..1361
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 633..647
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 717..731
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 748..767
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 868..892
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 893..909
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 910..924
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 925..959
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 970..996
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1020..1039
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1089..1103
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1117..1144
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1167..1184
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1210..1227
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1282..1306
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1309..1327
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1337..1361
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1478 AA; 162661 MW; 65907B9BABE83B38 CRC64;
MKFIILLALF GAAFGQNLRA EVDQLLPFLN MEQVRSIYQR YVQTDAQVGE IWSFLQSAET
DAAWRVLIST PELQEISAWT EVRGVSIRDY LNGIAAMLGL TPITRSVNSK SPASRSWSAM
MDEIRAATDL AGAEALARSF IANPASEFGE LYRMTQARRA GFTSIIQHPD VTRFSAQLRA
FGVDVDQIIA RFQAVVKAPE RLASPSGTMG NQPTKHTTGK PKKAVHWKSA EPPAPPGKPA
LVPGSPSSAP DIVTIRWSRP VSDGGSPILG YVVEQRRTGS PHWVRACPSL VQQPELSLSG
LEPGWRYQFR IMAENIVGRS ESSELSDPLT VTLQRNAISA PRFVHELFDA TTVENEKVEF
RVQVAGTPAP QISWFKDGFE IYSSRRARIV TEADVSVLVI HQAALTDEGE IKCAATNRAG
HTVTRCHLTI DAVPKIRLPR QYEDGLIIEA EEVIRMKVGL AGRPPPLVEW CHNGEPLDND
GRHEIVTTDK NSTLKVSSTK RADRGEYHIR ALNKLGEDNA SFLVTVTARP EPPGRVTVAI
SYGKTATLSW SAPGDDGGCK IGNYIVEYYR VGWDVWLKAA TCRQLTATLS DLIEGSQYRF
RVKAENPYGL SDPSEESDVL FVPDPKRGIT DASKMKTQPS TLPRKRRDLS QSPLRESNAK
KAFTPEPYGR DEIMREMSYG TPLDLELGHG YTTAASPGVP ALIVTEPAPL AEPESESESI
GPSTNVGRGP AASVTPLSLK AMQKFSKEPS PLSLNDGNST DQMGSGSVTG GEEAPRAQQP
NAKAAIHNSS EFMLVLYNEQ EAKKSTRNST FDFELDELVA PPPPLSLSAP ELNVEPPPAP
IMRAGVSSTE LLYERAMARF YQAVAYEESE NQRKEAEQEQ QALRRRQAQE QPQDDKSTVS
TTSVSNRLAD RRSSLRKRLS GDKESIVKQS SFEQDQAVLP PPQQQQQQQQ LSHSTLAEET
IAEEPSPGVD QVDSYTTLPL GYSESEESVS SSMSSMIEEL KRREQQQKLL EQQVLVLPKR
RGFMDDDETD EEPYHPGGER MRSPYRNPDP SQAIEVLTRP MPLPDPNFVP KPILKRPSTE
VLAKQGEPPA NKPTSSSTSS TTLLAGPPPP PKQPTPEREK SPVKTPTPPK KAEEKPRQEE
PEHPGPEAAA LGQKQPPAQQ DIKRAIVSEA ARQRRLETRQ SSIEESRAVA DFYSDMVQQI
ESSKKPRKLP LYMSPDEVRK LNEREYSRSS SRASSQSPLP PPDPYLLRGR TSRSPSLATD
HPVVVQRGSR PPRESKILRQ RSASIETESI AASSSRPSST VPKDPAESSA GQIRDEENGR
ISRAVEKKRA SLSGGAGVRN RSRSNSAART SVAQSSQLTT KVQPTVPVAA EPVTARTAAI
TNKQRTPEPV PAPVSKPIAS EAVEPKLAAA ILPEEGDLKP TISYLTDVAL FAIACWLYLF
KNPKLAIPVI LLIMYRHIRE AVKDKMPAWM RRRTEVTS
//