ID R0LKQ5_ANAPL Unreviewed; 899 AA.
AC R0LKQ5;
DT 26-JUN-2013, integrated into UniProtKB/TrEMBL.
DT 26-JUN-2013, sequence version 1.
DT 22-FEB-2023, entry version 32.
DE SubName: Full=G patch domain-containing protein 1 {ECO:0000313|EMBL:EOB01003.1};
DE Flags: Fragment;
GN ORFNames=Anapl_00369 {ECO:0000313|EMBL:EOB01003.1};
OS Anas platyrhynchos (Mallard) (Anas boschas).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Galloanserae; Anseriformes; Anatidae;
OC Anatinae; Anas.
OX NCBI_TaxID=8839 {ECO:0000313|EMBL:EOB01003.1, ECO:0000313|Proteomes:UP000296049};
RN [1] {ECO:0000313|Proteomes:UP000296049}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=23749191; DOI=10.1038/ng.2657;
RA Huang Y., Li Y., Burt D.W., Chen H., Zhang Y., Qian W., Kim H., Gan S.,
RA Zhao Y., Li J., Yi K., Feng H., Zhu P., Li B., Liu Q., Fairley S.,
RA Magor K.E., Du Z., Hu X., Goodman L., Tafer H., Vignal A., Lee T.,
RA Kim K.W., Sheng Z., An Y., Searle S., Herrero J., Groenen M.A.,
RA Crooijmans R.P., Faraut T., Cai Q., Webster R.G., Aldridge J.R.,
RA Warren W.C., Bartschat S., Kehr S., Marz M., Stadler P.F., Smith J.,
RA Kraus R.H., Zhao Y., Ren L., Fei J., Morisson M., Kaiser P., Griffin D.K.,
RA Rao M., Pitel F., Wang J., Li N.;
RT "The duck genome and transcriptome provide insight into an avian influenza
RT virus reservoir species.";
RL Nat. Genet. 45:776-783(2013).
CC -!- SIMILARITY: Belongs to the GPATCH1 family.
CC {ECO:0000256|ARBA:ARBA00008600}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KB743139; EOB01003.1; -; Genomic_DNA.
DR AlphaFoldDB; R0LKQ5; -.
DR Proteomes; UP000296049; Unassembled WGS sequence.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0006397; P:mRNA processing; IEA:InterPro.
DR InterPro; IPR011666; DUF1604.
DR InterPro; IPR000467; G_patch_dom.
DR PANTHER; PTHR13384; G PATCH DOMAIN-CONTAINING PROTEIN 1; 1.
DR PANTHER; PTHR13384:SF19; G PATCH DOMAIN-CONTAINING PROTEIN 1; 1.
DR Pfam; PF07713; DUF1604; 1.
DR Pfam; PF01585; G-patch; 1.
DR PROSITE; PS50174; G_PATCH; 1.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000296049}.
FT DOMAIN 128..148
FT /note="G-patch"
FT /evidence="ECO:0000259|PROSITE:PS50174"
FT REGION 169..204
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 631..899
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 631..652
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 653..672
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 767..796
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 848..885
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:EOB01003.1"
FT NON_TER 899
FT /evidence="ECO:0000313|EMBL:EOB01003.1"
SQ SEQUENCE 899 AA; 101000 MW; 70387B1D64B3B0C2 CRC64;
GERLKKPVPL QEQTVKDAKG RYQRFHGAFT GGFSAGYFNT VGTKEGWTPS AFISSRQKRA
DRTVLGPEDF MDEEDLSEFG IAPKDITTTD DFASKAKDRI KEKARQIAGV VAAIPGTTAF
DDLIGPSKIT IGVELLRKMG WKDGQGIGPR VKRKPRRQKP DPEVKIYGCA LPPGLSEGSE
DEEDEYQPEN VTFAPKDVMP VDLTPKENVH GLGYKGLDPT QALFGVSGRE HVNLFAGSED
PSNSLGDLRH NRGRKLGITG QAFGVGALEE EDDDIYATET LSKYDTVLKD EEPGDGLYGW
TAPKQYKYKK RSDTEVKYIG KILDGFSLAS KSSAPNKIYL PPDLPRNYRP VHYFRPVIAA
GNENYHLQKA LEESTGKLGS NTTQQSRHAL NASQRREQLG EAVLKGPARS VMEYLSEKDR
ERLKEVKQAS EQQMKAKTLP QPSRNSRFQP ASADDGFQKW QMLLGGQIAN AGSSDFKPFA
KDPEKQKRYE NFVRSLKQGE KDTLERHLDP TMTEWERGRE QEEFFRAAMF YKSSNSTLSS
RFTRAKYEDD VDKVEVPRDQ ENDTDDKETA VKMKMFGKLT RDKFEWHPEK LLCKRFNVPD
PYPNSSIVGL PKVKRDKYSV FNFLTLPEPT TSVTQETNEK NQQNSSLSKP KKPSRWDVSD
KEKEKKDSIS EFISLARSKA DLQQKPPVPT TEESKSKADL QQKPPEPTTE ECGTRASETL
PSEVADEDKD QEEESRPSMD LFKAIFVSSS DEKSSSSDEE SEEEQQPTTS VTDSETAKQV
NLPDSSSSNA QDNVSAKAEP GVSLLPASKQ ELDAAEEFGP KLPPAFPFGS TWQQETAVPA
SFPGPSRKEK HKKNREKQKT KRERKHKKEK KKKHRKHKTK GKHKNKKSEK DSSSDTTDS
//