ID A0A151NCQ9_ALLMI Unreviewed; 4049 AA.
AC A0A151NCQ9;
DT 08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT 08-JUN-2016, sequence version 1.
DT 27-MAR-2024, entry version 25.
DE SubName: Full=Fibrocystin isoform B {ECO:0000313|EMBL:KYO34399.1};
GN Name=PKHD1 {ECO:0000313|EMBL:KYO34399.1};
GN ORFNames=Y1Q_0012989 {ECO:0000313|EMBL:KYO34399.1};
OS Alligator mississippiensis (American alligator).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Crocodylia; Alligatoridae; Alligatorinae;
OC Alligator.
OX NCBI_TaxID=8496 {ECO:0000313|EMBL:KYO34399.1};
RN [1] {ECO:0000313|EMBL:KYO34399.1, ECO:0000313|Proteomes:UP000050525}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=KSC_2009_1 {ECO:0000313|EMBL:KYO34399.1};
RX PubMed=22293439; DOI=10.1186/gb-2012-13-1-415;
RA St John J.A., Braun E.L., Isberg S.R., Miles L.G., Chong A.Y., Gongora J.,
RA Dalzell P., Moran C., Bed'hom B., Abzhanov A., Burgess S.C., Cooksey A.M.,
RA Castoe T.A., Crawford N.G., Densmore L.D., Drew J.C., Edwards S.V.,
RA Faircloth B.C., Fujita M.K., Greenwold M.J., Hoffmann F.G., Howard J.M.,
RA Iguchi T., Janes D.E., Khan S.Y., Kohno S., de Koning A.J., Lance S.L.,
RA McCarthy F.M., McCormack J.E., Merchant M.E., Peterson D.G., Pollock D.D.,
RA Pourmand N., Raney B.J., Roessler K.A., Sanford J.R., Sawyer R.H.,
RA Schmidt C.J., Triplett E.W., Tuberville T.D., Venegas-Anaya M.,
RA Howard J.T., Jarvis E.D., Guillette L.J.Jr., Glenn T.C., Green R.E.,
RA Ray D.A.;
RT "Sequencing three crocodilian genomes to illuminate the evolution of
RT archosaurs and amniotes.";
RL Genome Biol. 13:415-415(2012).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KYO34399.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AKHW03003504; KYO34399.1; -; Genomic_DNA.
DR STRING; 8496.A0A151NCQ9; -.
DR eggNOG; ENOG502QR85; Eukaryota.
DR Proteomes; UP000050525; Unassembled WGS sequence.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR CDD; cd00102; IPT; 1.
DR CDD; cd00603; IPT_PCSR; 4.
DR Gene3D; 2.60.40.10; Immunoglobulins; 5.
DR Gene3D; 2.160.20.10; Single-stranded right-handed beta-helix, Pectin lyase-like; 2.
DR InterPro; IPR019316; G8_domain.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR014756; Ig_E-set.
DR InterPro; IPR002909; IPT_dom.
DR InterPro; IPR037524; PA14/GLEYA.
DR InterPro; IPR006626; PbH1.
DR InterPro; IPR012334; Pectin_lyas_fold.
DR InterPro; IPR011050; Pectin_lyase_fold/virulence.
DR PANTHER; PTHR46769:SF1; FIBROCYSTIN; 1.
DR PANTHER; PTHR46769; POLYCYSTIC KIDNEY AND HEPATIC DISEASE 1 (AUTOSOMAL RECESSIVE)-LIKE 1; 1.
DR Pfam; PF10162; G8; 2.
DR Pfam; PF01833; TIG; 8.
DR SMART; SM01225; G8; 2.
DR SMART; SM00429; IPT; 6.
DR SMART; SM00710; PbH1; 7.
DR SUPFAM; SSF81296; E set domains; 7.
DR SUPFAM; SSF51126; Pectin lyase-like; 2.
DR PROSITE; PS51484; G8; 2.
DR PROSITE; PS51820; PA14; 1.
PE 4: Predicted;
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Membrane {ECO:0000256|SAM:Phobius};
KW Reference proteome {ECO:0000313|Proteomes:UP000050525};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Transmembrane {ECO:0000256|ARBA:ARBA00022692, ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|ARBA:ARBA00022989,
KW ECO:0000256|SAM:Phobius}.
FT TRANSMEM 3888..3910
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 356..515
FT /note="PA14"
FT /evidence="ECO:0000259|PROSITE:PS51820"
FT DOMAIN 1982..2103
FT /note="G8"
FT /evidence="ECO:0000259|PROSITE:PS51484"
FT DOMAIN 2776..2902
FT /note="G8"
FT /evidence="ECO:0000259|PROSITE:PS51484"
FT REGION 1..24
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3921..3940
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 4001..4029
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 4005..4024
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 4049 AA; 441880 MW; 111D8F92BCB96F0D CRC64;
MPLSSEGWKG SKLDKGSPKI PKASAVTPKS SLEMNALIPF FLNVELLLLT VTSAKVFIDP
REGSIAGGTW ITLRLDDCLG LTSGQLGLWY STNGSPSEVS LVNPTLPRVV CDVHPLHFDV
STIRCKTRSL LQEGVYHPEV TSNGQVISTS GRMTGDNCTF KFAAAQTPVV YQINPPSGVP
GSLIEVYGQV IAGSYETMDF DTDYIDGPVI LEAEDDGWIS LCSFANKQNR SIYPIQGEHG
TGTLKCRVEG NYIGSQNISF SVFNKGKSVV HKDAWLISAK QELFLYQTHP EIVSVFPTSG
SLRGGTRLTI TGDFFDHPLK VTVAGVPCEN EYISPWKIIC ITGPVDKTRR LSAPQPGNRG
LLFEVWDDTV DTGLTEATPG YRWQFVPNAS SPAVFLSQAK QPFRSRLCGF FVAPQTNNYT
FWILADGKAS LYLSLTEDPG SKIKIASLPA GVSQWSEHWE KNWNQSWQPK SQKFELTGGS
RYYLEALHHG TAPSTGVKVG IQLHNTWLNP DVINTYHREK HEIQASAHQL PEIQMLTLSG
TGWFSLSWDN ISSNTISTNA TADQVQTVIE ELLSVKCDTE PSSAKFLFRN GFENVSDQKD
FCTKGYVVSG VEPFCGRFSI CSPKYLVRTP PSTLPRYDVT EYTHVCFAHK GYISNTLHVL
VSYINTFLRT VKKNLTCHWR TQENSLESWE FTCVDLWRSC VNNSELLLDL QANSTVLVHQ
IVLLFPEVDE EPSGKFYLDE IIISDRQVTV FQRDPKPAHP GGNIIEAVSV VGSPPIYNVS
LLVAGCGGHL PLFSLRGTLP SQGSEEDDSL FASLENANIS LRVQRLQTAS PPIGGTFQIL
LPNTVISGVP VHISSHHLHK LLQSSTDNST ARYFNASDFT VTKDSSTCYE AIWTLTWKAK
TGDLPNVISV SAENLTGLNP TVAARVVYDG GVFIGPIFGD MLATCNNSTQ VVVVVNEVLA
NCSGSCSFRY CQEATPLVSD VEYLLDDGLH TKVFIRGSGF TEDHQALRIL VNDTTCYVTA
SNRTDLVCQM KMLPVGLYQV MLLVRPYGFA LNASTGGDIY LRIEPKLVAV EPSRASEIGG
CPVILRGTGF DGISLVLFGS QVCPVNTNNS NSMRIECEVP SRGEDDFNVH VTLIARCQST
VFANVFIYDP SLNPAIVALS RNRSSIAGGQ ILQIGLSSFA AYTGSNVQVQ IGDAAADILG
KTAHGLDVVL PGLAAGWYNV SVILNGILIG SNGVESLIQY ISEIFNIEPC CGSFLGGTML
TISGTGFSPN PALVSVTIDR QTCDVTYLME EAIWCRTPPA AGLSHVQSQD ISAQVLVFIG
SRSTTHTSSL SLQSGDITFT YQRALTPGVS AVEVEMRNGS LHLGIHGLNV TNSLAKLGDS
ACELKSQHTN KSTTFSQCSL PMNTLEPGIY PIRVLQKQLG YANLTASLQD IRVTPQISTV
FPSQGSACGG LVLTISGFAL TSQRNSVRVS LGGNYSCEIK SSVYNMIICT LLPRDQSLGA
WQLPEAYPVL NVTVTVNGIS SICLANCTLH LLEEWTPSVD AVTWEINGTF TDVMLSGQRL
VWATDKPVVR MGNQAPCHVT YWNETSVRCW AASVYAGEHA LSMPNSRRGQ ACFRTGSRIV
SIAPYVLRFY PQHFGGNGGG RLTIEGAAFQ GRSQTSVIIG NHPCLITRAT YNAIQCTVPP
GHGTKALWLE VNSLSYPLGE ISYREEFTPA FLSLLPVVGL LLTVKVSRIT AVENMRVSVG
DSPCTNVNGN RTTLQCLAPQ LPAGEHHIVG HDLLRGWASS NLTFISRLAV TSVHHNFGCL
GGGAVHLHGT GFSPESTSVA ICDAPCVTLG PVTATDLSCL APRLDASLAI LCSLKHSSED
CQETGATYIK CDIRVTVGTD SVTGPAPYIY LCDDDILPWD QTTGDSSLPY FTGLFFSPKV
ERDEVLIYNS SCNITMETEA EMECEGANQP ITAKITEIRK SWGQNTQRHA RLRFCGPWSK
SSSWLDGCPP QDGDNVTVER GQTLLLDTIT GILNLLHVKG GKLLFGGPGP VGLHAHYILV
SDGGKLQVGS PNAPFCCKAH IHLYGSLHTP NFFPFGAKFL AVRNGTLSIH GWVPNVVFTH
LKSAAHVNDT RLVLAEPVDW QSGDEVIVSG TGPGDGEWQE EIVTVEAVNN TELYLRSPLR
FPHGFEEEQM GGQHLSLSAV VALLSRRIVV QGNVTGERMS HLRMCAAAGV SGDASGCLYK
RSEKKLGSQE MGAVVMMQAF QGEESHIRLE GVQFQHVGQA FQQHLSALTI AGTARLTDSY
IRGCSVWDSF GRGLGISGTS DLSVDNNVFY NISGHGLLLG GWLEQGNKIR HNILIGLSGT
DGLSNLEAVS PAGIYIQAPA NQIEGNMVCA AGYGYFFHLS PKRPSQTPVL SFSKNTAHSC
TRYGLLVYPE YQPQCANSLG PVLFQSFMAW RNQGGAQIFR SSNLELQNFQ IHSCKEFGID
IVESLGNTSV TNGLLLGHLG HKQDASCMSA GLKTPKRREL LVSNTTFMNF DSSMCTAIST
CSGCSRGQGG FTVRAERLKF LNSPNQVLFP FPHSAILEDL DGSITEQKGS HLLASLNILA
TSCVVSANFS QAAASSVCGR DVIFHRMSIG LNEAPDAPYN LTVTNNNNKT TTVNYVSDTL
SNLYGWMALL LDKEAYTLIF DNPLFNKQLQ YSATFDNFAD GNYLLMEHRN LSSTIEVTVL
CGTRRGQPLQ SLPSHIYHKS CDWFFNRKLG KLTYLVTGED LIQVTFKEER VSIPAPAPSD
GILKWSMPES WSGVGRGWGG YNHSIPAPGE DVIILPNRAV LVDTTLPPLR GLYVLGRLEF
PINSSNVLSA ACVVVAGGEL KVGTFHHPLE RGLNLLIFLR ASDGIYCDRL DGINVHPGTI
GVYGKVQMHS AYPKKSWTHL GADIAPGNER ILLADEVDWS HGGNIVISSS SYEPHQAEVV
TVKEIRNHSV KIHERLLHRH IGRSHNTEDG RQLPLAAEVG LLTRNIQIKS DTICSGRLLV
GRFRNANGVE YAGALQLLNV ELLNFGPSHL PAIDFRNVSQ GSAVIASSIH QTCGGGIQSV
ASHGIVLRDN VMFSTVGPGI DLEGQNHSLT RNLIILSKQP EGSPNWVAGV KVNLIDGAYL
LGNVVAGSER IAFHIKGQEC SLARDQYIEN VAHSSLHGVH LYRGDGFQNC TRITGFLSYK
NYDYGVMFHL ESSVVMDNMT LVDNAVGVLP VVCSSFEQHC LRKEYIKFRN SVIVATSKTF
DCIKDRIKPL SGDSTSRDRA PRYPRRGRIG ILWPAFASDT SRWPDKPWHK IRNDPSVSGI
MTLQEVTFTG FTKSCYSDDM DICIMSNPDS TGIMHPITSE KMRMLHVNEK NKLYFHTLQT
SNEYEDMVFP EMRCESSRKA LFKDLDGSAL GLEPPVSVFP KSDWEWPQFY LHAGIYREDS
KCVYKPSTQG YFCKETDHAL LILESLEVDA DGERPSPVMS VTGSFVESFS ASASHSSCCS
SGHPQSFYSV LPSNKLTKVC FAGPTPLTMR LHLNSGQSFT RLFLAVFYNE PQSLHVFRQG
KYIPSTSSFV SSDAVAGTNY FSFEDNLLYI LLHGDEPVEI YTRHSLLIAF TITTTIGEEE
QINVVHHLAD FLQIGHELVR IVHNGAGSES TLKVISANAR KRTRLCPTMT SCMAFHSRDD
GENTWAGPTS MRRLRPSGTA TSSSVMIIEV GDPPSLVRNS LVSSLSDERL QSLASILIIA
HQTGELQDVL DIPTDTLMLM WSASPSPEGC SGRNGSGLTP GSCLYARPYN ISVQVQPSDG
EMGKELPVQP QIIFLDKQGQ RVETLGLPSE PWVVEAYLKG SSKAALKGHT TVEVQHGWAC
FTDLAVSSSG TDWYLIFTVS SPPGAKFTAE SQPFTIFPIA MGEKSNPILA VVLSSVASVV
VLGLFVFCWV KKSKSNKTKT GRANALQAES NIKSSPIHRP NNSTCVQLQC KQEENDRDVA
GVEADMEANG KQGSRVGEVK EQHPQTFKAK SLGKMNIVEH QMDSGEKSSM TKKCGDRYEP
RARTSPWDSF ELQQLGLKEF SEWKDVSQE
//