ID K1529_HUMAN Reviewed; 1646 AA. AC Q9P1Z9; Q2KHR6; Q5VV25; Q68DP5; Q69YV9; Q6AHY0; DT 15-JAN-2008, integrated into UniProtKB/Swiss-Prot. DT 15-JAN-2008, sequence version 2. DT 05-OCT-2010, entry version 48. DE RecName: Full=Uncharacterized protein KIAA1529; GN Name=KIAA1529; OS Homo sapiens (Human). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Homo. OX NCBI_TaxID=9606; RN [1] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1). RC TISSUE=Brain; RX MEDLINE=20277482; PubMed=10819331; DOI=10.1093/dnares/7.2.143; RA Nagase T., Kikuno R., Ishikawa K., Hirosawa M., Ohara O.; RT "Prediction of the coding sequences of unidentified human genes. XVII. RT The complete sequences of 100 new cDNA clones from brain which code RT for large proteins in vitro."; RL DNA Res. 7:143-150(2000). RN [2] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORMS 2 AND 3). RC TISSUE=Testis; RX PubMed=17974005; DOI=10.1186/1471-2164-8-399; RA Bechtel S., Rosenfelder H., Duda A., Schmidt C.P., Ernst U., RA Wellenreuther R., Mehrle A., Schuster C., Bahr A., Bloecker H., RA Heubner D., Hoerlein A., Michel G., Wedler H., Koehrer K., RA Ottenwaelder B., Poustka A., Wiemann S., Schupp I.; RT "The full-ORF clone resource of the German cDNA consortium."; RL BMC Genomics 8:399-399(2007). RN [3] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX PubMed=15164053; DOI=10.1038/nature02465; RA Humphray S.J., Oliver K., Hunt A.R., Plumb R.W., Loveland J.E., RA Howe K.L., Andrews T.D., Searle S., Hunt S.E., Scott C.E., Jones M.C., RA Ainscough R., Almeida J.P., Ambrose K.D., Ashwell R.I.S., RA Babbage A.K., Babbage S., Bagguley C.L., Bailey J., Banerjee R., RA Barker D.J., Barlow K.F., Bates K., Beasley H., Beasley O., Bird C.P., RA Bray-Allen S., Brown A.J., Brown J.Y., Burford D., Burrill W., RA Burton J., Carder C., Carter N.P., Chapman J.C., Chen Y., Clarke G., RA Clark S.Y., Clee C.M., Clegg S., Collier R.E., Corby N., Crosier M., RA Cummings A.T., Davies J., Dhami P., Dunn M., Dutta I., Dyer L.W., RA Earthrowl M.E., Faulkner L., Fleming C.J., Frankish A., RA Frankland J.A., French L., Fricker D.G., Garner P., Garnett J., RA Ghori J., Gilbert J.G.R., Glison C., Grafham D.V., Gribble S., RA Griffiths C., Griffiths-Jones S., Grocock R., Guy J., Hall R.E., RA Hammond S., Harley J.L., Harrison E.S.I., Hart E.A., Heath P.D., RA Henderson C.D., Hopkins B.L., Howard P.J., Howden P.J., Huckle E., RA Johnson C., Johnson D., Joy A.A., Kay M., Keenan S., Kershaw J.K., RA Kimberley A.M., King A., Knights A., Laird G.K., Langford C., RA Lawlor S., Leongamornlert D.A., Leversha M., Lloyd C., Lloyd D.M., RA Lovell J., Martin S., Mashreghi-Mohammadi M., Matthews L., McLaren S., RA McLay K.E., McMurray A., Milne S., Nickerson T., Nisbett J., RA Nordsiek G., Pearce A.V., Peck A.I., Porter K.M., Pandian R., RA Pelan S., Phillimore B., Povey S., Ramsey Y., Rand V., Scharfe M., RA Sehra H.K., Shownkeen R., Sims S.K., Skuce C.D., Smith M., RA Steward C.A., Swarbreck D., Sycamore N., Tester J., Thorpe A., RA Tracey A., Tromans A., Thomas D.W., Wall M., Wallis J.M., West A.P., RA Whitehead S.L., Willey D.L., Williams S.A., Wilming L., Wray P.W., RA Young L., Ashurst J.L., Coulson A., Blocker H., Durbin R.M., RA Sulston J.E., Hubbard T., Jackson M.J., Bentley D.R., Beck S., RA Rogers J., Dunham I.; RT "DNA sequence and analysis of human chromosome 9."; RL Nature 429:369-374(2004). RN [4] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RA Mural R.J., Istrail S., Sutton G.G., Florea L., Halpern A.L., RA Mobarry C.M., Lippert R., Walenz B., Shatkay H., Dew I., Miller J.R., RA Flanigan M.J., Edwards N.J., Bolanos R., Fasulo D., Halldorsson B.V., RA Hannenhalli S., Turner R., Yooseph S., Lu F., Nusskern D.R., RA Shue B.C., Zheng X.H., Zhong F., Delcher A.L., Huson D.H., RA Kravitz S.A., Mouchard L., Reinert K., Remington K.A., Clark A.G., RA Waterman M.S., Eichler E.E., Adams M.D., Hunkapiller M.W., Myers E.W., RA Venter J.C.; RL Submitted (JUL-2005) to the EMBL/GenBank/DDBJ databases. RN [5] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 4). RC TISSUE=Uterus; RX PubMed=15489334; DOI=10.1101/gr.2596504; RG The MGC Project Team; RT "The status, quality, and expansion of the NIH full-length cDNA RT project: the Mammalian Gene Collection (MGC)."; RL Genome Res. 14:2121-2127(2004). RN [6] RP PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-1007, AND MASS RP SPECTROMETRY. RC TISSUE=Cervix carcinoma; RX PubMed=17924679; DOI=10.1021/pr070152u; RA Yu L.-R., Zhu Z., Chan K.C., Issaq H.J., Dimitrov D.S., Veenstra T.D.; RT "Improved titanium dioxide enrichment of phosphopeptides from HeLa RT cells and high confident phosphopeptide identification by cross- RT validation of MS/MS and MS/MS/MS spectra."; RL J. Proteome Res. 6:4150-4162(2007). CC -!- SUBCELLULAR LOCATION: Membrane; Single-pass membrane protein CC (Potential). CC -!- ALTERNATIVE PRODUCTS: CC Event=Alternative splicing; Named isoforms=4; CC Name=1; CC IsoId=Q9P1Z9-1; Sequence=Displayed; CC Name=2; CC IsoId=Q9P1Z9-2; Sequence=VSP_030503, VSP_030507, VSP_030508, CC VSP_030509; CC Note=No experimental confirmation; CC Name=3; CC IsoId=Q9P1Z9-3; Sequence=VSP_030502; CC Note=No experimental confirmation; CC Name=4; CC IsoId=Q9P1Z9-4; Sequence=VSP_030504, VSP_030505, VSP_030506; CC Note=No experimental confirmation; CC -!- SEQUENCE CAUTION: CC Sequence=BAA96053.1; Type=Erroneous initiation; CC Sequence=CAH18175.1; Type=Erroneous initiation; CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AB040962; BAA96053.1; ALT_INIT; mRNA. DR EMBL; AL137557; CAH10701.1; -; mRNA. DR EMBL; CR627453; CAH10534.1; -; mRNA. DR EMBL; CR749320; CAH18175.1; ALT_INIT; mRNA. DR EMBL; AL512590; CAH71603.1; -; Genomic_DNA. DR EMBL; CH471105; EAW58840.1; -; Genomic_DNA. DR EMBL; BC112930; AAI12931.1; -; mRNA. DR IPI; IPI00292836; -. DR IPI; IPI00470917; -. DR IPI; IPI00740702; -. DR IPI; IPI00877755; -. DR RefSeq; NP_065944.2; -. DR UniGene; Hs.435629; -. DR ProteinModelPortal; Q9P1Z9; -. DR STRING; Q9P1Z9; -. DR PRIDE; Q9P1Z9; -. DR Ensembl; ENST00000357054; ENSP00000349562; ENSG00000197816. DR Ensembl; ENST00000375206; ENSP00000364352; ENSG00000197816. DR GeneID; 100499483; -. DR KEGG; hsa:57653; -. DR UCSC; uc004axe.1; human. DR UCSC; uc004axg.1; human. DR CTD; 57653; -. DR GeneCards; GC09P100000; -. DR HGNC; HGNC:29303; KIAA1529. DR HOVERGEN; HBG097391; -. DR OMA; SFRQYLE; -. DR OrthoDB; EOG9MPM8J; -. DR PhylomeDB; Q9P1Z9; -. DR NextBio; 64400; -. DR ArrayExpress; Q9P1Z9; -. DR Bgee; Q9P1Z9; -. DR CleanEx; HS_KIAA1529; -. DR Genevestigator; Q9P1Z9; -. DR GO; GO:0016021; C:integral to membrane; IEA:UniProtKB-KW. PE 1: Evidence at protein level; KW Alternative splicing; Coiled coil; Complete proteome; Membrane; KW Phosphoprotein; Polymorphism; Transmembrane; Transmembrane helix. FT CHAIN 1 1646 Uncharacterized protein KIAA1529. FT /FTId=PRO_0000315233. FT TRANSMEM 35 57 Helical; (Potential). FT COILED 270 339 Potential. FT COILED 1420 1487 Potential. FT COMPBIAS 834 958 Glu-rich. FT MOD_RES 1007 1007 Phosphoserine. FT VAR_SEQ 1 1523 Missing (in isoform 3). FT /FTId=VSP_030502. FT VAR_SEQ 1 139 Missing (in isoform 2). FT /FTId=VSP_030503. FT VAR_SEQ 504 543 Missing (in isoform 4). FT /FTId=VSP_030504. FT VAR_SEQ 641 722 KSFETLADQTEWQSSHLFKYFQEVVQLWEAHQSELLVQELE FT LEKRMEQHRQKHSLESQVQEAHLDRLLDQLRQQSDKETLAF FT -> VRAGDCSTGELRNPSRLWQIRQSGRVRTSSSISRRWYN FT CGRHTRASCWCRSWSWRRGWSSTGRSTAWRARCRRPTSIGS FT WTN (in isoform 4). FT /FTId=VSP_030505. FT VAR_SEQ 723 1646 Missing (in isoform 4). FT /FTId=VSP_030506. FT VAR_SEQ 1016 1016 Q -> QLRAGFFEHLEKWFDQCSLNTRVTVATKINELDSEL FT ELHLHLHQPRAQQIEKDIHNVRAAELLLHQEQLDSHCAGVT FT ETLKKKRLMFCQFQEEQNVRSKNFRLKIYDMEHIFLNATRS FT QKLVTLSNTLHQELLSYVDVTQVSLRSFRQYLEESLGKLRY FT SNIEFIKHCR (in isoform 2). FT /FTId=VSP_030507. FT VAR_SEQ 1252 1252 T -> TGRGAWACGSRGSSEAGAGGAVCSPPVLCSCPGPSS FT PK (in isoform 2). FT /FTId=VSP_030508. FT VAR_SEQ 1279 1289 Missing (in isoform 2). FT /FTId=VSP_030509. FT VARIANT 301 301 P -> H (in dbSNP:rs7864805). FT /FTId=VAR_038151. FT VARIANT 322 322 S -> R (in dbSNP:rs17855671). FT /FTId=VAR_038152. FT VARIANT 373 373 L -> H (in dbSNP:rs10981558). FT /FTId=VAR_038153. FT VARIANT 548 548 P -> R (in dbSNP:rs61261278). FT /FTId=VAR_061250. FT VARIANT 917 917 E -> K (in dbSNP:rs12353306). FT /FTId=VAR_038154. FT VARIANT 995 995 S -> C (in dbSNP:rs2061634). FT /FTId=VAR_038155. FT VARIANT 1146 1146 F -> L (in dbSNP:rs3747495). FT /FTId=VAR_038156. FT VARIANT 1518 1518 D -> N (in dbSNP:rs2306093). FT /FTId=VAR_038157. FT CONFLICT 348 348 L -> F (in Ref. 2; CAH18175). FT CONFLICT 422 422 K -> M (in Ref. 2; CAH18175). FT CONFLICT 579 579 R -> W (in Ref. 1; BAA96053). FT CONFLICT 1083 1083 L -> P (in Ref. 2; CAH18175). FT CONFLICT 1177 1177 N -> I (in Ref. 2; CAH18175). FT CONFLICT 1596 1596 E -> D (in Ref. 2; CAH18175). SQ SEQUENCE 1646 AA; 191100 MW; 6A8AF5761D522615 CRC64; MWHGNHVQPG ATHRPNQGLE MLQGLGIGMK AFHNFNYFLF FYNVLLGLGA CLSRLLISCL LGMWLIARID RTIMQSGYEG ADMGFSAWIG MLYMDHYHIN PVLVSFCHIL ITNHREKKLQ QSTKYWCLNQ SAESLRICAM RGGENRPPAR VQSSSEELEL RHQSLDAFPG RRLPGRGIQP AAKMSSVGKV TQVPNGKAYQ QIFQAEVQLV HSLAATRKRA AERSVTLKSG RIPMMKKVET PEGEVMSPRQ QKWMHSLPND WIMENPVLHR EKERAKREKA RESENTIAAR EVRGLMDTIV PEKISTSTFQ RQAEHKRKSY ESALASFQEE IAQVGKEMEP LIVDTGGLFL KKLTESDEEM NRLFLKVEND TNLEDYTIQA LLELWDKVAG RLLLRKQEIK ELDEALHSLE FSRTDKLKSV LKKYAEVIEK TSYLMRPEVY RLINEEAMVM NYALLGNRKA LAQLFVNLME STLQQELDSR HRWQGLVDTW KALKKEALLQ SFSEFMASES IHTPPAVTKE LEVMLKTQNV LQQRRLKHLC TICDLLPPSY SKTQLTEWHS SLNSLNKELD TYHVDCMMRI RLLYEKTWQE CLMHVQNCKK QLLDWKAFTE EEAETLVNQF FFQMVGALQG KVEEDLELLD KSFETLADQT EWQSSHLFKY FQEVVQLWEA HQSELLVQEL ELEKRMEQHR QKHSLESQVQ EAHLDRLLDQ LRQQSDKETL AFHLEKVKDY LKNMKSRYEC FHTLLTKEVM EYPAIMLKEL NSYSSALSQY FFVREIFEQN LAGEVIFKFR QPEAHEKPSQ KRVKKLRKKQ GSKEDMTRSE ESISSGTSTA RSVEEVEEEN DQEMESFITE EVLGQQKKSP LHAKMDESKE GSIQGLEEMQ VEREGSLNPS LNEENVKGQG EKKEESEEED EKEEEEEEEK LEEEKEEKEA QEEQESLSVG EEEDKEEGLE EIYYEDMESF TISSGNTYFV FVPLEEEHCR KSHSTFSAMF INDTSSAKFI EQVTIPSRLI LEIKKQLFSE GGNFSPKEIN SLCSRLEKEA ARIELVESVI MLNMEKLENE YLDQANDVIN KFESKFHNLS VDLIFIEKIQ RLLTNLQVKI KCQVAKSNSQ TNGLNFSLQQ LQNKIKTCQE SRGEKTTVTT EELLSFVQTW KEKLSQRIQY LNCSLDRVSM TELVFTNTIL KDQEEDSDIL TSSEALEEEA KLDVVTPESF TQLSRVGKPL IEDPAVDVIR KLLQLPNTKW PTHHCDKDPS QTGFKRHRCQ PENSGKKAVP SASATSAGSL QTTHPPLSHS FTPHPKPNKM ERKYRVLGDK PPPAAEDFKG IILTLLWESS ENLLTVAEEF YRKEKRPVTR PDCMCDTFDQ CAENISKKIL EYQSQANKYH NSCLIELRIQ IRRFEELLPQ VCWLVMENFK EHHWKKFFTS VKEIRGQFEE QQKRLEKRKD KNAQKLHLNL GHPVHFQEME SLHLSEEERQ EELDSMIRMN KEKLEECTRR NGQVFITNLA TFTEKFLLQL DEVVTIDDVQ VARMEPPKQK LSMLIRRKLA GLSLKEESEK PLIERGSRKW PGIKPTEVTI QNKILLQPTS SISTTKTTLG HLAAVEARDA VYLKYLASFE EELKRIQDDC TSQIKEAQRW KDSWKQSLHT IQGLYV //