ID ZN789_HUMAN Reviewed; 425 AA. AC Q5FWF6; A4D282; Q6ZMZ9; DT 10-JUL-2007, integrated into UniProtKB/Swiss-Prot. DT 10-JUL-2007, sequence version 3. DT 05-OCT-2010, entry version 57. DE RecName: Full=Zinc finger protein 789; GN Name=ZNF789; OS Homo sapiens (Human). OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Homo. OX NCBI_TaxID=9606; RN [1] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 2). RC TISSUE=Spleen; RX PubMed=14702039; DOI=10.1038/ng1285; RA Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T., Irie R., RA Wakamatsu A., Hayashi K., Sato H., Nagai K., Kimura K., Makita H., RA Sekine M., Obayashi M., Nishi T., Shibahara T., Tanaka T., Ishii S., RA Yamamoto J., Saito K., Kawai Y., Isono Y., Nakamura Y., Nagahari K., RA Murakami K., Yasuda T., Iwayanagi T., Wagatsuma M., Shiratori A., RA Sudo H., Hosoiri T., Kaku Y., Kodaira H., Kondo H., Sugawara M., RA Takahashi M., Kanda K., Yokoi T., Furuya T., Kikkawa E., Omura Y., RA Abe K., Kamihara K., Katsuta N., Sato K., Tanikawa M., Yamazaki M., RA Ninomiya K., Ishibashi T., Yamashita H., Murakawa K., Fujimori K., RA Tanai H., Kimata M., Watanabe M., Hiraoka S., Chiba Y., Ishida S., RA Ono Y., Takiguchi S., Watanabe S., Yosida M., Hotuta T., Kusano J., RA Kanehori K., Takahashi-Fujii A., Hara H., Tanase T.-O., Nomura Y., RA Togiya S., Komai F., Hara R., Takeuchi K., Arita M., Imose N., RA Musashino K., Yuuki H., Oshima A., Sasaki N., Aotsuka S., RA Yoshikawa Y., Matsunawa H., Ichihara T., Shiohata N., Sano S., RA Moriya S., Momiyama H., Satoh N., Takami S., Terashima Y., Suzuki O., RA Nakagawa S., Senoh A., Mizoguchi H., Goto Y., Shimizu F., Wakebe H., RA Hishigaki H., Watanabe T., Sugiyama A., Takemoto M., Kawakami B., RA Yamazaki M., Watanabe K., Kumagai A., Itakura S., Fukuzumi Y., RA Fujimori Y., Komiyama M., Tashiro H., Tanigami A., Fujiwara T., RA Ono T., Yamada K., Fujii Y., Ozaki K., Hirao M., Ohmori Y., RA Kawabata A., Hikiji T., Kobatake N., Inagaki H., Ikema Y., Okamoto S., RA Okitani R., Kawakami T., Noguchi S., Itoh T., Shigeta K., Senba T., RA Matsumura K., Nakajima Y., Mizuno T., Morinaga M., Sasaki M., RA Togashi T., Oyama M., Hata H., Watanabe M., Komatsu T., RA Mizushima-Sugano J., Satoh T., Shirai Y., Takahashi Y., Nakagawa K., RA Okumura K., Nagase T., Nomura N., Kikuchi H., Masuho Y., Yamashita R., RA Nakai K., Yada T., Nakamura Y., Ohara O., Isogai T., Sugano S.; RT "Complete sequencing and characterization of 21,243 full-length human RT cDNAs."; RL Nat. Genet. 36:40-45(2004). RN [2] RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. RX MEDLINE=22616434; PubMed=12690205; DOI=10.1126/science.1083423; RA Scherer S.W., Cheung J., MacDonald J.R., Osborne L.R., Nakabayashi K., RA Herbrick J.-A., Carson A.R., Parker-Katiraee L., Skaug J., Khaja R., RA Zhang J., Hudek A.K., Li M., Haddad M., Duggan G.E., Fernandez B.A., RA Kanematsu E., Gentles S., Christopoulos C.C., Choufani S., RA Kwasnicka D., Zheng X.H., Lai Z., Nusskern D.R., Zhang Q., Gu Z., RA Lu F., Zeesman S., Nowaczyk M.J., Teshima I., Chitayat D., Shuman C., RA Weksberg R., Zackai E.H., Grebe T.A., Cox S.R., Kirkpatrick S.J., RA Rahman N., Friedman J.M., Heng H.H.Q., Pelicci P.G., Lo-Coco F., RA Belloni E., Shaffer L.G., Pober B., Morton C.C., Gusella J.F., RA Bruns G.A.P., Korf B.R., Quade B.J., Ligon A.H., Ferguson H., RA Higgins A.W., Leach N.T., Herrick S.R., Lemyre E., Farra C.G., RA Kim H.-G., Summers A.M., Gripp K.W., Roberts W., Szatmari P., RA Winsor E.J.T., Grzeschik K.-H., Teebi A., Minassian B.A., Kere J., RA Armengol L., Pujana M.A., Estivill X., Wilson M.D., Koop B.F., RA Tosi S., Moore G.E., Boright A.P., Zlotorynski E., Kerem B., RA Kroisel P.M., Petek E., Oscier D.G., Mould S.J., Doehner H., RA Doehner K., Rommens J.M., Vincent J.B., Venter J.C., Li P.W., RA Mural R.J., Adams M.D., Tsui L.-C.; RT "Human chromosome 7: DNA sequence and biology."; RL Science 300:767-772(2003). RN [3] RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1), AND VARIANT RP ALA-77. RC TISSUE=Lymph; RX PubMed=15489334; DOI=10.1101/gr.2596504; RG The MGC Project Team; RT "The status, quality, and expansion of the NIH full-length cDNA RT project: the Mammalian Gene Collection (MGC)."; RL Genome Res. 14:2121-2127(2004). CC -!- FUNCTION: May be involved in transcriptional regulation. CC -!- SUBCELLULAR LOCATION: Nucleus (Probable). CC -!- ALTERNATIVE PRODUCTS: CC Event=Alternative splicing; Named isoforms=2; CC Name=1; CC IsoId=Q5FWF6-1; Sequence=Displayed; CC Name=2; CC IsoId=Q5FWF6-2; Sequence=VSP_026561; CC Note=No experimental confirmation available. Ref.1 (BAD18576) CC sequence differs from that shown due to frameshifts in positions CC 85 and 271; CC -!- SIMILARITY: Belongs to the krueppel C2H2-type zinc-finger protein CC family. CC -!- SIMILARITY: Contains 8 C2H2-type zinc fingers. CC -!- SIMILARITY: Contains 1 KRAB domain. CC -!- SEQUENCE CAUTION: CC Sequence=EAL23876.1; Type=Erroneous gene model prediction; CC ----------------------------------------------------------------------- CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms CC Distributed under the Creative Commons Attribution-NoDerivs License CC ----------------------------------------------------------------------- DR EMBL; AK131429; BAD18576.1; ALT_FRAME; mRNA. DR EMBL; CH236956; EAL23875.1; -; Genomic_DNA. DR EMBL; CH236956; EAL23876.1; ALT_SEQ; Genomic_DNA. DR EMBL; BC089424; AAH89424.2; -; mRNA. DR IPI; IPI00412697; -. DR IPI; IPI00746127; -. DR RefSeq; NP_998768.2; -. DR UniGene; Hs.440384; -. DR HSSP; P17028; 1X6E. DR ProteinModelPortal; Q5FWF6; -. DR SMR; Q5FWF6; 5-59, 198-311, 225-340, 255-393, 282-421. DR PRIDE; Q5FWF6; -. DR Ensembl; ENST00000331410; ENSP00000331927; ENSG00000198556. DR GeneID; 285989; -. DR UCSC; uc003uqq.1; human. DR UCSC; uc003uqr.1; human. DR CTD; 285989; -. DR GeneCards; GC07P099072; -. DR HGNC; HGNC:27801; ZNF789. DR HPA; HPA029500; -. DR eggNOG; prNOG17995; -. DR HOGENOM; HBG717200; -. DR HOVERGEN; HBG018163; -. DR InParanoid; Q5FWF6; -. DR OMA; AKSYECS; -. DR NextBio; 95914; -. DR ArrayExpress; Q5FWF6; -. DR Bgee; Q5FWF6; -. DR CleanEx; HS_ZNF789; -. DR Genevestigator; Q5FWF6; -. DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell. DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW. DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro. DR GO; GO:0006355; P:regulation of transcription, DNA-dependent; IEA:InterPro. DR GO; GO:0006350; P:transcription; IEA:UniProtKB-KW. DR InterPro; IPR001909; Krueppel-associated_box. DR InterPro; IPR007087; Znf_C2H2. DR InterPro; IPR015880; Znf_C2H2-like. DR InterPro; IPR013087; Znf_C2H2/integrase_DNA-bd. DR Gene3D; G3DSA:3.30.160.60; Znf_C2H2/integrase_DNA-bd; 8. DR Pfam; PF01352; KRAB; 1. DR Pfam; PF00096; zf-C2H2; 4. DR SMART; SM00349; KRAB; 1. DR SMART; SM00355; ZnF_C2H2; 8. DR SUPFAM; SSF109640; Krueppel-associated_box; 1. DR PROSITE; PS50805; KRAB; 1. DR PROSITE; PS00028; ZINC_FINGER_C2H2_1; 8. DR PROSITE; PS50157; ZINC_FINGER_C2H2_2; 8. PE 2: Evidence at transcript level; KW Alternative splicing; Complete proteome; DNA-binding; Metal-binding; KW Nucleus; Polymorphism; Repeat; Transcription; KW Transcription regulation; Zinc; Zinc-finger. FT CHAIN 1 425 Zinc finger protein 789. FT /FTId=PRO_0000293697. FT DOMAIN 11 82 KRAB. FT ZN_FING 201 223 C2H2-type 1. FT ZN_FING 229 251 C2H2-type 2. FT ZN_FING 257 279 C2H2-type 3. FT ZN_FING 285 307 C2H2-type 4. FT ZN_FING 313 335 C2H2-type 5. FT ZN_FING 341 363 C2H2-type 6. FT ZN_FING 369 391 C2H2-type 7. FT ZN_FING 397 419 C2H2-type 8. FT VAR_SEQ 1 95 Missing (in isoform 2). FT /FTId=VSP_026561. FT VARIANT 77 77 T -> A (in dbSNP:rs6962772). FT /FTId=VAR_052903. FT CONFLICT 398 398 Q -> R (in Ref. 1; BAD18576). SQ SEQUENCE 425 AA; 49984 MW; 00BF9778A9340CFA CRC64; MFPPARGKEL LSFEDVAMYF TREEWGHLNW GQKDLYRDVM LENYRNMVLL GFQFPKPEMI CQLENWDEQW ILDLPRTGNR KASGSACPGS EARHKMKKLT PKQKFSEDLE SYKISVVMQE SAEKLSEKLH KCKEFVDSCR LTFPTSGDEY SRGFLQNLNL IQDQNAQTRW KQGRYDEDGK PFNQRSLLLG HERILTRAKS YECSECGKVI RRKAWFDQHQ RIHFLENPFE CKVCGQAFRQ RSALTVHKQC HLQNKPYRCH DCGKCFRQLA YLVEHKRIHT KEKPYKCSKC EKTFSQNSTL IRHQVIHSGE KRHKCLECGK AFGRHSTLLC HQQIHSKPNT HKCSECGQSF GRNVDLIQHQ RIHTKEEFFQ CGECGKTFSF KRNLFRHQVI HTGSQPYQCV ICGKSFKWHT SFIKHQGTHK GQIST //