Title of Invention

AN IN VITRO METHOD FOR ENHANCING THE LEVEL AND/OR QUALITY OF PROTEIN PRODUCTION

Abstract Disclosed herein are methods and compositions for enhanced protein production and over expression from an exogenous sequence using engineered zinc finger proteins to inactivate first and second genes in a cell including the exogenous sequence.
Full Text AN IN VITRO METHOD FOR ENHANCING THE LEVEL AND/OR QUALITY OF PROTEIN PRODUCTION

WO 2006/033859 PCT/US2005/032157
COMPOSITIONS AND METHODS FOR PROTEIN PRODUCTION
CROSS-REFERENCE TO RELATED APPLICATIONS
This application claims the benefit of United States provisional patent
applications No. 60/610,853 (filed September 16, 2004) and 60/661,841 (filed March
15,2005) the disclosures of which are incorporated by reference in their entireties for
all purposes.
STATEMENT OF RIGHTS TO INVENTIONS
MADE UNDER FEDERALLY SPONSORED RESEARCH
Not applicable.
TECHNICAL FIELD
The present disclosure is in the field of transcriptional regulation, particularly
protein expression and overproduction, for example, large-scale production of
therapeutic proteins.
BACKGROUND
Controlled regulation of transcription is useful in the fields of research,
diagnostics and therapeutics. For example, regulation of transcription of a nucleotide
sequence encoding a protein can facilitate production of large amounts of
recombinant protein for use as a therapeutic, for screening, for lead optimization and
for target validation.
Typical expression systems are characterized by a host cell line comprising an
expression vector containing a heterologous promoter operatively linked to a cDNA
encoding a gene product of interest. One such system for protein expression (and
overexpression) uses the SRa promoter, which is composed of a fusion between the
SV40 early promoter and the R segment and part of the U5 sequence form the long
terminal repeat of human T-cell leukemia virus type 1. Takebe et al. (1988) Mol.
Cell. Biol 8:466-472. Another overexpression system utilizes a human
cytomegalovirus (CMV) immediate early promoter. U.S. Patents 5,168,062 and
5,385,839. Both of these promoters are regulated by endogenous, naturally-occurring
1

WO 2006/033859 PCT/US2005/032157
transcription factors, whose availability or activity may limit the amount of
transcription, and hence the amount of protein produced, in these systems. Moreover,
overexpression of the naturally-occurring transcription factors that regulate these
promoters could lead to aberrant expression of genes normally regulated by these
factors, with potential detrimental effects on expression of the desired gene product.
Additional methods for protein production involve integration of promoters or
other regulatory sequences into a chromosome adjacent to a gene whose expression is
to be regulated. See, e.g., U.S. Patents 5,272,071; 5,641,670; 5,733,761; 5,968,502
and 6,361,972. These, too, depend of the action of endogenous transcription factors
and thus are subject to the potential limitations discussed above for the SRa and CMV
systems. Moreover, they also suffer from difficulties in achieving precisely targeted
chromosomal integration of exogenous polynucleotides.
Expression systems which yield levels of protein that are higher than those
obtained by using the SRa and CMV promoters, and which do not depend on random
integration of exogenous polynucleotide sequences, would be desirable. Moreover,
expression systems utilizing exogenous transcription factors allow the design of
customized transcription factors, provide greater flexibility and provide the potential
to obtain higher levels of expression of gene products of interest.
SUMMARY
In one aspect, a method for regulating the transcription of a nucleotide
sequence in a cell is provided, the method comprising expressing, in the cell, a protein
that binds to a target site comprising SEQ ID NO:1, wherein SEQ ID NO:1 is
operatively linked to the nucleotide sequence. In certain embodiments, a plurality of
target sites are operatively linked to the nucleotide sequence.
In another aspect, described herein is a method for regulating transcription of
first and second nucleotide sequences in a cell, the method comprising expressing, in
the cell, a protein that binds to a target site comprising SEQ ID NO:1 and wherein
SEQ ID NO:1 is operatively linked to the first and second nucleotide sequences. A
plurality of target sites may be operatively linked the first nucleotide sequence, to the
second nucleotide sequence or to both the first and second nucleotide sequences.
In any of them methods described herein, the protein may comprise, for
example, SEQ ID NO:25 or equivalents thereof (e.g., SEQ ID NO:2, SEQ ID NO:3,
SEQ ID NO:7, SEQ ID NO:8). Thus, described herein are methods for regulating
2

WO 2006/033859 PCT/US2005/032157
transcription of a first (and/or second) nucleotide sequence(s) in a cell by expressing
SEQ ID NO:25 or an equivalent in the cell. In certain embodiments, the protein
further comprises a transcriptional activation domain (e.g., VP16).
In any of the methods described herein, the nucleotide sequence can encode a
mRNA that is translated to yield one or more protein(s). Furthermore, in any of the
methods described herein, the nucleotide sequence(s) may comprises a cDNA
sequence, for example one or more cDNA sequence encoding antibody polypeptides
(e.g., antibody heavy chain or light chain). In additional embodiments, the nucleotide
sequence can encode a RNA molecule that is not translated into protein such as, for
example, siRNA, micro RNA, rRNA, tRNA, snRNA or scRNA.
In another aspect, described herein are methods for regulating the transcription
of a nucleotide sequence in a cell, the method comprising expressing, in the cell, a
protein comprising SEQ ID NO:25, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:7,
SEQ ID NO:8 or equivalents, wherein the protein binds to a target site; wherein the
target site is operatively linked to the nucleotide sequence. In certain embodiments,
the target site comprises SEQ ID NO: 1. One or more target sites may be operatively
linked to the nucleotide sequence. In certain embodiments, the nucleotide sequence
comprises a cDNA sequence. The protein may further comprise a transcriptional
activation domain, for example a VP16 domain.
In yet another aspect, described herein is a method for regulating the
transcription of first and second nucleotide sequences in a cell, the method
comprising: expressing, in the cell, a protein comprising SEQ ID NO:25, SEQ ID
NO:2, SEQ ID NO:3, SEQ ID NO:7, SEQ ID NO:8 or equivalents, wherein the
protein binds to a target site; wherein the target site is operatively linked to both of the
first and second nucleotide sequences. In certain embodiments, the target site
comprises SEQ ID NO: 1. One or more target sites may be operatively linked to the
first and/or second nucleotide sequences. In certain embodiments, the first and/or
nucleotide sequence comprises a cDNA sequence, for example a cDNA sequence
encoding antibody polypeptides (e.g., antibody light chains and/or antibody heavy
chains). The protein may further comprise a transcriptional activation domain, for
example a VP16 domain.
In yet another aspect, polypeptides comprising SEQ ID NO:25 are provided.
In certain embodiments, the polypeptides comprise SEQ ID NO:2 or SEQ ID NO:7 or
equivalents thereof.
3

WO 2006/033859 PCT/US2005/032157
In a still further aspect, polynucleotides encoding any of the polypeptides
disclosed herein are provided.
In yet another aspect, cells comprising any of the polypeptides and/or
polynucleotides disclosed herein are provided.
In another aspect, provided herein are vectors comprising SEQ ID NO:1; cells
comprising any of these vectors as well as a cell comprising one or more copies of
SEQ ID NO:1 integrated into its genome.
These and other embodiments will be readily apparent to one of skill in the art
upon reading the present disclosure.
BRIEF DESCRIPTION OF THE DRAWINGS
Figure 1 shows the nucleotide sequence (SEQ ID NO:9) of a hybrid SV40-R-
U5 promoter (SRα promoter), synthesized as described in Example 1. The target site
for the various ZFPs described in Example 2 is underlined.
Figure 2 shows the amino acid sequence of the 2392/00 protein (SEQ ID
NO:10). The domains in the protein are as follows. Amino acids 3-9; nuclear
localization sequence; amino acids 15-109: zinc finger domain; amino acids 119-185:
VP16 transcriptional activation domain; amino acids 196-203: FLAG epitope tag.
Figure 3 shows the sequence of a polynucleotide (SEQ ID NO: 11) encoding
the 2392/00 protein. Portions of the sequence encoding the various domains in the
protein are as follows. Nucleotides 7-27: nuclear localization sequence; nucleotides
43-327: zinc finger domain; nucleotides 355-555: VP16 transcriptional activation
domain; nucleotides 586-609: FLAG epitope tag.
Figure 4 shows the amino acid sequence of the 2392/10 protein (SEQ ID
NO: 12). The domains in the protein are as follows. Amino acids 3-9: nuclear
localization sequence; amino acids 15-109: zinc finger domain; amino acids 119-185:
VP16 transcriptional activation domain; amino acids 196-203: FLAG epitope tag.
Figure 5 shows the sequence of a polynucleotide (SEQ ID NO: 13) encoding
the 2392/10 protein. Portions of the sequence encoding the various domains in the
protein are as follows. Nucleotides 7-27: nuclear localization sequence; nucleotides
43-327: zinc finger domain; nucleotides 355-555: VP16 transcriptional activation
domain; nucleotides 586-609: FLAG epitope tag.
4

WO 2006/033859 PCT/US2005/032157
Figure 6 shows the amino acid sequences of a number of three-finger zinc
finger domains that were obtained from a two-hybrid selection system to recognize
the target sequence GCTGTGGAA (SEQ ID NO:1). See Example 2 for details.
Figure 7 shows levels of immunoglobulin kappa chain mRNA in cells
containing an integrated transcription unit comprising a SRα promoter and kappa
chain cDNA. Cells were transfected with plasmids encoding a VP16 activation
domain (NVF) or the 2393/00 ZFP fused to the VP16 activation domain (2392-
VP16). Numbers on the abscissa refer to nanograms of DNA transfected. See
Example 3 for details.
Figure 8 shows levels of secreted antibody in two cell lines (A and B)
transfected with a plasmid encoding the 2392/00 ZFP-VP16 fusion protein (2392),
compared to cells transfected with a plasmid encoding the VP16 activation domain
(NVF). GFP: cells transfected with a plasmid encoding a green-fluorescent protein;
Mock: mock-transfected cells; ntf: non-transfected cells. See Example 3 for details.
Figure 9 shows levels of immunoglobulin gamma heavy chain and
immunoglobulin kappa light chain mRNA in cells containing amplified gamma and
kappa chain cDNAs, both under the transcriptional control of the SRα promoter
("High Producer Line" in Figure). Results from cells transfected with a plasmid
encoding the 2392/00-VP16 fusion protein are indicated by "ZFP" along the abscissa;
non-transfected cells are indicated by "NT." mRNA levels were normalized to those
of GAPDH. See Example 3 for details.
Figure 10 shows levels of immunoglobulin G secreted from cells stably
transfected with sequences encoding the 2392/00-VP16 fusion protein (labeled "ZFP"
in the Figure). Secreted IgG levels from untransfected cells (labeled "Control") are
also shown.
Figure 11 shows relative IgG levels in cells transfected with two different
plasmids containing both a heavy chain-encoding transcription unit and a light chain-
encoding transcription unit. In one of the plasmids, each transcription unit is under
the transcriptional control of a SRα promoter (denoted SRa in the figure). In the other
plasmid, each transcription unit is under the transcriptional control of a SRα promoter
to which 8 additional copies of SEQ ID NO:1 have been appended (denoted SRa in
the figure). 2392/10-7 refers to a clonal isolate of CHO cells stably transfected with a
nucleic acid encoding the 2392/10-VP16 fusion protein. DG44 refers to the parental,
untransfected CHO cell line.
5

WO 2006/033859 PCT/US2005/032157
Figure 12 shows levels of green fluorescent protein mRNA (normalized to
GAPDH mRNA) in 2392/10-7 cells, which contain sequences encoding a ZFP-VP16
fusion protein. Cells were transfected with plasmids containing various modified
CMV promoters operatively linked to sequences encoding green fluorescent protein
(GFP). The number of copies of target site SEQ ID NO:1 inserted adjacent to the
CMV promoter in each construct is shown below the graph, as is the orientation of the
inserted target sites. The rightmost bar in each pair shows GFP mRNA levels in the
ZFP-containing cell line; the leftmost bar shows GFP mRNA levels in a parental cell
line that does not express the ZFP.
Figure 13 shows levels of erythropoietin (Epo) secreted from cells transfected
with different Epo expression constructs. SRαZ6 refers to a construct in which Epo
expression is controlled by a SRα promoter containing seven copies of SEQ ID NO:1.
CMV refers to a construct in which Epo expression is controlled by a CMV promoter.
CMVz10 refers to a construct in which Epo expression is controlled by a CMV
promoter containing ten copies of SEQ 3D NO: 1. The rightmost bar in each pair
shows levels of Epo secreted from the ZFP-containing cell line 2392/10-7; the
leftmost bar show levels of Epo secreted from a parental cell line that does not
express the ZFP. See Example 9 for details.
DETAILED DESCRIPTION
General
Practice of the methods, as well as preparation and use of the compositions
disclosed herein employ, unless otherwise indicated, conventional techniques in
molecular biology, biochemistry, chromatin structure and analysis, computational
chemistry, cell culture, recombinant DNA and related fields as are within the skill of
the art. These techniques are fully explained in the literature. See, for example,
Sambrook et al. MOLECULAR CLONING: A LABORATORY MANUAL, Second edition,
Cold Spring Harbor Laboratory Press, 1989 and Third edition, 2001; Ausubel et al.,
CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, John Wiley & Sons, New York,
1987 and periodic updates; the series METHODS IN ENZYMOLOGY, Academic Press,
San Diego; Wolffe, CHROMATIN STRUCTURE AND FUNCTION, Third edition,
Academic Press, San Diego, 1998; METHODS INENZYMOLOGY, Vol. 304,
"Chromatin" (P.M. Wassarman and A. P. Wolffe, eds.), Academic Press, San Diego,
6

WO 2006/033859 PCT/US2005/032157
1999; and METHODS IN MOLECULAR BIOLOGY, Vol. 119, "Chromatin Protocols"
(P.B. Becker, ed.) Humana Press, Totowa, 1999.
Definitions
The terms "nucleic acid," "polynucleotide," and "oligonucleotide" are used
interchangeably and refer to a deoxyribonucleotide or ribonucleotide polymer, in linear or
circular conformation, and in either single- or double-stranded form. For the purposes of
the present disclosure, these terms are not to be construed as limiting with respect to the
length of a polymer. The terms can encompass known analogues of natural nucleotides, as
well as nucleotides that are modified in the base, sugar and/or phosphate moieties (e.g.,
phosphorothioate backbones). In general, an analogue of a particular nucleotide has the
same base-pairing specificity; i.e., an analogue of A will base-pair with T.
The terms "polypeptide," "peptide" and "protein" are used interchangeably to refer
to a polymer of amino acid residues. The term also applies to amino acid polymers in
which one or more amino acids are chemical analogues or modified derivatives of a
corresponding naturally-occurring amino acids.
Techniques for determining nucleic acid and amino acid sequence identity are
known in the art. Typically, such techniques include determining the nucleotide
sequence of the mRNA for a gene and/or determining the amino acid sequence of a
protein encoded by a gene or mRNA, and comparing these sequences to a second
nucleotide or amino acid sequence. Genomic sequences can also be determined and
compared in this fashion. In general, identity refers to an exact nucleotide-to-
nucleotide or amino acid-to-amino acid correspondence of two polynucleotides or
polypeptide sequences, respectively. Two or more sequences (polynucleotide or
amino acid) can be compared by determining their percent identity. The percent
identity of two sequences, whether nucleic acid or amino acid sequences, is the
number of exact matches between two aligned sequences divided by the length of the
shorter sequences and multiplied by 100. An approximate alignment for nucleic acid
sequences is provided by the local homology algorithm of Smith and Waterman,
Advances in Applied Mathematics 2:482-489 (1981). This algorithm can be applied
to amino acid sequences by using the scoring matrix developed by Dayhoff Atlas of
Protein Sequences and Structure, M.O. Dayhoff ed., 5 suppl. 3:353-358, National
Biomedical Research Foundation, Washington, D.C., USA, and normalized by
Gribskov, Nucl. Acids Res. 14(6);6745-6763 (1986). An exemplary implementation
7

WO 2006/033859 PCT/US2005/032157
of this algorithm to determine percent identity of a sequence is provided by the
Genetics Computer Group (Madison, WI) in the "BestFit" utility application. The
default parameters for this method are described in the Wisconsin Sequence Analysis
Package Program Manual, Version 8 (1995) (available from Genetics Computer
Group, Madison, WI). A preferred method of establishing percent identity in the
context of the present disclosure is to use the MPSRCH package of programs
copyrighted by the University of Edinburgh, developed by John F. Collins and Shane
S. Sturrok, and distributed by IntelliGenetics, Inc. (Mountain View, CA). From this
suite of packages the Smith-Waterman algorithm can be employed where default
parameters are used for the scoring table (for example, gap open penalty of 12, gap
extension penalty of one, and a gap of six). From the data generated the "Match"
value reflects sequence identity. Other suitable programs for calculating the percent
identity or similarity between sequences are generally known in the art, for example,
another alignment program is BLAST, used with default parameters. For example,
BLASTN and BLASTP can be used using the following default parameters: genetic
code = standard; filter = none; strand = both; cutoff = 60; expect =10; Matrix =
BLOSUM62; Descriptions = 50 sequences; sort by = HIGH SCORE; Databases =
non-redundant, GenBank + EMBL + DDBJ + PDB + GenBank CDS translations +
Swiss protein + Spupdate + PIR. Details of these programs can be found at the
following internet address: http://www.ncbi.nlm.gov/cgi-bin/BLAST. With respect to
sequences described herein, the range of desired degrees of sequence identity is
approximately 20% to 100% and any integer value therebetween. Typically the
percent identities between sequences are at least 70-75%, preferably 80-82%, more
preferably 85-90%, even more preferably 92%, still more preferably 95%, and most
preferably 98% sequence identity.
Alternatively, the degree of sequence similarity between polynucleotides can
be determined by hybridization of polynucleotides under conditions that allow
formation of stable duplexes between homologous regions, followed by digestion
with single-stranded-specific nuclease(s), and size determination of the digested
fragments. Two nucleic acid, or two polypeptide sequences are substantially
homologous to each other when the sequences exhibit at least about 70%-75%,
preferably 80%-82%, more preferably 85%-90%, even more preferably 92%, still
more preferably 95%, and most preferably 98% sequence identity over a defined
length of the molecules, as determined using the methods above. As used herein,
8

WO 2006/033859 PCT/US2005/032157
substantially homologous also refers to sequences showing complete identity to a
specified DNA or polypeptide sequence. DNA sequences that are substantially
homologous can be identified in a Southern hybridization experiment under, for
example, stringent conditions, as defined for that particular system. Defining
appropriate hybridization conditions is within the skill of the art. See, e.g., Sambrook
et al., supra; Nucleic Acid Hybridization: A Practical Approach, editors B.D. Hames
and S.J. Higgins, (1985) Oxford; Washington, DC; IRL Press).
Selective hybridization of two nucleic acid fragments can be determined as
follows. The degree of sequence identity between two nucleic acid molecules affects
the efficiency and strength of hybridization events between such molecules. A
partially identical nucleic acid sequence will at least partially inhibit the hybridization
of a completely identical sequence to a target molecule. Inhibition of hybridization of
the completely identical sequence can be assessed using hybridization assays that are
well known in the art (e.g., Southern (DNA) blot, Northern (RNA) blot, solution
hybridization, or the like, see Sambrook, et al., Molecular Cloning: A Laboratory
Manual, Second Edition, (1989) Cold Spring Harbor, N.Y.). Such assays can be
conducted using varying degrees of selectivity, for example, using conditions varying
from low to high stringency. If conditions of low stringency are employed, the
absence of non-specific binding can be assessed using a secondary probe that lacks
even a partial degree of sequence identity (for example, a probe having less than
about 30% sequence identity with the target molecule), such that, in the absence of
non-specific binding events, the secondary probe will not hybridize to the target.
When utilizing a hybridization-based detection system, a nucleic acid probe is
chosen that is complementary to a reference nucleic acid sequence, and then by
selection of appropriate conditions the probe and the reference sequence selectively
hybridize, or bind, to each other to form a duplex molecule. A nucleic acid molecule
that is capable of hybridizing selectively to a reference sequence under moderately
stringent hybridization conditions typically hybridizes under conditions that allow
detection of a target nucleic acid sequence of at least about 10-14 nucleotides in
length having at least approximately 70% sequence identity with the sequence of the
selected nucleic acid probe. Stringent hybridization conditions typically allow
detection of target nucleic acid sequences of at least about 10-14 nucleotides in length
having a sequence identity of greater than about 90-95% with the sequence of the
selected nucleic acid probe. Hybridization conditions useful for probe/reference
9

WO 2006/033859 PCT/US2005/032157
sequence hybridization, where the probe and reference sequence have a specific
degree of sequence identity, can be determined as is known in the art (see, for
example, Nucleic Acid Hybridization: A Practical Approach, editors B.D. Hames and
S.J. Higgins, (1985) Oxford; Washington, DC; IRL Press).
Conditions for hybridization are well-known to those of skill in the art.
Hybridization stringency refers to the degree to which hybridization conditions
disfavor the formation of hybrids containing mismatched nucleotides, with higher
stringency correlated with a lower tolerance for mismatched hybrids. Factors that
affect the stringency of hybridization are well-known to those of skill in the art and
include, but are not limited to, temperature, pH, ionic strength, duration of the
hybridization reaction and concentration of organic solvents such as, for example,
formamide and dimethylsulfoxide. As is known to those of skill in the art,
hybridization stringency is increased by higher temperatures, lower ionic strength and
lower solvent concentrations.
With respect to stringency conditions for hybridization, it is well known in the art
that numerous equivalent conditions can be employed to establish a particular stringency
by varying, for example, the following factors: the length and nature of the sequences,
base composition of the various sequences, concentrations of salts and other hybridization
solution components, the presence or absence of blocking agents in the hybridization
solutions (e.g., dextran sulfate, and polyethylene glycol), hybridization reaction
temperature and time parameters, as well as, varying wash conditions. The selection of a
particular set of hybridization conditions is selected following standard methods in the art
(see, for example, Sambrook, et al., Molecular Cloning: A Laboratory Manual Second
Edition, (1989) Cold Spring Harbor, N.Y.).
"Binding" refers to a sequence-specific, non-covalent interaction between
macromolecules (e.g., between a protein and a nucleic acid). Not all components of a
binding interaction need be sequence-specific (e.g., contacts with phosphate residues
in a DNA backbone), as long as the interaction as a whole is sequence-specific. Such
interactions are generally characterized by a dissociation constant (Kd) of 10-6 M-1 or
lower. "Affinity" refers to the strength of binding: increased binding affinity being
correlated with a lower Kd.
A "binding protein" is a protein that is able to bind non-covalently to another
molecule. A binding protein can bind to, for example, a DNA molecule (a DNA-binding
protein), an RNA molecule (an RNA-binding protein) and/or a protein molecule (a
10

WO 2006/033859 PCT/US2005/032157
protein-binding protein). In the case of a protein-binding protein, it can bind to itself (to
form homodimers, homotrimers, etc.) and/or it can bind to one or more molecules of a
different protein or proteins. A binding protein can have more than one type of binding
activity. For example, zinc finger proteins have DNA-binding, RNA-binding and protein-
binding activity.
A "zinc finger DNA binding protein" (or zinc finger binding domain) is a protein,
or a domain within a larger protein, that binds DNA in a sequence-specific manner through
one or more zinc fingers, which are regions of amino acid sequence within the binding
domain whose structure is stabilized through coordination of a zinc ion. The term zinc
finger DNA binding protein is often abbreviated as zinc finger protein or ZFP.
Zinc finger binding domains can be "engineered" to bind to a predetermined
nucleotide sequence. Non-limiting examples of methods for engineering zinc finger
proteins are design and selection.
A designed zinc finger protein is a protein not occurring in nature whose
design/composition results principally from rational criteria. Rational criteria for
design include application of substitution rules and computerized algorithms for
processing information in a database storing information of existing ZFP designs and
binding data. See, for example, US Patents 6,140,081; 6,453,242; and 6,534,261;
see also WO 98/53058; WO 98/53059; WO 98/53060; WO 02/016536 and
WO 03/016496.
A "selected" zinc finger protein is a protein not found in nature whose
production results primarily from an empirical process such as phage display,
interaction trap or hybrid selection. See e.g., US 5,789,538; US 5,925,523;
US 6,007,988; US 6,013,453; US 6,200,759; WO 95/19431; WO 96/06166;
WO 98/53057; WO 98/54311; WO 00/27878; WO 01/60970 WO 01/88197 and
WO 02/099084.
Thus, an "engineered zinc finger protein" refers to a protein that contains one
or more zinc fingers, that has been constructed to bind in a sequence-specific fashion
to a predetermined nucleotide sequence. Generally, engineered zinc finger proteins
are non-naturally-occurring proteins and/or contain naturally-occurring zinc fingers in
non-naturally-occurring arrangements and/or combinations. Methods for engineering
the binding specificity of zinc fingers and constructing engineered zinc finger proteins
include, but are not limited to, rational design, randomization/selection techniques,
polysome selection, cis-display, one- and two-hybrid systems, and selection from
11

WO 2006/033859 PCT/US2005/032157
randomized libraries of both engineered and naturally-occurring zinc fingers. See, for
example, U.S. Patents 5,789,538; 6,007,988; 6,013,453; 6,140,466; 6,242,568;
6,410,248; 6,453,242; 6,479,626; 6,503,717; 6,534,261; 6,706,470; 6,733,970;
and 6,746,838; U.S. Patent Application publications 2003/0044957; 2003/0068675;
2003/0104526; 2003/0108880; 2003/0166141 and 2004/0091991; and PCT
Publications WO 98/53057; WO 98/53058; WO 98/53059; WO 98/53060; WO
01/53480; WO 01/88197; and WO 02/77227, the disclosures of which are
incorporated herein by reference in their entireties for all purposes. The term
"engineered zinc finger protein" does not refer to a cloned, naturally-occurring zinc
finger protein.
"Chromatin" is the nucleoprotein structure comprising the cellular genome.
Cellular chromatin comprises nucleic acid, primarily DNA, and protein, including
histones and non-histone chromosomal proteins. The majority of eukaryotic cellular
chromatin exists in the form of nucleosomes, wherein a nucleosome core comprises
approximately 150 base pairs of DNA associated with an octamer comprising two
each of histones H2A, H2B, H3 and H4; and linker DNA (of variable length
depending on the organism) extends between nucleosome cores. A molecule of
histone H1 is generally associated with the linker DNA. For the purposes of the
present disclosure, the term "chromatin" is meant to encompass all types of cellular
nucleoprotein, both prokaryotic and eukaryotic. Cellular chromatin includes both
chromosomal and episomal chromatin.
A "chromosome," is a chromatin complex comprising all or a portion of the
genome of a cell. The genome of a cell is often characterized by its karyotype, which
is the collection of all the chromosomes that comprise the genome of the cell. The
genome of a cell can comprise one or more chromosomes.
An "episome" is a replicating nucleic acid, nucleoprotein complex or other
structure comprising a nucleic acid that is not part of the chromosomal karyotype of a
cell. Examples of episomes include plasmids and certain viral genomes.
A "target site" or "target sequence" is a nucleic acid sequence that defines a
portion of a nucleic acid to which a binding molecule (e.g., a binding protein) will
bind, provided sufficient conditions for binding exist. For example, the sequence
5'-GAATTC-3' is a target site for the Eco RI restriction endonuclease.
An "accessible region" is a site in cellular chromatin in which a target site
present in the nucleic acid can be bound by an exogenous molecule which recognizes
12

WO 2006/033859 PCT/US2005/032157
the target site. Without wishing to be bound by any particular theory, it is believed
that an accessible region is one that is not packaged into a nucleosomal structure. The
distinct structure of an accessible region can often be detected by its sensitivity to
chemical and enzymatic probes, for example, nucleases.
An "exogenous" molecule is a molecule that is not normally present in a cell,
but can be introduced into a cell by one or more genetic, biochemical or other
methods. "Normal presence in the cell" is determined with respect to the particular
developmental stage and environmental conditions of the cell. Thus, for example, a
molecule that is present only during embryonic development of muscle is an
exogenous molecule with respect to an adult muscle cell. Similarly, a molecule
induced by heat shock is an exogenous molecule with respect to a non-heat-shocked
cell. An exogenous molecule can comprise, for example, a functioning version of a
malfunctioning endogenous molecule or a malfunctioning version of a normally-
functioning endogenous molecule.
An exogenous molecule can be, among other things, a small molecule, such as
is generated by a combinatorial chemistry process, or a macromolecule such as a
protein, nucleic acid, carbohydrate, lipid, glycoprotein, lipoprotein, polysaccharide,
any modified derivative of the above molecules, or any complex comprising one or
more of the above molecules. Nucleic acids include DNA and RNA, can be single- or
double-stranded; can be linear, branched or circular; and can be of any length.
Nucleic acids include those capable of forming duplexes, as well as triplex-forming
nucleic acids. See, for example, U.S. Patent Nos. 5,176,996 and 5,422,251. Proteins
include, but are not limited to, DNA-binding proteins, transcription factors, chromatin
remodeling factors, methylated DNA binding proteins, polymerases, methylases,
demethylases, acetylases, deacetylases, kinases, phosphatases, integrases,
recombinases, ligases, topoisomerases, gyrases and helicases.
An exogenous molecule can be the same type of molecule as an endogenous
molecule, e.g., an exogenous protein or nucleic acid. For example, an exogenous
nucleic acid can comprise an infecting viral genome, a plasmid or episome introduced
into a cell, or a chromosome that is not normally present in the cell. Methods for the
introduction of exogenous molecules into cells are known to those of skill in the art
and include, but are not limited to, lipid-mediated transfer (i.e., liposom.es, including
neutral and cationic lipids), electroporation, direct injection, cell fusion, particle
13

WO 2006/033859 PCT/US2005/032157
bombardment, calcium phosphate co-precipitation, DEAE-dextran-mediated transfer
and viral vector-mediated transfer.
By contrast, an "endogenous" molecule is one that is normally present in a
particular cell at a particular developmental stage under particular environmental
conditions. For example, an endogenous nucleic acid can comprise a chromosome,
the genome of a mitochondrion, chloroplast or other organelle, or a naturally-
occurring episomal nucleic acid. Additional endogenous molecules can include
proteins, for example, transcription factors and enzymes.
A "fusion" molecule is a molecule in which two or more subunit molecules are
linked, preferably covalently. The subunit molecules can be the same chemical type
of molecule, or can be different chemical types of molecules. Examples of the first
type of fusion molecule include, but are not limited to, fusion proteins (for example, a
fusion between a ZFP DNA-binding domain and a transcriptional regulatory domain)
and fusion nucleic acids (for example, a nucleic acid encoding the fusion protein
described supra). Examples of the second type of fusion molecule include, but are
not limited to, a fusion between a triplex-forming nucleic acid and a polypeptide, and
a fusion between a minor groove binder and a nucleic acid.
Expression of a protein (e.g., a fusion protein) in a cell can result from
delivery of the protein to the cell or by delivery of a polynucleotide encoding the
protein to a cell, wherein the polynucleotide is transcribed, and the transcript is
translated, to generate the protein. Delivery to the cell of a RNA molecule, which is
subsequently translated in the cell, can also be used to express a protein in a cell.
Trans-splicing, polypeptide cleavage and polypeptide ligation can also be involved in
expression of a protein in a cell. Methods for polynucleotide and polypeptide delivery
to cells are known in the art and exemplary methods are presented elsewhere in this
disclosure.
A "gene," for the purposes of the present disclosure, includes a DNA region
encoding a gene product (see infra), as well as all DNA regions which regulate the
production of the gene product, whether or not such regulatory sequences are adjacent
to coding and/or transcribed sequences. Accordingly, a gene includes, but is not
necessarily limited to, promoter sequences, terminators, translational regulatory
sequences such as ribosome binding sites and internal ribosome entry sites, enhancers,
silencers, insulators, boundary elements, replication origins, matrix attachment sites
and locus control regions.
14

WO 2006/033859 PCT/US2005/032157
"Gene expression" refers to the conversion of the information, contained in a
gene, into a gene product. A gene product can be the direct transcriptional product of
a gene (e.g., rnRNA, tRNA, rRNA, antisense RNA, dsRNA, ribozyme, structural
RNA or any other type of RNA), a processed transcript such as, for example siRNA,
or a protein produced by translation of a mRNA. Gene products also include RNAs
which are modified, by processes such as capping, polyadenylation, methylation, and
editing, and proteins modified by, for example, methylation, acetylation,
phosphorylation, ubiquitination, ADP-ribosylation, myristilation, and glycosylation.
"Modulation" of gene expression refers to a change in the activity of a gene.
Modulation of expression can include, but is not limited to, gene activation and gene
repression.
"Eucaryotic" cells include, but are not limited to, fungal cells (such as yeast),
plant cells, animal cells, mammalian cells and human cells.
A "region of interest" is any region of cellular chromatin, such as, for
example, a gene or a non-coding sequence within or adjacent to a gene, in which it is
desirable to bind an exogenous molecule. Binding can be for the purposes of, e.g.,
transcriptional regulation. A region of interest can be present in a chromosome, an
episome, an organellar genome (e.g., mitochondrial, chloroplast), or an infecting viral
genome, for example. A region of interest can be within the coding region of a gene,
within transcribed non-coding regions such as, for example, leader sequences, trailer
sequences or introns, or within non-transcribed regions, either upstream or
downstream of the coding region. A region of interest can be as small as a single
nucleotide pair or up to 2,000 nucleotide pairs in length, or any integral value of
nucleotide pairs.
The terms "operative linkage" and "operatively linked" (or "operably linked")
are used interchangeably with reference to a juxtaposition of two or more components
(such as sequence elements), in which the components are arranged such that the
components function normally and allow the possibility that at least one of the
components can mediate a function that is exerted upon at least one of the other
components. By way of illustration, a transcriptional regulatory sequence, such as a
promoter, is operatively linked to a coding sequence if the transcriptional regulatory
sequence controls the level of transcription of the coding sequence in response to the
presence or absence of one or more transcriptional regulatory factors. A
transcriptional regulatory sequence is generally operatively linked in cis with a coding
15

WO 2006/033859 PCT/US2005/032157
sequence, but need not be directly adjacent to it. For example, an enhancer is a
transcriptional regulatory sequence that is operatively linked to a coding sequence,
even though they are not contiguous.
With respect to fusion polypeptides, the term "operatively linked" can refer to
the fact that each of the components performs the same function in linkage to the
other component as it would if it were not so linked. For example, with respect to a
fusion polypeptide in which a ZFP DNA-binding domain is fused to a transcriptional
regulatory domain, the ZFP DNA-binding domain and the regulatory domain are in
operative linkage if, in the fusion polypeptide, the ZFP DNA-binding domain portion
is able to bind its target site and/or its binding site, while the regulatory domain is able
to modulate (e.g., activate or repress) transcription.
A "functional equivalent" or "functional fragment" of a protein, polypeptide or
nucleic acid is a protein, polypeptide or nucleic acid whose sequence is not identical
to the full-length protein, polypeptide or nucleic acid, yet retains the same function as
the full-length protein, polypeptide or nucleic acid. A functional fragment can
possess more, fewer, or the same number of residues as the corresponding native
molecule, and/or can contain one ore more amino acid or nucleotide substitutions.
Methods for determining the function of a nucleic acid (e.g., coding function, ability
to hybridize to another nucleic acid) are well-known in the art. Similarly, methods for
determining protein function are well-known. For example, the DNA-binding
function of a polypeptide can be determined, for example, by filter-binding,
electrophoretic mobility-shift, or immunoprecipitation assays. DNA cleavage can be
assayed by gel electrophoresis. See Ausubel et at, supra. The ability of a protein to
interact with another protein can be determined, for example, by co-
immunoprecipitation, two-hybrid assays or complementation, both genetic and
biochemical. See, for example, Fields et al. (1989) Nature 340:245-246; U.S. Patent
No. 5,585,245 and PCT WO 98/44350.
Engineered Zinc Finger Proteins and Target Sequences
Disclosed herein are compositions and methods for regulation of transcription,
which are useful, for example, for enhanced production of RNA and/or protein.
These include fusion proteins comprising an engineered zinc finger protein and a
functional domain such as, for example, a transcriptional activation domain. Suitable
functional domains are known in the art and include, without limitation,
16

WO 2006/033859 PCT/US2005/032157
transcriptional activation domains such as, for example, VP16, VP64 and p65.
Moreover, one or more of the same or different functional domains (e.g.,
transcriptional activation domains) can be present in a given fusion protein. See co-
owned U.S. Patent Application Publication No. 2002/0160940, incorporated by
reference, for disclosure of exemplary functional domains.
In certain embodiments, a zinc finger protein is engineered to bind to a
sequence comprising the target sequence GCTGTGGAA (SEQ ID NO: 1). This
sequence is present in the SRα promoter (Takebe et al, supra), a promoter commonly
used for protein production, but one or more copies SEQ ID NO:1 can be inserted in
or adjacent to any promoter known in the art (e.g., a CMV promoter).
An exemplary three-finger zinc finger protein, SBS2392/00, that has been
engineered to bind to SEQ ID NO:1 has the amino acid sequence:
KKKQHICHIQGCGKVYGORSNLVRHLRWHTGERPFMCTWSYCGKRFTRSDA
LSRHKRTHTGEKKFACPECPKRFMOSSDLRRHIKTHQNK (SEQ ID NO:2).
The underlined amino acid residues in SEQ ID NO:2 correspond to residues -
1 through +6 with respect to the start of the alpha-helical portion of a zinc finger and
are denoted the "recognition regions" because one or more of these residues
participate in sequence specificity of nucleic acid binding. Accordingly, proteins
comprising the same three recognition regions in a different polypeptide backbone
sequence are considered equivalents to the protein identified as SEQ ED NO:2, since
they will have the same DNA-binding specificity.
Thus, in certain embodiments, the three recognition regions (underlined in
SEQ ID NO:2 above) can be placed in any zinc finger backbone (see, e.g., U.S.
Patents 6,453,242 and 6,534,261) and the resulting protein can be used to regulate
transcription, e.g., to enhance protein production. Accordingly, engineered zinc
finger proteins having the following sequence can be used in the disclosed methods:
C-X2-4-C-X5-QRSNLVR-H-X3-5-H-X7-C-X2-4-C-X5-RSDALSR-H-X3-5-H-X7-C-X2-4-
C-X5-QSSDLRR-H-X3-5-H (SEQ ID NO:3).
Within the recognition region, residues -1, +3 and +6 are primarily
responsible for protein-nucleotide contacts. Accordingly, non-limiting examples of
additional equivalents include proteins comprising three zinc fingers wherein the first
finger contains a Q residue at -1, a N residue at +3 and a R residue at +6
(QXXNXXR, SEQ ID NO:4); the second finger contains a R residue at -1, an A
residue at +3 and a R residue at +6 (RXXAXXR, SEQ ID NO:5); and the third finger
17

WO 2006/033859 PCT/US2005/032157
contains a Q residue at -1, a D residue at +3 and a R residue at +6 (QXXDXXR, SEQ
ID NO:6). Additional equivalents comprise any ZFP that binds to a sequence
comprising the target sequence GCTGTGGAA (SEQ ID NO:1).
An additional exemplary three-finger zinc finger protein engineered to bind
the target sequence GCTGTGGAA (SEQ ID NO:1), SBS2392/10, has the following
amino acid sequence:
KKKOHICHIOGCGKVYGOSSNLARHLRWHTGERPFMCTWSYCGKRF
TRSDALTRHKRTHTGEKKFACPECPKRFMOSCDLTRHIKTHQNK (SEQ ID
NO:7)The underlined amino acid residues in SEQ ID NO:7 correspond to residues -1
through +6 with respect to the start of the alpha-helical portion of a zinc finger and are
denoted the "recognition regions" because one or more of these residues participate in
sequence specificity of nucleic acid binding. Accordingly, proteins comprising the
same three recognition regions in a different polypeptide backbone sequence are
considered equivalents to SEQ ID NO:7, since they will have the same DNA-binding
specificity.
Thus, in certain embodiments, the three recognition regions (underlined in
SEQ ID NO:7 above) can be placed in any zinc finger backbone (see, e.g., U.S.
Patents 6,453,242 and 6,534,261) and the resulting protein can be used to regulate
transcription, e.g., to enhance protein production. Accordingly, engineered zinc
finger proteins having the following sequence can be used in the disclosed methods;
C-X2-4-C-X5-QSSNLAR-H-X3-5-H-X7-C-X2-4-C-X5-RSDALTR-H-X3-5-H-X7-C-X2-4-
C-X5-QSCDLTR-H-X3-5-H (SEQ ID NO:8).
Within the recognition region, residues -1, +3 and +6 are primarily
responsible for protein-nucleotide contacts. Accordingly, non-limiting examples of
additional equivalents include proteins comprising three zinc fingers wherein the first
finger contains a Q residue at -1, a N residue at +3 and a R residue at +6
(QXXNXXR, SEQ ID NO:4); the second finger contains a R residue at -1, an A
residue at +3 and a R residue at +6 (RXXAXXR, SEQ ID NO:5); and the third finger
contains a Q residue at -1, a D residue at +3 and a R residue at +6 (QXXDXXR, SEQ
ID NO:6). Thus, for example, proteins comprising SEQ ID NO:25 are considered
equivalents for use in the disclosed methods.
C-X2-4-C-X5-QXXNXXR-H-X3-5-H-X7-C-X2-4-C-X5-RXXAXXR-H-X3-5-H-
X7-C-X2-4-C-X5-QXXDXXR-H-X3-5-H (SEQ ID NO:25)
18

WO 2006/033859 PCT/US2005/032157
Additional equivalents comprise any ZFP that binds to a sequence comprising
the target sequence GCTGTGGAA (SEQ ID NO:1).
Correspondences between amino acids at the -1, +3 and +6 contact residues of
the recognition region of a zinc finger, and micleotides in a target site, have been
described. See, for example, U.S. Patent Nos. 6,007,988; 6,013,453 and 6,746,838;
as well as PCT Publications WO 96/06166; WO 98/53058; WO 98/53059 and WO
98/53060. Accordingly, also to be considered equivalents are three-finger zinc finger
proteins in which the first finger contains Q at -1; N at +3 and R, K, S or T at +6; the
second finger contains R at -1; A, S or V at +3 and R, K, S or T at +6; and the third
finger contains N, Q, H or T at -1; S, D, E, L, T, or V at +3 and R, K, S or T at +6.
Fusion proteins and encoding polynucleotides
As previously stated, the engineered zinc finger DNA-binding domains
disclosed herein can comprise a portion of a fusion protein, wherein the fusion protein
also contains one or more functional domains (e.g., transcriptional regulatory
domains), nuclear localization sequences, epitope tags, etc. For example, the amino
acid sequence of a fusion protein comprising a nuclear localization sequence (NLS), a
zinc finger binding domain (ZFP), the VP16 transcriptional activation domain (VP16)
and a FLAG epitope tag (FLAG), denoted 2392/00, is shown in Figure 2 (SEQ ID
NO: 10). The nucleotide sequence encoding this protein is shown in Figure 3 (SEQ ID
NO:11).
An additional exemplary protein (2392/10) has the amino acid sequence (SEQ
ID NO:12) shown in Figure 4. The sequence of a polynucleotide (SEQ ID NO:13)
encoding the 2392/10 protein is shown in Figure 5.
Functional Domains
Any DNA-binding domain can optionally be associated with one or more
"functional domains" or "regulatory domains" which facilitate, e.g., DNA processing
(e.g., DNA cleavage) or modulation of gene expression. The binding domain can be
covalently or non-covalently associated with one or more regulatory domains,
alternatively two or more regulatory domains, with the two or more domains being
two copies of the same domain, or two different domains. The regulatory domains
can be covalently linked to the binding domain, e.g., via an amino acid linker, as part
of a fusion protein. A DNA-binding domain can also be associated with a regulatory
19

WO 2006/033859 PCT/US2005/032157
domain via a non-covalent dimerization domain, e.g., a leucine zipper, a STAT
protein N terminal domain, or an FK506 binding protein (see, e.g., O'Shea, Science
254: 539 (1991), Barahmand-Pour etal, Curr. Top. Microbiol Immunol. 211:121-
128 (1996); Klemm et al, Annu. Rev. Immunol. 16:569-592 (1998); Klemm et al.,
Annu. Rev. Immunol. 16:569-592 (1998); Ho et al., Nature 382:822-826 (1996); and
Pomeranz et al, Biochem. 37:965 (1998)). The regulatory domain can be associated
with the binding domain at any suitable position, including the C- or N-terminus of
the binding domain.
Common regulatory domains include, e.g., effector domains from transcription
factors (activators, repressors, co-activators, co-repressors), silencers, nuclear
hormone receptors, oncogene transcription factors (e.g., myc, jun, fos, myb, max,
mad, rel, ets, bcl, myb, mos family members etc.); DNA repair enzymes and their
associated factors and modifiers; DNA rearrangement enzymes and their associated
factors and modifiers; chromatin associated proteins and their modifiers (e.g., kinases,
acetyiases and deacetylases); and DNA modifying enzymes (e.g., methyltransferases,
topoisomerases, helicases, ligases, kinases, phosphatases, polymerases,
endonucleases, integrases) and their associated factors and modifiers.
Transcription factor polypeptides from which one can obtain a regulatory
domain include those that are involved in regulated and basal transcription. Such
polypeptides include transcription factors, their effector domains, coactivators.,
silencers, nuclear hormone receptors (see, e.g., Goodrich et al., Cell 84:825-30 (1996)
for a review of proteins and nucleic acid elements involved in transcription;
transcription factors in general are reviewed in Barnes & Adcock, Clin. Exp. Allergy
25 Suppl. 2:46-9 (1995) and Roeder, Methods Enzymol. 273:165-71 (1996)).
Databases dedicated to transcription factors are known (see, e.g., Science 269:630
(1995) and TRANSFAC). Nuclear hormone receptor transcription factors are
described in, for example, Rosen et al, J. Med. Chem. 38:4855-74 (1995). The
C/EBP family of transcription factors are reviewed in Wedel et al., Immunobiology
193:171-85 (1995). Coactivators and co-repressors that mediate transcription
regulation by nuclear hormone receptors are reviewed in, for example, Meier, Eur. J.
Endocrinol. 134(2):158-9 (1996); Kaiser et al., Trends Biochem. Sci. 21:342-5
(1996); and Utley et al., Nature 394:498-502 (1998)). GATA transcription factors,
which are involved in regulation of hematopoiesis, are described in, for example,
Simon,Nat. Genet. 11:9-11 (1995); Weiss et al., Exp. Hematol 23:99-107. TATA
20

WO 2006/033859 PCT/US2005/032157
box binding protein (TBP) and its associated TAF polypeptides (which include
TAF30, TAF55, TAF80, TAF110, TAF150, and TAF250) are described in Goodrich
&Tjian, Curr. Opin. Cell Biol. 6:403-9 (1994) and Hurley, Curr. Opin. Struct. Biol.
6:69-75 (1996). The STAT family of transcription factors are reviewed in, for
example, Barahmand-Pour et al., Curr. Top. Microbiol. Immunol. 211:121-8 (1996).
Transcription factors involved in disease are reviewed in Aso et al., J. Clin. Invest.
97:1561-9 (1996).
In one embodiment, the KOX repression domain and/or the KRAB repression
domain from the human KOX-1 protein is used as a transcriptional repressor. Thiesen
et al., New Biologist 2:363-374 (1990); Margolin et al., PNAS 91:4509-4513 (1994);
Pengue et al., Nucl. Acids Res. 22:2908-2914 (1994); Witzgall et al, PNAS 91:4514-
4518 (1994). In another embodiment, KAP-1, a KRAB co-repressor, is used with
KRAB or KOX. Friedman et al., Genes Dev. 10:2067-2078 (1996). Alternatively,
KAP-1 can be used alone as a functional domain. Other preferred transcription
factors and transcription factor domains that act as transcriptional repressors include
MAD (see, e.g., Sommer et al., J. Biol. Chem. 273:6632-6642 (1998); Gupta et al.,
Oncogene 16:1149-1159 (1998); Queva et al., Oncogene 16:967-977 (1998); Larsson
et al., Oncogene 15:737-748 (1997); Laherty et al., Cell 89:349-356 (1997); and
Cultraro et al., Mol Cell. Biol. 17:2353-2359 (19977)); FKHR (forkhead in
rhapdosarcoma gene; Ginsberg et al., Cancer Res. 15:3542-3546 (1998); Epstein et
al, Mol. Cell Biol. 18:4118-4130 (1998)); EGR-1 (early growth response gene
product-1: Yan et al., PNAS 95:8298-8303 (1998); and Liu et al., Cancer Gene Ther.
5:3-28 (1998)); the ets2 repressor factor repressor domain (ERD; Sgouras et al.,
EMBOJ. 14:4781-4793 ((19095)); and the MAD smSIN3 interaction domain (SID;
Ayer et al., Mol. Cell. Biol. 16:5772-5781 (1996)).
In one embodiment, the HSV VP16 activation domain is used as a
transcriptional activator (see, e.g., Hagmann et al., J. Virol. 71:5952-5962 (1997)).
Other transcription factors from which activation domains can be obtained include
nuclear hormone receptors (see, e.g., Torchia et al., Curr. Opin. Cell. Biol. 10:373-
383 (1998)); the p65 subunit of nuclear factor kappa B (Bitko & Barik, J. Virol.
72:5610-5618 (1998) and Doyle & Hunt, Neuroreport 8:2937-2942 (1997)); and
EGR-1 (early growth response gene product-1; Yan et al., PNAS 95:8298-8303
(1998); and Liu et al., Cancer Gene Ther. 5:3-28 (1998)). An additional synthetic
21

WO 2006/033859 PCT/US2005/032157
activation domain is the VP64 activation domain (Seipel et al., EMBO J. 11:4961-
4968 (1996)).
Kinases, phosphatases, methylases, demethylases, acetylases, deacetylases,
and other proteins that modify polypeptides involved in gene regulation are also
useful as regulatory domains. Such modifiers are often involved in switching on or
off transcription mediated by, for example, hormones. Kinases involved in
transcription regulation are reviewed in Davis, Mol. Reprod. Dev. 42:459-67 (1995),
Jackson et al., Adv. Second Messenger Phosphoprotein Res. 28:279-86 (1993), and
Boulikas, Crit. Rev. Eukaryot. Gene Expr. 5:1-77 (1995), while phosphatases are
reviewed in, for example, Schonthal & Sernin, Cancer Biol. 6:239-48 (1995).
Nuclear tyrosine kinases are described in Wang, Trends Biochem. Sci. 19:373-6
(1994).
As described, useful domains can also be obtained from fixe gene products of
oncogenes (e.g., myc, jun, fos, myb, max, mad, rel, ets, bcl, myb, mos family
members) and their associated factors and modifiers. Oncogenes are described in, for
example, Cooper, Oncogenes, 2nd ed., The Jones and Bartlett Series in Biology,
Boston, MA, Jones and Bartlett Publishers, 1995. The ets transcription factors are
reviewed in Waslylke? et al., Eur. J. Biochem. 211:7-18 (1993) and Crepieux et al.,
Crit. Rev. Oncog. 5:615-38 (1994). Myc oncogenes are reviewed in, for example,
Ryan et al., Biochem. J. 314:713-21 (1996). The jun and fos transcription factors are
described in, for example, The Fos and Jun Families of Transcription Factors, Angel
& Herrlich, eds. (1994). The max oncogene is reviewed in Hurlin et al., Cold Spring
Harb. Symp. Quant. Biol. 59:109-16. The myb gene family is reviewed in Kanei-Ishii
et al., Curr. Top. Microbiol Immunol. 211:89-98 (1996). The mos family is reviewed
in Yew et al., Curr. Opin. Genet. Dev. 3:19-25 (1993).
Regulatory domains can also be obtained from DNA repair enzymes and their
associated factors and modifiers. These include, for example, nucleases (exo- and
endo-), recombinases, helicases, integrases, polymerases and single-stranded DNA-
binding proteins (SSBs). DNA repair systems are reviewed in, for example, Vos,
Curr. Opin. Cell Biol. 4:385-95 (1992); Sancar, Ann. Rev. Genet. 29:69-105 (1995);
Lehmann, Genet. Eng. 17:1-19 (1995); and Wood, Ann. Rev. Biochem. 65:135-67
(1996). DNA rearrangement enzymes and their associated factors and modifiers can
also be used as regulatory domains (see, e.g., Gangloff et al., Experientia 50:261-9
(1994); Sadowski, FASEB J. 7:760-7 (1993)).
22

WO 2006/033859 PCT/US2005/032157
Similarly, regulatory domains can be derived from DNA modifying enzymes
(e.g., DNA methyltransferases, topoisomerases, helicases, ligases, kinases,
phosphatases, polymerases) and their associated factors and modifiers. Helicases are
reviewed in Matson et al., Bioessays, 16:13-22 (1994), and methyltransferases are
described in Cheng, Curr. Opin. Struct. Biol. 5:4-10 (1995). Chromatin associated
proteins and their modifiers (e.g., kinases, acetylases and deacetylases), such as
histone deacetylase (Wolffe, Science 272:371-2 (1996)) are also useful functional
domains, In one embodiment, the regulatory domain is a DNA methyl transferase
that acts as a transcriptional repressor (see, e.g., Van den Wyngaert et al., FEBS Lett.
426:283-289 (1998); Flynn et al., J. Mol Biol. 279:101-116 (1998); Okano et al.,
Nucleic Acids Res. 26:2536-2540 (1998); and Zardo & Caiafa, J. Biol. Chem.
273:16517-16520 (1998)). In another embodiment, endonucleases such as Fokl
provide functional domains to catalyze targeted DNA cleavage, which facilitates
processes such as transcriptional repression and homologous recombination. See,
e.g., U. S. Patents 5,436,150; 5,792,640 and 6,265,196; U.S. Patent Application
Publication No. 2003/0232410 and WO 03/87341.
Factors that control chromatin and DNA structure, movement and localization
and their associated factors and modifiers; factors derived from microbes (e.g.,
prokaryotes, eukaryotes and virus) and factors that associate with or modify them can
also be used to obtain functional domains for the construction of chimeric proteins or
fusion molecules. In one embodiment, recombinases and integrases are used as
regulatory domains. In another embodiment, histone acetyltransferase is used as a
transcriptional activation domain (see, e.g., Jin & Scotto, Mol. Cell. Biol. 18:4377-
4384 (1998); Wolffe, Science 272:371-372 (1996); Taunton et al., Science 272:408-
411 (1996); and Hassig et al., PNAS 95:3519-3524 (1998)). In another embodiment,
histone deacetylase is used as a transcriptional repression domain (see, e.g., Jin &
Scotto, Mol Cell. Biol. 18:4377-4384 (1998); Syntichaki & Thireos, J. Biol Chem.
273:24414-24419 (1998); Sakaguchi et al., Genes Dev. 12:2831-2841 (1998); and
Martinez et al., J. Biol. Chem. 273:23781-23785 (1998)).
Another suitable repression domain is methyl binding domain protein 2B
(MBD-2B) (see, also Hendrich et al. (1999) Mamm Genome 10:906-912 for
description of MBD proteins). Another useful repression domain is that associated
with the v-ErbA protein. See, for example, Damm, et al. (1989) Nature 339:593-597;
Evans (1989) Int. J. Cancer Suppl. 4:26-28; Pain et al. (1990) New Biol. 2:284-294;
23

WO 2006/033859 PCT/US2005/032157
Sap et al. (1989) Nature 340:242-244; Zenke et al. (1988) Cell 52:107-119; and
Zenke et al. (1990) Cell 61:1035-1049. Additional exemplary repression domains
include, but are not limited to, thyroid hormone receptor (TR, see infra), SID, MBD1,
MBD2, MBD3, MBD4, MBD-like proteins, members of the DNMT family (e.g.,
DNMT1, DNMT3A, DNMT3B), Rb, MeCP1 and MeCP2. See, for example, Bird et
al. (1999) Cell 99:451-454; Tyler et al. (1999) Cell 99:443-446; Knoepfler et al.
(1999) Cell 99:447-450; and Robertson et al. (2000) Nature Genet. 25:338-342.
Additional exemplary repression domains include, but are not limited to, ROM2 and
AtHD2A. See, for example, Chern et al. (1996) Plant Cell 8:305-321; and Wu et al.
(2000) Plant J. 22:19-27.
Certain members of the nuclear hormone receptor (NHR) superfamily,
including, for example, thyroid hormone receptors (TRs) and retinoic acid receptors
(RARs) are among the most potent transcriptional regulators currently known. Zhang
et al., Annu. Rev. Physiol 62:439-466 (2000) and Sucov et al., Mol Neurobiol 10(2-
3):169-184 (1995). In the absence of their cognate ligand, these proteins bind with
high specificity and affinity to short stretches of DNA (e.g., 12-17 base pairs) within
regulatory loci (e.g., enhancers and promoters) and effect robust transcriptional
repression of adjacent genes. The potency of their regulatory action stems from the
concurrent use of two distinct functional pathways to drive gene silencing: (i) the
creation of a localized domain of repressive chromatin via the targeting of a complex
between the corepressor N-CoR and a histone deacetylase, HDAC3 (Guenther et al.,
Genes Dev 14:1048-1057 (2000); Urnov et al, EMBO J 19:4074-4090 (2000); Li et
al., EMBO J 19, 4342-4350 (2000) and Underhill et al., J. Biol. Chem. 275:40463-
40470 (2000)) and (ii) a chromatin-independent pathway (Urnov et al., supra) that
may involve direct interference with the function of the basal transcription machinery
(Fondell et al., Genes Dev 7(7B):1400-1410 (1993) and Fondell et al.,Mol Cell Biol
16:281-287 (1996).
In the presence of very low (e.g., nanomolar) concentrations of their ligand,
these receptors undergo a conformational change which leads to the release of
corepressors, recruitment of a different class of auxiliary molecules (e.g.,
coactivators) and potent transcriptional activation. Collingwood et al., J. Mol.
Endocrinol. 23(3):255-275 (1999).
The portion of the receptor protein responsible for transcriptional control (e.g.,
repression and activation) can be physically separated from the portion responsible for
24

WO 2006/033859 PCT/US2005/032157
DNA binding, and retains full functionality when tethered to other polyp eptides, for
example, other DNA-binding domains. Accordingly, a nuclear hormone receptor
transcription control domain can be fused to a DNA-binding domain (e.g., a zinc
finger protein) such that the transcriptional regulatory activity of the receptor can be
targeted to a chromosomal region of interest (e.g., a gene) by virtue of the DNA-
binding domain.
Moreover, the structure of TR and other nuclear hormone receptors can be
altered, either naturally or through recombinant techniques, such that it loses all
capacity to respond to hormone (thus losing its ability to drive transcriptional
activation), but retains the ability to effect transcriptional repression. This approach is
exemplified by the transcriptional regulatory properties of the oncoprotein v-ErbA.
The v-ErbA protein is one of the two proteins required for leukemic transformation of
immature red blood cell precursors in young chicks by the avian erythroblastosis
virus. TR is a major regulator of erythropoiesis (Beug et al., Biochim Biophys Acta
1288(3):M35-47 (1996); in particular, in its unliganded state, it represses genes
required for cell cycle arrest and the differentiated state. Thus, the administration of
thyroid hormone to immature erythroblasts leads to their rapid differentiation. The v-
ErbA oncoprotein is an extensively mutated version of TR; these mutations include:
(i) deletion of 12 amino-terminal amino acids; (ii) fusion to the gag oncoprotein; (iii)
several point mutations in the DNA binding domain that alter the DNA binding
specificity of the protein relative to its parent, TR, and impair its ability to
heterodimerize with the retinoid X receptor; (iv) multiple point mutations in the
ligand-binding domain of the protein that effectively eliminate the capacity to bind
thyroid hormone; and (v) a deletion of a carboxy-terminal stretch of amino acids that
is essential for transcriptional activation. Stunnenberg et al., Biochim Biophys Acta
1423(1):F15-33 (1999). As a consequence of these mutations, v-ErbA retains the
capacity to bind to naturally occurring TR target genes and is an effective
transcriptional repressor when bound (Urnov et al., supra; Sap et al., Nature
340:242-244(1989); and Ciana et al., EMBO J. 17(24):7382-7394 (1999). In
contrast to TR, however, v-ErbA is completely insensitive to thyroid hormone, and
thus maintains transcriptional repression in the presence of thyroid hormones or
retinoids.
Accordingly, in one aspect, v-ErbA or its functional fragments are used as a
repression domain. In additional embodiments, TR or its functional domains are used
25

WO 2006/033859 PCT/US2005/032157
as a repression domain in the absence of ligand and/or as an activation domain in the
presence of ligand (e.g., 3,5,3'-triiodo-L-thyronine or T3). Thus, TR can be used as a
switchable functional domain (i.e., a bifunctional domain); its activity (activation or
repression) being dependent upon the presence or absence (respectively) of ligand.
Additional exemplary repression domains are obtained from the DAX protein
and its functional fragments. Zazopoulos et al., Nature 390:311-315 (1997). In
particular, the C-terminal portion of DAX-1, including amino acids 245-470, has been
shown to possess repression activity. Altincicek et al., J. Biol. Chem. 275:7662-7667
(2000). A further exemplary repression domain is the RBP1 protein and its functional
fragments. Lai et al., Oncogene 18:2091-2100 (1999); Lai et al., Mol. Cell. Biol.
19:6632-6641 (1999); Lai et al., Mol. Cell. Biol. 21:2918-2932 (2001) and WO
01/04296. The full-length RBP1 polypeptide contains 1257 amino acids. Exemplary
functional fragments of RBP1 are a polypeptide comprising amino acids 1114-1257,
and a polypeptide comprising amino acids 243-452.
Members of the TIBG family of transcription factors contain three repression
domains known as Rl, R2 and R3. Repression by TIEG family proteins is achieved
at least in part through recruitment of mSIN3A histone deacetylases complexes.
Cook et al. (1999) J. Biol. Chem. 274:29,500-29,504; Zhang et al. (2001) Mol. Cell.
Biol. 21:5041-5049. Any or all of these repression domains (or their functional
fragments) can be fused alone, or in combination with additional repression domains
(or their functional fragments), to a DNA-binding domain to generate a targeted
exogenous repressor molecule.
Furthermore, the product of the human cytomegalovirus (HCMV) UL34 open
reading frame acts as a transcriptional repressor of certain HCMV genes, for example,
the US3 gene. LaPierre et al. (2001) J. Virol. 75:6062-6069. Accordingly, the UL34
gene product, or functional fragments thereof, can be used as a component of a fusion
polypeptide also comprising a zinc finger binding domain. Nucleic acids encoding
such fusions are also useful in the methods and compositions disclosed herein.
Yet another exemplary repression domain is the CDF-1 transcription factor
and/or its functional fragments. See, for example, WO 99/27092.
The Ikaros family of proteins are involved in the regulation of lymphocyte
development, at least in part by transcriptional repression. Accordingly, an Ikaros
family member (e.g., Ikaros, Aiolos) or a functional fragment thereof, can be used as
26

WO 2006/033859 PCT/US2005/032157
a repression domain. See, for example, Sabbattini et al. (2001) EMBO J. 20:2812-
2822.
The yeast Ashlp protein comprises a transcriptional repression domain.
Maxon et al. (2001) Proc. Natl. Acad. Sci. USA 98:1495-1500. Accordingly, the
Ashlp protein, its functional fragments, and homologues of Ashlp, such as those
found, for example, in, vertebrate, mammalian, and plant cells, can serve as a
repression domain for use in the methods and compositions disclosed herein.
Additional exemplary repression domains include those derived from histone
deacetylases (HDACs, e.g., Class I HDACs, Class II HDACs, SIR-2 homologues),
HDAC-interacting proteins (e.g., SIN3, SAP30, SAP15, NCoR, SMRT, RB, p107,
p130, RBAP46/48, MTA, Mi-2, Brgl, Brm), DNA-cytosine methyltransferases (e.g.,
Dnmt1, Dnmt3a, Dnmt3b), proteins that bind methylated DNA (e.g., MBD1, MBD2,
MBD3, MBD4, MeCP2, DMAP1), protein methyltransferases (e.g., lysine and
arginine methylases, SuVar homologues such as Suv39H1), polycomb-type repressors
(e.g., Bmi-1, eed1, RING1, RYBP, E2F6, Mel18, YY1 and CtBP), viral repressors
(e.g., adenovirus Elb 55K protein, cytomegalovirus UL34 protein, viral oncogenes
such as v-erbA), hormone receptors (e.g., Dax-1, estrogen receptor, thyroid hormone
receptor), and repression domains associated with naturally-occurring zinc finger
proteins (e.g., WT1, KAP1). Further exemplary repression domains include members
of the polycomb complex and their homologues, HPH1, HPH2, HPC2, NC2, groucho,
Eve, tramtrak, mHP1, SIP1, ZEB1, ZEB2, and Enxl/Ezh2. In all of these cases,
either the full-length protein or a functional fragment can be used as a repression
domain for fusion to a zinc finger binding domain. Furthermore, any homologues of
the aforementioned proteins can also be used as repression domains, as can proteins
(or their functional fragments) that interact with any of the aforementioned proteins.
Additional repression domains, and exemplary functional fragments, are as
follows. Hesl is a human homologue of the Drosophila hairy gene product and
comprises a functional fragment encompassing ammo acids 910-1014. In particular, a
WRPW (trp-arg-pro-trp) motif can act as a repression domain. Fisher et al. (1996)
Mol. Cell. Biol. 16:2670-2677.
The TLE1, TLE2 and TLE3 proteins are human homologues of the
Drosophila groucho gene product. Functional fragments of these proteins possessing
repression activity reside between amino acids 1-400. Fisher et al., supra.
27

WO 2006/033859 PCT/US2005/032157
The Tbx3 protein possesses a functional repression domain between amino
acids 524-721. He et al. (1999) Proc. Natl. Acad. Sci. USA 96:10,212-10,217. The
Tbx2 gene product is involved in repression of the p14/p16 genes and contains a
region between amino acids 504-702 that is homologous to the repression domain of
Tbx3; accordingly Tbx2 and/or this functional fragment can be used as a repression
domain. Carreira et al. (1998) Mol. Cell. Biol. 18:5,099-5,108.
The human Ezh2 protein is a homologue of Drosophila enhancer of zeste and
recruits the eed1 polycomb-type repressor. A region of the Ezh2 protein comprising
amino acids 1-193 can interact with eed1 and repress transcription; accordingly Ezh2
and/or this functional fragment can be used as a repression domain. Denisenko et al.
(1998) Mol. Cell. Biol. 18:5634-5642.
The RYBP protein is a corepressor that interacts with polycomb complex
members and with the YY1 transcription factor. A region of RYBP comprising
amino acids 42-208 has been identified as functional repression domain. Garcia et al.
(1999)EMBO J. 18:3404-3418.
The RING finger protein RING1A is a member of two different vertebrate
polycomb-type complexes, contains multiple binding sites for various components of
the polycomb complex, and possesses transcriptional repression activity.
Accordingly, RTNG1A or its functional fragments can serve as a repression domain.
Satjin et al. (1997) Mol. Cell. Biol. 17:4105-4113.
The Bmi-1 protein is a member of a vertebrate polycomb complex and is
involved in transcriptional silencing. It contains multiple binding sites for various
polycomb complex components. Accordingly, Bmi-1 and its functional fragments are
useful as repression domains. Gunster et al. (1997) Mol. Cell., Biol. 17:2326-2335;
Hemenway et al. (1998) Oncogene 16:2541-2547.
The E2F6 protein is a member of the mammalian Bmi-1 -containing polycomb
complex and is a transcriptional repressor that is capable or recruiting RYBP, Bmi-1
and RING1A. A functional fragment of E2F6 comprising amino acids 129-281 acts
as a transcriptional repression domain. Accordingly, E2F6 and its functional
fragments can be used as repression domains. Trimarchi et al. (2001) Proc. Natl.
Acad. Sci. USA 98:1519-1524.
The eed1 protein represses transcription at least in part through recruitment of
histone deacetylases (e.g., HDAC2). Repression activity resides in both the N- and C-
28

WO 2006/033859 PCT/US2005/032157
terminal regions of the protein. Accordingly, eed1 and its functional fragments can be
used as repression domains, van der Vlag et al. (1999) Nature Genet. 23:474-478.
The CTBP2 protein represses transcription at least in part through recruitment
of an HPC2-polycomb complex. Accordingly, CTBP2 and its functional fragments
are useful as repression domains. Richard et al. (1999) Mol. Cell. Biol. 19:777-787.
Neuron-restrictive silencer factors are proteins that repress expression of
neuron-specific genes. Accordingly, a NRSF or functional fragment thereof can serve
as a repression domain. See, for example, US Patent No. 6,270,990.
Additional repression domains include PLZF, BCL-6, BAZF, ZNF274, PRH,
TEL, TGIF, and G9A.
It will be clear to those of skill in the art that, in the formation of a fusion
protein (or a nucleic acid encoding same) between a DNA-binding domain and a
functional domain, either a repressor or a molecule that interacts with a repressor is
suitable as a functional domain. Essentially any molecule capable of recruiting a
repressive complex and/or repressive activity (such as, for example, histone
deacetylation) to the target gene is useful as a repression domain of a fusion protein.
Additional exemplary activation domains include, but are not limited to, p300,
CBP, PCAF, SRC1 PvALF, AtHD2A and ERF-2. See, for example, Robyr et al.
(2000) Mol. Endocrinol. 14:329-347; Collingwood et al. (1999) J. Mol. Endocrinol.
23:255-275; Leo.et al. (2000) Gene 245:1-11; Manteuffel-Cymborowska (1999)
Acta Biochim. Pol. 46:77-89; McKenna et al. (1999) J. Steroid Biochem. Mol. Biol.
69:3-12; Malik et al. (2000) Trends Biochem. Sci. 25:277-283; and Lemon et al.
(1999) Curr. Opin. Genet. Dev. 9:499-504. Additional exemplary activation domains
include, but are not limited to, OsGAI, HALF-1, C1, AP1, ARF-5, -6, -7, and -8,
CPRF1, CPRF4, MYC-RP/GP, and TRAB1. See, for example, Ogawa et al. (2000)
Gene 245:21-29; Okanami et al. (1996) Genes Cells 1:87-99; Goff et al. (1991)
Genes Dev. 5:298-309; Cho et al. (1999) Plant Mol. Biol. 40:419-429; Ulmason et
al. (1999) Proc. Natl. Acad. Sci. USA 96:5844-5849; Sprenger-Haussels et al. (2000)
Plant J. 22:1-8; Gong et al. (1999) Plant Mol. Biol. 41:33-44; and Hobo et al. (1999)
Proc. Natl. Acad. Sci. USA 96:15,348-15,353.
Additional transcriptional activation domains can be obtained from the
following proteins: ATF2, myc, GATA-1, GATA-3, NF-E2, Octl, CTF1, Spl, GR-
AF1, a zeste deletion, HSF-1, p53, myoD, and CARβ.
29

WO 2006/033859 PCT/US2005/032157
It will be clear to those of skill in the art that, in the formation of a fusion
protein (or a nucleic acid encoding same) between a DNA-binding domain and a
functional domain, either an activation domain or a molecule that interacts with an
activation domain is suitable as a functional domain. Essentially any molecule
capable of recruiting an activating complex and/or activating activity (such as, for
example, histone acetylation) to the target gene is useful as an activating domain of a
fusion protein.
Insulator domains, localization domains, and chromatin remodeling proteins
such as ISWI-containing domains and/or methyl binding domain proteins suitable for
use as functional domains in fusion molecules are described, for example, in co-
owned U.S. Patent Applications 2002/0115215 and 2003/0082552 and in co-owned
WO 02/44376.
In a further embodiment, a DNA-binding domain (e.g., a zinc finger domain)
is fused to a bifunctional domain (BFD). A bifunctional domain is a transcriptional
regulatory domain whose activity depends upon interaction of the BFD with a second
molecule. The second molecule can be any type of molecule capable of influencing
the functional properties of the BFD including, but not limited to, a compound, a
small molecule, a peptide, a protein, a polysaccharide or a nucleic acid. An
exemplary BFD is the ligand binding domain of the estrogen receptor (ER). In the
presence of estradiol, the ER ligand binding domain acts as a transcriptional activator;
while, in the absence of estradiol and the presence of tamoxifen or 4-hydroxy-
tamoxifen, it acts as a transcriptional repressor. Another example of a BFD is the
thyroid hormone receptor (TR) ligand binding domain which, in the absence of
ligand, acts as a transcriptionai repressor and in the presence of thyroid hormone (T3),
acts as a transcriptional activator. An additional BFD is the glucocorticoid receptor
(GR) ligand binding domain. In the presence of dexamethasone, this domain acts as a
transcriptional activator; while, in the presence of RU486, it acts as a transcriptional
repressor. An additional exemplary BFD is the ligand binding domain of the retinoic
acid receptor. In the presence of its ligand all-trans-retinoic acid, the retinoic acid
receptor recruits a number of co-activator complexes and activates transcription. In
the absence of ligand, the retinoic acid receptor is not capable of recruiting
transcriptional co-activators. Additional BFDs are known to those of skill in the art.
See, for example, US Patent Nos. 5,834,266 and 5,994,313 and PCT WO 99/10508.
30

WO 2006/033859 PCT/US2005/032157
Another class of functional domain derived from nuclear receptors are those
whose functional activity is regulated by a non-natural ligand. These are often
mutants or modified versions of naturally-occurring receptors and are sometimes
referred to as "switchable" domains. For example, certain mutants of the
progesterone receptor (PR) are unable to interact with their natural ligand, and are
therefore incapable of being transcriptionally activated by progesterone. Certain of
these mutants, however, can be activated by binding small molecules other than
progesterone (one example of which is the antiprogestin mifepristone). Such non-
natural but functionally competent ligands have been denoted anti-hormones. See,
e.g., U.S. Patents 5,364,791; 5,874,534; 5,935,934; Wang et al., (1994) Proc. Natl.
Acad. Sci. USA 91:8180-8184; Wang et al. (1997) Gene Ther. 4:432-441.
Accordingly, a fusion comprising a targeted DNA-binding domain (e.g., ZFP),
a functional domain, and a mutant PR ligand binding domain of this type can be used
for mifepristone-dependent activation or repression of an endogenous gene of choice,
by designing or selecting the DNA-binding domain such that it binds in or near the
gene of choice. Such fusions can further comprise a degron domain. If the fusion
contains an activation domain, mifepristone-dependent activation of gene expression
is obtained; if the fusion contains a repression domain, mifepristone-dependent
repression of gene expression is obtained. Additionally, polynucleotides encoding
such fusion proteins are provided, as are vectors comprising such polynucleotides and
cells comprising such polynucleotides and vectors. It will be clear to those of skill in
the art that modified or mutant versions of receptors other than PR can also be used as
switchable domains. See, for example, Tora et al. (1989) EMBO J. 8:1981-1986.
Host cells, vectors and promoters
In the practice of the disclosed methods, an engineered zinc finger protein, as
described above, is expressed in a cell and binds to a target site to regulate
transcription. A protein can be expressed in a cell, e.g., by delivering the protein to
the cell or by delivering a polynucleotide encoding the protein to a cell. If a DNA
molecule encoding the protein is delivered to a cell, it is transcribed into a mRNA
molecule which can be translated to produce the protein. Alternatively, a RNA
molecule can be delivered to a cell and is translated to generate the protein. Methods
for polynucleotide and polypeptide delivery to cells are known in the art and
exemplary methods are presented elsewhere in this disclosure. Expression of a
31

WO 2006/033859 PCT/US2005/032157
protein in a cell can also be accomplished by transfecting a cell with a nucleic acid
encoding the protein, and selecting a cell line in which the nucleic acid has stably
integrated into a chromosome or is otherwise stably and heritably maintained in the
cell. In these cases, expression of the protein encoded by the stably-maintained
sequences can be inducible (e.g., tetracycline- and doxycycline-regulated systems) or
constitutive.
The disclosed compositions and methods can be used for the transcriptional
regulation of any nucleotide sequence in any cell, including cultured cells, primary
cells, cells in an organism and cells removed from an organism which are then
returned to the organism after delivery of a protein or a polynucleotide encoding the
protein to a cell. In this regard, transcription of both endogenous and exogenous
sequences can be regulated. To regulate transcription of endogenous sequences, one
or more copies of SEQ ID NO: 1 can be inserted into or in the vicinity of the
endogenous sequence to be regulated. Exemplary methods for targeted insertion of
exogenous sequences into cellular genomes are disclosed in U.S. Patent Application
Publication No. 2003/0232410 (Dec. 18,2003), PCT Publication WO 03/87341 (Oct.
23, 2003) and co-owned U.S. Patent Application Serial No. 10/912,932, filed on
August 6, 2004, the disclosures of which are hereby incorporated by reference in their
entireties for all purposes.
As noted, the disclosed methods and compositions can also be used to regulate
transcription of exogenous sequences (e.g., a vector comprising a cDNA sequence
which is introduced into a cell). For example, a cDNA can be cloned into a vector
containing a promoter, e.g., a SRα promoter, such that transcription of the cDNA
sequences is controlled by the promoter. In these cases, the cDNA-containing vector
is introduced into a cell, and an engineered ZFP as disclosed herein (or a
polynucleotide encoding an engineered ZFP as disclosed herein) is introduced into the
same cell. Alternatively, the cDNA is introduced into a stable cell line containing an
integrated (or otherwise stably maintained) copy of a polynucleotide encoding an
engineered ZFP as disclosed herein. In another alternative, an engineered ZFP or a
polynucleotide encoding an engineered ZFP is introduced into a cell containing an
integrated (or otherwise stably maintained) copy of a polynucleotide sequence
comprising a cDNA.
In cells containing stably integrated sequences encoding an engineered ZFP, a
number of exogenous transcription units, each containing one or more target sites for
32

WO 2006/033859 PCT/US2005/032157
the ZFP, can be introduced (e.g., by transfection) to obtain coordinate regulation of
the exogenous transcription units. Also in such cells, one or more target sites can be
introduced upstream, within or adjacent to any number of endogenous transcription
units, to obtain coordinate regulation of multiple genes.
Suitable vectors for the propagation of nucleotide sequences that are
exogenous to a cell, and promoters for regulation of the transcription of such
exogenous nucleotide sequences, are known in the art. Exemplary promoters include
the SRα and CMV promoters. Regulation of transcription of sequences operatively
linked to a SRα promoter can be accomplished directly using the disclosed proteins,
since a copy of the target site SEQ ID NO:1 is present in the SRα promoter. Higher
levels of transcription (e.g., for overexpression of proteins) are achieved by placing
additional copies of SEQ ID NO:1 into the SRα promoter. Furthermore, modification
of a promoter other than SRα (e.g., the CMV promoter), by insertion of one or more
copies of SEQ ID NO:1, leads to expression levels that are higher than those obtained
with the unmodified promoter.
The disclosed methods and compositions can be used in any type of cell
including, but not limited to, prokaryotic cells, fungal cells, Archaeal cells, plant cells,
insect cells, animal cells, vertebrate cells, mammalian cells and human cells. Suitable
cell lines for protein expression are known to those of skill in the art and include, but
are not limited to COS, CHO (e.g., CHO-S, CHO-K1, CHO-DG44, CHO-DUXB11),
VERO, MDCK, WI38, V79, B14AF28-G3, BHK, HaK, NSO, SP2/0-Ag14, HeLa,
HEK293 (e.g., HEK293-F, HEK293-H, HEK293-T), perC6, insect cells such as
Spodoptera fugiperda (Sf), and fungal cells such as Saccharomyces, Pischia and
Schizosaccharomyces. Progeny, variants and derivatives of these cell lines can also
be used.
Exemplary promoters include SRa, CMV, phosphoglycerate kinase (PGK),
human ubiquitin C (UBC), elongation factor1α (EF-1α), herpes thymidine kinase
(TK), SV40 early and late promoters, human keratin-14 (K14) and Rous Sarcoma
virus (RSV).
In certain embodiments, two or more promoters can be regulated with the
same engineered ZFP, provided that one or more target sites for the ZFP is present in
or near each promoter. This is particularly useful for recombinant production of
antibodies, as sequences encoding a heavy chain can be placed under the
transcriptional control of a first promoter and sequences encoding a light chain can be
33

WO 2006/033859 PCT/US2005/032157
placed under the transcriptional control of a second promoter. The two promoters can
be the same, or they can be different promoters. In additional embodiments, two
copies of the same promoter, each containing a different number of ZFP target sites,
can be used to obtain differential expression of sequences operatively linked to each
promoter. Two different promoters, each containing a different number of target
sites, can also be used.
It will be clear to those of skill in the art that it is possible to design a ZFP to
bind to any predetermined target sequence; and therefore is possible to use one or
more copies of such a sequence, along with a ZFP the binds to the sequence, to
regulate transcription for e.g. overexpression.
Fusion nucleic acids and expression vectors
In certain embodiments, a fusion polypeptide is encoded by a fusion nucleic
acid. In such cases, the nucleic acid can be cloned into intermediate vectors for
transformation into prokaryotic or eukaryotic cells for replication and/or expression.
Intermediate vectors for storage or manipulation of the fusion nucleic acid or
production of fusion protein can be prokaryotic vectors, (e.g., plasmids), shuttle
vectors, insect vectors, or viral vectors for example. A fusion nucleic acid can also
cloned into an expression vector, for administration to a bacterial cell, fungal cell,
protozoal cell, plant cell, or animal cell, preferably a mammalian cell, more preferably
a human cell.
A nucleic acid encoding a fusion protein can be cloned into a vector for
transformation into prokaryotic or eukaryotic cells for replication and/or expression.
Vectors can be prokaryotic vectors, e.g., plasmids, or shuttle vectors, insect vectors,
or eukaryotic vectors. A nucleic acid encoding a ZFP can also be cloned into an
expression vector, for administration to a plant cell, animal cell, preferably a
mammalian cell or a human cell, fungal cell, bacterial cell, or protozoal cell.
To obtain expression of a cloned gene or nucleic acid, sequences encoding a
fusion protein are typically subcloned into an expression vector that contains a
promoter to direct transcription. Suitable bacterial and eukaryotic promoters are well
known in the art and described, e.g., in Sambrook et al., Molecular Cloning, A
Laboratory Manual (2nd ed. 1989; 3rd ed., 2001); Kriegler, Gene Transfer and
Expression: A Laboratory Manual (1990); and Current Protocols in Molecular
Biology (Ausubel et al., supra. Bacterial expression systems for expressing the ZFP
34

WO 2006/033859 PCT/US2005/032157
are available in, e.g., E. coli, Bacillus sp., and Salmonella (Palva et al., Gene 22:229-
235 (1983)). Kits for such expression systems are commercially available.
Eukaryotic expression systems for mammalian cells, yeast, and insect cells are well
known by those of skill in the art and are also commercially available.
The promoter used to direct expression of a protein-encoding nucleic acid
depends on the particular application. For example, a strong constitutive promoter is
typically used for expression and purification of protein. In contrast, when a protein
is administered in vivo for gene regulation, either a constitutive or an inducible
promoter is used, depending on the particular use of the protein. In addition, a
preferred promoter for administration of a protein can be a weak promoter, such as
HSV TK or a promoter having similar activity. The promoter typically can also
include elements that are responsive to transactivation, e.g., hypoxia response
elements, Gal4 response elements, lac repressor response element, and small molecule
control systems such as tet-regulated systems and the RU-486 system (see, e.g.,
Gossen & Bujard, PNAS 89:5547 (1992); Oligino et al., Gene Ther. 5:491-496
(1998); Wang et al., Gene Ther. 4:432-441 (1997); Neering et al., Blood 88:1147-
1155 (1996); and Rendahl et al., Nat. Biotechnol. 16:757-761 (1998)).
In addition to the promoter, the expression vector typically contains a
transcription unit or expression cassette that contains all the additional elements
required for the expression of the nucleic acid in host cells, either prokaryotic or
eukaryotic. A typical expression cassette thus contains a promoter operably linked,
e.g., to a nucleic acid sequence encoding the ZFP, and signals required, e.g., for
efficient polyadenylation of the transcript, transcriptional termination, ribosome
binding sites, or translation termination. Additional elements of the cassette may
include, e.g., enhancers, and heterologous splicing signals.
The particular expression vector used to transport the genetic information into
the cell is selected with regard to the intended use of the protein, e.g., expression in
plants, animals, bacteria, fungus, protozoa, etc. (see expression vectors described
below). Standard bacterial expression vectors include plasmids such as pBR322-
based plasmids, pSKF, pET23D, and commercially available fusion expression
systems such as GST and LacZ. An exemplary fusion protein is the maltose binding
protein, "MBP." Such fusion proteins facilitate purification of the protein. Epitope
tags, e.g., c-myc, hemagglutinin (HA) or FLAG, can also be added to recombinant
35

WO 2006/033859 PCT/US2005/032157
proteins to provide convenient methods of isolation, for monitoring expression, and
for monitoring cellular and subcellular localization.
Expression vectors containing regulatory elements from eukaryotic viruses are
often used in eukaryotic expression vectors, e.g., SV40 vectors, papilloma virus
vectors, and vectors derived from Epstein-Barr virus. Other exemplary eukaryotic
vectors include pMSG, pAV009/A+, pMTO10/A+, pMAMneo-5, baculovirus
pDSVE, and any other vector allowing expression of proteins under the direction of
the SV40 early promoter, SV40 late promoter, metallothionein promoter, murine
mammary tumor virus promoter, Rous sarcoma virus promoter, polyhedrin promoter,
or other promoters shown effective for expression in eukaryotic cells.
Some expression systems have markers for selection of stably transfected cell
lines such as thymidine kinase, hygromycin B phosphotransferase, and dihydrofolate
reductase. High yield expression systems are also suitable, such as using a
baculovirus vector in insect cells, with a protein coding sequence under the direction
of the polyhedrin promoter or other strong baculovirus promoters.
The elements that are typically included in expression vectors also include a
replicon that functions in E. coli or other prokaryotic bacteria, a gene encoding
antibiotic resistance to permit selection of bacteria that harbor recombinant plasmids,
and unique restriction sites in nonessential regions of the plasmid to allow insertion of
recombinant sequences.
Standard transfection methods are used to produce bacterial, mammalian,
yeast or insect cell lines that express large quantities of protein, which are then
purified using standard techniques (see, e.g., Colley et al., J. Biol. Chem. 264:17619-
17622 (1989); Guide to Protein Purification, in Methods in Enzymology, vol. 182
(Deutscher, ed., 1990)). Transformation of eukaryotic and prokaryotic cells are
performed according to standard techniques (see, e.g., Morrison, J. Bad. 132:349-351
(1977); Clark-Curtiss & Curtiss, Methods in Enzymology 101:347-362 (Wu et al., eds,
1983).
Any of the well known procedures for introducing foreign nucleotide
sequences into host cells may be used. These include the use of calcium phosphate
transfection, polybrene, protoplast fusion, electroporation, liposomes, microinjection,
naked DNA, plasmid vectors, viral vectors, both episomal and integrative, and any of
the other well known methods for introducing cloned genomic DNA, cDNA,
synthetic DNA or other foreign genetic material into a host cell (see, e.g., Sambrook
36

WO 2006/033859 PCT/US2005/032157
et al., supra). It is only necessary that the particular genetic engineering procedure
used be capable of successfully introducing at least one gene into the host cell capable
of expressing the protein of choice.
Nucleic acids encoding fusion proteins and delivery to cells
Conventional viral and non-viral based gene transfer methods can be used to
introduce nucleic acids encoding fusion proteins and/or engineered ZFPs in cells (e.g.,
mammalian cells) and target tissues. Such methods can also be used to administer
nucleic acids encoding fusion proteins to cells in vitro. In certain embodiments,
nucleic acids encoding fusion protiens are administered for in vivo or ex vivo gene
therapy uses. Non-viral vector delivery systems include DNA plasmids, naked
nucleic acid, and nucleic acid complexed with a delivery vehicle such as a liposome
or poloxamer. Viral vector delivery systems include DNA and RNA viruses, which
have either episomal or integrated genomes after delivery to the cell. For a review of
gene therapy procedures, see Anderson, Science 256:808-813 (1992); Nabel &
Feigner, TIBTECH 11:211-217 (1993); Mitani & Caskey, TIBTECH 11:162-166
(1993); Dillon, TIBTECH 11:167-175 (1993); Miller, Nature 357:455-460 (1992);
Van Brunt, Biotechnology 6(10):1149-1154 (1988); Vigne, Restorative Neurology
and Neuroscience 8:35-36 (1995); Kremer & Perricaudet, British Medical Bulletin
51(1):31-44 (1995); Haddada et al., in Current Topics in Microbiology and
Immunology Doerfler and Böhm (eds) (1995); and Yu et al., Gene Therapy 1:13-26
(1994).
Methods of non-viral delivery of nucleic acids encoding engineered ZFPs
include electroporation, lipofection, microinjection, biolistics, virosomes, liposomes,
immunoliposomes, polycation or lipid:nucleic acid conjugates, naked DNA, artificial
virions, and agent-enhanced uptake of DNA. Lipofection is described in e.g., US
5,049,386, US 4,946,787; and US 4,897,355) and lipofection reagents are sold
commercially (e.g., Transfectam™, Lipofectin™, Lipofectamine®). Cationic and
neutral lipids that are suitable for efficient receptor-recognition lipofection of
polynucleotides include those of Feigner, WO 91/17424, WO 91/16024. Delivery can
be to cells (ex vivo administration) or target tissues (in vivo administration).
The preparation of lipid:nucleic acid complexes, including targeted liposomes
such as immunolipid complexes, is well known to one of skill in the art (see, e.g.,
37

WO 2006/033859 PCT/US2005/032157
Crystal, Science 270:404-410 (1995); Blaese et al., Cancer Gene Ther. 2:291-297
(1995); Behr et al., Bioconjugate Chem. 5:382-389 (1994); Remy et al., Bioconjugate
Chem. 5:647-654 (1994); Gao et al., Gene Therapy 2:710-722 (1995); Ahmad et al.,
Cancer Res. 52:4817-4820 (1992); U.S. Pat Nos. 4,186,183,4,217,344, 4,235,871,
4,261,975, 4,485,054,4,501,728,4,774,085,4,837,028, and 4,946,787).
The use of RNA or DNA viral based systems for the delivery of nucleic acids
encoding fusion proteins and/or engineered ZFPs take advantage of highly evolved
processes for targeting a virus to specific cells in the body and trafficking the viral
payload to the nucleus. Viral vectors can be administered directly to patients (in vivo)
or they can be used to treat cells in vitro and the modified cells are administered to
patients (ex vivo). Conventional viral based systems for the delivery of ZFPs include,
but are not, limited to, retroviral, lentivirus, adenoviral, adeno-associated, vaccinia and
herpes simplex virus vectors for gene transfer. Integration in the host genome is
possible with the retrovirus, lentivirus, and adeno-associated virus gene transfer
methods, often resulting in long term expression of the inserted transgene.
Additionally, high transduction efficiencies have been observed in many different cell
types and target tissues.
The tropism of a retrovirus can be altered by incorporating foreign envelope
proteins, expanding the potential target population of target cells. Lentiviral vectors
are retroviral vectors that are able to transduce or infect non-dividing cells and
typically produce high viral titers. Selection of a retroviral gene transfer system
depends on the target tissue. Retroviral vectors are comprised of as-acting long
terminal repeats with packaging capacity for up to 6-10 kb of foreign sequence. The
minimum cis-acting LTRs are sufficient for replication and packaging of the vectors,
which are then used to integrate the therapeutic gene into the target cell to provide
permanent transgene expression. Widely used retroviral vectors include those based
upon murine leukemia virus (MuLV), gibbon ape leukemia virus (GaLV), Simian
Immunodeficiency virus (SIV), human immunodeficiency virus (HTV), and
combinations thereof (see, e.g., Buchscher et al., J. Virol. 66:2731-2739 (1992);
Johann et., J. Virol. 66:1635-1640 (1992); Sommerfelt et al., Virol. 176:58-59
(1990); Wilson et al., J. Virol. 63:2374-2378 (1989); Miller et al., J. Virol. 65:2220-
2224 (1991); PCT/US94/05700).
In applications in which transient expression of a fusion protein is preferred,
adenoviral based systems can be used. Adenoviral based vectors are capable of very
38

WO 2006/033859 PCT/US2005/032157
high transduction efficiency in many cell types and do not require cell division. With
such vectors, high titer and high levels of expression have been obtained. This vector
can be produced in large quantities in a relatively simple system. Adeno-associated
virus ("AAV") vectors are also used to transduce cells with target nucleic acids, e.g.,
in the in vitro production of nucleic acids and peptides, and for in vivo and ex vivo
gene therapy procedures (see, e.g., West et al., Virology 160:38-47 (1987); U.S.
Patent No. 4,797,368; WO 93/24641; Kotin, Human Gene Therapy 5:793-801 (1994);
Muzyczka, J. Clin. Invest. 94:1351 (1994). Construction of recombinant AAV
vectors are described in a number of publications, including U.S. Pat. No. 5,173,414;
Tratschin et al., Mol. Cell. Biol. 5:3251-3260 (1985); Tratschin, et al., Mol. Cell. Biol.
4:2072-2081 (1984); Hermonat & Muzyczka, PNAS 81:6466-6470 (1984); and
Samulski et al., J. Virol. 63:03822-3828 (1989).
At least six viral vector approaches are currently available for gene transfer in
clinical trials, which utilize approaches that involve complementation of defective
vectors by genes inserted into helper ceil lines to generate the transducing agent.
pLASN and MFG-S are examples of retroviral vectors that have been used in
clinical trials (Dunbar et al., Blood 85:3048-305 (1995); Kohn et al., Nat. Med.
1:1017-102 (1995); Malech et al., PNAS 94:22 12133-12138 (1997)). PA317/pLASN
was the first therapeutic vector used in a gene therapy trial. (Blaese et al., Science
270:475-480 (1995)). Transduction efficiencies of 50% or greater have been
observed for MFG-S packaged vectors. (Ellem et al., Immunol Immunother. 44(1):10-
20 (1997); Dranoff et al, Hum. Gene Ther. 1:111-2 (1997).
Recombinant adeno-associated virus vectors (rAAV) are a promising
alternative gene delivery systems based on the defective and nonpathogenic
parvovirus adeno-associated type 2 virus. All vectors are derived from a plasmid that
retains only the AAV 145 bp inverted terminal repeats flanking the transgene
expression cassette. Efficient gene transfer and stable transgene delivery due to
integration into the genomes of the transduced cell are key features for this vector
system. (Wagner et al., Lancet 351:9117 1702-3 (1998), Kearns et al, Gene Ther.
9:748-55 (1996)).
Replication-deficient recombinant adenoviral vectors (Ad) can be produced at
high titer and readily infect a number of different cell types. Most adenovirus vectors
are engineered such that a transgene replaces the Ad El a, Elb, and/or E3 genes;
subsequently the replication defective vector is propagated in human 293 cells that
39

WO 2006/033859 PCT/US2005/032157
supply deleted gene function in trans. Ad vectors can transduce multiple types of
tissues in vivo, including nondividing, differentiated cells such as those found in liver,
kidney and muscle. Conventional Ad vectors have a large carrying capacity. An
example of the use of an Ad vector in a clinical trial involved polynucleotide therapy
for antitumor immunization with intramuscular injection (Sterman et al., Hum. Gene
Ther. 7:1083-9 (1998)). Additional examples of the use of adenovirus vectors for
gene transfer in clinical trials include Rosenecker et al., Infection 24:1 5-10 (1996);
Sterman et al., Hum. Gene Ther. 9:7 1083-1089 (1998); Welsh et al., Hum. Gene
Ther. 2:205-18 (1995); Alvarez etal, Hum. Gene Ther. 5:597-613 (1997); Topf etal,
Gene Ther. 5:507-513 (1998); Sterman et al., Hum. Gene Ther. 7:1083-1089 (1998).
Packaging cells are used to form virus particles that are capable of infecting a
host cell. Such cells include 293 cells, which package adenovirus, and \r2 cells or
PA317 cells, which package retrovirus. Viral vectors used in gene therapy are usually
generated by a producer cell line that packages a nucleic acid vector into a viral
particle. The vectors typically contain the minimal virai sequences required for
packaging and subsequent integration into a host (if applicable), other viral sequences
being replaced by an expression cassette encoding the protein to be expressed. The
missing viral functions are supplied in trans by the packaging cell line. For example,
AAV vectors used in gene therapy typically only possess inverted terminal repeat
(ITR) sequences from the AAV genome which are required for packaging and
integration into the host genome. Viral DNA is packaged in a cell line, which
contains a helper plasmid encoding the other AAV genes, namely rep and cap, but
lacking ITR sequences. The cell line is also infected with adenovirus as a helper. The
helper virus promotes replication of the AAV vector and expression of AAV genes
from the helper plasmid. The helper plasmid is not packaged in significant amounts
due to a lack of ITR sequences. Contamination with adenovirus can be reduced by,
e.g., heat treatment to which adenovirus is more sensitive than AAV. Alternatively,
adenovirus helper functions can be provided on a plasmid.
In many gene therapy applications, it is desirable that the gene therapy vector
be delivered with a high degree of specificity to a particular tissue type. Accordingly,
a viral vector can be modified to have specificity for a given cell type by expressing a
ligand as a fusion protein with a viral coat protein on the outer surface of the virus.
The ligand is chosen to have affinity for a receptor known to be present on the cell
type of interest. For example, Han et al., Proc. Natl Acad. Sci USA 92:9747-9751
40

WO 2006/033859 PCT/US2005/032157
(1995), reported that Moloney murine leukemia virus can be modified to express
human heregulin fused to gp70, and the recombinant virus infects certain human
breast cancer cells expressing human epidermal growth factor receptor. This principle
can be extended to other virus-target cell pairs, in which the target cell expresses a
receptor and the virus expresses a fusion protein comprising a ligand for the cell-
surface receptor. For example, filamentous phage can be engineered to display
antibody fragments (e.g., Fab or Fv) having specific binding affinity for virtually any
chosen cellular receptor. Although the above description applies primarily to viral
vectors, the same principles can be applied to nonviral vectors. Such vectors can be
engineered to contain specific uptake sequences which favor uptake by specific target
cells.
Gene therapy vectors can be delivered in vivo by administration to an
individual patient, typically by systemic administration (e.g., intravenous,
intraperitoneal, intramuscular, subdermal, or intracranial infusion) or topical
application, as described below. Alternatively, vectors can be delivered to ceils ex
vivo, such as cells explanted from an individual patient (e.g., lymphocytes, bone
marrow aspirates, tissue biopsy) or universal donor hematopoietic stem cells,
followed by reimplantation of the cells into a patient, usually after selection for cells
which have incorporated the vector.
Ex vivo cell transfection for diagnostics, research, or for gene therapy (e.g., via
re-infusion of the transfected cells into the host organism) is well known to those of
skill in the art. In certain embodiments, cells are isolated from the subject organism,
transfected with a nucleic acid (gene or cDNA), and re-infused back into the subject
organism (e.g., patient). Various cell types suitable for ex vivo transfection are well
known to those of skill in the art (see, e.g., Freshney et al., Culture of Animal Cells, A
Manual of Basic Technique (3rd ed. 1994)) and the references cited therein for a
discussion of how to isolate and culture cells from patients).
In one embodiment, stem cells are used in ex vivo procedures for cell
transfection and gene therapy. The advantage to using stem cells is that they can be
differentiated into other cell types in vitro, or can be introduced into a mammal (such
as the donor of the cells) where they will engraft in the bone marrow. Methods for
differentiating CD34+ cells in vitro into clinically important immune cell types using
cytokines such a GM-CSF, IFN-y and TNF-a are known (see Inaba et al., J. Exp.
Med. 176:1693-1702(1992)).
41

WO 2006/033859 PCTAJS2005/032157
Stem cells are isolated for transduction and differentiation using known
methods. For example, stem cells are isolated from bone marrow cells by panning the
bone marrow cells with antibodies which bind unwanted cells, such as CD4+ and
CD8+ (T cells), CD45+ (panB cells), GR-1 (granulocytes), and lad (differentiated
antigen presenting cells) {see Inaba et al., J. Exp. Med. 176:1693-1702 (1992)).
Vectors (e.g., retro viruses, adenoviruses, liposomes, etc.) containing
therapeutic nucleic acids can also be administered directly to an organism for
transduction of cells in vivo. Alternatively, naked DNA can be administered.
Administration is by any of the routes normally used for introducing a molecule into
ultimate contact with blood or tissue cells including, but not limited to, injection,
infusion, topical application and electroporation. Suitable methods of administering
such nucleic acids are available and well known to those of skill in the art, and,
although more than one route can be used to administer a particular composition, a
particular route can often provide a more immediate and more effective reaction than
another route.
Pharmaceutically acceptable carriers are determined in part by the particular
composition being administered, as well as by the particular method used to
administer the composition. Accordingly, there is a wide variety of suitable
formulations of pharmaceutical compositions available, as described below {see, e.g.,
Remington's Pharmaceutical Sciences, 17th ed., 1989).
DNA constructs may be introduced into the genome of a desired plant host by
a variety of conventional techniques. For reviews of such techniques see, for
example, Weissbach & Weissbach Methods for Plant Molecular Biology (1988,
Academic Press, N.Y.) Section VIII, pp. 421-463; and Grierson & Corey, Plant
Molecular Biology (1988, 2d Ed.), Blackie, London, Ch. 7-9. For example, the DNA
construct may be introduced directly into the genomic DNA of the plant cell using
techniques such as electroporation and microinjection of plant cell protoplasts, or the
DNA constructs can be introduced directly to plant tissue using biolistic methods,
such as DNA particle bombardment (see, e.g., Klein et al (1987) Nature 327:70-73).
Alternatively, the DNA constructs may be combined with suitable T-DNA flanking
regions and introduced into a conventional Agrobacterium tumefaciens host vector.
Agrobacterium tumefaciens-mediated transformation techniques, including disarming
and use of binary vectors, are well described in the scientific literature. See, for
example Horsch et al (1984) Science 233:496-498, and Fraley et al (1983) Proc. Nat'l.
42

WO 2006/033859 PCT/US2005/032157
Acad. Sci. USA 80:4803. The virulence functions of the Agrobacterium tumefaciens
host will direct the insertion of the construct and adjacent marker into the plant cell
DNA when the cell is infected by the bacteria using binary T DNA vector (Bevan
(1984) Nuc. Acid Res. 12:8711-8721) or the co-cultivation procedure (Horsch et al
(1985) Science 227:1229-1231). Generally, the Agrobacterium transformation system
is used to engineer dicotyledonous plants (Bevan et al (1982) Ann. Rev. Genet
16:357-384; Rogers et al (1986) Methods Enzymol. 118:627-641). The
Agrobacterium transformation system may also be used to transform, as well as
transfer, DNA to monocotyledonous plants and plant cells. See Hernalsteen et al
(1984) EMBO J3:3039-3041; Hooykass-Van Slogteren et al (1984) Nature
311:763-764; Grimsley et al (1987) Nature 325:1677-179; Boulton et al (1989) Plant
Mol. Biol. 12:31-40.; and Gould et al (1991) Plant Physiol. 95:426-434.
Alternative gene transfer and transformation methods include, but are not
limited to, protoplast transformation through calcium-, polyethylene glycol (PEG)- or
electroporation-mediated uptake of naked DNA (see Paszkowski et al. (1984) EMBO
J3:2717-2722, Potrykus et al. (1985) Molec. Gen. Genet 199:169-177; Fromm et al.
(1985) Proc. Nat. Acad. Sci. USA 82:5824-5828; and Shimamoto (1989) Nature
338:274-276) and electroporation of plant tissues (D'Halluin et al. (1992) Plant Cell
4:1495-1505). Additional methods for plant cell transformation include
microinjection, silicon carbide mediated DNA uptake (Kaeppler et al. (1990) Plant
Cell Reporter 9:415-418), and microprojectile bombardment (see Klein et al. (1988)
Proc. Nat. Acad. Sci. USA 85:4305-4309; and Gordon-Kamm et al. (1990) Plant Cell
2:603-618).
Transformed plant cells which are produced by any of the above
transformation techniques can be cultured to regenerate a whole plant which
possesses the transformed genotype and thus the desired phenotype. Such
regeneration techniques rely on manipulation of certain phytohormones in a tissue
culture growth medium, typically relying on a biocide and/or herbicide marker which
has been introduced together with the desired nucleotide sequences. Plant
regeneration from cultured protoplasts is described in Evans, et al., "Protoplasts
Isolation and Culture" in Handbook of Plant Cell Culture, pp. 124-176, Macmillian
Publishing Company, New York, 1983; and Binding, Regeneration of Plants, Plant
Protoplasts, pp. 21-73, CRC Press, Boca Raton, 1985. Regeneration can also be
obtained from plant callus, explants, organs, pollens, embryos or parts thereof. Such
43

WO 2006/033859 PCT/US2005/032157
regeneration techniques are described generally in Klee et al (1987) Ann. Rev. of
Plant Phys. 38:467-486.
Nucleic acids introduced into a plant cell can be used to confer desired traits
on essentially any plant. A wide variety of plants and plant cell systems may be
engineered for the desired physiological and agronomic characteristics described
herein using the nucleic acid constructs of the present disclosure and the various
transformation methods mentioned above. In certain embodiments, target plants and
plant cells for engineering include, but are not limited to, those monocotyledonous
and dicotyledonous plants, such as crops including grain crops (e.g., wheat, maize,
rice, millet, barley), fruit crops (e.g., tomato, apple, pear, strawberry, orange), forage
crops (e.g., alfalfa), root vegetable crops (e.g., carrot, potato, sugar beets, yam), leafy
vegetable crops (e.g., lettuce, spinach); flowering plants (e.g., petunia, rose,
chrysanthemum), conifers and pine trees (e.g., pine fir, spruce); plants used in
phytoremediation (e.g., heavy metal accumulating plants); oil crops (e.g., sunflower,
rape seed) and plants used for experimental purposes (e.g., Arabidopsis). Thus, the
disclosed methods and compositions have use over a broad range of plants, including,
but not limited to, species from the genera Asparagus, Avena, Brassica, Citrus,
Citrullus, Capsicum, Cucurbita, Daucus, Glycine, Hordeum, Lactuca, Lycopersicon,
Malus, Manihot, Nicotiana, Oryza, Persea, Pisum, Pyrus, Prunus, Raphanus, Secale,
Solanum, Sorghum, Triticum, Vitis, Vigna, and Zea.
One of skill in the art will recognize that after the expression cassette is stably
incorporated in transgenic plants and confirmed to be operable, it can be introduced
into other plants by sexual crossing. Any of a number of standard breeding
techniques can be used, depending upon the species to be crossed.
A transformed plant cell, callus, tissue or plant may be identified and isolated
by selecting or screening the engineered plant material for traits encoded by the
marker genes present on the transforming DNA. For instance, selection may be
performed by growing the engineered plant material on media containing an
inhibitory amount of the antibiotic or herbicide to which the transforming gene
construct confers resistance. Further, transformed plants and plant cells may also be
identified by screening for the activities of any visible marker genes (e.g., the
P-glucuronidase, luciferase, B or C1 genes) that may be present on the recombinant
nucleic acid constructs. Such selection and screening methodologies are well known
to those skilled in the art.
44

WO 2006/033859 PCT/US2005/032157
Physical and biochemical methods also may be used to identify plant or plant
cell transformants containing inserted gene constructs. These methods include but are
not limited to: 1) Southern analysis or PCR amplification for detecting and
determining the structure of the recombinant DNA insert; 2) Northern blot, S1 RNase
protection, primer-extension or reverse transcriptase-PCR amplification for detecting
and examining RNA transcripts of the gene constructs; 3) enzymatic assays for
detecting enzyme or ribozyme activity, where such gene products are encoded by the
gene construct; 4) protein gel electrophoresis, Western blot techniques,
immunoprecipitation, or enzyme-linked immunoassays, where the gene construct
products are proteins. Additional techniques, such as in situ hybridization, enzyme
staining, and irnmunostaining, also may be used to detect the presence or expression
of the recombinant construct in specific plant organs and tissues. The methods for
doing all these assays are well known to those skilled in the art.
Effects of gene manipulation using the methods disclosed herein can be
observed by, for example, northern blots of the RNA (e.g., mRNA) isolated from the
tissues of interest. Typically, if the amount of mRNA has increased, it can be
assumed that the corresponding endogenous gene is being expressed at a greater rate
than before. Other methods of measuring gene activity can be used. Different types
of enzymatic assays can be used, depending on the substrate used and the method of
detecting the increase or decrease of a reaction product or by-product. In addition, the
levels of protein expressed can be measured immunochemically, i.e., ELISA, RIA,
EIA and other antibody based assays well known to those of skill in the art, such as by
electrophoretic detection assays (either with staining or western blotting). The
transgene may be selectively expressed in some tissues of the plant or at some
developmental stages, or the transgene may be expressed in substantially all plant
tissues, substantially along its entire life cycle. However, any combinatorial
expression mode is also applicable.
The present disclosure also encompasses seeds of the transgenic plants
described above wherein the seed has the transgene or gene construct. The present
disclosure further encompasses the progeny, clones, cell lines or cells of the
transgenic plants described above wherein said progeny, clone, cell line or cell has the
transgene or gene construct.
45

WO 2006/033859 PCTAJS2005/032157
Delivery vehicles
An important factor in the administration of polypeptide compounds, such as
fusion proteins, is ensuring that the polypeptide has the ability to traverse the plasma
membrane of a cell, or the membrane of an intra-cellular compartment such as the
nucleus. Cellular membranes are composed of lipid-protein bilayers that are freely
permeable to small, nonionic lipophilic compounds and are inherently impermeable to
polar compounds, macromolecules, and therapeutic or diagnostic agents. However,
proteins and other compounds such as liposomes have been described, which have the
ability to translocate polypeptides across a cell membrane.
For example, "membrane translocation polypeptides" have amphiphilic or
hydrophobic amino acid subsequences that have the ability to act as membrane-
translocating carriers. In one embodiment, homeodomain proteins have the ability to
translocate across cell membranes. The shortest internalizable peptide of a
homeodomain protein, Antennapedia, was found to be the third helix of the protein,
from amino acid position 43 to 58 (see, e.g., Prochiantz, Current Opinion in
Neurobiology 6:629-634 (1996)). Another subsequence, the h (hydrophobic) domain
of signal peptides, was found to have similar cell membrane translocation
characteristics (see, e.g., Lin et al., J. Biol Chem. 270:1 4255-14258 (1995)).
Examples of peptide sequences which can be linked to a protein, for
facilitating uptake of the protein into cells, include, but are not limited to: an 11 amino
acid peptide of the tat protein of HTV; a 20 residue peptide sequence which
corresponds to amino acids 84-103 of the pl6 protein (see Fahraeus et al., Current
Biology 6:84 (1996)); the third helix of the 60-amino acid long homeodomain of
Antennapedia (Derossi et al., J. Biol. Chem. 269:10444 (1994)); the h region of a
signal peptide such as the Kaposi fibroblast growth factor (K-FGF) h region (Lin et}
al., supra); or the VP22 translocation domain from HSV (Elliot & O'Hare, Cell
88:223-233 (1997)). Other suitable chemical moieties that provide enhanced cellular
uptake may also be chemically linked to ZFPs. Membrane translocation domains
(i.e., internalization domains) can also be selected from libraries of randomized
peptide sequences. See, for example, Yeh et al. (2003) Molecular Therapy 7(5):S461,
Abstract #1191.
Toxin molecules also have the ability to transport polypeptides across cell
membranes. Often, such molecules (called "binary toxins") are composed of at least
two parts: a translocation/binding domain or polypeptide and a separate toxin domain
46

WO 2006/033859 PCT/US2005/032157
or polypeptide. Typically, the translocation domain or polypeptide binds to a cellular
receptor, and then the toxin is transported into the cell. Several bacterial toxins,
including Clostridium perfringens iota toxin, diphtheria toxin (DT), Pseudomonas
exotoxin A (PE), pertussis toxin (PT), Bacillus anihracis toxin, and pertussis
adenylate cyclase (CYA), have been used to deliver peptides to the cell cytosol as
internal or amino-terminal fusions (Arora et al., J. Biol Chem., 268:3334-3341
(1993); Perelle et al., Infect. Immun., 61:5147-5156 (1993); Stenmark et al., J. Cell
Biol. 113:1025-1032 (1991); Donnelly et al., PNAS90:3530-3534 (1993); Carbonetti
et al., Abstr. Annu. Meet. Am. Soc. Microbiol. 95:295 (1995); Sebo et al., Infect.
Immun. 63:3851-3857 (1995); Klimpel et al., PNAS U.S.A. 89:10277-10281 (1992);
and Novak et al, J. Biol Chem. 267:17186-17193 1992)).
Such peptide sequences can be used to translocate ZFPs and other types of
fusion proteins across a cell membrane. Polypeptide sequences can be conveniently
fused to or derivatized with such translocation sequences. Typically, the translocation
sequence is provided as part of a fusion protein. Optionally, a linker can be used to
link the translocation sequence to the remainder of the fusion protein. Any suitable
linker can be used, e.g., a peptide linker. See supra.
A fusion protein can also be introduced into an animal cell, preferably a
mammalian cell, via a liposomes and liposome derivatives such as immunoliposomes.
The term "liposome" refers to vesicles comprised of one or more concentrically
ordered lipid bilayers, which encapsulate an aqueous phase. The aqueous phase
typically contains the compound to be delivered to the cell, e.g. a fusion protein
comprising a degron domain and a ZFP DNA-binding domain.
The liposome fuses with the plasma membrane, thereby releasing the protein
into the cytosol. Alternatively, the liposome is phagocytosed or taken up by the cell
in a transport vesicle. Once in the endosome or phagosome, the liposome either
degrades or fuses with the membrane of the transport vesicle and releases its contents.
In current methods of drug delivery via liposomes, the liposome ultimately
becomes permeable and releases the encapsulated compound (in this case, a fusion
protein) at the target tissue or cell. For systemic or tissue specific delivery, this can be
accomplished, for example, in a passive manner wherein the liposome bilayer
degrades over time through the action of various agents in the body. Alternatively,
active drug release involves using an agent to induce a permeability change in the
liposome vesicle. Liposome membranes can be constructed so that they become
47

WO 2006/033859 PCT/US2005/032157
destabilized when the environment becomes acidic near the liposome membrane (see,
e.g., PNAS 84:7851 (1987); Biochemistry 28:908 (1989)). When liposomes are
endocytosed by a target cell, for example, they become destabilized and release their
contents. This destabilization is termed fusogenesis.
Dioleoylphosphatidylethanolamine (DOPE) is the basis of many "fusogenic" systems.
Such liposomes typically comprise a protein and a lipid component, e.g., a
neutral and/or cationic lipid, optionally including a receptor-recognition molecule
such as an antibody that binds to a predetermined cell surface receptor or ligand (e.g.,
an antigen). A variety of methods are available for preparing liposomes as described
in, e.g., Szoka et al., Ann. Rev. Biophys. Bioeng. 9:467 (1980), U.S. Pat. Nos.
4,186,183, 4,217,344,4,235,871,4,261,975,4,485,054,4,501,728,4,774,085,
4,837,028, 4,235,871, 4,261,975, 4,485,054,4,501,728, 4,774,085,4,837,028,
4,946,787, PCT Publication No. WO 91\17424, Deamer & Bangham, Biochim.
Biophys. Acta 443:629-634 (1976); Fraley, et al., PNAS 76:3348-3352 (1979); Hope
et al., Biochim. Biophys. Acta 812:55-65 (1985); Mayer et al., Biochim. Biophys. Acta
858:161-168 (1986); Williams et al., PNAS 85:242-246 (1988); Liposomes (Ostro
(ed.), 1983, Chapter 1); Hope et al., Chem. Phys. Lip. 40:89 (1986); Gregoriadis,
Liposome Technology (1984) and Lasic, Liposomes: from Physics to Applications
(1993)). Suitable methods include, for example, sonication, extrusion, high
pressure/homogenization, microfluidization, detergent dialysis, calcium-induced
fusion of small liposome vesicles and ether-fusion methods, all of which are known to
those of skill in the art.
In certain, embodiments, it is desirable to target liposomes using targeting
moieties that are specific to a particular cell type, tissue, and the like. Targeting of
liposomes using a variety of targeting moieties (e.g., ligands, receptors, and
monoclonal antibodies) has been described. See, e.g., U.S. Patent Nos. 4,957,773 and
4,603,044.
Examples of targeting moieties include monoclonal antibodies specific to
antigens associated with neoplasms, such as prostate cancer specific antigen and
MAGE. Tumors can also be targeted by detecting gene products resulting from the
activation or over-expression of oncogenes, such as ras or c-erbB2. In addition, many
tumors express antigens normally expressed by fetal tissue, such as the
alphafetoprotein (AFP) and carcinoembryonic antigen (CEA). Sites of viral infection
can be targeted using various viral antigens such as hepatitis B core and surface
48

WO 2006/033859 PCT/US2005/032157
antigens (HBVc, HBVs) hepatitis C antigens, Epstein-Barr virus antigens, human
immunodeficiency type-1 virus (HIV1) and papilloma virus antigens. Inflammation
can be detected using molecules specifically recognized by surface molecules which
are expressed at sites of inflammation such as integrins (e.g., VCAM-1), selectin
receptors (e.g., ELAM-1) and the like.
Standard methods for coupling targeting agents to liposomes can be used.
These methods generally involve incorporation into liposomes of lipid components,
e.g., phosphatidylethanolamine, which can be activated for attachment of targeting
agents, or derivatized lipophilic compounds, such as lipid derivatized bleomycin.
Antibody targeted liposomes can be constructed using, for instance, liposomes which
incorporate protein A (see Renneisen et al., J. Biol. Chem., 265:16337-16342 (1990)
and Leonetti et al., PNAS 87:2448-2451 (1990).
Dosages
For therapeutic applications, the dose administered to a patient, in the context
of the present disclosure, should be sufficient to effect a beneficial therapeutic
response in the patient over time. In addition, particular dosage regimens can be
useful for determining phenotypic changes in an experimental setting, e.g., in
functional genomics studies, and in cell or animal models. The dose will be
determined by the efficacy and Kd of the particular ZFP employed, the nuclear
volume of the target cell, and the condition of the patient, as well as the body weight
or surface area of the patient to be treated. The size of the dose also will be
determined by the existence, nature, and extent of any adverse side-effects that
accompany the administration of a particular compound or vector in a particular
patient.
As an example, the maximum therapeutically effective dosage of ZFP for
approximately 99% binding to target sites is calculated to be in the range of less than
about 1.5x105 to 1.5x106 copies of the specific ZFP molecule per cell. The number of
ZFPs per cell for this level of binding is calculated as follows, using the volume of a
HeLa cell nucleus (approximately 1000 um3 or 10-12 L; Cell Biology, (Altaian &
Katz, eds. (1976)). As the HeLa nucleus is relatively large, this dosage number is
recalculated as needed using the volume of the target cell nucleus. This calculation
also does not take into account competition for ZFP binding by other sites. This
49

WO 2006/033859 PCT/US2005/032157
calculation also assumes that essentially all of the ZFP is localized to the nucleus. A
value of l00x Kd is used to calculate approximately 99% binding of to the target site,
and a value of 10x Kd is used to calculate approximately 90% binding of to the target
site. For this example, Kd = 25 nM
ZFP + target site complex
i.e., DNA + protein DNA:protein complex
Kd= [DNA] [protein]
[DNA:protein complex]
When 50% of ZFP is bound, Kd = [protein]
So when [protein] = 25 nM and the nucleus volume is 10-12 L
[protein] = (25x10-9 moles/L) (10-12 L/nucleus) (6x1023
molecules/mole)
= 15,000 molecules/nucleus for 50% binding
When 99% target is bound; 100x Kd = [protein]
100x Kd = [protein] = 2.5 uM
(2.5x10-6moles/L) (10-12L/nucleus) (6x1023 molecules/mole)
= about 1,500,000 molecules per nucleus for 99% binding of
target site.
The appropriate dose of an expression vector encoding a ZFP fusion protein
can also be calculated by taking into account the average rate of ZFP expression from
the promoter and the average rate of ZFP degradation in the cell. In certain
embodiments, a weak promoter such as a wild-type or mutant HSV TK promoter is
used, as described above. The dose of fusion protein in micrograms is calculated by
taking into account the molecular weight of the particular protein being employed.
In determining the effective amount of the protein to be administered in the
treatment or prophylaxis of disease, the physician evaluates circulating plasma levels
of the protein or nucleic acid encoding the protein, potential toxicities due to the
protein, progression of the disease, and the production of antibodies to the protein.
Administration can be accomplished via single or divided doses.
Pharmaceutical compositions and administration
Fusion proteins and expression vectors encoding such proteins can be
administered directly to the patient for targeted gene regulation, targeted DNA
50

WO 2006/033859 PCT/US2005/032157
cleavage and/or recombination, and for therapeutic or prophylactic applications, for
example, cancer, ischemia, diabetic retinopathy, macular degeneration, rheumatoid
arthritis, psoriasis, HIV infection, sickle cell anemia, Alzheimer's disease, muscular
dystrophy, neurodegenerative diseases, vascular disease, cystic fibrosis, stroke, and
the like. Examples of microorganisms that can be inhibited by ZFP gene therapy
include pathogenic bacteria, e.g., chlamydia, rickettsial bacteria, mycobacteria,
staphylococci, streptococci, pneumococci, meningococci and conococci, klebsiella,
proteus, serratia, pseudomonas, legionella, diphtheria, salmonella, bacilli, cholera,
tetanus, botulism, anthrax, plague, leptospirosis, and Lyme disease bacteria; infectious
fungus, e.g., Aspergillus, Candida species; protozoa such as sporozoa (e.g.,
Plasmodia), rhizopods (e.g., Entamoeba) and flagellates (Trypanosoma, Leishmania,
Trichomonas, Giardia, etc.);viral diseases, e.g., hepatitis (A, B, or C), herpes virus
(e.g., VZV, HSV-1, HSV-6, HSV-H, CMV, and EBV), HIV, Ebola, adenovirus,
influenza virus, flaviviruses, echovirus, rhinovirus, coxsackie viras, coronavirus,
respiratory syncytial virus, mumps virus, rotavirus, measles virus, rubella virus,
parvovirus, vaccinia virus, HTLV virus, dengue virus, papillomavirus, poliovirus,
rabies virus, and arboviral encephalitis viras, etc.
Administration of therapeutically effective amounts is by any of the routes
normally used for introducing a protein or nucleic acid into ultimate contact with the
tissue to be treated. The fusion proteins or encoding nucleic acids are administered in
any suitable manner, preferably with pharmaceutically acceptable carriers. Suitable
methods of administering such modulators are available and well known to those of
skill in the art, and, although more than one route can be used to administer a
particular composition, a particular route can often provide a more immediate and
more effective reaction than another route.
Pharmaceutically acceptable carriers are determined in part by the particular
composition being administered, as well as by the particular method used to
administer the composition. Accordingly, there is a wide variety of suitable
formulations of pharmaceutical compositions that are available (see, e.g., Remington's
Pharmaceutical Sciences, 17th ed. 1985)).
Fusion proteins or their encoding nucleic acids, alone or in combination with
other suitable components, can be made into aerosol formulations (i.e., they can be
"nebulized") to be administered via inhalation. Aerosol formulations can be placed
51

WO 2006/033859 PCT/US2005/032157
into pressurized acceptable propellants, such as dichlorodifluoromethane, propane,
nitrogen, and the like.
Formulations suitable for parenteral administration, such as, for example, by
intravenous, intramuscular, intradermal, and subcutaneous routes, include aqueous
and non-aqueous, isotonic sterile injection solutions, which can contain antioxidants,
buffers, bacteriostats, and solutes that render the formulation isotonic with the blood
of the intended recipient, and aqueous and non-aqueous sterile suspensions that can
include suspending agents, solubilizers, thickening agents, stabilizers, and
preservatives. The disclosed compositions can be administered, for example, by
intravenous infusion, orally, topically, intraperitoneally, intravesically or intrathecally.
The formulations of compounds can be presented in unit-dose or multi-dose sealed
containers, such as ampules and vials. Injection solutions and suspensions can be
prepared from sterile powders, granules, and tablets of the kind previously described.
Applications
The disclosed proteins can also be used in vitro. For example, the engineered
ZFP (or a transcription/translation lysate) can be incubated with a polynucleotide
(e.g., plasmid) e.g., in a coupled transcription/translation system for protein
production in vitro.
In additional embodiments, the target site for an engineered ZFP can be placed
on either side of a coding sequence, either endogenous or exogenous. In this
situation, binding of an engineered ZFP (optionally fused to, e.g., an insulator
domain; see e.g., WO 01/02553; WO 02/44376) to the target sites can protect the
coding sequence against the influence of e.g., position effects and
heterochromatinization.
The disclosed methods can be used in concert with existing methods (e.g.,
methotrexate-based selection for amplification of sequences adjacent to a
dihydrofolate reductase gene) to obtain increased levels of protein expression based
on both increased template number and higher levels of transcription.
In additional embodiments, a target sequence for an engineered ZFP can be
introduced adjacent to or overlapping a binding site for another transcriptional
regulatory molecule such that regulation can be further modulated by the engineered
ZFP. Engineered ZFPS fused to either transcriptional activation domains or
52

WO 2006/033859 PCT/US2005/032157
transcriptional repression domain can be used, as can ZFPs not fused to a functional
domain, which can sterically modulate binding of other regulatory molecules.
Inactivation of endogenous genes to facilitate protein production
In certain embodiments, the function of one or more cellular gene products is
inactivated to enhance the level and/or quality of proteins produced using the methods
and compositions disclosed herein. Gene function can be inactivated by, e.g.,
disruption of one or more alleles of the endogenous cellular gene that encodes the
gene product (e.g., by gene "knock-out") and/or by repressing transcription of an
endogenous cellular gene as described, for example, in co-owned U.S. Patent No.
6,534,261.
Disruption of an endogenous cellular gene can be accomplished by
mutagenesis (e.g., chemical or radiation-induced) or by insertion of exogenous DNA
sequences into the cellular genome, optionally followed by selection for the desired
mutant cells. Insertion of exogenous sequences can be random (e.g., following
retroviral infection or contact of a cell with a DNA molecule) or targeted (e.g., U.S.
Patent No. 5,614,396). Targeted insertion of exogenous sequences can also be
accomplished by targeted cleavage of genomic DNA in combination with introduction
of exogenous DNA. Co-owned PCT WO 2005/014791 and United States Provisional
Patent Application 60/702,394 (filed July 26,2005), the disclosures of which are
incorporated by reference, provide methods and compositions for homology-
dependent and homology independent methods, respectively, for targeted insertion of
exogenous sequences. In addition, PCT WO 2005/014791 also discloses methods for
targeted mutagenesis by targeted DNA cleavage followed by non-homologous end-
joining.
By way of example, one or more genes involved in the process of apoptosis
(or the function thereof) can be inactivated. Exemplary genes involved in apoptosis
are shown in Table 1.
53

WO 2006/033859 PCT/US2005/032157
Table 1
caspase1
caspase2
caspase3
caspase4
caspase5
caspase6
caspase7
caspase8
caspase9
caspase10
caspase11
caspase12
caspase13
caspase14
bax
bak
bik
APAF-1 (apoptosome associated factor 1)
c-jun
lactate dehydrogenase
cytochrome c
Genes whose products negatively regulate cell cycle progression can be
inactivated. Exemplary genes involved in regulation of the cell cycle are shown in
Table 2.
Table 2
p16
p19
Rb (retinoblastoma protein)
p53
p73
Telomerase
Because the bioactivity of certain proteins can be affected by their
glycosylation pattern, it may be desirable, in certain circumstances, to inactivate
certain genes whose products are involved in protein glycosylation. Exemplary genes
involved in glycosylation are shown in Table 3.
Table 3
a-1,6-fucosyltransferase
CMP-N-acetylneuraminic acid hydroxylase
CMP-sialic acid hydroxylase
Glucosamine-6-phosphate isomerase
54

WO 2006/033859 PCT/US2005/032157
Genes encoding receptors for infection (e.g., viral or bacterial receptors) can
be inactivated to enhance protein production. For example, the gene encoding the
minute virus of mice (MVM) receptor can be inactivated.
In certain cases, it may be desirable to integrate an exogenous DNA molecule
encoding a transgene and a selection marker (e.g., to select cells containing the
integrated transgene). If the selection marker is encoded by the genome of the cell
into which it is desired to integrate the exogenous DNA molecule, it can be useful to
inactivate the endogenous gene encoding that selection marker. Inactivation can be
partial (such that reduced levels of the marker are produced) or complete (such that no
marker is produced). Exemplary marker genes that can be so inactivated are shown in
Table 4.
Table 4
dihydrofolate reducatase
Glutamine synthetase
hypoxanthine phosphoribosyl transferase
Genes encoding proteases can be inactivated to enhance protein production.
Exemplary proteases are shown in Table 5.
Table 5
serine proteases
elastase
collagenase
plasminogen activator
For safety and compliance with regulatory requirements, genes encoding
prions (PRPs)and/or pseudo-prions can be inactivated. Pseudo-prions include, for
example, PRNP/PRP, Prion-like protein Doppel (PRND) and Shadow of Prion
(SPRN) See, for example, Liao et al. (1986) Science 233:364-367 (PRNP); Lu et al.
(2000) Biochemistry 39:13575-13583 (PRND) and Premzl et al. (2003) Gene 314:89-
102 (SPRN).
In cells which are being used for production of antibody molecules, it can be
useful to inactivate genes encoding proteins which are bound by the antibody or
antibodies being produced. For example, genes encoding the antigen(s) recognized
by the antibody or antibodies can be inactivated by any of the methods disclosed
herein.
55

WO 2006/033859 PCT/US2005/032157
Additional cellular processes which can affect protein yield and quality
include, for example, replication, transcription, RNA processing, translation, amino
acid biosynthesis, cellular metabolism, protein folding, protein degradation, protein
transport, stress responses and plasmid copy number control. It is to be understood
that either positive or negative regulation of a gene (including inactivation or gene
"knock-out") involved in any of the aforementioned processes can be used to enhance
the yield and/or quality of a protein expressed using the methods and compositions
disclosed herein.
EXAMPLES
The following examples are presented as illustrative of, but not limiting, the
claimed subject matter.
Example 1: Construction of a SRα promoter
The S V40 early promoter was generated by PCR from the plasmid pRL-SV40
(Promega) and included base pairs 80-449 from the pRL-SV40 sequence and
incorporated the HindIII site of the promoter at the 3'end. Bases 80-88 comprise the
9bp target site for the 2392/00 and 2932/10 ZFPs. A Bpul0I restriction site was
added to the PCR primer immediately upstream of the ZFP binding site to facilitate
subsequent cloning into the backbone vector, pcDNA3.1/zeo, from which the original
CMV promoter had been removed.
The R-U5 composite sequence was generated by PCR from the plasmid
pDrive01-NSE(r)-RU5 v02 (Invivogen) and included base pairs 1774-2066 from the
plasmid sequence, which encompass the HindIII restriction site within the 5' primer
binding site. An EcoRI restriction site was added to the 3' PCR primer to facilitate
cloning into the backbone vector.
The SV40 and RU5 sequences were linked at their respective HindHI sites to
form the SV40-RU5 fusion. The sequence of this hybrid promoter (SEQ ID NO:9) is
shown in Figure 1, with the ZFP target site underlined.
Example 2: Engineering zinc finger proteins to bind to a target site in the SRα promoter
To increase expression of proteins whose coding regions are under the control
of the SRα promoter, a three-finger zinc finger protein, SBS2392/00, was synthesized
56

WO 2006/033859 PCT/US2005/032157
(according to methods disclosed in co-owned U.S. Patents 6,453,242 and 6,534,261)
to bind a nine-nucleotide target sequence in the SRα promoter. The target site was the
sequence GCTGTGGAA (SEQ ID NO:1), located as shown in Figure 1. The
sequence of the protein is as follows, with the recognition regions of the zinc fingers
underlined:
KKKQHICHIQGCGKVYGORSNLVRHLRWHTGERPFMCTWSYCGKRFTRSDA
LSRHKRTHTGEKKFACPECPKRPMQSSDLRHIKTHQNK (SEQ ID NO:2).
Additional zinc finger domains capable of binding to SEQ ID NO.1 were
obtained using a two-hybrid selection system. See, e.g., U.S. Patent Application
Publication No. 2003/0044787 (Mar. 6,2003) and Joung et al. (2000) Proc. Nail.
Acad. Sci. USA 97:7382-7287. Their amino acid sequences are shown in Figure 6.
Nucleotide sequences encoding these zinc finger proteins were fused to
nucleotide sequences encoding a VP16 activation domain, a nuclear localization
signal (NLS) and a FLAG epitope tag to generate engineered transcription factors.
Exemplary poiynucleotide sequences are shown in Figures 3 and 5, with then-
encoded amino acid sequences shown in Figures 2 and 4, respectively.
Example 3: Enhancement of SRα promoter-driven expression of
immunoglobulin genes by an engineered zinc finger protein
A DG44 CHO cell line containing an integrated antibody expression construct
driven by the SRα promoter was transiently transfected with 20-250 ng of plasmid
encoding the VP16 activation domain (NVF), or ZFP 2392/00 (SEQ ID NO:2)linked
to VP16 (2392-VP16). Immunoglobulin kappa chain mRNA expression was
measured by real-time PCR (Taqman®). The results, shown in Figure 7, indicate that
SRα-driven transcription of kappa chain mRNA is increased 4-5-fold by the 2392/00
transcription factor.
In an experiment to measure the effect of the 2392/00-VP16 fusion on protein
levels, two different DG44 CHO cell-derived cell lines (Line A and Line B), that
stably express gamma heavy chains and kappa light chains, the expression of both of
which is driven by the SRα promoter, were transiently transfected with plasmid
encoding the VP16 activation domain (NVF) or ZFP 2392/00 linked to the VP16
activation domain (2392). Expression of secreted immunoglobulin G was measured
57

WO 2006/033859 PCT/US2005/032157
by ELISA. The results, shown in Figure 8, indicate that, in both cell lines, expression
of IgG was increased by the 2392/00 ZFP-VP16 fusion.
To determine whether the 2392/00-VP16 fusion protein can increase
expression in an optimized expression system, a cell line containing amplified gamma
heavy chain and kappa light chain cDNAs, obtained by methotrexate selection, was
used. Sequences encoding the 2392/00-VP16 fusion ZFP were introduced into cells
of this "high producer" line (ZFP) and the level of mRNA expression was compared
with the same cells in the absence of ZFP (NT). Figure 9 shows that optimized levels
of both gamma chain and kappa chain mRNAs can be increased a further three-fold
when the 2392/00-VP16 protein is introduced into these cells.
A cell line containing integrated cDNAs encoding the gamma heavy chain and
kappa light chain components of immunoglobulin G, both under the transcriptional
control of the SRα promoter, was transfected with a plasmid encoding the 2392/00-
VP16 fusion protein. Transfected cells were selected for stable ZFP expression.
Secreted IgG was measured by ELISA from a clonal iine stably expressing the
2392/00-VP16 fusion protein. The results, shown in Figure 10, reveal an
approximately 4-fold increase in secreted IgG in cells stably transfected with the
2392/00-VP16 protein, compared to control untransfected cells.
Example 4: Selection and properties of a stable cell line containing
integrated sequences encoding a 2392/10-VP16 fusion protein
Chinese hamster ovary (CHO) DG44 cells growing adherent on 6 well plates
were transfected with a plasmid encoding a 2392/10-VP16 fusion protein, as follows.
Cells were incubated with 1 ug DNA and 4ul Lipofectamine 2000 in lml serum-free
growth medium for 4hrs, then medium was aspirated and 2ml of regular growth
medium added. After 3 days, cells were split in various dilutions into 25 cm dishes
and subjected to Zeocin selection. When selection was complete, individual colonies
were harvested from the appropriate dilution plate, transferred into 24 well dishes and
grown up. Criteria for completion of selection were that individual cell colonies
appear and no new cell death occurs, while complete cell death is observed in non-
transfected DG44 cells that had been placed under selection at the same time. 22
clonal lines were analyzed for ZFP expression, cell growth properties and activation
of reporter constructs containing mulitmerized ZFP binding sites.
58

WO 2006/033859 PCT/US2005/032157
One of these lines was subcloned and genomic DNA was extracted from two
of the subclones. The DNA was digested completely with BamHI restriction
endonuclease, resolved on a 0.9% TAE agarose gel, transferred to Nytran+ membrane
and the membrane was probed with a 290bp radiolabelled DNA fragment encoding
the VP16 activation domain. A single band of approximately 12 kbp was detected in
the two ZFP-containing cell lines, suggesting a single copy of the DNA encoding the
fusion proteins is present. This band was not detected in DNA from wild type DG44
CHO cells.
Example 5: Construction of a SRα promoter containing multiple copies
of a target site for a ZFP-VP16 fusion protein
A promoter denoted "SRαZ6" was constructed by joining a DNA fragment
containing six copies of the target site for the 2392/00 and 2392/10 ZFPs (SEQ ID
NO:1) to an SRα promoter-containing DNA fragment. The Z6 promoter thus contains
7 copies of SEQ ID NO:1. These additional target sites confer a much greater level of
ZFP-mediated activation of this promoter. See infra.
The insert containing the additional six binding sites was inserted as a
BamHI/NdeI fragment after first inserting those restriction sites (shown in bold text
below) upstream of the SV40 portion of the SRα promoter. The seven iterations of
the target site are underlined. The target site downstream of the Ndel site is the site
originally present in the SV40 portion of the SRα promoter.
The sequence of that portion of the SRaZ6 promoter containing the ZFP target
sites is as follows:
BamHI
ggatccgagcbgbggaabgagagcbgbggaabgagagcbgbggaabgagagc
bgtggaatgagagctgtggaatgagagctgtggaatgacatatggcbgtggaatgtg
bgbcagbba NdeI
(SEQ ID NO:23)
Example 6: Construction of a CMV promoter containing multiple copies
of a target site for a ZFP-VP16 fusion protein
A promoter denoted "CMVz10" was constructed by inserting a DNA
fragment, comprising multiple copies of a target site (SEQ ID NO:1) for the 2392/00
59

WO 2006/033859 PCT/US2005/032157
and 2392/10 engineered ZFPs, into a MM restriction site immediately upstream of
the CMV promoter. The CMVz10 promoter contains 9 perfect iterations of the target
site plus a single 8/9 match in the tenth iteration (a C to T substitution). The sequence
of the CMVz10 promoter is shown below, with the MluI restriction sites shown in
bold and the target sites underlined. Note that, since this sequence was inserted in the
reverse orientation, the sequence shown reflects the complement of each binding site.
MluI
acgcgttcattccacagctctcattccacagctctcattccacagctctcat
tccacagctctcattccacagctctcattccacagctctcattccacagctctcatt
ccacagctctcattccacagctctcattccacagtcacgcgt
MluI
(SEQ ID NO:24)
Additional promoter constructs, which include different numbers of binding
sites and different orientations of the binding sites with respect to the promoter, have
also been constructed and tested.
Example 7: Enhanced protein production from a SRα promoter
containing multiple copies of a target site for a ZFP-VP16 fusion protein
A clonal isolate (2392/10-7) of the DG44 CHO cell line containing the stably
integrated 2392/10 ZFP (Example 4), and parent DG44 cells, were transfected with
two different antibody-expressing constructs. In one, heavy chain and the light chain
transcription units were each under the control of an SRα promoter (SRa); in the
other, both the heavy chain and light chain transcription units were under the control
of an SRα promoter containing an additional 8 target sites for the ZFP immediately
upstream of the promoter (SRa 2393BS). Three days after transfection, IgG secretion
was measured by ELISA.
The results are shown in Figure 11, and indicate that IgG expression level is
dependent upon the presence of the 2392/10 ZFP and that more copies of the 2392/10
target site result in higher IgG levels.
60

WO 2006/033859 PCT/US2005/032157
Example 8: Enhanced transcription from a CMV promoter containing
multiple copies of a target site for a ZFP-VP16 fusion protein
To test the effect of the 2392/10-VP16 fusion protein on the transcriptional
activity of the CMV promoter, a reporter construct, in which a CMV promoter was
operatively linked to sequences encoding green fluorescent protein (GFP) was
constructed. Variants of this reporter construct were then constructed, which
contained multiple copies of the target site for the 2392/10 ZFP upstream of the core
CMV promoter sequences. Promoters containing different numbers of target sites in
different orientations were tested by transfecting them into the 2392/10-7 cell line
(Example 4), followed by analysis of GFP mRNA levels by real-time PCR
(Taqman®). Representative results are shown in Figure 12. The results indicate that
more copies of the target site lead to higher steady-state mRNA levels, and that the
effect is independent of the orientation of the target sites with respect to the remainder
of the CMV promoter sequences. No effect of either number or orientation of target
sites is observed in a parental cell line that does not express the 2392/10 ZFP.
Example 9: Enhanced expression of erythropoietin, mediated by the
2392/10-VP16 fusion protein, from SRα and CMV promoters containing multiple
copies of the 2392/10 target site
Three different erythropoietin (Epo)-expressmg constructs were constructed.
In the first, Epo expression was controlled by a CMV promoter fused to a P-globin
intron (CMV). In the second, Epo expression was controlled by a CMV promoter
containing ten upstream copies of the 2392/10 binding site, fused to a P-globin intron
(CMVzlO, Example 6). In the third, Epo expression was controlled by a SRα
promoter with six additional copies of the 2392/10 binding site immediately upstream
of the promoter (SRαZ6, Example 5). The three constructs were transiently
transfected into either parental DG44 CHO cells or the ZFP-expressing 2392/10-7 cell
line. After 24 hours, Epo secretion was measured by ELISA.
The results, shown in Figure 13, confirm previous observations that, in the
absence of the 2392/10-VP16 ZFP, the CMV promoter is stronger than the SRα
promoter. However, activation of the SRαZ6 promoter by the 2393/10-VP16 protein
increases its activity such that it is comparable to that of the CMV promoter.
Moreover, insertion of 2392/10 target sites adjacent to the CMV promoter increases
61

WO 2006/033859 PCT/US2005/032157
its already strong activity an additional three-fold in 2392/10-VP16-expressing cells.
Thus, the activity of the strongest known naturally-occurring promoter has been
improved by the use of an engineered ZFP transcriptional activator.
All patents, patent applications and publications mentioned herein are hereby
incorporated by reference in their entirety.
Although disclosure has been provided in some detail by way of illustration
and example for the purposes of clarity of understanding, it will be apparent to those
skilled in the art that various changes and modifications can be practiced without
departing from the spirit or scope of the disclosure. Accordingly, the foregoing
descriptions and examples should not be construed as limiting.
62

We Claim :
1. An in vitro method for enhancing the level and/or quality of protein production from an
exogenous sequence in a cell the method comprising:
(a) providing a cell in which a first gene is inactivated; and
(b) expressing first and second fusion proteins in the cell, each of the first and second
fusion proteins comprising a zinc finger DNA-binding domain and a nuclease, wherein
the first and second fusion proteins bind to and cleave at a first site in a second gene,
wherein the first and second genes are inactivated;
(c) expressing an exogenous sequence encoding the protein in the cell in which
the first and second genes are inactivated;
thereby enhancing- the level and/or quality of protein production from the
exogenous sequence expressed in the cell.
2. The method as claimed in claim 1, wherein cleavage in the second gene is followed
by non-homologous end joining.
3. The method as claimed in claim 1. comprising expressing third and fourth
fusion proteins in the cell, each of the third and fourth fusion proteins comprising a zinc
finger protein and a nuclease, wherein the third and fourth fusion proteins bind to and
cleave at a second site in the second gene.
4. The method as claimed in claim 1. comprising contacting the cell with an
exogenous nucleic acid sequence;
thereby resulting in integration of the exogenous nucleic acid sequence into the
second gene.
5. The method as claimed in any of claims 1 to 4, wherein the first gene is box.

6. The method as claimed in any of claims 1 to 4, wherein the first gene is bac.
7. The method as claimed in claim 5, wherein the first gene is bax and the second gene is
bac.
8. The method as claimed in claim 6, wherein the first gene is bac and the second gene is
box.
9. The method as claimed in any of claims 1 to 4, wherein the first gene is glutamine
synthetase.
10. The method as claimed in any of claims 1 to 4, wherein the first gene is
dihydrofolate reductase.
11. The method as claimed in claim 9, wherein the first gene is giutamine synthetase
and the second gene is dihydrofolate reductase.
12. The method as claimed in claim 10, wherein the first gene is dihydrofolate
reductase and the second gene is glutamine synthetase.
] 3. The method as claimed in any of claims 1-12, wherein a third gene is inactivated in
the cell, the method comprising:
(a) providing a cell in which first and second genes are inactivated; and
(b) expressing fifth and sixth fusion proteins in the cell, each of the fifth and sixth
fusion proteins comprising a zinc finger DNA-binding domain and a nuclease, wherein
the fifth and sixth fusion proteins bind to and cleave at a first site in the third gene.
14. The method as claimed in claim 13, wherein cleavage in the third gene is followed
by non-homologous end joining.

15. The method as claimed in claim 13, comprising expressing seventh and
eighth fusion proteins in the cell, each of the seventh and eighth fusion proteins
comprising a zinc finger protein and a nuclease, wherein the seventh and eighth fusion
proteins bind to and cleave at a second site in the third gene.
16. The method as claimed in claim 13. comprising contacting the cell with an
exogenous nucleic acid sequence;
thereby resulting in integration of the exogenous nucleic acid sequence into the
third gene.
17. The method as claimed in any of claims 13-16, wherein the genes that are
inactivated are the bax, bac, and α-l,6-fucosyltransferase genes.

Disclosed herein are methods and compositions for enhanced protein production
and over expression from an exogenous sequence using engineered zinc finger proteins to
inactivate first and second genes in a cell including the exogenous sequence.

Documents:

01023-kolnp-2007-abstract.pdf

01023-kolnp-2007-claims.pdf

01023-kolnp-2007-correspondence others 1.1.pdf

01023-kolnp-2007-correspondence others.pdf

01023-kolnp-2007-description complete.pdf

01023-kolnp-2007-drawings.pdf

01023-kolnp-2007-form 1.pdf

01023-kolnp-2007-form 3.pdf

01023-kolnp-2007-form 5.pdf

01023-kolnp-2007-international publication.pdf

01023-kolnp-2007-international search report.pdf

01023-kolnp-2007-pct request form.pdf

01023-kolnp-2007-priority document 1.1.pdf

01023-kolnp-2007-sequence listing.pdf

1023-KOLNP-2007-(28-09-2011)-ABSTRACT.pdf

1023-KOLNP-2007-(28-09-2011)-AMANDED CLAIMS.pdf

1023-KOLNP-2007-(28-09-2011)-AMANDED PAGES OF SPECIFICATION.pdf

1023-KOLNP-2007-(28-09-2011)-CORRESPONDENCE.pdf

1023-KOLNP-2007-(28-09-2011)-DESCRIPTION (COMPLETE).pdf

1023-KOLNP-2007-(28-09-2011)-FORM 1.pdf

1023-KOLNP-2007-(28-09-2011)-FORM 13.pdf

1023-KOLNP-2007-(28-09-2011)-FORM 2.pdf

1023-KOLNP-2007-(28-09-2011)-OTHERS.pdf

1023-KOLNP-2007-ABSTRACT.pdf

1023-KOLNP-2007-AMANDED CLAIMS.pdf

1023-KOLNP-2007-ASSIGNMENT.pdf

1023-KOLNP-2007-CANCELLED PAGES.pdf

1023-KOLNP-2007-DESCRIPTION (COMPLETE).pdf

1023-KOLNP-2007-DRAWINGS.pdf

1023-KOLNP-2007-FORM 1-1.1.pdf

1023-KOLNP-2007-FORM 13.pdf

1023-kolnp-2007-form 18.pdf

1023-KOLNP-2007-FORM 2.pdf

1023-KOLNP-2007-FORM 3.pdf

1023-KOLNP-2007-FORM-27.pdf

1023-KOLNP-2007-PA.pdf

1023-KOLNP-2007-PETITION UNDER RULE 137-1.1.pdf

1023-KOLNP-2007-PETITION UNDER RULE 137.pdf

1023-KOLNP-2007-REPLY TO EXAMINATION REPORT.pdf

abstract-01023-kolnp-2007.jpg


Patent Number 250332
Indian Patent Application Number 1023/KOLNP/2007
PG Journal Number 52/2011
Publication Date 30-Dec-2011
Grant Date 26-Dec-2011
Date of Filing 22-Mar-2007
Name of Patentee SANGAMO BIOSCIENCES, INC.
Applicant Address POINT RICHMOND TECH CENTER, 501 CANAL BLVD., SUITE A100, RICHMOND, CA
Inventors:
# Inventor's Name Inventor's Address
1 LI XIAO-YONG 1014 MASONIC AVENUE, ALBANY, CA 94706
2 BARTSEVICH VICTOR 1009 KAINS AVENUE, APT. 5, ALBANY, CA 94706
3 COLLINGWOOD TREVOR 1111 CAMBRIDGE STREET, NOVATO, CA 94947
4 JAMIESON ANDREW 2528 SUTTER STREET, SAN FRANCISCO, CA 94115
PCT International Classification Number A61K 48/00
PCT International Application Number PCT/US2005/032157
PCT International Filing date 2005-09-08
PCT Conventions:
# PCT Application Number Date of Convention Priority Country
1 60/610,853 2004-09-16 U.S.A.
2 60/661,841 2005-03-15 U.S.A.