Title of Invention

A METHOD FOR REDUCING THE COMPLEXITE OF A SAMPLE THAT COMPRISES A MIXTURE OF PROTEINS

Abstract The present invention relates generally to methods for reducing the complexity of a sample. More specifically, the present invention relates to proteomics, the measurement of the protein levels in biological samples, and analysis of proteins in a sample using antibodies that recognize small epitopes.
Full Text A METHOD FOR REDUCING THE COMPLEXITY OF A SAMPLE THAT
COMPRISES A MIXTURE OF PROTEINS
CROSS-REFERENCE TO RELATED APPLICATIONS
[0001] This application claims the benefit of U.S. Provisional Application Nos. 60/496,154,
filed on August 18, 2003, and 60/511,720, filed on October 15, 2003, which are hereby
incorporated by reference in their entireties.
FIELD OF THE INVENTION
[0002] The present invention relates generally to methods for reducing the complexity of a
sample. More specifically, the present invention relates to proteomics, the measurement of the
protein levels in biological samples, and analysis of proteins in a sample using antibodies that
recognize small epitopes.
BACKGROUND OF THE INVENTION
[0003] Proteomics offers a more direct look at the biological functions of a cell or organism
than does genomics, the traditional focus for evaluation of gene activity. Proteomics involves the
qualitative and quantitative measurement of gene activity by detecting and quantitating
expression at the protein level, rather than at the messenger RNA level. Proteomics also involves
the study of non-genome encoded events including the post-translational modification of
proteins, protein degradation and protein byproducts, interactions between proteins, and the
location of proteins within the cell. The structure, function, or level of activity of the proteins
expressed by a cell are also of interest.
[0004] The study of gene expression at the protein level is important because many of the
most important cellular processes are regulated by the protein status of the cell, not by the status
of gene expression. Also, the protein content of a cell is highly relevant to drug discovery efforts
since most drugs are designed to be active against protein targets.

[0005] Current technologies for the analysis of protein mixtures, such as the intracellular
proteins of a cell or population of cells and the proteins secreted by the cell or population of cells
or biological fluids, are based on a variety of protein separation techniques followed by
identification and/or analysis of the separated proteins. The most popular method is based on
2D-gel electrophoresis followed by "in-gel" proteolytic digestion and mass spectroscopy.
Alternatively, Edman and related methods may be used for the sequencing. This 2D-gel
technique requires large sample sizes, is time consuming, and is currently limited in its ability to
reproducibly resolve a significant fraction of the proteins expressed by a human cell. Techniques
involving some large-format 2D-gels can produce gels which separate a larger number of
proteins than traditional 2D-gel techniques, but reproducibility is still poor and over 95% of the
spots cannot be sequenced due to limitations with respect to sensitivity of the available
sequencing techniques. The electrophoretic techniques are also plagued by a bias towards
proteins of high abundance.
[0006] Thus, there is a need for the ability to assay more completely proteins expressed by a
cell or a population of cells hi an organism or in a fluid comprising protein (such as serum,
plasma, lymph, and other biological fluids), including up to the total set of proteins expressed by
the cell or cells or found in the fluid comprising protein.
BRIEF SUMMARY OF THE INVENTION
[0007] In one aspect, the invention provides methods for reducing the complexity of a
sample, said methods comprising: (a) contacting a sample with one or more small epitope
antibody under conditions that permit binding; and (b) separating an antibody-protein complex,
whereby proteins comprising one or more epitope(s) bound by the one or more small epitope
antibody are isolated, separated, enriched and/or purified.
[0008] In another aspect, the invention provides methods comprising (a) contacting a sample
with one or more small epitope antibody under conditions that permit binding; (b) separating an
antibody-protein complex, whereby proteins comprising one or more epitope(s) bound by the
one or more small epitope antibody are isolated, separated, enriched and/or purified; and (c)
separating proteins from the antibody-protein complex.

[0009] In another aspect, the invention provides methods for reducing the complexity of a
sample, said methods comprising: separating a small epitope antibody-protein complex, whereby
proteins comprising an epitope bound by the small epitope antibody are enriched; wherein the
complex was generated by contacting a sample with the small epitope antibody.
[0010] In another aspect, the invention provides methods for reducing the complexity of a
sample, said methods comprising: (a) separating a small epitope antibody-protein complex,
whereby proteins comprising an epitope bound by the small epitope antibody are enriched;
wherein the complex was generated by contacting a sample with the small epitope antibody; and
(b) separating proteins from the antibody-protein complex.
[0011] In another aspect, the invention provides methods for reducing the complexity of a
sample, said methods comprising separating protein from a small epitope antibody-protein
complex, whereby protein comprising an epitope bound by the small epitope antibody is
enriched; wherein the small epitope antibody-protein complex is generated by (a) contacting a
sample with the small epitope antibody under conditions that permit binding, whereby the small
epitope antibody-protein complex is generated; and (b) separating an antibody-proteincomplex.
[0012] As is evident, one or more steps may be combined and/or performed sequentially
(often in any order, as long as the requisite product(s) are able to be formed), and, as is evident,
the invention includes various combinations of the steps described herein. It is also evident, and
is described herein, that the invention encompasses methods in which the initial, or first, step is
any of the steps described herein. Methods of the invention encompass embodiments in which
later, "downstream" steps are an initial step.
[0013] In some embodiments, the methods further comprise a step of treating the sample
with a protein cleaving agent, whereby polypeptide fragments are generated. In embodiments
involving a step of separating protein from the antibody-protein complex, the sample can be
treated with a protein cleaving agent prior to a step of contacting a sample with the at least one
small epitope antibody, and/or following a step of separating protein from the antibody-protein
complex. Methods for treatment with protein cleaving agents are well known in the art and
described herein. One or more protein cleaving agent may be used. The protein cleaving agent
may be an enzyme (such as chymotrypsin or trypsin) or a chemical agent (such as cyanogen
bromide).
[0014] Thus, in another aspect, the invention provides methods for reducing the complexity
of a sample, said methods comprising (a) contacting a sample with one or more small epitope

antibody under conditions that permit binding; (b) separating an antibody-protein complex,
whereby proteins comprising one or more epitope(s) bound by the one or more small epitope
antibody are enriched; (c) separating protein from protein-antibody complex; and (d) treating the
protein with a protein cleaving agent, whereby polypeptide fragments are generated.
[0015] In another aspect, the invention provides methods for reducing the complexity of a
sample, said methods comprising (a) contacting a sample with one or more small epitope
antibody under conditions that permit binding, to form an antibody-protein complex; and (b)
treating the antibody-protein complex with a protein cleaving agent to produce polypeptide
fragments.
[0016] In another aspect, the invention provides methods for reducing the complexity of a
protein sample, said methods comprising: (a) treating the sample with a protein cleaving agent,
whereby polypeptide fragments are generated; (b) contacting the polypeptide fragments with one
or more small epitope antibody under conditions that permit binding, whereby antibody-
polypeptide complexes are generated; and (c) separating the antibody-polypeptide complex,
whereby polypeptides comprising one or more epitope bound by the one or more small epitope
antibody are enriched.
[0017] In another aspect, the invention provides methods for reducing the complexity of a
sample, said method comprising: (a) incubating a reaction mixture, said reaction mixture
comprising: (i) a small epitope antibody; and (ii) a sample, wherein incubating is under
conditions permitting binding; and (b) separating an antibody-protein complex, whereby protein
is enriched.
[0018] In another aspect, the invention provides methods for reducing the complexity of a
sample, said method comprising: separating an antibody-protein complex, whereby protein is
enriched; wherein the antibody-protein complex is generated by incubating a reaction mixture,
said reaction mixture comprising: (a) a small epitope antibody; and (b) a sample, wherein
incubating is under conditions permitting binding.
[0019] In another aspect, the invention provides methods for reducing the complexity of a
sample, said method comprising: (a) incubating a reaction mixture, said reaction mixture
comprising: (i) a small epitope antibody; and (ii) a sample, wherein incubating is under
conditions permitting binding; (b) separating an antibody-protein complex; and (c) separating
protein from the protein-antibody complex, whereby protein is enriched.

[0020] In another aspect, the invention provides separating protein from a separated protein-
antibody complex, wherein the protein-antibody complex is generated by incubating a reaction
mixture, said reaction mixture comprising: (a) a small epitope antibody; and (b) a sample,
wherein incubating is under conditions permitting binding; and separation of a protein-antibody
complex.
[0021] In another aspect, the invention provides a method for reducing the complexity of a
sample that comprises a mixture of proteins, comprising separating a small epitope antibody-
protein complex, wherein proteins comprising an epitope bound by the small epitope antibody
are enriched. In some embodiments, the method further comprises separating protein from the
antibody-protein complex. In some embodiments, the small epitope antibody binds an epitope
consisting of about 3 to about 5 amino acids. In some embodiments, the sample is contacted
with a plurality of small epitope antibodies to form a plurality of small epitope antibody-protein
complexes. In some embodiments, the small epitope antibodies are detectably labeled. In some
embodiments, a plurality of small epitope antibodies is immobilized on a solid matrix. In some
embodiments, the sample is contacted with a plurality of small epitope antibodies in parallel. In
some embodiments, the sample is contacted with a plurality of small epitope antibodies serially.
In some embodiments, the sample is contacted with at least 100 small epitope antibodies. In
some embodiments, the method further comprises contacting protein separated from the
antibody-protein complex with a protein cleaving agent to form polypeptide fragments. In some
embodiments, the method further comprises contacting the small epitope antibody-protein
complex with a protein cleaving agent to form polypeptide fragments. In some embodiments,
the method further comprises contacting the sample with a protein cleaving agent to form
polypeptide fragments prior to formation of the small epitope antibody-protein complex,
optionally further comprising separating polypeptide fragments from the small epitope antibody-
protein complex. In one embodiment, the protein cleaving agent comprises a protease. In
another embodiment, the protein cleaving agent comprises a chemical agent.
[0022] In another aspect, the invention provides a method for reducing the complexity of a
sample that comprises a mixture of proteins, comprising (a) contacting the sample with at least
one small epitope antibody to form an antibody-protein complex; and (b) separating the
antibody-protein complex from unbound protein in the sample. In some embodiments, steps (a)
and (b) are performed sequentially. In some embodiments, steps (a) and (b) are performed
simultaneously. In some embodiments, the method further comprises separating protein from the

antibody-protein complex. In some embodiments, the small epitope antibody binds an epitope
consisting of about 3 to about 5 amino acids. In some embodiments, the at least one small
epitope antibody comprises at least about 100 small epitope antibodies.
[0023] In other aspects, the invention provides small epitope antibody-protein complexes,
proteins, and/or polypeptide fragments prepared using methods for reducing the complexity of a
sample described herein.
[0024] As is evident to one skilled in the art, aspects that refer to combining and incubating
the resultant mixture also encompass method embodiments which comprise incubating the
various mixtures (in various combinations and/or subcombinations) so that the desired products
are formed.
[0025] One, or more than one (such as about two, about three, about four, about five, about
ten, about twenty or more) small epitope antibod(ies) may be used in the methods of the
invention. In some embodiments, the sample is contacted with about 20, about 30, about 40,
about 50, about 75, about 100, about 125, about 150, about 200, about 300, about 400, about 500,
about 1000, or more small epitope antibodies. In some embodiments, the sample is contacted
with at least about 20, about 30, about 40, about 50, about 75, about 100, about 125, about 150,
about 200, about 300, about 400, about 500, about 1000, or more small epitope antibodies. In
some embodiments, the sample is contacted with less than about 100, about 95, about 90, about
85, about 80, about 75, about 70, about 65, about 60, about 55, about 50, about 45, about 40,
about 35, about 30, about 25, about 20, about 15, about 10, or fewer small epitope antibodies. In
some embodiments, the sample is contacted with at least about any of 10,20,30,40, 50,75,100,
125,150,200, 300,400 or 500 small epitope antibodies, with an upper limit of about any of 20,
30,40,50,75,100,125,150,200,300,400, 500, or 1000 small epitope antibodies.
[0026] The invention also provides methods using the protein prepared using any of the
methods described herein, for example, methods for characterizing a protein, methods of
expression profiling, methods of identifying proteins; methods for identifying protein
degradation products; methods for identifying change in post-translational modification, and
methods for determining the mass, the amount and/or identity of protein(s) in a sample. Methods
of genotyping (protein mutation detection), identifying splice variants, determining the presence
or absence of a protein of interest, expression profiling; methods for identifying protein
degradation products; methods for identifying change in post-translational modification, and
protein discovery are also encompassed by the methods of the invention.

[0027] Thus, in one aspect, the invention provides methods for characterizing a protein
comprising: (a) reducing the complexity of a sample using any of the methods described herein,
whereby proteins are enriched and/or purified; and (b) analyzing the proteins (interchangeably
termed "products").
[0028] In another aspect, the invention provides methods for characterizing protein
comprising analyzing protein; wherein the protein was prepared using any of the methods
described herein.
[0029] In some embodiments, the step of analyzing comprises determining amount of said
proteins, whereby the amount of protein(s) prepared, enriched and/or separated is quantified. In
some embodiments, the step of analyzing comprises identifying one or more of said proteins. In
some embodiments, the identity of the epitope(s) to which the small epitope antibody(ies) bind is
used to assist identification of the enriched proteins. In some embodiments, a protein is
identified using any one or more of the following characteristics: sequence; mass; m/z ratio (in
embodiments involving mass spectrometric analysis), and/or amino acid composition. In other
embodiments, the step of analyzing comprises determining the mass of one or more protein(s).
In some embodiments, the step of analyzing includes analysis for the detection of any alterations
in the protein, as compared to a reference protein which is identical (at least in part) to the
protein sequence other than the sequence alteration. Sequence alterations include mutations
(such as deletion, substitution, insertion and/or transversion of one or more amino acid), splice
variants, degradation products, and change in glycosylation.
[0030] In another aspect, the invention provides methods for characterizing a protein using
mass spectrometry, comprising: (a) reducing the complexity of a sample using any of the
methods described herein, whereby proteins are enriched and/or purified; and (b) analyzing the
proteins (interchangeably termed "products") which are isolated, purified, prepared and/or
separated using any of the methods herein, wherein the analyzing is by mass spectrometry.
[0031] In another aspect, the invention provides methods for characterizing protein
comprising analyzing protein using mass spectrometry; wherein the protein was prepared using
any of the methods described herein; wherein the analyzing is by mass spectrometry. In some
embodiments, quantity, mass, and/or identity of a protein is determined. In some embodiments,
the methods further comprise use of epitope identity information.
[0032] In some embodiments, mass spectrometric is matrix assisted laser
desorption/ionization ("MALDI") mass spectrometry; surface-enhanced laser

desorption/ionization ("SELDI"); and/or tandem mass spectrometry (e.g., MS/MS, MS/MS/MS,
ESI-MS/MS).
[0033] In another aspect, the invention provides methods for determining the identity of a
protein in a sample using mass spectrometry, said methods comprising: (a) reducing the
complexity of a sample using any of the methods described herein, whereby proteins are
enriched and/or purified; (b) analyzing the proteins (interchangeably termed "products"), wherein
the analyzing is by mass spectrometry; and (c) determining identity of enriched protein. In some
embodiments, the methods further comprise use of epitope identity information.
[0034] In another aspect, the invention provides methods for determining the identity of a
protein in a sample using mass spectrometry, said methods comprising determining the identity
of protein using mass spectrometry; wherein the protein is prepared using any of the methods for
reducing complexity of a sample described herein. In some embodiments, the methods further
comprise use of epitope identity information.
[0035] In another aspect, the invention provides methods for protein expression profiling,
wherein in the level of expression of one or more proteins is determined, wherein the protein is
prepared using any of the methods for reducing complexity of a sample described herein. In
some embodiments, the level of expression is determined using mass spectrometry. In some
embodiments, the invention provides methods for comparing the amounts of proteins in two or
more samples.
[0036] In another aspect, the invention provides methods for protein expression profiling,
wherein in the identity of one or more proteins is determined, wherein the protein is prepared
using any of the methods for reducing complexity of a sample described herein. In some
embodiments, protein identity is determined using mass spectrometry. In some embodiments,
the methods further comprise use of epitope identity information. In some embodiments, the
invention provides methods for comparing the identity of protein(s) in two or more samples.
[0037] In another aspect, the invention provides a method for determining the presence or
absence of a protein of interest in a sample, wherein the method comprises detecting the protein
of interest, of any, in an enriched protein fraction, wherein the enriched protein fraction is
prepared using any of the methods for reducing the complexity of a sample described herein, and
wherein detection of the protein of interest indicates presence of the protein in the sample. In
one embodiment, detection comprises mass spectrometry.

[0038] In another aspect, the invention provides a method for determining the amount of a
protein of interest in a sample, wherein the method comprises quantifying the amount of the
protein of interest in an enriched protein fraction, wherein the enriched protein fraction is
prepared using any of the methods for reducing the complexity of a sample described herein. In
one embodiment, quantification of the protein of interest comprises mass spectrometry.
[0039] In another aspect, the invention provides a method for identifying the protein in a
small epitope antibody-protein complex, wherein the small epitope antibody-protein complex is
prepared using any of the methods for reducing the complexity of a sample described herein. In
one embodiment, the identification comprises mass spectrometry.
[0040] In another aspect, the invention provides a method for identification of a biomarker,
wherein the method comprises comparing the proteins in two or more enriched protein fractions,
wherein each of the two or more enriched protein fractions is prepared from a sample using any
of the methods for reducing the complexity of a sample described herein. In some embodiments,
the two or more samples comprise samples from at least one individual who has a disease
condition and at least one individual who does not have the disease condition, and presence or
absence of the biomarker is indicative of the disease condition. In one embodiment, the
invention provides a method for determining presence or absence of a disease condition in an
individual, comprising determining the level of a biomarker in a sample from the individual,
wherein the biomarker is identified as described herein, and wherein the level of the biomarker is
indicative of the presence or absence of the disease condition. In some embodiments, the two or
more samples comprise samples from at least one individual who has received treatment for a
disease condition and at least one individual who has not received treatment for the disease
condition, and presence or absence of the biomarker is indicative of efficacy of the treatment. In
one embodiment, the invention provides a method for determining efficacy of treatment for a
disease condition in an individual, comprising detennining the level of a biomarker in a sample
from the individual, wherein the biomarker is identified as described herein, and wherein the
level of the biomarker is indicative of the efficacy of treatment. In some embodiments, the two
or more samples comprise samples from at least one individual who has been exposed to a toxin
or pathogen and at least one individual who has not been exposed to the toxin or pathogen, and
presence or absence of the biomarker is indicative of exposure of an individual to the toxin or
pathogen. In one embodiment, the invention provides a method for determining exposure of an
individual to a toxin or pathogen, comprising detennining the level of a biomarker in a sample

from the individual, wherein the biomarker is identified as described herein, and wherein the
level of the biomarker is indicative of exposure to the toxin or pathogen.
[0041] In another aspect, the invention provides compositions and kits comprising one or
more small epitope antibodies for use in any of the methods of the invention.
[0042] In some embodiments, the invention provides a composition comprising a plurality of
small epitope antibodies. In some embodiments, the plurality of small epitope antibodies binds
epitopes consisting of about 3 to about 5 amino acids. In some embodiments, the small epitope
antibodies are detectably labeled. In some embodiments, the plurality of small epitope
antibodies comprises at least about 100 small epitope antibodies.
[0043] In some embodiments, the invention provides a kit comprising a plurality of small
epitope antibodies. In some embodiments, the plurality of small epitope antibodies binds
epitopes consisting of about 3 to about 5 amino acids. In some embodiments, the small epitope
antibodies are detectably labeled. In some embodiments, the plurality of small epitope
antibodies comprises at least about 100 small epitope antibodies.
BRIEF DESCRIPTION OF THE DRAWINGS
[0044] FIGURE 1 shows the reaction pattern using mapping polypeptides spanning
sequences of immunization polypeptides for group 2 and group 5 mice, respectively.
[0045] FIGURE 2 shows the results of a secondary screen of positive antibodies in a phage
ELISA, as described in Example 2.
[0046] FIGURE 3 shows an SPR trace of a single chain antibody derived from phage
L50P1_15 against peptides 1, 6, 7, 8, and 9, as described in Example 2.
DETAILED DESCRIPTION OF THE INVENTION
[0047] The invention provides methods using one or more antibodies that bind (generally,
specifically bind) small epitopes, termed "small epitope antibodies", to fractionate a protein
mixture based on the presence and/or quantity of small epitopes within protein in the protein
mixture, whereby protein(s) comprising the small epitope are isolated, separated, prepared,
purified and/or enriched. Use of the methods of the invention thereby provides a means for
reducing the complexity of a protein mixture, facilitating subsequent use and/or characterization

of the enriched protein components of the sample. Insofar as the small epitope bound by the
antibody is known, binding by a small epitope antibody provides information relating to amino
acid content of protein(s) bound by the small epitope antibody. Small epitope antibodies are
further described herein.
[0048] As a general overview, the methods comprise: (a) contacting a sample with at least
one small epitope antibody under conditions that permit binding; and (b) separating an antibody-
protein complex from proteins that are not bound by the small epitope antibody(ies). Generally,
proteins comprising one or more epitope bound by the at least one small epitope antibody are
isolated, separated, enriched and/or purified. In some embodiments, the methods further
comprise: step (c) of separating protein from the antibody-protein complex. In some
embodiments, the methods further comprise a step of treating the sample with a protein cleaving
agent prior to step (a) of contacting a sample with the at least one small epitope antibody, or, in
embodiments involving separation of protein from the antibody-protein complex, after step (c) of
separating protein from the antibody-protein complex.
[0049] The methods of the invention are useful for fractionating samples comprising protein,
which is accomplished by the use of antibodies (termed "small epitope antibodies") that
recognize epitopes that are present in a multiplicity of proteins (such as, for example, an epitope
consisting of or consisting essentially of 3 linear amino acids, 4 linear amino acids, or 5 linear
amino acids). Small epitope antibodies suitable for use in the methods of the invention are
extensively described herein and exemplified in the Examples. By virtue of the specificity of the
small epitope antibodies, proteins (e.g., polypeptides) are separated, enriched and/or purified
depending on the presence and/or amount of the small epitope within the protein that is
recognized by the small epitope antibody(ies) used in the methods of the invention. Methods
using the protein prepared using the methods of the invention are further described herein. As is
evident, "reducing the complexity of a sample", as used herein, encompasses isolating, purifying,
separating, enriching and/or purifying proteins (e.g., polypeptides) from a sample. Accordingly,
the invention provides methods for purifying and/or enriching protein, methods for isolating
protein, methods for separating protein, methods for preparing protein for characterization,
methods for preparing protein for mass spectrometry analysis, methods for identifying protein
(such as one or a group of proteins), methods for discovering new protein, and methods for
quantification of protein in a sample.

[0050] In one aspect, the invention provides methods for reducing the complexity of a
sample, said methods comprising: (a) contacting a sample with one or more small epitope
antibody under conditions that permit binding; and (b) separating an antibody-protein complex,
whereby proteins comprising one or more epitope(s) bound by the one or more small epitope
antibody are isolated, separated, enriched and/or purified.
[0051] In another aspect, the invention further provides methods for purifying and/or
enriching protein; isolating protein; separating protein, preparing protein for characterization;
preparing protein for mass spectrometry analysis; identifying protein (such as one or more
protein, or a group of proteins); discovering a new protein; and/or quantification of protein in a
sample, wherein said methods comprising: (a) contacting a sample with one or more small
epitope antibody under conditions that permit binding; and (b) separating an antibody-protein
complex.
[0052] In another aspect, the invention also encompasses methods using the protein prepared
using any of the methods of the invention, for example, for characterizing a protein, methods of
expression profiling, methods of identifying proteins; methods for identifying protein
degradation products; methods for identifying change in post-translational modification, and
methods for detemiining the mass, the amount and/or identity of protein(s) in a sample. For
example, these methods can be applied in such areas as protein discovery, expression profiling,
drug discovery and diagnostics.
[0053] In another embodiment, mass spectrometry is used to characterize the protein
prepared using any of the methods of the invention. The protein fraction generated using a small
epitope antibody is particularly amenable to analysis using mass spectrometry because the
number of proteins (including protein variants) is reduced (as compared with the starting sample)
by use of the small epitope antibodies described herein. Insofar as the epitope present within the
protein is identified, the amino acid sequence or content of the epitope (termed "epitope
sequence" or "epitope amino acid content") provides further information useful for
characterizing and identifying the protein. Mass spectrometry methods have been used to
quantify and/or identify proteins. In some embodiments, mass spectrometry analysis generates a
polypeptide mass map. Using these results, polypeptide mass mapping may permit identification
of the corresponding protein. In other embodiments, mass spectrometry analysis is by tandem
mass spectrometer, and generates specific sequence information. Use of this information may
result in identification of the corresponding protein at the sequence level. In some embodiments,

protein is identified using a method comprising MS analysis of protein prepared using any of the
methods of the invention, in combination with epitope sequence or amino acid content
information.
[0054] One or more than one (such as about 2, about 5, about 7, about 10, about 20, about
30, about 50, about 100, or more) small epitope antibodies may be used in the methods of the
invention. In some embodiments, the sample is contacted with about 20, about 30, about 40,
about 50, about 75, about 100, about 125, about 150, about 200, about 300, about 400, about 500,
about 1000, or more small epitope antibodies. In some embodiments, the sample is contacted
with at least about 20, about 30, about 40, about 50, about 75, about 100, about 125, about 150,
about 200, about 300, about 400, about 500, about 1000, or more small epitope antibodies. In
some embodiments, the sample is contacted with less than about 100, about 95, about 90, about
85, about 80, about 75, about 70, about 65, about 60, about 55, about 50, about 45, about 40,
about 35, about 30, about 25, about 20, about 15, about 10, or fewer small epitope antibodies. In
some embodiments, the sample is contacted with at least about any of 10,20, 30,40, 50, 75,100,
125,150,200,300,400 or 500 small epitope antibodies, with an upper limit of about any of 20,
30, 40, 50, 75,100,125,150,200, 300,400,500, or 1000 small epitope antibodies.
[0055] In another aspect, the invention provides compositions and kits comprising one or
more small epitope antibody for use in any of the methods of the invention. In some
embodiments, the kits further comprise instructions for any of the methods described herein.
General Techniques
[0056] The practice of the present invention will employ, unless otherwise indicated,
conventional techniques of molecular biology (including recombinant techniques), microbiology,
cell biology, biochemistry and immunology, which are within the skill of the art. Such
techniques are explained fully in the literature, such as, Molecular Cloning: A Laboratory
Manual, second edition (Sambrook et al., 1989) Cold Spring Harbor Press; Oligonucleotide
Synthesis (M.J. Gait, ed., 1984); Methods in Molecular Biology, Humana Press; Cell Biology: A
Laboratory Notebook (J.E. Cellis, ed., 1998) Academic Press; Animal Cell Culture (R.I.
Freshney, ed., 1987); Introduction to Cell and Tissue Culture ( J.P. Mather and P.E. Roberts,
1998) Plenum Press; Cell and Tissue Culture: Laboratory Procedures (A. Doyle, J.B. Griffiths,
and D.G. Newell, eds., 1993-8) J. Wiley and Sons; Methods in Enzymology (Academic Press,

Inc.); Handbook of Experimental Immunology (D.M. Weir and C.C. Blackwell, eds.,)/ Gene
Transfer Vectors for Mammalian Cells (J.M. Miller and M.P. Calos, eds., 1987); Current
Protocols in Molecular Biology (F.M. Ausubel et al., eds., 1987); PCR: The Polymerase Chain
Reaction (Mullis et al., eds., 1994); Current Protocols in Immunology (J.E. Coligan et al., eds.,
1991); Short Protocols in Molecular Biology (Wiley and Sons, 1999); Immunobiology (C.A.
Janeway and P. Travers, 1997); Antibodies (P. Finch, 1997); Antibodies: a practical approach
(D. Catty., ed., IRL Press, 1988-1989); Monoclonal antibodies : a practical approach (P.
Shepherd and C. Dean, eds., Oxford University Press, 2000); Using antibodies: a laboratory
manual (E. Harlow and D. Lane (Cold Spring Harbor Laboratory Press, 1999); and The
Antibodies (M. Zanetti and J.D. Capra, eds., Harwood Academic Publishers, 1995).
Definitions
[0057] An "antibody" is an immunoglobulin molecule capable of specific binding to a target,
such as a carbohydrate, polynucleotide, lipid, polypeptide, etc., through at least one antigen
recognition site, located in the variable region of the immunoglobulin molecule. As used herein,
the term encompasses not only intact polyclonal or monoclonal antibodies, but also fragments
thereof (such as Fab, Fab', F(ab')2, Fv), single chain (ScFv), mutants thereof, fusion proteins
comprising an antibody portion, and any other modified configuration of the immunoglobulin
molecule that comprises an antigen recognition site of the required specificity. An antibody
includes an antibody of any class, such as IgG, IgA, or IgM (or sub-class thereof), and the
antibody need not be of any particular class. Depending on the antibody amino acid sequence of
the constant domain of its heavy chains, immunoglobulins can be assigned to different classes.
There are five major classes of immunoglobulins: IgA, IgD, IgE, IgG, and IgM, and several of
these may be further divided into subclasses (isotypes), e.g., IgGl, IgG2, IgG3, IgG4, IgAl and
IgA2. The heavy-chain constant domains that correspond to the different classes of
immunoglobulins are called alpha, delta, epsilon, gamma, and mu, respectively. The subunit
structures and three-dimensional configurations of different classes of immunoglobulins are well
known.
[0058] "Fv" is an antibody fragment that contains a complete antigen-recognition and -
binding site. In a two-chain Fv species, this region consists of a dimer of one heavy and one
light chain variable domain in tight, non-covalent association. In a single-chain Fv species, one

heavy and one light chain variable domain can be covalently linked by a flexible polypeptide
linker such that the light and heavy chains can associate in a dimeric structure analogous to that
in a two-chain Fv species. It is in this configuration that the three CDRs of each variable domain
interact to define an antigen-binding specificity on the surface of the VH-VL dimer. However,
even a single variable domain (or half of a Fv comprising only 3 CDRs specific for an antigen)
has the ability to recognize and bind antigen, although generally at a lower affinity than the
entire binding site.
[0059] The Fab fragment also contains the constant domain of the light chain and the first
constant domain (CHI) of the heavy chain. Fab' fragments differ from Fab fragments by the
addition of a few residues at the carboxy terminus of the heavy chain CHI domain including one
or more cysteines from the antibody hinge regions.
[0060] A "monoclonal antibody" refers to a homogeneous antibody population wherein the
monoclonal antibody is comprised of amino acids (naturally occurring and non-naturally
occurring) that are involved in the selective binding of an antigen. A population of monoclonal
antibodies (as opposed to polyclonal antibodies) are highly specific, in the sense that they are
directed against a single antigenic site. The term "monoclonal antibody" encompasses not only
intact monoclonal antibodies and full-length monoclonal antibodies, but also fragments thereof
(such as Fab, Fab', F(ab')2, Fv), single chain (ScFv), mutants thereof, fusion proteins comprising
an antibody portion, and any other modified configuration of the immunoglobulin molecule that
comprises an antigen recognition site of the required specificity and the ability to bind to an
antigen (see definition of antibody). It is not intended to be limited as regards to the source of
the antibody or the manner in which it is made (e.g., by hybridoma, phage selection, recombinant
expression, transgenic animals, etc.).
[0061] The terms "polypeptide", "oligopeptide", "peptide" and "protein" are used
interchangeably herein to refer to polymers of amino acids of any length. The polymer may be
linear or branched, it may comprise modified amino acids, and it may be interrupted by non-
amino acids. The terms also encompass an amino acid polymer that has been modified naturally
or by intervention; for example, disulfide bond formation, glycosylation, lipidation, acetylation,
phosphorylation, or any other manipulation or modification, such as conjugation with a labeling
component. Also included within the definition are, for example, polypeptides containing one or
more analogs of an amino acid (including, for example, unnatural amino acids, etc.), as well as
other modifications known in the art.

[0062] An epitope that "specifically binds" or "preferentially binds" (used interchangeably
herein) to an antibody is a term well understood in the art, and methods to determine such
specific or preferential binding are also well known in the art. A molecule is said to exhibit
"specific binding" or "preferential binding" if it reacts or associates more frequently, more
rapidly, with greater duration and/or with greater affinity with a particular cell or substance than
it does with alternative cells or substances. An antibody "specifically binds" or "preferentially
binds" to a target if it binds with greater affinity, avidity, more readily, and/or with greater
duration than it binds to other substances. For example, an antibody that specifically or
preferentially binds to an epitope is an antibody that binds this epitope with greater affinity,
avidity, more readily, and/or with greater duration than it binds to other epitopes. It is also
understood by reading this definition that, for example, an antibody (or moiety or epitope) that
specifically or preferentially binds to a first target may or may not specifically or preferentially
bind to a second target. As such, "specific binding" or "preferential binding" does not
necessarily require (although it can include) exclusive binding. Generally, but not necessarily,
reference to binding means preferential binding.
[0063] A "sample" encompasses a variety of sample types, including those obtained from an
individual. The definition encompasses blood and other liquid samples of biological origin, solid
tissue samples such as a biopsy specimen or tissue cultures or cells derived therefrom, and the
progeny thereof. A sample can be from a microorganism (e.g., bacteria, yeasts, viruses, viroids,
molds, fungi) plant, or animal, including mammals such as humans, rodents (such as mice and
rats), and monkeys (and other primates). A sample may comprise a single cell or more than a
single cell. The definition also includes samples that have been manipulated in any way after
their procurement, such as by treatment with reagents, solubilization, or enrichment for certain
components, such as proteins or polynucleotides. The term "sample" encompasses a clinical
sample, and also includes cells in culture, cell supernatants, cell lysates, serum, plasma,
biological fluid, human tissue propagated in animals, and tissue samples. Examples of a sample
include blood, plasma, serum, urine, stool, cerebrospinal fluid, synovial fluid, amniotic fluid,
saliva, lung lavage, semen, milk, nipple aspirate, prostatic fluid, mucous, and tears.
[0064] The "complexity" of a sample means the number of different protein species,
including number of different proteins as well as number of different protein variants (including
splice variants, polymorphisms, and protein degradation products).

[0065] "Detect" refers to identifying (determining) the presence, absence and/or amount of
the object or substance to be detected, and as described herein, detection may be qualitative
and/or quantitative.
[0066] As used herein, the singular form "a", "an", and "the" includes plural references
unless indicated otherwise. For example, "an" antibody includes one or more antibodies and "a
protein" means one or more proteins.
Methods of the invention
[0067] With respect to all methods described herein, reference to a small epitope antibody
also includes compositions comprising one or more of these antibodies. These compositions
may further comprise suitable excipients, such as pharmaceutically acceptable excipients, as well
as buffers and/or components to enhance stability, which are well known in the art.
Methods for reducing the complexity of a sample
[0068] The invention provides methods using one or more antibodies that bind (generally,
specifically bind) small epitopes, termed "small epitope antibodies", to fractionate a protein
mixture based on the presence or absence or amount of small epitopes within proteins within the
protein mixture, whereby a fraction comprising protein(s) comprising and enriched for the small
epitope is generated. As used herein, "enriched" refers to an increase in concentration and/or
purity of a protein or peptide in comparison with the concentration and/or purity of the protein or
peptide in the sample from which it was derived. Use of the methods of the invention thereby
provides a means for reducing the complexity of a protein mixture, facilitating subsequent use
and/or characterization of the enriched protein components of the sample. Insofar as the amino
acid sequence or composition of the small epitope bound by the antibody is known, binding by a
small epitope antibody provides information relating to amino acid sequence and/or content of
protein(s) bound by the small epitope antibody. As described herein, epitope identity
information {i.e., the amino acid content and/or sequence recognized by a small epitope
antibody) may be used in combination with other methods of the invention to, e.g., identify
proteins. Small epitope antibodies are further described herein.
[0069] The invention further provides methods for purifying and/or enriching protein;
isolating protein; separating protein, preparing protein for characterization (e.g., subsequent

analysis); preparing protein for mass spectrometry analysis; identifying protein; discovering new
protein; and/or quantification of protein in a sample.
[0070] As a general overview, the methods comprise: .(a) contacting a sample with at least
one small epitope antibody under conditions that permit binding; and (b) separating an antibody-
protein complex. In one embodiment, steps (a) and (b) occur sequentially. In another
embodiment, steps (a) and (b) occur simultaneously. Generally, proteins comprising one or more
epitope bound by the one or more small epitope antibody are isolated, separated, enriched and/or
purified (i.e., removed from the environment of the original sample). In some embodiments, the
methods further comprise: step (c) of separating protein from the antibody-protein complex. In
some embodiments, the methods further comprise treating the sample with a protein cleaving
agent. In one embodiment, the protein cleaving agent is added prior to step (a) of contacting a
sample with the at least one small epitope antibody. In another embodiment, the protein cleaving
agent is added after step (c) of separating protein from the antibody-protein complex.
[0071] The methods of the invention are useful for fractionating samples comprising protein
(such as polypeptides), which is accomplished by the use of antibodies (termed "small epitope
antibodies") that recognize epitopes that are present in a multiplicity of proteins (such an epitope
consisting of or consisting essentially of 3 linear amino acids, 4 linear amino acids, or 5 linear
amino acids). Small epitope antibodies suitable for use in the methods of the invention are
extensively described herein and exemplified in the Examples. By virtue of the specificity of the
small epitope antibodies, proteins or peptides (e.g., polypeptides) are separated, enriched and/or
purified depending on the presence and/or amount of the small epitope within the protein that is
recognized by the small epitope antibody(ies) used in the methods of the invention. As is
evident, "reducing the complexity of a sample", as used herein, encompasses isolating, purifying,
separating, enriching and/or purifying proteins or peptides (e.g., polypeptides) from a sample
(including removing the proteins or peptides from the environment of the sample).
[0072] Accordingly, in one aspect, the invention provides methods for reducing the
complexity of a sample, said methods comprising: (a) contacting a sample with one or more
small epitope antibody under conditions that permit binding; and (b) separating an antibody-
protein complex, whereby proteins comprising one or more epitope(s) bound by the one or more
small epitope antibody are isolated, separated, enriched and/or purified. In some embodiments,
the methods further comprise: step (c) of separating protein from the antibody-protein complex.

[0073] In one embodiment, the invention provides methods for reducing the complexity of a
protein sample, said methods comprising separating a small epitope antibody-protein complex,
whereby proteins comprising an epitope bound by the small epitope antibody are enriched;
wherein the complex was generated by contacting a sample with the small epitope antibody. In
another embodiment, the invention provides methods for reducing the complexity of a protein
sample, said methods comprising separating a plurality of small epitope antibody-protein
complexes, whereby proteins comprising epitopes bound by a plurality of small epitope
antibodies are enriched, and wherein the complexes were generated by contacting a sample with
the plurality of small epitope antibodies.
[0074] In another aspect, the invention provides methods for reducing the complexity of a
protein sample, said methods comprising separating protein from a small-epitope antibody-
protein complex, whereby protein comprising an epitope bound by the small epitope antibody is
enriched; wherein the small epitope antibody-protein complex is generated by contacting a
sample with the small epitope antibody under conditions that permit binding, whereby the small
epitope antibody-protein complex is generated; and separating an antibody-protein complex from
unbound proteins in the sample, if any. In one embodiment, the invention provides methods for
reducing the complexity of a protein sample, said methods comprising separating a plurality of
proteins from small epitope antibody-protein complexes, whereby protein comprising epitopes
bound by a plurality of small epitope antibodies is enriched, and wherein the small epitope
antibody-protein complexes are generated by contacting a sample with a plurality of small
epitope antibodies under conditions that permit binding to proteins in the sample, whereby small
epitope antibody-protein complexes are generated, and separating the antibody-protein
complexes from unbound proteins in the sample, if any.
[0075] As is evident, one or more steps may be combined and/or performed sequentially
(often in any order, as long as the requisite product(s) are able to be formed), and, as is evident,
the invention includes various combinations of the steps described herein. It is also evident, and
is described herein, that the invention encompasses methods in which the initial, or first, step is
any of the steps described herein. Methods of the invention encompass embodiments in which
later, "downstream" steps are an initial step.
[0076] In some embodiments, the methods further comprise a step of treating the sample
with a protein cleaving agent, whereby polypeptide fragments are generated. In embodiments
involving step (c) of separating protein from the antibody-protein complex, the sample can

treated with a protein cleaving agent prior to step (a) of contacting a sample with the at least one
small epitope antibody, and/or following step (c) of separating protein from the antibody-protein
complex. The protein cleaving agent may be an enzyme (such as chymotrypsin or trypsin) or a
chemical agent (such as cyanogen bromide). Protein cleaving agents and methods for treatment
with protein cleaving agents are well known in the art and further described herein.
[0077] In another aspect, the invention provides methods for reducing the complexity of a
sample, said methods comprising (a) contacting a sample with one or more small epitope
antibody under conditions that permit binding; (b) separating an antibody-protein complex,
whereby proteins comprising one or more epitope(s) bound by the one or more small epitope
antibody are enriched; (c) separating protein from protein-antibody complex; and (d) treating the
protein with a protein cleaving agent, whereby polypeptide fragments are generated.
[0078] In another aspect, the invention provides methods for reducing the complexity of a
sample, said methods comprising (a) contacting a sample with one or more small epitope
antibody under conditions that permit binding, thereby forming an antibody-protein complex;
and (b) treating the antibody-protein complex with a protein cleaving agent to produce
polypeptide fragments.
[0079] In another aspect, the invention provides methods for reducing the complexity of a
protein sample, said methods comprising: (a) treating the sample with a protein cleaving agent,
whereby polypeptide fragments are generated; (b) contacting the polypeptide fragments with one
or more small epitope antibody under conditions that permit binding, whereby antibody-
polypeptide complexes are generated; and (c) separating the antibody-polypeptide complex,
whereby polypeptides comprising one or more epitope bound by the one or more small epitope
antibody are enriched.
[0080] One, or more than one (such as about two, about three, about four, about five, about
ten, about twenty, about one hundred, or more) small epitope antibod(ies) may be used in the
methods of the invention. In some embodiments, the sample is contacted with about 20, about
30, about 40, about 50, about 75, about 100, about 125, about 150, about 200, about 300, about
400, about 500, about 1000, or more small epitope antibodies. In some embodiments, the sample
is contacted with at least about 20, about 30, about 40, about 50, about 75, about 100, about 125,
about 150, about 200, about 300, about 400, about 500, about 1000, or more small epitope
antibodies. In some embodiments, the sample is contacted with less than about 100, about 95,
about 90, about 85, about 80, about 75, about 70, about 65, about 60, about 55, about 50, about

45, about 40, about 35, about 30, about 25, about 20, about 15, about 10, or fewer small epitope
antibodies. In some embodiments, the sample is contacted with at least about any of 10, 20,30,
40, 50,75,100,125,150,200,300,400, or 500 small epitope antibodies, with an upper limit of
about any of 20, 30,40, 50, 75,100, 125,150, 200, 300,400, 500, or 1000 small epitope
antibodies.
[0081] It is understood that the sample may also be contacted with other protein binding
agents, including antibodies that are not small epitope antibodies, and other protein binding
agents. Such agents may be used simultaneously, sequentially, before or after treatment with
small epitope antibodies.
[0082] In some embodiments, the sample is treated with one or more antibodies that bind to
one or more proteins, preferably proteins that are known to be abundant in the sample, prior to or
simultaneously with the step of contacting the sample with one or more small epitope antibodies.
For example, in a serum sample, pretreatment may comprise antibodies that bind to albumin,
immunoglobulin, and/or other abundant proteins. In one embodiment, proteins in the sample are
cleaved with a protein cleaving agent prior to contact with the one or more antibodies that bind
to one or more known abundant proteins. In another embodiment, proteins in the sample are
cleaved with a protein cleaving agent after contact with the one or more antibodies that bind to
one or more known proteins, such as abundant proteins. In one embodiment, the bound
protein(s) (such as abundant protein(s)) are removed from the sample prior to contact with the
one or more small epitope antibodies. In one embodiment, the method comprises "debulking" of
a sample by treatment with one or more antibodies that bind to one or more known proteins in
the sample, such as abundant protein(s) (optionally followed by removal of bound proteins),
cleavage of proteins in the sample with a protein cleaving agent, and contact of cleaved proteins
with one or more small epitope antibodies. In another embodiment, the method comprises
treatment of the sample with a protein cleaving agent, debulking of the sample by treatment with
one or more antibodies that bind to one or more known proteins, such as abundant protein(s)
and/or cleaved polypeptide fragments in the sample (optionally followed by removal of the
bound protein(s) and/or polypeptide fragments), and contact of the remaining proteins and/or
cleaved polypeptide fragments with one or more small epitope antibodies. In another
embodiment, the method comprises debulking of the sample by treatment with one or more
antibodies that bind to one or more known proteins, such as abundant protein(s) (optionally
followed by removal of the bound protein(s)), contacting the sample with at least one small

epitope antibody to form an antibody-protein complex, and treatment of the antibody-protein
complex with a protein cleaving agent.
[0083] It is further understood that the protein components of the sample that remain
following treatment with small-epitope antibodies (i.e., the unbound components) may also be
suitable for use in the methods of the invention using protein generated using the methods of the
invention. Thus, in some embodiments, the methods using the protein generated using the
methods of the invention encompass use of this unbound protein fraction.
[0084] Methods and conditions for antibody binding and separation of antibody-protein
complexes are well known in the art and further described herein. Generally, the sample is
partially or wholly denatured when it is contacted with the small epitope antibody(ies), but
denaturation is not required in every embodiment. In some embodiment, step (a) of contacting a
sample with two or more antibodies is sequential (as when one antibody is contacted with the
sample, then removed, another antibody is contacted with the sample and removed, and so on).
In other embodiments, step (a) of contacting with two or more antibodies is in parallel, for
example, as when a group of antibodies are contacted with the sample simultaneously. In some
embodiments, several groups of two or more antibodies are serially contacted with the sample,
for example, group 1 is contacted and removed, group 2 is contacted and removed, and so on.
[0085] As noted in the definition, and as used herein, "sample" encompasses a variety of
sample types, including those obtained from an individual. In some embodiment, the sample
comprises blood, plasma, serum, urine, stool, cerebrospinal fluid, synovial fluid, amniotic fluid,
saliva, lung lavage, semen, milk, nipple aspirate, prostatic fluid, mucous, and tears. Suitable
samples for use in the methods of the invention are described further herein.
Methods using proteins isolated ('enriched') using the methods of the invention
[0086] The proteins isolated or enriched using the methods of the invention can be used for a
variety of purposes. For purposes of illustration, methods of characterizing proteins using the
proteins enriched and/or purified by the methods of the invention, are described. In some
embodiments, the proteins are characterized using mass spectrometry, whereby the proteins may
be quantified and/or identified. Methods of genotyping (protein mutation detection), identifying
splice variants, determining the presence or absence of a protein of interest, expression profiling;

methods for identifying protein degradation products; methods for identifying change inpost-
translational modification, and methods of protein discovery are also described.
[0087] For simplicity and convenience, reference is generally made to "protein(s)". It is
understood that reference to protein encompasses "polypeptides" (interchangeably termed
"polypeptide fragments"). As is evident from the discussion herein, in some embodiments, a
protein cleaving agent is used to generate polypeptide fragments.
Methods of characterizing a protein
[0088] The invention provides methods for characterizing (for example, detecting (presence
or absence) and/or quantifying) a protein of interest (generally, a polypeptide fragment). In some
embodiments, use of the methods of the invention generates one or more fractions of the sample,
each of which comprises fewer proteins than in the starting sample, facilitating subsequent
characterization of the protein comprised in the fraction. In particular, characterization using
mass spectrometry is expected to be enhanced, as further described herein.
[0089] Thus, the invention provides methods for characterizing a protein comprising: (a)
reducing the complexity of a sample using any of the methods described herein, whereby
proteins are enriched and/or purified; and (b) analyzing the proteins (interchangeably termed
"products") which are isolated by any one or of the methods described herein.
[0090] In another aspect, the invention provides methods for characterizing a protein
comprising: analyzing proteins (interchangeably termed "products"), wherein the protein is
prepared using any of the methods for reducing complexity of a sample described herein
(including: methods for purifying and/or enriching a protein, methods for isolating a protein,
methods for separating a protein, methods for preparing a protein fraction for characterization,
methods for preparing a protein fraction for mass spectrometry analysis, methods for identifying
a protein (such as one or more protein, or a group of proteins), methods for discovering a new
protein, and methods for quantification of protein in a sample.)
[0091] The step of analyzing can be performed by any method known in the art or described
herein. Methods for analyzing proteins are well known in the art, and include: sodium dodecyl
sulphate-polyacrylamide gel electrophoresis ("SDS-PAGE"), isoelectric focusing, separated by
such techniques as high pressure liquid chromatography, FPLC, thin layer chromatography,
affinity chromatography, gel-filtration chromatography, ion exchange chromatography, and other

standard biochemical analyses, immunodetection, protein sequencing, analysis with protein
arrays, mass spectrometry, and the like. Thus, the invention includes those further analytical
and/or quantification methods as applied to any of the products of the methods herein.
[0092] In some embodiments, the step of analyzing comprises determining amount of said
proteins, whereby the amount of protein(s) prepared, enriched and/or separated is quantified. It
is understood that the amount of enriched protein(s) may be determined using quantitative and/or
qualitative methods. Determining amount of protein product includes determining whether
product is present or absent.
[0093] In some embodiments, the step of analyzing comprises identifying one or more of
said proteins. Methods for identifying a protein are known in the art, and include:
immunodetection, protein sequencing, and the like. In some embodiments, essentially all of the
enriched proteins (purified or enriched from a sample) are identified. In some embodiments, the
identity of the epitope(s) to which the small epitope antibody(ies) bind is used to assist
identification of the enriched proteins. In some embodiments, a protein is identified using any
one or more of the following characteristics: sequence; mass; m/z ratio (in embodiments
involving mass spectrometric analysis), amino acid composition, and any other method that
provide sufficient information to identify a protein. As used herein, "identify" includes
identifying known (previously characterized proteins) as well as discovery of previously
unknown or uncharacterized proteins (including protein variants such as mutant proteins,
differentially modified proteins {e.g., varying carbohydrate content) and splice variants). In
some embodiments, a multiplicity, a large multiplicity or a very large multiplicity of proteins are
identified. In other embodiments, at least about 2, 3, 4, 5,10,20, 30,40, 50, 60,70, 80, 90,100,
500, or 1000 or more proteins are identified.
[0094] In other embodiments, the step of analyzing comprises determining the mass of one
or more protein(s).
[0095] In some embodiments, the step of analyzing includes analysis for the detection of any
alterations in the protein, as compared to a reference protein which is identical (at least in part) to
the protein sequence other than the sequence alteration. The sequence alterations may be
sequence alterations present in the genomic sequence or may be sequence alterations which are
not reflected in the genomic DNA sequences, for example, alterations due to post transcriptional
alterations, and/or mRNA processing, including splice variants, and/or post-translational
modifications, such as variation in amount of glycosylation, and protein degradation or by-

products. Sequence alterations include mutations (such as deletion, substitution, insertion
and/or transversion of one or more amino acid).
[0096] It is understood that the identity (sequence) of the epitope(s) to which the small
epitope antibody(ies) may be used in combination with any of the methods described herein to,
e.g., identify proteins.
Method of characterizing a protein using mass spectrometry
[0097] In some embodiments, mass spectrometry (MS) is used to characterize the proteins
isolated using the methods of the invention. Generally, in embodiments involving mass
spectrometric analysis, the sample will be treated with a protein cleaving agent (whereby
polypeptide fragments are generated), but treatment with a cleaving agent is not required in
every embodiment. In some embodiments, the sample is. treated with a protein cleaving agent
prior to contacting the sample with small epitope antibodies. In other embodiments, the sample
is treated with a protein cleaving agent following enrichment of a protein fraction by contacting
with a small epitope antibody, separation of antibody-protein complex, and separation of protein
from the protein-antibody complex. As noted herein, the protein (such as polypeptide fragments)
generated using the methods of the invention are particularly amenable to analysis using mass
spectrometry because use of the methods of the invention generates fractions of proteins that are
less complex than are the starting sample. Insofar as the epitope present within the protein is
known, e.g., the cognate epitope recognized by the small epitope antibody used to purify and/or
enrich the protein fraction comprising the protein, the amino acid sequence or content of the
epitope (termed "epitope sequence" or "epitope content") provides further information useful for
characterizing and identifying the protein.
[0098] Methods for mass spectrometric protein analysis are well known in the art and further
described herein. Mass spectrometry methods have been used to quantify and/or identify
proteins. (See, e.g., Li et al. (2000) Tibtech 18:151-160; Rowley et al. (2000) Methods 20: 383-
397; and Kuster and Mann (1998) Curr. Opin. Structural Biol. 8: 393-400.) Mass spectrometric
techniques have also been developed that permit at least partial de novo sequencing of isolated
proteins. Chait et al. (1993) Science 262:89-92; Keough et al.(1999) Proc. Natl. Acad. Sci. USA
96:7131-6; reviewed in Bergman (2000) EXS 88:133-44.

[0099] Polypeptide mass mapping provides a polypeptide mass fingerprint of the protein or
protein fraction under analysis, based on its amino acid composition. Polypeptide mass mapping
can be obtained using, for example, the MALDI-TOF platform, in which matrix-assisted laser
desorption/ionization (MALDI) is used to ionize polypeptides of interest while the time of flight
distribution of the ionized polypeptides provides mass to charge ratio specifications for each
polypeptide which can be used to query protein sequence databases. The polypeptide mass
fingerprints yielded comprise the amino acid composition based on mass and charge
determination. Using these results, a small set of polypeptide mass matches may provide
sufficient information for the identification of the corresponding protein.
[00100] In a second method for protein identification by MS, individual polypeptides in the
mixture are fragmented to generate sequence infonnation. Polypeptides are ionized by
electrospray (ESI) from the liquid phase, and then sprayed into a tandem mass spectrometer that
is capable of resolving polypeptides in a mixture, isolating polypeptides of interest and
dissociating individual polypeptide species into constituent amino- and carboxy-terminal-
containing fragments by predominantly disrupting polypeptide bonds (collision induced
dissociation). The resulting mass spectrum is comprised of the parent ion as well as two
overlapping mass ladders of ions derived from the amino- and carboxy-terminal containing
fragments. Because each member of a ladder differs in mass-to-charge ratio (termed "m/z") by 1
amino acid from its nearest mass neighbor in the series, a partial primary sequence can be
generated and used to query both protein and translated DNA sequence databases. This mass
spectrometry platform provides specific sequence information derived from several polypeptides,
which is often more useful for protein identification that a list of polypeptide masses that reflect
the amino acid composition of the polypeptide (as generated by other platforms, including
SELDI-TOF).
[00101] Mass spectrometry methods further permit quantification of proteins that are
analyzed, as further described below.
[00102] Other mass spectrometry methods are well known in the art, and include: matrix
assisted laser desorption/ionization ("MALDI") mass spectrometry; surface-enhanced laser
desorption/ionization ("SELDI"); Tandem mass spectrometry (e.g., MS/MS, MS/MS/MS, ESI-
MS/MS, etc.). In some embodiments, tandem mass spectrometry is carried out using a laser
desorption/ionization mass spectrophotometer .that is further coupled to a quadrupole time-of-
flight mass spectrometer QqTOF MS (see e.g., Krutchinsky et al, WO 99/38185). Methods such

as MALDI-QqTOFMS (Krutchinsky et al., WO 99/38185; Shevchenko et al. (2000) Anal. Chem.
72:2132-2141), ESI-QqTOF MS (Figeys et al. (1998) Rapid Comm'ns. Mass Spec. 12-1435-
144) and chip capillary electrophoresis (chip-CE)-QqTOF MS (Li et al. (2000) Anal. Chem. 72:
599-609) have been described previously. Mass spectrometers and techniques for using them in
methods of the invention are well known to those of skill in the art. A person skilled in the art
would understand that any of the components of a mass spectrometer (e.g., desorption source, .
mass analyzer, detect, etc.) can be combined with other suitable components described herein or
those known in the art. For additional information regarding mass spectrometers, see, e.g.,
Principles of Instrumental Analysis, 3rd ed., Skoog, Saunders College Publishing, Philadelphia,
1985; and Kirk-Othmer Encyclopedia of Chemical Technology, 4th ed. Vol. 15 (John Wiley &
Sons, New York 1995), pp. 1071-1094.
Data Analysis of Mass Spectra
[00103] The mass spectra data obtained using the mass spectrometry analysis can be used to
obtain information on the quantity and/or identity of the enriched protein products obtained using
the methods of the invention. Data generated by desorption and detection of polypeptides can be
analyzed using any suitable means {e.g., visually, by computer, etc). In one embodiment, data is
analyzed with the use of a programmable digital computer. The computer contains code that
receives as input, data on the strength of the signal at various molecular masses received from a
particular addressable location on the substrate. This data can indicate the number of products
detected, optionally including the strength of the signal of a peak value and the determined
molecular mass for each product detected.
[00104] Data analysis can include the steps of determining signal strength (e.g., height of
peaks) of a peak value (e.g., of a particular mass-to-charge value or range of values) detected and
removing "outliers" (data deviating from a predetermined statistical distribution). The observed
peaks can be normalized, a process whereby the height of each peak relative to some reference is
calculated. For example, a reference can be background noise generated by instrument and
chemicals (e.g., energy absorbing molecule) which is set as zero in the scale. Then the signal
strength detected for each polypeptide or other substances can be displayed in the form of
relative intensities in the scale desired (e.g., 100). Alternatively, a standard may be admitted
with the sample so that a peak from the standard can be used as a reference to calculate relative

intensities of the signals observed for each affinity tagged product detected. Software programs
such as the Biomarker Wizard program (Ciphergen Biosystems, Inc., Fremont, Calif.) can be
used to aid in analyzing mass spectra.
[00105] In some embodiments the amounts of one or more proteins present in a sample is
determined, in part, by executing an algorithm with a programmable digital computer. The
algorithm identifies at least one peak value in a first mass spectrum of a first sample and in a
second mass spectrum of a second sample. The algorithm then compares the signal strength of
the peak value of the first mass spectrum to the signal strength of the peak value of the second
mass spectrum. The relative signal strengths are an indication of the amount of the protein that is
present in the first and second samples. A standard containing a known amount of a protein can
be analyzed as the second sample to better quantify the amount of the protein present in the first
sample. In certain embodiments, the identities of the proteins in the first and second samples can
also be determined (see below).
[00106] The present invention also provides methods of determining the identity of a protein.
In certain embodiments, a programmable digital computer is used to access a database containing
one or more mass spectra. An algorithm is then executed with a programmable digital computer
to determine at least a first measure for each of the predicted mass spectra. The first measure is
an indication of the closeness-of-fit between a mass spectrum of the protein and each of the
plurality of predicted mass spectra.
[00107] The data of a mass spectrum can be used to identify the proteins by executing an
algorithm with a programmable digital computer that compares the MS data to records in a
database. Each molecule provides characteristic mass-spectrometric (MS) data (also referred to
as a mass spectral "signature" or "fingerprint") when analyzed by MS methods. This data can be
analyzed by comparing it to databases containing, inter alia, actual or theoretical MS data or
protein sequence information. Additionally, a protein may be cleaved into fragments for MS
analysis. Information obtained from the MS analysis of fragments is also compared to a database
to identify proteins {e.g., proteins) in the sample (see e.g., Yates (1998) J. Mass Spec. 33:1-19;
Yates et al., U.S. Pat. No. 5,538,897; Yates et al., U.S. Pat. No. 6,017,693; PCT Publication No.
WO 00/11208 and Gygi et al. (1999) Nat. Biotechnol 10:994-999). Software resources that
facilitate interpretation of mass spectra, especially protein mass spectra, and mining of public
domain sequence databases are now readily accessible on the Internet to facilitate protein
identification. Among these are Protein Prospector (http://prospector.ucsf/edu), PROWL

(http://prowl.rockefeller.edu), and the Mascot Search Engine (Matrix Science Ltd., London, UK,
www.matrixscience.com).
[00108] In certain embodiments, MS data and information obtained from that data are
compared to a database consisting of data and information relating to proteins. For example, the
database may consist of sequences of nucleotides or amino acids. The database may consist of
nucleotide or amino acid sequences of expressed sequence tags (ESTs). Alternatively, the
database may consist of sequences of genes at the nucleotide or amino acid level. The database
can include, without limitation, a collection of nucleotide sequences, amino acid sequences, or
translations of nucleotide sequences included in the genome of any species.
[00109] A database of information relating to proteins, e.g., sequences of nucleotides or amino
acids, is typically analyzed via a computer program or a search algorithm which is optionally
performed by a computer. Information from sequence databases is searched for best matches
with data and information obtained from the methods of the present invention (see e.g., Yates
(1998) J. Mass Spec. 33:1-19; Yates et al., U.S. Pat. No. 5,538,897; Yates et al., U.S. Pat. No.
6,017,693). Any appropriate algorithm or computer program useful for searching a database can
be used. Search algorithms and databases are constantly updated, and such updated versions will
be used in accordance with the present invention. Examples of programs or databases can be
found on the World Wide Web (WWW) at http://base-peak.wiley.com/, http://mac-mann6.embl-
heidelberg.de/MassSpec/Software.html, http://www.mann.embl-
heidelberg.de/Services/PeptideSearch/PeptideSearchIn-tro.html,
ftp://ftp.ebi.ac.uk/pub/databases/, and http://donatello.ucsf.ed- u. U.S. Pat. Nos. 5,632,041;
5,964,860; 5,706,498; and 5,701,256 also describe algorithms or methods for sequence
comparison. Other examples of databases include the Genpept database, the GenBank database
(described in Burks et al. (1990) Methods in Enzymology 183: 3-22, EMBL data library
(described in Kahn et al. (1990) Methods in Enzymology 183:23-31, the Protein Sequence
Database (described in Barker et al. (1990) Methods in Enzymology 183: 31-49, SWISS-PROT
(described in Bairoch et al. (1993) Nucleic Acids Res. 21: 3093-3096, and PIR-International
(described in (1993) Protein Seg. Data Anal. 5:67-192).
[00110] In some embodiments, the amino acid sequence of the epitope recognized by the
small epitope antibody (termed "epitope sequence") is used in conjunction with the database
search information and search algorithms to enhance identification of proteins. For example,
prior to or following analysis of MS data and information obtained from that data by comparison

to a database consisting of data and information relating to proteins, the amino acid sequence of
the epitope may be used to refine the data analysis. For example, a preliminary list of protein
identity candidates may be refined by excluding members from that list that do not include the
epitope sequence. In another example, a database may be compiled or theoretically generated of
all proteins comprising a given epitope sequence. This database may then be subjected to further
analysis using data analysis methods known in the art.
[00111] A database of information relating to proteins, e.g., sequences of nucleotides or amino
acids, is typically analyzed via a computer program or a search algorithm which is optionally
performed by a computer.
[00112] In a further embodiment, novel databases are generated for comparison to mass
spectrometrically determined MS data, e.g., mass or mass spectra of cleaved protein and
polypeptide fragments. For example, a theoretical database of all polypeptide fragments
comprising an epitope recognized by a small epitope antibody is generated. This database may
be used in conjunction with any of the data analysis tools and methods described herein.
[00113] In some embodiments, the mass of a polypeptide derived from a mass spectrum is
used to query a database for those masses of proteins or predicted proteins from nucleic acid
sequences that provide the closest fit. In this manner, an unknown protein can be rapidly
identified without an amino acid sequence. In other embodiments of the invention, the masses
provided from polypeptide fragments thereof can be compared to the predicted mass spectra of a
database of proteins or predicted proteins from a nucleic acid sequences that provide the closest
fit.
[00114] Sequences or simulated cleavage fragments from the sequence database that fall
within a desired range of similar sequence homologies to sequences generated from the MS data
of parent or fragment molecules are designated "matches" or "hits." In this manner, the identity
of the proteins or fragments thereof can be rapidly determined. The investigator can customize
or vary the range of acceptable sequence homology comparison values according to each
particular analysis.
[00115] It is understood that for convenience, reference is made to protein "identity". It is
understood that the methods described herein are equally applicable to the determination of
presence or absence of a mutation (such as an amino acid substitution, transversion, insertion or
deletion), and other protein variants, such as splice variants, degradation products, and/or
differential post-translational modification (for example, variation in glycosylation level).

[00116] In some embodiments, the presence or absence of a mutation is determined by
detection of a change in m/z ratio relative to a reference m/z ratio.
[00117] In some embodiments, level (or changes in level) of post-translational modification is
determined by comparing endoglycosylase-treated sample with a reference sample (e.g., a
sample that has not been treated with endoglycosylase), whereby level of post translational
modification is determined.
Expression profiling
[00118] The methods of the invention are suitable for use in determining the levels of
expression of one or more proteins in a sample. As described above, enriched and/or purified
protein fractions can be detected and/or quantified by various methods, as described herein
and/or known in the art. In some embodiments, protein fractions are analyzed (including
quantification and/or identification) using mass spectrometry. It is understood that amount of
protein product may be determined using quantitative and/or qualitative methods. Determining
amount of product includes determining whether product is present or absent. Thus, an
expression profile can includes information about presence or absence of one or more protein
sequences of interest. "Absent" or "absence" of product, and "lack of detection of product" as
used herein includes insignificant, or de minimus levels.
[00119] In some embodiments, the amounts of proteins in two or more samples are compared.
Typically, the samples have overlapping protein profiles. Using the methods of the present
invention, the amounts of the proteins can be compared to determine how the profiles differ in
the nature and amount of proteins that are present. These methods are useful for identifying a
change in the nature or amount of a protein that is indicative of a disease state (e.g., a disease
biomarker, PSA, BRCA1, etc.) or treatment efficacy, or toxic effects of an agent, or presence of
a pathogen (e.g., HIV, bacterial pathogens, viral pathogens, prions, etc), etc. These methods are
also useful for discovering proteins that are associated with disease states for drug discovery
purposes, diagnostic purposes, etc. In particular, it is useful to compare the protein profiles of
samples that are from different subjects or have been subjected to different conditions or
treatments.
[00120] For example, in certain embodiments, the first sample is an untreated control sample
and the second sample has been subjected to an agent or condition. Examples of agents include,

but are not limited to: a chemotherapeutic agent, ultraviolet light, a medical device {e.g., a stent
defibrillator), an exogenous gene, and a growth factor. Those of skill in the art will recognize
that there are many ways to introduce an exogenous gene into a cell (see, e.g., Ausubel et al.,
eds., (1994), supra). In other embodiments, the first sample is a diseased sample and the second
sample is a non-diseased sample. In addition, agents can take the form of candidate drugs. For
example, the proteins in a first sample treated with a candidate drug and can be compared to a
second sample which is a negative or positive control. The influence of the candidate drug on
the amount of a protein (e.g., a protein) present in the first and second sample can be an
indication of the candidate drugs efficacy or toxicity. Those of skill in the art will appreciate that
these methods can be adapted to analyze the effects of any agent on a disease state or amount of
a disease marker present in a sample. In one embodiment, the methods are used to identify
protein(s) that are associated with treatment with an agent (such as a candidate drug). Such
proteins may be, e.g., may be associated with efficacy of the agent, and thereby serve as a proxy
for a clinical endpoint.
Biomarkers
[00121] Biomarker protein (or proteins) can be identified using the expression profiling and
characterization methods described herein. A biomarker is a protein of interest, for which the
detection, monitoring, quantitation, and/or characterization is of interest. In some embodiments,
a biomarker is correlated with a specific condition or treatment, such as a disease or condition,
treatment with a drug (including efficacy of drug treatment and/or toxicity), treatment with a
medical device, and the like. In other embodiments, a biomarker is expressed in a tissue or cell
of interest (e.g., a tumor, an organ, etc.). As used herein, a biomarker protein may be a newly
identified protein or protein variant (such as a mutant protein, splice variant, a protein with
altered post-translational modification, etc.). In other embodiments, a biomarker is a tissue-
specific marker.
[00122] A biomarker can be used as a surrogate marker in diagnosis (including staging of
disease, in some embodiments), prognosis, evaluation and/or selection of therapies, monitoring
of disease progression, monitoring of efficacy of treatment, and/or treatment of disease. In some
embodiments, a biomarker is detected and/or quantified by any method known in the art, and/or

any method described herein, whereby expression of the biomarker (presence or absence of
biomarker, or differential expression of the biomarker) indicates the presence of a disorder or a
condition. In one embodiment, increase in level of a biomarker indicates the presence of a
disorder or condition. In another embodiment, decrease in level of a biomarker indicates the
presence of a disorder or condition. In some embodiments, biomarker expression is used to
evaluate the efficacy of a particular therapeutic treatment regimen in animal studies, in clinical
trials, or to monitor the treatment of an individual subject. In some embodiments, the biomarker
serves as a proxy for a desired clinical endpoint. In other embodiments, the biomarker is
correlated with efficacy of an agent, as when biomarker expression is predictive of efficacy of
treatment with an agent (such as a drug). In one embodiment, increase in le\d of a biomarker
indicates efficacy or progress of treatment. In another embodiment, decrease in level of a
biomarker indicates efficacy or progress of treatment.
[00123] The biomarker can be used as a marker for toxicity, including, toxicity of an agent
such as a pharmaceutical, new drug candidate, cosmetic, or other chemical. In some
embodiments, detection of biomarker expression may also be used to monitor for environmental
exposure to an agent, such as a toxin or a pathogen. In one embodiment, increase in level of a
biomarker indicates toxicity or exposure to an agent. In another embodiment, decrease in level
of a biomarker indicates toxicity or exposure to an agent.
[00124] A biomarker can be used to screen a plurality or library of molecules and compounds
for specific binding affinity, including, for example, DNA molecules, RNA molecules, peptide
nucleic acids, polypeptides, mimetics, small molecules, and the like. In one embodiment, an
assay involves providing a plurality of molecules and/or compounds, combining a biomarker
with the plurality of molecules and/or compounds under conditions to allow specific binding, and
detecting specific binding to identify at least one molecule or compound which specifically binds
the biomarker.
[00125] Similarly, one or more biomarkers, or portions thereof, can be used to screen a
plurality or library of molecules and/or compounds in any of a variety of screening assays to
identify a ligand. Methods for screening are well known in the art. The assay can be used to
screen, for example, aptamers, DNA molecules, RNA molecules, peptide nucleic acids,
polypeptides, mimetics, proteins, antibodies, agonists, antagonists, immunoglobulins, inhibitors,
small molecules, pharmaceutical agents or drug compounds and the like, which specifically bind
the biomarker.

[00126] In another embodiment, one or more antibodies comprising an antigen binding site
that specifically binds a biomarker can be used for the detection of the biomarker (including in
vitro and in vivo detection). In another example, an antibody that specifically binds a biomarker
can be linked to an in vivo imaging reagent, such as, for example, 3H, mIn,1251, (see Esteban et
al. (1987) J. Nucl. Med 28.861-870), and used in an in vivo imaging application.
Compositions and kits
[00127] The invention also provides compositions for use in any of the methods described
herein, such as methods for reducing the complexity of a sample, methods for purifying and/or
enriching a protein or a plurality of proteins, methods for isolating and/or separating a protein or
a plurality of proteins, and/or methods for preparing a protein, a plurality of proteins, or a protein
fraction for characterization, methods for preparing a protein, a plurality of proteins, or a protein
fraction for mass spectrometry analysis, methods for identifying a protein or a plurality of
proteins, methods for discovering one or more new proteins, methods for detection and/or
quantification of a protein or a plurality of proteins in a sample, methods for characterizing a one
or more proteins, methods for expression profiling, methods for identifying protein degradation
products, methods for identifying change(s) in post-translational modification, and/or methods
for determining the mass, the amount and/or identity of protein(s) in a sample. The compositions
used in the methods of the invention may comprise one or more (such as about 2, about 3, about
4, about 5, about 7, about 10, about 15 or more) small epitope antibody(ies). In some
embodiments, the composition comprises less than about 100, about 95, about 90, about 85,
about 80, about 75, about 70, about 65, about 60, about 55, about 50, about 45, about 40, about
35, about 30, about 25, about 20, about 15, about 10, about 5, or fewer small epitope antibodies.
In some embodiments, the composition comprises at least about 20, about 30, about 40, about 50,
about 75, about 100, about 125, about 150, about 200, about 300, about 400, about 500, about
1000, or more small epitope antibodies. In some embodiments, the composition comprises about
20, about 30, about 40, about 50, about 75, about 100, about 125, about 150, about 200, about
300, about 400, about 500, about 1000, or more small epitope antibodies. In some embodiments,
the composition comprises at least about any of 10,20,30,40,50,75,100,125,150,200, 300,
400 or 500 small epitope antibodies, with an upper limit of about any of 20, 30,40, 50,75,100,
125,150,200,300,400,500 or 1000 small epitope antibodies.

[00128] The invention also provides kits for use in the instant methods. Kits of the invention
include one or more containers comprising one or more small epitope antibody(ies). In some
embodiments, the kit comprises less than about 100, about 95, about 90, about 85, about 80,
about 75, about 70, about 65, about 60, about 55, about 50, about 45, about 40, about 35, about
30, about 25, about 20, about 15, about 10, about 5, or fewer small epitope antibodies. In some
embodiments, the kit comprises at least about 20, about 30, about 40, about 50, about 75, about
100, about 125, about 150, about 200, about 300, about 400, about 500, about 1000, or more
small epitope antibodies. In some embodiments, the kit comprises about 20, about 30, about 40,
about 50, about 75, about 100, about 125, about 150, about 200, about 300, about 400, about 500,
about 1000, or more small epitope antibodies. In some embodiments, the kit comprises at least
about any of 10,20, 30,40,50, 75,100,125,150,200,300,400 or 500 small epitope antibodies
with an upper limit of about any of 20, 30,40, 50, 75,100, 125,150, 200, 300,400, 500 or 1000
small epitope antibodies. In some embodiments, the kit further comprises instructions for use in
accordance with any of the methods of the invention described herein, such as methods for
reducing the complexity of a sample, methods for purifying and/or enriching a protein or a
plurality of proteins, methods for isolating and/or separating a protein or a plurality of proteins,
and/or methods for preparing a protein, a plurality of proteins, or a protein fraction for
characterization, methods for preparing a protein, a plurality of proteins, or a protein fraction for
mass spectrometry analysis, methods for identifying a protein or a plurality of proteins, methods
for discovering one or more new proteins, methods for detection and/or quantification of a
protein or a plurality of proteins in a sample, methods for characterizing a one or more proteins,
methods for expression profiling,, methods for identifying protein degradation products, methods
for identifying change(s) in post-translational modification, and/or methods for determining the
mass, the amount and/or identity of protein(s) in a sample.
[00129] The invention also comprises any of the protein "products" (e.g., proteins enriched,
purified, isolated, prepared, separated, and/or fractionated using any of the methods of the
invention described herein. The invention also provides proteins or protein fragments
characterized (e.g., detected, identified, quantified, etc.) using any of the methods of the
invention described herein and compositions comprising such products. Such proteins comprise
a cognate small epitope that is recognized by the small epitope antibody (to which the protein
was bound). The invention also provides small epitope antibody-protein complexes or small
epitope antibody-protein fragment complexes (for methods wherein the proteins are contacted

with a protein cleaving agent prior to contact with the small epitope antibody(ies)) prepared or
isolated by any of the methods described herein. The invention also provides proteins or protein
fragments separated from a small epitope antibody-protein complex or small epitope antibody-
protein fragment complex according to any of the methods described herein, and/or protein
fragments prepared from proteins after separation from small epitope antibody(ies).
[00130] In another aspect, the invention includes compositions and/or kits comprising
intermediates (such as complexes, e.g., small epitope antibody-protein complex) produced by
any aspect of the methods of the invention. The invention also provides incubation mixtures
comprising protein-containing samples and small epitope antibodies and/or small epitope
antibody-protein complexes as described herein.
[00131] The kits of this invention are in suitable packaging. Suitable packaging includes, but
is not limited to, vials, bottles, jars, flexible packaging (e.g., sealed Mylar or plastic bags), and
the like. In some embodiments, the kit comprises a container and a label or package insert(s) on
or associated with the container. The label or package insert may indicate that the small epitope
antibody(ies) are useful for any of the methods described herein, e.g., method for reducing the
complexity of a sample, or method for identifying a protein, characterizing a protein, and/or
expression profiling. Instructions may be provided for practicing any of the methods described
herein.
Components and reaction mixtures useful in the methods of the invention
Small epitope antibody
[00132] The methods of the invention use a small epitope antibody. As used herein, a "small
epitope antibody" is an antibody that binds (generally specifically binds) a small peptide epitope.
By virtue of the epitope specificity, small epitope antibodies generally recognize a multiplicity of
proteins that comprise the small epitope to which the antibody binds. Insofar as the small
epitope bound by the antibody is known, binding by a small epitope antibody provides
information relating to amino acid content and/or sequence of protein(s) bound by the small
epitope antibody. Small epitope antibodies are described, for example, in co-pending U.S. Patent
Application No. 10/687,174. Small epitope antibodies and methods of making small epitope
antibodies are further discussed herein and exemplified in the Examples.

[00133] An antibody can encompass monoclonal antibodies, polyclonal antibodies, antibody
fragments {e.g., Fab, Fab', F(ab')2, Fv, Fc, etc.), chimeric antibodies, single chain (ScFv),
mutants thereof, fusion proteins comprising an antibody portion, and any other modified
configuration of the immunoglobulin molecule that comprises an antigen recognition site of the
required specificity. The antibodies may be murine, rat, human, or any other origin (including
humanized antibodies). Small epitope antibodies may be produced by a number of methods
known in the art, including, for example, production by a hybridoma, recombinant production, or
chemical synthesis.
[00134] Generally, a small epitope antibody binds a short, linear peptide epitope of 3, 4, or 5
sequential (consecutive) amino acids. Alternatively, in some embodiments, a small epitope
antibody binds a discontinuous amino acid sequence witliin a polypeptide. In some
embodiments, a small epitope antibody binds an epitope consisting of or consisting essentially of
about any of 2, 3,4, 5, 6, 7, 8, 9, or 10 amino acids. In some embodiments, a small epitope
antibody binds an epitope consisting of or consisting essentially of 2 to 10, 3 to 8, or 3 to 5
amino acids. In some embodiments, a small epitope antibody binds an epitope consisting of or
consisting essentially of less than about any of 10, 9, 8, 7, 6, 5, 4, or 3 amino acids. In some
embodiments, a population of small epitope antibodies binds epitopes consisting of or consisting
essentially of about 3 to about 5 amino acids. In some embodiments, a population of small •
epitope antibodies binds epitopes consisting of or consisting essentially of 2 to 10, 3 to 8, or 3 to
5 amino acids. In some embodiments, a population of small epitope antibodies binds epitopes
consisting of or consisting essentially of about any of 2,3,4,5,6, 7, 8,9, or 10 amino acids. In
some embodiments, a population of small epitope antibodies binds epitopes consisting of or
consisting essentially of less than about any of 10, 9, 8, 7, 6, 5, 4, or 3 amino acids. A population
of small epitope antibodies comprises a plurality of small epitope antibodies. In one
embodiment, the plurality of small epitope antibodies binds epitopes of the same number of
amino acids. In other embodiments, the plurality of small epitope antibodies binds epitopes of a
mixture of different numbers of amino acids. In any of the embodiments described herein, an
epitope may be a sequential or discontinuous sequence within a polypeptide, as described below.
In some embodiments, one or more small epitope antibody(ies) may be comprised within a
mixture of antibodies that comprises antibodies that bind to epitopes larger that the epitopes
recognized by the one or more small epitope antibody(ies).

[00135] In some embodiments, the small epitope antibody binds an epitope consisting of or
consisting essentially of 3 sequential amino acids (termed a 3mer), four sequential amino acids
(termed a 4mer), or five sequential amino acids (termed a 5mer). In other embodiments, the
small peptide antibody binds a small "discontinuous" or "degenerate" linear peptide sequence,
such as the linear peptide sequence YCxC, wherein x represents any of the 20 natural amino
acids (a degenerate linear sequence). In other embodiments, the small epitope antibody binds a
non-sequential (discontinuous) sequence within a polypeptide based on conformational
proximity of amino acids within the polypeptide to form the epitope (for example, a
conformational epitope formed by proximity of amino acid residues due to secondary structure
within a folded polypeptide). In still other embodiments, the small epitope antibody may bind an
epitope consisting of an amino acid sequence that is predicted to be antigenic, using methods
well known in the art for predicting antigenicity. Antibodies that bind small linear peptide
epitopes have been previously described, as shown in Table 2, below. In some embodiments,
the same antibody may bind a sequential sequence on one or more proteins and a discontinuous
sequence on one or more proteins.
[00136] Small epitope antibodies generally recognize a multiplicity of proteins that comprise
the small epitope to which the antibody binds. In some embodiments, the small epitope antibody
binds to an epitope present one or more times in about any of 0.1%, 0.5%, 1,2%, 3%, 4%, 5%,
10%, or more of proteins in a sample. In still other embodiments, the small epitope antibody
binds to an epitope present one or more times in about 0.1 % to 1 % of proteins in a sample. In
still other embodiments, the small epitope antibody binds to an epitope present one or more times
in approximately 1-5% of proteins in a sample. In still other embodiments, the small epitope
antibody binds to an epitope present one or more times in about 0.1% to 1 % of proteins in a
sample, wherein the small antibody epitope binds to a linear peptide epitope consisting of or
consisting essentially of 3 amino acids, 4 amino acids or 5 amino acids. In still other
embodiments, the small epitope antibody binds to an epitope present one or more times in about
1-5% of proteins in a sample, wherein the small antibody epitope binds to a linear peptide
epitope consisting of or consisting essentially of 3 amino acids, 4 amino acids or 5 amino acids.
In still other embodiments, the small epitope antibody binds to an epitope present one or more
times in about 5-7% or about 5-10% of proteins in a sample, wherein the small antibody epitope
binds to a linear peptide epitope consisting or consisting essentially of 3 amino acids, 4 amino
acids or 5 amino acids. In some embodiments, a plurality of small epitope antibodies

collectively bind to one or more epitopes present one or more times in about any of at least about
any of 0.1%, 0.5%, 1, 2%, 3%, 4%, 5%, 10%, or more of proteins in a sample. In some
embodiments, a plurality of small epitope antibodies binds to an epitope present one or more
times in about any of 0.1 to 1%, 1 to 5%, 5 to 7%, or 5 to 10% of proteins in a sample. Methods
for empirically assessing frequency of an epitope in a sample include: assessment using
biochemical approaches, such as binding of an antibody followed by analysis using, for example,
2D gels or mass spectrometry, and sequence based analysis, using, for example, amino acid or
nucleic acid sequence databases such as GenBank and SwissProt. Suitable databases are further
described herein.
[00137] In some embodiments, the epitope recognized by a small epitope antibody further
comprises a C-terminal amino acid recognized as a cleavage site by an endopeptidase. For
example, the epitope could comprise a C-terminal arginine and/or a lysine, which are each
recognized by trypsin as a cleavage site. Following endopeptidase digestion of a protein
mixture, the amino acid recognized by the endopeptidase is generally found at the C-terminus of
the target peptide; accordingly, an epitope encompassing such an amino acid will also be found
at the C-terminus of a target polypeptide, which may increase immunogeniciry, and increase the
binding energy associated with antibody-target peptide binding.
[00138] In some embodiments, the small epitope antibody binds its cognate epitope with an
affinity of a binding reaction of at least about 10"7 M, at least about 10"8 M, or at least about 10"9
M, or lower. Binding affinity may be measured by well-known methods in the art, including, for
example, by surface plasmon resonance (Malmborg and Borrebaeck (1995) J. Immunol. Methods
183(1):7-13; Lofas and Johnsson (1990) J. Chem. Soc. Chem. Commun. 1526. In some
embodiments, a binding interaction will discriminate over adventitious binding interactions in
the reaction by at least two-fold, at least five-fold, at least 10- to at least 100-fold or more.
[00139] One, or a population of more than one (such as about two, about three, about four,
about five, about ten, about twenty, about one hundred or more) small epitope antibod(ies) may
be used in the methods of the invention. Thus, in some embodiments, the methods comprise use
of at least one small epitope antibody. In other embodiments, the methods comprise use of at
least two small epitope antibodies. In still other embodiments, at least about 5, about 10, about
20, about 30, about 40, about 50, about 60, about 75, about 100, about 125, about 150, about 200
about, 300, about 400, about 500, about 750, about 1000, or more small epitope antibodies are
used in the methods of the invention. In some embodiments, the sample is contacted with less

than about 100, about 95, about 90, about 85, about 80, about 75, about 70, about 65, about 60,
about 55, about 50, about 45, about 40, about 35, about 30, about 25, about 20, about 15, about
10, about 5, or fewer small epitope antibodies. In some embodiments, the sample is contacted
with at least about 20, about 30, about 40, about 50, about 75, about 100, about 500, about 1000,
or more small epitope antibodies. In some embodiments, a sample is contacted with at least
about any of 5,10,20,30,40,50,60,75,100,125,150,200,300,400,500, or 750 small epitope
antibodies, with an upper limit of about any of 10, 20, 30, 40, 50, 75,100, 125,150,200,300,
400, 500, 750, or 1000 small epitope antibodies. It is understood that mixture of small epitope
antibodies and other protein binding agents (such as antibodies that are not small epitope
antibodies) may be used.
[00140] It is understood that the identity (sequence) of the epitope(s) to which the small
epitope antibody(ies) may be used in combination with any of the methods described herein to,
e.g., identify proteins. In some embodiments, the small epitope identity is known. In other
embodiments, the identity of the epitope is predictable using methods known in the art.
[00141] As discussed herein, antibodies may be contacted with the sample one at a time or in
groups of two or more antibodies. In some embodiments, contacting is serial (sequential or
iterative), e.g., a single antibody or group of antibodies is contacted with the sample and
separated, and a second antibody or group of antibodies is contacted with the sample and
separated. In other embodiments, contacting is in parallel, e.g., a group of antibodies is
contacted with the sample and separated. It is appreciated that contacting may be both in parallel
and serial, as when different groups of antibodies are serially contacted with a sample. Groups
of antibodies may be overlapping in composition (e.g., group 1= antibody A, B, C, D,; group 2 =
antibody B, C, D, E, etc.).
[00142] It is evident that the number of small epitope antibodies that are useful in the methods
of reducing complexity of a sample depends on the use, application, and/or subsequent analysis
contemplated for the protein prepared using one or more small epitope antibodies. In some
applications, such as detection of a protein(s) comprising a cognate epitope recognized by a
small antibody, a single small epitope antibody (or, in some embodiments, a small number of
small epitope antibodies) may be used to prepare, purify and/or enrich a fraction of protein(s)
that comprises the protein for which subsequent detection (or other analysis) is desired. Then,
the separated protein can be subjected to further analysis. In other embodiments, use of a set of
two or more small epitope antibodies may be useful. For example, in applications such as

protein discovery and, in some embodiments, expression profiling, it may be desirable to use a
multiplicity of small epitope antibodies, such that a large multiplicity of proteins (such as
essentially all protein in the starting sample) will be enriched and/or purified. Use of a
multiplicity of small epitope antibodies is also useful in application in which purification and/or
enrichment of new protein(s) or protein forms is desired (for example, because information
regarding target protein sequence is unknown). As an illustrative example relating to
embodiments involving fractionation of a multiplicity of proteins in a sample (such as essentially
all proteins in a sample) shown, knowledge of the sequence and/or the length of the cognate
amino acid epitope recognized by the small epitope antibody permits an estimate regarding the
expected frequency of the epitope(s) recognized by the small epitope antibody(ies) within the
protein sample. As shown in Table 1, there are a total of 8,000 (203), 160,000 (204) and
3,200,000 (205) random combinations for 3mer, 4mer and 5mer linear peptide sequences,
respectively. Considering 500 amino acids as an average length of protein, the probability that it
is detected by a single anti-3mer antibody is 0.0625, the probability increases to about 1 when 15
anti-3mer antibodies are used, and the probability increases to 6.25 when 100 anti-3mer
antibodies are used. Such calculations are routine. A small epitope antibody may also recognize
a degenerate linear epitope, for example a short peptide, such as YCxC, where x represents two
or more of the 20 standard amino acids.


[00143] Thus, it is understood that the number of small epitope antibodies useful in the
methods of the invention depends on various factors, including, for example, the use, application,
and/or subsequent analysis contemplated for the protein fraction bound by the small epitope
antibody(ies), complexity of the sample (in terms of number of expected or estimated or
previously determined proteins, including protein variants such as splice variants), average size
of the proteins in the sample, frequency that the cognate epitope is present or predicted to be
present in a sample, binding affinity and/or specificity of the small epitope antibody(ies);
knowledge of target protein(s), and stability of the small epitope antibody. Such factors are well
known in the art and are further discussed herein.
[00144] Antibodies that bind small linear peptide epitopes have been previously described, as
shown in Table. 2.


[00145] Methods of making small epitope antibodies are known in the art. In another aspect,
and as exemplified in the Examples, small epitope antibodies (e.g., human, humanized, mouse,
chimeric) may be made by using immunogens which express one or more small peptide epitopes,
such as a small linear peptide epitope consisting of or consisting essentially of 3,4, or 5 amino
acids.
[00146] Immunogens may be produced, for example, by chemical synthesis. Methods for
synthesizing polypeptides are well known in the art. In some embodiments, the polypeptide
immunogen is synthesized with a terminal cysteine to facilitate coupling to either KLH or BSA,

as is known in the art. The terminal cysteine can be incorporated at the amino terminus of the
polypeptide (which may minimize steric effects during immunization and screening), or at the
carboxy terminus. In other embodiments, the polypeptide immunogen is synthesized as a
multiple antigen polypeptide, or MAP.
[00147] The route and schedule of immunization of the host animal are generally in keeping
with established and conventional techniques for antibody stimulation and production, as further
described herein. General techniques for production of human and mouse antibodies are known
in the art and are described herein. Typically, the host animal is inoculated intraperitoneally with
an amount of immunogen, including as described herein.-
[00148] Hybridomas can be prepared from the lymphocytes and immortalized myeloma cells
using the general somatic cell hybridization technique of Kohler, B. and Milstein, C. (1975)
Nature 256:495-497 or as modified by Buck, D. W. et al, (1982) In Vitro, 18:377-381.
Available myeloma lines, including but not limited to X63-Ag8.653 and those from the Salk
Institute, Cell Distribution Center, San Diego, Calif., USA, may be used in the hybridization.
Generally, the technique involves fusing myeloma cells and lymphoid cells using a fusogen such
as polyethylene glycol, or by electrical means well known to those skilled in the art. After the
fusion, the cells are separated from the fusion medium and grown in a selective growth medium,
such as hypoxanthine-aminopterin-thymidine (HAT) medium, to eliminate unhybridized parent
cells. Any of the media described herein, supplemented with or without serum, can be used for
culturing hybridomas that secrete monoclonal antibodies. As another alternative to the cell'
fusion technique, EBV immortalized B cells may be used to produce the small epitope antibodies
of the subject invention. The hybridomas are expanded and subcloned, if desired, and
supernatants are assayed for anti-immunogen activity by conventional immunoassay procedures
{e.g., radioimmunoassay, enzyme immunoassay, or fluorescence immunoassay).
[00149] Hybridomas or progeny cells of the parent hybridomas that produce small epitope
antibodies (such as monoclonal antibodies) may be used as source of antibodies or derivatives
thereof, or a portion thereof.
[00150] Hybridomas that produce such antibodies may be grown in vitro or in vivo using
known procedures. The monoclonal antibodies may be isolated from the culture media or body
fluids, by conventional immunoglobulin purification procedures such as ammonium sulfate
precipitation, gel electrophoresis, dialysis, chromatography, and ultrafiltration, if desired.
Undesired activity if present, can be removed, for example, by running the preparation over

adsorbents made of the immunogen attached to a solid phase and eluting or releasing the desired
antibodies off the immunogen. Immunization of a host animal with a human or other species of
small epitope receptor, or a fragment of the human or other species of small epitope receptor, or
a human or other species of small epitope receptor or a fragment containing the target amino acid
sequence conjugated to a protein that is immunogenic in the species to be immunized, e.g.,
keyhole limpet hemocyanin, serum albumin, bovine thyroglobulin, or soybean trypsin inhibitor
using a bifunctional or derivatizing agent, for example maleimidobenzoyl sulfosuccinimide ester
(conjugation through cysteine residues), N-hydroxysuccinimide (through lysine residues),
glutaradehyde, succinic anhydride, SOC12, or R1N=C=NR, where R and Rl are different alkyl
groups can yield a population of antibodies (e.g., monoclonal antibodies).
[00151] If desired, the small epitope antibody (monoclonal or polyclonal) of interest may be
sequenced and the polynucleotide sequence may then be cloned into a vector for expression or
propagation. The sequence encoding the antibody of interest may be maintained in vector in a
host cell and the host cell can then be expanded and frozen for future use. In an alternative, the
polynucleotide sequence may be used for genetic manipulation to "humanize" the antibody or to
improve the affinity, or other characteristics of the antibody. For example, the constant-region
may be engineered to more resemble human constant regions to avoid immune response if the
antibody is used in clinical trials and treatments in humans. It may be desirable to genetically
manipulate the antibody sequence to obtain greater affinity to the small epitope and/or greater
and/or altered specificity to the small epitope. It will be apparent to one of skill in the art that
one or more polynucleotide changes can be made to the small epitope antibody and still maintain
its binding ability to the small epitope.
[00152] A number of "humanized" antibody molecules comprising an antigen-binding site
derived from a non-human immunoglobulin have been described, including chimeric antibodies
having rodent or modified rodent V regions and their associated complementarity determining
regions (CDRs) fused to human constant domains. See, for example, Winter et al. Nature
349:293-299 (1991), Lobuglio et al.(l989)Proc. Nat. Acad. Sci. USA 86:4220-4224, Shaw et al.
(1987) J Immunol. 138:4534-4538, and Brown et al. (1987) Cancer Res. 47:3577-3583. Other
references describe rodent CDRs grafted into a human supporting framework region (FR) prior
to fusion with an appropriate human antibody constant domain. See, for example, Riechmann et
al. (1988) Nature 332:323-327, Verhoeyen et al. Science (1988) 239:1534-1536, and Jones etal.
Nature (1986) 321:522-525. Another reference describes rodent CDRs supported by

recombinantly veneered rodent framework regions. See, for example, European Patent
Publication No. 519,596. These "humanized" molecules are designed to minimize unwanted
immunological response toward rodent anti-human antibody molecules which limits the duration
and effectiveness of therapeutic applications of those moieties in human recipients. For
example, the antibody constant region can be engineered such that it is immunologically inert
{e.g., does not trigger complement lysis). See, e.g. PCT/GB99/01441; UK Patent Application
No. 9809951.8. There are four general steps to humanize a monoclonal antibody. These are:
(1) determining the nucleotide and predicted amino acid sequence of the starting antibody light
and heavy variable domains (2) designing the humanized antibody, i.e., deciding which antibody
framework region to use during the humanizing process (3) the actual humanizing
methodologies/techniques and (4) the transfection and expression of the humanized antibody.
See, for example, U.S. Patent Nos. 4,816,567; 5,807,715; 5,866,692; 6,331,415; 5,530,101;
5,693,761; 5,693,762; 5,585,089; and 6,180,370. Other methods of humanizing antibodies that
may also be utilized are disclosed by Daugherty et ah, Nucl. Acids Res. (1991) 19:2471-2476 and
in U.S. Patent Nos. 6,180,377; 6,054,297; 5,997,867; 5,866,692; 6,210,671; 6,350,861; andPCT
Publication No. WO 01/27160.
[00153] In yet another alternative, fully human antibodies may be obtained by using
commercially available mice that have been engineered to express specific human
immunoglobulin proteins. Transgenic animals that are designed to produce a more desirable
(e.g., fully human antibodies) or more robust immune response may also be used for generation
of humanized or human antibodies. Examples of such technology are Xenomouse ™ from
Abgenix, Inc. (Fremont, CA) and HuMAb-Mouse® and TC Mouse™ from Medarex, Inc.
(Princeton, NJ).
[00154] In an alternative, antibodies may be made recombinantly and expressed using any
method known in the art. In another alternative, antibodies may be made recombinantly by
phage display technology. See, for example, U.S. Patent Nos. 5,565,332; 5,580,717; 5,733,743
and 6,265,150; and Winter etal.( 1994) Annu. Rev. Immunol. 12:433-455.
[00155] Alternatively, the phage display technology (McCafferty et al.(l990) Nature
348:552-553) can be used to produce human antibodies and antibody fragments in vitro, from
immunoglobulin variable (V) domain gene repertoires from unimmunized donors. For example,
existing antibody phage display libraries may be panned in parallel against a large collection of
synthetic polypeptides. According to this technique, antibody V domain genes are cloned in-

frame into either a major or minor coat protein gene of a filamentous bacteriophage, such as Ml 3
or fd, and displayed as functional antibody fragments on the surface of the phage particle.
Because the filamentous particle contains a single-stranded DNA copy of the phage genome,
selections based on the functional properties of the antibody also result in selection of the gene
encoding the antibody exhibiting those properties. Thus, the phage mimics some of the
properties of the B cell. Phage display can be performed in a variety of formats; for review see,
e.g., Johnson, Kevin S. and Chiswell, David J., Current Opinion in Structural Biology (1993) 3,
564-571. Several sources of V-gene segments can be used for phage display. Clackson et al.,
Nature (1991) 352:624-628 isolated a diverse array of anti-oxazolone antibodies from a small
random combinatorial library of V genes derived from the spleens of immunized mice. A
repertoire of V genes from unimmunized human donors can be constructed and antibodies to a
diverse array of antigens (including self-antigens) can be isolated essentially following the
techniques described by Mark et al (1991) J. Mol. Biol. 222:581-597, or Griffith et al (1993)
EMBO J. 12:725-734. In a natural immune response, antibody genes accumulate mutations at a
high rate (somatic hypermutation). Some of the changes introduced will confer higher affinity,
and B cells displaying high-affinity surface immunoglobulin are preferentially replicated and
differentiated during subsequent antigen challenge. This natural process can be mimicked by
employing the technique known as "chain shuffling." Marks, et al. (1992) Bio/Technol. 10:779-
783. In this method, the affinity of "primary" human antibodies obtained by phage display can
be improved by sequentially replacing the heavy and light chain V region genes with repertoires
of naturally occurring variants (repertoires) of V domain genes obtained from unimmunized
donors. This technique allows the production of antibodies and antibody fragments with
affinities in the pM-nM range. A strategy for making very large phage antibody repertoires (also
known as "the mother-of-all libraries") has been described by Waterhouse et al. (1993) Nucl.
Acids Res. 21:2265-2266. Gene shuffling can also be used to derive human antibodies from
rodent antibodies, where the human antibody has similar affinities and specificities to the starting
rodent antibody. According to this method, which is also referred to as "epitope imprinting", the
heavy or light chain V domain gene of rodent antibodies obtained by phage display technique is
replaced with a repertoire of human V domain genes, creating rodent-human chimeras. Selection
on antigen results in isolation of human variable regions capable of restoring a functional
antigen-binding site, i.e., the epitope governs (imprints) the choice of partner; When the process
is repeated in order to replace the remaining rodent V domain, a human antibody is obtained (see

PCT Publication No. WO 9306213, published April 1,1993). Unlike traditional humanization of
rodent antibodies by CDR grafting, this technique provides completely human antibodies, which
have no framework or CDR residues of rodent origin. It is apparent that although the above
discussion pertains to humanized antibodies, the general principles discussed are applicable to
customizing antibodies for use, for example, in dogs, cats, primates, equines and bovines.
[00156] Antibodies may be made recombinantly by first isolating the antibodies made from
host animals, obtaining the gene sequence, and using the gene sequence to express the antibody
recombinantly in host cells (e.g., CHO cells). Another method that may be employed is to
express the antibody sequence in plants (e.g., tobacco), transgenic milk, or in other organisms.
Methods for expressing antibodies recombinantly in plants or milk have been disclosed. See, for
example, Peeters et ah (2001) Vaccine 19:2756; Lonberg, N. and D. Huszar (1995) Int. Rev.
Immunol 13:65; and Pollock et ah (1999) J Immunol Methods 231:147. Methods for making
derivatives of antibodies, e.g., humanized, single chain, etc. are known in the art.
[00157] Immunoassays and flow cytometry sorting techniques such as fluorescence activated
cell sorting (FACS) can also be employed to isolate antibodies that are specific for the desired
small epitope.
[00158] The antibodies can be bound to many different carriers. Carriers can:be active and/or
inert. Examples of well-known carriers include polypropylene, polystyrene, polyethylene,
dextran, nylon, amylases, glass, natural and modified celluloses, polyacrylamides, agaroses and
magnetite. The nature of the carrier can be either soluble or insoluble for purposes of the
invention. Those skilled in the art will know of other suitable carriers for binding antibodies, or
will be able to ascertain such, using routine experimentation.
[00159] DNA encoding small epitope antibodies may be isolated and sequenced, as is known
in the art. Generally, the monoclonal antibody is readily isolated and sequenced using
conventional procedures (e.g., by using oligonucleotide probes that are capable of binding
specifically to genes encoding the heavy and light chains of the monoclonal antibodies). The
hybridoma cells serve as a preferred source of such cDNA. Once isolated, the DNA may be
placed into expression vectors, which are then transfected into host cells such as E. coli cells,
simian COS cells, Chinese hamster ovary (CHO) cells, or myeloma cells that do not otherwise
produce immunoglobulin protein, to obtain the synthesis of monoclonal antibodies in the
recombinant host cells. The DNA also may be modified, for example, by substituting the coding
sequence for human heavy and light chain constant domains in place of the homologous murine

sequences, Morrison etal (1984) Proc. Nat. Acad. Sci. 81: 6851, or by covalently joining to the
immunoglobulin coding sequence all or part of the coding sequence for a non-immunoglobulin
polypeptide. In that manner, "chimeric" or "hybrid" antibodies are prepared that have the
binding specificity of a small epitope antibody (such as a monoclonal antibody) herein.
[00160] Small epitope antibodies may be characterized using methods well-known in the art,
some of which are described in the Examples. For example, one method is to identify the
epitope to which it binds, including solving the crystal structure of an antibody-antigen complex,
competition assays, gene fragment expression assays, and synthetic polypeptide-based assays, as
described, for example, in Chapter 11 of Harlow and Lane, Using Antibodies, a Laboratory
Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York, 1999. In an
additional example, epitope mapping can be used to determine the sequence to which a small
epitope antibody binds. Epitope mapping is commercially available from various sources, for
example, Pepscan Systems (Edelhertweg 15, 8219 PH Lelystad, The Netherlands). Polypeptides
of varying lengths (e.g., at least 4-6 amino acids long) can be isolated or synthesized (e.g.,
recombinantly) and used for binding assays with an anti-small epitope antibody. In another
example, the epitope to which the small epitope antibody binds can be determined in a
systematic screening by using overlapping polypeptides derived from the small epitope
extracellular sequence and determining binding by the small epitope antibody. Certain epitopes
can also be identified by using large libraries of random polypeptide sequences displayed on the
surface of phage particles (phage libraries), as is well known in the art.
[00161] Yet another method which can be used to characterize an anti-small epitope antibody
is to use competition assays with other antibodies known to bind to the same antigen, i.e., to
determine if the anti-small epitope antibody binds to the same epitope as other antibodies.
Competition assays are well known to those of skill in the art.
[00162] The small epitope antibodies useful in this invention may be linked to a labeling agent
(alternatively termed "label") such as a fluorescent molecule (such as a hapten or fluorescent
bead), a binding partner, a solid support, or other agents to facilitate separation that are known in
the art. Such agents are further described herein.
[00163] In some embodiments, one or more of the following considerations are used in the
design of small epitope antibodies (whether designed to be used singly or in a population) that
result in an epitope frequency with sufficient redundancy to yield optimal coverage of the
proteins present in a sample. In one embodiment, a group of small epitope antibodies designed

according to one or more of the following considerations is capable of binding to cognate
epitopes on at least about any of 10, 20, 30,40, 50, 60, 70, 80, 90, or 95% of the proteins in a
sample.
• Epitope size: Small epitope antibodies that are small enough to occur frequently in the
proteome but large enough to confer sufficient binding energy when recognized by their
cognate epitope. In some embodiments, the epitope size recognized by each antibody is
3,4, or 5'amino acids.
• Epitope abundance: In an embodiment, optimal epitope abundance enables each small
epitope antibody to bind to about 100 to about 150 serum-derived polypeptides of about
20 to about 100 amino acids in length. This abundance level matches the resolving
power of most mass spectrometers without requiring MS-MS and collision-induced
dissociation (CID). Epitopes of the appropriate abundance are preferable for
achievement of the desired MS resolution and sensitivity.
• Sampling redundancy: In some embodiments, a sufficiently large collection of small
epitope antibodies is used to permit binding to about 3 to about 5 epitopes per protein per
proteome of interest. This design feature provides for sampling redundancy to
accommodate the variability expected in both expression levels for different proteins and
binding efficiency for each antibody in the collection.
• Affinity: In some embodiments, the tightness of binding between small epitope
antibodies and their epitopes affects the sensitivity of protein profiling. In some
embodiments, each antibody in a collection binds with high enough affinity to ensure
that sufficient analyte is captured for MS analysis.
• Frequency of Binding: In some embodiments, frequency of binding of small epitope
antibodies is high so that peptides present within each bound peptide fraction contain a
common epitope. This provides sampling redundancy and facilitates bioinformatic
determination of peptide identity.
Contacting the sample with a small epitope antibody and separation of protein from a
protein-antibody complex
[00164] Methods and conditions for contacting an antibody with a protein in a sample are well
known in the art. Antibodies may be contacted with the sample one at a time or in groups of two

or more antibodies). In some embodiments, contacting is serial (sequential, or iterative), e.g., a
single antibody or group of antibodies is contacted with the sample; separated; and a second
antibody or group of antibodies is contacted with the sample, and separated, and so on). In other
embodiments, contacting is in parallel, e.g., a group of antibodies is contacted with the sample,
and separated. It is appreciated that contacting may be both in parallel and serial, as when
different groups of antibodies are serially contacted with a sample. Groups of antibodies may be
overlapping in composition (e.g., group 1= antibody A, B, C, D; group 2 = antibody B, C, D, E,
etc.) or different in composition. Contacting of an antibody with protein may occur with both
antibody and protein in a liquid medium or may occur with one component (antibody or protein)
bound or associated with a solid support and the other component in a liquid medium. In one
embodiment, a liquid (e.g., aqueous) protein containing sample is contacted with a small epitope
antibody that is bound or associated with a solid support.
[00165] In some embodiments involving parallel contacting, it is desirable for small epitope
antibodies to be individually separable, for example, by linking the antibody to detectable
distinct beads, use of individually separable binding partners, immobilization of antibody in, e.g.,
different wells of a multiwell plate, use of antibody arrays, and the like. Insofar as the small
epitope bound by the antibody is known, binding by a small epitope antibody provides
information relating to amino acid content and/or sequence of protein(s) bound by the small
epitope antibody. In embodiments wherein knowledge of the cognate small epitope is desired, it
may be convenient to individually separate the small antibodies (such that the protein bound by
each small epitope antibody is kept separate). However, individual separation or separability is
not required in every embodiment. For example, small epitope antibodies may be combined in
small pools of two or more antibodies that possess overlapping antibody composition, such as (1)
antibodies ABC; (2) antibodies CDE; (3) antibodies FGH, and (4) antibodies HIJ. Following
separation of antibody-protein complexes, and separation of antibody from antibody-protein
complexes, information regarding presence or absence of a particular small epitope may be
inferred based on membership in a particular group.
[00166] To facilitate separation of the antibody-protein complex from unbound protein in the
sample, the antibody may be linked to an agent that facilitates separation, such as a binding
partner (e.g., biotin, oligonucleotide, aptamer), a solid support (such as a bead or matrix,
including a microarray or multiwell plate); or any other agent known in the art. Linking may be
covalent or noncovalent, and may be direct or indirect. Methods for linking antibodies to such

agents are well known in the art. See, e.g. Kennedy et al. (1976) Clin. Chim. Acta 70:1-31, and
Schurs et al. (1977) Clin. Chim. Acta 81:1-40 (describing coupling techniques, including the
glutaraldehyde method, the periodate method, the dimaleimide method, the m-maleimidobenzyl-
N-hydroxy-succinimide ester method, all of which methods are incorporated by reference
herein).
[00167J Methods for separating an antibody-protein complex from a sample are known in the
art and include use of a capture agent that binds a binding partner (e.g., avidin to capture a
biotin-linked antibody; an oligonucleotide to capture an oligonucleotide linked to an antibody;
Physical separation may also be used, such as sedimentation, filtration, FACS (for example,
using beads that are labeled with a spectral signature), and magnetic separation (when the
antibody is linked to a matrix with magnetic properties, such as a magnetic bead).
[00168] Many binding partners are known in the art (e.g., a dinitrophenyl group, digoxigenin,
fluorophores, Oregon Green dyes, Alexa Fluor 488 (Molecular Probes), fluorescein, a dansyl
group, Marina Blue (Molecular Probes), tetramethylrhodamine, Texas Red (Molecular Probes),
BODIPY (4,4-difluoro-4-bora-3a,4a-diaza-s-indacene; U.S. Pat. No. 4,774,339) dyes, etc) that
can be used in the present invention. Antibodies that can be used as capture reagents can
specifically bind to binding agents are commercially available from vendors such as Molecular
Probes, Eugene, Oreg. These antibodies include antibodies that can specifically bind to a
dinitrophenyl group, a digoxigenin, a fluorophore, Oregon Green dyes, Alexa Fluor 488
(Molecular Probes), fluorescein, a dansyl group, Marina Blue (Molecular Probes),
tetrahmethylrhodamine, Texas Red (Molecular Probes), and a BODIPY dye (Molecular Probes).
Any suitable ligand and anti-ligand may also be used.
[00169] Oligonucleotides can be used as binding partner and capture reagents.
Oligonucleotides include nucleic acids such as DNA, RNA, and mixed RNA/DNA molecules.
The oligonucleotide that is used as the affinity label should be able to hybridize to the sequence
of the oligonucleotide present on the capture reagent. Those of skill in the art will recognize that
many different oligonucleotide sequences can be designed that will hybridize to each other.
Important considerations for designing such oligonucleotide pairs include the actual nucleotide
sequence, the length of the oligonucleotides, the hybridization conditions (e.g., temperature, salt
concentration, presence of organic chemicals, etc.) and the melting temperature of the
oligonucleotide.

[00170] Solid supports suitable for immobilizing (linking) antibodies or proteins from a
sample (and modifications to render solid supports suitable for immobilizing antibodies) are well
known in the art. Examples of a solid support include: a bead (including magnetized beads),
microwell plate, and a protein microarray (e.g., technology owned by Zyomyx, Inc. See, e.g. US
Patent No. 6,365,418). Thus, for example, CdSe-CdS core-shell nanocrystals enclosed in a silica
shell can be easily derivatized for coupling to a biological molecule. Bruchez et al (1998)
Science 281: 2013-2016. Similarly, highly fluorescent quantum dots (zinc sulfide-capped
cadmium selenide) have been covalently coupled to biomolecules for use in ultrasensitive
biological detection. Warren andNie (1998) Science 281: 2016-2018. Fluorescently labeled
beads are commercially available from Luminex and Quantum Dot.
[00171] The bound protein (or in some embodiments, polypeptide fragments) may be released
from the antibody-protein complex using conventional immunoaffinity elution conditions such as
acidic pH, ionic strength, detergents or combinations of the above. Generally, peptide or protein
is de-salted for subsequent fractionation, characterization, or other analysis.
a. Protein cleaving agent
[00172] In some embodiments, the methods of the invention further comprise treating the
sample with a protein cleaving agent, whereby polypeptide fragments are generated. In one
embodiment, the sample is contacted with a protein cleaving agent prior to contacting a sample
with at least one small epitope antibody. In another embodiment, protein is contacted with a
protein cleaving agent after separation of protein from an antibody-protein complex.
[00173] Protein cleaving agent treatment generates protein cleavage fragments (such as
polypeptides), which can facilitate subsequent mass spectral analysis of the amount of protein
and the identity of proteins in a sample(s). In particular, treatment with a protein cleaving agent
treatment can facilitate the analysis of proteins whose molecular masses exceed 25 kDa. Protein
cleaving reagent treatment also may facilitate accessibility and/or access of small epitope
antibodies to a cognate epitope. Protein cleaving agents are well known in the art, and are
further discussed herein. In some embodiments, one protein cleaving agent is used. In other
embodiments, more than one protein cleaving reagent is used. In some embodiments, more than
one type of protein cleaving agent is used with respect to a single sample (e.g., two or more types
of proteases, two or more types of chemical cleavage agents, or a combination of one or more

protease and one or more chemical cleavage agent). Conditions for treatment with a protein
cleaving agent are well known in the art.
[00174] In one embodiment, a protein cleaving agent is a protease Example of proteases that
can be used as protein cleaving agents, include, but are not limited to: chymotrypsin, trypsin
(arg, lys cleavage sequence), thermolysin (phe, leu, iso, val cleavage sequence), V8 protease,
Endoproteinase Glu-C, Endoproteinase Asp-N, Endoproteinase Lys-C, Endoproteinase Arg-C,
Endoproteinase Arg-N, Factor Xa protease, thrombin, enterokinase, V5 protease, and the tobacco
etch virus protease. Proteases useful in the methods of the invention can be genetically
engineered and/or chemically modified to prevent autolysis. It is appreciated that an enzymatic
protein cleaving agent (such as a protease) can be modified to facilitate removal of the protease
from the polypeptide cleavage products following polypeptide cleavage. Such modifications are
known in the art and include: (1) bead-bound (e.g., latex, silica or magnetic bead) protease, (2)
haptenated protease, (3) affinity depletion of the protease (with, for example, a bead-bound anti-
protease, or bead-bound non-cleavable substrate) and/or (4) size exclusion chromatography. The
activity of a protease can be inhibited, for example, by treating with heat, a protease inhibitor, a
metal chelator (e.g., EGTA, EDTA), etc.
[00175] In another embodiment, a protein cleaving agent is a chemical cleaving agent, such as
chemical substances and compounds that cleave polypeptides and peptide bonds. Nonlimiting
examples of chemical cleaving agents include cyanogen bromide (which cleaves at methionine
residues), hydroxylamine (which cleaves between an Asn and a Gly residue), and acid pH (which
can cleave an Asp-Pro bond) (see e.g., Ausubel et al., supra).
[00176] In still further embodiments, phosphatases (e.g., alkaline phosphatase, acid
phosphatase, protein serine phosphatase, protein tyrosine phosphatase, protein threonine
phosphatase, etc.), lipases, and other enzymes can be employed as protein cleaving agents.
Sample
[00177] As noted in the definition and as used herein, "sample" encompasses a variety of
sample types and/or origins, such as blood and other liquid samples of biological origin, solid
tissue samples such as a biopsy specimen or tissue cultures or cells derived therefrom, and the
progeny thereof. The definition also includes samples that have been manipulated in any way
after their procurement, such as by treatment with reagents, solubilization, or enrichment for

certain components, such as proteins or polynucleotides. The term "sample" encompasses a
clinical sample, and also includes cells in culture, cell supernatants, cell lysates, serum, plasma,
biological fluid, and tissue samples. A sample can be from a microorganism, e.g., bacteria,
yeasts, viruses, viroids, molds, fungi, plants, animals, including mammals such as humans. A
sample may comprise a single cell or more than a single cell. Examples of a sample include
blood, plasma, serum, urine, stool, cerebrospinal fluid, synovial fluid, amniotic fluid, saliva, lung
lavage, semen, milk, nipple aspirate, prostatic fluid, mucous, cheek swabs, and/or tears.
[00178] These samples can be prepared by methods known in the art such as lysing,
fractionation, purification, including affinity purification, FACS, laser capture microdissection
(LCM) or isopycnic centrifugation. In some embodiments, subcellular fractionation methods are
used to create enriched cellular or subcellular fractions, such as subcellular organelles including
nuclei, mitochondria, heavy and light membranes and cytoplasm.
[00179] Prior to contacting the sample with one or more small epitope antibodies, the sample
may be treated with agents capable of denaturing and/or solubilizing proteins, such as detergents
(ionic and non-ionic), chaotropes and/or reducing agent. Such agents are known in the art.
[00180] Under certain circumstances, it may be desirable to remove or minimize abundant
proteins present in a sample, for example, by targeted immunodepletion, or other methods known
in the art. Generally, such removal (or reduction) occurs prior to contacting the sample with one
or more small epitope antibodies (however, such reduction or removal can occur during or after
treatment with small epitope antibodies). Any suitable reagent may be used, including one or
more small epitope antibody(ies). In one embodiment, removal and/or reduction of one or more
sample components is effected by treating the sample with one or more small epitope antibodies.
[00181] In some embodiments, it may be desirable to treat the sample with a polysaccharide
cleaving agent, for example, to reduce, minimize, and or eliminate glycosylation of sample
protein. Removal of any carbohydrate moieties may be accomplished chemically or
enzymatically. Examples of polysaccharide cleaving agents include glycosidases,
endoglycosidases, exoglycosylases, and chemicals such as trifluoromethanesulfonic acid.
Endoglycosidases such as Endoglycosidase H (New England Biolabs, Beverly, Mass.), and Endo
Hf (New England Biolabs) are commercially available. These endoglycosidases cleave the
chitobiose core of high mannose and some hybrid oligosaccharides from N-linked glycoproteins.
Exoglycosidases are also commercially available from vendors such as New England Biolabs
and include, beta-N-Acetylhexosaminidase, alpha- 1-2-Fucosidase, alpha-1-3,4 Fucosidase alpha-

1-2,3 Mannosidase, alpha-1-6 Mannosidase, Neuraminidase, alpha-2-3 Neuraminidase, beta 1-3
Galactosidase, and alpha -N-Acetyl-galactosaminidase
[00182] The following Examples are provided to illustrate, but not limit, the invention.

EXAMPLES
Example 1: Preparation and characterization of small epitope antibodies
[00183] Five immunization polypeptides in the format of Multiple Antigenic Peptide (MAP)
were designed as shown in Table 3. These sequences in combination were also used to evaluate
cross-reactivity of the induced antibodies, by virtue of the inclusion in different MAPs of the
same sequence in differing locations. Each of the immunization polypeptides was used to
immunize 4 Balb/C mice using standard methods.

[00184] Notes to Table 3:
Polypeptide MAPI: HSLFHPEDTGQV: From PSA, amino acids #79-89. KKTTNV:
From Meningococcal Opa protein, containing KTT, a published 3mer antibody epitope
(Malorny, Morelli et al. 1998).
Polypeptide MAP2: Alternate sequences of MAPI.
Polypeptide MAP3: LTPKK: Motif 1 of PSA (Nagasaki, Watanabe et al. 1999).
KKTTNVLTVPTNIPG: From Meningococcal Opa protein, containing two published 3mer
antibody epitopes: KTT and NIP and one 4mer epitope: TNIP (Morelli, et al. (1997) Mol
Microbiol 25(6): 1047-64.

Polypeptide MAP4: LTPKK: From PSA, the same as in peptide MAP3.
LTQENQNRGTH: An immunogenic sequence of alpha-1-ACT selected by DNAStar computer
program. IYNQ: From Meningococcal Opa protein, containing a 2mer epitope IY and four
amino acids of a 5mer epitope, TIYNQ and of a 7mer epitope TPTIYNQ (Marelli, et al, id).
Polypeptide MAP5 TIYNTNIPG: From Meningococcal Opa protein (Marelli, et al,
id.). LTQENQNRGTH: The same as in peptide MAP4.
[00185] Two sets of screening polypeptides were designed: (1) 5 C-terminally biotinylated
with the same sequences as the immunization polypeptides (shown in Table 4); and (2) 43 lOmer
biotinylated polypeptides with sequences panning all five immunization polypeptides (shown in
Table 5).





[00186] After a standard period of immunization, immune serum was collected from each
mouse using standard methods, and tested using ELISA as follows:
ELISA plates (Corning 3369 or similar) were coated with 100µl/well or 50µl/well of
streptavidin (Sigma Catalog No. S4762 or similar, 5ug/ml in 50mM carbonate buffer, pH 9.6).
Plates were incubated at 4°C overnight or at room temperature for 2 hours. Following
incubation, plates were washed 3 times with PBS+0.05% Tween-20 (PBST buffer). Following
washing, plates were blocked with 250µl/well of PBST, and incubated at room temperature for 1
hour, or at 4°C overnight. PBST was removed, and 100µl/well or 50µl/well of a test
biotinylated polypeptide selected from Table 4, at a concentration of 5 |J.g/ml (diluted in PBS)
was added. Plates were incubated for about 30 to 60 min at room temperature. Following
incubation, plates were washed 3 times with PBST. Then, 100µl or 50µl/well of test serum {i.e.,
from test bleeds) was added, and the plates were incubated for one hour at room temperature, or
overnight at 4°C. To titer immunoreactivity, the serum was generally diluted prior to testing to

1:500,1:2000,1:8000, or 1:32000. Following incubation, plates were washed 3 times with
PBST. To detect antibody binding, a 1:10,000 dilution of goat anti-mouse IgG (and IgM)-HRP
conjugate (Jackson'Immuno order No. 115-036-071, or similar) was added to each well. Plates
were incubated at room temperature for another hour, then washed 5 times with PBST. HRP
substrate (Sigma Fast OPD,) was added and incubated in the dark at room temperature for 30-60
minutes. Plates were read at OD450 with a 96-well colorimetric detector if HRP reaction was
not stopped. Alternatively, HRP reaction was stopped with 1.25M sulfuric acid, and plates were
read at OD492.
[00187] 12 test bleeds from Groups 1, 2, and 3 mice were tested. No immune response was
observed from mice in groups 1 and 3, and these mice were not studied further. All 4 mice in
group 2 showed strong immune response to screening polypeptide Pep2-0 (>1:32,000). In
addition, immune sera from two of the four mice in group 2 (mice #2-1 and #2-4) showed cross-
reactivity with screening polypeptides designed for groups 1 and 3 due to the sequence
homology between MAP2 and MAP1/MAP3. These results were consistent with mice #2-1 and
#2-4 expressing antibodies mat recognize distinct and concise epitopes present within more than
one screening antigen used in the ELISA assays. A test of the #2-1 and #2-4 sera versus 23
lOmer biotinylated polypeptides that span sequences of all three immunization polypeptides for -
group 1,2 and 3 mice also demonstrated a broad cross-reactivity.
[00188] Eight test bleeds from groups 4-5 were tested by ELISA. Group 4 mice demonstrated
a modest response to their relevant screening polypeptide, Pep4-0, while exhibiting strong cross-
reactivity with Pep3-0, the screening polypeptide designed for group 3. Group 4 mice did not
show substantial cross-reactivity to Pep5-0 even though there is significant sequence identity
between Pep4-0 and Pep5-0. In contrast, 3 of 4 mice in group 5 (mice 5-2, 5-3,5-4) exhibited
robust immunoreactivity to both their screening polypeptide, Pep5-0, and to the related screening
polypeptide, Pep4-0. The sera from the responsive mice in group 5 did not demonstrate
substantial cross-reactivity to the Pep3-0, even though there is a 5 amino acid block of sequence
identity. A test of the #5-2 and 5-3 sera versus 23 lOmer biotinylated polypeptides that span
sequences of all three immunization polypeptides for group 4 and 5 mice demonstrated two
broad but distinctive reaction patterns with the mapping polypeptides spanning sequences of
immunization polypeptides for groups 4 and 5 mice.

[00189] Group 2, mice #1 and #4, and Group 5, mice #2 and #3, showed the best immune
responses, as summarized in Table 6 and Figure 1. These mice were selected for hybridoma
fusions.

[00190] The animals were sacrificed, the lymph nodes and spleens harvested, then B cell
hybridoma fusions using P3 mouse myeloma cell line as a fusion partner were generated using
standard methods. Fusions were plated and incubated for 11-14 days before screening.
[00191] In the first round of screening, hybridomas from group 2 and 5 mice were analyzed
by ELISA in 96 well plates, essentially as described above, using the corresponding screening
polypeptides, 2-0 and 5-0. Following several rounds of screening, 48 positive hybridoma lines
were identified and transferred to 24 well plates for expansion and additional characterization
including epitope mapping. Of the 48 positive lines, 33 were derived from the Group 2 animals
that received the MAP2 immunogen while the remaining 15 originated from the Group 5
animals. Most of the hybridoma lines (-94%) were the fusion products of B cells harvested from
the spleen. Thirteen of the 48 hybridoma lines expressed IgG, 25 expressed IgM, and the
remaining 10 hybridoma lines were expressing both IgG and IgM or were not expressing either
IgG or IgM and were therefore expressing either IgA or IgE.

[00192] In the second round of screening, hybridomas selected for expansion were re-tested
against the relevant screening polypeptide (either polypeptide 2-0 or polypeptide 5-0). 13 of the
48 hybridomas characterized after the 24 well expansion phase exhibited sequence specific
binding to the screening polypeptide 2-0. Other hybridomas bound non-specifically (i.e., bound
a variety of oligopeptide sequences), failed to bind (reflecting either a false positive or clonal
instability and loss during the transfer and subsequent propagation in 24 well plates) or bound
control wells containing BSA.
[00193] The 13 hybridomas that specifically bound to screening polypeptide 2-0 were epitope
mapped using ELISA as described above, using 3 different sets of 10mer C-terminal biotinylated
mapping polypeptides: polypeptides 1-1 to 1-5; 2-1 to 2-9; and 3-1 to 3-9 (see Table 5). 10 of
the 12 hybridoma lines exhibited maximum reactivity with a single mapping polypeptide, 2-1,
and that hybridomas 2.03 and 2.11 showed strong binding to different overlapping sets of
mapping polypeptides, polypeptides 2-1 through 2-3 and 2-7 through 2-9. Because these data
showed strong reactivity to a single mapping polypeptide for most hybridoma lines, we
considered the possibility that steric hindrance associated with immobilization of the mapping , .
polypeptides (specifically, biotin-avidin immobilization) was preventing antibody binding to the
epitope present within a cognate series of 10mers, thus potentially biasing the ELISA epitope
map results. Thus, we evaluated epitope specificity using a competitive binding assay.
[00194] Individual mapping polypeptides were evaluated for their ability to inhibit antibody
binding to the 2-0 screening polypeptide affixed to streptavidin-coated 96 well plates. In this
format, the 10mer mapping polypeptides were not tethered within the binding pocket of
streptavidin and consequently should not be sterically hindered from interacting with a reactive
antibody present within the set of 13 hybridomas. Competition experiments were performed
using standard methods using the 2-0 screening polypeptide affixed to streptavidin-coated 96
well plates and 10mer mapping polypeptide added to each well.
[00195] Using the competitive binding assay, the epitopes recognized by 10 of the 13
hybridomas were determined. Eight of the hybridomas were specific for the epitope PEDTG,
hybridoma 2.03 was specific for epitope DTG and hybridoma 2.11 recognized the epitope
KKTTN. Hybridoma 2.31 exhibited a complex inhibition pattern suggesting that this line is a
mixture of 2 or more specificities and should be subcloned to segregate the individual
reactivities. Finally, hybridomas 1.02 and 2.12 showed poor discrimination in the competitive
inhibition assay. The results of this analysis are summarized in Table 7.



[00196] The competitive binding assays were repeated twice, and it was confirmed that
hybridoma 2.11 recognized the epitope KTTN, not the epitope KKTTN as suggested in the
preliminary experiments. The epitope competitive binding assays confirmed the epitope
characterization described above for the other, hybridomas. The results of this updated analysis
are summarized in Table 8. Hybridomas 2.03 (also called DA001-2.03), 2.04 (DA001-2.04) and
2.11 (also called DA001 -2.11) are being prepared for deposit at the ATCC.



Example 2. Preparation of Small Epitope Antibodies
[00197] An approach to identify antibodies based on phage display antibody screening was
performed. Five peptide sequences used for the selection of positive antibodies are shown in
Table 9. These sequences in combination were also used to evaluate cross-reactivity of the
selected antibodies.

[00198] Note to Table 9: The letter 'X' denotes a mixture of the naturally-occurring L-amino
acids excluding cysteine, methionine, and tryptophan.
[00199] Positives were selected after six rounds of enrichment. The results of phage ELISA
screens against the five screening peptides is shown in Table 10. A total of 96 phage were
screened for P1; 48 were screened for polypeptides P6-P9. In all cases, positive phage were
identified above background.




[00200] In a secondary screen of positives identified in the primary screen, a phage ELISA
assay was done against all five polypeptides. Up to five positives were selected for the secondary
screen. Figure 2 shows the results of the most selective clones using this assay. All five positives
yielded significant signal to polypeptide above BSA, and the phage selected from PI
(L50P1-15), Pδ (L50P8-5), and P9 (L50P9_5) appear to show specificity in this semi-
quantitative assay.
[00201] The reactive antibody for L50P 1_15 was subcloned into a vector for bacterial
expression of single chain antibodies. The crude periplasmic preparation was analyzed using a
surface plasmon resonance (SPR) biosensor assay to monitor the formation of complex
association and the dissociation of the protein from immobilized peptides (Malmborg et al,
1995). Figure 3 shows the SPR profile of single chain antibody against the five polypeptides and
BSA. The antibody has the highest affinity for peptide 1, with an estimated Kd of 2xl0"8.
Example 3. Protein profiling and biomarker development
[00202] In one exemplary method for protein profiling, serums derived from healthy and
affected individuals for a particular disease of clinical interest are subjected to: (a) debulking of
the most abundant protein constituents; (b) deglycosylation of the less abundant proteins that
remain; (c) reduction and alkylation of cysteine residues present in the debulked proteome; (d)
digestion of the debulked proteome to completion; (e) fractionation of the resulting peptide
fragments with small epitope antibodies as described above; and (f) comparison of the
composition and relative abundance of peptide constituents from epitope enriched fractions
derived from healthy and affected patients to identify candidate biomarkers associated with a
specific disease.
[00203] Fractionation with small epitope antibodies is performed in parallel with a set of
approximately 100 small epitope antibodies of different specificities. Each antibody is chosen
based on a set of criteria including epitope size, epitope abundance in the serum proteome,
specificity, affinity, and sampling redundancy. The epitopes recognized by the antibodies are
predominantly 3mers, although some are 4mers or 5mers that satisfy the abundance criteria, with
each epitope occurring in 0.5-3% of the constituents of the serum proteome. Each antibody
recognizes its cognate epitope in a context-independent manner and with high affinity. The

complete set of small epitope antibodies used for fractionation provides 3-5 fold sampling
redundancy to accommodate the variability expected in both expression levels for different
proteins and capture efficiencies for each antibody in the set.
[00204] Mass spectroscopy is used to analyze the peptide composition and peptide constituent
expression levels for each small epitope antibody fraction. Biomarkers are identified that are
differentially expressed in healthy and diseased individuals. ELISA assays are developed that
can discriminate between healthy and affected individuals based on specific levels of identified
biomarkers present in plasma or serum.
[00205] Although the foregoing invention has been described in some detail by way of
illustration and examples for purposes of clarity of understanding, it will be apparent to those
skilled in the art that certain changes and modifications may be practiced without departing from
the spirit and scope of the invention. Therefore, the description should not be construed as
limiting the scope of the invention, which is delineated by the appended claims.
[00206] All publications, patents, and patent applications cited herein are hereby incorporated
by reference in their entireties for all purposes and to the same extent as if each individual
publication, patent, or patent application were specifically and individually indicated to be so
incorporated by reference.

WE CLAIM:
1. A method for reducing the complexity of a sample that comprises a mixture of
proteins, said method comprising separating a multiplicity of different proteins from said
mixture of proteins, wherein said multiplicity of proteins is bound to a small epitope antibody
in small epitope antibody-protein complexes, wherein the small epitope antibody binds an
epitope consisting of 3 to 5 amino acids, wherein the multiplicity of proteins bound by the
small epitope antibody is enriched; and separating said multiplicity of proteins from said
antibody-protein complexes.
2. A method as claimed in claim 1, comprising contacting the multiplicity of
proteins separated from the antibody-protein complexes with a protein cleaving agent to form
polypeptide fragments.
3. A method as claimed in claim 2, wherein the protein cleaving agent
comprises a protease.
4. A method as claimed in claim 2, wherein the protein cleaving agent
comprises a chemical agent, such as herein described.
5. A method as claimed in claim 1, wherein said mixture of proteins comprises
a mixture of polypeptide fragments produced by cleavage of proteins in said sample with a
protein cleaving agent
6. A method as claimed in claim 5, wherein the protein cleaving agent comprises
a protease.
7. A method as claimed in claim 5, wherein the protein cleaving agent
comprises a chemical agent, such as herein described.
8. Multiplicity of proteins prepared by the method as claimed in claim 1.
9. Polypeptide fragments prepared by the method as claimed in claim 2.

10. A method for reducing the complexity of a sample that comprises a mixture of
proteins, said method comprising separating a multiplicity of different proteins from said
mixture of proteins, wherein said multiplicity of proteins is bound to a plurality of different
small epitope antibodies in small epitope antibody-protein complexes, wherein each small
epitope antibody in said plurality of small epitope antibodies binds a different epitope
consisting of 3 to 5 amino acids, wherein said multiplicity of proteins bound by said plurality
of small epitope antibodies is enriched; and separating said multiplicity of proteins from the
antibody-protein complexes.
11. A method as claimed in claim 10, wherein the small epitope antibodies are
immobilized on a solid matrix.
12. A method as claimed in claim 10, wherein the small epitope antibodies are
detectably labeled.
13. A method as claimed in claim 10, wherein said plurality of different small
epitope antibodies comprises at least 100 different small epitope antibodies.
14. A method for reducing the complexity of a sample that comprises a mixture of
proteins, said method comprising:

(a) contacting the sample comprising a mixture of proteins with at least
one small epitope antibody to form small epitope antibody-protein complexes, wherein a
multiplicity of different proteins from said sample is bound to said small epitope antibody in
small epitope antibody-protein complexes, wherein the small epitope antibody binds an
epitope consisting of 3 to 5 amino acids;
(b) separating said small antibody-protein complexes from unbound
protein in the sample; and
(c) separating said multiplicity of proteins from said small antibody-
protein complexes.
15. A method as claimed in claim 14, wherein steps (a) and (b) are performed
sequentially.

16. A method as claimed in claim 14, wherein steps (a) and (b) are performed
simultaneously.
17. A method as claimed in claim 14, wherein the at least one small epitope
antibody comprises at least 100 small epitope antibodies.
18. A method for determining presence or absence of a protein of interest in a
sample, said method comprising detecting the protein of interest, if any, in an enriched
protein fraction, wherein the enriched protein fraction is prepared by the method of claim 1,
and wherein detection of the protein of interest indicates presence of the protein in the
sample.
19. A method as claimed in claim 18, wherein said detection comprises mass
spectrometry.
20. A method for determining the amount of a protein of interest in a sample, said
method comprising quantifying the amount of the protein of interest in an enriched protein
fraction, wherein the enriched protein fraction is prepared by the method as claimed in claim
1.
21. A method as claimed in claim 20, wherein said quantifying comprises mass
spectrometry.
22. A method for identifying a protein in small epitope antibody-protein
complexes, wherein the small epitope antibody-protein complexes are prepared as claimed
in claim 1.
23. A method as claimed in claim 22, wherein said identifying comprises mass
spectrometry.
24. A method for identification of a biomarker, said method comprising
comparing the proteins in the two or more enriched protein fractions, wherein each of the
enriched protein fractions is prepared from a sample by the method as claimed in claim 1.

25. A method as claimed in claim 24, wherein the two or more samples comprise
samples from at least one individual who has a disease condition and at least one individual
who does not have the disease condition, and wherein presence or absence of the biomarker
is indicative of the disease condition.
26. A kit for use in a method as claimed in any of claims 1-7 and 10-25,
comprising at least one small epitope antibody, wherein said at least one small epitope
antibody binds epitopes consisting of 3 to 5 amino acids.
27. A kit as claimed in claim 26, wherein said at least one small epitope antibody
is detectably labeled.
28. A kit as claimed in claim 26 or 27, wherein said at least one small epitope
antibody comprises a plurality of small epitope antibodies.
29. A kit as claimed in claim 28, wherein said plurality of small epitope
antibodies comprises at least 100 small epitope antibodies.
30. A method as claimed in claim 10, comprising contacting the proteins
separated from the antibody-protein complexes with a protein cleaving agent to form
polypeptide fragments.
31. A method as claimed in claim 30, wherein the protein cleaving agent
comprises a protease.
32. A method as claimed in claim 30, wherein the protein cleaving agent
comprises a chemical agent, such as herein described.
33. A method as claimed in claim 10, wherein said mixture of proteins comprises
a mixture of polypeptide fragments produced by cleavage of protein in said sample with a
protein cleaving agent.
34. A method as claimed in claim 33, wherein the protein cleaving agent
comprises a protease.

35. A method as claimed in claim 33, wherein the protein cleaving agent
comprises a chemical agent, such as herein described.
36. The method as claimed in any one of claims 1,10, and 14, wherein the epitope
to which said small epitope antibody or antibodies bind is present within the amino acid
sequence of each protein in said multiplicity of proteins.

The present invention relates generally to methods for reducing the complexity of a sample. More specifically, the
present invention relates to proteomics, the measurement of the protein levels in biological samples, and analysis of proteins in a
sample using antibodies that recognize small epitopes.

Documents:

501-KOLNP-2006-CORRESPONDENCE.pdf

501-KOLNP-2006-FORM 27.pdf

501-KOLNP-2006-FORM-27.pdf

501-kolnp-2006-granted-abstract.pdf

501-kolnp-2006-granted-assignment.pdf

501-kolnp-2006-granted-claims.pdf

501-kolnp-2006-granted-correspondence.pdf

501-kolnp-2006-granted-description (complete).pdf

501-kolnp-2006-granted-drawings.pdf

501-kolnp-2006-granted-examination report.pdf

501-kolnp-2006-granted-form 1.pdf

501-kolnp-2006-granted-form 13.pdf

501-kolnp-2006-granted-form 18.pdf

501-kolnp-2006-granted-form 3.pdf

501-kolnp-2006-granted-form 5.pdf

501-kolnp-2006-granted-gpa.pdf

501-kolnp-2006-granted-reply to examination report.pdf

501-kolnp-2006-granted-specification.pdf


Patent Number 228756
Indian Patent Application Number 501/KOLNP/2006
PG Journal Number 07/2009
Publication Date 13-Feb-2009
Grant Date 10-Feb-2009
Date of Filing 03-Mar-2006
Name of Patentee TETHYS BIOSCIENCE, INC.
Applicant Address 1900, POWELL STREET, SUITE 210, EMERYVILLE, CA
Inventors:
# Inventor's Name Inventor's Address
1 URDEA, MICHAEL S. 100, BUNCE MEADOW DRIVE, ALAMO, CA 94507
2 LANDES, GREGORY MICHAEL 1603, VETTA DRIVE, LIVERMORE, CA 94550
3 WENT, GREGORY T. 257, BUENA VISTA, MILL VALLEY, CA 94941
PCT International Classification Number G01N 33/68
PCT International Application Number PCT/US2004/027005
PCT International Filing date 2004-08-18
PCT Conventions:
# PCT Application Number Date of Convention Priority Country
1 60/511,720 2003-10-15 U.S.A.
2 60/496, 154 2003-08-18 U.S.A.