Title of Invention

AN ASPERGILLUS WHICH DO NOT PRODUCE PROTEASES

Abstract ABSTRACT " AN ASPERGILLUS WHICH DO NOT PRODUCE PROTEASES" 2287/MAS/96 An Aspergillus, wherein the areA gene by recombinant DNA technology has been modified in a way by which it cannot be expressed in a way providing for a functional AreA activator, and wherein the genes encoding for the extracellular proteases PepC and/or PepE have been inactivated in a manner whereby they are not expressed to produce functional proteases.
Full Text Title: Novel Microorganisms
The present invention relates to fungi, which do not produce proteases. The fungi of the invention are useful as hosts for the production of proteins susceptible to proteolytic degradation by the proteases usually produced, and the invention consequently encompasses processes for the production of proteins of interest in high yields by using the fungi of the invention. The invention also comprises methods for producing such fungi and DNA Constructs to be used in these methods.

Fungi, and especially filamentous fungi, are widely used commercially because of their ability to secrete remarkably high levels of proteins
Among the filamentous fungi species belonging to the genus Aspergillus have a long history of commercial use for the production of endogenous and lately also heterologous proteins.
One disadvantage with most microorganisms used for the production of proteins is the inherent production of proteases which may subject a protein product of interest to degradation due to proteolysis.
Various ways of avoiding this have been envisaged. Among other solutions it has been suggested to delete or disrupt the genes encoding the various proteases. Unfortunately, the fungi produce a high number of proteases making such a solution more or less unrealistic.
A need is therefore persisting for strains of filamentous fungi exhibiting no or very low levels of protease production.

For a number of years it has been known that the regulatory gene areA which mediates nitrogen metabolite repression in A. nidulans influences the production of extracellular proteases (Arst & Cove, molec. gen. Genet. 126,(1973) 111-141).
The areA gene from A. nidulans has been cloned (Caddick et al., EMBO Journal 5, (1986) 1087-1090) and various modifications made to it to evaluate functions of different regions in the activator protein encoded by this gene (Stankovitch et al. Mol. Microbiol. 7, (1993) 81-87). Furthermore the gene coding the corresponding function in A. Jumigatus apparently has been cloned recently (Hensel et al. 2nd European Conference on Fungal Genetics, April 28 to May 1,1994, Book of Abstracts, Ell).
From the literature a single use is also known of a strain of A. nidulans of genotype argB areA 1 as a host for the production of t-PA (Upshall et al. Biotechnology 5, (1987) 1301-1304). In this example only the argB genotype is used as a selection marker through its arginine prototrophy, while the areA genotype is simply a coincidence.
International Patent Publication No. WO 95/35385 discloses the deletion of the areA gene as a means for reducing the protease level in filamentous fungi.
Apart from the extracellular proteases, fungi also produce a number of intracellular proteases (also called endoplasmic).
Among these a serine protease of the subtilisin type produced by A. niger and designated PepC has been described, the gene expressing it cloned, and a deletion mutant described in EP 574 347 and in Frederick et al, Gene, 125 57-64 (1993)
A further such protease of the aspartic type designated PepE has been dis¬closed in Jarai et al, Gene, 145 171-178 (1994). the article discloses the cloning and characterisation of the pepE gene and speculates about the regulation of the pepE and pepC genes.
The present invention has as an object the alleviation of the need for protease free filamentous fungi.


The present invention consequently relates to fungi, wherein the areA,pepC, and/or pepE genes by recombinant DNA technology have been modified such that they cannot be expressed in a way providing for a functional AreA activator and functional PepC and/or PepE proteases.
The invention furthermore relates to methods for producing such fungi, obtained by deletion of the areA, pepC, and/or pepE genes.
This may be obtained through a method comprising
i) cloning of the areA,pepC, and/or pepE genes from a fungus of interest,
ii) producing DNA constructs each comprising one among the areA gene, the
pepC gene, and the pepE gene, wherein an internal part has been substituted,
deleted, or extra DNA has been inserted,
iii) transforming said fungus with the constructs, and
iv) isolating transformants which are areA',pepC, and/orpepK.
The information obtained from the above mentioned cloning of the areA, pepC, and/or pepE genes may also be used in connection with the well-known anti-sense technology, to construct an expression plasmid giving rise to synthesis of a RNA molecules complementary to the mRNA transcribed from the areA, pepC, and/or pepE genes, and to transform the fungus of interest therewith.
The invention furthermore relates to DNA constructs intended for use in the above mentioned methods.
Furthermore the invention relates to methods of producing a desired protein or gene product, especially secreted proteins, whereby a fungal host modified and optionally transformed with a DNA construct comprising at least a DNA sequence coding for the protein or gene product of interest, is cultivated in a suitable growth medium at appropriate conditions and the desired gene product is recovered and purified.
When working with the invention it was surprisingly found that the fungi of the invention produces such secreted proteins in a much improved yield.

It was also surprisingly found that the only nitrogen source capable of providing good growth of the A. oryzae areA' strains was glutamine.
The invention furthermore relates to protein products produced by the above methods.
Also the invention relates to a DNA sequence coding for the pepC gene from A. oryzae (SEQ ID No. 1) or functional alleles thereof.
The invention also covers a PepC protease from A. oryzae (SEQ ID No. 2), and processes for the production of the PepC protease comprising transforming a suitable host with a DNA construct comprising a DNA sequence coding for the PepC protease, selecting a transformant capable of producing said PepC protease, cultivating said transformant in an appropriate growth medium and recovering said PepC protease from said culture.
Furthermore the invention relates to a DNA sequence coding for the pepE gene from A. oryzae (SEQ ID No. 3) or functional alleles thereof.
Also, the invention relates to a PepE protease from A. oryzae (SEQ ID No. 4), and processes for the production of the PepE protease comprising transforming a suitable host with a DNA construct comprising a DNA sequence coding therefore, selecting a transformant capable of producing said PepE protease, cultivating said transformant in an appropriate growth medium and recovering said PepE protease from said culture.
According to these aspects said host is preferably a fungus, according to the invention, especially A. oryzae, and wherein said DNA construct provides for an extra copy of the gene encoding either said PepC or PepE protease.

The invention is described in further detail in the following parts of the specification with reference to the Examples and the drawing, wherein

Fig. 1 shows the steps involved in the construction of HowBlOl,
Fig. 2 shows the construction of pToC345,
Fig. 3 shows the steps involved in the construction of pToC315.
Fig. 4 diagrammatically shows a two step gene deletion of thepyrG gene.
Fig. 5 shows the construction of pJaL235,
Fig. 6 shows the steps involved in the construction of pJaL335,
Fig. 7 shows the steps involved in the construction of pJaL363,
Fig. 8 shows the steps involved in the construction of pJaLz,
Fig. 9 shows the construction of pSK5 and pSK9,
Figs. 10a and 10b show the steps involved in the construction of pToC243 and pToC266,
Fig. 11 shows the steps involved in the construction of pMT1606,
Fig. 12 shows the construction of pToC56,
Figs. 13a and 13b show the steps involved in the construction of pJaL368, and
Figs. 14a and 14b show the construction of pToC338.
In the present specification the following definitions are used:
The expression areAD means a strain in which the areA gene is deleted. Similar notations are used for strains, wherein \hepepC, and/or pepE genes are deleted.
The expression areA' means a strain which does not produce a functional AreA activator. The term "loss of function" is also often used for this. Similar notations used for strains, which do not produce functional PepC, and/or PepE protease(s).
The expression "anti-sense technology" describes methods such as disclosed in US Patent No. 5,190,931.

As indicated the present invention relates in its first aspect to fungi, wherein the areA gene by recombinant DNA technology has been modified in a way by which it cannot be expressed in a way providing for a functional AreA activator, and wherein the genes encoding for the extracellular proteases PepC and/or PepE has been inactivated in a manner whereby they are not expressed to produce functional proteases.
This object may specifically be obtained by deletion or disruption of the areA, pepC, and/or pepE genes.
The cloning of the areA, pepC, and/or pepE genes are described in the Examples.
AreA homologs from other fungi could be cloned either by cross hybridization with one of the already known genes or by complementation of areA mutants; e.g. A. nidulans areA-18 or the A. oryzae areA deleted strain described in this application.
Methods for deleting or disrupting a gene are specifically described in WO 90/00192 (Genencor).
Methods for substituting DNA in a gene are also generally known, and can be accomplished by substituting one or more continuous parts of the gene, but it may also be obtained by site directed mutagenesis generating a DNA sequence encoding a AreA activator variant that is not functional.
Another method by which such an object may be obtained is by using anti-sense technology.
The anti-sense technology and how to employ it is described in detail in the aforementioned US Patent No. 5,190,931 (University of New York).
A further method of obtaining said inactivation is by inserting extra DNA internally in the areA gene, thereby giving rise to the expression of a dysfunctional activator protein.
In connection with this method information provided by the cloning can be used to make DNA constructs that can be integrated into the areA gene, and even replace it with another gene, such as the pyrG gene.
7

A further method of avoiding the presence of the areA activator is by interfering with the regulation of the expression signals regulating the expression of the areA gene itself.
The principles described above apply equally to the/jepC, and/or pepE genes.
According to the invention the fungus preferably belongs to a genus selected from the group comprising Aspergillus, Trichoderma, Humicola, Candida, Acremonium, Fusarium, and Penicillium.
Among these genera species selected from the group comprising A. otyzae, A. niger, A. awamori, A. phoenicis, A. japonicus, A, foetidus, A. nidulans, T. reesei, T. hanianum, H. insolens, H. lanuginosa, F. graminearum, F. solani, P. chrysogenum, and others are preferred.
As indicated the invention also is meant to encompass the method for
producing the fungi of the first aspect of the invention, and wherein said inactivation has
been obtained by deletion of the areA,pepC, and/or pepE genes, which method comprises
i) cloning of the areA, pepC, and/or pepE genes from a fungus of interest,
ii) producing DNA constructs each comprising one among the areA gene, the
pepC gene, and/or the pepE gene, wherein an internal part has been
substituted, deleted, or extra DNA has been inserted,
iii) transforming said fungus with the constructs, and
iv) isolating transformants which are areA',pepC, and/or pepE.
Since it is believed that the maturation of the PepC protease is controlled by the PepE protease the invention also comprises a method for producing a fungus of the invention, wherein said inactivation has been obtained by deletion of the areA and pepE genes, which method comprises
i) cloning of the areA and pepE genes from a fungus of interest,
ii) producing DNA constructs each comprising one among the areA gene and the
pepE gene, wherein an internal part has been substituted, deleted, or extra
DNA has been inserted,
iii) transforming said fungus with the constructs, and
8

iv) isolating transformants which are areA\ andpepE.
Also included is the method for producing the fungi, wherein the inactivation
has been obtained by using anti-sense technology. Such a method comprising
i) construction of expression plasmids, each of which give rise to synthesis of an
RNA molecule complementary to the mRNA transcribed from the areA gene,
the pepC gene, and/or the pepE gene,
ii) transformation of die host fungus with said expression plasmids and a suitable
marker, either on separate plasmids or on the same plasmid,
iii) selection of transformants using said marker, and
iv) screening selected transformants for strains exhibiting a reduction in the
synthesis of the AreA, PepC, and/or PepE products.
A further aspect of the invention is meant to comprise DNA constructs for use in the above mentioned methods.
In respect of the former method said DNA constructs may comprise the areA, pepC, and/or pepE genes, wherein an interna) part has been substituted, deleted, or extra DNA has been inserted.
At least one of the DNA constructs may furthermore also comprise DNA sequences encoding a protein product of interest, such as those mentioned later.
In respect of the latter anti-sense method the DNA constructs may comprise inverted DNA sequence of the areA,pepC, and/or pepE genes connected to a functional promoter, whereby the mRNAs are at least partially complementary to mRNAs produced from the areA,pepC, and/or pepE genes.
A further aspect of the invention relates to a process for the production of a desired gene product, preferably a secreted gene product, whereby a iungus according to the invention is cultivated in a suitable growth medium at appropriate conditions and the desired gene product is recovered and purified.

In the case of a gene product expressed by a heterologous gene the DNA sequence coding for the desired gene product may be a part of the DNA construct used for producing said fungus.
Normally, however, a separate transformation of the fungus of the invention is performed in order to make the fungus capable of producing the desired product.
Methods for transforming fungi are well known in the art, cf. e.g. EP 0 184 438 A2 (Gist-Brocades N.V.) and EP publication No. 0 98 993 (Novo Nordisk A/S).
For indigenous products this is of course not necessary, but in order to increase the production it may be an advantage to provide for multiple copies of the gene encoding the protein of interest to be incorporated into the host.
The desired gene product is generally a peptide or protein, preferably an enzyme.
Among enzymes it is preferably selected from the group comprising proteases, such as trypsin and chymosin; lipases, cutinases, cellulases, xylanases, laccases, pectinases, etc.
Another type of desired gene product is generally a therapeutically active peptide or protein.
Among the therapeutically active peptide or protein the protein preferably is selected from the group comprising insulin, growth hormone, glucagon, somatostatin, interferons, PDGF, factor VII, factor Vm, urokinase, t-PA, CSF, lactoferrin, TPO etc.
A further aspect of the invention relates to the DNA sequences coding for the pepC gene from A. oryzae (SEQ ID No. 1), the pepE gene from A oryzae (SEQ ID No. 3)or functional alleles thereof. Also encompassed by the invention are the corresponding PepC and PepE proteases and their production, preferably by recombinant means.
In this aspect the invention relates to processes for the production of the PepC protease or PepE protease from A. oryzae comprising transforming a suitable host with a DNA construct comprising a DNA sequence encoding the protease of interest, selecting a transformant capable of producing the protease, cultivating the transformant

in an appropriate growth medium and recovering the PepC or PepE protease from the culture.
The host used in such a process is preferably a host according to the above mentioned aspects of the invention
In certain embodiments of the process for producing the PepC or PepE protease the host is A. oryzae. In that case it is preferred that the DNA construct comprising a DNA sequence coding for the protease, provides for an extra copy of the gene already present in the host.
The DNA construct comprising the DNA sequence encoding the protease will normally also comprise regulatory elements in order to provide for proper expression and processing of the protease in the host.
The invention is explained in further detail in the Examples given below. These should, however, not in any way be construed as limiting the scope of the invention as defined in the appended claims.
F.XA1VTPT.FS
Materials and Methods Strains
A. oryzae, IF04177:available from Institute for Fermentation, Osaka; 17-25 Juso
Hammachi 2-Chome Yodogawa-Ku, Osaka, Japan.
ToC9I3: The construction of this strain is described in the Examples.
areA: This gene codes for a regulatory protein controlling nitrogen catabolism.
pepC: This gene codes for a serine protease of the subtilisin type
pepE: This gene codes for an aspartic protease.

pyrG: This gene codes for orotidine-S'-phosphate decarboxylase, an enzyme
involved in the biosynthesis of uridine.
bar. This gene was originally isolated from Streptomyces hygroscopicus and codes
for phosphinothricin acetyltransferase. The enzyme modifies phosphinothricin
(=glufosinate) and thereby inactivates this compound which is toxic to
bacteria, fungi and plants.
Plasmids
pUC118: pS02: The construction of this plasmid is described in the Examples.
pJers4: A 2.0 kb subclone of pS02 in pUC118. pJers4 contains a functional A.
oryzaepyrG gene.
pS05: The construction of this plasmid from pS02 is described in the Examples.
pToC56: The construction of this plasmid is described in EP publication No. 0 98
993.
pToC68: The construction of this plasmid is described in WO 91/17243.
pToC90: A subclone of p3SR2, harboring the amdS gene from Aspergillus nidulans
as a 2.7 kb Xbal fragment [Corrick et al., GENE 1987 53 63-71], on a
pUC19 vector[Yannisch-Perron et al., GENE 1985 33 103-119], prepared
as described in WO 91/17243. pToC266: The construction of this plasmid is described in the Examples. pToC299: The construction of this plasmid is described in the Examples. pToC338: The construction of this plasmid is described in the Examples. pMT1606: The construction of this plasmid from pBPIT (B. Straubinger et al. Fungal
Genetics Newsletter 39(1992): 82-83) and p775 (EP publication No. 0 98
993) is described in the Examples.
p775: The construction of this plasmid is described in EP publication No. 0 98
993.

p777: The construction of this plasmid is described in EP publication No. 0 98
993. pHW470: The construction of this plasmid is described in the Examples.
Example 1

The A. oryzae pepE gene was cloned by cross-hybridization with the A. niger gene. A partial A. niger gene was obtained as a 700 bp PCR fragment from a PCR reaction with A. niger chromosomal DNA and pepE specific primers made according to thcpepE sequence published by G. Jarai et al, Gene 145 (1994) 171-178. The fragment was shown to contain pepE sequences by DNA sequencing. It hybridizes to A, oryzae chromosomal DNA under stringent conditions and Southern analysis showed that A. oryzae contains a single pepE like gene.
The pepE gene was deleted both by the gene replacement method and the two step gene replacement method (G. May in "Applied Molecular Genetics of Filamentous Fungi" (1992) pp. 1-25. Eds. J. R. Kinghorn and G. Turner; Blackie Academic and Professional). As marker was used the A. oryzaepyrG gene, the A. oryzae strain was apyrG- strain made by deletion of Xh&pyrG gene.

A cosmid library of Aspergillus oryzae was constructed essentially according to the instruction from the supplier (Stratagene) of the "SuperCosl cosmid vector kit".
Genomic DNA of A. oryzae IF04177 was prepared from protoplasts made by standard procedures (Christensen, T., et. al., Biotechnology 6 (1988) 1419-1422). After isolation of the protoplasts they were pelleted by centrifugation at 2500 rpm for 5 minutes in a Labofuge T (Heto), the pellet was suspended in 10 mM NaCl, 20 mM Tris-HCl (pH 8.0), 1 mM EDTA, 100 ug/ml proteinase K and 0.5% SDS as stated in the manual from the Supercos 1 cosmid vector kit and the rest of the DNA preparation was done according to the kit's instructions. The size of the genomic DNA was analysed by electrophoresis

using the CHEF-gel apparatus from Biorad. A 1% agarose gel was run for 20 hours at 200 volt with a 10-50 second pulse. The gel was stained by etidium bromide and photographed. The DNA was 50->100 kb in size. The DNA was partially restricted by Sau3A. The size of the restricted DNA was 20-50 kb determined by the same type of CHEF-gel analysis as above. The CsCl gradient banded SuperCosl vector was prepared according to the manual. Ligation and packaging was likewise done as described. After titration of the library all of the packaging mix from one ligation and packaging was transfected into the host cells XL 1-Blue MR and plated on 50 jig/ml ampicillin LB plates. Approximately 3800 colonies were obtained. Cosmid preparation from 10 colonies showed that they all had inserts of the expected size. The colonies were picked individually and inoculated in microtiter plate wells with 100 ul LB (100 ug/ml ampicillin) and incubated at 37°C over night. 100 ul of 50% glycerol was added to each well and the whole library was frozen at -80°C. A total of 3822 colonies were stored. This represents the A. oryzae genome apr. 4.4 times.
The individually frozen colonies in the library were inoculated onto LB-plates (100ng/ml ampiciUin) by using a multipin device with 6 times 8 pins fitting into half a microtiter dish. Plates were made containing colonies from all clones in the library. The plates were incubated at 37 C over night. Sterilized Whatman 540 filters cut to the size of a petri dish were placed upon the colonies which were incubated for two more hours at 37 C. The filters were transferred to LB plates containing 200|ag/ml of chloramphenicol and the plates were incubated over night at 37 C.The next day the filters were washed twice in 0.5 M NaOH for 5 minutes, then twice in 0.5 M Tris-HCl (pH=7.4) for 5 minutes and then twice in 2 x SSC for 5 minutes. The filters were wet with ethanol and air-dried.
The filters were hybridized with a 0.7 kb 32P labelled PCR fragment containing part of the pepE gene from A. niger. The PCR fragment was obtained by running PCR on chromosomal DNA from A. niger with two primer 700 bp apart in the DNA sequence. The hybridization was carried out for 16 hours at 65 C in 10 Denhart, 5 x SSC, 0.02 M EDTA, 1% SDS, 0.15 mg/ml polyA and 0.05 mg/ml yeast tRNA. After hybridization the filters were washed in 2 x SSC, 0.1% SDS at 45°C twice and placed on X-ray films. 5 colonies hybridized with the probe, 4 of them were subsequently shown to contain the A. oryzae pepE gene by Southern analysis of the isolated cosmid DNA, using

the same probe. Three of the cosmids were identical, thus two different cosmid clones containing pepE were isolated, they were called 7C7 and 33C1, names referring to their position in the stored library. Two overlapping fragments, a 4.3 kb EcoRI fragment (pToC299) and a 2.4 kb Hindlll (pToC301) fragment, were subcloned and partially sequenced. SEQ. ID No. 1 shows the DNA sequence and the deduced aa sequence for the protease. The gene shows strong homology to the A. niger gene.
The A. oryzae pyrG gene was cloned by cross hybridization with the A. niger pyrG gene (W. van Hartingsveldt et al., Mol. Gen. Genet 206:71-75 (1987)). A lambda library of partial SauIIlA digested A, oryzae IF04177 DNA was probed at low stringency with a 1 kb DNA fragment from the A. niger pyrG gene. A 3.8 kb Hindlll fragment from a positive clone was subcloned into a pUC118 vector. The resultant plasmid, pS02, was shown to contain ihepyrG gene by complementation of an A. niger pyrG' mutant.

A pyrG deletion plasmid, pS05, containing about 1 kb of pyrG flanking sequences on each end was constructed from the plasmid pS02. A. oryzae IF04177 was transformed with this construct and transformants were selected by resistance to 5-fluoro-orotic acid (FOA), a phenotype characteristic of pyrG mutants. One transformant, HowBlOl, was shown by Southern analysis to have the expected deletion at the pyrG locus. Being a pyrG mutant HowBlOl requires uridine for growth. HowBlOl can be transformed with the wtpyrG gene by selection for ability to grow without uridine.
The steps involved in the construction of HowBlOl are illustrated in Fig. 1.
A plasmid, pToC345, designed to replace the pepE gene with the pyrG gene, was constructed.

Two PCR reactions were run with pToC299 as template; the first primer set was :
19819 GAAGATCTGCGCGGATGTACATTGTAG
19821 TTAGTCAGAAATTCGTCCCG
The second was:
19820 CCCAAGCTTCATGCTCGACCAGGGCCTCCT
19818 GGTCTGTGTTAACCAAAGAAC
The appr. 800 bp fragment obtained with 19819/19821 was cut with BgllllHindlH and cloned together with the 1.1 kb fragment obtained with 19820/19818 and cut with HindlHIPstI into BglWPstl cut pIC19R (J. L. Marsh et al, Gene 32 (1984) 481-484). The resulting plasmid was cut at the unique Hindlll site, dephosphorylated and the 3.5 kb pyrG containing fragment from pJaL335 (described in Example 2) was inserted. The construction of pToC345 is illustrated in Figure 2.
HowBlOl was transformed with EcoRI cut pToC345 using standard procedures and transformants were selected by their ability to grow without the addition of uridine. 100 transformants were reisolated once through conidiospores. Spores were picked from single colonies on the reisolation plates and suspended in 100 ml of water with 0.01% Trition X-100. 1 ml spore suspension from each transformant and from IF04177, which was included as a control strain, were spotted on two Whatmann 540 filters placed on top of each their YPD plate. The plates were incubated at 30°C for 18 hours. The filters were removed from the plates and placed in 20% SDS for two hours at room temperature. They were then bakes for 3 minutes in a 600W micro-wawe oven. The filters were then washed for 5 minutes in 10%SDS, 2 times 5 minutes in 0.5M NaOH, 1.5M NaCl, one time 5 minutes in 0.5MTris-HCl pH=7.5, 1.5M NaCl and one time 5 minutes in 20xSSC and air dried. The two sets of filters were hybridized by standard procedures with each their 32p_iaDeUed probe. One set was hybridized with a 600bp BbuIIHindlll fragment from pToC299 containing the part of the pepE gene that was attempted to be deleted. The other set of filters was hybridized with a DNA

fragment from the A. oryzae tpi gene. Any gene present in one copy, but pepE could be used since this is a control of the amount of DNA bound to the filters.
After hybridization the filters were washed with O.lxSSC, 0.1% SDS at 65°C and the radioactivity bound to the filters were visualized by a Phospolmager. 13 of the transformants were picked for further analysis because they showed little hybridization to the pepE probe compared to the hybridization to the control probe. Chromosomal DNA was prepared by standard procedures and a Southern blot of the
EcoRI restricted DNA was hybridized with a 32p-iabelled l.lkb Bbul fragment from pToC299 containing the 3' part of XhepepE gene which was not to be deleted. In the wt strain a 4.3 kb fragment should hybridize to the probe, in a correct replacement strain the 4.3 kb fragment should be replaced by a 7.2kb fragment. Two of the transformants looked correct, one had no hybridizing bands at all and most had the wt band plus maybe one other band, indicating integration of the transforming DNA at a non¬homologous locus.
In order to isolate a pyrG~ derivative of the pepE deleted strain 10? conidiospores were spread on FOA containing plates and resistant colonies were selected. The FOA resistant colonies were reisolated, DNA was prepared and Southern analysis was performed to identify the strains in which the pyrG gene was lost via recombination between the repeat sequences flanking the gene in pToC345.

A plasmid, pToC315, designed for a two step gene deletion of \hepepE gene was constructed. A 1.6 kb EcoRI/Hindlll (the Hindlll site was blunt ended by treatment with the Klenow fragment of DNA polymerase) from pToC299 containing sequences upstream from the pepE gene was cloned together with a 1.4 kb Sall/Bbul (the Bbul fragment was blunt ended) containing the 3 'end of the pepE gene into the EcoRI/Sall cut vector pUC19. The resulting plasmid was cut at the unique Hindlll site in the pUC19 linker, dephosphorylated and the 1.8 kb pyrG containing fragment from pJets4 was inserted. The construction of pToC315 is illustrated in Fig. 3.

HowBlOl was transformed with pToC315 using standard procedures and transformants were selected by their ability to grow without the addition of uridine. After reisolation chromosomal DNA was prepared from 12 transformants, the DNA was cut with Asp718 and analysed by Southern analyses with a Bbul fragment from pToC301 containing part of ihepepE gene as a radioactive labelled probe. One transformant had the plasmid integrated in the endogenous pepE gene as revealed by the disappearance of the pepE specific Asp718 fragment, which had been replaced by two new bands as predicted if pToC315 had integrated as a single copy by homologous recombination at the pepE locus. The transformant was named ToC1089. 5x10 conidiae spores of ToC1089 were spread on plates containing 5-fluoro-orotic acid selecting for loss of the pyrG gene. This is the second step in a two step gene deletion, the pyrG gene can be lost by recombination with either of two pairs of identical sequences, one of which will result in the deletion of the pepE gene as well. The procedure is depicted in Fig. 4. The frequency of 5-fluoro-orotic acid resistance was approximately 10". The 5-fluoro-orotic acid resistant colonies were reisolated and a strain deleted for the pepE gene was identified by Southern analysis.
EXAMPLE 2
The A. oryzae pepC gene was cloned by cross-hybridization with the A. niger gene. The A. niger gene was obtained as a 1.1 kb PCR fragment from a PCR reaction with A. niger chromosomal DNA and pepC specific primers made according to the pepC sequence published by Frederick. G.D et al. Gene 125 (1993) 57-64. The fragment was shown to contain pepC sequences by DNA sequencing. It hybridizes to A. oryzae chromosomal DNA under stringent conditions and Southern analysis showed that A. oryzae contains a single pepC like gene.
The pepC gene was deleted by a two step gene replacement method (G. May in "Applied Molecular Genetics of Filamentous Fungi" (1992) pp. 1-25. Eds. J. R. Kinghorn and G. Turner; Blackie Academic and Professional). As marker was used the A.

otyzae pyrG gene, the A. oryzae strain was a pyrG strain made by deletion of the pyrG
gene.
From the published cDNA nucleotide sequence encoding A. niger pepC (Frederick G.D et al. Gene 125 (1993) 57-64) two oligonucleotides were designed so that the encoding part of the pepC gene where amplified in a PCR reaction. The primer #5258 where made so that the 3' end of the nucleotide sequence corresponds to the N-terminal part of the pepC gene (underline) and the 5'-end is for facilitating cloning (contains a BamHI restriction endonuclease site). The primer #5259
where made so that the 3' end of the nucleotide sequence corresponds to the C-terminal part of thepepC gene and the 5'-end is for facilitating cloning (contains a Xhol restriction endonuclease site).
Genomic DNA from A. niger was used as template in the PCR reaction. Amplification reaction were performed in 100 ul volumes containing 2.5 units Taq-polymerase, 100 ng of A, niger genomic DNA, 50 mM KC1, 10 raM Tris-HCl pH 8.0,1.5 mM MgCl2, 250 nM of each dNTP, and 100 pM of each of the two primers described above.
Amplification was carried out in a Perkin-Elmer Cetus DNA Termal 480, and consisted of one cycle of 3 minutes at 94°C, followed by 25 cycles of 1 minutes at 94°C, 30 seconds at 55°C, and 1 minutes at 72°C. The PCR reaction produces one DNA fragment of ca. 1.1 kb in length. This fragment were isolated by gel electrophoresis, purified, cloned into the vector pCR*^ (Invitrogen Corporation), and sequenced using standard methods known in the art of molecular biology. The resulting plasmid were called pJaL197.

Southern blot of genomic DNA from A. oryzae IF04177 where hybridized with the 1.1 kb J2P labelled EcoRI DNA fragment from pJaL197 clone containing the A, niger pepC gene. Genomic DNA was cut with the following restriction enzymes: EcoRI, BamHI, Xhol, and Hindlll. Hybridization was carried out for 16 hours at 65°C in 10 * Denhart, 5 * SSC, 0.02 M EDTA, 1% SDS, 0.15 mg/ml polyA and 0.05 mg/ml yeast tRNA. After hybridization the filters were washed in 2 x SSC, 0.1% SDS at 65°C twice and placed on X-ray films. The probe hybridized to a single size of fragment in each of the four digest, indicating that the pepC gene is present in a single copy in A. oryzae IF04177.
A partially library of A. oryzae genomic DNA was constructed containing BamHI fragments with a size of 4.5-5.5 kband ligated into the vector pIC19H. The above A. niger pepC gene clone was radiolabeled and used to probe the partial A. oryzae BamHI genomic library. Hybridization was carried out as described above. About 4000 E. coli colonies were screened and four positive colonies was obtained. The 4 clones was shown to be identical by restriction enzyme digestion. One of these clones called pJaL235 (Fig. 5), with an insert of 4.6 kb, was analyses further by restriction mapping and Southern blotting. This shows that the pepC gene is located in a 2.9 kb BamHIISall fragment. Sequencing of this 2.9 kb BamHIISall fragment revealed the presence of a long open reading frame of 495 amino acids interrupted by two introns with consensus sequences indicative of intron splicing. The sequence of the A. oryzae pepC gene is shown in SEQ ID No. 3.
By PCR, with the primer #7659
where the 3' end of the nucleotide sequence corresponds to position 7-26 in pS02 (underline) and the 5'-end is for facilitating cloning (contains a BgUI restriction endonuclease site), and the primer #7656


where the 3' end of the nucleotide sequence corresponds to position 385-407 in pS02 (underline) and the 5'-end is for facilitating cloning (contains a EcoRI and HindlH restriction endonuclease site), on the plasmid pS02 a 432 bp fragment was amplified. The fragment was digested with BgllJ and EcoRI and isolated by gel electrophoresis, purified, and cloned into the corresponding site in pS02, resulting in plasmid pJaL335 (The construction is outlined in Fig. 6.
Plasmid pJaL235 was digested with Pvul and treated with Klenow polymerase to make the ends blunt and then digested with Hindlll. The 2.6 kb fragment were isolated by gel electrophoresis, and purified. The 2.6 kb fragment was cloned into pUC12 digested with Sma/and Hindlll giving plasmid pJaL308.
Plasmid pJaL308 was digested with Smal and treated with bacterial alkaline phosphatase to remove the 5* phosphate groups according to the manufacturers instructions and phenol extracted and precipitated.
Plasmid pS02 was digested with Hindlll, and treated with Klenow polymerase to make the ends blunt. The 3.8 kb fragment encoding the A. oryzae pyrG gene were isolated by gel electrophoresis, and purified.
The two fragments are mixed together and ligated. After transformations of E. coli, the colonies carrying the correct plasmids are identified by restriction enzyme digestion of mini-plasmid preparations. The construction of pJaL363 is illustrated in Fig. 7.
Plasmid pJaL363 consist of pUC12 vector containing a fragment which carries the pepC gene flanked by an EcoRI site and an Hindlll and where the pepC is interrupted by an 3.8 kb DNA fragment encoding the A. oryzae pyrG gene.

Plasmid pJaL335 is digested with HindHI, and treated with Klenow polymerase to make the ends blunt. The 3.5 kb fragment encoding the A. otyzaepyrG gene is isolated by gel electrophoresis, and purified. The fragment is cloned into pJaL308 Smal restriction site. The construction of pJaLz is outlined in Fig. 8. The plasmid consist of pUC 12 vector containing a fragment which carries the pepC gene flanked by an EcoRl site and an HindHI and where the pepC is interrupted by an 3.5 kb DNA fragment encoding the A. oryzaepyrG gene.

15 ug of either one of the disruption plasmids is digested to completion by HindHI and EcoRl. The completeness of the digest is checked by running an aliquot on a gel and the remainder of the DNA is phenol extracted, precipitated and resuspended in 10 ufof sterile water.
The transformation of A. oryzae HowBlOI host strain is preformed by the protoplast method (Christensen et al. Biotechnology (1988) 6:1419-1422). Typically, A. oryzae mycelia is grown in a rich nutrient broth. The mycelia is separated from the broth by filtration. The enzyme preparation Novozyme® (Novo Nordisk) is added to the mycelia in osmotically stabilizing buffer such as 1.2 M MgSCU buffered to pH 5.0 with sodium phosphate. The suspension was incubated for 60 minutes at 37°C with agitation. The protoplast is filtered through mira-cloth to remove mycelial debris. The protoplast is harvested and washed twice with STC (1.2 M sorbitol, 10 mM CaCl2, 10 mM Tris-HCl pH 7.5). The protoplast is finally resuspended in 200-1000 jul STC.
For transformation 5 ug DNA is added to 100 u.1 protoplast suspension and then 200 ul PEG solution (60% PEG 4000, 10 mM CaCl2, 10 mM Tris-HCl pH 7.5) is added and the mixture is incubated for 20 minutes at room temperature. The protoplast is harvested and washed twice with 1 -2 M sorbitol. The protoplast is finally resuspended 200 ml 1.2 M sorbitol, plated on selective plates (minimal medium + 10 g/1 Bacto-Agar (Difco), and incubated at 37°C. After 3-4 days of growth at 37°C, stable transformants appear as vigorously growing and sporulating colonies.


' From the stable colonies individual spores is streaked on fresh minimal plates. Single colonies is selected and restreaked to give pure cultures. These are used to inoculate 10 ml of liquid YPM medium (1% yeast extract, 1% peptone, 2% maltose). After 18 hours at 30°C shaking at 180 rpm, the mycelia are harvested on filter paper. Mycelia is then transfer to an 2 ml eppendorf tube and freeze dried. After freeze drying DNA is prepared from the individual mycelia by grinding the mycelia to a fine powder with a pestle in the tube. This powder are resuspended in 0.5 ml of 50 mM EDTA pH 8.0, 0.2% SDS, 1 ul DEP by vortexing. These are incubate at 65°C for 20 minutes. After this is added 0.1 ml 5 M KAc pH 6.5, the solution is mixed and incubated on ice for 5 minutes. The cell debris is separated from the DNA solution by centrifugation at 20.000 rpm for 5 minutes. 0.4 ml supernatant are precipitated with 0.3 ml isopropanol and centrifiigated at 20.000 rpm for 10 minutes. The DNA pellet is redissolved in 100 ul of sterile TE buffer containing 0.1
3 |ig of each DNA is digested with EcoRI, fractionated by agarose gel electrophoresis, transferred to Immobilan-N membrane filters, and probe with the 1.5 kb 32P labelled Ncol DNA fragment from pJaL335 containing part of the pepC protease gene. Strains which carry a disruption of the pepC are easily recognized by that the wild type band on 3.6 kb is shifted to a 7.4 kb band in the transformant.
The disrupted A. oryzae pepC strain is made pyrG minus by selecting spontaneous mutant resistance to 5-fluoro-orotic acid, a phenotype characteristic of pyrG mutants. Being a pyrG mutant the strain requires uridine for growth. The strain can be transformed with the wtpyrG gene by selection for the ability to grow without uridine.
EXAMPLE 3

The areAA strain was constructed as follows. The areA gene from A. oryzae was cloned. A pyrG strain also deficient for either pepC or pepE or pepC plus pepE was

transformed with a plasmid carrying the pyrG gene inserted between DNA fragments upstream and downstream from the areA gene. The coding region for areA was not present on the plasmid. Transformants were selected for their ability to grow in the absence of uridine and in the presence of chlorate. This double selection selects both for a functional pyrG gene and for areA minus. Strains obtained by this selection procedure were finally screened by Southern analysis to identify those in which the chromosomal areA gene was substituted by the pyrG gene.

The A. oryzae areA gene was cloned by cross hybridization to the A. nidulans areA gene (B. Kudla et al., EMBO J. 9:1355-1364 (1990)). A genomic library of A. oryzae IF04177 was prepared by partial digestion of chromosomal DNA with SauIIIA and cloning of the obtained DNA fragments into the vector 1GEM-II (obtained from Promega). Cross hybridization of the library with the A. nidulans areA gene was performed in 40% formamide at 37°C. Hybridizing 1 clones were isolated and from these fragments were sub-cloned into the vector pBluescript SK+ (obtained from Stratagene) giving rise to the plasmids pSK5 and pSK9 illustrated in Fig. 9. The cloned gene was able to complement an A. nidulans areA mutant, proving that it is indeed the A oryzae areA homolog. 5643 bp of the clone was sequenced, and comparison of the sequences of the A. oryzae and the A. nidulans areA genes shows that they are highly homologous. The sequence of the A. oryzae areA gene is shown in SEQ ID No. 5.

In order to delete the areA gene from the A. oryzae chromosome the plasmid pToC266 was constructed. pToC266 contains a 2.1 kb DNA fragment originating upstream of the areA gene (isolated from pSK5) and a 1.4 kb DNA fragment originating downstream from the areA gene (isolated from pSK9). The two fragments are separated by approximately 3.2 kb in the genome, the coding region is situated in this part of the gene. The A. oryzae pyrG gene from pjers4 was inserted between the areA upstream and downstream DNA fragments. The construction of pToC266 is illustrated in Figs. 10a and 10b. pToC266 has a unique EcoRI site and was linearized by cutting with this restriction enzyme before used in transformations.
A pyrG strain also deficient for either pepC or pepE or pepC plus pepE is transformed with linearized pToC266. Transformants are selected on minimal plates (Cove Biochem. biophy. Acta (1966) 113 : 51-56) containing glutamine as the nitrogen source and glucose as the carbon source. Transformants are reisolated twice on the same type of plates, and then subjected to growth test on different nitrogen sources. Transformants growing well on glutamine but not on nitrate, ammonium or urea are expected to be deleted for areA. The deletion is confirmed by Southern analysis.
EXAMPLE 4

A plasmid containing the bar gene from Streptomyces hygroscopius (C. J. Thompson et. al, EMBO J. 6 : 2519-2523 (1987)) inserted after the A. oryzae TAKA-amylase promoter and followed by a fragment containing the transcriptional terminator and poEyadenylation signal from the A. niger gla gene was constructed.
The plasmid, pMT1606, can be used for selection of glufosinate resistant transformants of A. oryzae. pMT1606 was constructed by isolating the bar gene from the plasmid pBPIT (B. Straubinger et. al, Fungal Genetics Newsletter 39 : 82-83 (1992)) and

cloning it into the fungal expression plasmid p775 described in EP publication No. 0 098 993 Al. Fig. 11 illustrates the construction of pMT1606.
EXAMPLES

An A. oryzae areAkgepE&pepC strain is transformed with the plasmid pToC56 (Fig. 12), which is a fungal expression plasmid for the mammalian enzyme chymosin, by co-transformation with pMT1606. Construction of the plasmid pToC56 is described in EP publication No. 0 98 993.
Transformants are selected for growth on minimal medium containing 10 mM ammonium and 1 mg/ml glufosinate and screened for the presence of pToC56 by the ability to produce chymosin. The transformants are grown in shake flasks in minimal medium containing maltodextrin and glutamine for 4 days at 30°C.The content of chymosin in the supernatants were analysed by SDS-Page and Western blotting.
Example 6

Construction of an expression plasmid for pepC. Plasmid pJaL235 was digested with AatH and Nsil and treated with Klenow polymerase to make the ends blunt. The 1.7 kb fragment was isolated by gel electrophoresis, and purified. The 1.7 kb fragment was cloned into pIC19H digested with Smal giving pJaL365.
Plasmid pJaL365 was digested with BamHI and Xhol and the 1.7 kb fragment was isolated by gel electrophoresis, and purified. The 1.7 kb fragment was cloned into pToC68 digested with BamHI and Xhol giving pJaL368 (Figs. 13a and 13 b).

An A. oryzae strain is transformed with the plasmid pJaL368, which is a fungal expression plasmid for the protease PepC, by cotransformation with pToC90.
Transformants are selected for growth on minimal medium containing 10 mM acetamide and screened for the presence of pJaL368 by the ability to produce the protease PepC.
Example 7
Overevpression nf pc.pK
A plasmid called pToC338 carrying the pepE gene fused to the TAKA-amylase promoter from A. oryzae was constructed. Figs. 14a and 14b depicts the construction.
An EcoRllSall fragment from pToC299 containing most of the coding region and appr. 430 bp of the 3' untranslated region of pepE was cloned into EcoRllBamHl cut pUC19 together with a synthetic DNA fragment of the following sequence:
8681 GATCCACCATGAAG 8747 GTGGTACTTCAGCT
The resulting plasmid called pToC334 was cut with BamHIIEcoRI and a fragment containing the entire structural gene of pepE with a BamHI site fused immediately upstream of the start codon was isolated, approximately 430 bp of untranslated 3' sequence was also present in the fragment. The fragment was cloned into EcoRIISai7 cut pUC19 together with an approximately 1.1 kb SalllBamHI fragment from the plasmid p775 containing the TAKA-amylase promoter from A. oryzae. The resulting plasmid was named pToC338.
pToC338 was co-transformed into A, oryzae JaL125 (an A. oryzae alp minus strain described in Danish Patent Application No. 0354/96) with pToC90, which contains the A. nidulans acetamidase (amdS) gene, using standard procedures {e.g. as described in

EP 0 098 993 Al). Transformants were selected by their ability to use acetamide as the sole nitrogen source. 11 transformants were reisolated twice through conidiospores. The transformants were fermented for three days at 30°C inlO ml YPM (YP with 2% maltose) and the fermentation broth was analysed by SDS-page. One of the transformants produced a protein of the same size as the protein encoded by the pepE gene, protease activity measurements confirmed that the broth from that transformant contained a higher activity toward casein at pH=5.5 compared to the host strain JaL125. The protein was purified and N-terminal sequencing showed that it is indeed the protein encoded by the pepE gene. The N-terminal of the secreted protein was:
gly*-arg-his-asp-val-leu-val-asp-asn-phe-leu-asn-ala-gln-tyr-phe-ser-glu-ile-glu-ile-gly-
thr-pro-pro-gln-lys-phe-lys
*this residue could also be a lysine.
confirming the expression of the PepE protease.

SEQUENCE LISTING
(1) GENERAL INFORMATION:
(i) APPLICANT:
(A) NAME: Novo Nordisk A/S
(B) STREET: Novo AUe
(C) CITY: Bagsvaerd
(E) COUNTRY: DENMARK (ii) TITLE OF INVENTION: Novel Microorganisms
(Hi) NUMBER OF SEQUENCES: 12
(iv) COMPUTER READABLE FORM:
(A) MEDIUM TYPE: Floppy disk
(B) COMPUTER: IBM PC compatible
(C) OPERATING SYSTEM: PC-DOS/MS-DOS
(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO)
(2) INFORMATION FOR SEQ ID NO: 1:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 2454 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear

(ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE:
(A) ORGANISM: Aspergillus oryzae (ix) FEATURE:
(A) NAME/KEY: exon
(B)LOCATION:603..70l (ix) FEATURE:
(A) NAME/KEY: intron
(B) LOCATION:702..791 (ix) FEATURE:

(A) NAME/KEY: exon
(B) LOCATION:792..942 (ix) FEATURE:
(A) NAME/KEY: intron
(B)LOCATION:943..10GI (ix) FEATURE:
(A) NAME/KEY: exon
(B)LOCATION:1002..1656 (ix) FEATURE:
(A) NAME/KEY: intron
(B) LOCATION:1657..1?13 (ix) FEATURE:
(A) NAME/KEY: exon
(B)LOCATION:1714..2G01 (ix) FEATURE:
(A) NAME/KEY: CDS
(B)LOCATION:join(603-701, 792-942, 1002-1656, 1714-2001) (ix) FEATURE:
30

(A) NAME/KEY: mat_peptide
(B) LOCATION:603..2001
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1:
GCGCGGATGT ACATTGTAGG TACATGGATA GTATGTACTG TATATACCGT CATTTAGTAA 60
GGCAACTAAC TAACTTATCA GCCTAGCTCC CGAGACGGCC TTACATCATC CGCAGCGGCA 120
ATCAGCTCCA CACCCTTGGA TAGTGATAAG AGACACAAGG AGTTCTGAGT ATGGTATTAT 180
AAGTGCAGTG AGTTGGGATG AAACAGAGAG ATAGAGGGAA TACTCCTATT TATCAATGAA 240
CGTATACAGA CATACCCCAG CAGCGTTCCT GGCGGTATTG TAAAAGGGCC GTACCTTGGA 300
GATCAAGTGA TGAGACACCC GTGATGCAGG AACTCCACTT CAATCCAATG ACGCATCGAG 360
TTGCTCCCTG ATTGGTTGAT ACGCAGGTCG CTCCGCAACC GGTCCGCATC ACCTCACTTC 420
CCTCCCCCAG ACCTGGAGGT ACCTCTCCCG TCCTTCTCTC CCTCTCCATC CCATCATCTA 480
TCCCTCTCCA GACCCTGATT GTATTTCATC ATTCCTATCG TCCCATATTA ATAGAGTATT 540
GCTAGTTTTC TTTTGATTTC GTCTGTTGAG GTGCTGCTTT TTTGTCGCCG TTGTCGCCCA 600
CC ATG AAG TCG ACC TTG GTT ACG GCC TCT GTG CTG TTG GGC TGT GCT
647
Met Lys Ser Thr Leu Val Thr Ala Ser Val Leu Leu Gly Cys Ala
15 10 15
TCC GCC GAG GTT CAC AAG CTG AAG CTC AAC AAG GTG CCC GTG TCC GAG 695

Ser Ala Glu Val His Lys Leu Lys Leu Asn Lys Val Pro Val Ser Glu
20 25 30
CAA TTT GTGAGTAGAC CTTACTATTC CGGCCATGAA AATATTCATC TACC-
CATCTG 751
GlnPhe
AAAGCTTGTC GGGACGAATT TCTGACTAAA TCGTATCCAG AAC TTG CAC AAC ATC 806
Asn Leu His Asn He
35
GAC ACC CAT GTG CAG GCT CTC GGC CAG AAG TAC ATG GGA ATC CGT CCC 854
Asp Thr His Val Gin Ala Leu Gly Gin Lys Tyr Met Gly He Arg Pro
40 45 50
AAC ATC AAG CAA GAT CTT CTC AAT GAG AAC CCG ATT AAC GAT ATG GGA 902
Asn He Lys Gin Asp Leu Leu Asn Glu Asn Pro He Asn Asp Met Gly
55 60 65 70
CGT CAT GAT GTC CTT GTT GAC AAC TTC CTG AAT GCA CAA T 942
Arg His Asp Val Leu Val Asp Asn Phe Leu Asn Ala Gin
75 80
GTACGAAACC CTAGTAATAC TTGAAGGGGG GCTCCAACTT ACGCGTAGAT TCTCTAAAG AC 1003
Tyr
TTC TCC GAA ATC GAG ATC GGT ACT CCT CCA CAG AAG TTC AAG GTG
GTC 1051
Phe Ser Glu He Glu lie Gly Thr Pro Pro Gin Lys Phe Lys Val Val
85 90 95 100
CTT GAC ACT GGC AGC TCA AAC CTA TGG GTG CCC TCT TCG GAG TGT GGT 1099
Leu Asp Thr Gly Ser Ser Asn Leu Trp Val Pro Ser Ser Glu Cys Gly
105 110 115
TCT ATC GCC TGC TAT TTG CAT AAC AAG TAC GAC TCA TCC TCG TCC
TCC 1147
Ser He Ala Cys Tyr Leu His Asn Lys Tyr Asp Ser Ser Ser Ser Ser

120 125 130
ACG TAC CAG AAG AAT GGC AGC GAA TTT GCC ATC AAG TAC GGC TCT
GGT 1195
Thr Tyr Gin Lys Asn Gly Ser Glu Phe Ala He Lys Tyr Gly Ser Gly
135 140 145
AGC CTG AGT GGT TTT GTT TCT CAG GAT ACT CTC AAG ATC GGT GAC CTG 1243
Ser Leu Ser Gly Phe Val Ser Gin Asp Thr Leu Lys He Gly Asp Leu
150 155 160
AAG GTG AAG GAT CAG CTG TTC GCC GAG GCT ACT AGT GAG CCC GGC
CTT 1291
Lys Val Lys Asp Gin Leu Phe Ala Glu Ala Thr Ser Glu Pro Gly Leu
165 170 175 180
GCT TTT GCC TTT GGC CGC TTT GAT GGT ATC CTT GGG TTG GGA TTT GAC 1339
Ala Phe Ala Phe Gly Arg Phe Asp Gly He Leu Gly Leu Gly Phe Asp
185 190 195
ACA ATT TCC GTC AAC AAG ATT CCT CCA CCC TTC TAT AGC ATG CTC GAC 1387
Thr He Ser Val Asn Lys He Pro Pro Pro Phe Tyr Ser Met Leu Asp
200 205 210
CAG GGC CTC CTC GAC GAG CCA GTC TTT GCT TTC TAC CTT GGA GAC
ACT 1435
Gin Gly Leu Leu Asp Glu Pro Val Phe Ala Phe Tyr Leu Gly Asp Thr
215 220 225
AAC AAG GAA GGT GAT GAC TCC GTA GCG ACA TTC GGC GGT GTT GAC AAG 1483
Asn Lys Glu Gly Asp Asp Ser Val Ala Thr Phe Gly Gly Val Asp Lys
230 235 240
GAT CAC TAC ACC GGC GAG TTG GTC AAG ATT CCC CTT CGC CGC AAG
GCC 1531
Asp His Tyr Thr Gly Glu Leu Val Lys He Pro Leu Arg Arg Lys Ala
245 250 255 260
TAC TGG GAG GTT GAC CTT GAT GCT ATC GCC CTT GGC GAT AGC GTT GCT 1579

Tyr Tip Glu Val Asp Leu Asp Ala He Ala Leu Gly Asp Ser Val Ala
265 270 275
GAA CTC GAT AAC ACC GGT GTC ATT CTG GAT ACC GGC ACT TCC CTT ATC 1627
Glu Leu Asp Asn Thr Gly Val He Leu Asp Thr Gly Thr Ser Leu He
280 285 290
GCC TTG GCC ACC ACC CTT GCC GAG CTT AT GTAAGTCAAG
CCAGTGTACT 1676
Ala Leu Ala Thr Thr Leu Ala Glu Leu He
295 300
GTGCATGTCT GTCATACTCT TACTAACTAT TCTGAAG T AAC AAG GAA ATC GGT 1729
Asn Lys Glu lie Gly 305
GCC AAG AAG GGC TTC ACC GGC CAA TAC TCG GTT GAC TGT GAC AAG CGC 1777
Ala Lys Lys Gly Phe Thr Gly Gin Tyr Ser Val Asp Cys Asp Lys Arg
310 315 320
GAT TCC TTG CCT GAC CTC ACC TTC ACC CTG AGC GGA TAC AAC TTC ACC 1825
Asp Ser Leu Pro Asp Leu Thr Phe Thr Leu Ser Gly Tyr Asn Phe Thr
325 330 335
ATT GGT CCC TAC GAC TAC ACT CTT GAA GTC CAG GGA TCT TGC ATC
AGC 1873
He Gly Pro Tyr Asp Tyr Thr Leu Glu Val Gin Gly Ser Cys He Ser
340 345 350 355
GCC TTC ATG GGC ATG GAC TTC CCT GAA CCC GTT GGC CCC TTG GCC ATC 1921
Ala Phe Met Gly Met Asp Phe Pro Glu Pro Val Gly Pro Leu Ala He
360 365 370
CTG GGT GAC GCG TTC CTC AGG AAG TGG TAC AGT GTG TAC GAC CTC GCC 1969
Leu Gly Asp Ala Phe Leu Arg Lys Trp Tyr Ser Val Tyr Asp Leu Ala
375 380 385

AAC GGT GCT GTT GGC CTG GCC AAG GCT AAG TAACCAAGTA
ATCTACCATG 2019
Asn Gly Ala Val Gly Leu Ala Lys Ala Lys
390 395
CTATGTTCTT ATTGGTTGCT TGTGTATGTG AGACAATGGT ACATGATAGC CTGCCTCGGT 2079
AGTTGGTTGG CCTTTTTCTG TTACGGGAAA TCGGCAAAGC CTTGTTTTCG CTTATGACCT 2139
CTATCCTGTT TGTTATTGAT ATTTTGTGTG ACTCAGTGAG CCACTGGCTA TGCTCTAATG 2199
ACATTCATTG GATGCCGATA GTTCTATATA CATTGCGATT TTAACGCGTA TCTTTGATCT 2259
ATCGGTACAA TGATTCCCTA CTAAAGGTAG CCCAACTAGA CAACTATGCC TACGACCTCT 2319
CTACATTCTT CATAGCTCCG TGTGGAGTCC GTCTCATACA ACCTCGAGCA ACCTGCAGTT 2379
CTTTGGTTAA CACAGACCAC ACCTTAAAAC GGCACGATCC ATTCGAATAG ACAAGCCCTC 2439
TTAATATTTG AATTC 2454
(2) INFORMATION FOR SEQ ID NO: 2:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 397 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2:

Met Lys Ser Thr Leu Val Thr Ala Ser Val Leu Leu Gly Cys Ala Ser
15 10 15
Ala Glu Val His Lys Leu Lys Leu Asn Lys Val Pro Val Ser Glu Gin
20 25 30
Phe Asn Leu His Asn He Asp Thr His Val Gin Ala Leu Gly Gin Lys
35 40 45
Tyr Met Gly He Arg Pro Asn He Lys Gin Asp Leu Leu Asn Glu Asn
50 55 60
Pro He Asn Asp Met Gly Arg His Asp Val Leu Val Asp Asn Phe Leu
65 70 75 80
Asn Ala Gin Tyr Phe Ser Glu He Glu He Gly Thr Pro Pro Gin Lys
85 90 95
Phe Lys Val Val Leu Asp Thr Gly Ser Ser Asn Leu Trp Val Pro Ser
100 105 110
Ser Glu Cys Gly Ser He Ala Cys Tyr Leu His Asn Lys Tyr Asp Ser
115 120 125
Ser Ser Ser Ser Thr Tyr Gin Lys Asn Gly Ser Glu Phe Ala He Lys
130 135 140
Tyr Gly Ser Gly Ser Leu Ser Gly Phe Val Ser Gin Asp Thr Leu Lys
145 150 155 160
He Gly Asp Leu Lys Val Lys Asp Gin Leu Phe Ala Glu Ala Thr Ser
165 170 175
Glu Pro Gly Leu Ala Phe Ala Phe Gly Arg Phe Asp Gly He Leu Gly
180 185 190
Leu Gly Phe Asp Thr He Ser Val Asn Lys He Pro Pro Pro Phe Tyr
195 200 205
Ser Met Leu Asp Gin Gly Leu Leu Asp Glu Pro Val Phe Ala Phe Tyr
210 215 220
Leu Gly Asp Thr Asn Lys Glu Gly Asp Asp Ser Val Ala Thr Phe Gly
225 230 235 240

Gly Val Asp Lys Asp His Tyr Thr Gly Glu Leu Val Lys He Pro Leu
245 250 255
Arg Arg Lys Ala Tyr Trp Glu Val Asp Leu Asp Ala He Ala Leu Gly
260 265 270
Asp Ser Val Ala Glu Leu Asp Asn Thr Gly Val He Leu Asp Thr Gly
275 280 285
Thr Ser Leu He Ala Leu Ala Thr Thr Leu Ala Glu Leu He Asn Lys
290 295 300
Glu He Gly Ala Lys Lys Gly Phe Thr Gly Gin Tyr Ser Val Asp Cys
305 310 315 320
Asp Lys Arg Asp Ser Leu Pro Asp Leu Thr Phe Thr Leu Ser Gly Tyr
325 330 335
Asn Phe Thr He Gly Pro Tyr Asp Tyr Thr Leu Glu Val Gin Gly Ser
340 345 350
Cys He Ser Ala Phe Met Gly Met Asp Phe Pro Glu Pro Val Gly Pro
355 360 365
Leu Ala He Leu Gly Asp Ala Phe Leu Arg Lys Trp Tyr Ser Val Tyr
370 375 380
Asp Leu Ala Asn Gly Ala Val Gly Leu Ala Lys Ala Lys
385 390 395
(2) INFORMATION FOR SEQ ID NO: 3:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 3224 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA (genomic)

(iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE:
(A) ORGANISM: Aspergillus oryzae (ix) FEATURE:
(A) NAME/KEY: exon
(B)LOCATION:388..756 (ix) FEATURE:
(A) NAME/KEY: intron
(B)LOCATION:757..817 (ix) FEATURE:
(A) NAME/KEY: exon
(B)LOCATION:818..I753 (ix) FEATURE:
(A) NAME/KEY: intron
(B) LOCATION:1754..1814 (ix) FEATURE:

(A) NAME/KEY: exon
(B) LOCATION-.1815..1997 (ix) FEATURE:
(A) NAME/KEY: matjeptide (B)LOCATION:388..1997 (ix) FEATURE:
(A) NAME/KEY: CDS (B)LOCATION:join(388..756,818..1753,1815..1997)
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3:
GGATCCATTA CCCTCTTACC GCCATACCCC AGGTCTTGCG ACCGCGCTAA TCGGGAGCGA 60

TCGACGCGCG GCACCTCCTC AGTAAAGCTG TGTCATCATT GTAAATTACC GTATCCCGGT 120
TGCATCATCC TCCGCTGCCC TTGCCTGCTT GGGGGATCGA CCTATTAAGC CCAGCTATCT 180
TACACCTGCT CCCATCCTCC TCTTCTCCAA CTCCTCATCC ATCCTTCCTC CTCCTTCTTC 240
CTTTTAACCC CCCCAACTCA GCATCGTTCC ATCCTCCCAT CTTTCCTTTC TTTTTACCTC 300
AAATCTCCAT CTGTATTCTT TCCTCTTAGA ACTCTTCCTT TCCCCCCTTC TGTACCTTGT 360
GTTTAGACGT CACTCTTGTT GCCCATC ATG AGA GGC ATC CTC GGC CTT
TCC 411
Met Arg Gly He Leu Gly Leu Ser
1 5
CTG CTG CCA CTA CTA GCA GCG GCC TCC CCC GTT GCT GTT GAC TCC ATC 459
Leu Leu Pro Leu Leu Ala Ala Ala Ser Pro Val Ala Val Asp Ser He
10 15 20
CAC AAC GGA GCG GCT CCC ATT CTT TCG GCC TCA AAT GCC AAA GAG GTT 507
His Asn Gly Ala Ala Pro He Leu Ser Ala Ser Asn Ala Lys GIu Val
25 30 35 40
CCA GAC TCT TAC ATT GTC GTC TTC AAG AAG CAT GTT TCC GCT GAA
ACG 555
Pro Asp Ser Tyr He Val Val Phe Lys Lys His Val Ser Ala Glu Thr
45 50 55
GCT GCT GCT CAT CAC ACC TGG GTG CAG GAC ATC CAC GAT TCG ATG ACT 603
Ala Ala Ala His His Thr Trp Val Gin Asp He His Asp Ser Met Thr
60 65 70
GGA CGC ATC GAC CTG AAG AAG CGC TCT CTT TTT GGT TTC AGT GAT GAC 651
Gly Arg He Asp Leu Lys Lys Arg Ser Leu Phe Gly Phe Ser Asp Asp
75 80 85

CTT TAC CTC GGT CTC AAG AAC ACC TTC GAT ATC GCC GGG TCC CTA GCG 699
Leu Tyr Leu Gly Leu Lys Asn Thr Phe Asp He Ala Gly Ser Leu Ala
90 95 100
GGC TAC TCC GGA CAT TTC CAT GAG GAT GTG ATC GAG CAG GTC CGG
AGA 747
Gly Tyr Ser Gly His Phe His Glu Asp Val He Glu Gin Val Arg Arg
105 110 115 120
CAT CCT GAT GTAGGTTCCC CCCTCGGCCC ACCCGTTTTT GTAGAGCCCT
796 His Pro Asp
TGGTCTAACT TGATTTTCAA G GTT GAA TAC ATC GAG AAA GAC ACC GAA GTC 847
Val Glu Tyr He Glu Lys Asp Thr Glu Val
125 130
CAC ACC ATG GAG GAG ACA ACC GAG AAG AAT GCT CCC TGG GGC TTG GCT 895
His Thr Met Glu Glu Thr Thr Glu Lys Asn Ala Pro Trp Gly Leu Ala
135 140 145
CGT ATC TCT CAC CGT GAC AGC CTC TCG TTC GGT ACC TTT AAC AAG
TAC 943
Arg He Ser His Arg Asp Ser Leu Ser Phe Gly Thr Phe Asn Lys Tyr
150 155 160 165
CTG TAT GCT TCG GAA GGC GGT GAG GGT GTC GAT GCT TAT ACT ATT GAC 991
Leu Tyr Ala Ser Glu Gly Gly Glu Gly Val Asp Ala Tyr Thr He Asp
170 175 180
ACT GGT ATC AAC ATT GAG CAT GTC GAT TTC GAG GAT CGA GCA CAC TGG 1039
Thr Gly He Asn He Glu His Val Asp Phe Glu Asp Arg Ala His Trp
185 190 195
GGA AAG ACC ATC CCT AGC AAT GAT GAG GAT GCG GAT GGC AAC GGA CAC 1087

Gly Lys Thr He Pro Ser Asn Asp Glu Asp Ala Asp Gly Asn Gly His
200 205 210
GGA ACT CAC TGC TCC GGA ACC ATT GCT GGT AAG AAG TAC GGT GTT GCC 1135
Gly Thr His Cys Ser Gly Thr He Ala Gly Lys Lys Tyr Gly Val Ala
215 220 225
AAG AAG GCC AAC ATC TAT GCC GTC AAG GTC TTG AGG TCC AGC GGT
TCT 1183
Lys Lys Ala Asn He Tyr Ala Val Lys Val Leu Arg Ser Ser Gly Ser
230 235 240 245
GGC ACT ATG TCC GAT GTC GTT CTG GGT GTC GAG TGG GCC GTC CAG TCC 1231
Gly Thr Met Ser Asp Val Val Leu Gly Val Glu Trp Ala Val Gin Ser
250 255 260
CAC CTC AAG AAG GCT AAG GAC GCC AAA GAT GCC AAG GTC AAG GGT TTC 1279
His Leu Lys Lys Ala Lys Asp Ala Lys Asp Ala Lys Val Lys Gly Phe
265 270 275
AAG GGC AGC GTT GCC AAC ATG AGT CTT GGT GGT GCC AAG TCC AGG ACC 1327
Lys Gly Ser Val Ala Asn Met Ser Leu Gly Gly Ala Lys Ser Arg Thr
280 285 290
CTT GAG GCT GCT GTC AAT GCT GGT GTT GAG GCT GGT CTT CAC TTC GCC 1375
Leu Glu Ala Ala Val Asn Ala Gly Val Glu Ala Gly Leu His Phe Ala
295 300 305
GTT GCT GCT GGT AAC GAC AAT GCC GAT GCC TGC AAC TAC TCC CCT
GCT 1423
Val Ala Ala Gly Asn Asp Asn Ala Asp Ala Cys Asn Tyr Ser Pro Ala
310 315 320 325
GCC GCT GAG AAT GCC ATC ACT GTC GGT GCC TCG ACC CTT CAG GAT GAG 1471
Ala Ala Glu Asn Ala He Thr Val Gly Ala Ser Thr Leu Gin Asp Glu
330 335 340

CGT GCT TAC TTC TCC AAC TAC GGA AAG TGC ACT GAC ATC TTT GCC CCG 1519
Arg Ala Tyr Phe Ser Asn Tyr Gly Lys Cys Thr Asp He Phe Ala Pro
345 350 355
GGT CCC AAC ATT CTT TCC ACC TGG ACT GGC AGC AAG CAC GCT GTC AAC 1567
Gly Pro Asn He Leu Ser Thr Tip Thr Gly Ser Lys His Ala Val Asn
360 365 370
ACC ATC TCT GGA ACC TCT ATG GCT TCT CCT CAC ATT GCT GGT CTG CTG 1615
Thr He Ser Gly Thr Ser Met Ala Ser Pro His He Ala Gly Leu Leu
375 380 385
GCC TAC TTC GTT TCT CTG CAG CCT GCT CAG GAC TCT GCT TTC GCT GTC
1663
Ala Tyr Phe Val Ser Leu Gin Pro Ala Gin Asp Ser Ala Phe Ala Val
390 395 400 405
GAT GAG CTT ACT CCT GCC AAG CTC AAG AAG GAT ATC ATC TCC ATC GCC 1711
Asp Glu Leu Thr Pro Ala Lys Leu Lys Lys Asp He He Ser He Ala
410 415 420
ACC CAG GGT GCC CTT ACT GAT ATC CCA TCT GAC ACC CCC AAC
1753
Thr Gin Gly Ala Leu Thr Asp He Pro Ser Asp Thr Pro Asn
425 430 435
GTAAGTTATA TTATCCATTT TGGTATAATG AAACAGAAAG TGGCTAACTG TTTTATTCTA 1813
G CTT CTC GCC TGG AAC GGC GGT GGT GCC GAC AAC TAC ACC CAG ATT
1859
Leu Leu Ala Trp Asn Gly Gly Gly Ala Asp Asn Tyr Thr Gin He
440 445 450
GTC GCC AAG GGT GGA TAC AAG GCC GGC AGT GAC AAC CTT AAG GAC CGC 1907
Val Ala Lys Gly Gly Tyr Lys Ala Gly Ser Asp Asn Leu Lys Asp Arg
455 460 465

TTT GAC GGA CTA GTC AAC AAG GCC GAG AAG TTG CTC GCT GAG GAG CTT 1955
Phe Asp Gly Leu Val Asn Lys Ala Glu Lys Leu Leu Ala Glu Glu Leu
470 475 480
GGA GCT ATT TAC AGT GAG ATC CAG GGT GCT GTT GTT GCA TAG 1997
Gly Ala He Tyr Ser Glu He Gin Gly Ala Val Val Ala *
485 490 495
ATGCAAGACA AGACTTGATT TAGAGTGACG TAACTAGTTT CGTTTATGGC AGGGTATGGG 2057
AATTGGCTAA CCGAACACTG GCGCTGGTAT TTGTTTTTGC TGCTGCTTTT TGGTAACACG 2117
GAGAAGCCGA TGCATTGACT GCATTGGGTA CATTATCCTG ACATGGTTTA CCTGGTCTTT 2177
CATTATTATT ATAGCATACA TGTCCACAAC AATCTTTGAC ATCCTATCTA GAGATACATG 2237
TGCTTGCTTT TAACAGACTG CCAAATCAAT TATGCGACTG TTCTGCACAG ATAATCGTGG 2297
CTTGGTTTGA AGGCTGCCAT AAAGTCTAAC GCTGGCTACC AATTAGGTAG GAGTGTCCCC 2357
TTCCTGCCAG GTTGCTCCAG TCGTAGAAGT AGACTGATAT ATTGAAGATT GCCCATATAC 2417
CATGGACGCT CGTCTTATTC TACATCATAT ATGTCACTCC TAGTGACCAT ATAGACATGC 2477
TAACCATTGC ACAACCCCCC ACAGGTTCAA TCCAACCCAT GACCCCCTCT CATCTTCTGT 2537
TGTATTTTCA GGTTCTAGAT TTGCATACAT ACTACCCATC ATCGGAAGAC GGGTGAGGAG 2597
GCAGATGACC CGACATTATA TTTATTAATT GCTTAGGATG TTTCAACAAC ATTAAAAGTA 2657

TATCAATAAG CTTTTCCAGT TTATATTTAC TACCTAAGAT TACGGCATAT AGTGTATTCT 2717
GTGTGCGTAA GAGGTCGCCC TTAAATGGAA ACAGTTCGCG GTTGGAGATA TATATTTGTA 2777
GTGTTCAGGC GGAACGAGTA AAAAAAAAAA AAAATGAGAA GCTGGTGATA TTAACTCCGA 2837
TGTTTATCTT ACATATACCA ATGGATGTAG TCTCATTATA ACGCTTTCTC TGTAGTTTGG 2897
TTGTCATAGA ACTGAATGAC AGGTAAGTGT GTATGTATGT ACAGTACGCA CGGGGGGCCA 2957
TGTGGTCAAC CACACCCAAT GGGCGGTCTT GTCACTTTCC GGACTGGAAA TGAAACGTTC 3017
CATGGAAGAA ATCTGGATGA TTACCTTGAG TACGAGAGAA CTATGGTTGC CGGTAATGGG 3077
TGATTGCCAC AATCATCAGT TCGGTTGAGG CGTTCAACAT CTACGGTACG TTCAGTCACA 3137
TGAATCTGGG AATTCGGGCC TGGTATGCTG GTTTTCGCAA GAGATCCACC CGGCGTGTGC 3197
CAGGTATGCT ACATTTTCTC AGTCGAC 3224
(2) INFORMATION FOR SEQ ID NO: 4:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 496 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: A:
Met Arg Gly He Leu Gly Leu Ser Leu Leu Pro Leu Leu Ala Ala Ala
15 10 15
Ser Pro Val Ala Val Asp Ser He His Asn Gly Ala Ala Pro He Leu
20 25 30
Ser Ala Ser Asn Ala Lys Glu Val Pro Asp Ser Tyr He Val Val Phe
35 40 45
Lys Lys His Val Ser Ala Glu Thr Ala Ala Ala His His Thr Trp Val
50 55 60
Gin Asp He His Asp Ser Met Thr Gly Arg He Asp Leu Lys Lys Arg
65 70 75 80
Ser Leu Phe Gly Phe Ser Asp Asp Leu Tyr Leu Gly Leu Lys Asn Thr
85 90 95
Phe Asp He Ala Gly Ser Leu Ala Gly Tyr Ser Gly His Phe His Glu
100 105 110
Asp Val He Glu Gin Val Arg Arg His Pro Asp Val Glu Tyr He Glu
115 120 125
Lys Asp Thr Glu Val His Thr Met Glu Glu Thr Thr Glu Lys Asn Ala
130 135 140
Pro Trp Gly Leu Ala Arg He Ser His Arg Asp Ser Leu Ser Phe Gly
145 150 155 160
Thr Phe Asn Lys Tyr Leu Tyr Ala Ser Glu Gly Gly Glu Gly Val Asp
165 170 175
Ala Tyr Thr He Asp Thr Gly lie Asn He Glu His Val Asp Phe Glu
180 185 190
Asp Arg Ala His Trp Gly Lys Thr He Pro Ser Asn Asp Glu Asp Ala
195 200 205
Asp Gly Asn Gly His Gly Thr His Cys Ser Gly Thr He Ala Gly Lys
210 215 220

Lys Tyr Gly Val Ala Lys Lys Ala Asn He Tyr Ala Val Lys Val Leu
225 230 235 240
Arg Ser Ser Gly Ser Gly Thr Met Ser Asp Val Val Leu Gly Val Glu
245 250 255
Trp Ala Val Gin Ser His Leu Lys Lys Ala Lys Asp Ala Lys Asp Ala
260 265 270
Lys Val Lys Gly Phe Lys Gly Ser Val Ala Asn Met Ser Leu Gly Gly
275 280 285
Ala Lys Ser Arg Thr Leu Glu Ala Ala Val Asn Ala Gly Val Glu Ala
290 295 300
Gly Leu His Phe Ala Val Ala Ala Gly Asn Asp Asn Ala Asp Ala Cys
305 310 315 320
Asn Tyr Ser Pro Ala Ala Ala Glu Asn Ala He Thr Val Gly Ala Ser
325 330 335
Thr Leu Gin Asp Glu Arg Ala Tyr Phe Ser Asn Tyr Gly Lys Cys Thr
340 345 350
Asp He Phe Ala Pro Gly Pro Asn He Leu Ser Thr Trp Thr Gly Ser
355 360 365
Lys His Ala Val Asn Thr He Ser Gly Thr Ser Met Ala Ser Pro His
370 375 380
He Ala Gly Leu Leu Ala Tyr Phe Val Ser Leu Gin Pro Ala Gin Asp
385 390 395 400
Ser Ala Phe Ala Val Asp Glu Leu Thr Pro Ala Lys Leu Lys Lys Asp
405 410 415
He He Ser He Ala Thr Gin Gly Ala Leu Thr Asp He Pro Ser Asp
420 425 430
Thr Pro Asn Leu Leu Ala Trp Asn Gly Gly Gly Ala Asp Asn Tyr Thr
435 440 445
Gin He Val Ala Lys Gly Gly Tyr Lys Ala Gly Ser Asp Asn Leu Lys

450 455 460
Asp Arg Phe Asp Gly Leu Val Asn Lys Ala Glu Lys Leu Leu Ala Glu
465 470 475 480
Glu Leu Gly Ala He Tyr Ser Glu He Gin Gly Ala Val Val Ala *
485 490 495
(2) INFORMATION FOR SEQ ID NO: 5;
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 5643 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA (genomic)
(iii) HYPOTHETICAL: NO
(iii) ANTI-SENSE: NO
(vi) ORIGINAL SOURCE:
(A) ORGANISM; Aspergillus oryzae
(B) STRAIN: IF04177
(ix) FEATURE:
(A) NAME/KEY: intron
(B) LOCATION: 2701..2769 (ix) FEATURE:

(A) NAME/KEY; CDS
(B) LOCATION: join(2282..2700, 2770..4949)
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5:
AAGCTTCGTC CTCGCATCTC GGCCGGGTGA GTAAGGTATG GTATTATTCA TGAAGGGATC 60

TCGTTGGTTA CCGTTGTCTA TCCCTAAACA AAGGATTCAA GAGAACAACT CGGAATGCTC 120
CCTCCGCTTA AACCCCTTGA CTCACTGATG GTGTATGTAC TATGGGTACG ACGTTCGGGA 180
TGTGGACTAC CAACCAGAGA GTGATTAGAG AGTCCGGGTT CTCAGTCCAT GATTTTTGCA 240
TCTTTGAAAC AGACGATGCG GAGCGGTCAT TGGCGGAGTT TACTCCCAAA TACGGCCGAA 300
CGGGGTACTT TAAGTGGAAT CTCCGATTTT GGATCTAAGC TCATGAAGGA AAAGTACTAC 360
TAATGCGTAC CTGTGCCTAA TGTTAGTGCT AGTTCGTCTG TTGCATTTTA CCCGTCGGTT 420
AAGACGAATG GATCCGTTCA GGTTTTAAAA TAACTATCTA TGAAATATTT TAGATTTCCC 480
GACATAGTGG TTGGGATGTC TCGATTAACA CTAGGTACAT CAGGCTCAAT TGATTTTGGT 540
TTTAACGAAA CATGATATAG GTCAGGGTCG TGGACCACCC TCCGCCAGGG ATCAGGGGAC 600
GGTTACATGC GAAGGATTCT GATTATATTC ATGATTATGT CAAGCCTTTT CTCTCGTGTG 660
AAGAGGAGCA GAGAATCCGT ACGGGTTTAA TTTAATTTAG CGCCCTGCAG CTTCGAGAAC 720
ATCCCCAGCA ACGTTAAAAA CCACGAGCTA AAATGGGTCG CCACCGGAAG CACTCGAGTC 780
GAGAGATCGG TCGGCTCAGT ATTCGTAATA CCTGCGTTCC AGACGGTTTT GGTCGTTGGT 840
TTCACTCAGG GAACTTAATT CCAGCGGGAC CCAATATAAT TTGAATGATT CATGATACAT 900
CCATTCGTTT GAACCGATCC TGCAAGAGTT CTGTCTGATT TGGTCAACAT AGTTTTCCTC 960

TGGGGGAGAC TGGGGAAGAG TCAACACAAT GGTCAGGGAG AGAAGAATGA AAGCTCTCGC 1020
AAGTGGATGA TCATGCTACG TACTGTAGGA ATAAAATTAA TTAATGCGAG GCTGCAAGTA 1080
TCCCTGCGCC GATTTTCTCT TCTTACGGCG GGAACCAAAA AATGTGACGC TGTGATTTTC 1140
TGGAAAAGGT AAGGATGTTT AGTTTCCCAG GATTATTACT GGTTCCGTAT GTGTATGTGT 1200
ATGGATATCA TTCCGTATGG ATACGCCCGT TTCCTCCGCC CAGAACCAGT CCGTCATCCA 1260
TCCTCCACTC TTTCTTCTCT TAGAGCCTTT CCACCTCTCT TCACTTTCTT TTTCTTTCCC 1320
CCCTCCCTCT TTGCTTTCCC TCTCCCAGTA TTATTCTTAT ATTATCGGTT TGACCGTCGC 1380
CTCAGTATCG GCCCCCCGTG AATCACTTTT CGTTTCTCTT GTATTTTACT TTCCTATCTG 1440
GGATTGCTCC TCGATTAGCA GCTCTACTTC ATTCGGCCAT GTGCGTCTAG AGGGTCTAGC 1500
CCCTCTCTCT CTTTGCACTG ACTGTCAGCC ATACCATAGT ATCATCCCGG AATTAAGAAA 1560
AAAAAAGAAA TTATTCTACC TCCGATCTGG ACAAATTATA ACCAGGAGAA AATCAAGCGA 1620
AAGAGGGGCA AAGGAGGAGA CACCATTAAA ACTGGGTCTG GTTTGATTCA TGACATACAT 1680
TCGTCGTCTT GAATTTCAAT AGGTACGGAC TGATGCATTC CACTCGAGCC TTTTTAGCTG 1740
CGTGTCCGTC TCCAATCGCA CTTCTTTTCT TATTTCCTTG TGGGATAAAT TGATTATTTA 1800

CCGTTTCGTT TTCTCTATAT TGCGGTGGTG GTGCGACCCA TCCAACTATT ATTATTATAA 1860
TTGGAATTTG ATTTGGATTT TGATTCCTGT GACGGATCTC AGACCAAGTG CCTAAACTAT 1920
AACTGACTTG GACCCCCTTC AGATCCTAGC TTCCCGATTC TTTTCCACCA CTGCTGCATC 1980
CTCTTCCTGC ACGCAGCGTT CGTTTAGGGC GGGTAGACTG GAATTTATTC CTTGCGCCAC 2040
GGACCAATCG CTCCCTCGAC GCTCTCATTC CTGCGTCGAG CTCTTTTTCC CTCGACTCTC 2100
ATTGCTTGCT GGGCTGGTTC TTGAACCTCT TCAATCGTCC TTATCTCTTT CCCCCCATCC 2160
GGCCTGTGAT TCCTATCTTT CCTTTTTTTC TTCCCTTTCT TGTTTGATCC CCCCTCCTCC 2220
CCGTCTTATC GCCTACTATC GTGATCCCCG CCCTTCCCAA TAAAGAGTAG GGCGTGTGAA 2280
C ATG TCC GGG TTA ACC CTC GGG CGA GGC CCT GGG GGC GTG CGA CCG
2326
Met Ser Gly Leu Thr Leu Gly Arg Gly Pro Gly Gly Val Arg Pro
15 10 15
ACT CAA ACC GCA ACT TTT ACC ACC CAC CAC CCG TCC GCC GAT GCT GAC 2374
Thr Gin Thr Ala Thr Phe Thr Thr His His Pro Ser Ala Asp Ala Asp
20 25 30
CGC TCC TCC AAC AAC CTC CCC CCT ACC TCC TCG CAG CTG TCC GAT GAC 2422
Arg Ser Ser Asn Asn Leu Pro Pro Thr Ser Ser Gin Leu Ser Asp Asp
35 40 45
TTT TCT TTC GGT TCC CCT CTG AGC CCC GCC GAC TCA CAG GCC CAT
GAC 2470
Phe Ser Phe Gly Ser Fro Leu Ser Pro Ala Asp Ser Gin Ala His ASD

50 55 60
GGC CTA CTT CAG GAC TCC CTC TTC CCT GAA TGG GGG TCT GGT GCG CCT 2518
Gly Leu Leu Gin Asp Ser Leu Phe Pro Glu Tip Gly Ser Gly Ala Pro
65 70 75
CGA CCC GGC ATT GAC AGT CCG GAT GAG ATG CAG AGG CAA GAT CCG CTA 2566
Arg Pro Gly He Asp Ser Pro Asp Glu Met Gin Arg Gin Asp Pro Leu
80 85 90 95
GCG ACT CAA ATA TGG AAG CTC TAT TCT AGG ACC AAG GCC CAG TTG CCC 2614
Ala Thr Gin He Trp Lys Leu Tyr Ser Arg Thr Lys Ala Gin Leu Pro
100 105 110
AAC CAG GAG CGT ATG GAA AAC CTG ACC TGG CGG ATG ATG GCG ATG AGT 2662
Asn Gin Glu Arg Met Glu Asn Leu Thr Trp Arg Met Met Ala Met Ser
115 120 125
TTG AAA CGT AAG GAG CGG GAA CGT GCT CAA CAG TCC AT GTAGGTGTTC 2710
Leu Lys Arg Lys Glu Arg Glu Arg Ala Gin Gin Ser Met
130 135 140
TCCCTCTGTA GAGGAACGGC TGGACCCGCT CATCATTAAT TrrrTTTTTG TCTGTGAAGG 2770
TTT CCT GCG AGA CGC GGT AGC GCT GGC CCC AGT GGT ATC GCT CAA CTG 2818
Phe Pro Ala Arg Arg Gly Ser Ala Gly Pro Ser Gly He Ala Gin Leu
145 150 155
CGC ATT TCC GAC CCG CCC GTT GCC ACC GGT AAC CCT CAG TCA ACC GAC 2866
Arg He Ser Asp Pro Pro Val Ala Thr Gly Asn Pro Gin Ser Thr Asp
160 165 170
CTG ACC GCC GAC CCT ATG AAC CTC GAC GAT TTC ATC GTG CCC TTC GAA 2914
Leu Thr Ala Asp Pro Met Asn Leu Asp Asp Phe He Val Pro Phe Glu
175 180 185

TCT CCT TCG GAC CAC CCC TCG CCC AGT GCC GTC AAG ATT TCC GAC TCC 2962
Ser Pro Ser Asp His Pro Ser Pro Ser Ala Val Lys He Ser Asp Ser
190 195 200
ACG GCG TCC GCG GCC ATT CCC ATC AAG TCC CGG AAA GAC CAG CTG
AGA 3010
Thr Ala Ser Ala Ala He Pro He Lys Ser Arg Lys Asp Gin Leu Arg
205 210 215 220
GAT TCT ACC CCG GTG CCG GCC TCG TTC CAC CAT CCG GCT CAG GAT
CAA 3058
Asp Ser Thr Pro Val Pro Ala Ser Phe His His Pro Ala Gin Asp Gin
225 230 235
CGG AAG AAC AGT GAA TTT GGC TAG GTC CCC CGT CGC GTG CGC AAG ACG 3106
Arg Lys Asn Ser Glu Phe Gly Tyr Val Pro Arg Arg Val Arg Lys Thr
240 245 250
AGT ATC GAC GAG CGT CAA TTT TTC TCA CTG CAG GTG CCG ACC CGA AAG 3154
Ser He Asp Glu Arg Gin Phe Phe Ser Leu Gin Val Pro Thr Arg Lys
255 260 265
CGA CCG GCC GAA TCC TCG CCC CAG GTA CCC CCC GTT TCC AAC TCG ATG 3202
Arg Pro Ala Glu Ser Ser Pro Gin Val Pro Pro Val Ser Asn Ser Met
270 275 280
TTG GCC CAC GAT CCG GAC CTC GCT TCC GGC GTG CCC GAT TAT GCC
TTG 3250
Leu Ala His Asp Pro Asp Leu Ala Ser Gly Val Pro Asp Tyr Ala Leu
285 290 295 300
GAC GCC CCG TCC TCG GCC TTT GGC TTC CAT CAG GGT AAC CAC CAT CCG 3298
Asp Ala Pro Ser Ser Ala Phe Gly Phe His Gin Gly Asn His His Pro
305 310 315
GTC AAT CAT CAC AAC CAC ACC TCC CCC GGG GCA CCG TTT GGC TTG GAT 3346

Val Asn His His Asn His Thr Ser Pro Gly Ala Pro Phe Gly Leu Asp
320 325 330
ACG TTC GGC CTG GGA GAT GAT CCA ATC TTG CCC TCC GCG GGC CCC TAC 3394
Thr Phe Gly Leu Gly Asp Asp Pro He Leu Pro Ser Ala Gly Pro Tyr
335 340 345
CAG TCG CAA TTC ACC TTC TCA CCC AGC GAG TCT CCG ATG GCC TCC GGT 3442
Gin Ser Gin Phe Thr Phe Ser Pro Ser Glu Ser Pro Met Ala Ser Gly
350 355 360
CAT CCG TTT GCG AAC CTC TAT TCG CAT ACC CCG GTG GCT TCG TCC
CTC 3490
His Pro Phe Ala Asn Leu Tyr Ser His Thr Pro Val Ala Ser Ser Leu
365 370 375 380
AAC TCG ACG GAT TTC TTC TCT CCA CCG CCA TCA GGC TAC CAG TCC ACG 3538
Asn Ser Thr Asp Phe Phe Ser Pro Pro Pro Ser Gly Tyr Gin Ser Thr
385 390 395
GCA TCC ACG CCG CAG CCC ACC TAC GAC GGG GAC CAT TCC GTT TAT TTC 3586
Ala Ser Thr Pro Gin Pro Thr Tyr Asp Gly Asp His Ser Val Tyr Phe
400 405 410
GAT ATG CCG TCG GGC GAC GCG CGC ACC CAG CGC CGC ATT CCG AAC
TAT 3634
Asp Met Pro Ser Gly Asp Ala Arg Thr Gin Arg Arg He Pro Asn Tyr
415 420 425
ATT TCG CAT CGG TCC AAC TTG TCT GCT TCG CTG CAG CCT CGG TAT ATG 3682
He Ser His Arg Ser Asn Leu Ser Ala Ser Leu Gin Pro Arg Tyr Met
430 435 440
TTC AAC CAG AAC AAC CAT GAA CAG GCC AGT TCG TCG ACG GTG CAT
TCG 3730
Phe Asn Gin Asn Asn His Glu Gin Ala Ser Ser Ser Thr Val His Ser
445 450 455 460

CCG AGC TAC CCC ATT CCC CAG CCG CAA CAT GTG GAC CCC ACT CAG GTG 3778
Pro Ser Tyr Pro He Pro Gin Pro Gin His Val Asp Pro Thr Gin Val
465 470 475
TTG AAC GCC ACC AAT TAC TCG ACC GGC AAC TCC CAC CAT ACC GGC GCC 3826
Leu Asn Ala Thr Asn Tyr Ser Thr Gly Asn Ser His His Thr Gly Ala
4S0 485 490
ATG TTT TCA TTT GGA GCC GAT TCA GAT AAC GAG GAT GAC GAT GGT CAT 3874
Met Phe Ser Phe Gly Ala Asp Ser Asp Asn GIu Asp Asp Asp Gly His
495 500 505
CAG CTG TCC GAG CGG GCT GGT CTG GCG ATG CCG ACT GAA TAT GGG GAC 3922
Gin Leu Ser Glu Arg Ala Gly Leu Ala Met Pro Thr GIu Tyr Gly Asp
510 515 520
GAG GAC GGG TTC TCG TCG GGC ATG CAG TGG GAT GGG CAG TTC CCG
GGC 3970
Glu Asp Gly Phe Ser Ser Gly Met Gin Trp Asp Gly Gin Phe Pro Gly
525 530 535 540
TCC TTC CAT TCG CTG CCG GGC TTT GGC CCT CAA CAT CGC AAG CAT GTT 4018
Ser Phe His Ser Leu Pro Gly Phe Gly Pro Gin His Arg Lys His Val
545 550 555
ACC ATC GGG TCC ACG GAC ATG ATG GAC ACC CCC GAG GAG TGG AAT CAC 4066
Thr lie Gly Ser Thr Asp Met Met Asp Thr Pro GIu Glu Trp Asn His
560 565 570
GGT GGC AGT TTG GGT CGG ACT CAT GGG TCG GTG GCT TCG GTC AGT
GAG 4114
Gly Gly Ser Leu Gly Arg Thr His Gly Ser Val Ala Ser Val Ser Glu
575 580 585
GTG CGC AAC CGA GAG CAG GAC CCT CGC CGG CAG AAG ATT GCC CGC
ACC 4162
Val Arg Asn Arg Glu Gin Asp Pro Arg Arg Gin Lys He Ala Arg Thr

590 595 600
ACG TCC ACC CCC AAT ACG GCC CAG CTG TTG CGC CAA AGC ATG CAC
TCT 4210
Thr Ser Thr Pro Asn Thr Ala Gin Leu Leu Arg Gin Ser Met His Ser
605 610 615 620
AAT AAC AAT ACG TCT CAT ACC TCC CCT AAT ACG CCG CCC GAG TCC GCC 4258
Asn Asn Asn Thr Ser His Thr Ser Pro Asn Thr Pro Pro Glu Ser Ala
625 630 635
CTG AGC AGC GCA GTT CCG TCC CGC CCG GCC AGT CCC GGG GGC AGC AAG 4306
Leu Ser Ser Ala Val Pro Ser Arg Pro Ala Ser Pro Gly Gly Ser Lys
640 645 650
AAC GGC GAC CAA GGC AGC AAC GGA CCG ACC ACC TGC ACG AAC TGC TTC 4354
Asn Gly Asp Gin Gly Ser Asn Gly Pro Thr Thr Cys Thr Asn Cys Phe
655 660 665
ACT CAA ACC ACT CCG CTG TGG CGT CGG AAC CCA GAG GGC CAG CCA CTG 4402
Thr Gin Thr Thr Pro Leu Trp Arg Arg Asn Pro Glu Gly Gin Pro Leu
670 675 680
TGC AAT GCC TGC GGG TTG TTT TTG AAA TTG CAC GGT GTC GTG CGC
CCT 4450
Cys Asn Ala Cys Gly Leu Phe Leu Lys Leu His Gly Val Val Arg Pro
685 690 695 700
CTG TCC CTG AAA ACG GAC GTT ATC AAA AAG CGC AAC CGT AGC AGT GCC 4498
Leu Ser Leu Lys Thr Asp Val He Lys Lys Arg Asn Arg Ser Ser Ala
705 710 715
AAC AGC TTG GCG GTT GGG ACC TCC CGT GCG TCG AAG AAG ACA GCC CGC 4546
Asn Ser Leu Ala Val Gly Thr Ser Arg Ala Ser Lys Lys Thr Ala Arg
720 725 730

AAG AAC TCG GTG CAG CAA GCA TCC GTC ACG ACT CCG ACA TCA AGC
CGC 4594
Lys Asn Ser Val Gin Gin Ala Ser Val Thr Thr Pro Thr Ser Ser Arg
735 740 745
GCT CAG AAT GGG ACT TCC TTC GAA TCC CCG CCC GCC GGC TTT AGT GCT 4642
Ala Gin Asn Gly Thr Ser Phe Glu Ser Pro Pro Ala Gly Phe Ser Ala
750 755 760
GCC GCG GGA CGG TCG AAT GGG GTG GTA CCC ATT GCC GCC GCT CCT
CCG 4690
Ala Ala Gly Arg Ser Asn Gly Val Val Pro He Ala Ala Ala Pro Pro
765 770 775 780
AAG GCA GCT CCC TCC GCA GCC GCC TCC CCT AGC ACG GGC CAG ACC CGC 4738
Lys Ala Ala Pro Ser Ala Ala Ala Ser Pro Ser Thr Gly Gin Thr Arg
785 790 795
AAC CCG ATC CAG GCT GCC CCG AAA CGT CAA CGA CGG CTG GAA AAG GCC 4786
Asn Pro He Gin Ala Ala Pro Lys Arg Gin Arg Arg Leu Glu Lys Ala
800 805 810
ACG GAG ATG GAA ACG GAC GAG GCT AAC AAG TCC GCG GGA GGC CGA TCC 4834
Thr Glu Met Glu Thr Asp Glu Ala Asn Lys Ser Ala Gly Gly Arg Ser
815 820 825
AAG GTG GTG CCT CTG GCA CCC GCC ATG CCA CCG GCA GCA GCC AAT CCG 4882
Lys Val Val Pro Leu Ala Pro Ala Met Pro Pro Ala Ala Ala Asn Pro
830 835 840
GCG AAC CAT AGT ATT GCC GGA GGC CAA GGG GCT AGT CAG GAA TGG
GAG 4930
Ala Asn His Ser He Ala Gly Gly Gin Gly Ala Ser Gin Glu Trp Glu
845 850 855 860
TGG TTG ACG ATG AGT CTGTAATGGC CGCGCTTACC TCTCTACTTC TCTACACTCG 4985 Trp Leu Thr Met Ser Leu

865
TTTCTTAATA TCTTTCTTGA ACCCCCCCTT ATATTTTCCC ACCGTTGATG CTACGCCATG 5045
ACCGATAGAG ATGATGAATA CTGCAACCAA TGGAATCTCG CTAGACGAGA GGTGTTAGAT 5105
GACGTGGCCC GCGATGCACT TAATGAGATA CGAGGAGGTG CAATGCGTTG GTTACGCTAG 5165
TTTAATGGTA ACATGACGAG GGATATTCGC TCTGTTATTT CGGGCTTTGA TCTGTTTCAG 5225
TCTGCGATTT AACAGCGACT GATCCTCTGC TGTGACAATA CACAGCTTGT CTTGTGGTTC 5285
TGTTGTGGCT TTCTGTTTGT TTGGCTGATT TGATTTATGC TTGATACAAT CGCGTCTGTC 5345
CGGACCCCGG CCTTTGTTTT GTTTTCAGTT CTGATTCTTC ACTGTTTCTG ATTCTCTTGT 5405
TCATGTTTTT GATTTGTTCA AGGCTTGGGG CCGGGCAGAA GTGCGCATCT CTGCTTTGTG 5465
TTTTCCGTCA CCGTGCATAG ACGCTGTATG TATATGCTAC AGCAAGATTC TACTTATCCA 5525
GTCTGAGCCT GTATTCATTG AAGTGTAGCC AGCTGTCGAA TGAGCTTTTT AACGATATTG 5585
TTTTGTTGAG TAGTCAACAA GTAGTATCTG TATATTCCGG AGTCTAAGTA AGACACTT 5643
(2) INFORMATION FOR SEQ ID NO: 6:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 866 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear

(ii) MOLECULE TYPE; protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6:
Met Ser Gly Leu Thr Leu Gly Arg Gly Pro Gly Gly Val Arg Pro Thr
15 10 15
Gin Thr Ala Thr Phe Thr Thr His His Pro Ser Ala Asp Ala Asp Arg
20 25 30
Ser Ser Asn Asn Leu Pro Pro Thr Ser Ser Gin Leu Ser Asp Asp Phe
35 40 45
Ser Phe Gly Ser Pro Leu Ser Pro Ala Asp Ser Gin Ala His Asp Gly
50 55 60
Leu Leu Gin Asp Ser Leu Phe Pro Glu Trp Gly Ser Gly Ala Pro Arg
65 70 75 80
Pro Gly He Asp Ser Pro Asp Glu Met Gin Arg Gin Asp Pro Leu Ala
85 90 95
Thr Gin He Trp Lys Leu Tyr Ser Arg Thr Lys Ala Gin Leu Pro Asn
100 105 110
Gin Glu Arg Met Glu Asn Leu Thr Trp Arg Met Met Ala Met Ser Leu
115 120 125
Lys Arg Lys Glu Arg Glu Arg Ala Gin Gin Ser Met Phe Pro Ala Arg
130 135 140
Arg Gly Ser Ala Gly Pro Ser Gly He Ala Gin Leu Arg He Ser Asp
145 150 155 160
Pro Pro Val Ala Thr Gly Asn Pro Gin Ser Thr Asp Leu Thr Ala Asp
165 170 175
Pro Met Asn Leu Asp Asp Phe He Val Pro Phe Glu Ser Pro Ser Asp
180 185 190
His Pro Ser Pro Ser Ala Val Lys He Ser Asp Ser Thr Ala Ser Ala
195 200 205

Ala Ite Pro He Lys Ser Arg Lys Asp Gin Leu Arg Asp Ser Thr Pro
210 215 220
Val Pro Ala Ser Phe His His Pro Ala Gin Asp Gin Arg Lys Asn Ser
225 230 235 240
Glu Phe Gly Tyr Val Pro Arg Arg Val Arg Lys Thr Ser He Asp Glu
245 250 255
Arg Gin Phe Phe Ser Leu Gin Val Pro Thr Arg Lys Arg Pro Ala Glu
260 265 270
Ser Ser Pro Gin Val Pro Pro Val Ser Asn Ser Met Leu Ala His Asp
275 280 285
Pro Asp Leu Ala Ser Gly Val Pro Asp Tyr Ala Leu Asp Ala Pro Ser
290 295 300
Ser Ala Phe Gly Phe His Gin Gly Asn His His Pro Val Asn His His
305 310 315 320
Asn His Thr Ser Pro Gly Ala Pro Phe Gly Leu Asp Thr Phe Gly Leu
325 330 335
Gly Asp Asp Pro He Leu Pro Ser Ala Gly Pro Tyr Gin Ser Gin Phe
340 345 350
Thr Phe Ser Pro Ser Glu Ser Pro Met Ala Ser Gly His Pro Phe Ala
355 360 365
Asn Leu Tyr Ser His Thr Pro Val Ala Ser Ser Leu Asn Ser Thr Asp
370 375 380
Phe Phe Ser Pro Pro Pro Ser Gly Tyr Gin Ser Thr Ala Ser Thr Pro
385 390 395 400
Gin Pro Thr Tyr Asp Gly Asp His Ser Val Tyr Phe Asp Met Pro Ser
405 410 415
Gly Asp Ala Arg Thr Gin Arg Arg He Pro Asn Tyr He Ser His Arg
420 425 430

Ser Asn Leu Ser Ala Ser Leu Gin Pro Arg Tyr Met Phe Asn Gin Asn
435 440 445
Asn His Glu Gin Ala Ser Ser Ser Thr Val His Ser Pro Ser Tyr Pro
450 455 460
He Pro Gin Pro Gin His Val Asp Pro Thr Gin Val Leu Asn Ala Thr
465 470 475 480
Asn Tyr Ser Thr Gly Asn Ser His His Thr Gly Ala Met Phe Ser Phe
485 490 495
Gly Ala Asp Ser Asp Asn Glu Asp Asp Asp Gly His Gin Leu Ser Glu
500 505 510
Arg Ala Gly Leu Ala Met Pro Thr Glu Tyr Gly Asp Glu Asp Gly Phe
515 520 525
Ser Ser Gly Met Gin Trp Asp Gly Gin Phe Pro Gly Ser Phe His Ser
530 535 540
Leu Pro Gly Phe Gly Pro Gin His Arg Lys His Val Thr lie Gly Ser
545 550 555 560
Thr Asp Met Met Asp Thr Pro Glu Glu Trp Asn His Gly Gly Ser Leu
565 570 575
Gly Arg Thr His Gly Ser Val Ala Ser Val Ser Glu Val Arg Asn Arg
580 585 590
Glu Gin Asp Pro Arg Arg Gin Lys He Ala Arg Thr Thr Ser Thr Pro
595 600 605
Asn Thr Ala Gin Leu Leu Arg Gin Ser Met His Ser Asn Asn Asn Thr
610 615 620
Ser His Thr Ser Pro Asn Thr Pro Pro Glu Ser Ala Leu Ser Ser Ala
625 630 635 640
Val Pro Ser Arg Pro Ala Ser Pro Gly Gly Ser Lys Asn Gly Asp Gin
645 650 655
Gly Ser Asn Gly Pro Thr Thr Cys Thr Asn Cys Phe Thr Gin Thr Thr
660 665 670

Pro Leu Tip Arg Arg Asn Pro Glu Gly Gin Pro Leu Cys Asn Ala Cys
675 680 685
Gly Leu Phe Leu Lys Leu His Gly Val Val Arg Pro Leu Ser Leu Lys
690 695 700
Thr Asp Val He Lys Lys Arg Asn Arg Ser Ser Ala Asn Ser Leu Ala
705 710 715 720
Val Gly Thr Ser Arg Ala Ser Lys Lys Thr Ala Arg Lys Asn Ser Val
725 730 735
Gin Gin Ala Ser Val Thr Thr Pro Thr Ser Ser Arg Ala Gin Asn Gly
740 745 750
Thr Ser Phe Glu Ser Pro Pro Ala Gly Phe Ser Ala Ala Ala Gly Arg
755 760 765
Ser Asn Gly Val Val Pro He Ala Ala Ala Pro Pro Lys Ala Ala Pro
770 775 780
Ser Ala Ala Ala Ser Pro Ser Thr Gly Gin Thr Arg Asn Pro He Gin
785 790 795 800
Ala Ala Pro Lys Arg Gin Arg Arg Leu Glu Lys Ala Thr Glu Met Glu
805 810 815
Thr Asp Glu Ala Asn Lys Ser Ala Gly Gly Arg Ser Lys Val Val Pro
820 825 830
Leu Ala Pro Ala Met Pro Pro Ala Ala Ala Asn Pro Ala Asn His Ser
835 840 845
He Ala Gly Gly Gin Gly Ala Ser Gin Glu Trp Glu Trp Leu Thr Met
850 855 860
Ser Leu 865
(2) INFORMATION FOR SEQ ID NO: 7:
(i) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 27 nucleotides
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA Primer 19819
(iii) HYPOTHETICAL; YES
(iii) ANTI-SENSE: NO
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7
GAAGATCTGC GCGGATGTAC ATTGTAG 27
(2) INFORMATION FOR SEQ ID NO: 8:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 20 nucleotides
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA Primer 19821
(iii) HYPOTHETICAL: YES
(iii) ANTI-SENSE: NO
ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 8
TTAGTCAGAA ATTCGTCCCG 20
(2) INFORMATION FOR SEQ ID NO: 9:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 30 nucleotides
(B) TYPE: nucleic acid
(C) STRANDEDNESS; single
(D) TOPOLOGY: linear
62

(ii) MOLECULE TYPE: DNA Primer 19820
(iii) HYPOTHETICAL: YES
(iii) ANTI-SENSE: NO
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9
CCCAAGCTTC ATGCTCGACC AGGGCCTCCT 30
(2) INFORMATION FOR SEQ ID NO: 10:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 21 nucleotides
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA Primer 19818
(iii) HYPOTHETICAL: YES
(iii) ANTI-SENSE: NO
[xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10
GGTCTGTGTT AACCAAAGAA C 21
(2) INFORMATION FOR SEQ ID NO: 11:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 14 nucleotides
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA Primer 8681
(iii) HYPOTHETICAL: YES
(iii) ANTI-SENSE: NO
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11
63

GATCCACCAT GAAG
(2) INFORMATION FOR SEQ ID NO: 12:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 14 nucleotides
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA Primer 8747
(Hi) HYPOTHETICAL: YES
(iii) ANTI-SENSE: NO
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12
GTGGTACTTC AGCT


We claim:
1. An Aspergillus, wherein the areA gene by recombinant DNA technology has been modified in a way by which it cannot be expressed in a way providing for a functional AreA activator, and wherein the genes encoding for the extracellular proteases PepC and/or PepE have been inactivated in a manner whereby they are not expressed to produce functional proteases.
2. The Aspergillus as claimed in claim 1, wherein said inactivation has been obtained by deletion of all or parts of the areA, pepC, and/or pepE genes.
3. The Aspergillus as claimed in claim 1, wherein said inactivation has been obtained by deletion of all or parts of the areA, andpepE genes.
4. The Aspergillus as claimed in claim 1, wherein said inactivation has been obtained by interfering with the regulation of the expression signals regulating the expression of the areA,pepC, and/or pepE genes themselves.
5. The Aspergillus as claimed in claim 1, wherein said inactivation has been obtained by interfering with the regulation of the expression signals regulating the expression of the areA, andpepE genes themselves.
6. The Aspergillus as claimed in claim 1, wherein said inactivation has been obtained by using anti-sense technology.
7. The Aspergillus as claimed in claim 1, wherein said inactivation has been obtained by inserting extra DNA internally in the areA, pepC, and/or pepE genes.

8. The Aspergillus as claimed in claim 1, which belongs to a species selected from the group comprising A. oryzae, A. niger, A. awamori, A. phoenicis, A. japonicus, A. foetidus, A. nidulans, T. reesei, T. harzianum, H. insolens, H. lanuginosa, F. graminectrum, F. solani, P. chrysogenum, and others.
9. A method for producing the Aspergillus as claimed in claim 1, wherein said inactivation has been obtained by deletion of the areA, pepC, and/or pepE genes, which method comprises
i) cloning of the areA,pepC, and/or pepE genes from a Aspergillus of interest,
ii) producing DNA constructs each comprising one among the areA gene, the
pepC gene, and/or the pepE gene, wherein an internal part has been
substituted, deleted, or extra DNA has been inserted,
iii) transforming said Aspergillus with the constructs, and
iv) isolating transformants which are areA',pepC, and/or pepE.
10. A method for producing the Aspergillus as claimed in claim 1, wherein said
inactivation has been obtained by deletion of the areA and pepE genes, which method
comprises
i) cloning of the areA zndpepE genes from an Aspergillus of interest,
ii) producing DNA constructs each comprising one among the areA gene and
the pepE gene, wherein an internal part has been substituted, deleted, or
extra DNA has been inserted,
iii) transforming said Aspergillus with the constructs, and
iv) isolating transformants which are areA', andpepE.

11. A method for producing the Aspergillus as claimed in claim 1, wherein
said inactivation has been obtained by deletion of the areA and pepC genes, which
method comprises
i) cloning of the areA andpepC genes from an Aspergillus of interest,
ii) producing DNA constructs each comprising one among the areA gene and
the pepC gene, wherein an internal part has been substituted, deleted, or
extra DNA has been inserted,
iii) transforming said Aspergillus with the constructs, and
iv) isolating transformants which are areA', mdpepC.
12. A method for producing the Aspergillus as claimed in claim 1, wherein
said inactivation has been obtained by deletion of the pepC and pepE genes, which
method comprises
i) cloning of the pepC and pepE genes from an Aspergillus of interest,
ii) producing DNA constructs each comprising one among the pepC gene and
the pepE gene, wherein an internal part has been substituted, deleted, or
extra DNA has been inserted,
iii) transforming said Aspergillus with the constructs, and
iv) isolating transformants which arepepC, andpepE'.
13. A method for producing the Aspergillus as claimed in claim 1, wherein said
inactivation has been obtained by using anti-sense technology, which method comprises
i) construction of expression plasmids, each of which give rise to synthesis of
an RNA molecule complementary to the mRNA transcribed from the areA gene, the pepC gene, and/or the pepE gene,
ii) transformation of the host Aspergillus with said expression plasmids and a
suitable marker, either on separate plasmids or on the same plasmid,

iii) selection of transformants using said marker, and
iv) screening selected transformants for strains exhibiting a reduction in the
synthesis of the AreA, PepC, and/or PepE products.
14. A method for producing the Aspergillus as claimed in claim 1, wherein said
inactivation has been obtained by using anti-sense technology, which method comprises
i) construction of expression plasmids, each of which give rise to synthesis of
an RNA molecule complementary to the mRNA transcribed from the areA
gene and the pepE gene,
ii) transformation of the host Aspergillus with said expression plasmids and a
suitable marker, either on separate plasmids or on the same plasmid,
iii) selection of transformants using said marker, and
iv) screening selected transformants for strains exhibiting a reduction in the
synthesis of the AreA, PepC, and PepE products.
15. A process for the production of a desired gene product, whereby an Aspergillus as claimed in any of the claims 1 to 8 is cultivated in a suitable growth medium at appropriate conditions and the desired gene product is recovered and purified.
16. A process for the production of a desired gene product, whereby an the Aspergillus as claimed in any of the claims 1 to 8, which has been transformed to integrate a DNA sequence coding for the desired gene product into the genome of the Aspergillus in a functional manner, is cultivated in a suitable growth medium at appropriate conditions and the desired gene product is recovered and purified.
17. A process for producing a desired polypeptide comprising cultivating an Aspergillus in an appropriate growth medium and recovering said polypeptide from said

culture, said Aspergillus carrying a recombinant DNA construct capable of causing expression of said polypeptide or a precursor thereof in said Aspergillus, said Aspergillus further being characterized by producing lower amounts of functional AreA, PepC, and/or PepE than the wild-type of said Aspergillus.
18. A method as claimed in claim 17, wherein said Aspergillus has been modified to produce lower than wild-type amounts of AreA, PepC, and/or PepE by a process comprising transforming a parent of said Aspergillus with DNA constructs capable of causing reduced production of functional AreA, PepC, and/or PepE when integrated in the genome of said Aspergillus.
19. A method as claimed in claim 17, wherein said polypeptide is secreted to the extracellular medium by said fungus.
20. A method according to claim 17, wherein said Aspergillus produces higher amounts of said polypeptide than a similar Aspergillus where said similar Aspergillus produces AreA, PepC, and/or PepE in amounts similar to those produced by the wild-type of said Aspergillus, said similar Aspergillus being identical to said Aspergillus in all other respects.
21. The process as claimed in claim 13 or 14 to 20, wherein said gene product is a secreted protein.
22. The process as claimed in any of the claims 13 to 19, wherein said desired gene product is an industrial peptide or protein, preferably an enzyme.

23. The process as claimed in claim 22, wherein said enzyme is selected from the group comprising a protease, lipase, cutinase, cellulase, chymosin.
24. The process as claimed in any of the claims 14 to 21, wherein said desired gene product is a therapeutically active peptide or protein.
25. The process as claimed in claim 24, wherein said therapeutically active peptide or protein is selected from the group comprising insulin, growth hormone, glucagon, somatostatin, interferon, PDGF, factor VII, factor VIII, urokinase, tPA, EPO, orTPO.
26. A gene product produced in accordance with any of the processes 14 to 25.
27. A DNA sequence coding for the pepC gene from A. oryzae (SEQ ID No. 1) or functional alleles thereof.
28. A PepC protease from A. oryzae (SEQ ID No. 2).
29. A process for the production of the PepC protease as claimed in claim 28 comprising transforming a suitable host with a DNA construct comprising a DNA sequence as claimed in claim 27, selecting a transformant capable of producing said PepC protease, cultivating said transformant in an appropriate growth medium and recovering said PepC protease from said culture.
30. A DNA sequence coding for thepepE gene from A. oryzae (SEQ ID No. 3) or functional alleles thereof.

31. A PepE protease from A. oryzae (SEQ ID No. 4).
32. A process for the production of the PepE protease as claimed in claim 31 comprising transforming a suitable host with a DNA construct comprising a DNA sequence as claimed in claim 30, selecting a transformant capable of producing said PepE protease, cultivating said transformant in an appropriate growth medium and recovering said PepE protease from said culture.
33. The process as claimed in claim 29 or 32, wherein said host is aa Aspergillus as claimed in any of the claims 1 to8.
34. The process as claimed in claim 29, 32 or 33, wherein said host is A. oryzae, and wherein said DNA construct comprising a DNA sequence as claimed in claim 27 or 30, provides for an extra copy of the gene encoding said PepC or PepE protease.


Documents:

2287-mas-1996 abstract-duplicate.pdf

2287-mas-1996 abstract.pdf

2287-mas-1996 assignment.pdf

2287-mas-1996 claims-duplicate.pdf

2287-mas-1996 claims.pdf

2287-mas-1996 correspondence-others.pdf

2287-mas-1996 correspondence-po.pdf

2287-mas-1996 description (complete)-duplicate.pdf

2287-mas-1996 description (complete).pdf

2287-mas-1996 drawings.pdf

2287-mas-1996 form-1.pdf

2287-mas-1996 form-19.pdf

2287-mas-1996 form-26.pdf

2287-mas-1996 form-3.pdf

2287-mas-1996 form-4.pdf

2287-mas-1996 form-6.pdf

2287-mas-1996 petition.pdf


Patent Number 224830
Indian Patent Application Number 2287/MAS/1996
PG Journal Number 49/2008
Publication Date 05-Dec-2008
Grant Date 23-Oct-2008
Date of Filing 17-Dec-1996
Name of Patentee NOVOZYMES A/S
Applicant Address KROGSHOEJVEJ 36, DK-2880 BAGSVAERD,
Inventors:
# Inventor's Name Inventor's Address
1 TOVE CHRISTENSEN NOVO ALLE, DK-2880 BAGSVAERD,
2 JAN LEHMBECK NOVO ALLE, DK-2880 BAGSVAERD,
PCT International Classification Number C12N15/64
PCT International Application Number N/A
PCT International Filing date
PCT Conventions:
# PCT Application Number Date of Convention Priority Country
1 NA