Glucoamylase blends
1.A method for saccharifying a starch substrate comprising contacting the starch substrate with a glucoamylase consisting of Fv-GA (SEQ ID NO:1), at least 85% identity to the sequence of Fv-GA (SEQ ID NO:1), at least 85% identity to the catalytic domain of Fv-GA (SEQ ID NO:1), at least 85% identity to the linker and the catalytic domain of Fv-GA (SEQ ID NO:1), or at least 85% identity to an active fragment of Fv-GA (SEQ ID NO:1) having glucoamylase activity, and contacting the starch substrate with at least one additional glucoamylase.
2. The method of claim 1, wherein the additional glucoamylase comprises Afu-GA (SEQ ID NO:3) or An-GA (SEQ ID NO: 7).
3. The method of claim 1 or claim 2, wherein the additional glucoamylase is Afu-GA (SEQ ID NO:3) or a glucoamylase with at least 85% identity to SEQ ID NO: 3.
4. The method of claim 1 or claim 2, wherein the additional glucoamylase is An-GA (SEQ ID NO:7) or is a glucoamylase with at least 85% identity to SEQ ID NO: 7.
5. The method of any one of claims 1-4, wherein saccharifying the starch substrate results in a high glucose syrup.
6. The method of any one of claims 1-5, wherein the high glucose syrup comprises an amount of glucose selected from the list consisting of: at least 95.5% glucose, at least 95.6% glucose, at least 95.7% glucose, at least 95.8% glucose, at least 95.9% glucose, at least 96% glucose, at least 96.1% glucose, at least 96.2% glucose, at least 96.3% glucose, at least 96.4% glucose, at least 96.5% glucose, and at least 97% glucose.
7. The method of any one of claims 1-6, further comprising fermenting the high glucose syrup to an end product.
8. The method of claim 7, wherein saccharifying and fermenting are carried out as a Simultaneous Saccharification and Fermentation (SSF) process.
9. The method of claim 7 or claim 8, wherein the end product is an alcohol.
10. The method of claim 9, wherein the end product is ethanol.
11. The method of any one of claims 1-10, wherein the starch substrate is about 5% to 99%, 15% to 50%, or 40-99% Dry Solids (DS).
12. The method of any one of claims 1-11, wherein the dosage of each glucoamylase is in a range of about 0.01 to about 10 glucoamylase units (GAU) per gram of dry solid starch (dss).
13. The method of any one of claims 1-12, wherein the starch substrate is selected from the following: wheat, barley, corn, rye, rice, sorghum, bran, cassava, milo, millet, potato, sweet potato, tapioca, and any combination thereof.
14. The method of any one of claims 1-13, wherein the starch substrate comprises liquefied starch, gelatinized starch, or granular starch.
15. The method of any one of claims 1-14, further comprising adding a hexokinase, a xylanase, a glucose isomerase, a xylose isomerase, a phosphatase, a phytase, a pullulanase, a beta amylase, an alpha amylase, a protease, a cellulase, a hemicellulase, a lipase, a cutinase, a trehalase, an isoamylase, an oxidoreductase, an esterase, a transferase, a pectinase, a hydrolase, an alpha glucosidase, a beta glucosidase, or a combination thereof to the starch substrate.
16. The method of claim 15, wherein pullulanase is further added.
Background
Industrial fermentation mainly uses glucose as a raw material for the production of large amounts of proteins, enzymes, alcohols and other chemical end products. Typically, glucose is the product of starch processing, which is conventionally a two-step enzymatic process that catalyzes the breakdown of starch, involving liquefaction and saccharification.
During liquefaction, insoluble granular starch is slurried in water, heat gelatinized, and hydrolyzed by a thermostable a-amylase. During saccharification, the soluble dextrins produced in liquefaction are further hydrolyzed by glucoamylase to produce a high glucose syrup containing greater than 95% glucose.
Glucoamylases are exo-acting carbohydrases capable of hydrolyzing the linear and branched glucosidic linkages of starch (e.g., amylose and amylopectin) to produce fermentable sugars from starch (e.g., an enzyme-liquefying a starch substrate). These fermentable sugars, for example, low molecular weight sugars (such as glucose) can then be converted to fructose by other enzymes (e.g., glucose isomerase); carrying out crystallization; or used in fermentation to produce many end products (e.g., alcohols, monosodium glutamate, succinic acid, vitamins, amino acids, 1, 3-propanediol, and lactic acid).
In view of the core role that glucoamylases play in the production of glucose from starch, it would be advantageous in the art to provide new blends of Glucoamylases (GA) with improved properties for this conversion. Examples of improved properties of GA blends include, but are not limited to: producing sugars, syrups, and biochemicals (including but not limited to ethanol).
Disclosure of Invention
A method for saccharifying a starch substrate is provided, the method comprising contacting the starch substrate with a glucoamylase comprising or consisting of the amino acid sequence of fusarium verticillii glucoamylase (Fv-GA), and further contacting the starch substrate with at least one additional glucoamylase. Additional methods are provided for saccharifying and fermenting a starch substrate using a combination of Fv-GA and at least one additional glucoamylase to produce an end product, a biochemical end product, and a fermented beverage.
The embodiment of the invention comprises the following steps:
1.a method for saccharifying a starch substrate comprising contacting the starch substrate with a glucoamylase consisting of the sequence of Fv-GA (SEQ ID NO:1), having at least 85% identity to the catalytic domain of Fv-GA (SEQ ID NO:1), having at least 85% identity to a linker and the catalytic domain of Fv-GA (SEQ ID NO:1), or having at least 85% identity to an active fragment of Fv-GA (SEQ ID NO:1) having glucoamylase activity, and contacting the starch substrate with at least one additional glucoamylase.
In some embodiments of method 1, the first listed glucoamylase is 90%, 95%, 98%, or 99% identical to the sequence of Fv-GA (SEQ ID NO: 1).
2. The method as described in 1, wherein the additional glucoamylase comprises Afu-GA (SEQ ID NO:3) or An-GA (SEQ ID NO: 7).
3. The method of 1 or 2, wherein the additional glucoamylase is Afu-GA (SEQ ID NO:3) or a glucoamylase with at least 85% identity to SEQ ID NO: 3.
4. The method of 1 or 2, wherein the additional glucoamylase is An-GA (SEQ ID NO:7) or a glucoamylase with at least 85% identity to SEQ ID NO: 7.
5. The method of any one of claims 1-4, wherein saccharifying the starch substrate results in a high glucose syrup.
6. The method of any one of claims 1-5, wherein the high glucose syrup comprises an amount of glucose selected from the list consisting of: at least 95.5% glucose, at least 95.6% glucose, at least 95.7% glucose, at least 95.8% glucose, at least 95.9% glucose, at least 96% glucose, at least 96.1% glucose, at least 96.2% glucose, at least 96.3% glucose, at least 96.4% glucose, at least 96.5% glucose, and at least 97% glucose.
7. The method of any one of claims 1-6, further comprising fermenting the high glucose syrup to an end product.
8. The method of claim 7, wherein saccharifying and fermenting are carried out as a Simultaneous Saccharification and Fermentation (SSF) process.
9. The method of claim 7 or 8, wherein the end product is an alcohol.
10. The method of claim 9, wherein the end product is ethanol.
11. The method of any one of claims 1-10, wherein the starch substrate is about 5% to 99%, 15% to 50%, or 40% -99% Dry Solids (DS).
12. The method of any one of claims 1-11, wherein the dosage of each glucoamylase is in a range of about 0.01 to about 10 glucoamylase units (GAU) per gram of dry solid starch (dss).
13. The method of any one of claims 1-12, wherein the starch substrate is selected from the following: wheat, barley, corn, rye, rice, sorghum, bran, cassava, milo, millet, potato, sweet potato, tapioca, and any combination thereof.
14. The method of any one of claims 1-13, wherein the starch substrate comprises liquefied starch, gelatinized starch, or granular starch.
15. The method of any one of claims 1-14, further comprising adding a hexokinase, a xylanase, a glucose isomerase, a xylose isomerase, a phosphatase, a phytase, a pullulanase, a beta amylase, an alpha-amylase, a protease, a cellulase, a hemicellulase, a lipase, a cutinase, a trehalase, an isoamylase, an oxidoreductase, an esterase, a transferase, a pectinase, a hydrolase, an alpha-glucosidase, a beta-glucosidase, or a combination thereof to the starch substrate.
16. The method as described in 15, wherein pullulanase is further added.
17. A method for saccharifying and fermenting a starch substrate to produce an end product, comprising contacting the starch substrate with: a glucoamylase with at least 85% identity to Fv-GA (SEQ ID NO:1) and a glucoamylase with at least 85% identity to Tr-GA variant CS4(SEQ ID NO: 9).
In some embodiments of method 17, the first listed glucoamylase is 90%, 95%, 98%, or 99% identical to the sequence of Fv-GA (SEQ ID NO: 1). In some embodiments of method 17, the second listed glucoamylase has 90%, 95%, 98%, or 99% identity with the sequence of Tr-GA variant CS4(SEQ ID NO: 9).
18. The method of claim 17, wherein saccharifying and fermenting are performed as a Simultaneous Saccharification and Fermentation (SSF) process.
19. The method of claim 17 or 18, wherein the end product is an alcohol.
20. The method of any one of claims 17-19, wherein the end product is ethanol.
21. The method of claim 20, wherein the saccharified and fermented starch substrate results in a reduced DP4+ level and an increased final concentration of ethanol.
22. The method as described in 21, wherein the reduced level of DP4+ is lower than would be achieved using the same total glucoamylase dose of Fv-GA or Tr-GA variant CS4 under the same saccharification and fermentation conditions, respectively.
23. The method as described in 21, wherein the increased final concentration of ethanol is higher than would be achieved using Fv-GA or Tr-GA variant CS4, respectively, at the same total glucoamylase dose under the same saccharification and fermentation conditions.
24. A method for saccharifying and fermenting a starch substrate to produce a biochemical end product comprising contacting the starch substrate with a glucoamylase having at least 85% identity to Fv-GA (SEQ ID NO:1) and with a glucoamylase having at least 85% identity to Hg-GA (SEQ ID NO:5) under conditions suitable for biochemical fermentation.
In some embodiments of method 24, the first listed glucoamylase is 90%, 95%, 98%, or 99% identical to the sequence of Fv-GA (SEQ ID NO: 1). In some embodiments of method 24, the second listed glucoamylase has 90%, 95%, 98%, or 99% identity to the sequence of Hg-GA (SEQ ID NO: 5).
25. The method of claim 24, wherein the end product is a biochemical selected from the group consisting of: amino acids, organic acids, citric acid, lactic acid, succinic acid, monosodium glutamate, gluconic acid, sodium gluconate, calcium gluconate, potassium gluconate, glucono-delta-lactone, sodium erythorbate, omega-3 fatty acids, butanol, lysine, itaconic acid, 1, 3-propanediol, biodiesel, and isoprene.
26. A method of producing a fermented beverage, wherein the method comprises the steps of: contacting the mash and/or wort with a glucoamylase having at least 85% identity to Fv-GA (SEQ ID NO:1) and contacting the starch substrate with at least one additional glucoamylase.
In some embodiments of method 26, the first listed glucoamylase is 90%, 95%, 98%, or 99% identical to the sequence of Fv-GA (SEQ ID NO: 1).
Drawings
FIG. 1 shows the% glucose at the end of 48hr for the Afu-GA (80. mu.g/g ds), Fv-GA (80. mu.g/g ds), and Afu-GA + Fv-GA (40. mu.g/g ds + 40. mu.g/g ds) blends.
FIG. 2 is a graph based on the data in Table 6 showing the effect of glucoamylase ratio of Afu-GA and Fv-GA on final% glucose after 48hr of saccharification.
Figure 3 shows% glucose at 48hr incubation under two different pH conditions.
FIG. 4 is based on the data in Table 7For OPTIMAXTM4060VHP and% glucose at 72hr additionally blended with glucoamylase Afu-GA or Fv-GA.
FIG. 5 shows% w/v DP4+ at 54hr for CS4+ Fv-GA blends, CS4GA alone, and Fv-GA alone.
FIG. 6 shows% w/v ethanol at 54hr for CS4+ Fv-GA blend, CS4GA, and Fv-GA.
Detailed Description
The invention will now be described in detail by way of reference only, using the following definitions and examples. All patents and publications, including all sequences disclosed within such patents and publications, cited herein are expressly incorporated herein by reference.
Unless defined otherwise herein, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. Singleton, et al, Dictionary of Microbiology and Molecular Biology,3rd Ed. [ Dictionary of Microbiology and Molecular Biology,3rd edition ], John Wiley and Sons, Ltd. [ John Willi-Giraffe, Inc. ], New York (2007), and Hale & Marham, The Harper Collins Dictionary of Biology [ Harper Collins Biology Dictionary ], Harper Perennial [ Huper permanent Press ], New York (1991) provide The skilled artisan with a general Dictionary of many of The terms used in The present invention. Although any methods and materials similar or equivalent to those described herein can also be used in the practice or testing of the present invention, the preferred methods and materials are described. Numerical ranges include the numbers defining the range. Unless otherwise indicated, nucleic acids are written from left to right in a 5 'to 3' orientation; amino acid sequences are written from left to right in the amino to carboxyl orientation. In connection with definitions and terminology in this field, the practitioner makes particular reference to Green and Sambrook, Molecular Cloning: Alaborory Manual (Fourth Edition) [ Molecular Cloning: a guide for the experiments (fourth edition), Cold Spring Harbor Laboratory Press 2012[ Cold Spring Harbor Laboratory Press 2012], and Ausubel FM et al, 1993. It is to be understood that this invention is not limited to the particular methodology, protocols, and reagents described herein as these may vary.
The headings provided herein are not limitations of the various aspects or embodiments of the invention which can be had by reference to the specification as a whole. Accordingly, the terms defined immediately below are more fully defined by reference to this specification as a whole.
All publications cited herein are expressly incorporated herein by reference for the purpose of describing and disclosing compositions and methods that might be used in connection with the invention.
I. Definition of
The term "amino acid sequence" is synonymous with the terms "polypeptide", "protein", and "peptide" and is used interchangeably. When such amino acid sequences exhibit activity, they may be referred to as "enzymes". Amino acid sequences represented in the standard amino-to-carboxyl terminal orientation (i.e., N → C) are used using the conventional single or three letter code for amino acid residues.
The term "nucleic acid" encompasses DNA, RNA, heteroduplexes, and synthetic molecules capable of encoding a polypeptide. The nucleic acid may be single-stranded or double-stranded, and may have chemical modifications. The terms "nucleic acid" and "polynucleotide" are used interchangeably. Since the genetic code is degenerate, more than one codon may be used to encode a particular amino acid, and the compositions and methods of the invention encompass nucleotide sequences that encode a particular amino acid sequence. Thus, the present invention contemplates every possible variant nucleotide sequence encoding a GA or amino acid variant thereof, all possible variant nucleotide sequences being possible in view of the degeneracy of the genetic code. Unless otherwise indicated, nucleic acid sequences are presented in a 5 '-to-3' orientation.
Nucleic acid and amino acid sequences referred to herein as "non-naturally occurring" refer to those not found in nature, i.e., are the products of human manipulation and/or synthesis.
As used herein, "glucoamylase" or "GA enzyme" or "GA polypeptide" is defined as an enzyme of the amyloglucosidase class (EC 3.2.1.3, glucoamylase, alpha-1, 4-D-glucan glucohydrolase). These are exo-acting enzymes that catalyze the release of D-glucose from the non-reducing ends of starch and related oligo-and polysaccharides. These enzymes are also capable of hydrolyzing alpha-1, 6 and alpha-1, 3 linkages, although alpha-1, 6 and alpha-1, 3 linkages typically hydrolyze at a much slower rate than alpha-1, 4 linkages.
As used herein, a "variant" of an enzyme, protein, polypeptide, nucleic acid, or polynucleotide means that the variant is derived from a parent polypeptide or parent nucleic acid (e.g., a native, wild-type, or otherwise defined parent polypeptide or nucleic acid) that includes at least one modification or alteration as compared to its parent. Thus, a variant may have several mutations compared to the parent, wherein "several" means from 1 to 10 mutations. For example, a variant having from 1 to 10 amino acid substitutions as compared to SEQ ID NO:1 may be referred to as an Fv-GA variant having several substitutions. The alterations/modifications may include substitutions of amino/nucleic acid residues in the parent for different amino/nucleic acid residues at one or more positions, deletion of an amino/nucleic acid residue (or series of amino/nucleic acid residues) in the parent at one or more positions, insertion of an amino/nucleic acid residue (or series of amino/nucleic acid residues) in the parent at one or more positions, truncation of amino-and/or carboxy-terminal amino acid sequences or 5 'and or 3' nucleic acid sequences, and any combination thereof. The variant Fv-GA enzymes according to some aspects of the invention retain amylolytic activity, but may have altered properties, e.g., improved properties, in some particular aspects. For example, a variant Fv-GA enzyme may have an altered pH optimum, improved thermostability, improved hydrolysis with respect to one or more substrates (e.g., DP1 (e.g., glucose), DP3 (e.g., maltotriose, panose), DP4 (e.g., maltotetraose), and/or amylopectin) or a combination thereof. In certain embodiments, a variant Fv-GA enzyme comprises an amino acid sequence having at least 60%, 70%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity to SEQ ID No. 1, or an enzymatically active fragment thereof. Likewise, variants An-GA, variant Afu-GA, variant Hg-GA, variants of Tr-GA variant CS4, and variants of Tr-GA are contemplated having at least 60%, 70%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity to their respective SEQ ID NOs, or are enzymatically active fragments thereof.
A "combined variant" is a variant that includes two or more mutations, e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30 or more substitutions, deletions, and/or insertions.
As used herein, a "parent" or "parent" polynucleotide, polypeptide, or enzyme sequence (e.g., a "parent Fv-GA enzyme"), or equivalent thereof, refers to a polynucleotide, polypeptide, or enzyme sequence that serves as a starting point or template for designing a variant polynucleotide, polypeptide, or enzyme. It is further noted that the words "parent" and "parental" are used interchangeably in context. As used herein, "parent Fv-GA enzyme" refers to a polypeptide in its mature form that includes an amino acid sequence having at least 60% identity to SEQ ID NO. 4, including amino acid sequences having amylolytic activity that are at least 60%, 70%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identity to SEQ ID NO. 1 (which provides the amino acid sequence of a wild-type GA from a mature form of Fusarium verticillium), or allelic variants or fragments thereof.
The term "wild-type" refers to a naturally occurring polypeptide or nucleic acid sequence, i.e., a polypeptide or nucleic acid sequence that does not include artificial variations.
The term "heterologous" when used in reference to a portion of a nucleic acid indicates that the nucleic acid comprises two or more subsequences that are not normally found in the same relationship to each other in nature. For example, the nucleic acid is typically recombinantly produced, having two or more sequences, e.g., from unrelated genes, arranged to produce a new functional nucleic acid, e.g., a promoter from one source and a coding region from another source. Similarly, a heterologous polypeptide will generally refer to two or more subsequences that are not found in the same relationship to each other in nature (e.g., a fusion polypeptide).
The term "recombinant" when used in reference to, for example, a cell, or nucleic acid, polypeptide or vector, indicates that the cell, nucleic acid, polypeptide or vector has been modified by the introduction of a heterologous nucleic acid or polypeptide or an alteration of a native nucleic acid or polypeptide, or that the cell is derived from a cell so modified. Thus, for example, these recombinant cells express genes that are not found within the native form of the cell (non-recombinant), or express native genes that are otherwise abnormally expressed, under expressed, or not expressed at all. "recombinant" compositions need to be manipulated by humans and thus exclude natural products.
As used herein, the term "isolated" or "purified" refers to a nucleic acid or polynucleotide that is removed from a naturally occurring environment. Typically, in an isolated or purified nucleic acid or polypeptide sample, one or more nucleic acids or polypeptides of interest are present in an increased absolute or relative concentration as compared to their naturally occurring environment. In some cases, the isolated or purified nucleic acid or polynucleotide is synthetically produced.
The term "enriched" when describing a component or material (e.g., a polypeptide or polynucleotide) in a composition, means that the component or material in the composition is present at a relatively increased concentration as compared to the starting composition from which the enriched composition was produced. For example, an enriched GA composition (or sample) is a GA composition (or sample) in which the relative or absolute concentration of GA is increased as compared to the initial fermentation product from the host organism.
As used herein, the term "promoter" refers to a nucleic acid sequence that functions to direct transcription of a downstream gene. Typically the promoter will be appropriate for the host cell in which the target gene is being expressed. Promoters, together with other transcriptional and translational regulatory nucleic acid sequences (also referred to as "control sequences"), are necessary for the expression of a given gene. Generally, transcriptional and translational regulatory sequences include, but are not limited to, promoter sequences, ribosome binding sites, transcriptional start and stop sequences, translational start and stop sequences, and enhancer or activator sequences. A "constitutive" promoter is a promoter that is active under most environmental and developmental conditions. An "inducible" promoter refers to a promoter that is active under environmental or developmental regulation. An example of an inducible promoter useful in the present invention is the Trichoderma reesei (hypocrea jecorina) cbh1 promoter, deposited in the Genbank under accession number D86235. In another aspect, the promoter is a cbh II or xylanase promoter from hypocrea jecorina. Examples of suitable promoters include promoters from: aspergillus awamori or Aspergillus niger glucoamylase gene (Nunberg, J.H. et al (1984) mol.cell.biol. [ molecular and cellular biology ].4, 2306-; a Rhizomucor miehei carboxyprotease gene; hypocrea jecorina glucoamylase I gene (Shoemaker, s.p. et al (1984) european patent application No. EPO0137280a 1); the A.nidulans trpC gene (Yelton, M. et al (1984) Proc. Natl. Acad. Sci. USA [ Proc. Sci. USA ]81, 1470-; the A.nidulans alcA Gene (Lockington, R.A. et al (1986) Gene [ Gene ]33, 137-149); the A.nidulans tpiA gene (McKnight, G.L. et al (1986) Cell [ Cell ]46, 143-147); the Aspergillus nidulans amdS gene (Hynes, M.J. et al (1983) mol.cell Biol. [ molecular and cellular biology ]3, 1430-1439); hypocrea jecorina xln1 gene; hypocrea jecorina cbh2 gene; hypocrea jecorina eg1 gene; hypocrea jecorina eg2 gene; hypocrea jecorina eg3 gene, and promoters from higher eukaryotes, such as the SV40 early promoter (Barclay, S.L. and E.Meller (1983) Molecular and Cellular Biology 3, 2117-2130).
A nucleic acid is "operably linked" to another nucleic acid sequence when the nucleic acid is placed in a functional relationship with the other nucleic acid sequence. For example, if a DNA encoding a secretory leader (i.e., a signal peptide) is expressed as a preprotein that participates in the secretion of a polypeptide, then the DNA encoding the secretory leader (i.e., the signal peptide) is operably linked to the DNA of the polypeptide; a promoter or enhancer is operably linked to a coding sequence if it affects the transcription of the sequence; or a ribosome binding site is operably linked to a coding sequence if it is positioned so as to facilitate translation. Generally, "operably linked" means that the DNA sequences being linked are contiguous and, in the case of a secretory leader, contiguous and in reading phase. However, enhancers need not be contiguous. Ligation is achieved by ligation at convenient restriction sites. If such sites are not present, these synthetic oligonucleotide adaptors or linkers are used in accordance with conventional practice. The term "operably linked" promoter refers to a functional linkage between a nucleic acid expression control sequence (e.g., a promoter, or a plurality of transcription factor binding sites) and a second nucleic acid sequence, wherein the expression control sequence directs transcription of the nucleic acid sequence corresponding to the second sequence.
The terms "signal sequence", "signal peptide", "secretory sequence", "secretory peptide", "secretory signal sequence", "secretory signal peptide", and the like, refer to a peptide sequence that is a component of a larger polypeptide that is directed through the secretory pathway of the cell in which the larger polypeptide is synthesized, as well as nucleic acids encoding such peptides. Typically, the larger polypeptide (or protein) is typically cleaved to remove the secretory/signal peptide during transit through the secretory pathway, with the cleaved form of the polypeptide (i.e., the form without the signal/secretory peptide) generally being referred to herein as the "mature form" of the polypeptide. For example, SEQ ID NO:2 provides a signal peptide (i.e., full-length Fv-GA) for the amino acid sequence of GA from Fusarium verticillium (Fv-GA).
As used herein, the term "vector" refers to a nucleic acid construct designed for transfer between different host cells. "expression vector" refers to a vector that has the ability to integrate and express a heterologous DNA segment in a foreign cell. Many prokaryotic and eukaryotic expression vectors are commercially available. The choice of a suitable expression vector is within the knowledge of the person skilled in the art.
Thus, an "expression cassette" or "expression vector" is a nucleic acid construct, produced recombinantly or synthetically, with a series of specific nucleic acid elements that permit transcription of a particular nucleic acid in a target cell. The recombinant expression cassette may be integrated in a plasmid, chromosome, mitochondrial DNA, plastid DNA, virus, or nucleic acid fragment. Typically, the recombinant expression cassette portion of the expression vector includes, among other sequences, the nucleic acid sequence to be transcribed and a promoter.
As used herein, the term "plasmid" refers to a circular double-stranded (ds) DNA construct that forms an extrachromosomal self-replicating genetic element when present in many bacteria and some eukaryotes. Plasmids can be used for any of a number of different purposes, e.g., as cloning vectors, propagation vectors, expression vectors, and the like.
As used herein, the term "selectable marker" refers to a nucleotide sequence or polypeptide encoded thereby that is capable of being expressed in a cell, and wherein expression of the selectable marker in the cell confers the ability to distinguish from cells that do not express the selectable marker. In certain embodiments, the selectable marker allows cells expressing the selectable marker to grow in the presence of the corresponding selective agent or under the corresponding selective growth conditions. In other embodiments, the selectable marker allows cells expressing the selectable marker to be identified and/or isolated from cells not expressing the selectable marker by a physical characteristic (e.g., by differences in fluorescence, immunoreactivity, etc.).
Typically, a nucleic acid molecule encoding an Fv-GA or a variant Fv-GA will hybridize under moderate to high stringency conditions to the wild-type sequence provided herein as SEQ ID NO:2 (the native Fv-GA gene). However, in some cases, an Fv-GA encoding nucleotide sequence is employed that possesses a substantially different codon usage, and the enzyme encoded by the Fv-GA encoding nucleotide sequence has the same or substantially the same amino acid sequence as the native enzyme. For example, depending on the frequency of particular codons utilized by the host (commonly referred to as "codon optimization"), the coding sequence can be modified in order to facilitate faster expression of Fv-GA in a particular prokaryotic or eukaryotic expression system. For example, Te' o, et al (2000), describe the optimization of genes for expression in filamentous fungi. Sometimes, such nucleic acid sequences are referred to as "degenerate" or "degenerate sequences".
A nucleic acid sequence is said to "selectively hybridize" to a reference nucleic acid sequence if the two sequences specifically hybridize to each other under moderate to high stringency hybridization and wash conditions. Hybridization conditions are based on the melting temperature (Tm) of the nucleic acid binding complex or probe. For example, typically "maximum stringency" occurs at about Tm-5 ℃ (5 ℃ below the Tm of the probe); "high stringency" occurs at about 5 ℃ to 10 ℃ below Tm; "moderate" or "moderately stringent" occurs at about 10 ℃ to 20 ℃ below the Tm of the probe; and "low stringency" occurs at about 20 ℃ -25 ℃ below Tm. Functionally, maximum stringency conditions can be used to identify sequences that have stringent or near stringent identity to the hybridization probe; while high stringency conditions are used to identify sequences that have about 80% or more sequence identity with the probe.
Medium and high stringency hybridization conditions are well known in the art (see, e.g., Sambrook, et al, 1989, chapters 9 and 11, and in Ausubel, F.M., et al, 1993, expressly incorporated herein by reference). Examples of high stringency conditions include the following hybridizations: in 50% formamide, 5 XSSC, 5 Xdengue solution, 0.5% SDS and 100. mu.g/ml denatured vector DNA at about 42 ℃, followed by two washes in 2 XSSC and 0.5% SDS at room temperature and two more washes in another 0.1 XSSC and 0.5% SDS at 42 ℃.
As used herein, the terms "transformed," "stably transformed," and "transgenic" with respect to a cell refer to a cell having a non-native (heterologous) nucleic acid sequence integrated into its genome or as an episomal plasmid maintained through multiple generations.
As used herein, the term "expression" refers to the process of producing a polypeptide based on the nucleic acid sequence of a gene. This process typically involves both transcription and translation.
The term "introduced" in the context of inserting a nucleic acid sequence into a cell means "transfection", "transformation" or "transduction" and includes reference to the incorporation of a nucleic acid sequence into a eukaryotic or prokaryotic cell, where the nucleic acid sequence may be incorporated into the genome of the cell (e.g., chromosome, plasmid, plastid or mitochondrial DNA), converted into an autonomous replicon, or transiently expressed (e.g., transfected mRNA).
The term "host cell" refers to a cell that contains a vector and supports the replication, and/or transcription, and translation (expression) of an expression construct. Host cells for use in the present invention may be prokaryotic cells (e.g., E.coli) or eukaryotic cells (e.g., yeast, plant, insect, amphibian, or mammalian cells). In certain embodiments, the host cell is a filamentous fungus.
When an amino acid position (or residue) in a first polypeptide is considered "equivalent" to an amino acid position in a second, related polypeptide, this means that the amino acid position of the first polypeptide corresponds to that in the second, related polypeptide by (i) a primary sequence alignment (see sequence alignment and description of sequence identity below); (ii) structural sequence homology; or (iii) a location indicated by one or more of the similar functional characteristics. Thus, an amino acid position in a first GA enzyme (or variant thereof) can be identified as "equivalent" (or "homologous") to an amino acid position in a second GA enzyme (or even a plurality of different GA enzymes).
Primary sequence alignment: equivalent amino acid positions can be determined using primary amino acid sequence alignment methods, many of which are known in the art. For example, by aligning the primary amino acid sequences of two or more different GA enzymes, it is possible to assign an amino acid position number from one GA enzyme to be equivalent to the position number of another aligned GA enzyme. In this manner, the numbering system derived from the amino acid sequence of one GA enzyme (e.g., the Fv-GA enzyme represented in SEQ ID NO:1) can be used to identify equivalent (or homologous) amino acid residues in other GA enzymes.
Structural sequence homology: in addition to determining "equivalent" amino acid positions using primary sequence alignment methods, "equivalent" amino acid positions can also be defined by determining homology at the secondary and/or tertiary structure level. For example, for glucoamylase, whose tertiary structure has been determined by x-ray crystallography, equivalent residues can be defined as those below: wherein the coordinates of the atoms of two or more backbone atoms of a particular amino acid residue of the glucoamylase are within 0.13nm, preferably within 0.1nm, after alignment with Fv-GA (N to N, CA to CA, C to C, O to O). The alignment is achieved after the best model has been oriented and positioned to give the Fv-GA the maximum overlap of the atomic coordinates of the non-hydrogen protein atoms of the glucoamylase in question. The best model is the crystallographic model that gives the highest available resolution. In the case where two or more different models have equal resolution, the model is used when it has the lowest R-factor for the experimental diffraction data using the following equation.
Similar functional properties: equivalent amino acid residues in a first polypeptide that are functionally similar to specific residues of a second related polypeptide (e.g., a first glucoamylase and Fv-GA) are defined as those amino acids in the first polypeptide that adopt a conformation such that they alter, modify, or contribute to polypeptide structure, substrate binding, or catalysis in a defined manner and are due to the specific residues of the second related polypeptide. When the tertiary structure of a first polypeptide has been obtained by x-ray crystallography, amino acid residues of the first polypeptide that are functionally similar to a second polypeptide occupy similar positions to the extent that the atomic coordinates of at least two side chain atoms of a residue are located at 0.13nm of the corresponding side chain atom of the second polypeptide, although the backbone atom of a given residue may not meet the equivalence criterion based on the occupied homologous positions.
The terms "improved property" or "improved performance" or the like, relative to a variant enzyme (e.g., a GA variant), are defined herein as an improvement in a characteristic or activity associated with the variant enzyme as compared to its respective parent enzyme. Improved properties include, but are not limited to, improved thermostability, or altered temperature-dependent activity profile, improved activity or stability at a desired pH or pH range, improved substrate specificity, improved product specificity, and improved stability in the presence of chemicals or other components in a starch conversion process step, and the like. Improved performance may be determined using one or more specific assays, including but not limited to: (a) expression of (b) hydrolytic activity on a DP2 substrate, (c) hydrolytic activity on a DP7 substrate, (d) hydrolytic activity on a panose substrate, (e) hydrolytic activity on a amylopectin substrate, (f) hydrolytic activity on granular Corn Starch (CS), (g) thermostability, (h) glucose inhibition, (i) reverse activity, and (j) hydrolytic activity on a amylopectin substrate.
The term "improved thermostability" relative to a variant protein (e.g., a GA variant) is defined herein as a variant enzyme that exhibits retention of a greater portion of enzyme activity after incubation for a period of time at an elevated temperature relative to the parent enzyme. Such a variant may or may not exhibit an altered thermal activity profile relative to the parent. For example, a variant may have improved refolding ability after incubation at elevated temperature relative to the parent.
By "improved product specificity" is meant a variant enzyme that exhibits an altered product profile as compared to the parent enzyme, wherein the altered product profile of the variant is improved in a given application as compared to the parent. A "product profile" is defined herein as the chemical composition of the reaction product produced by the enzyme of interest.
By "improved substrate specificity" is meant a variant enzyme that targets a particular substrate (or class of substrates) hydrolyzed differently than the parent enzyme, such that the variant has improved performance in a given application as compared to the parent.
By "improved chemical stability" is meant that the variant enzyme exhibits retention of enzymatic activity after a period of incubation in the presence of one or more chemicals that reduce the enzymatic activity of the parent enzyme under the same conditions. Variants with improved chemical stability are able to catalyze reactions better in the presence of such chemicals than the parent enzyme.
"pH range" in reference to an enzyme refers to the range of pH values at which the enzyme exhibits catalytic activity.
The terms "pH stable" and "pH stability" in reference to an enzyme relate to the ability of the enzyme to retain activity for a predetermined period of time (e.g., 15min., 30min., 1 hour) at a pH value within a wide range.
As used herein, "starch" refers to any material consisting of a complex polysaccharide carbohydrate of plants consisting of a carbohydrate having the formula (C)6H10O5)x(where "X" can be any number) amylose and amylopectin. In particular, the term refers to any plant-based material, including but not limited to, cereals, grasses, tubers, and roots, and more specifically wheat, barley, corn, rye, rice, sorghum, bran, cassava, millet, potatoes, sweet potatoes, and tapioca. "granular starch" refers to uncooked (raw) starch that has not undergone gelatinization, where "starch gelatinization" refers to the solubilization of starch molecules to form a viscous suspension.
"Degree of Polymerization (DP)" refers to the number (n) of anhydroglucopyranose units in a given saccharide. Examples of DP1 are monosaccharides such as glucose and fructose. Examples of DP2 are disaccharides such as maltose and sucrose. DP7 represents a polymer with seven anhydroglucopyranose units.
As used herein, "hydrolysis of starch" and the like refers to the cleavage of glycosidic bonds by the addition of water molecules. Thus, enzymes with "starch hydrolyzing activity" catalyze the cleavage of glycosidic bonds by the addition of water molecules.
As used herein, "fermentable sugar" refers to a sugar that is capable of being metabolized under fermentation conditions. Typically, these sugars are glucose, maltose, and maltotriose (DP1, DP2, and DP 3).
As used herein, "total sugar content" refers to the total sugar content present in the starch composition.
"percent sequence identity" or grammatical equivalents means that a particular sequence has at least a certain percentage of amino acid residue identity to amino acid residues in a designated reference sequence using an alignment algorithm. An example of an algorithm suitable for determining sequence similarity is the BLAST algorithm, described in Altschul, et al, J.Mol.biol. [ J.M. 215: 403-. Software for performing BLAST analysis is publicly available through the National Center for Biotechnology Information (< www (dot) ncbi (dot) nlm (dot) nih (dot) gov >). This algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying short words of length W in the query sequence that either match or satisfy some positive threshold score T when aligned with a word of the same length in a database sequence. These initial neighborhood word hits serve as starting points to find longer HSPs containing them. Word hits are extended in both directions along each of the two sequences being compared until the cumulative alignment score can be increased. The extension of a word hit stops when the cumulative alignment score decreases by an amount X from its maximum achieved value, when the cumulative score is zero or below zero, or when the end of either sequence is reached. The BLAST algorithm parameters W, T, and X determine the sensitivity and speed of the alignment. The BLAST program uses as defaults a wordlength (W) of 11, a BLOSUM62 scoring matrix (see Henikoff & Henikoff, Proc. Natl. Acad. Sci. USA [ Proc. Natl. Acad. Sci. USA ]89:10915(1989)) for comparison (B) of 50, an expectation (E) of 10, M '5, N' -4, and a comparison of the two strands.
The BLAST algorithm then also performs a statistical analysis of the similarity between the two sequences (see, e.g., Karlin & Altschul, Proc. Nat' l. Acad. Sci. USA [ Proc. Natl. Acad. Sci. ]90:5873-5787 (1993)). One measure of similarity provided by the BLAST algorithm is the minimum sum probability (P (N)), which provides an indication of the probability by which a match between two nucleotide or amino acid sequences will occur by chance. For example, an amino acid sequence is considered similar to a protease if the smallest sum probability in a comparison of the test amino acid sequence to the protease amino acid sequence is less than about 0.1, more preferably less than about 0.01, and most preferably less than about 0.001.
Alignment algorithms can also be employed to identify amino acid residues (or residues) in a first amino acid sequence (e.g., Fv-GA) that correspond to amino acid residues (or residues) in a second amino acid sequence (e.g., a homologous GA enzyme from another species, e.g., trichoderma reesei GA).
When problems arise with percent sequence identity or corresponding amino acid residues between two sequences, alignments using the CLUSTAL W algorithm with default parameters will dominate. See Thompson et al (1994) Nucleic Acids Res. [ Nucleic Acids research ]22: 4673-one 4680. The default parameters of the CLUSTAL W algorithm are as follows:
gap opening penalty: 10.0
Gap extension penalty: 0.05
Protein weight matrix: BLOSUM series
DNA weight matrix: IUB
Delayed divergence sequence%: 40
Vacancy separation distance: 8
DNA conversion weight: 0.50
List hydrophilic residues: GPSNDQEKR
Using a negative matrix: closing device
Switch special residue penalties: opening device
Switching hydrophilicity penalties: opening device
The end of handover gap separation penalty is off.
Molecular biology
The embodiments of the invention provide for the expression of a desired glucoamylase (or combination of glucoamylases) from a glucoamylase-encoding nucleic acid under the control of promoter functionality in a host cell of interest (e.g., a filamentous fungus). The present invention thus relies on a number of conventional techniques in the field of recombinant genetics. Basic literature disclosing examples of suitable recombinant genetic methods is noted above.
The present invention contemplates any method known in the art by which mutations can be introduced into a parent nucleic acid/polypeptide.
The present invention relates to the expression, purification, and/or isolation and use of variant GA enzymes. These enzymes can be prepared by recombinant methods using any of a number of glucoamylase genes known in the art (e.g., the Fusarium verticillium glucoamylase gene, including SEQ ID NO: 2). Any convenient method for introducing mutations may be employed, including site-directed mutagenesis. As described above, the mutation (or variation) comprises a substitution, addition, deletion, or truncation that will correspond to one or more amino acid changes in the expressed GA variant. Again, site-directed mutagenesis and other methods of incorporating amino acid changes in expressed proteins at the DNA level can be found in a number of references, such as Green and Sambrook et al, 2012 and Ausubel et al.
DNA encoding amino acid sequence variants of the parent GA are prepared by a variety of methods known in the art. These methods include, but are not limited to, preparation methods by site-directed (or oligonucleotide-mediated) mutagenesis, PCR mutagenesis, and cassette mutagenesis of earlier prepared DNA encoding the parent GA enzyme.
Site-directed mutagenesis is a method that can be employed in the preparation of substitutional variants. This technique is well known in the art (see, e.g., Carter et al, Nucleic Acids Res. [ Nucleic Acids research ]13:4431-4443(1985), and Kunkel et al, Proc. Natl. Acad. Sci. USA [ Proc. Natl. Acad. Sci. USA ]82:488 (1987)). Briefly, in performing site-directed mutagenesis of DNA, the starting DNA is altered by first hybridizing an oligonucleotide encoding the desired mutation to a single strand of such starting DNA. After hybridization, the entire second strand is synthesized using a DNA polymerase, using the hybridized oligonucleotides as primers, and the single strand of the starting DNA as template. Thus, an oligonucleotide encoding the desired mutation is incorporated into the resulting double-stranded DNA.
PCR mutagenesis is also suitable for making amino acid sequence variants of the parent GA. See Higuchi, in PCR Protocols [ PCR Protocols ], pp.177- & 183 (Academic Press, 1990); and Vallette et al, Nuc. acids Res. [ nucleic acid research ]17:723-733 (1989). In short, when a small amount of template DNA is used as starting material in PCR, primers that differ slightly in sequence from the corresponding regions in the template DNA can be used to generate a relatively large number of specific DNA fragments that differ from the template sequence only at the positions where the primers differ from the template.
Another method for preparing variants, cassette mutagenesis, is based on the technique described by Wells et al, Gene [ Gene ]34: 315-. The starting material is a plasmid (or other vector) containing the starting polypeptide DNA to be mutated. One or more codons are identified in the starting DNA to be mutated. Unique restriction endonuclease sites must be present on each side of one or more identified mutation sites. If such restriction sites are not present, they can be generated using the oligonucleotide-mediated mutagenesis methods described above, such that they are introduced at the appropriate location in the starting polypeptide DNA. Plasmid DNA was cleaved at these sites to linearize it. Double-stranded oligonucleotides encoding sequences of DNA between the restriction sites but containing the desired mutation or mutations are synthesized using standard procedures, wherein the two strands of the oligonucleotide are synthesized separately and then hybridized together using standard techniques. The double-stranded oligonucleotide is called a cassette. The cassette is designed to have 5 'and 3' ends compatible with the ends of the linearized plasmid so that it can be ligated directly to the plasmid. The plasmid now contains the mutated DNA sequence.
Alternatively or additionally, a desired amino acid sequence encoding a desired glucoamylase can be determined, and a nucleic acid sequence encoding such an amino acid sequence variant can be synthetically produced.
The desired glucoamylase or glucoamylases so prepared may be further modified, typically depending on the intended use of the glucoamylase. These modifications may involve further changes in the amino acid sequence, fusions to one or more heterologous polypeptides and/or covalent modifications.
GA and variant GA polypeptides and nucleic acids encoding same
Glucoamylases (GA) are produced by a number of bacterial, fungal, yeast, and plant strains. Glucoamylase from a number of fungi is secreted, for example, from Aspergillus (Svensson et al, Carlsberg Res. Commun. [ Carlsberg research Commission ]48: 529-; talaromyces (U.S. Pat. No. 4,247,637; U.S. Pat. No. 6,255,084; and U.S. Pat. No. 6,620,924); rhizopus (Ashikari et al, Agric. biol. chem. [ agricultural and Biochemical ]50:957- & 964 (1986); Ashikari et al, App. Microbio. Biotech. [ applied microbiology and Biotech. [ 32:129- & 133(1989) and U.S. Pat. No. 4,863,864); humicola (WO 05/052148 and U.S. patent No. 4,618,579); and strains of the genus Mucor (Houghton-Larsen et al, applied. Microbiol. Biotechnol. [ applied microorganisms and Biotechnology ]62:210-217 (2003)). Many genes encoding these enzymes have been cloned and expressed in yeast, fungal, and/or bacterial cells.
Commercially, glucoamylases are very important enzymes and have been used in a variety of applications requiring the hydrolysis of starch (e.g., for the production of glucose and other monosaccharides from starch). Glucoamylase is used to produce high fructose corn sweeteners, which account for over 50% of the U.S. sweetener market. Generally, glucoamylases may be, and are typically used with, alpha-amylases in starch hydrolysis processes to hydrolyze starch to dextrins, and then glucose. The glucose can be used directly; conversion to fructose by other enzymes (e.g., glucose isomerase); crystallizing; or for fermentation to produce a number of end products (e.g., ethanol, citric acid, succinic acid, ascorbic acid intermediates, glutamic acid, glycerol, 1, 3-propanediol, and lactic acid).
Glucoamylases can consist of up to three different domains, in all glucoamylases, a structurally conserved catalytic domain of about 450 residues, usually followed by a linker region consisting of between 30 and 80 residues linked to a Starch Binding Domain (SBD) (also known as a carbohydrate binding domain, or CBD) of about 100 residues. The structure of trichoderma reesei glucoamylase (TrGA) with all three regions intact was determined to be 1.8 angstrom resolution. See WO 2009/048488 and WO 2009/048487, incorporated herein by reference. This structure was aligned with the previously determined coordinates of the catalytic domain of glucoamylase from A.awamori strain X100 using the determined coordinates (Aleshin, A.E., Hoffman, C., Firsov, L.M., and Honzatko, R.B. referred crystal structures of glucoamylase from Aspergillus awamori var. X100.[ crystal structure of glucoamylase purification from A.awamori ] J.mol.biol. [ J.M. 238: 575. sup. (1994)). The structures of the catalytic domains of the two glucoamylases overlap very closely and it is possible to identify equivalent residues based on this structural overlap. It is further believed that all glucoamylases share a basic structure.
In view of the well-known structural and functional relationships of glucoamylases, glucoamylase variants with altered properties have been successfully made and characterized. Certain variants exhibit improved properties compared to the parent glucoamylase. Examples of improved properties include increased thermostability and increased specific activity. Methods for making and characterizing trichoderma reesei GA variants with altered properties have been described in WO2009/067218 (incorporated herein by reference).
Described herein are GA enzyme proteins (amino acids) and nucleotide sequences, including:
fusarium verticillium GA (Fv-GA) protein sequence (SEQ ID NO:1)
Fusarium verticillium GA (Fv-GA) DNA nucleotide sequence (SEQ ID NO:2)
Aspergillus fumigatus GA (Afu-GA) protein sequence (SEQ ID NO:3)
Aspergillus fumigatus GA (Afu-GA) nucleotide sequence (SEQ ID NO:4)
Humicola grisea GA (Hg-GA) protein sequence (SEQ ID NO:5)
Humicola grisea GA (Hg-GA) nucleotide sequence (SEQ ID NO:6)
Aspergillus niger GA (An-GA) protein sequence (SEQ ID NO:7)
Aspergillus niger GA (An-GA) nucleotide sequence (SEQ ID NO:8)
Trichoderma reesei GA (Tr-GA) variant CS4 mature protein sequence (SEQ ID NO:9)
Trichoderma reesei GA (Tr-GA) mature protein sequence (SEQ ID NO:10)
Trichoderma reesei GA (Tr-GA) parent protein sequence (SEQ ID NO:11)
Trichoderma reesei GA (Tr-GA) nucleotide sequence (SEQ ID NO:12)
Variant GA enzymes are also described herein. As described herein, these variant GA enzymes have one or more mutations with respect to a parent GA enzyme, wherein the parent GA enzyme has at least 60% (i.e., 60% or greater) amino acid sequence identity with SEQ ID No. 4, including at least 61%, 65%, 70%, 75%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, up to and including 100% amino acid sequence identity with SEQ ID No. 4. A variant GA enzyme (i.e., having one or more mutations) can have at least 60% (i.e., 60% or greater) amino acid sequence identity to SEQ ID No. 4, including at least 61%, 65%, 70%, 75%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to SEQ ID No. 4. In certain embodiments, the parent GA is selected from the following: trichoderma strains (e.g., Trichoderma reesei, Trichoderma longibrachiatum, Trichoderma brachiatum, Trichoderma asperellum, Trichoderma koningii, Trichoderma reesei, Trichoderma citrinoviride, and Trichoderma harzianum); aspergillus strains (e.g., Aspergillus aculeatus, Aspergillus niger, Aspergillus nidulans, Aspergillus kawachii, Aspergillus awamori, Aspergillus clavatus, Aspergillus terreus, Aspergillus fumigatus, and Aspergillus oryzae); talaromyces strains (e.g., Talaromyces emersonii, Talaromyces thermophilus, and Talaromyces duponti); trametes strains (e.g., trametes annulata); hypocrea strains (e.g., hypocrea gelatinosum (h.gelatinosa), hypocrea orientalis (h.orientalis), hypocrea virginosa (h.vinosa), hypocrea jecorina, hypocrea schweinitizi (h.schweinitizi), and hypocrea aurantiacus (h.citrina)); fusarium strains (e.g., Fusarium oxysporum, and Fusarium roseum); humicola strains (e.g., Humicola grisea, Humicola insolens, and Humicola lanuginosa); saccharomyces strains (e.g., Saccharomycopsis fibuligera); kebab thermophila; podosporium andersoni (Podospora anderina); or their respective aneuploid, sexual or holotype counterparts.
In some cases, the parent GA is a fusarium verticillium GA (Fv-GA). In addition, the variant GA enzyme has amylolytic activity (or is a variant GA fragment having amylolytic activity), wherein in certain embodiments, the variant GA has improved properties compared to the parent GA (as described in detail herein).
Some aspects of the invention provide a variant of a parent GA enzyme, wherein the variant has starch hydrolyzing activity, has at least 60% (e.g., at least 80%) sequence identity to SEQ ID NO:1, and has at least one improved property compared to the parent GA enzyme, the property selected from the group consisting of: (a) expression of (b) hydrolytic activity on a DP2 substrate, (c) hydrolytic activity on a DP7 substrate, (d) hydrolytic activity on a panose substrate, (e) hydrolytic activity on a amylopectin substrate, (f) hydrolytic activity on granular Corn Starch (CS), (g) thermostability, (h) glucose inhibition, (i) reverse activity, and (j) hydrolytic activity on a amylopectin substrate.
In certain embodiments, the GA variant has at least two, at least three, at least four, at least five, at least six, at least seven, at least 8, at least 9, or more improved properties (selected from the above list) compared to the parent GA enzyme. In certain embodiments, the variant GA enzyme comprises an amino acid mutation at one or more amino acid positions in Fv-GA (as represented in SEQ ID NO: 1). Since some of the parent GA enzymes according to aspects of the invention may not have the same amino acid as the wild-type Fv-GA, the amino acid positions corresponding to the above residues may also be designated by the position numbers alone.
The alignment of amino acid sequences used to determine homology can be determined by using a "sequence comparison algorithm". Optimal alignment of sequences for comparison can be performed, for example, by the local homology algorithm of Smith & Waterman, adv. Appl. Math. [ applied mathematical Advance ]2:482(1981), by the homology alignment algorithm of Needleman & Wunsch, J.mol.biol. [ journal of molecular biology ]48:443(1970), by the similarity search method of Pearson & Lipman, Proc.Nat' l Acad.Sci.USA [ Proc.Natl Acad.Sci.USA ]85:2444(1988), by Computer execution of these algorithms (GAP, BEFIT, FAST, and TFASTA in Wisconsin Genetics Software Package ], by the Computer Science Group [ Genetics Computer Group ], Science [ Dr., Madison, Wis Conn., Inc. [ Chemical Group of Chemical industries ], by the visual inspection of Chemical society, Inc., MOM, Ogawa, or MOM, Ogaku. See also the description of "percent sequence identity" provided in the definitions section above.
Expression of recombinant GA
Aspects of the invention include methods and compositions relating to generative nucleic acids encoding GA enzymes and GA enzyme variants, host cells comprising such nucleic acids, production of GA or GA variants by such host cells, and isolation, purification, and/or use of these GA variants.
Thus, embodiments of the invention provide a host cell that has been transduced, transformed, or transfected with an expression vector comprising a desired GA or GA variant-encoding nucleic acid sequence. For example, a filamentous fungal cell or yeast cell is transfected with an expression vector having a promoter or biologically active promoter fragment or one or more (e.g., a series) of enhancers that function in a host cell line, operably linked to a DNA fragment encoding a desired GA or GA variant, such that the desired GA or GA variant is expressed in the cell line.
A. Nucleic acid constructs/expression vectors.
The natural or synthetic polynucleotide fragment encoding the desired GA or GA variant may be incorporated into heterologous nucleic acid constructs or vectors that are capable of being introduced into, and replicated in, a host cell of interest (e.g., a filamentous fungal cell or a yeast cell). The vectors and methods disclosed herein are suitable for use in host cells for expression of a desired GA or GA variant. Any vector may be used so long as the vector satisfies the desired replication/expression characteristics (such characteristics are generally defined by the user) in the host cell or cells into which it is introduced. Many suitable vectors and promoters are known to those skilled in the art, some of which are commercially available. Cloning and expression vectors are also described in Sambrook et al, 1989, Ausubel FM et al, 1989, and Stratepan et al, 1981, each of which is expressly incorporated herein by reference. Suitable expression vectors for fungi are described in van den Hondel, c.a.m.j.j. et al (1991): bennett, J.W.And and Lasure, L.L, (ed.) More Gene Manipulations in Fungi ] Academic Press 396-428 is described. The appropriate DNA sequence can be inserted into a plasmid or vector (collectively referred to herein as a "vector") by a variety of procedures. Typically, the DNA sequence is inserted into the appropriate restriction endonuclease site or sites by standard procedures. Such procedures and related subcloning procedures are considered to be within the knowledge of those skilled in the art.
A recombinant host cell comprising coding sequences for a desired GA or GA variant can be produced by introducing a heterologous nucleic acid construct comprising the desired GA or GA variant coding sequence into a desired host cell (e.g., as described in further detail below). For example, the desired GA or GA variant coding sequence may be inserted into a suitable vector according to well-known recombinant techniques and used to transform a filamentous fungus capable of GA expression. As mentioned above, due to the inherent degeneracy of the genetic code, other nucleic acid sequences encoding substantially the same or functionally equivalent amino acid sequences can be used to clone and express the desired GA or GA variant. Thus, it is understood that such substitutions in the coding region are within the sequence variants encompassed by the present invention.
The invention also includes a recombinant nucleic acid construct comprising one or more desired GA or GA variant-encoding nucleic acid sequences as described above. These constructs include vectors (e.g., plasmids or viral vectors) into which the sequences of the invention are inserted in either a forward or reverse orientation.
The heterologous nucleic acid construct can include coding sequences for the desired GA or GA variant: (i) in isolated form; (ii) in combination with another coding sequence, such as a fusion polypeptide or signal peptide coding sequence, wherein the desired GA or GA variant coding sequence is the primary coding sequence; (iii) in combination with non-coding sequences, e.g., introns and control elements, such as promoter and terminator elements or 5 'and/or 3' untranslated regions, are effective for expressing a coding sequence in a suitable host; and/or (iv) in a vector or host environment, wherein the desired GA or GA variant coding sequence is a heterologous gene.
In one aspect of the invention, a heterologous nucleic acid construct is used to transfer a desired GA or GA variant encoding nucleic acid sequence into a host cell in vitro, for example into established filamentous fungal and yeast lines. Long-term production of the desired GA or GA variant can be achieved by generating a host cell with stable expression of the GA or GA variant. Thus, any method effective to produce stable transformants can be used in the practice of the present invention.
Suitable vectors are typically equipped with selectable marker-encoding nucleic acid sequences, insertion sites, and suitable control elements (e.g., promoter and termination sequences). The vector may include regulatory sequences, including, for example, non-coding sequences, such as introns and control elements, i.e., promoter and terminator elements or 5 'and/or 3' untranslated regions, effective for expressing the coding sequence in a host cell (and/or in a vector or host cell environment in which the modified soluble protein antigen coding sequence is not normally expressed), operably linked to the coding sequence. Many suitable vectors and promoters are known to those of skill in the art, many of which are commercially available and/or described in Sambrook, et al, (supra).
Examples of suitable promoters include both constitutive and inducible promoters, examples of both promoters include the CMV promoter, the SV40 early promoter, the RSV promoter, the EF-1 α promoter, promoters comprising the Tet Response Element (TRE) in the tet-on or tet-off system (ClonTech and BASF) as described, the β actin promoter, and the metallothionein promoter which can be upregulated by the addition of certain metal salts. Promoter sequences are DNA sequences which are recognized by the particular host cell used for expression purposes. The promoter sequence is operably linked to a DNA sequence encoding a GA or a variant GA polypeptide. Such linkage includes positioning the promoter relative to the start codon of the DNA sequence encoding the GA or variant GA polypeptide in the expression vector such that the promoter can drive transcription/translation of the GA or GA variant coding sequence. The promoter sequence comprises transcriptional and translational control sequences that mediate the expression of the GA or variant GA polypeptide. Examples include genes encoding glucoamylase, alpha-amylase, or alpha-glucosidase from Aspergillus niger, Aspergillus awamori, or Aspergillus oryzae; the aspergillus nidulans gpdA or trpC gene; neurospora crassa cbh1 or trp1 gene; aspergillus niger or Rhizomucor miehei aspartic proteinase coding gene; hypocrea jecorina cbh1, cbh2, egl1, egl2, or other cellulase-encoding gene promoters.
The choice of an appropriate selectable marker will depend on the host cell, and appropriate markers for different hosts are well known in the art. Typical selectable marker genes include argB from Aspergillus nidulans or hypocrea jecorina, amdS from Aspergillus nidulans, pyr4 from Neurospora crassa or hypocrea jecorina, pyrG from Aspergillus niger or Aspergillus nidulans. Additional examples of suitable selectable markers include, but are not limited to, trpc, trp1, oliC31, niaD, or leu2, which are included in heterologous nucleic acid constructs used to transform mutant strains, such as trp-, pyr-, leu-, and the like.
Such selectable markers confer upon the transformants the ability to utilize metabolites that are not normally metabolized by the filamentous fungus. For example, the amdS gene from hypocrea jecorina, which encodes the enzyme acetamidase, allows the transformant cells to grow on acetamide as a nitrogen source. The selectable marker (e.g., pyrG) can restore the ability of the auxotrophic mutant strain to grow on selective minimal medium, or the selectable marker (e.g., lic31) can confer on the transformant the ability to grow in the presence of an inhibitory drug or antibiotic.
The selectable marker coding sequence is cloned into any suitable plasmid using methods commonly used in the art. Examples of suitable plasmids include pUC18, pBR322, pRAX, and pUC 100. The pRAX plasmid contains the AMA1 sequence from aspergillus nidulans, which makes replication possible in aspergillus niger.
The practice of the present invention will employ, unless otherwise indicated, conventional techniques of molecular biology, microbiology, recombinant DNA, and immunology, which are within the skill of the art. These techniques are explained fully in the literature. See, e.g., Sambrook et al, 1989; freshney, 1987; ausubel et al, 1993; and Coligan et al, 1991.
B. Host cells and culture conditions for GA enzyme production
After the DNA sequence encoding the GA or GA variant has been cloned into the DNA construct, the DNA is used to transform a microorganism. The microorganism to be transformed may be selected from a variety of host cells for the purpose of expressing a GA or a variant GA according to the invention. The following sections are provided as examples of host cells/microorganisms and are not meant to limit the scope of host cells that may be employed in practicing aspects of the invention.
(i) Filamentous fungi
Aspects of the invention include filamentous fungi that are modified, selected, and cultured in a manner effective to produce the desired GA or GA variant production or expression, relative to the corresponding untransformed parent filamentous fungus.
Examples of species of parent filamentous fungi that may be treated and/or modified for desired glucoamylase expression include, but are not limited to, trichoderma, penicillium, humicola species (including humicola specialosa); aspergillus species (including Aspergillus niger), Chrysosporium species, myceliophthora species, Fusarium species, Hypocrea species, and Sclerotinia species.
Cells expressing the desired GA or GA variant are cultured under conditions typically used for culturing parental fungal lines. Generally, cells are cultured in standard media containing physiological salts and nutrients, as described, for example, in Pourquie, J.et al, Biochemistry and Genetics of Cellulose Degradation [ Biochemistry and Genetics of Cellulose Degradation ], eds, Aubert, J.P.et al, Academic Press [ Academic Press ], pages 71-86, 1988, and Ilmen, M.et al, appl.Environ.Microbiol [ applied and environmental microbiology ]63: 1298-. Standard culture conditions are known in the art, e.g., cultures are incubated at 28 ℃ in a shake culture or fermentor until the desired level of expression of the desired GA variant is achieved.
Culture conditions for a given filamentous fungus may be found, for example, in the scientific literature and/or from fungal sources such as the American Type Culture Collection (ATCC). After fungal growth has been established, the cells are exposed to conditions effective to cause or allow expression of the desired GA or GA variant.
In cases where the desired GA or GA variant coding sequence is under the control of an inducible promoter, an inducing agent (e.g., a sugar, a metal salt, or an antibiotic) is added to the culture medium at a concentration effective to induce expression of the desired GA or GA variant.
In one embodiment, the strain is an aspergillus niger strain, which is a useful strain for obtaining an overexpressed protein. For example, the A.niger variant A.awamori dgr246 is known to secrete increasing amounts of secreted cellulase (Goedegebauur et al, curr. Genet [ modern genetics ] (2002)41: 89-98). Other strains of A.niger variant A.awamori (e.g. GCDAP3, GCDAP4, and GAP3-4) are known (Ward et al, 1993, appl. Microbiol. Biotechnol. [ applied microbiology & biotechnology ]39: 738-743).
In another embodiment, the strain is a strain of trichoderma reesei, which is a useful strain for obtaining an over-expressed protein. For example, RL-P37 is known to secrete increasing amounts of cellulase enzyme as described by Sheir-Neiss, et al, appl. Microbiol. Biotechnol. [ applied microbiology and biotechnology ]20:46-53 (1984). Functional equivalents of RL-P37 include Trichoderma reesei strain RUT-C30(ATCC No. 56765) and strain QM9414(ATCC No. 26921). It is expected that these strains will also be useful in overexpressing GA or variant GA.
In the absence of potentially deleterious native glucoamylase activity, where it is desired to obtain a GA or GA variant, it is useful to obtain a host cell strain having a deletion of one or more glucoamylase genes prior to introduction of a DNA construct or plasmid comprising a DNA fragment encoding the desired GA or GA variant. Such strains may be prepared in any convenient manner, for example by the methods disclosed in U.S. Pat. No. 5,246,853 and WO 92/06209, the disclosures of which are incorporated herein by reference. By expressing the desired GA or GA variant in a host microorganism that lacks one or more glucoamylase genes (e.g., a glucoamylase gene endogenous to the host cell), identification and subsequent purification procedures are simplified, if desired.
Gene deletion can be accomplished by inserting the form of the desired gene to be deleted or disrupted into a plasmid by methods known in the art. The deletion plasmid is then cleaved at one or more appropriate restriction enzyme sites within the coding region of the desired gene, and the gene coding sequence, or a portion thereof, is replaced with a selectable marker. Flanking DNA sequences (e.g., about 0.5kb to about 2.0kb) from the locus of the gene to be deleted or disrupted can be retained on either side of the selectable marker gene. An appropriate deletion plasmid will typically have unique restriction enzyme sites present therein to enable the fragment comprising the deleted gene (including flanking DNA sequences) and the selectable marker gene to be removed as a single linear fragment.
In certain embodiments, more than one copy of DNA encoding a desired GA or GA variant may be present in the host strain to facilitate overexpression of the GA or GA variant. For example, the host cell may have multiple copies of the desired GA or GA variant integrated into the genome, or alternatively, comprise a plasmid vector capable of autonomous replication in the host organism.
(ii) Yeast
The invention also contemplates the use of yeast as a host cell for the production of the desired GA or GAs. Several other genes encoding hydrolases have been expressed in different strains of the yeast Saccharomyces cerevisiae. These include sequences encoding two endoglucanases from Trichoderma reesei (Penttila et al, 1987), two cellobiohydrolases (Penttila et al, 1988), and a beta-glucosidase (Cummings and Fowler,1996), a xylanase from Brevibacterium sp. (Li and Ljungdahl,1996), an alpha-amylase from wheat (Rothstein et al, 1987), and the like.
(iii) Others
It is further contemplated that in some embodiments, expression systems in host cells other than filamentous fungal cells or yeast cells may be employed, including insect cell or bacterial cell expression systems. Certain bacterial host cells may, for example, be bacterial host cells that are also ethanologenic organisms (e.g., engineered zymomonas mobilis) that are not only capable of expressing one or more enzymes/variants of interest, but are also capable of metabolizing certain monomers and other fermentable sugars for their conversion to ethanol. The choice of host cell may be determined by the wishes of the user of the GA or GA variant described herein, and is therefore not intended to be limiting in this regard.
C. The desired GA-encoding nucleic acid sequence is introduced into a host cell.
The invention further provides cells and cell compositions that have been genetically modified to comprise an exogenously provided nucleic acid sequence encoding a desired GA or GA variant. The parental cell or cell line may be genetically modified (e.g., transduced, transformed, or transfected) with a cloning vector or an expression vector. As further described above, the vector may be, for example, in the form of a plasmid, a viral particle, a phage, or the like.
The transformation methods of the invention may result in the stable integration of all or a portion of the transformation vector into the genome of the host cell. However, transformation of extrachromosomal transformation vectors that result in maintenance of autonomous replication is also contemplated.
Any well-known procedure for introducing an exogenous nucleotide sequence into a host cell can be used. These include the use of calcium phosphate transfection, polybrene, protoplast fusion, electroporation, biolistic methods, liposomes, microinjection, protoplast vectors, viral vectors, and any other well known method for introducing cloned genomic DNA, cDNA, synthetic DNA, or other foreign genetic material into a host cell (see, e.g., Sambrook et al, supra). In essence, the particular genetic engineering procedure used should be capable of successfully introducing a polynucleotide (e.g., an expression vector) into a host cell capable of expressing the desired GA or GA variant.
Many standard transfection methods can be used to generate trichoderma reesei cell lines that express large amounts of heterologous polypeptides. Some disclosed methods for introducing a DNA construct into a strain of trichoderma include: lorito, Hayes, DiPietro and Harman,1993, Curr. Genet. [ modern genetics ]24: 349-356; goldman, VanMettagu and Herrera-Estralla, 1990, Curr. Genet. [ modern genetics ]17: 169-; penttila, Nevalainen, Ratto, Salminen and Knowles,1987, Gene [ Gene ]6: 155-164; for aspergillus: yelton, Hamer and Timberlake,1984, Proc. Natl. Acad. Sci. USA [ Proc. Natl. Acad. Sci ]81: 1470-; for fusarium species: bajar, Podila and Kolattukudy,1991, Proc. Natl. Acad. Sci. USA [ Proc. Natl. Acad. Sci ]88: 8202-8212; for Streptomyces: hopwood et al, 1985, The John Innes Foundation [ John lnnestik ], novitz, uk, and for bacillus: brigidi, DeRossi, Bertarini, Riccaradi and Matteuzzi,1990, FEMS Microbiol. Lett. [ FEMS microbiology letters ]55: 135-. Examples of suitable transformation processes for Aspergillus species can be found in Campbell et al, Improved transformation efficiency of A.niger using homologous niaD genes for nitrate reductase [ Improved transformation efficiency of A.niger using homologous niaD genes for nitrate reductase ]. Curr. Genet [ modern genetics ].16: 53-56; 1989.
In addition, a heterologous nucleic acid construct comprising the desired glucoamylase-encoding nucleic acid sequence may be transcribed in vitro, and the resulting RNA introduced into the host cell by well-known methods (e.g., by injection).
D. Analysis for GA nucleic acid coding sequences and/or protein expression.
To assess the expression of a desired GA variant by a cell line that has been transformed with a nucleic acid construct encoding the desired GA variant, the assay may be performed at the protein level, RNA level, or by using a functional bioassay specifically for glucoamylase activity and/or production.
Typically, assays for analyzing the expression of the desired GA variant include, but are not limited to, northern blotting, dot blotting (DNA or RNA analysis), RT-PCR (reverse transcriptase polymerase chain reaction), or in situ hybridization (using an appropriately labeled probe (based on the nucleic acid coding sequence)) and conventional southern blotting and autoradiography.
Furthermore, the production and/or expression of a desired GA variant can be measured directly in a sample, e.g., by an assay for glucoamylase activity, expression, and/or production. Such assays are described, for example, in Becker et al, Biochem J. [ J. Biochem ] (2001)356:19-30, and Mitsuishi et al, FEBS [ Federation of European Biochem society ] (1990)275:135-138, each of which is expressly incorporated herein by reference. The ability of GA to hydrolyze isolated soluble and insoluble substrates can be measured using assays described in Srisodsuk et al, J.Biotech. [ J.Biotech ] (1997)57:49-57, and Nidetzky and Claeyssens, Biotech.Bioeng. [ Biotechnology and bioengineering ] (1994)44: 961-. Useful substrates for the assay of glucoamylase include soluble starch, amylopectin, DP7, p-nitrophenyl glucoside. In addition, protein expression can be assessed by immunological methods such as ELISA, competitive immunoassay, radioimmunoassay, western blot, indirect immunofluorescence assay, and the like. Some of these assays can be performed using commercially available reagents and/or kits designed to detect GA enzymes. Such immunoassays can be used to qualitatively and/or quantitatively assess the expression of a desired GA or GA variant. The details of these methods are known to those skilled in the art, and many reagents for practicing such methods are commercially available. In certain embodiments, an immunoreagent specific for a desired GA or variant GA enzyme other than its parent GA may be employed, e.g., an antibody specific for a GA substitution or fusion partner (e.g., an N-or C-terminal tag sequence, such as a hexa-histidine tag or a FLAG tag) of a GA or GA variant. Thus, aspects of the invention include the use of purified forms of the desired GA or GA variant to produce monoclonal or polyclonal antibodies specific for the expressed polypeptide for use in different immunoassays. (see, e.g., Hu et al, 1991).
Methods for enrichment, isolation, and/or purification of GA or GA variant polypeptides
Typically, the desired GA or GA variant polypeptide produced in the host cell culture is secreted into the culture medium (producing a culture supernatant comprising the GA or GA variant) and may be enriched, purified, or isolated, e.g., by removing undesired components from the cell culture medium. However, in some cases, the desired GA or GA variant polypeptide can be produced in a cellular form (e.g., cytoplasmic, periplasmic, or other form associated with the cell) that is recovered from the cell lysate/homogenate as needed. Harvesting the desired GA or GA variant polypeptide from the cell or cell supernatant in which the desired GA or GA variant polypeptide is produced using techniques routinely employed by those skilled in the art. Examples include, but are not limited to, filtration (e.g., ultrafiltration or microfiltration), centrifugation, density gradient fractionation (e.g., density gradient ultracentrifugation), affinity chromatography (Tilbeurgh et al, 1984), ion exchange chromatography (Goyal et al, 1991; Friess et al, 1983; Bhikhabhai et al, 1984; Ellouz et al, 1987) (including ion exchange), the use of materials with high resolution (Medve et al, 1998), hydrophobic interaction chromatography (Tomaz and Queiroz,1999), and two-phase partitioning (Brumbauer, et al, 1999).
Although enriched, isolated or purified GA or GA variant polypeptides are sometimes desirable, in some embodiments, the host cell expressing the GA or GA variant polypeptide is used directly in a process requiring glucoamylase activity. Thus, enrichment, isolation or purification of the desired GA or GA variant polypeptide is not always required to obtain a GA or GA variant polypeptide composition that finds use in a desired assay or process that requires or would benefit from glucoamylase activity. In one such example, a yeast cell expressing a GA or GA variant can be added directly to a fermentation process such that the yeast cell expresses the GA or variant GA directly into a fermentation broth, wherein its glucoamylase activity converts the non-fermentable substrate into fermentable sugars for the yeast cell, thereby directly converting into the desired product (e.g., into ethanol) (see, e.g., ilmen et al, High level section of cellobiohydrolases by Saccharomyces cerevisiae [ High level secretion of cellobiohydrolase by Saccharomyces cerevisiae ] Biotechnology for Biofuels [ Biotechnology of Biofuels ]2011,4: 30).
Composition VI
Compositions comprising one or more GAs or variant GAs as disclosed herein are contemplated. These GA and variant GA described herein can be used in enzyme compositions including, but not limited to, starch hydrolysis and saccharification compositions, detergent and detergent compositions (e.g., laundry detergents, dishwashing detergents, and hard surface cleaning compositions), alcohol, and/or biochemical fermentation compositions, and animal feed compositions. In addition, the variant glucoamylases can be used in baking applications, such as bread and cake production, brewing, healthcare, textile, food, environmental waste conversion processes, bio-pulp processing, and biomass conversion applications.
In some embodiments, an enzyme composition comprising one or more GAs or variant GAs encompassed by the present disclosure (e.g., obtained in or recovered and purified from the culture medium) will be used with any one or combination of alpha-amylase, protease, pullulanase, isoamylase, cellulase, hemicellulase, xylanase, cyclodextrin glycosyltransferase, lipase, transferase, phytase, laccase, oxidase, oxidoreductase, esterase, cutinase, granular starch hydrolase, and other glucoamylases.
In some representative compositions, one or more GA or variant GA as described herein will be combined with an alpha amylase, such as a fungal alpha amylase (e.g., derived from a trichoderma species) or a bacterial alpha amylase (e.g., derived from a bacillus species), including variants, chimeras, and hybrids thereof. In certain embodiments, the alpha amylase is an acid stable alpha amylase. In certain embodiments, the alpha amylase is a Granular Starch Hydrolyzing Enzyme (GSHE). Commercially available alpha amylases contemplated for use in the GA or variant GA compositions as described herein are available (e.g., from Danisco US Inc. or Novozymes, usa).
In other embodiments, one or more GA or variant GA as described herein may be combined with other GA (either variant or natural). In some embodiments, the GA of the present disclosure will be combined with: one or more GAs derived from a strain of Aspergillus or a variant thereof, e.g., Aspergillus oryzae, Aspergillus niger, Aspergillus kawachii, and Aspergillus awamori; a glucoamylase derived from a strain of humicola or a variant thereof; a glucoamylase derived from a strain of talaromyces or variants thereof, in particular talaromyces emersonii, talaromyces ankarum; a glucoamylase derived from a strain of athelia or variants thereof, in particular athelia zidoides (a.rolfsii); from the genus penicillium or variants thereof, in particular penicillium chrysogenum; penicillium oxalicum; trametes, Thermomyces, Neurospora, Phanerochaete, Ganoderma, neosartorula, Schizophyllum, Acremonium, Adinandra, Aureobasidium, Gomphostema, Chrysosporium, Coniocaeta, Disporotrichum, Fusarium, Gibberella, Bromophyllum, Pleurotus, Grifola, Mucor, myceliophthora, Sclerotium, Neurospora, Sporoxylum, pachycephalospora, Penicillium, Piperiplophora, Rhizomucor, Rhizopus, Schizophyllum, Tremella, Subulisporia, Scutellaria, Talaromyces, Thermomyces, Thielavia, trametes, Podospora, and Gluconobacter; and a glucoamylase derived from a strain of trichoderma or a variant thereof, particularly trichoderma reesei.
Utility of GA and GA variants
As detailed above, GA is a very important commercial enzyme used in a variety of applications that require hydrolysis of a starch substrate to fermentable sugars (e.g., glucose, maltose, maltotriose, etc.). GA is used in processes for producing high fructose corn sweeteners, which make up over 50% of the us sweetener market, as well as in processes for the direct production of glucose. Generally, glucoamylases may be, and are typically used with, alpha-amylases in starch hydrolysis processes to hydrolyze starch to dextrins, and then glucose. The glucose can be used directly; conversion to fructose by other enzymes (e.g., glucose isomerase); crystallizing; or for fermentation to produce a number of end products (e.g., ethanol, citric acid, succinic acid, ascorbic acid intermediates, glutamic acid, glycerol, 1, 3-propanediol, and lactic acid).
One or more GA or GA variants or combinations thereof may also be effective to directly hydrolyze starch for syrups and/or biochemicals (e.g., alcohols, organic acids, amino acids, other biochemicals, and biomaterials) wherein the reaction temperature is below the gelatinization temperature of the substrate.
In addition, one or more GA or GA variants or combinations thereof may be useful in a one-step enzymatic conversion process, Simultaneous Liquefaction and Saccharification (SLS), to produce high glucose syrups from gelatinized starch using a liquefying α -amylase and a saccharifying glucoamylase at conditions above the starch gelatinization temperature. In the disclosed method, starch undergoes gelatinization, liquefaction, and saccharification simultaneously during simultaneous liquefaction and saccharification. In contrast, granular starch maintains the crystal structure during incubation in the "uncooked" or direct-starch-to-glucose (DSTG) process. Thus, SLS significantly reduces the time required for starch dissolution compared to DSTG, since starch gelatinizes immediately at its operating temperature.
In view of the commercial importance of GA, it will be appreciated that the desired GA or GA variant-encoding nucleic acids, the desired GA or GA variant polypeptides, and compositions comprising the same find utility in a variety of applications. The improved property or properties of these GA or GA variants described herein can be exploited in many ways. For example, at high temperatures (e.g., at temperatures where the parent GA will perform poorly), a GA or GA variant with improved performance under conditions of heat stress may be used to increase the starch hydrolyzing activity in the process being performed, thereby allowing the user to reduce the total amount of GA employed (as compared to the parent GA used). Other improved properties of GA or GA variant polypeptides may be exploited, including GA or GA variants with altered pH optima, increased stability or activity at a particular pH, increased specific activity against a substrate, and/or high levels of expression in a host cell of interest.
A starch hydrolysis composition comprising the desired GA or GA variant as described herein has use in the production of ethanol. The ethanol from this process can be further used as an octane enhancer, or directly used as a fuel instead of gasoline, which is advantageous because ethanol as a fuel source is more environmentally friendly than petroleum-derived products. It is known that the use of ethanol will improve air quality and may reduce local ozone levels and reduce smoke. Furthermore, the utilization of ethanol instead of gasoline can be of strategic importance in mitigating the impact of sudden changes in non-renewable energy and petrochemical supplies.
Separate saccharification and fermentation are processes in which starch present in a feedstock (e.g., corn) is converted to glucose, and then an ethanologenic organism (e.g., a yeast strain) converts the glucose to ethanol. Simultaneous Saccharification and Fermentation (SSF) is a process in which starch present in a feedstock is converted to glucose, and simultaneously and in the same reactor, an ethanologen converts the glucose to ethanol. Thus, the GA or GA variant of the invention has utility in both processes for degrading starch-containing feedstock to produce ethanol.
In some embodiments, a GA or a GA variant as described herein is expressed in an ethanologenic organism, whereby the enzymatic activity of the variant GA expressed by the ethanologenic organism produces glucose, which can be converted to ethanol by the ethanologenic organism.
The GA or GA variant as described herein has use in the production of a host cell for the production of a biochemical product of interest. Thus, aspects of the disclosure include methods of producing a biochemical by obtaining host cells expressing a GA or GA variant as described herein, and culturing these host cells under conditions used to produce the biochemical of interest. It is contemplated that such conditions may include, for example, pH conditions of pH 3-8, and, for example, temperature conditions of 25 ℃ to 95 ℃. The host cell may include additional modifications to facilitate production of the desired biochemical, for example to express homologous or heterologous genes, additional variant genes, and/or to delete or otherwise inactivate expression of one or more endogenous genes. Biochemicals of interest include, but are not limited to, alcohols (ethanol, methanol, butanol, etc.) and other organic compounds, including volatile organic molecules (e.g., isoprene).
It should be noted that GA or GA variants with reduced thermostability have utility, for example, in areas where it is desirable to neutralize enzyme activity at lower temperatures so that other enzymes that may be present are not affected.
One aspect of the invention relates to the use of a GA or GA variant polypeptide according to the invention in the production of a fermented beverage, such as beer. Thus, aspects of the disclosure include methods of providing/producing a fermented beverage comprising the step of contacting a mash and/or wort with a GA or GA variant polypeptide as described herein. Another aspect relates to a method of providing a fermented beverage, the method comprising the steps of: (a) preparing a mash, (b) filtering the mash to obtain a wort, and (c) fermenting the wort to obtain a fermented beverage (such as beer), wherein the GA or GA variant polypeptide is added to: (i) mash in step (a), and/or (ii) wort in step (b), and/or (iii) wort in step (c).
According to yet another aspect, a fermented beverage (e.g., beer) is produced and provided by a method comprising the steps of: (1) contacting the mash and/or wort with a GA or GA variant polypeptide as described herein; and/or (2) (a) preparing a mash, (b) filtering the mash to obtain a wort, and (c) fermenting the wort to obtain a fermented beverage (such as beer), wherein the GA or GA variant polypeptide is added to: (i) the mash of step (a), and/or (ii) the wort of step (b), and/or (iii) the wort of step (c).
Aspects of the invention also include fermented beverages (such as beer) produced using GA or a GA variant as described above.
The term "beer" is meant to include any fermented wort produced by fermenting/brewing starch-containing plant material. Typically, beer is produced from malt or adjunct, or any combination of malt and adjunct, which is a starch-containing plant material.
As used herein, the term "malt" is understood to be any malted cereal grain, such as malted barley or wheat.
As used herein, the term "adjunct" refers to any starch and/or sugar containing plant material that is not malt, e.g., barley or wheat malt. As examples of auxiliary materials, mention may be made of materials which can be used as sources of starch, such as common corn grits, refined corn grits, brewer's milling yeast, rice, sorghum, refined corn starch, barley starch, hulled barley, wheat starch, baked cereals, cereal flakes, rye, oats, corn (maize), potatoes, manioc, cassava and syrups, such as corn syrup, sugar cane syrup, invert syrup, barley and/or wheat syrup, etc.
As used herein, the term "mash" refers to an aqueous slurry of any starch and/or sugar containing plant material such as flour (e.g., including crushed barley malt, crushed barley) and/or other adjuncts or combinations thereof, which is subsequently mixed with water for separation into wort and spent grain.
As used herein, the term "wort" refers to the unfermented liquid effluent (run-off) after extraction of the grist during mashing.
In another aspect, the invention relates to a method of preparing/producing a fermented beverage (e.g. beer) comprising mixing a polypeptide of the invention with malt or an adjunct.
Examples of beer produced according to the above uses and methods (i.e., wherein GA variants as described herein are used) include, but are not limited to, the following: whole malt beer, beer brewed in accordance with the "Reinheitsgegot" method, ale (IPA), lager, ale, low malt beer (Happoshu) (second beer), tertiary beer, dry beer, ale, low beer, low alcohol beer, low calorie beer, porter, bock beer, stout, malt liquor, non-alcohol beer, non-alcohol malt liquor, and the like, and includes alternative cereal and malt beverages, such as fruit flavored malt beverages, such as citrus-flavored, such as lemon, orange, lime or berry-flavored malt beverages, wine-flavored malt beverages, such as vodka, rum or agave-flavored malt liquors, or coffee-flavored malt beverages, such as coffee-flavored malt liquors, and the like.
From the above, it can be seen that GA or GA variant polypeptides (and nucleic acids encoding them) having improved properties compared to their parent GA enzyme have utility in improving any of a variety of assays and processes employing cellobiohydrolases.
Examples of the invention
The invention is described in further detail in the following examples, which are not intended to limit the scope of the invention as claimed in any way. The drawings are intended to be considered as forming a part of the specification and description of the invention. All references cited herein are specifically incorporated herein by reference in their entirety.
Example 1
I. Effect of enzyme concentration on comparison of Aspergillus niger-GA (An-GA) and Fusarium verticillium GA (Fv-GA) during saccharification
Under typical saccharification conditions, a series of experiments were conducted to determine whether the novel blend of glucoamylases (An-GA + Fv-GA) produced higher glucose yields than the final glucose yields of the glucoamylases (An-GA, Fv-GA) alone. Experiments were performed using 33.5% ds corn starch liquefact and adjusting the pH of the liquefact to pH 4.4 using dilute acid. The starch liquefact is placed in a water bath maintained at 60 ℃ and one or more enzymes are added. Only all treatments for Fv-GA had An AkAA acid stable alpha amylase of 0.32SSU, matching the acid stable alpha amylase present in the An-GA commercial product. The starch liquefact was kept at 60 ℃ for 48hr and samples were withdrawn throughout the incubation to analyze the sugar profile. Glucoamylase doses are given as GAU/g DS corn starch and/or μ g/g DS, and the sugar profiles at different intervals are given in tables 1, 2, and 3.
Table 1 shows the results for OPTIMAX at different dosesTML-400 (trade name for glucoamylase An-GA from DuPont Industrial Bioscience). It was observed that An-GA did not reach the end of 48hr at different doses>95.5% glucose.
Table 2 shows the sugar profile for Fv-GA at different doses. It was observed that at different doses, Fv-GA did not reach > 95.5% glucose at the end of 48 hr.
Table 3 shows the results for different blend ratios forAn-GA(OPTIMAXTMSugar profile of the blend of L-400) + Fv-GA, an unexpected increase in final glucose yield of greater than 95.5% was observed.
TABLE 1 different An-GA (OPTIMAX)TML-400) sugar distribution of dose
TABLE 2 sugar profiles for different Fv-GA (Fusarium-GA) doses
TABLE 3 different An-GA (OPTIMAX)TML-400) and Fv-GA (Fusarium verticillium-GA) dose blending ratio
The data shown in tables 1-3 supports the unexpected finding that blends of An-GA + Fv-GA perform better and produce > 95.5% glucose, whereas the use of An-GA and Fv-GA alone did not produce > 95.5% glucose. The enzyme blends found are beneficial to glucose syrup producers, as blending of An-GA + Fv-GA results in higher glucose yields.
Example 2
I. The effect of different glucoamylases (purified glucoamylases from Trichoderma reesei (Tr-GA), Humicola grisea (H-GA), Aspergillus fumigatus (Afu-GA), and Fusarium verticillium (Fv-GA)) with An-GA on glucose production during standard saccharification
Experiments were performed to determine the effect of different glucoamylase blends on glucose production when blended with An-GA. These experiments were performed using 33.5% ds starch liquefact and adjusting the pH of the liquefact to pH 4.4. The starch liquefact is placed in a water bath maintained at 60 ℃ and one or more enzymes are added. The starch liquefact was kept at 60 ℃ for 48hr and samples were withdrawn throughout the incubation to analyze the sugar profile (shown in table 4).
Table 4 shows the different glucoamylase and An-GA (OPTIMAX) additions at a dose of 10. mu.g/g ds (purified glucoamylase protein) for useTML-400) sugar profile of the treatment. Of all the glucoamylases tested, only Fv-GA was effective at 48hr when blended with An-GA, resulting in greater than 95.5% higher glucose production.
TABLE 4 An-GA (OPTIMAX) with different glucoamylasesTML-400) sugar distribution of the blend
Example 3
I. Effect of different glucoamylases (purified glucoamylases from Trichoderma reesei (Tr-GA), Humicola grisea (H-GA), Aspergillus niger (An-GA), and Fusarium verticillium (Fv-GA)) and Aspergillus fumigatus-GA (Afu-GA) on glucose production during standard saccharification
As already shown, a set of experiments was performed to determine if different glucoamylases show similar effect to Afu-GA when blended with An-GA (discussion in example 2). These experiments were performed using 33.5% ds starch liquefact and adjusting the pH of the liquefact to pH 4.4. The starch liquefact is placed in a water bath maintained at 60 ℃ and one or more enzymes are added. Only purified enzyme protein was used for these experiments, so all enzyme treatments had AkAA acid stable alpha amylase at 0.32SSU, matching the commercial An-GA product. The starch liquefact was kept at 60 ℃ for 48hr and samples were withdrawn throughout the incubation to analyze the sugar profile (shown in table 5).
Table 5 shows the glyco-profile of different glucoamylases for Afu-GA added at a dose of 10. mu.g/g ds. Similar to the observations made in example 2 using An-GA, in all glucoamylases tested, only Fv-GA was effective, increasing glucose to greater than 95.5% at 48hr when blended with Afu-GA. This observation shows that Fv-GA enhanced glucose production when blended with An-GA or Afu-GA, whereas other tested glucoamylases had no significant effect when blended with An-GA and Afu-GA.
TABLE 5 sugar distribution of Afu-GA (Aspergillus fumigatus-GA) blends with different glucoamylases
Example 4
Effect of Afu-GA and Fv-GA blends on DP1 (% glucose)
Under typical saccharification conditions, experiments were performed with Afu-GA, Fv-GA, and blends of Afu-GA + Fv-GA to determine the effect on% glucose produced. These experiments were performed using 33.5% ds starch liquefact and adjusting the pH of the liquefact to pH 4.4. The starch liquefact is placed in a water bath maintained at 60 ℃ and one or more enzymes are added. All enzymes treated AkAA acid stable alpha amylase with 0.32 SSU. The starch liquefact was kept at 60 ℃ for 48hr and samples were taken throughout the incubation to analyze the sugar profile (shown in figure 1).
FIG. 1 shows the% glucose at the end of 48hr for the Afu-GA (80. mu.g/g ds), Fv-GA (80. mu.g/g ds), and Afu-GA + Fv-GA (40. mu.g/g ds + 40. mu.g/g ds) blends. The data in the figure shows that the blend of Afu-GA + Fv-GA successfully achieved the highest% glucose at 48hr compared to when Afu-GA and Fv-GA were used alone at equal total glucoamylase protein doses.
Example 5
Effect of different blending ratios of Afu-GA and Fv-GA
Under typical saccharification conditions, experiments were conducted with different blending ratios of Afu-GA and Fv-GA to determine the optimal ratio of glucoamylase (protein) for maximum glucose production. These experiments were performed using 33.5% ds starch liquefact and adjusting the pH of the liquefact to pH 4.4. The starch liquefact is placed in a water bath maintained at 60 ℃ and one or more enzymes are added. Only purified enzyme protein was used in this set of experiments, so all enzymes treated AkAA acid stable alpha amylase with 0.32SSU to match the acid stable alpha amylase present in the An-GA commercial product. The starch liquefact was kept at 60 ℃ for 48hr and samples were withdrawn throughout the incubation to analyze the sugar profile (shown in table 6).
Table 6 shows the carbohydrate profile for the novel blends of Afu-GA and Fv-GA with different protein ratios. When Afu-GA was used alone at 80. mu.g/g ds, 95.5% glucose was not reached at 48 hr. But when the Afu-GA and Fv-GA were blended at different ratios (keeping the total protein constant at 80. mu.g/g ds), all blends reached > 95.5% glucose at 48 hr. Thus the blend of Afu-GA + Fv-GA at equivalent total glucoamylase protein resulted in the highest level of glucose production compared to Afu-GA itself.
TABLE 6 sugar distribution for Afu-GA and Fv-GA blends at different protein ratios
FIG. 2 is a graph based on the data in Table 6 showing the effect of glucoamylase ratio of Afu-GA and Fv-GA on final% glucose after 48hr of saccharification. It was found that the ideal blend to optimize glucose production was a moderate level of protein ratio, shown as 40+ 40. While all blends reached > 95.5% glucose at 48hr, it was unexpectedly found that for Afu-GA and FV-GA, the protein ratio shown as a moderate level of 40+40 reached > 96% glucose at 48 hr.
Example 6
I. Effect of different blending ratios of Afu-GA and Fv-GA under different pH conditions
A set of experiments was performed with blends of Afu-GA + Fv-GA and compared with the results for the commercial OPTIMAX4060 VHP. These experiments were performed using 33.5% ds starch liquefact and adjusting the pH of the liquefact to pH 4.4 and/or 5.5. The starch liquefact is placed in a water bath maintained at 60 ℃ and one or more enzymes are added. All enzyme treatments with non-commercially purified enzymes for AkAA acid stable alpha amylases with 0.32SSU matched this commercial product. The starch liquefact was kept at 60 ℃ for 48hr and samples were taken throughout the incubation to analyze the sugar profile (shown in figure 3).
Figure 3 shows% glucose at 48hr incubation under two different pH conditions. The commercial product, OPTIMAXTM4060VHP performs reasonably at pH 4.5 and is produced>95.5% glucose, but performed poorly at pH 5.5 and reached only 90.93% glucose at 48 hr. The OPTIMAXTM4060 the VHP comprises Aspergillus niger glucoamylase, acid stable alpha amylase, and pullulanase, which contribute to 95.5% glucose at pH 4.5, but at elevated pH (e.g., at elevated pH)>5.0) because pullulanase is inactivated at higher pH values. This deactivation results in a decrease in the performance of the OPTIMAX4060VHP product at elevated pH. The Afu-GA + Fv-GA blend maintains performance over a wide range of pH (pH 4.5 to 5.5) and consistently achieves performance>95.5% glucose.
Example 7
I. Effect of debranching enzyme, Pullulanase (PU) on a range of blending ratios of Afu-GA and Fv-GA.
Using a commercial product OPTIMAXTM4060VHP A set of experiments was performed as it already contains a pullulanase debranching enzyme that helps it to reach 95.5% glucose. Except for the standard 0.16GAU/g ds dose of OPTIMAXTM4060VHP, two different glucoamylases (Afu-GA and Fv-GA) are blended. Additional glucoamylases Afu-GA and Fv-GA were added at 10. mu.g/g ds.
Table 7 shows OPTIMAXTM4060VHP and sugar profile of treatments in which standard OPTIMAX was excludedTM4060VHP dose, additional glucoamylase Afu-GA or Fv-GA was added. Standard dose of OPTIMAX within 72hrTM4060VHP reaches 95.52% glucose. The addition of glucoamylase Afu-GA did not increase% glucose over 72hr of saccharification time and reached OPTIMAXTM4060VHP standard dose similar 95.6% glucose.
TABLE 7 OPTIMAXTM4060 sugar profile of VHP and additionally blended glucoamylase Afu-GA or Fv-GA.
FIG. 4 shows OPTIMAX based on the data in Table 7TM4060VHP and% glucose at 72hr additionally blended with glucoamylase Afu-GA or Fv-GA.
When 72hr later, each test condition was reached>OPTIMAX at 95% glucoseTMThe combination of 4060VHP + Fv-GA had an unexpectedly improved effect of increasing% glucose to 96.35% compared to OPTIMAX alone observedTM4060VHP standard dose or OPTIMAXTM4060VHP + Afu-GA treatment was significantly higher.
Example 8
Impact of fv-GA and additional GA blends on Simultaneous Saccharification and Fermentation (SSF) in corn for ethanol processes.
Corn liquefact is obtained from a commercial fuel alcohol producer. The corn liquefact has a specification of 32.84% ds and a pH of 3.5. The pH of the corn liquefact was adjusted to 4.5 with 4N sodium hydroxide. The experimental SSF process was performed simultaneously in 250ml Erlenmeyer flasks in multiple replicates, using 100g of liquefact in each flask. An appropriate amount of enzyme was added to the liquefied product together with dry yeast and 600ppm urea. The dried yeast was allowed to hydrate for about 10 minutes before being added to the erlenmeyer flask. The enzyme dosages for these tests were 1) the Tr-GA variant CS4 ("CS 4"), at 25 μ g g ds, and Fv-GA at 25 μ g ds; 2) CS4GA at 50. mu.g ds, and 3) Fv-GA at 50. mu.g ds. These flasks were covered with rubber stoppers and placed in a pressurized air flow incubator at 32 ℃ and shaken at 200rpm for 54 hr. Samples were withdrawn at regular intervals for ethanol measurements.
The ethanol data provided in table 8 shows that for the Fv-GA plus CS4GA blend, the ethanol ratio is higher, and the final ethanol concentration is also highest in the comparative treatments. Blends of GA added at equal total protein were better than GA addition alone at equal total protein loading. This shows the synergistic effect of Fv-GA with other GAs.
Similar effects were observed for DP4+ reduction, since the Fv-GA plus CS4 blend reached the lowest DP4+ compared to GA addition alone at equal total protein addition.
Lower DP4+ and higher ethanol are significant to commercial ethanol producers because this advantage explains the higher conversion of substrate to the desired end product (e.g., ethanol).
Table 8 for CS4+ Fv-GA blends, CS4GA, and Fv-GA,% w/v ethanol and% w/v DP4+ profiles show lower% w/v DP4+ and higher% w/v ethanol for this GA blend.
FIG. 5 shows% w/v DP4+ at 54hr for CS4+ Fv-GA blends, CS4GA alone, and Fv-GA alone. It is clear that the lower% w/v DP4+ is advantageously derived from GA blends. As shown in fig. 5, a lower DP4+ of about 0.3% w/v was produced for the GA blend compared to either CS4GA alone or Fv-GA. Under similar conditions, CS4GA alone reduced DP4+ to 1.03% w/v, while the novel GA blend reduced DP4+ to 0.74% w/v, which is a 25.2% reduction in DP4 +. Under similar conditions, use of the blend may result in a 1.03x reduction in DP4 +.
FIG. 6 shows% w/v ethanol at 54hr for CS4+ Fv-GA blend, CS4GA, and Fv-GA, with higher% w/v ethanol produced for the GA blend. As shown, the% w/v ethanol was about 0.2% higher for the GA blends compared to either CS4GA alone or Fv-GA. Under similar conditions, CS4GA alone achieved an ethanol content of 14.98% v/v, while the novel GA blend achieved an ethanol content of 15.26% v/v, which is an overall increase in ethanol yield of 1.86% over that achieved with CS4GA alone. Under similar conditions, use of the blend can result in an increase of 1.018x in ethanol production.
Example 9
I. Effect of Fv-GA and additional GA blends on saccharification under biochemical fermentation conditions at pH 7.0 and at 37 ℃.
Experiments were performed with Afu-GA, HGA, TrGA, Fv-GA, and HGA + Fv-GA blends to determine the effect of GA enzyme alone and GA blends on the final% glucose. These experiments were performed using 33.5% ds starch liquefact and adjusting the pH of the liquefact to pH 7.0. The starch liquefact is placed in a water bath maintained at 37 ℃ and one or more enzymes are added. The starch liquefact was kept at 37 ℃ for 72hr and samples were taken throughout the incubation to analyze the sugar profile.
The data in Table 9 show that Fv-GA added at 80. mu.g/g ds is much faster and reaches the highest final% glucose compared to other GAs (also added at 80. mu.g/g ds). Furthermore, Fv-GA added at 80. mu.g/g ds was slightly better than HGA added at 160. mu.g/g ds.
The data also show that the blend of Fv-GA added at 40. mu.g/g ds and HGA added at 160. mu.g/g ds is better than HGA added at 320. mu.g/g ds (Table 10). This shows a significant and unexpected synergy between Fv-GA and HGA. The blend of Fv-GA and HGA (total protein addition of 200. mu.g/g ds with Fv-GA added at 40. mu.g/g ds and HGA added at 160. mu.g/g ds) achieved a lower amount of DP4+ and the highest amount of% DP1 compared to HGA added even at 320. mu.g/g ds.
TABLE 9 sugar distribution of Afu-GA, Fv-GA, HGA, and TrGA at 80. mu.g/g ds protein dose during saccharification under biochemical fermentation conditions at 37 ℃ and pH 7.0
TABLE 10 sugar profiles of the Fv-GA + HGA blends and HGAs at higher doses compared to the higher doses of HGA during saccharification under biochemical fermentation conditions at 37 ℃ and pH 7.0 show that the Fv-GA plus HGA blends perform better.
Example 10
Effect of fv-GA and additional GA blends on the Direct Starch To Glucose (DSTG) Process
Experiments were performed with Afu-GA, Fv-GA, and blends of Afu-GA + Fv-GA to determine the effect of the enzymes and enzyme blends on the final% solubility and% glucose. Experiments were performed using 32% ds raw tapioca starch, and the pH of the tapioca starch slurry was maintained at pH 5.2. These enzymes were added as follows: 1) Afu-GA 2 at 80. mu.g/g ds) Afu-GA + at 40. mu.g/g ds plus Fv-GA at 40. mu.g/g ds, and 3) Fv-GA at 80. mu.g/g ds. All treatments also included the 10AAU alpha-amylase product SPEZYME(DuPont corporation). The slurry was mixed at room temperature to blend the added enzymes. For each treatment, 35g aliquots were weighed in 2 replicates into LABOMAT steel beakers. The metal beaker was mounted back to LABOMAT for incubation at 60 ℃ for 72hr with constant mixing at 60rpm clockwise and counter-clockwise. Samples were withdrawn throughout the incubation to analyze the sugar profile.
The data in Table 11 show that the Fv-GA plus Afu-GA blend reached a solubility of 96.51% in 72hr, and that addition of either Afu-G or Fv-GA alone reached solubilities of 93.05% and 94.89%, respectively. Also, for the Fv-GA and Afu-GA blends, the% DP1 and% DP4+ values were 96.51% and 0.71%, respectively. This shows that the Fv-GA plus Afu-GA blend unexpectedly reached the lowest amount of DP4+ and the highest amount of% DP1, compared to either Afu-GA or Fv-GA added alone.
Table 11 sugar profiles and% solubilities of AfuGA, Fv-GA + Afu-GA blends, and Fv-GA show that the Fv-GA plus AfuGA blends perform better than the addition of Afu-GA and Fv-GA alone during DSTG of tapioca starch at 60 ℃ and pH 5.2.
Sequence listing
SEQ ID NO:1
Fv-GA protein sequence:
MFTQILYGLTALSALQGQVTASPGGSSLDRFISKEADISIKGVLANIGADGKRAQGAAPGAVVASPSRTDPDYWYTWTRDSALTYKVLVERFIHGDKSLQRKIDEYVSAQAKLQGVTNPSGGPESGGLGEPKFHVNLTAFTGSWGRPQRDGPPLRATALTLYANWLVSHGDRSKAVNKVWPVIEKDLAYTVKFWNRTGYDLWEEVNGSSFFTLSASHRALVEGAALAKKLGKSCSDCATNAPRVLCFMQSFWTGSYIDSNINVNDGRKGLDANSILSSIHTFDPSSKCTDSTFQPCSSRALANHKEVVDSFRSIYGVNKNRGKGKAAAVGRYSEDVYYDGNPWYLATLAAAEQLYAAVYQWNKIGSITVDSVSLPFFSDLVPKVSKGTYRKNSKTYKAIIKAVTSYADGFVAVVQTYTPKDGSLAEQFDKSTGTPKSAVHLTWSYASFVGAAERRTGVVPPAWGESNANKVPAVCEAAPACDTTITFNVKNVDVTSDQKVYIVGGITQLSNWAPADGIALEESTSTKGLWTVKVKIPSDTSFEYKYIKKTSDGTVTWESDPNNSAATGSKCGSSSTINDEWR
SEQ ID NO:2
Fv-GA DNA nucleotide sequence:
ATGTTTACTCAGATCTTGTATGGCCTCACGGCCTTATCTGCCCTCCAAGGTCAGGTCACTGCATCACCAGGCGGTTCCAGCCTTGATCGCTTCATTTCCAAAGAGGCCGACATCTCCATCAAGGGCGTGCTTGCCAATATTGGCGCCGATGGCAAGCGAGCCCAAGGCGCTGCACCTGGCGCCGTTGTAGCAAGTCCATCGAGAACAGATCCAGACTGTAAGTCAATATTCTTGATAGTATCGCAAATGGATGACTGAGACGTTTTCTAGACTGGTACACATGGACCCGAGACTCCGCCTTGACATACAAAGTCCTTGTTGAGCGCTTCATTCACGGAGACAAGTCTCTCCAGCGCAAGATCGACGAATATGTCTCTGCCCAAGCCAAGCTCCAGGGCGTCACCAACCCATCTGGCGGCCCCGAGTCAGGCGGCCTTGGGGAACCCAAGTTTCACGTCAACCTCACAGCTTTCACAGGATCCTGGGGTCGTCCTCAACGAGATGGCCCTCCTCTGAGAGCTACTGCATTGACGCTGTACGCCAACTGGCTTGTTTCTCACGGCGACCGCTCCAAGGCCGTCAACAAGGTTTGGCCTGTCATTGAGAAGGATCTTGCATACACCGTCAAGTTCTGGAACAGAACCGGTTACGATCTTTGGGAGGAGGTTAACGGATCTTCGTTCTTCACCCTCTCTGCTTCACATCGTGCTCTGGTTGAGGGAGCTGCTCTTGCTAAGAAGCTTGGCAAGTCTTGCTCCGACTGCGCAACCAACGCCCCCCGTGTTCTCTGCTTCATGCAAAGCTTCTGGACCGGCAGCTACATCGACTCGAACATCAATGTCAACGATGGCCGCAAGGGTCTTGATGCCAACTCCATTCTGTCTTCTATTCACACCTTTGATCCTTCTTCGAAGTGCACAGACTCTACCTTCCAGCCTTGTTCTTCAAGGGCTCTTGCGAACCACAAGGAAGTAGTGGACTCTTTCCGCTCCATCTATGGTGTCAACAAAAACAGAGGTAAAGGTAAAGCTGCTGCTGTCGGTCGATACAGTGAGGATGTGTACTACGACGGTAACCCTTGGTACTTGGCTACTCTTGCTGCTGCCGAGCAACTGTACGCTGCTGTCTATCAATGGAACAAGATCGGTTCCATCACAGTTGATAGTGTGTCGCTCCCCTTTTTCAGTGACCTTGTACCAAAGGTTTCCAAGGGAACCTATCGCAAGAACAGCAAGACATACAAGGCTATTATCAAGGCTGTCACTTCATACGCTGATGGCTTTGTCGCCGTTGTGCAGACCTATACTCCCAAAGATGGCTCCCTCGCAGAGCAGTTCGATAAGTCTACTGGAACTCCCAAGTCAGCTGTTCACCTAACCTGGTCCTACGCCTCCTTTGTCGGTGCTGCCGAGCGTCGTACTGGCGTCGTTCCTCCAGCTTGGGGCGAGAGCAACGCCAACAAGGTGCCTGCTGTTTGCGAAGCAGCTCCAGCCTGCGACACCACCATCACGTTCAATGTGAAGAACGTTGATGTCACGTCGGACCAGAAGGTTTACATTGTTGGCGGGATCACTCAACTTTCCAACTGGGCCCCTGCTGACGGCATTGCGCTTGAGGAATCCACGAGCACCAAGGGCTTGTGGACTGTGAAGGTCAAGATTCCATCTGATACCAGCTTTGAGTATAAGTATATAAAGAAGACGAGTGATGGAACTGTTACATGGGAGAGTGACCCCAATAACAGTGCGGCGACGGGCAGCAAGTGCGGAAGCAGCAGTACCATCAACGATGAGTGGAGGTAA
SEQ ID NO:3
Afu-GA protein sequence:
MPRLSYALCALSLGHAAIAAPQLSARATGSLDSWLGTETTVALNGILANIGADGAYAKSAKPGIIIASPSTSEPDYYYTWTRDAALVTKVLVDLFRNGNLGLQKVITEYVNSQAYLQTVSNPSGGLASGGLAEPKYNVDMTAFTGAWGRPQRDGPALRATALIDFGNWLIDNGYSSYAVNNIWPIVRNDLSYVSQYWSQSGFDLWEEVNSMSFFTVAVQHRALVEGSTFAKRVGASCSWCDSQAPQILCYMQSFWTGSYINANTGGGRSGKDANTVLASIHTFDPEAGCDDTTFQPCSPRALANHKVYTDSFRSVYAINSGIPQGAAVSAGRYPEDVYYNGNPWFLTTLAAAEQLYDAIYQWKKIGSISITSTSLAFFKDIYSSAAVGTYASSTSTFTDIINAVKTYADGYVSIVQAHAMNNGSLSEQFDKSSGLSLSARDLTWSYAAFLTANMRRNGVVPAPWGAASANSVPSSCSMGSATGTYSTATATSWPSTLTSGSPGSTTTVGTTTSTTSGTAAETACATPTAVAVTFNEIATTTYGENVYIVGSISELGNWDTSKAVALSASKYTSSNNLWYVSVTLPAGTTFEYKYIRKESDGSIVWESDPNRSYTVPAACGVSTATENDTWQ
SEQ ID NO:4
Afu-GA nucleotide sequence:
atgcctcgcctttcctacgcgctctgtgcgctgtctctcgggcatgctgctattgcagctcctcagttatccgctcgtgctaccggcagcttggactcctggttgggtactgagaccaccgttgcgctcaatggtattctggccaacatcggtgccgacggtgcttatgcgaagagcgctaagcctggcataatcattgccagtccgagcaccagcgaaccagactgtgagaaccttcctgaactggccctgtccggcagtcattgacctcggtagactactatacctggacgagagatgctgctctcgtcacgaaagtcctggtcgacctcttccgcaacggcaacctgggtctgcagaaagtcattaccgaatacgtcaactctcaggcgtacttgcagaccgtgtctaatccgtcgggtggtcttgcgagcggaggtctcgcggagcctaagtacaacgtcgacatgacggcctttaccggagcctggggtcgtcctcagcgtgatggtccggctctgcgggccaccgccctcatcgactttggcaactggctgattgtatgttctccatacgagccccaggaagcgttgctgacgtctacaggacaacggctactccagctatgctgtcaacaacatctggcccattgtgcgcaacgacttgtcctacgtttctcagtactggagccagagtggctttggtgagtcccgactctctggaagtttacaacgtgcatcgattactgacaattgagattctacgtgacagatctctgggaagaagtcaactccatgtccttcttcaccgtcgctgtccagcaccgtgccctcgtggagggaagcacgttcgctaaacgggtgggagcgtcgtgctcgtggtgtgactcgcaggccccccagatcctctgctacatgcagagtttctggactggctcgtatatcaacgccaacaccggtggtggccggtccggcaaggatgccaacaccgtcctcgccagcatccataccttcgaccccgaagccggctgcgacgatactactttccagccctgctctcctcgggcccttgccaaccacaaggtgtacaccgattcgttccgctcggtctacgcgatcaactccggcatcccacagggcgctgccgtttccgctggccgctaccccgaggacgtctactacaacggcaacccttggttcctcaccaccctcgccgctgccgagcagctctacgacgctatctaccagtggaagaagatcggttccatcagcatcaccagcacctccctcgccttcttcaaggacatctacagctccgccgcggtcggcacctacgcctctagcacctccaccttcacggacatcatcaacgcggtcaagacctacgcagacggctacgtgagcatcgtccaggcacacgccatgaacaacggctccctttcggagcaattcgacaagtcctctgggctgtccctctccgcccgcgatctgacctggtcctacgccgctttcctcaccgccaacatgcgtcgtaacggcgtggtgcctgccccctggggcgccgcctccgccaactccgtcccctcgtcttgctccatgggctcggccacgggcacctacagcaccgcgacagccacctcctggcccagcacgctgaccagcggctcgccaggcagcaccaccaccgtgggcaccacgaccagtaccacctctggcaccgccgccgagaccgcctgtgcgacccctaccgccgtggccgtcacctttaacgagatcgccaccaccacctacggcgagaatgtttacattgttgggtccatctccgagctcgggaactgggataccagcaaagcagtggccctgagtgcgtccaagtatacctccagcaataacctctggtacgtgtccgtcaccctgccggctggcacgacattcgagtacaagtatatccgcaaggaaagcgatggctcgatcgtgtgggagagtgaccccaaccgctcgtatacggtgccggcagcttgtggagtgtctactgcgaccgagaatgatacttggcagtga
SEQ ID NO:5
Hg-GA protein sequence:
MHTFSKLLVLGSAVQSALGRPHGSSRLQERAAVDTFINTEKPIAWNKLLANIGPNGKAAPGAAAGVVIASPSRTDPPYFFTWTPDAALVLTGIIESLGHNYNTTLQTVIQNYVASQAKLQQVSNPSGTFADGSGLGEAKFNVDLTAFTGEWGRPQRDGPPLRAIALIQYAKWLIANGYKSTAKSVVWPVVKNDLAYTAQYWNETGFDLWEEVPGSSFFTIASSHRALTEGAYLAAQLDTECPPCTTVAPQVLCFQQAFWNSKGNYVVSTSTAGEYRSGKDANSILASIHNFDPEAGCDNLTFQPCSERALANHKAYVDSFRNLYAINKGIAQGKAVAVGRYSEDVYYNGNPWYLANFAAAEQLYDAIYVWNKQGSITVTSVSLPFFRDLVSSVSTGTYSKSSSTFTNIVNAVKAYADGFIEVAAKYTPSNGALAEQYDRNTGKPDSAADLTWSYSAFLSAIDRRAGLVPPSWRASVAKSQLPSTCSRIEVAGTYVAATSTSFPSKQTPNPSAAPSPSPYPTACADASEVYVTFNERVSTAWGETIKVVGNVPALGNWDTSKAVTLSASGYKSNDPLWSITVPIKATGSAVQYKYIKVGTNGKITWESDPNRSITLQTASSAGKCAAQTVNDSWR
SEQ ID NO:6
Hg-GA nucleotide sequence:
atgcataccttctccaagctcctcgtcctgggctctgccgtccagtctgccctcgggcggcctcacggctcttcgcgtctccaggaacgcgctgccgttgataccttcatcaacaccgagaagcccatcgcatggaacaagctgctcgccaacatcggccctaacggcaaagccgctcccggtgccgccgccggcgttgtgattgccagcccttccaggacggaccctccttgtacgtggtggcatggaatggacccaagagactgttttagatgaaagagagtttctgctaaccgccacacccagacttcttcacctggaccccggatgccgccctggtcctcaccggcatcatcgagtcccttggccacaactacaacaccaccctgcagaccgtcatccagaactacgtcgcgtcgcaggccaagctgcagcaggtctcgaacccctcgggaaccttcgccgacggctcgggtctcggtgaggccaagttcaatgtcgacctcactgccttcactggcgaatggggtcgccctcagagggacggcccgcccctgcgcgccatcgctctcatccagtacgccaagtggctgatcgccaacggctacaagagcacggccaagagcgtcgtctggcccgtcgtcaagaacgatctcgcctacacggcccagtactggaacgagaccggcttcgatctctgggaggaggtccccggcagctcgttctttaccatcgccagctctcacaggggtgagtcatttattgttcagtgttttctcattgaataattaccggaatgccactgacgccaaacagctctgactgagggtgcttacctcgccgctcagctcgacaccgagtgcccgccctgcacgaccgtcgcccctcaggttctgtgcttccagcaggccttctggaactccaagggcaactatgtcgtctcaacagtaagatccctacaccaacaaaaaaaatcgaaaaggaacgttagctgacccttctagtcaacggcgggcgagtatcgctccggcaaggacgccaactcgatcctggcgtccatccacaacttcgaccctgaggccggctgcgacaacctgaccttccagccctgcagcgagcgcgccctggccaaccacaaggcctatgtcgactcgttccgcaacctctacgccatcaacaagggcatcgcccagggcaaggccgttgccgtcggccgctactcggaggatgtctactacaacggcaacccgtggtacctggccaactttgccgccgccgagcagctctacgacgccatctacgtgtggaacaagcagggctccatcaccgtgacctcggtctccctgcccttcttccgcgaccttgtctcgtcggtcagcaccggcacctactccaagagcagctcgaccttcaccaacatcgtcaacgccgtcaaggcctacgccgacggcttcatcgaggtggcggccaagtacaccccgtccaacggcgcgctcgccgagcagtacgaccgcaacacgggcaagcccgactcggccgccgacctgacgtggtcgtactcggccttcctctcggccatcgaccgccgcgcgggtctcgtccccccgagctggcgggccagcgtggccaagagccagctgccgtccacctgctcgcgcatcgaggtcgccggcacctacgtcgccgccacgagcacctcgttcccgtccaagcagaccccgaacccctccgcggcgccctccccgtccccctacccgaccgcctgcgcggacgctagcgaggtgtacgtcaccttcaacgagcgcgtgtcgaccgcgtggggcgagaccatcaaggtggtgggcaacgtgccggcgctggggaactgggacacgtccaaggcggtgaccctgtcggccagcgggtacaagtcgaatgatcccctctggagcatcacggtgcccatcaaggcgacgggctcggccgtgcagtacaagtatatcaaggtcggcaccaacgggaagattacttgggagtcggaccccaacaggagcattaccctgcagacggcgtcgtctgcgggcaagtgcgccgcgcagacggtgaatgattcgtggcgttaa
SEQ ID NO:7
An-GA protein sequence: MSFRSLLALSGLVCTGLANVISKRATWDSWLSNEATVARTAILNNIGADGAWVSGADSGIVVASPSTDNPDYFYTWTRDSGLVLKTLVDLFRNGDTSLLSTIENYISAQAIVQGISNPSGDLSSGAGLGEPKFNVDETAYTGSWGRPQRDGPALRATAMIGFGQWLLDNGYTSTATDIVWPLVRNDLSYVAQYWNQTGYDLWEVNGSSFFTIAVQHRALVEGSAFATAVGSSCSWCDSQAPEILCYLQSFWTGSFILANFDSSRSAKDANTLLLGSIHTFDPEAACDDSTFQPCSPRALANHKEVVDSFRSIYTLNDGLSDSEAVAVGRYPEDTYYNGNPWFLCTLAAAEQLYDALYQWDKQGSLEVTDVSLDFFKALYSDATGTYSSSSSTYSSIVDAVKTFADGFVSIVETHAASNGSMSEQYDKSDGEQLSARDLTWSYAALLTANNRRNVVPSASWGETSASSVPGTCAATSAIGTYSSVTVTSWPSIVATGGTTTTATPTGSGSVTSTSKTTATASKTSTSTSSTSCTTPTAVAVTFDLTATTTYGENIYLVGSISQLGDWETSDGIALSADKYTSSDPLWYVTVTLPAGESFEYKFIRIESDDSVEWESDPNREYTVPQACGTSTATVTDTWR
SEQ ID NO:8
An-GA nucleotide sequence:
atgtcgttccgatctctactcgccctgagcggcctcgtctgcacagggttggcaaatgtgatttccaagcgcgcgacctgggattcatggttgagcaacgaagcgaccgtggctcgtactgccatcctgaataacatcggggcggacggtgcttgggtgtcgggcgcggactctggcattgtcgttgctagtcccagcacggataacccggactacttctacacctggactcgcgactctggtctcgtcctcaagaccctcgtcgatctcttccgaaatggagataccagtctcctctccaccattgagaactacatctccgcccaggcaattgtccagggtatcagtaacccctctggtgatctgtccagcggcgctggtctcggtgaacccaagttcaatgtcgatgagactgcctacactggttcttggggacggccgcagcgagatggtccggctctgagagcaactgctatgatcggcttcgggcaatggctgcttgacaatggctacaccagcaccgcaacggacattgtttggcccctcgttaggaacgacctgtcgtatgtggctcaatactggaaccagacaggatatgatctctgggaagtcaatggctcgtctttctttacgattgctgtgcaacaccgcgcccttgtcgaaggtagtgccttcgcgacggccgtcggctcgtcctgctcctggtgtgattctcaggcacccgaaattctctgctacctgcagtccttctggaccggcagcttcattctggccaacttcgatagcagccgttccgccaaggacgcaaacaccctcctgctgggaagcatccacacctttgatcctgaggccgcatgcgacgactccaccttccagccctgctccccgcgcgcgctcgccaaccacaaggaggttgtagactctttccgctcaatctataccctcaacgatggtctcagtgacagcgaggctgttgcggtgggtcggtaccctgaggacacgtactacaacggcaacccgtggttcctgtgcaccttggctgccgcagagcagttgtacgatgctctataccagtgggacaagcaggggtcgttggaggtcacagatgtgtcgctggacttcttcaaggcactgtacagcgatgctactggcacctactcttcgtccagttcgacttatagtagcattgtagatgccgtgaagactttcgccgatggcttcgtctctattgtggaaactcacgccgcaagcaacggctccatgtccgagcaatacgacaagtctgatggcgagcagctttccgctcgcgacctgacctggtcttatgctgctctgctgaccgccaacaaccgtcgtaacgtcgtgccttccgcttcttggggcgagacctctgccagcagcgtgcccggcacctgtgcggccacatctgccattggtacctacagcagtgtgactgtcacctcgtggccgagtatcgtggctactggcggcaccactacgacggctacccccactggatccggcagcgtgacctcgaccagcaagaccaccgcgactgctagcaagaccagcaccagtacgtcatcaacctcctgtaccactcccaccgccgtggctgtgactttcgatctgacagctaccaccacctacggcgagaacatctacctggtcggatcgatctctcagctgggtgactgggaaaccagcgacggcatagctctgagtgctgacaagtacacttccagcgacccgctctggtatgtcactgtgactctgccggctggtgagtcgtttgagtacaagtttatccgcattgagagcgatgactccgtggagtgggagagtgatcccaaccgagaatacaccgttcctcaggcgtgcggaacgtcgaccgcgacggtgactgacacctggcggtag
SEQ ID NO:9
Tr-GA variant CS4 mature protein sequence:
SVDDFISTETPIALNNLLCNVGPDGCRAFGTSAGAVIASPSTIDPDYYYMWTRDSALVFKNLIDRFTETYDAGLQRRIEQYITAQVTLQGLSNPSGSLADGSGLGEPKFELTLKPFTGNWGRPQRDGPALRAIALIGYSKWLINNNYQSTVSNVIWPIVRNDLNYVAQYWNQTGFDLWEEVNGSSFFTVANQHRALVEGATLAATLGQSGSAYSSVAPQVLCFLQRFWVSSGGYVDSNINTNEGRTGKDVNSVLTSIHTFDPNLGCDAGTFQPCSDKALSNLKVVVDSFRSIYGVNKGIPAGAAVAIGRYAEDVYYNGNPWYLATFAAAEQLYDAIYVWKKTGSITVTATSLAFFQELVPGVTAGTYSSSSSTFTNIINAVSTYADGFLSEAAKYVPADGSLAEQFDRNSGTPLSAVHLTWSYASFLTAAARRAGIVPPSWANSSASTIPSTCSGASVVGSYSRPTATSFPPSQTPKPGVPSGTPYTPLPCATPTSVAVTFHELVSTQFGHTVKVAGNAAALGNWSTSAAVALDAVNYRDNHPLWIGTVNLEAGDVVEYKYIIVGQDGSVTWESDPNHTYTVPAVACVTQVVKEDTWQS
SEQ ID NO:10
Tr-GA mature protein sequence:
SVDDFISTETPIALNNLLCNVGPDGCRAFGTSAGAVIASPSTIDPDYYYMWTRDSALVFKNLIDRFTETYDAGLQRRIEQYITAQVTLQGLSNPSGSLADGSGLGEPKFELTLKPFTGNWGRPQRDGPALRAIALIGYSKWLINNNYQSTVSNVIWPIVRNDLNYVAQYWNQTGFDLWEEVNGSSFFTVANQHRALVEGATLAATLGQSGSAYSSVAPQVLCFLQRFWVSSGGYVDSNINTNEGRTGKDVNSVLTSIHTFDPNLGCDAGTFQPCSDKALSNLKVVVDSFRSIYGVNKGIPAGAAVAIGRYAEDVYYNGNPWYLATFAAAEQLYDAIYVWKKTGSITVTATSLAFFQELVPGVTAGTYSSSSSTFTNIINAVSTYADGFLSEAAKYVPADGSLAEQFDRNSGTPLSALHLTWSYASFLTATARRAGIVPPSWANSSASTIPSTCSGASVVGSYSRPTATSFPPSQTPKPGVPSGTPYTPLPCATPTSVAVTFHELVSTQFGQTVKVAGNAAALGNWSTSAAVALDAVNYADNHPLWIGTVNLEAGDVVEYKYINVGQDGSVTWESDPNHTYTVPAVACVTQVVKEDTWQS
SEQ ID NO:11
Tr-GA parent protein sequence (632 amino acids):
SEQ ID NO:12
Tr-GA nucleotide sequence (1899 bp):
sequence listing
<110> Danisco USA
<120> glucoamylase blends
<130>
<160> 12
<170> PatentIn version 3.5
<210> 1
<211> 582
<212> PRT
<213> Fusarium verticillium
<400> 1
Met Phe Thr Gln Ile Leu Tyr Gly Leu Thr Ala Leu Ser Ala Leu Gln
1 5 10 15
Gly Gln Val Thr Ala Ser Pro Gly Gly Ser Ser Leu Asp Arg Phe Ile
20 25 30
Ser Lys Glu Ala Asp Ile Ser Ile Lys Gly Val Leu Ala Asn Ile Gly
35 40 45
Ala Asp Gly Lys Arg Ala Gln Gly Ala Ala Pro Gly Ala Val Val Ala
50 55 60
Ser Pro Ser Arg Thr Asp Pro Asp Tyr Trp Tyr Thr Trp Thr Arg Asp
65 70 75 80
Ser Ala Leu Thr Tyr Lys Val Leu Val Glu Arg Phe Ile His Gly Asp
85 90 95
Lys Ser Leu Gln Arg Lys Ile Asp Glu Tyr Val Ser Ala Gln Ala Lys
100 105 110
Leu Gln Gly Val Thr Asn Pro Ser Gly Gly Pro Glu Ser Gly Gly Leu
115 120 125
Gly Glu Pro Lys Phe His Val Asn Leu Thr Ala Phe Thr Gly Ser Trp
130 135 140
Gly Arg Pro Gln Arg Asp Gly Pro Pro Leu Arg Ala Thr Ala Leu Thr
145 150 155 160
Leu Tyr Ala Asn Trp Leu Val Ser His Gly Asp Arg Ser Lys Ala Val
165 170 175
Asn Lys Val Trp Pro Val Ile Glu Lys Asp Leu Ala Tyr Thr Val Lys
180 185 190
Phe Trp Asn Arg Thr Gly Tyr Asp Leu Trp Glu Glu Val Asn Gly Ser
195 200 205
Ser Phe Phe Thr Leu Ser Ala Ser His Arg Ala Leu Val Glu Gly Ala
210 215 220
Ala Leu Ala Lys Lys Leu Gly Lys Ser Cys Ser Asp Cys Ala Thr Asn
225 230 235 240
Ala Pro Arg Val Leu Cys Phe Met Gln Ser Phe Trp Thr Gly Ser Tyr
245 250 255
Ile Asp Ser Asn Ile Asn Val Asn Asp Gly Arg Lys Gly Leu Asp Ala
260 265 270
Asn Ser Ile Leu Ser Ser Ile His Thr Phe Asp Pro Ser Ser Lys Cys
275 280 285
Thr Asp Ser Thr Phe Gln Pro Cys Ser Ser Arg Ala Leu Ala Asn His
290 295 300
Lys Glu Val Val Asp Ser Phe Arg Ser Ile Tyr Gly Val Asn Lys Asn
305 310 315 320
Arg Gly Lys Gly Lys Ala Ala Ala Val Gly Arg Tyr Ser Glu Asp Val
325 330 335
Tyr Tyr Asp Gly Asn Pro Trp Tyr Leu Ala Thr Leu Ala Ala Ala Glu
340 345 350
Gln Leu Tyr Ala Ala Val Tyr Gln Trp Asn Lys Ile Gly Ser Ile Thr
355 360 365
Val Asp Ser Val Ser Leu Pro Phe Phe Ser Asp Leu Val Pro Lys Val
370 375 380
Ser Lys Gly Thr Tyr Arg Lys Asn Ser Lys Thr Tyr Lys Ala Ile Ile
385 390 395 400
Lys Ala Val Thr Ser Tyr Ala Asp Gly Phe Val Ala Val Val Gln Thr
405 410 415
Tyr Thr Pro Lys Asp Gly Ser Leu Ala Glu Gln Phe Asp Lys Ser Thr
420 425 430
Gly Thr Pro Lys Ser Ala Val His Leu Thr Trp Ser Tyr Ala Ser Phe
435 440 445
Val Gly Ala Ala Glu Arg Arg Thr Gly Val Val Pro Pro Ala Trp Gly
450 455 460
Glu Ser Asn Ala Asn Lys Val Pro Ala Val Cys Glu Ala Ala Pro Ala
465 470 475 480
Cys Asp Thr Thr Ile Thr Phe Asn Val Lys Asn Val Asp Val Thr Ser
485 490 495
Asp Gln Lys Val Tyr Ile Val Gly Gly Ile Thr Gln Leu Ser Asn Trp
500 505 510
Ala Pro Ala Asp Gly Ile Ala Leu Glu Glu Ser Thr Ser Thr Lys Gly
515 520 525
Leu Trp Thr Val Lys Val Lys Ile Pro Ser Asp Thr Ser Phe Glu Tyr
530 535 540
Lys Tyr Ile Lys Lys Thr Ser Asp Gly Thr Val Thr Trp Glu Ser Asp
545 550 555 560
Pro Asn Asn Ser Ala Ala Thr Gly Ser Lys Cys Gly Ser Ser Ser Thr
565 570 575
Ile Asn Asp Glu Trp Arg
580
<210> 2
<211> 1802
<212> DNA
<213> Fusarium verticillium
<400> 2
atgtttactc agatcttgta tggcctcacg gccttatctg ccctccaagg tcaggtcact 60
gcatcaccag gcggttccag ccttgatcgc ttcatttcca aagaggccga catctccatc 120
aagggcgtgc ttgccaatat tggcgccgat ggcaagcgag cccaaggcgc tgcacctggc 180
gccgttgtag caagtccatc gagaacagat ccagactgta agtcaatatt cttgatagta 240
tcgcaaatgg atgactgaga cgttttctag actggtacac atggacccga gactccgcct 300
tgacatacaa agtccttgtt gagcgcttca ttcacggaga caagtctctc cagcgcaaga 360
tcgacgaata tgtctctgcc caagccaagc tccagggcgt caccaaccca tctggcggcc 420
ccgagtcagg cggccttggg gaacccaagt ttcacgtcaa cctcacagct ttcacaggat 480
cctggggtcg tcctcaacga gatggccctc ctctgagagc tactgcattg acgctgtacg 540
ccaactggct tgtttctcac ggcgaccgct ccaaggccgt caacaaggtt tggcctgtca 600
ttgagaagga tcttgcatac accgtcaagt tctggaacag aaccggttac gatctttggg 660
aggaggttaa cggatcttcg ttcttcaccc tctctgcttc acatcgtgct ctggttgagg 720
gagctgctct tgctaagaag cttggcaagt cttgctccga ctgcgcaacc aacgcccccc 780
gtgttctctg cttcatgcaa agcttctgga ccggcagcta catcgactcg aacatcaatg 840
tcaacgatgg ccgcaagggt cttgatgcca actccattct gtcttctatt cacacctttg 900
atccttcttc gaagtgcaca gactctacct tccagccttg ttcttcaagg gctcttgcga 960
accacaagga agtagtggac tctttccgct ccatctatgg tgtcaacaaa aacagaggta 1020
aaggtaaagc tgctgctgtc ggtcgataca gtgaggatgt gtactacgac ggtaaccctt 1080
ggtacttggc tactcttgct gctgccgagc aactgtacgc tgctgtctat caatggaaca 1140
agatcggttc catcacagtt gatagtgtgt cgctcccctt tttcagtgac cttgtaccaa 1200
aggtttccaa gggaacctat cgcaagaaca gcaagacata caaggctatt atcaaggctg 1260
tcacttcata cgctgatggc tttgtcgccg ttgtgcagac ctatactccc aaagatggct 1320
ccctcgcaga gcagttcgat aagtctactg gaactcccaa gtcagctgtt cacctaacct 1380
ggtcctacgc ctcctttgtc ggtgctgccg agcgtcgtac tggcgtcgtt cctccagctt 1440
ggggcgagag caacgccaac aaggtgcctg ctgtttgcga agcagctcca gcctgcgaca 1500
ccaccatcac gttcaatgtg aagaacgttg atgtcacgtc ggaccagaag gtttacattg 1560
ttggcgggat cactcaactt tccaactggg cccctgctga cggcattgcg cttgaggaat 1620
ccacgagcac caagggcttg tggactgtga aggtcaagat tccatctgat accagctttg 1680
agtataagta tataaagaag acgagtgatg gaactgttac atgggagagt gaccccaata 1740
acagtgcggc gacgggcagc aagtgcggaa gcagcagtac catcaacgat gagtggaggt 1800
aa 1802
<210> 3
<211> 631
<212> PRT
<213> Aspergillus fumigatus
<400> 3
Met Pro Arg Leu Ser Tyr Ala Leu Cys Ala Leu Ser Leu Gly His Ala
1 5 10 15
Ala Ile Ala Ala Pro Gln Leu Ser Ala Arg Ala Thr Gly Ser Leu Asp
20 25 30
Ser Trp Leu Gly Thr Glu Thr Thr Val Ala Leu Asn Gly Ile Leu Ala
35 40 45
Asn Ile Gly Ala Asp Gly Ala Tyr Ala Lys Ser Ala Lys Pro Gly Ile
50 55 60
Ile Ile Ala Ser Pro Ser Thr Ser Glu Pro Asp Tyr Tyr Tyr Thr Trp
65 70 75 80
Thr Arg Asp Ala Ala Leu Val Thr Lys Val Leu Val Asp Leu Phe Arg
85 90 95
Asn Gly Asn Leu Gly Leu Gln Lys Val Ile Thr Glu Tyr Val Asn Ser
100 105 110
Gln Ala Tyr Leu Gln Thr Val Ser Asn Pro Ser Gly Gly Leu Ala Ser
115 120 125
Gly Gly Leu Ala Glu Pro Lys Tyr Asn Val Asp Met Thr Ala Phe Thr
130 135 140
Gly Ala Trp Gly Arg Pro Gln Arg Asp Gly Pro Ala Leu Arg Ala Thr
145 150 155 160
Ala Leu Ile Asp Phe Gly Asn Trp Leu Ile Asp Asn Gly Tyr Ser Ser
165 170 175
Tyr Ala Val Asn Asn Ile Trp Pro Ile Val Arg Asn Asp Leu Ser Tyr
180 185 190
Val Ser Gln Tyr Trp Ser Gln Ser Gly Phe Asp Leu Trp Glu Glu Val
195 200 205
Asn Ser Met Ser Phe Phe Thr Val Ala Val Gln His Arg Ala Leu Val
210 215 220
Glu Gly Ser Thr Phe Ala Lys Arg Val Gly Ala Ser Cys Ser Trp Cys
225 230 235 240
Asp Ser Gln Ala Pro Gln Ile Leu Cys Tyr Met Gln Ser Phe Trp Thr
245 250 255
Gly Ser Tyr Ile Asn Ala Asn Thr Gly Gly Gly Arg Ser Gly Lys Asp
260 265 270
Ala Asn Thr Val Leu Ala Ser Ile His Thr Phe Asp Pro Glu Ala Gly
275 280 285
Cys Asp Asp Thr Thr Phe Gln Pro Cys Ser Pro Arg Ala Leu Ala Asn
290 295 300
His Lys Val Tyr Thr Asp Ser Phe Arg Ser Val Tyr Ala Ile Asn Ser
305 310 315 320
Gly Ile Pro Gln Gly Ala Ala Val Ser Ala Gly Arg Tyr Pro Glu Asp
325 330 335
Val Tyr Tyr Asn Gly Asn Pro Trp Phe Leu Thr Thr Leu Ala Ala Ala
340 345 350
Glu Gln Leu Tyr Asp Ala Ile Tyr Gln Trp Lys Lys Ile Gly Ser Ile
355 360 365
Ser Ile Thr Ser Thr Ser Leu Ala Phe Phe Lys Asp Ile Tyr Ser Ser
370 375 380
Ala Ala Val Gly Thr Tyr Ala Ser Ser Thr Ser Thr Phe Thr Asp Ile
385 390 395 400
Ile Asn Ala Val Lys Thr Tyr Ala Asp Gly Tyr Val Ser Ile Val Gln
405 410 415
Ala His Ala Met Asn Asn Gly Ser Leu Ser Glu Gln Phe Asp Lys Ser
420 425 430
Ser Gly Leu Ser Leu Ser Ala Arg Asp Leu Thr Trp Ser Tyr Ala Ala
435 440 445
Phe Leu Thr Ala Asn Met Arg Arg Asn Gly Val Val Pro Ala Pro Trp
450 455 460
Gly Ala Ala Ser Ala Asn Ser Val Pro Ser Ser Cys Ser Met Gly Ser
465 470 475 480
Ala Thr Gly Thr Tyr Ser Thr Ala Thr Ala Thr Ser Trp Pro Ser Thr
485 490 495
Leu Thr Ser Gly Ser Pro Gly Ser Thr Thr Thr Val Gly Thr Thr Thr
500 505 510
Ser Thr Thr Ser Gly Thr Ala Ala Glu Thr Ala Cys Ala Thr Pro Thr
515 520 525
Ala Val Ala Val Thr Phe Asn Glu Ile Ala Thr Thr Thr Tyr Gly Glu
530 535 540
Asn Val Tyr Ile Val Gly Ser Ile Ser Glu Leu Gly Asn Trp Asp Thr
545 550 555 560
Ser Lys Ala Val Ala Leu Ser Ala Ser Lys Tyr Thr Ser Ser Asn Asn
565 570 575
Leu Trp Tyr Val Ser Val Thr Leu Pro Ala Gly Thr Thr Phe Glu Tyr
580 585 590
Lys Tyr Ile Arg Lys Glu Ser Asp Gly Ser Ile Val Trp Glu Ser Asp
595 600 605
Pro Asn Arg Ser Tyr Thr Val Pro Ala Ala Cys Gly Val Ser Thr Ala
610 615 620
Thr Glu Asn Asp Thr Trp Gln
625 630
<210> 4
<211> 2060
<212> DNA
<213> Aspergillus fumigatus
<400> 4
atgcctcgcc tttcctacgc gctctgtgcg ctgtctctcg ggcatgctgc tattgcagct 60
cctcagttat ccgctcgtgc taccggcagc ttggactcct ggttgggtac tgagaccacc 120
gttgcgctca atggtattct ggccaacatc ggtgccgacg gtgcttatgc gaagagcgct 180
aagcctggca taatcattgc cagtccgagc accagcgaac cagactgtga gaaccttcct 240
gaactggccc tgtccggcag tcattgacct cggtagacta ctatacctgg acgagagatg 300
ctgctctcgt cacgaaagtc ctggtcgacc tcttccgcaa cggcaacctg ggtctgcaga 360
aagtcattac cgaatacgtc aactctcagg cgtacttgca gaccgtgtct aatccgtcgg 420
gtggtcttgc gagcggaggt ctcgcggagc ctaagtacaa cgtcgacatg acggccttta 480
ccggagcctg gggtcgtcct cagcgtgatg gtccggctct gcgggccacc gccctcatcg 540
actttggcaa ctggctgatt gtatgttctc catacgagcc ccaggaagcg ttgctgacgt 600
ctacaggaca acggctactc cagctatgct gtcaacaaca tctggcccat tgtgcgcaac 660
gacttgtcct acgtttctca gtactggagc cagagtggct ttggtgagtc ccgactctct 720
ggaagtttac aacgtgcatc gattactgac aattgagatt ctacgtgaca gatctctggg 780
aagaagtcaa ctccatgtcc ttcttcaccg tcgctgtcca gcaccgtgcc ctcgtggagg 840
gaagcacgtt cgctaaacgg gtgggagcgt cgtgctcgtg gtgtgactcg caggcccccc 900
agatcctctg ctacatgcag agtttctgga ctggctcgta tatcaacgcc aacaccggtg 960
gtggccggtc cggcaaggat gccaacaccg tcctcgccag catccatacc ttcgaccccg 1020
aagccggctg cgacgatact actttccagc cctgctctcc tcgggccctt gccaaccaca 1080
aggtgtacac cgattcgttc cgctcggtct acgcgatcaa ctccggcatc ccacagggcg 1140
ctgccgtttc cgctggccgc taccccgagg acgtctacta caacggcaac ccttggttcc 1200
tcaccaccct cgccgctgcc gagcagctct acgacgctat ctaccagtgg aagaagatcg 1260
gttccatcag catcaccagc acctccctcg ccttcttcaa ggacatctac agctccgccg 1320
cggtcggcac ctacgcctct agcacctcca ccttcacgga catcatcaac gcggtcaaga 1380
cctacgcaga cggctacgtg agcatcgtcc aggcacacgc catgaacaac ggctcccttt 1440
cggagcaatt cgacaagtcc tctgggctgt ccctctccgc ccgcgatctg acctggtcct 1500
acgccgcttt cctcaccgcc aacatgcgtc gtaacggcgt ggtgcctgcc ccctggggcg 1560
ccgcctccgc caactccgtc ccctcgtctt gctccatggg ctcggccacg ggcacctaca 1620
gcaccgcgac agccacctcc tggcccagca cgctgaccag cggctcgcca ggcagcacca 1680
ccaccgtggg caccacgacc agtaccacct ctggcaccgc cgccgagacc gcctgtgcga 1740
cccctaccgc cgtggccgtc acctttaacg agatcgccac caccacctac ggcgagaatg 1800
tttacattgt tgggtccatc tccgagctcg ggaactggga taccagcaaa gcagtggccc 1860
tgagtgcgtc caagtatacc tccagcaata acctctggta cgtgtccgtc accctgccgg 1920
ctggcacgac attcgagtac aagtatatcc gcaaggaaag cgatggctcg atcgtgtggg 1980
agagtgaccc caaccgctcg tatacggtgc cggcagcttg tggagtgtct actgcgaccg 2040
agaatgatac ttggcagtga 2060
<210> 5
<211> 634
<212> PRT
<213> Humicola grisea
<400> 5
Met His Thr Phe Ser Lys Leu Leu Val Leu Gly Ser Ala Val Gln Ser
1 5 10 15
Ala Leu Gly Arg Pro His Gly Ser Ser Arg Leu Gln Glu Arg Ala Ala
20 25 30
Val Asp Thr Phe Ile Asn Thr Glu Lys Pro Ile Ala Trp Asn Lys Leu
35 40 45
Leu Ala Asn Ile Gly Pro Asn Gly Lys Ala Ala Pro Gly Ala Ala Ala
50 55 60
Gly Val Val Ile Ala Ser Pro Ser Arg Thr Asp Pro Pro Tyr Phe Phe
65 70 75 80
Thr Trp Thr Pro Asp Ala Ala Leu Val Leu Thr Gly Ile Ile Glu Ser
85 90 95
Leu Gly His Asn Tyr Asn Thr Thr Leu Gln Thr Val Ile Gln Asn Tyr
100 105 110
Val Ala Ser Gln Ala Lys Leu Gln Gln Val Ser Asn Pro Ser Gly Thr
115 120 125
Phe Ala Asp Gly Ser Gly Leu Gly Glu Ala Lys Phe Asn Val Asp Leu
130 135 140
Thr Ala Phe Thr Gly Glu Trp Gly Arg Pro Gln Arg Asp Gly Pro Pro
145 150 155 160
Leu Arg Ala Ile Ala Leu Ile Gln Tyr Ala Lys Trp Leu Ile Ala Asn
165 170 175
Gly Tyr Lys Ser Thr Ala Lys Ser Val Val Trp Pro Val Val Lys Asn
180 185 190
Asp Leu Ala Tyr Thr Ala Gln Tyr Trp Asn Glu Thr Gly Phe Asp Leu
195 200 205
Trp Glu Glu Val Pro Gly Ser Ser Phe Phe Thr Ile Ala Ser Ser His
210 215 220
Arg Ala Leu Thr Glu Gly Ala Tyr Leu Ala Ala Gln Leu Asp Thr Glu
225 230 235 240
Cys Pro Pro Cys Thr Thr Val Ala Pro Gln Val Leu Cys Phe Gln Gln
245 250 255
Ala Phe Trp Asn Ser Lys Gly Asn Tyr Val Val Ser Thr Ser Thr Ala
260 265 270
Gly Glu Tyr Arg Ser Gly Lys Asp Ala Asn Ser Ile Leu Ala Ser Ile
275 280 285
His Asn Phe Asp Pro Glu Ala Gly Cys Asp Asn Leu Thr Phe Gln Pro
290 295 300
Cys Ser Glu Arg Ala Leu Ala Asn His Lys Ala Tyr Val Asp Ser Phe
305 310 315 320
Arg Asn Leu Tyr Ala Ile Asn Lys Gly Ile Ala Gln Gly Lys Ala Val
325 330 335
Ala Val Gly Arg Tyr Ser Glu Asp Val Tyr Tyr Asn Gly Asn Pro Trp
340 345 350
Tyr Leu Ala Asn Phe Ala Ala Ala Glu Gln Leu Tyr Asp Ala Ile Tyr
355 360 365
Val Trp Asn Lys Gln Gly Ser Ile Thr Val Thr Ser Val Ser Leu Pro
370 375 380
Phe Phe Arg Asp Leu Val Ser Ser Val Ser Thr Gly Thr Tyr Ser Lys
385 390 395 400
Ser Ser Ser Thr Phe Thr Asn Ile Val Asn Ala Val Lys Ala Tyr Ala
405 410 415
Asp Gly Phe Ile Glu Val Ala Ala Lys Tyr Thr Pro Ser Asn Gly Ala
420 425 430
Leu Ala Glu Gln Tyr Asp Arg Asn Thr Gly Lys Pro Asp Ser Ala Ala
435 440 445
Asp Leu Thr Trp Ser Tyr Ser Ala Phe Leu Ser Ala Ile Asp Arg Arg
450 455 460
Ala Gly Leu Val Pro Pro Ser Trp Arg Ala Ser Val Ala Lys Ser Gln
465 470 475 480
Leu Pro Ser Thr Cys Ser Arg Ile Glu Val Ala Gly Thr Tyr Val Ala
485 490 495
Ala Thr Ser Thr Ser Phe Pro Ser Lys Gln Thr Pro Asn Pro Ser Ala
500 505 510
Ala Pro Ser Pro Ser Pro Tyr Pro Thr Ala Cys Ala Asp Ala Ser Glu
515 520 525
Val Tyr Val Thr Phe Asn Glu Arg Val Ser Thr Ala Trp Gly Glu Thr
530 535 540
Ile Lys Val Val Gly Asn Val Pro Ala Leu Gly Asn Trp Asp Thr Ser
545 550 555 560
Lys Ala Val Thr Leu Ser Ala Ser Gly Tyr Lys Ser Asn Asp Pro Leu
565 570 575
Trp Ser Ile Thr Val Pro Ile Lys Ala Thr Gly Ser Ala Val Gln Tyr
580 585 590
Lys Tyr Ile Lys Val Gly Thr Asn Gly Lys Ile Thr Trp Glu Ser Asp
595 600 605
Pro Asn Arg Ser Ile Thr Leu Gln Thr Ala Ser Ser Ala Gly Lys Cys
610 615 620
Ala Ala Gln Thr Val Asn Asp Ser Trp Arg
625 630
<210> 6
<211> 2102
<212> DNA
<213> Humicola grisea
<400> 6
atgcatacct tctccaagct cctcgtcctg ggctctgccg tccagtctgc cctcgggcgg 60
cctcacggct cttcgcgtct ccaggaacgc gctgccgttg ataccttcat caacaccgag 120
aagcccatcg catggaacaa gctgctcgcc aacatcggcc ctaacggcaa agccgctccc 180
ggtgccgccg ccggcgttgt gattgccagc ccttccagga cggaccctcc ttgtacgtgg 240
tggcatggaa tggacccaag agactgtttt agatgaaaga gagtttctgc taaccgccac 300
acccagactt cttcacctgg accccggatg ccgccctggt cctcaccggc atcatcgagt 360
cccttggcca caactacaac accaccctgc agaccgtcat ccagaactac gtcgcgtcgc 420
aggccaagct gcagcaggtc tcgaacccct cgggaacctt cgccgacggc tcgggtctcg 480
gtgaggccaa gttcaatgtc gacctcactg ccttcactgg cgaatggggt cgccctcaga 540
gggacggccc gcccctgcgc gccatcgctc tcatccagta cgccaagtgg ctgatcgcca 600
acggctacaa gagcacggcc aagagcgtcg tctggcccgt cgtcaagaac gatctcgcct 660
acacggccca gtactggaac gagaccggct tcgatctctg ggaggaggtc cccggcagct 720
cgttctttac catcgccagc tctcacaggg gtgagtcatt tattgttcag tgttttctca 780
ttgaataatt accggaatgc cactgacgcc aaacagctct gactgagggt gcttacctcg 840
ccgctcagct cgacaccgag tgcccgccct gcacgaccgt cgcccctcag gttctgtgct 900
tccagcaggc cttctggaac tccaagggca actatgtcgt ctcaacagta agatccctac 960
accaacaaaa aaaatcgaaa aggaacgtta gctgaccctt ctagtcaacg gcgggcgagt 1020
atcgctccgg caaggacgcc aactcgatcc tggcgtccat ccacaacttc gaccctgagg 1080
ccggctgcga caacctgacc ttccagccct gcagcgagcg cgccctggcc aaccacaagg 1140
cctatgtcga ctcgttccgc aacctctacg ccatcaacaa gggcatcgcc cagggcaagg 1200
ccgttgccgt cggccgctac tcggaggatg tctactacaa cggcaacccg tggtacctgg 1260
ccaactttgc cgccgccgag cagctctacg acgccatcta cgtgtggaac aagcagggct 1320
ccatcaccgt gacctcggtc tccctgccct tcttccgcga ccttgtctcg tcggtcagca 1380
ccggcaccta ctccaagagc agctcgacct tcaccaacat cgtcaacgcc gtcaaggcct 1440
acgccgacgg cttcatcgag gtggcggcca agtacacccc gtccaacggc gcgctcgccg 1500
agcagtacga ccgcaacacg ggcaagcccg actcggccgc cgacctgacg tggtcgtact 1560
cggccttcct ctcggccatc gaccgccgcg cgggtctcgt ccccccgagc tggcgggcca 1620
gcgtggccaa gagccagctg ccgtccacct gctcgcgcat cgaggtcgcc ggcacctacg 1680
tcgccgccac gagcacctcg ttcccgtcca agcagacccc gaacccctcc gcggcgccct 1740
ccccgtcccc ctacccgacc gcctgcgcgg acgctagcga ggtgtacgtc accttcaacg 1800
agcgcgtgtc gaccgcgtgg ggcgagacca tcaaggtggt gggcaacgtg ccggcgctgg 1860
ggaactggga cacgtccaag gcggtgaccc tgtcggccag cgggtacaag tcgaatgatc 1920
ccctctggag catcacggtg cccatcaagg cgacgggctc ggccgtgcag tacaagtata 1980
tcaaggtcgg caccaacggg aagattactt gggagtcgga ccccaacagg agcattaccc 2040
tgcagacggc gtcgtctgcg ggcaagtgcg ccgcgcagac ggtgaatgat tcgtggcgtt 2100
aa 2102
<210> 7
<211> 639
<212> PRT
<213> Aspergillus niger
<400> 7
Met Ser Phe Arg Ser Leu Leu Ala Leu Ser Gly Leu Val Cys Thr Gly
1 5 10 15
Leu Ala Asn Val Ile Ser Lys Arg Ala Thr Trp Asp Ser Trp Leu Ser
20 25 30
Asn Glu Ala Thr Val Ala Arg Thr Ala Ile Leu Asn Asn Ile Gly Ala
35 40 45
Asp Gly Ala Trp Val Ser Gly Ala Asp Ser Gly Ile Val Val Ala Ser
50 55 60
Pro Ser Thr Asp Asn Pro Asp Tyr Phe Tyr Thr Trp Thr Arg Asp Ser
65 70 75 80
Gly Leu Val Leu Lys Thr Leu Val Asp Leu Phe Arg Asn Gly Asp Thr
85 90 95
Ser Leu Leu Ser Thr Ile Glu Asn Tyr Ile Ser Ala Gln Ala Ile Val
100 105 110
Gln Gly Ile Ser Asn Pro Ser Gly Asp Leu Ser Ser Gly Ala Gly Leu
115 120 125
Gly Glu Pro Lys Phe Asn Val Asp Glu Thr Ala Tyr Thr Gly Ser Trp
130 135 140
Gly Arg Pro Gln Arg Asp Gly Pro Ala Leu Arg Ala Thr Ala Met Ile
145 150 155 160
Gly Phe Gly Gln Trp Leu Leu Asp Asn Gly Tyr Thr Ser Thr Ala Thr
165 170 175
Asp Ile Val Trp Pro Leu Val Arg Asn Asp Leu Ser Tyr Val Ala Gln
180 185 190
Tyr Trp Asn Gln Thr Gly Tyr Asp Leu Trp Glu Val Asn Gly Ser Ser
195 200 205
Phe Phe Thr Ile Ala Val Gln His Arg Ala Leu Val Glu Gly Ser Ala
210 215 220
Phe Ala Thr Ala Val Gly Ser Ser Cys Ser Trp Cys Asp Ser Gln Ala
225 230 235 240
Pro Glu Ile Leu Cys Tyr Leu Gln Ser Phe Trp Thr Gly Ser Phe Ile
245 250 255
Leu Ala Asn Phe Asp Ser Ser Arg Ser Ala Lys Asp Ala Asn Thr Leu
260 265 270
Leu Leu Gly Ser Ile His Thr Phe Asp Pro Glu Ala Ala Cys Asp Asp
275 280 285
Ser Thr Phe Gln Pro Cys Ser Pro Arg Ala Leu Ala Asn His Lys Glu
290 295 300
Val Val Asp Ser Phe Arg Ser Ile Tyr Thr Leu Asn Asp Gly Leu Ser
305 310 315 320
Asp Ser Glu Ala Val Ala Val Gly Arg Tyr Pro Glu Asp Thr Tyr Tyr
325 330 335
Asn Gly Asn Pro Trp Phe Leu Cys Thr Leu Ala Ala Ala Glu Gln Leu
340 345 350
Tyr Asp Ala Leu Tyr Gln Trp Asp Lys Gln Gly Ser Leu Glu Val Thr
355 360 365
Asp Val Ser Leu Asp Phe Phe Lys Ala Leu Tyr Ser Asp Ala Thr Gly
370 375 380
Thr Tyr Ser Ser Ser Ser Ser Thr Tyr Ser Ser Ile Val Asp Ala Val
385 390 395 400
Lys Thr Phe Ala Asp Gly Phe Val Ser Ile Val Glu Thr His Ala Ala
405 410 415
Ser Asn Gly Ser Met Ser Glu Gln Tyr Asp Lys Ser Asp Gly Glu Gln
420 425 430
Leu Ser Ala Arg Asp Leu Thr Trp Ser Tyr Ala Ala Leu Leu Thr Ala
435 440 445
Asn Asn Arg Arg Asn Val Val Pro Ser Ala Ser Trp Gly Glu Thr Ser
450 455 460
Ala Ser Ser Val Pro Gly Thr Cys Ala Ala Thr Ser Ala Ile Gly Thr
465 470 475 480
Tyr Ser Ser Val Thr Val Thr Ser Trp Pro Ser Ile Val Ala Thr Gly
485 490 495
Gly Thr Thr Thr Thr Ala Thr Pro Thr Gly Ser Gly Ser Val Thr Ser
500 505 510
Thr Ser Lys Thr Thr Ala Thr Ala Ser Lys Thr Ser Thr Ser Thr Ser
515 520 525
Ser Thr Ser Cys Thr Thr Pro Thr Ala Val Ala Val Thr Phe Asp Leu
530 535 540
Thr Ala Thr Thr Thr Tyr Gly Glu Asn Ile Tyr Leu Val Gly Ser Ile
545 550 555 560
Ser Gln Leu Gly Asp Trp Glu Thr Ser Asp Gly Ile Ala Leu Ser Ala
565 570 575
Asp Lys Tyr Thr Ser Ser Asp Pro Leu Trp Tyr Val Thr Val Thr Leu
580 585 590
Pro Ala Gly Glu Ser Phe Glu Tyr Lys Phe Ile Arg Ile Glu Ser Asp
595 600 605
Asp Ser Val Glu Trp Glu Ser Asp Pro Asn Arg Glu Tyr Thr Val Pro
610 615 620
Gln Ala Cys Gly Thr Ser Thr Ala Thr Val Thr Asp Thr Trp Arg
625 630 635
<210> 8
<211> 1920
<212> DNA
<213> Aspergillus niger
<400> 8
atgtcgttcc gatctctact cgccctgagc ggcctcgtct gcacagggtt ggcaaatgtg 60
atttccaagc gcgcgacctg ggattcatgg ttgagcaacg aagcgaccgt ggctcgtact 120
gccatcctga ataacatcgg ggcggacggt gcttgggtgt cgggcgcgga ctctggcatt 180
gtcgttgcta gtcccagcac ggataacccg gactacttct acacctggac tcgcgactct 240
ggtctcgtcc tcaagaccct cgtcgatctc ttccgaaatg gagataccag tctcctctcc 300
accattgaga actacatctc cgcccaggca attgtccagg gtatcagtaa cccctctggt 360
gatctgtcca gcggcgctgg tctcggtgaa cccaagttca atgtcgatga gactgcctac 420
actggttctt ggggacggcc gcagcgagat ggtccggctc tgagagcaac tgctatgatc 480
ggcttcgggc aatggctgct tgacaatggc tacaccagca ccgcaacgga cattgtttgg 540
cccctcgtta ggaacgacct gtcgtatgtg gctcaatact ggaaccagac aggatatgat 600
ctctgggaag tcaatggctc gtctttcttt acgattgctg tgcaacaccg cgcccttgtc 660
gaaggtagtg ccttcgcgac ggccgtcggc tcgtcctgct cctggtgtga ttctcaggca 720
cccgaaattc tctgctacct gcagtccttc tggaccggca gcttcattct ggccaacttc 780
gatagcagcc gttccgccaa ggacgcaaac accctcctgc tgggaagcat ccacaccttt 840
gatcctgagg ccgcatgcga cgactccacc ttccagccct gctccccgcg cgcgctcgcc 900
aaccacaagg aggttgtaga ctctttccgc tcaatctata ccctcaacga tggtctcagt 960
gacagcgagg ctgttgcggt gggtcggtac cctgaggaca cgtactacaa cggcaacccg 1020
tggttcctgt gcaccttggc tgccgcagag cagttgtacg atgctctata ccagtgggac 1080
aagcaggggt cgttggaggt cacagatgtg tcgctggact tcttcaaggc actgtacagc 1140
gatgctactg gcacctactc ttcgtccagt tcgacttata gtagcattgt agatgccgtg 1200
aagactttcg ccgatggctt cgtctctatt gtggaaactc acgccgcaag caacggctcc 1260
atgtccgagc aatacgacaa gtctgatggc gagcagcttt ccgctcgcga cctgacctgg 1320
tcttatgctg ctctgctgac cgccaacaac cgtcgtaacg tcgtgccttc cgcttcttgg 1380
ggcgagacct ctgccagcag cgtgcccggc acctgtgcgg ccacatctgc cattggtacc 1440
tacagcagtg tgactgtcac ctcgtggccg agtatcgtgg ctactggcgg caccactacg 1500
acggctaccc ccactggatc cggcagcgtg acctcgacca gcaagaccac cgcgactgct 1560
agcaagacca gcaccagtac gtcatcaacc tcctgtacca ctcccaccgc cgtggctgtg 1620
actttcgatc tgacagctac caccacctac ggcgagaaca tctacctggt cggatcgatc 1680
tctcagctgg gtgactggga aaccagcgac ggcatagctc tgagtgctga caagtacact 1740
tccagcgacc cgctctggta tgtcactgtg actctgccgg ctggtgagtc gtttgagtac 1800
aagtttatcc gcattgagag cgatgactcc gtggagtggg agagtgatcc caaccgagaa 1860
tacaccgttc ctcaggcgtg cggaacgtcg accgcgacgg tgactgacac ctggcggtag 1920
<210> 9
<211> 599
<212> PRT
<213> Trichoderma reesei
<400> 9
Ser Val Asp Asp Phe Ile Ser Thr Glu Thr Pro Ile Ala Leu Asn Asn
1 5 10 15
Leu Leu Cys Asn Val Gly Pro Asp Gly Cys Arg Ala Phe Gly Thr Ser
20 25 30
Ala Gly Ala Val Ile Ala Ser Pro Ser Thr Ile Asp Pro Asp Tyr Tyr
35 40 45
Tyr Met Trp Thr Arg Asp Ser Ala Leu Val Phe Lys Asn Leu Ile Asp
50 55 60
Arg Phe Thr Glu Thr Tyr Asp Ala Gly Leu Gln Arg Arg Ile Glu Gln
65 70 75 80
Tyr Ile Thr Ala Gln Val Thr Leu Gln Gly Leu Ser Asn Pro Ser Gly
85 90 95
Ser Leu Ala Asp Gly Ser Gly Leu Gly Glu Pro Lys Phe Glu Leu Thr
100 105 110
Leu Lys Pro Phe Thr Gly Asn Trp Gly Arg Pro Gln Arg Asp Gly Pro
115 120 125
Ala Leu Arg Ala Ile Ala Leu Ile Gly Tyr Ser Lys Trp Leu Ile Asn
130 135 140
Asn Asn Tyr Gln Ser Thr Val Ser Asn Val Ile Trp Pro Ile Val Arg
145 150 155 160
Asn Asp Leu Asn Tyr Val Ala Gln Tyr Trp Asn Gln Thr Gly Phe Asp
165 170 175
Leu Trp Glu Glu Val Asn Gly Ser Ser Phe Phe Thr Val Ala Asn Gln
180 185 190
His Arg Ala Leu Val Glu Gly Ala Thr Leu Ala Ala Thr Leu Gly Gln
195 200 205
Ser Gly Ser Ala Tyr Ser Ser Val Ala Pro Gln Val Leu Cys Phe Leu
210 215 220
Gln Arg Phe Trp Val Ser Ser Gly Gly Tyr Val Asp Ser Asn Ile Asn
225 230 235 240
Thr Asn Glu Gly Arg Thr Gly Lys Asp Val Asn Ser Val Leu Thr Ser
245 250 255
Ile His Thr Phe Asp Pro Asn Leu Gly Cys Asp Ala Gly Thr Phe Gln
260 265 270
Pro Cys Ser Asp Lys Ala Leu Ser Asn Leu Lys Val Val Val Asp Ser
275 280 285
Phe Arg Ser Ile Tyr Gly Val Asn Lys Gly Ile Pro Ala Gly Ala Ala
290 295 300
Val Ala Ile Gly Arg Tyr Ala Glu Asp Val Tyr Tyr Asn Gly Asn Pro
305 310 315 320
Trp Tyr Leu Ala Thr Phe Ala Ala Ala Glu Gln Leu Tyr Asp Ala Ile
325 330 335
Tyr Val Trp Lys Lys Thr Gly Ser Ile Thr Val Thr Ala Thr Ser Leu
340 345 350
Ala Phe Phe Gln Glu Leu Val Pro Gly Val Thr Ala Gly Thr Tyr Ser
355 360 365
Ser Ser Ser Ser Thr Phe Thr Asn Ile Ile Asn Ala Val Ser Thr Tyr
370 375 380
Ala Asp Gly Phe Leu Ser Glu Ala Ala Lys Tyr Val Pro Ala Asp Gly
385 390 395 400
Ser Leu Ala Glu Gln Phe Asp Arg Asn Ser Gly Thr Pro Leu Ser Ala
405 410 415
Val His Leu Thr Trp Ser Tyr Ala Ser Phe Leu Thr Ala Ala Ala Arg
420 425 430
Arg Ala Gly Ile Val Pro Pro Ser Trp Ala Asn Ser Ser Ala Ser Thr
435 440 445
Ile Pro Ser Thr Cys Ser Gly Ala Ser Val Val Gly Ser Tyr Ser Arg
450 455 460
Pro Thr Ala Thr Ser Phe Pro Pro Ser Gln Thr Pro Lys Pro Gly Val
465 470 475 480
Pro Ser Gly Thr Pro Tyr Thr Pro Leu Pro Cys Ala Thr Pro Thr Ser
485 490 495
Val Ala Val Thr Phe His Glu Leu Val Ser Thr Gln Phe Gly His Thr
500 505 510
Val Lys Val Ala Gly Asn Ala Ala Ala Leu Gly Asn Trp Ser Thr Ser
515 520 525
Ala Ala Val Ala Leu Asp Ala Val Asn Tyr Arg Asp Asn His Pro Leu
530 535 540
Trp Ile Gly Thr Val Asn Leu Glu Ala Gly Asp Val Val Glu Tyr Lys
545 550 555 560
Tyr Ile Ile Val Gly Gln Asp Gly Ser Val Thr Trp Glu Ser Asp Pro
565 570 575
Asn His Thr Tyr Thr Val Pro Ala Val Ala Cys Val Thr Gln Val Val
580 585 590
Lys Glu Asp Thr Trp Gln Ser
595
<210> 10
<211> 599
<212> PRT
<213> Trichoderma reesei
<400> 10
Ser Val Asp Asp Phe Ile Ser Thr Glu Thr Pro Ile Ala Leu Asn Asn
1 5 10 15
Leu Leu Cys Asn Val Gly Pro Asp Gly Cys Arg Ala Phe Gly Thr Ser
20 25 30
Ala Gly Ala Val Ile Ala Ser Pro Ser Thr Ile Asp Pro Asp Tyr Tyr
35 40 45
Tyr Met Trp Thr Arg Asp Ser Ala Leu Val Phe Lys Asn Leu Ile Asp
50 55 60
Arg Phe Thr Glu Thr Tyr Asp Ala Gly Leu Gln Arg Arg Ile Glu Gln
65 70 75 80
Tyr Ile Thr Ala Gln Val Thr Leu Gln Gly Leu Ser Asn Pro Ser Gly
85 90 95
Ser Leu Ala Asp Gly Ser Gly Leu Gly Glu Pro Lys Phe Glu Leu Thr
100 105 110
Leu Lys Pro Phe Thr Gly Asn Trp Gly Arg Pro Gln Arg Asp Gly Pro
115 120 125
Ala Leu Arg Ala Ile Ala Leu Ile Gly Tyr Ser Lys Trp Leu Ile Asn
130 135 140
Asn Asn Tyr Gln Ser Thr Val Ser Asn Val Ile Trp Pro Ile Val Arg
145 150 155 160
Asn Asp Leu Asn Tyr Val Ala Gln Tyr Trp Asn Gln Thr Gly Phe Asp
165 170 175
Leu Trp Glu Glu Val Asn Gly Ser Ser Phe Phe Thr Val Ala Asn Gln
180 185 190
His Arg Ala Leu Val Glu Gly Ala Thr Leu Ala Ala Thr Leu Gly Gln
195 200 205
Ser Gly Ser Ala Tyr Ser Ser Val Ala Pro Gln Val Leu Cys Phe Leu
210 215 220
Gln Arg Phe Trp Val Ser Ser Gly Gly Tyr Val Asp Ser Asn Ile Asn
225 230 235 240
Thr Asn Glu Gly Arg Thr Gly Lys Asp Val Asn Ser Val Leu Thr Ser
245 250 255
Ile His Thr Phe Asp Pro Asn Leu Gly Cys Asp Ala Gly Thr Phe Gln
260 265 270
Pro Cys Ser Asp Lys Ala Leu Ser Asn Leu Lys Val Val Val Asp Ser
275 280 285
Phe Arg Ser Ile Tyr Gly Val Asn Lys Gly Ile Pro Ala Gly Ala Ala
290 295 300
Val Ala Ile Gly Arg Tyr Ala Glu Asp Val Tyr Tyr Asn Gly Asn Pro
305 310 315 320
Trp Tyr Leu Ala Thr Phe Ala Ala Ala Glu Gln Leu Tyr Asp Ala Ile
325 330 335
Tyr Val Trp Lys Lys Thr Gly Ser Ile Thr Val Thr Ala Thr Ser Leu
340 345 350
Ala Phe Phe Gln Glu Leu Val Pro Gly Val Thr Ala Gly Thr Tyr Ser
355 360 365
Ser Ser Ser Ser Thr Phe Thr Asn Ile Ile Asn Ala Val Ser Thr Tyr
370 375 380
Ala Asp Gly Phe Leu Ser Glu Ala Ala Lys Tyr Val Pro Ala Asp Gly
385 390 395 400
Ser Leu Ala Glu Gln Phe Asp Arg Asn Ser Gly Thr Pro Leu Ser Ala
405 410 415
Leu His Leu Thr Trp Ser Tyr Ala Ser Phe Leu Thr Ala Thr Ala Arg
420 425 430
Arg Ala Gly Ile Val Pro Pro Ser Trp Ala Asn Ser Ser Ala Ser Thr
435 440 445
Ile Pro Ser Thr Cys Ser Gly Ala Ser Val Val Gly Ser Tyr Ser Arg
450 455 460
Pro Thr Ala Thr Ser Phe Pro Pro Ser Gln Thr Pro Lys Pro Gly Val
465 470 475 480
Pro Ser Gly Thr Pro Tyr Thr Pro Leu Pro Cys Ala Thr Pro Thr Ser
485 490 495
Val Ala Val Thr Phe His Glu Leu Val Ser Thr Gln Phe Gly Gln Thr
500 505 510
Val Lys Val Ala Gly Asn Ala Ala Ala Leu Gly Asn Trp Ser Thr Ser
515 520 525
Ala Ala Val Ala Leu Asp Ala Val Asn Tyr Ala Asp Asn His Pro Leu
530 535 540
Trp Ile Gly Thr Val Asn Leu Glu Ala Gly Asp Val Val Glu Tyr Lys
545 550 555 560
Tyr Ile Asn Val Gly Gln Asp Gly Ser Val Thr Trp Glu Ser Asp Pro
565 570 575
Asn His Thr Tyr Thr Val Pro Ala Val Ala Cys Val Thr Gln Val Val
580 585 590
Lys Glu Asp Thr Trp Gln Ser
595
<210> 11
<211> 632
<212> PRT
<213> Trichoderma reesei
<400> 11
Met His Val Leu Ser Thr Ala Val Leu Leu Gly Ser Val Ala Val Gln
1 5 10 15
Lys Val Leu Gly Arg Pro Gly Ser Ser Gly Leu Ser Asp Val Thr Lys
20 25 30
Arg Ser Val Asp Asp Phe Ile Ser Thr Glu Thr Pro Ile Ala Leu Asn
35 40 45
Asn Leu Leu Cys Asn Val Gly Pro Asp Gly Cys Arg Ala Phe Gly Thr
50 55 60
Ser Ala Gly Ala Val Ile Ala Ser Pro Ser Thr Ile Asp Pro Asp Tyr
65 70 75 80
Tyr Tyr Met Trp Thr Arg Asp Ser Ala Leu Val Phe Lys Asn Leu Ile
85 90 95
Asp Arg Phe Thr Glu Thr Tyr Asp Ala Gly Leu Gln Arg Arg Ile Glu
100 105 110
Gln Tyr Ile Thr Ala Gln Val Thr Leu Gln Gly Leu Ser Asn Pro Ser
115 120 125
Gly Ser Leu Ala Asp Gly Ser Gly Leu Gly Glu Pro Lys Phe Glu Leu
130 135 140
Thr Leu Lys Pro Phe Thr Gly Asn Trp Gly Arg Pro Gln Arg Asp Gly
145 150 155 160
Pro Ala Leu Arg Ala Ile Ala Leu Ile Gly Tyr Ser Lys Trp Leu Ile
165 170 175
Asn Asn Asn Tyr Gln Ser Thr Val Ser Asn Val Ile Trp Pro Ile Val
180 185 190
Arg Asn Asp Leu Asn Tyr Val Ala Gln Tyr Trp Asn Gln Thr Gly Phe
195 200 205
Asp Leu Trp Glu Glu Val Asn Gly Ser Ser Phe Phe Thr Val Ala Asn
210 215 220
Gln His Arg Ala Leu Val Glu Gly Ala Thr Leu Ala Ala Thr Leu Gly
225 230 235 240
Gln Ser Gly Ser Ala Tyr Ser Ser Val Ala Pro Gln Val Leu Cys Phe
245 250 255
Leu Gln Arg Phe Trp Val Ser Ser Gly Gly Tyr Val Asp Ser Asn Ile
260 265 270
Asn Thr Asn Glu Gly Arg Thr Gly Lys Asp Val Asn Ser Val Leu Thr
275 280 285
Ser Ile His Thr Phe Asp Pro Asn Leu Gly Cys Asp Ala Gly Thr Phe
290 295 300
Gln Pro Cys Ser Asp Lys Ala Leu Ser Asn Leu Lys Val Val Val Asp
305 310 315 320
Ser Phe Arg Ser Ile Tyr Gly Val Asn Lys Gly Ile Pro Ala Gly Ala
325 330 335
Ala Val Ala Ile Gly Arg Tyr Ala Glu Asp Val Tyr Tyr Asn Gly Asn
340 345 350
Pro Trp Tyr Leu Ala Thr Phe Ala Ala Ala Glu Gln Leu Tyr Asp Ala
355 360 365
Ile Tyr Val Trp Lys Lys Thr Gly Ser Ile Thr Val Thr Ala Thr Ser
370 375 380
Leu Ala Phe Phe Gln Glu Leu Val Pro Gly Val Thr Ala Gly Thr Tyr
385 390 395 400
Ser Ser Ser Ser Ser Thr Phe Thr Asn Ile Ile Asn Ala Val Ser Thr
405 410 415
Tyr Ala Asp Gly Phe Leu Ser Glu Ala Ala Lys Tyr Val Pro Ala Asp
420 425 430
Gly Ser Leu Ala Glu Gln Phe Asp Arg Asn Ser Gly Thr Pro Leu Ser
435 440 445
Ala Leu His Leu Thr Trp Ser Tyr Ala Ser Phe Leu Thr Ala Thr Ala
450 455 460
Arg Arg Ala Gly Ile Val Pro Pro Ser Trp Ala Asn Ser Ser Ala Ser
465 470 475 480
Thr Ile Pro Ser Thr Cys Ser Gly Ala Ser Val Val Gly Ser Tyr Ser
485 490 495
Arg Pro Thr Ala Thr Ser Phe Pro Pro Ser Gln Thr Pro Lys Pro Gly
500 505 510
Val Pro Ser Gly Thr Pro Tyr Thr Pro Leu Pro Cys Ala Thr Pro Thr
515 520 525
Ser Val Ala Val Thr Phe His Glu Leu Val Ser Thr Gln Phe Gly Gln
530 535 540
Thr Val Lys Val Ala Gly Asn Ala Ala Ala Leu Gly Asn Trp Ser Thr
545 550 555 560
Ser Ala Ala Val Ala Leu Asp Ala Val Asn Tyr Ala Asp Asn His Pro
565 570 575
Leu Trp Ile Gly Thr Val Asn Leu Glu Ala Gly Asp Val Val Glu Tyr
580 585 590
Lys Tyr Ile Asn Val Gly Gln Asp Gly Ser Val Thr Trp Glu Ser Asp
595 600 605
Pro Asn His Thr Tyr Thr Val Pro Ala Val Ala Cys Val Thr Gln Val
610 615 620
Val Lys Glu Asp Thr Trp Gln Ser
625 630
<210> 12
<211> 1899
<212> DNA
<213> Trichoderma reesei
<400> 12
atgcacgtcc tgtcgactgc ggtgctgctc ggctccgttg ccgttcaaaa ggtcctggga 60
agaccaggat caagcggtct gtccgacgtc accaagaggt ctgttgacga cttcatcagc 120
accgagacgc ctattgcact gaacaatctt ctttgcaatg ttggtcctga tggatgccgt 180
gcattcggca catcagctgg tgcggtgatt gcatctccca gcacaattga cccggactac 240
tattacatgt ggacgcgaga tagcgctctt gtcttcaaga acctcatcga ccgcttcacc 300
gaaacgtacg atgcgggcct gcagcgccgc atcgagcagt acattactgc ccaggtcact 360
ctccagggcc tctctaaccc ctcgggctcc ctcgcggacg gctctggtct cggcgagccc 420
aagtttgagt tgaccctgaa gcctttcacc ggcaactggg gtcgaccgca gcgggatggc 480
ccagctctgc gagccattgc cttgattgga tactcaaagt ggctcatcaa caacaactat 540
cagtcgactg tgtccaacgt catctggcct attgtgcgca acgacctcaa ctatgttgcc 600
cagtactgga accaaaccgg ctttgacctc tgggaagaag tcaatgggag ctcattcttt 660
actgttgcca accagcaccg agcacttgtc gagggcgcca ctcttgctgc cactcttggc 720
cagtcgggaa gcgcttattc atctgttgct ccccaggttt tgtgctttct ccaacgattc 780
tgggtgtcgt ctggtggata cgtcgactcc aacatcaaca ccaacgaggg caggactggc 840
aaggatgtca actccgtcct gacttccatc cacaccttcg atcccaacct tggctgtgac 900
gcaggcacct tccagccatg cagtgacaaa gcgctctcca acctcaaggt tgttgtcgac 960
tccttccgct ccatctacgg cgtgaacaag ggcattcctg ccggtgctgc cgtcgccatt 1020
ggccggtatg cagaggatgt gtactacaac ggcaaccctt ggtatcttgc tacatttgct 1080
gctgccgagc agctgtacga tgccatctac gtctggaaga agacgggctc catcacggtg 1140
accgccacct ccctggcctt cttccaggag cttgttcctg gcgtgacggc cgggacctac 1200
tccagcagct cttcgacctt taccaacatc atcaacgccg tctcgacata cgccgatggc 1260
ttcctcagcg aggctgccaa gtacgtcccc gccgacggtt cgctggccga gcagtttgac 1320
cgcaacagcg gcactccgct gtctgcgctt cacctgacgt ggtcgtacgc ctcgttcttg 1380
acagccacgg cccgtcgggc tggcatcgtg cccccctcgt gggccaacag cagcgctagc 1440
acgatcccct cgacgtgctc cggcgcgtcc gtggtcggat cctactcgcg tcccaccgcc 1500
acgtcattcc ctccgtcgca gacgcccaag cctggcgtgc cttccggtac tccctacacg 1560
cccctgccct gcgcgacccc aacctccgtg gccgtcacct tccacgagct cgtgtcgaca 1620
cagtttggcc agacggtcaa ggtggcgggc aacgccgcgg ccctgggcaa ctggagcacg 1680
agcgccgccg tggctctgga cgccgtcaac tatgccgata accaccccct gtggattggg 1740
acggtcaacc tcgaggctgg agacgtcgtg gagtacaagt acatcaatgt gggccaagat 1800
ggctccgtga cctgggagag tgatcccaac cacacttaca cggttcctgc ggtggcttgt 1860
gtgacgcagg ttgtcaagga ggacacctgg cagtcgtaa 1899