WO2008124048A2 - Multizymes and their use in making polyunsaturated fatty acids - Google Patents

Multizymes and their use in making polyunsaturated fatty acids Download PDF

Info

Publication number
WO2008124048A2
WO2008124048A2 PCT/US2008/004377 US2008004377W WO2008124048A2 WO 2008124048 A2 WO2008124048 A2 WO 2008124048A2 US 2008004377 W US2008004377 W US 2008004377W WO 2008124048 A2 WO2008124048 A2 WO 2008124048A2
Authority
WO
WIPO (PCT)
Prior art keywords
seq
delta
elongase
desaturase
polypeptide
Prior art date
Application number
PCT/US2008/004377
Other languages
French (fr)
Other versions
WO2008124048A3 (en
WO2008124048A8 (en
Inventor
Howard G. Damude
Anthony J. Kinney
Kevin G. Ripp
Quinn Qun Zhu
Original Assignee
E. I. Du Pont De Nemours And Company
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by E. I. Du Pont De Nemours And Company filed Critical E. I. Du Pont De Nemours And Company
Priority to EP08727277.9A priority Critical patent/EP2129777B1/en
Priority to MX2009010574A priority patent/MX2009010574A/en
Priority to RSP-2009/0413A priority patent/RS20090413A/en
Priority to JP2010502136A priority patent/JP2010523113A/en
Priority to AU2008236723A priority patent/AU2008236723B2/en
Priority to CA002679988A priority patent/CA2679988A1/en
Priority to CN200880018608XA priority patent/CN101765658B/en
Priority to BRPI0808577A priority patent/BRPI0808577A2/en
Priority to UAA200910197A priority patent/UA103595C2/en
Priority to DK08727277.9T priority patent/DK2129777T3/en
Priority to ES08727277.9T priority patent/ES2559312T3/en
Priority to RU2009140397/10A priority patent/RU2517608C2/en
Publication of WO2008124048A2 publication Critical patent/WO2008124048A2/en
Publication of WO2008124048A3 publication Critical patent/WO2008124048A3/en
Publication of WO2008124048A8 publication Critical patent/WO2008124048A8/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P7/00Preparation of oxygen-containing organic compounds
    • C12P7/64Fats; Fatty oils; Ester-type waxes; Higher fatty acids, i.e. having at least seven carbon atoms in an unbroken chain bound to a carboxyl group; Oxidised oils or fats
    • C12P7/6409Fatty acids
    • C12P7/6427Polyunsaturated fatty acids [PUFA], i.e. having two or more double bonds in their backbone
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/0004Oxidoreductases (1.)
    • C12N9/0071Oxidoreductases (1.) acting on paired donors with incorporation of molecular oxygen (1.14)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • C12N15/62DNA sequences coding for fusion proteins
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • C12N15/8241Phenotypically and genetically modified plants via recombinant DNA technology
    • C12N15/8242Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits
    • C12N15/8243Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits involving biosynthetic or metabolic pathways, i.e. metabolic engineering, e.g. nicotine, caffeine
    • C12N15/8247Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits involving biosynthetic or metabolic pathways, i.e. metabolic engineering, e.g. nicotine, caffeine involving modified lipid metabolism, e.g. seed oil composition
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/93Ligases (6)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y114/00Oxidoreductases acting on paired donors, with incorporation or reduction of molecular oxygen (1.14)
    • C12Y114/19Oxidoreductases acting on paired donors, with incorporation or reduction of molecular oxygen (1.14) with oxidation of a pair of donors resulting in the reduction of molecular oxygen to two molecules of water (1.14.19)
    • C12Y114/19001Stearoyl-CoA 9-desaturase (1.14.19.1), i.e. DELTA9-desaturase

Definitions

  • This invention is in the field of biotechnology. More specifically, this invention pertains to polynucleotide sequences encoding multizymes and their use in the synthesis of long-chain polyunsaturated fatty acids (PUFAs).
  • PUFAs long-chain polyunsaturated fatty acids
  • PUFAs are important biological components of healthy cells and are recognized as: "essential" fatty acids that cannot be synthesized de novo in mammals and instead must be obtained either in the diet or derived by further elongation and desaturation of linoleic acid (LA; 18:2 omega-6) or ⁇ -linolenic acid (ALA; 18:3 omega-3); constituents of plasma membranes of cells, where they may be found in such forms as phospholipids or triacylglycerols; necessary for proper development (particularly in the developing infant brain) and for tissue formation and repair; and, precursors to several biologically active eicosanoids of importance in mammals (e.g., prostacyclins, eicosanoids, leukotrienes, prostaglandins).
  • LA linoleic acid
  • ALA ⁇ -linolenic acid
  • constituents of plasma membranes of cells where they may be found in such forms as phospholipids or triacylglycerols; necessary for proper development
  • omega-3 PUFAs a high intake of long-chain omega-3 PUFAs produces cardiovascular protective effects (Dyerberg et al., Amer. J. Clin. Nutr. 28:958-966 (1975); Dyerberg et al., Lancet. 2(8081):117- 119 (1978); Shimokawa, H., World Rev. Nutr. Diet 88:100-108 (2001); von Schacky et al., World Rev. Nutr. Diet 88:90-99 (2001)).
  • Numerous other studies document wide-ranging health benefits conferred by administration of omega-3 and/or omega- 6 PUFAs against a variety of symptoms and diseases (e.g., asthma, psoriasis, eczema, diabetes, cancer).
  • arachidonic acid (ARA; 20:4 omega-6), eicosapentaenoic acid (EPA; 20:5 omega-3), and docosahexaenoic acid (DHA; 22:6 omega-3) all require expression of either the delta-9 elongase/delta-8 desaturase pathway (which operates in some organisms, such as euglenoid species and which is characterized by the production of eicosadienoic acid (EDA; 20:2 omega-6) and/or eicosatrienoic acid (ETrA; 20:3 omega-3)) or the delta-6 desaturase/delta-6 elongase pathway (which is predominantly found in algae, mosses, fungi, nematodes and humans and which is characterized by the production of gamma-linolenic acid (GLA; 18:3 omega-6) and/or stearidonic acid (STA; 18:4
  • the delta-8 desaturase enzymes identified thus far have the ability to convert both EDA to dihomo gamma-linolenic acid (DGLA (also known as HGLA); 20:3, n-6) and ETrA to eicosatetraenoic acid (ETA; 20:4, n-3).
  • DGLA dihomo gamma-linolenic acid
  • ETA eicosatetraenoic acid
  • ARA and EPA are subsequently synthesized from DGLA and ETA, respectively, following reaction with a delta-5 desaturase.
  • DHA synthesis requires the subsequent expression of an additional C20/22 elongase and a delta-4 desaturase.
  • AAN75707 Thraustochytrium sp. (GenBank Accession No. CAD42496; U.S. Patent 7,087,432); Schizochytrium aggregatum (SEQ ID NO:28; PCT Publication No. WO 2002/090493); Pavlova lutheri (GenBank Accession No. AAQ98793); and lsochrysis galbana (SEQ ID NO:30; GenBank Accession No. AAV33631 ; Pereira et al., Biochem. J., 384(2): 357-366 (2004); PCT Publication No. WO 2002/090493)].
  • the present invention concerns a multizyme comprising a single polypeptide having at least two independent and separable enzymatic activities.
  • the enzymatic activities of the multizyme can be selected from the group consisting of fatty acid elongases, fatty acid desaturases, acyl transferases, acyl CoA synthases, and thioesterases. More specifically, the enzymatic activities can comprise at least one fatty acid elongase linked to at least one fatty acid desaturase.
  • the multizyme can comprise a first enzymatic activity linked to a second enzymatic activity and said link is selected from the group consisting of a polypeptide bond, SEQ ID NO:198 (EgDHAsyni linker), SEQ ID NO:198 (EgDHAsyni linker), SEQ ID NO:198 (EgDHAsyni linker), SEQ ID NO:198 (EgDHAsyni linker), SEQ ID NO:198 (EgDHAsyni linker), SEQ ID NO:198 (EgDHAsyni linker), SEQ ID NO:198 (EgDHAsyni linker), SEQ ID NO:198 (EgDHAsyni linker), SEQ ID NO:198 (EgDHAsyni linker), SEQ ID NO:198 (EgDHAsyni linker), SEQ ID NO:198 (EgDHAsyni linker), SEQ ID NO:198 (EgDHAsyn
  • the invention concerns an isolated polynucleotide encoding a DHA synthase comprising:
  • nucleotide sequence encoding a polypeptide having DHA synthase activity, wherein the polypeptide has at least 80% amino acid identity, based on the Clustal V method of alignment, when compared to an amino acid sequence as set forth in SEQ ID NO:12, SEQ ID NO:22, SEQ ID NO:95, SEQ ID NO:96, or SEQ ID NO:97;
  • nucleotide sequence encoding a polypeptide having DHA synthase activity wherein the nucleotide sequence has at least 80% sequence identity, based on the BLASTN method of alignment, when compared to a nucleotide sequence as set forth in SEQ ID NO:11 , SEQ ID NO:205, SEQ ID NO:21 , SEQ ID NO:91 , SEQ ID NO:92, SEQ ID NO:93, or SEQ ID NO:410;
  • nucleotide sequence encoding a polypeptide having DHA synthase activity, wherein the nucleotide sequence hybridizes under stringent conditions to a nucleotide sequence as set forth in SEQ ID NO:11 , SEQ ID NO:205, SEQ ID NO:21 , SEQ ID NO:91 , SEQ ID NO:92, SEQ ID NO:93, or SEQ ID NO:410; or
  • the invention concerns the polynucleotide encoding a polypeptide having DHA synthase activity wherein the nucleotide sequence comprises SEQ ID NO:11 , SEQ ID NO:21 , SEQ ID NO:91 , SEQ ID NO:92, SEQ ID NO:93, or SEQ ID NO:410.
  • the invention concerns the polypeptide of the invention having DHA synthase activity, wherein the amino acid sequence of the polypeptide comprises SEQ ID NO:12, SEQ ID NO:22, SEQ ID NO:95, SEQ ID NO:96, or SEQ ID NO:97.
  • the invention concerns an isolated polynucleotide encoding a C20 elongase comprising:
  • nucleotide sequence encoding a polypeptide having C20 elongase activity wherein the nucleotide sequence has at least 80% sequence identity, based on the BLASTN method of alignment, when compared to a nucleotide sequence as set forth in SEQ ID NO:11 , SEQ ID NO:21 , SEQ ID NO:91 , SEQ ID NO:92, SEQ ID NO:93, SEQ ID NO:183, SEQ ID NO:188, SEQ ID NO:201 (EgDHAsyni C20 elongase domain), SEQ ID NO:206 (EgDHAsyni * C20 elongase domain), SEQ ID NO:203 (EgDHAsyn2 C20 elongase domain), SEQ ID NO:227 (EaDHAsyni C20 elongase domain), SEQ ID NO:228 (EaDHAsyn2 C20 elongase domain), SEQ ID NO:
  • nucleotide sequence encoding a polypeptide having C20 elongase activity, wherein the nucleotide sequence hybridizes under stringent conditions to a nucleotide sequence as set forth in SEQ ID NO:11 , SEQ ID NO:21 , SEQ ID NO:91 , SEQ ID NO:92, SEQ ID NO:93, SEQ ID NO:183, SEQ ID NO:188, SEQ ID NO:201 (EgDHAsyni C20 elongase domain), SEQ ID NO:206
  • EgDHAsyni * C20 elongase domain SEQ ID NO:203 (EgDHAsyn2 C20 elongase domain), SEQ ID NO:227 (EaDHAsyni C20 elongase domain), SEQ ID NO:228 (EaDHAsyn2 C20 elongase domain), SEQ ID NO:229 (EaDHAsyn3 C20 elongase domain) or SEQ ID NO:230 (EaDHAsyn4 C20 elongase domain); or (d) a complement of the nucleotide sequence of (a), (b) or (c), wherein the complement and the nucleotide sequence consist of the same number of nucleotides and are 100% complementary.
  • the invention concerns an isolated polynucleotide encoding a delta-4 desaturase comprising: (a) a nucleotide sequence encoding a polypeptide having delta-4 desaturase activity, wherein the polypeptide has at least 80% amino acid identity, based on the Clustal V method of alignment, when compared to an amino acid sequence as set forth in SEQ ID NO:12, SEQ ID NO:22, SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:97, SEQ ID NO:193, SEQ ID NO:215, SEQ ID NO:217, SEQ ID NO:221 , SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241 , SEQ ID NO:246, SEQ ID NO:247, SEQ ID NO:248, SEQ ID NO:249, SEQ ID NO:382, SEQ ID NO:384, SEQ ID NO:386, SEQ ID NO:388, SEQ ID NO:404, SEQ ID NO:
  • nucleotide sequence encoding a polypeptide having delta-4 desaturase activity wherein the nucleotide sequence has at least 80% sequence identity, based on the BLASTN method of alignment, when compared to a nucleotide sequence as set forth in SEQ ID NO:11 , SEQ ID NO:21 , SEQ ID NO:91 , SEQ ID NO:92, SEQ ID NO:93, SEQ ID NO:192, SEQ ID NO:214, SEQ ID No:216, SEQ ID NO:220, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:242, SEQ ID NO:243, SEQ ID NO:244, SEQ ID NO:245, SEQ ID NO:381 , SEQ ID NO:383, SEQ ID NO:385, SEQ ID NO:387, SEQ ID NO:403, SEQ ID NO:405, or SEQ ID NO:407; (c) a nucleotide sequence en
  • the invention concerns an isolated polynucleotide encoding a DHA synthase, said polynucleotide comprising the sequence set forth in any of SEQ ID NO:11 , SEQ ID NO:205, SEQ ID NO:21 , SEQ ID NO:91 , SEQ ID NO:92, SEQ ID NO:93, or SEQ ID NO:410.
  • the invention concerns an isolated polynucleotide encoding a C20 elongase, said isolated polynucleotide encoding a C20 elongase, said polynucleotide comprising the sequence set forth in any of SEQ ID NO: 183, SEQ ID NO:188, SEQ ID NO:201 (EgDHAsyni C20 elongase domain, SEQ ID NO:206 (EgDHAsyni * C20 elongase domain), SEQ ID NO:203 (EgDHAsyn2 C20 elongase domain), SEQ ID NO:227 (EaDHAsyni C20 elongase domain), SEQ ID NO:228 (EaDHAsyn2 C20 elongase domain), SEQ ID NO:229 (EaDHAsyn3 C20 elongase domain) or SEQ ID NO:230 (EaDHA
  • the invention concerns an isolated polynucleotide encoding a delta-4 desaturase, said polynucleotide comprising the sequence set forth in SEQ ID NO:192, SEQ ID NO:214, SEQ ID NO:220, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:242, SEQ ID NO:243, SEQ ID NO:244, SEQ ID NO:245, SEQ ID NO:381 , SEQ ID NO:383, SEQ ID NO:385, SEQ ID No:387, SEQ ID NO:403, SEQ ID NO:405, or SEQ ID NO:407.
  • the invention concerns a recombinant construct comprising any of the isolated polynucleotides of the invention operably linked to at least one regulatory sequence.
  • the invention concerns a host cell comprising in its genome the recombinant construct of the invention. More particularly, the host cell is a recombinant microbial host cell comprising a multizyme of the invention, wherein the first enzymatic activity is a delta-9 elongase and the second enzymatic activity is a delta-8 desaturase. In another aspect, the first enzymatic activity is a C20 elongase, and the second enzymatic activity is a delta-4 desaturase. In a fourteenth embodiment, the invention concerns a transformed Yarrowia sp. comprising the recombinant construct of the invention.
  • the invention concerns a method for transforming a cell, comprising transforming a cell with the recombinant construct of the invention and selecting those cells transformed with said recombinant construct.
  • the invention concerns a method for producing a transformed plant comprising transforming a plant cell with any of the polynucleotides of the invention and regenerating a plant from the transformed plant cell.
  • the invention concerns a method for producing yeast comprising transforming a yeast cell with any of the polynucleotides of the invention and growing yeast from the transformed yeast cell.
  • the invention concerns a plant comprising in its genome the recombinant construct of the invention. Also of interest are seeds obtained from such plants, oil obtained from such seeds, food or feed incorporating such oil, and a beverage incorporating the oil of the invention.
  • the invention concerns an isolated nucleic acid molecule which encodes a C20 elongase as set forth in SEQ ID NO: 183 wherein at least 147 codons are codon-optimized for expression in Yarrowia sp.
  • the invention concerns an isolated nucleic acid molecule which encodes a C20 elongase as set forth in SEQ ID NO: 188 wherein at least 134 codons are codon-optimized for expression in Yarrowia sp.
  • the invention concerns an isolated nucleic acid molecule which encodes a delta-4 desaturase enzyme as set forth in SEQ ID NO: 192 wherein at least 285 codons are codon-optimized for expression in Yarrowia sp.
  • the invention concerns a method for making a multizyme which comprises: (a) linking a first polypeptide with at least a second polypeptide wherein each polypeptide has an independent and separable enzymatic activity; and
  • the invention concerns a method for altering the fatty acid profile of an oilseed plant comprising: a) transforming an oilseed plant cell with the recombinant construct of the invention; b) regenerating a plant from the transformed oilseed plant cell step (a), wherein the plant has an altered fatty acid profile.
  • the invention concerns an isolated polynucleotide encoding a DGLA synthase comprising:
  • nucleotide sequence encoding a polypeptide having DGLA synthase activity, wherein the polypeptide is set forth in SEQ ID NO:441 , SEQ ID NO:454, SEQ ID NO:461 , SEQ ID NO:464, SEQ ID NO:471 , SEQ ID NO:515, SEQ ID NO:516, SEQ ID NO:517, SEQ ID NO:518, or SEQ ID NO:519;
  • nucleotide sequence encoding a polypeptide having DGLA synthase activity wherein the nucleotide sequence is set forth in SEQ ID NO:440, SEQ ID NO:446, SEQ ID NO:453, SEQ ID NO:460, SEQ ID NO:463, SEQ ID NO:470, SEQ ID NO:492, SEQ ID NO:493, SEQ ID NO:494, SEQ ID NO:495, or SEQ ID NO:496;
  • nucleotide sequence encoding a polypeptide having DGLA synthase activity, wherein the nucleotide sequence hybridizes under stringent conditions to a nucleotide sequence as set forth in SEQ ID NO:440, SEQ ID NO:446, SEQ ID NO:453, SEQ ID NO:460, SEQ ID NO:463, SEQ ID NO:470, SEQ ID NO:492, SEQ ID NO:493, SEQ ID NO:494, SEQ ID NO:495, or SEQ ID NO:496; or (d) a complement of the nucleotide sequence of (a), (b) or (c), wherein the complement and the nucleotide sequence consist of the same number of nucleotides and are 100% complementary.
  • the invention concerns a method for converting linoleic acid to dihomo gamma-linolenic acid comprising: a) providing a recombinant microbial host cell comprising: i) a DGLA synthase comprising:
  • the invention concerns a method for the conversion of ⁇ -linolenic acid to eicosatrienoic acid comprising: a) providing a recombinant microbial host cell comprising: i) a DGLA synthase comprising: 1) at least one polypeptide encoding a delta-9 elongase;
  • the invention concerns a method for the conversion of eicosapentaenoic acid to docosahexaenoic acid comprising: a) providing a recombinant microbial host cell comprising: i) a DHA synthase comprising: 1) at least one polypeptide encoding a C20 elongase;
  • the invention concerns a method for the conversion of arachidonic acid to docosapentaenoic acid comprising: a) providing a recombinant microbial host cell comprising: i) a DHA synthase comprising:
  • the invention concerns a method for the identification of a polypeptide having improved delta-4 desaturase activity comprising: a) providing a wild-type delta-4 desaturase polypeptide isolated from Euglena anabena having a base-line delta-4 desaturase activity; b) truncating the wild-type polypeptide of (a) by from about 1 to about 200 amino acids to create a truncated mutant polypeptide having delta-4 desaturase activity that is increased as compared with the base-line delta-4 desaturase activity.
  • the invention concerns a microbial host cell which produces a polyunsaturated fatty acid and expresses polypeptides encoding enzymes in the following sequential pathway:
  • polypeptides comprise at least one multizyme, a fusion comprising a fusion between at least one contiguous enzyme pair.
  • FIG. 1 is a representative omega-3 and omega-6 fatty acid pathway providing for the conversion of myristic acid through various intermediates to DHA.
  • FIG. 2 shows a Clustal W alignment between a portion of the coding sequence of EgDHAsyn2 (SEQ ID NO:21), the cDNA sequence of the Euglena gracilis delta-4 desaturase (SEQ ID NO:23) (NCBI Accession No. AY278558 (Gl 33466345), locus AY278558, Meyer et al., ⁇ /oc ⁇ em/sfry 42(32):9779-9788 (2003)), and the coding sequence of the Euglena gracilis delta-4 desaturase (SEQ ID NO:24) (Meyer et al., supra).
  • FIGs. 3A and 3B show a Clustal W alignment between the amino acid sequence of EgDHAsyni (SEQ ID NO:12), EgDHAsyn2 (SEQ ID NO:22), and EgC20elo1 (SEQ ID NO:6).
  • FIGs. 4A and 4B show the Clustal W alignment of the N-terminus of EgDHAsyni (SEQ ID NO: 12) and the N-terminus of EgDHAsyn2 (SEQ ID NO:22) with EgC20elo1 (SEQ ID NO:6), Pavlova sp. CCMP459 C20-PUFA EIo (SEQ ID NO:2), Ostreococcus tauri PUFA elongase 2 (SEQ ID NO:25) (NCBI Accession No. AAV67798 (Gl 55852396), locus AAV67798, CDS AY591336; Meyer et al., J. Lipid Res.
  • FIGs. 5A , 5B, 5C and 5D show the Clustal W alignment of the C-terminus of
  • EgDHAsyni EgDHAsyn1_CT; amino acids 253-793 of SEQ ID NO:12; the N- terminus of EgDHAsyni is not shown and is indicated by ""
  • EgDHAsyn2_CT C-terminus of EgDHAsyn2
  • EgDHAsyn2_CT C-terminus of EgDHAsyn2
  • Euglena gracilis delta-4 fatty acid desaturase SEQ ID NO: 13
  • Thraustochytrium aureum delta-4 desaturase SEQ ID NO:27
  • AAN75707 (GI 25956288), locus AAN75707, CDS AF391543), Schizochytrium aggregatum delta-4 desaturase (SEQ ID NO:28) (PCT Publication No. WO 2002/090493), Thalassiosira pseudonana delta-4 desaturase (SEQ ID NO:29) (NCBI Accession No. AAX14506 (Gl 60173017), locus AAX14506, CDS AY817156; Tonon et al., FEBS J. 272 (13):3401-3412 (2005)), and lsochrysis galbana delta-4 desaturase (SEQ ID NO:30) (NCBI Accession No.
  • AAV33631 (Gl 54307110), locus AAV33631 , CDS AY630574; Pereira et al., Biochem. J., 384(2):357-366 (2004) and PCT Publication No. WO 2002/090493).
  • FIG. 6 shows an alignment of interior fragments of EgDHAsyni (EgDHAsyn1_ NCT; amino acids 253-365 of SEQ ID NO:12) and EgDHAsyn2 (EgDHAsyn2_NCT; amino acids 253-365 of SEQ ID NO:22) spanning both the C20 elongase region and the delta-4 desaturase domain (based on homology) with the C-termini of C20 elongases (EgC20elo1_CT, amino acids 246-298 of SEQ ID NO:6; PavC20elo_CT, amino acids 240-277 of SEQ ID NO:2; OtPUFAelo2_CT, amino acids 256-300 of SEQ ID NO:25; TpPUFAelo2_CT, amino acids 279-358 of SEQ ID NO:26) and the N-termini of delta-4 desaturases (EgD4_NT, amino acids 1-116 of SEQ ID NO: 13;
  • FIG. 7 provides plasmid maps for the following: (A) pY115 (see also SEQ ID NO:
  • FIG. 8 provides plasmid maps for the following: (A) pY132 (see also SEQ ID NO:40); (B) pY161 (see also SEQ ID NO:41); (C) pY164 (see also SEQ ID NO:42); and (D) pY141 (see also SEQ ID NO:49).
  • FIG. 9 provides plasmid maps for the following: (A) pY143 (see also SEQ ID NO:
  • FIG. 10 provides plasmid maps for the following: (A) pY152 (see also SEQ ID NO:67); (B) pY157 (see also SEQ ID NO:69); (C) pY153 (see also SEQ ID NO:72); and (D) pY151 (see also SEQ ID NO:76).
  • FIG. 11 is a map of pY160 (see also SEQ ID NO:77).
  • FIG. 12 shows a chromatogram of the lipid profile of a Euglena anabaena cell extract as described in the Examples.
  • FIGs. 13A, 13B and 13C show a Clustal W alignment of the amino acid sequences for EaDHAsyni (SEQ ID NO:95), EaDHAsyn2 (SEQ ID NO:96), EaDHAsyn3 (SEQ ID NO:97), and EaDHAsyn4 (SEQ ID NO:98).
  • FIG. 14 provides plasmid maps for the following: (A) pY165 (see also SEQ ID NO:99); (B) pY166 (see also SEQ ID NO:100); (C) pY167 (see also SEQ ID NO: 101); and (D) pY168 (see also SEQ ID NO: 102).
  • FIG. 15 provides plasmid maps for the following: (A) pKR1061 (see also SEQ ID NO:111); (B) pKR973 (see also SEQ ID NO: 128); (C) pKR1064 (see also SEQ ID NO:132); and (D) pKR1133 (see also SEQ ID NO:145).
  • FIG. 16 provides plasmid maps for the following: (A) pKR1105 (see also SEQ ID NO:
  • FIG. 17 is a map of KS373 (see also SEQ ID NO: 179).
  • FIG. 18 shows the fatty acid profiles, calculated % elongation, and calculated % desaturation for the clones (except pBY-EgC20elo1) shown in Table 24.
  • FIG. 19 shows the fatty acid profiles, calculated % elongation, and calculated % desaturation for feeding EPA to a vector only control, pY141, pY143, pY149, pY156, pY157, and pY160.
  • FIG. 20 shows the fatty acid profiles, calculated % elongation, and calculated % desaturation for feeding DPA to a vector only control, pY141 , pY150, pY151, pY152, pY153, pY156, pY157, and pY160.
  • FIG. 21 shows a schematic of the relative domain structure for each construct described in Table 25.
  • FIG. 22 shows the fatty acid profiles, calculated % elongation, and calculated % desaturation for feeding EPA, ARA, and DPA to Yarrowia cells transformed with pY141 (EgDHAsyni ; SEQ ID NO:49) and to a vector only control.
  • FIG. 23 shows the fatty acid profiles for individual embryos from a representative event in somatic soybean embryos transformed with soybean expression vectors pKR973 and pKR1064 (see Table 26).
  • FIG. 24 shows the fatty acid profiles from the five best elongation events in soybean embryos transformed with soybean expression vector KS373.
  • FIG. 25 summarizes BLASTP and percent identity values for EgC20elo1 (Example 3), EgDHAsyni (Example 4), and EgDHAsyn2 (Example 5).
  • FIG. 26 shows the fatty acid profiles from feeding soybean embryos with EPA. The soybean embryos were selected from the best C20/delta-5 elongase and delta-4 desaturase activities in soybean embryos transformed with soybean expression vector pKR1105.
  • FIG. 27 shows a chromatogram of the lipid profile of a Euglena gracilis cell extract as described in the Examples.
  • FIG. 28 is a map of pKR1183 (see also SEQ ID NO:266).
  • FIG. 29 summarizes the Euglena anabaena DHA synthase domain sequences.
  • FIG. 30 is a map of pKR1253 (see also SEQ ID NO:270).
  • FIG. 31 is a map of pKR1255 (see also SEQ ID NO:275).
  • FIG. 32 is a map of pKR1189 (see also SEQ ID NO:285).
  • FIG. 33 is a map of pKR1229 (see also SEQ ID NO:296).
  • FIG. 34 is a map of pKR1249 (see also SEQ ID NO:297).
  • FIG. 35 is a map of pKR1322 (see also SEQ ID NO:314).
  • FIG. 36 shows the fatty acid profiles for five events transformed with pKR1189 that have the lowest average ALA content (average of 5 soybean somatic embryos analyzed) along with an event (2148-3-8-1) having a fatty acid profile typical of wild type embryos for this experiment.
  • Fatty acids are identified as 16:0 (palmitate), 18:0 (stearic acid), 18:1 (oleic acid), LA, and ALA.
  • Fatty acid compositions are expressed as a weight percent (wt. %) of total fatty acids.
  • FIG. 37 shows the fatty acid profiles for five events transformed with pKR1183 that have the highest average DGLA content (average of 5 soybean somatic embryos analyzed).
  • Fatty acids are identified as 16:0 (palmitate), 18:0 (stearic acid), 18:1 (oleic acid), LA, ALA, EDA, ERA, DGLA, and ETA. Fatty acid compositions are expressed as a weight percent (wt. %) of total fatty acids.
  • FIG. 38 shows the average fatty acid profiles (Average of 10 soybean somatic embryos) for 20 events transformed with pKR1249 and pKR1253 that have the highest ARA.
  • Fatty acids are identified as 16:0 (palmitate), 18:0 (stearic acid), 18:1 (oleic acid), LA, ALA, EDA, SCI, DGLA, ARA, ERA, JUN, ETA, and EPA.
  • Fatty acid compositions are expressed as a weight percent (wt. %) of total fatty acids.
  • Fatty acids listed as "others” include: 18:2 (5,9), 18:3 (5,9,12), STA, 20:0, 20:1(11), 20:2 (7,11) or 20:2 (8,11), and DPA.
  • FIG. 39 shows the actual fatty acid profiles for each soybean somatic embryo from one event (AFS 5416-8-1-1) having an average ARA content of 17.0% and an average EPA content of 1.5%.
  • Fatty acids are identified as 16:0 (palmitate), 18:0 (stearic acid), 18:1 (oleic acid), LA, ALA, EDA, SCI, DGLA, ARA, ERA, JUN, ETA, and EPA. Fatty acid compositions are expressed as a weight percent (wt. %) of total fatty acids. Fatty acids listed as "others" include: 18:2 (5,9), 18:3 (5,9,12), STA, 20:0, 20:1(11), 20:2 (7,11) or 20:2 (8,11), and DPA.
  • FIG. 40 shows the average fatty acid profiles (Average of 9 or 10 soybean somatic embryos) for 20 events transformed with pKR1249 and pKR1255 that have the highest ARA.
  • Fatty acids are identified as 16:0 (palmitate), 18:0 (stearic acid), 18:1 (oleic acid), LA, ALA, EDA, SCI, DGLA, ARA, ERA, JUN, ETA, and EPA; fatty acid compositions are expressed as a weight percent (wt. %) of total fatty acids.
  • Fatty acids listed as “others” include: 18:2 (5,9), 18:3 (5,9,12), STA, 20:0, 20:1(11), 20:2 (7,11) or 20:2 (8,11), and DPA.
  • FIG. 41 shows the fatty acid profiles from feeding embryos with EPA. The soybean embryos were selected from the events with the best C20/delta-5 elongase and delta-4 desaturase activities in soybean embryos transformed with soybean expression vector pKR1134. Fatty acids in FIG.
  • Fatty acid compositions listed in FIG. 41 are expressed as a weight percent (wt. %) of total fatty acids.
  • FIG. 42 shows the fatty acid profiles from feeding soybean embryos with EPA.
  • the soybean embryos were selected from the events with the best C20/delta-5 elongase and delta-4 desaturase activities from the 20 new events analyzed for soy transformed with pKR1105.
  • Fatty acids in FIG. 42 are identified as 16:0 (palmitate), 18:0 (stearic acid), 18:1 (oleic acid), LA, ALA, EPA, 22:0 (docosanoic acid), DPA, 24:0 (tetracosanoic acid), DHA, and 24:1 (nevonic acid).
  • Fatty acid compositions listed in FIG. 42 are expressed as a weight percent (wt. %) of total fatty acids.
  • FIG. 43 shows a graph depicting the relative activities of events transformed with either pKR1105 (C20 elongase and delta-4 desaturase expressed individually) or pKR1134 (C20 elongase and delta-4 desaturase expressed as a fusion), when the soybean embryos were fed EPA.
  • FIG. 44 diagrams the development of Yarrowia lipolytics strain Y4305U3.
  • FIG. 45 provides plasmid maps for the following: (A) pZKLeuN-29E3 and (B) pY116.
  • FIG. 46 provides plasmid maps for the following: (A) pKO2UF8289 and (B) pZKSL-555R.
  • FIG. 47 provides plasmid maps for the following: (A) pZP3-Pa777U and (B) pY117.
  • FIG. 48 provides plasmid maps for the following: (A) pZP2-2988 and (B) pZKUE3S.
  • FIG. 49 provides plasmid maps for the following: (A) pZKL2-5U89GC and (B) pZKL1-2SP98C.
  • FIG. 50 provides plasmid maps for the following: (A) pZKUM and (B) pZKD2- 5U89A2.
  • FIG. 51 A diagrams the development of Yarrowia lipolytica strain Y4184U.
  • FIG. 51 B provides a plasmid map for pEgC20ES.
  • FIG. 52 provides plasmid maps for the following: (A) pZUFmEgC20ES and
  • FIG. 52C is a schematic drawing showing overlap of the 3' region of the EaC20E domain (SEQ ID NO:231) with the 5 1 region of the EaD4 domain (SEQ ID NO:246) within EaDHAsyni (SEQ ID NO:95).
  • FIG. 53A shows an alignment between the N-termini of EaD4S (SEQ ID NO:193), EaD4S-1 (SEQ ID NO:382), EaD4S-2 (SEQ ID NO:384), and EaD4S-3 (SEQ ID NO:386).
  • FIG. 53B shows an alignment between the N-termini of EgD4S (SEQ ID NO:388), EgD4S-1 (SEQ ID NO:404), EgD4S-2 (SEQ ID NO:406), and EgD4S-3 (SEQ ID NO:408).
  • FIG. 54 provides plasmid maps for the following: (A) pZKLY-G204, (B) pEgC20ES-K, (C) pYNTGUS1-CNP, and (D) pZKLY.
  • FIG. 55 provides plasmid maps for the following: (A) pZUFmG9G8fu and (B) pZUFmG9A8.
  • FIG. 56 is a map of pKR1014.
  • FIG. 57 is a map of pKR1152.
  • FIG. 58 is a map of pKR1151.
  • FIG. 59 is a map of pKR1150.
  • FIG. 60 is a map of pKR1199.
  • FIG. 61 is a map of pKR1200.
  • FIG. 62 is a map of pKR1184.
  • FIG. 63 is a map of pKR1321.
  • FIG. 64 is a map of pKR1326.
  • fatty acids are identified as 16:0 (palmitate), 18:0 (stearic acid), 18:1 (oleic acid), LA, ALA, EDA, ERA, DGLA, and ETA, and fatty acid compositions are expressed as a weight percent (wt. %) of total fatty acids.
  • elongation activity is expressed as % delta-9 elongation of C18 fatty acids (C18 % delta-9 elong), calculated according to the following formula: ([product]/[substrate + product]) * 100.
  • the combined percent elongation for LA and ALA is determined as: ([DGLA + ETA + EDA + ERA]/[LA + ALA + DGLA + ETA + EDA + ERA])*100.
  • the combined percent desaturation for EDA and ERA is shown as "C20 % delta-8 desat", determined as: ([DGLA + ETA]/[DGLA + ETA + EDA + ERA]) * 100, and is also referred to as the overall % desaturation.
  • FIG. 65 shows the fatty acid profiles for the five events transformed with pKR1014 that have the highest average DGLA content (average of 5 soybean somatic embryos analyzed).
  • FIG. 66 shows the fatty acid profiles for the five events transformed with pKR1152 that have the highest average DGLA content (average of 5 soybean somatic embryos analyzed).
  • FIG. 67 shows the fatty acid profiles for the five events transformed with pKR1151 that have the highest average DGLA content (average of 5 soybean somatic embryos analyzed).
  • FIG. 68 shows the fatty acid profiles for the five events transformed with pKR1150 that have the highest average DGLA content (average of 5 soybean somatic embryos analyzed).
  • FIG. 69 shows the fatty acid profiles for the five events transformed with pKR1199 that have the highest average DGLA content (average of 5 soybean somatic embryos analyzed).
  • FIG. 70 shows the fatty acid profiles for the five events transformed with pKR1200 that have the highest average DGLA content (average of 5 soybean somatic embryos analyzed).
  • FIG. 71 shows the fatty acid profiles for the five events transformed with pKR1184 that have the highest average DGLA content (average of 5 soybean somatic embryos analyzed).
  • FIG. 72 shows a comparison of individually expressed delta-9 elongases with delta-8 desaturases versus the equivalent delta-9 elongase-delta-8 desaturase fusion.
  • Each data point represents the average %DGLA or %EDA for 5-6 embryos (as a % of total fatty acids) for all events analyzed, and Avg. %DGLA is plotted vs. Avg. % EDA.
  • EgTpom represents EgD9e co-expressed with TpomD ⁇ (pKR1014)
  • EgTpomfus represents the EgD9e/TpomD8 fusion (pKR1199).
  • EgEa represents EgD9e co-expressed with EaD8 (pKR1152), and EgEafus represents the EgD9e/EaD8 fusion (pKR1200).
  • EaTpom represents EaD9e co-expressed with TpomD ⁇ (pKR1151), and EaTpomfus represents the EaD9e/TpomD8 fusion (pKR1183).
  • EaEa represents EaD9e co- expressed with EaD8 (pKR1150) and EaEafus represents the EaD9e/EaD8 fusion (pKR1200).
  • FIG. 73 shows the fatty acid profiles for the five events transformed with pKR1322 (Experiment MSE2274) that have the highest average ARA and EPA content (average of the 5 embryos analyzed)
  • Fatty acids are identified as 16:0 (palmitate), 18:0 (stearic acid), 18:1 (oleic acid), 18:2 (5,9), LA, ALA, EDA, ERA, SCI, DGLA, JUN (also called JUP), ETA, ARA and EPA.
  • Fatty acid compositions are expressed as a weight percent (wt. %) of total fatty acids.
  • Elongation activity is expressed as % delta-9 elongation of C18 fatty acids (%Elo), calculated according to the following formula: ([product]/[substrate + product]) * 100. More specifically, the combined percent elongation for LA and ALA is determined as: ([DGLA + ETA + EDA + ERA + EPA + ARA]/[LA + ALA + DGLA + ETA + EDA + ERA + EPA +
  • the combined percent delta-8 desaturation for EDA and ERA is shown as “%D8", determined as: ([DGLA + ETA + EPA + ARA]/[DGLA + ETA + EDA + ERA + EPA + ARA]) * 100. This is also referred to as the overall % delta-8 desaturation.
  • the combined percent delta-5 desaturation for DGLA and ETA is shown as "%D5", determined as: ([EPA + ARA]/[DGLA + ETA + EPA + ARA]) * 100. This is also referred to as the overall % delta-5 desaturation.
  • FIG. 74 shows the fatty acid profiles for the five events transformed with pKR1326 (Experiment MSE2275) that have the highest average DGLA and ETA content (average of the 5 embryos analyzed).
  • Fatty acids are identified as 16:0 (palmitate), 18:0 (stearic acid), 18:1 (oleic acid), LA, ALA, EDA, ERA, DGLA and DGLA and ETA.
  • Fatty acid compositions are expressed as a weight percent (wt. %) of total fatty acids.
  • Elongation activity is expressed as % delta-9 elongation of C18 fatty acids (C18 % delta-9 elong), calculated according to the following formula: ([product]/[substrate + product])*100.
  • the combined percent elongation for LA and ALA is determined as: ([DGLA + ETA + EDA + ERA]/[LA + ALA + DGLA + ETA + EDA + ERA])MOO.
  • the combined percent desaturation for EDA and ERA is shown as "C20 % delta-8 desat", determined as: ([DGLA + ETA]/[DGLA + ETA + EDA + ERA]) * 100. This is also referred to as the overall % desaturation.
  • Sequences Listing contains one letter codes for nucleotide sequence characters and the single and three letter codes for amino acids as defined in the IUPAC-IUB standards described in Nucleic Acids Research 13:3021-3030 (1985) and in the Biochemical Journal 219(2):345-373 (1984).
  • SEQ ID NOs: 1-519 are primers, ORFs encoding genes, proteins (or portions thereof), or plasmids, as identified in Table 2.
  • the present invention relates to multizymes, such as DHA synthase. These are useful for, inter alia, the manipulation of biochemical pathways for the production of healthful PUFAs and more specifically for the production of docosahexaenoic acid (DHA).
  • DHA docosahexaenoic acid
  • the subject invention finds many applications.
  • PUFAs, or derivatives thereof, made by the methodology disclosed herein can be used as dietary substitutes, or supplements, particularly infant formulas, for patients undergoing intravenous feeding or for preventing or treating malnutrition.
  • the purified PUFAs (or derivatives thereof) may be incorporated into cooking oils, fats, or margarines formulated so that in normal use the recipient would receive the desired amount for dietary supplementation.
  • the PUFAs may also be incorporated into infant formulas, nutritional supplements, or other food products and may find use as anti-inflammatory or cholesterol lowering agents.
  • the compositions may be used for pharmaceutical use (human or veterinary).
  • the PUFAs are generally administered orally but can be administered by any route by which they may be successfully absorbed, e.g., parenterally (e.g., subcutaneously, intramuscularly or intravenously), rectally, vaginally, or topically (e.g., as a skin ointment or lotion).
  • Supplementation of humans or animals with PUFAs produced by recombinant means can result in increased levels of the added PUFAs, as well as their metabolic progeny.
  • treatment with EPA can result not only in increased levels of EPA, but also downstream products of EPA such as eicosanoids (i.e., prostaglandins, leukotrienes, thromboxanes).
  • eicosanoids i.e., prostaglandins, leukotrienes, thromboxanes.
  • Complex regulatory mechanisms can make it desirable to combine various PUFAs, or add different conjugates of PUFAs, in order to prevent, control, or overcome such mechanisms to achieve the desired levels of specific PUFAs in an individual.
  • ORF Open reading frame
  • PCR Polymerase chain reaction
  • ATCC American Type Culture Collection
  • PUFA Polyunsaturated fatty acid(s)
  • PUFA Polyunsaturated fatty acid(s)
  • TAGs Triacylglycerols
  • down-regulate or down-regulation refer to a reduction or decrease in the level of expression of a gene or polynucleotide.
  • multizyme refers to a single polypeptide having at least two independent and separable enzymatic activities.
  • the multizyme comprises a first enzymatic activity linked to a second enzymatic activity.
  • fusion protein is used interchangeably with the term “multizyme”.
  • a “fusion protein” refers to a single polypeptide having at least two independent and separable enzymatic activities.
  • fusion gene refers to a polynucleotide or gene that encodes a multizyme.
  • a fusion gene can be constructed by linking at least two DNA fragments, wherein each DNA fragment encodes for an independent and separate enzyme activity.
  • An example of a fusion gene is described herein below in Example 38, in which the Hybrid 1 -HGLA Synthase fusion gene was constructed by linking the Euglena anabaena delta-9 elongase (EaD9Elo1 ; SEQ ID NO:252) and the
  • Tetruetreptia pomquetensis CCMP1491 delta-8 desaturase (TpomD8; SEQ ID NO: 162) using the Euglena gracilis DHA synthase 1 proline-rich linker. (EgDHAsynHink; SEQ ID NO:197).
  • domain or “functional domain” is a discrete, continuous part or subsequence of a polypeptide that can be associated with a function (e.g. enzymatic activity).
  • domain includes but is not limited to fatty acid biosynthetic enzymes and portions of fatty acid biosynthetic enzymes that retain enzymatic activity.
  • DHA synthase is an example of a multizyme. Specifically, a DHA synthase comprises a C20 elongase linked to a delta-4 desaturase using any of the linkers described herein. Another example of a multizyme is a single polypeptide comprising a delta-9 elongase linked to a delta-8 desaturase as discussed below.
  • link refers to joining or bonding at least two polypeptides having independent and separable enzyme activities.
  • linker refers to the bond or link between two or more polypeptides each having independent and separable enzymatic activities
  • the link used to form a multizyme is minimally comprised of a single polypeptide bond.
  • the link may be comprised of one amino acid residue, such as proline, or a polypeptide. If the link is a polypeptide, it may be desirable for the link to have at least one proline amino acid residue.
  • linker An example of a linker is shown in SEQ ID NO: 198 (the EgDHAsyni proline- rich linker).
  • fatty acids refers to long-chain aliphatic acids (alkanoic acids) of varying chain lengths, from about Ci 2 to C 22 (although both longer and shorter chain-length acids are known). The predominant chain lengths are between Ci ⁇ and C 22 .
  • Fatty acids are described herein by a simple notation system of "X:Y", wherein X is the total number of carbon (C) atoms in the particular fatty acid and Y is the number of double bonds.
  • the number following the fatty acid designation indicates the position of the double bond from the carboxyl end of the fatty acid with the "c" affix for the c/s-configu ration of the double bond (e.g., palmitic acid (16:0), stearic acid (18:0), oleic acid (18:1 , 9c), petroselinic acid (18:1 , 6c), LA (18:2, 9c,12c), GLA (18:3, 6c,9c,12c) and ALA (18:3, 9c,12c,15c)).
  • 18:1 , 18:2 and 18:3 refer to oleic, LA and ALA fatty acids, respectively. If not specifically written as otherwise, double bonds are assumed to be of the cis configuration. For instance, the double bonds in 18:2 (9,12) would be assumed to be in the cis configuration.
  • a metabolic, or biosynthetic, pathway in a biochemical sense, can be regarded as a series of chemical reactions occurring within a cell, catalyzed by enzymes, to achieve either the formation of a metabolic product to be used or stored by the cell, or the initiation of another metabolic pathway (then called a flux generating step). Many of these pathways are elaborate, and involve a step by step modification of the initial substance to shape it into a product having the exact chemical structure desired.
  • PUFA biosynthetic pathway refers to a metabolic process that converts oleic acid to LA, EDA, GLA, DGLA, ARA, DTA, DPAn-6, ALA, STA, ETrA, ETA, EPA, DPA and DHA. This process is well described in the literature (e.g., see PCT Publication No. WO 2006/052870). Simplistically, this process involves elongation of the carbon chain through the addition of carbon atoms and desaturation of the molecule through the addition of double bonds, via a series of special desaturation and elongation enzymes (i.e., "PUFA biosynthetic pathway enzymes") present in the endoplasmic reticulum membrane.
  • PUFA biosynthetic pathway enzymes a series of special desaturation and elongation enzymes
  • PUFA biosynthetic pathway enzyme refers to any of the following enzymes (and genes which encode said enzymes) associated with the biosynthesis of a PUFA, including: a delta-4 desaturase, a delta-5 desaturase, a delta-6 desaturase, a delta- 12 desaturase, a delta-15 desaturase, a delta-17 desaturase, a delta-9 desaturase, a delta-8 desaturase, a delta-9 elongase, a C14/1 6 elongase, a C16/18 elongase, a C1 8 / 20 elongase, a C 20 / 22 elongase, a DHA synthase and/or a multizyme of the instant invention.
  • omega-3/omega-6 fatty acid biosynthetic pathway refers to a set of genes which, when expressed under the appropriate conditions encode enzymes that catalyze the production of either or both omega-3 and omega-6 fatty acids.
  • genes involved in the omega-3/omega-6 fatty acid biosynthetic pathway encode PUFA biosynthetic pathway enzymes.
  • FIG. 1 A representative pathway is illustrated in FIG. 1 , providing for the conversion of myhstic acid through various intermediates to DHA, which demonstrates how both omega-3 and omega-6 fatty acids may be produced from a common source. The pathway is naturally divided into two portions where one portion will generate omega-3 fatty acids and the other portion, omega-6 fatty acids.
  • omega-3/omega-6 fatty acid biosynthetic pathway means that some (or all) of the genes in the pathway express active enzymes, resulting in in vivo catalysis or substrate conversion. It should be understood that “omega-3/omega-6 fatty acid biosynthetic pathway” or “functional omega-3/omega-6 fatty acid biosynthetic pathway” does not imply that all the PUFA biosynthetic pathway enzyme genes are required, as a number of fatty acid products will only require the expression of a subset of the genes of this pathway.
  • delta-6 desaturase/ delta-6 elongase pathway refers to a PUFA biosynthetic pathway that minimally includes at least one delta-6 desaturase and at least one Ci 8/2 0 elongase, thereby enabling biosynthesis of DGLA and/or ETA from LA and ALA, respectively, with GLA and/or STA as intermediate fatty acids.
  • ARA, DTA, DPAn-6, EPA, DPA, and DHA may also be synthesized.
  • delta-9 elongase/delta-8 desaturase pathway refers to a PUFA biosynthetic pathway that minimally comprises at least one delta-9 elongase and at least one delta-8 desaturase, thereby enabling biosynthesis of DGLA and/or ETA from LA and ALA, respectively, with EDA and/or ETrA as intermediate fatty acids
  • ARA, DTA 1 DPAn-6, EPA, DPA and DHA may also be synthesized. This pathway may be advantageous in some embodiments, as the biosynthesis of GLA and/or STA is excluded.
  • intermediate fatty acid refers to any fatty acid produced in a fatty acid metabolic pathway that can be further converted to an intended product fatty acid in this pathway by the action of other metabolic pathway enzymes.
  • EDA, ETrA, DGLA, ETA and ARA can be produced and are considered “intermediate fatty acids” since these fatty acids can be further converted to EPA via action of other metabolic pathway enzymes.
  • by-product fatty acid refers to any fatty acid produced in a fatty acid metabolic pathway that is not the intended fatty acid product of the pathway nor an "intermediate fatty acid” of the pathway.
  • sciadonic acid (SCI) and juniperonic acid (JUP) also can be produced by the action of a delta-5 desaturase on either EDA or ETrA, respectively. They are considered to be "by-product fatty acids” since neither can be further converted to EPA by the action of other metabolic pathway enzymes.
  • triacylglycerol refers to neutral lipids composed of three fatty acyl residues esterified to a glycerol molecule (and such terms will be used interchangeably throughout the present disclosure herein).
  • oils can contain long-chain PUFAs, as well as shorter saturated and unsaturated fatty acids and longer chain saturated fatty acids.
  • oil biosynthesis generically refers to the synthesis of TAGs in the cell.
  • Percent (%) PUFAs in the total lipid and oil fractions refers to the percent of PUFAs relative to the total fatty acids in those fractions.
  • total lipid fraction or “lipid fraction” both refer to the sum of all lipids (i.e., neutral and polar) within an oleaginous organism, thus including those lipids that are located in the phosphatidylcholine (PC) fraction, phosphatidylethanolamine (PE) fraction and triacylglycerol (TAG or oil) fraction.
  • PC phosphatidylcholine
  • PE phosphatidylethanolamine
  • TAG or oil triacylglycerol
  • conversion efficiency and “percent substrate conversion” refer to the efficiency by which a particular enzyme (e.g., a desaturase) can convert substrate to product.
  • the conversion efficiency is measured according to the following formula: ([product]/[substrate + product]) * 100, where 'product' includes the immediate product and all products in the pathway derived from it.
  • “Desaturase” is a polypeptide that can desaturate, i.e., introduce a double bond, in one or more fatty acids to produce a fatty acid or precursor of interest.
  • omega-reference system throughout the specification to refer to specific fatty acids, it is more convenient to indicate the activity of a desaturase by counting from the carboxyl end of the substrate using the delta-system.
  • delta-8 desaturases will desaturate a fatty acid between the eighth and ninth carbon atom numbered from the carboxyl-terminal end of the molecule and can, for example, catalyze the conversion of EDA to DGLA and/or ETrA to ETA.
  • fatty acid desaturases include, for example: (1) delta-5 desaturases that catalyze the conversion of DGLA to ARA and/or ETA to EPA; (2) delta-6 desaturases that catalyze the conversion of LA to GLA and/or ALA to STA; (3) delta- 4 desaturases that catalyze the conversion of DPA to DHA and/or DTA to DPAn-6; (4) delta-12 desaturases that catalyze the conversion of oleic acid to LA; (5) delta- 15 desaturases that catalyze the conversion of LA to ALA and/or GLA to STA; (6) delta-17 desaturases that catalyze the conversion of ARA to EPA and/or DGLA to ETA; and (7) delta-9 desaturases that catalyze the conversion of palmitic acid to palmitoleic acid (16:1) and/or stearic acid to oleic acid (18:1).
  • delta-15 and delta-17 desaturases are also occasionally referred to as “omega-3 desaturases”, “w-3 desaturases”, and/or “n-3 desaturases”, based on their ability to convert omega-6 fatty acids into their omega-3 counterparts (e.g., conversion of LA into ALA and ARA into EPA, respectively).
  • omega-3 desaturases w-3 desaturases
  • n-3 desaturases based on their ability to convert omega-6 fatty acids into their omega-3 counterparts (e.g., conversion of LA into ALA and ARA into EPA, respectively).
  • it is most desirable to empirically determine the specificity of a particular fatty acid desaturase by transforming a suitable host with the gene for the fatty acid desaturase and determining its effect on the fatty acid profile of the host.
  • delta-4 desaturase refers to an enzyme that will desaturate a fatty acid between the fourth and fifth carbon atom numbered from the carboxyl-terminal end of the molecule and that can, for example, catalyze the conversion of DPA to DHA and/or DTA to DPAn-6.
  • EgDHAsyni refers to a DHA synthase enzyme (SEQ ID NO:12) isolated from Euglena gracilis, encoded by SEQ ID NO:11 herein.
  • EgDHAsyn2 refers to a DHA synthase enzyme (SEQ ID NO:22) isolated from Euglena gracilis, encoded by SEQ ID NO:21 herein.
  • EaDHAsyni refers to a DHA synthase enzyme (SEQ ID NO:95) isolated from Euglena anabaena, encoded by SEQ ID NO:91 herein.
  • EaDHAsyn2 refers to a DHA synthase enzyme (SEQ ID NO:96) isolated from Euglena anabaena, encoded by SEQ ID NO:92 herein.
  • EaDHAsyn3 refers to a DHA synthase enzyme (SEQ ID NO:97) isolated from Euglena anabaena, encoded by SEQ ID NO:93 herein.
  • EaDHAsyn4 refers to an enzyme (SEQ ID NO:98) isolated from Euglena anabaena, encoded by SEQ ID NO:94 herein.
  • elongase system refers to a suite of four enzymes that are responsible for elongation of a fatty acid carbon chain to produce a fatty acid that is two carbons longer than the fatty acid substrate that the elongase system acts upon. More specifically, the process of elongation occurs in association with fatty acid synthase, whereby CoA is the acyl carrier (Lassner et al., Plant Cell 8:281-292 (1996)).
  • malonyl-CoA is condensed with a long-chain acyl-CoA to yield carbon dioxide (CO 2 ) and a ⁇ -ketoacyl-CoA (where the acyl moiety has been elongated by two carbon atoms).
  • Subsequent reactions include reduction to ⁇ - hydroxyacyl-CoA, dehydration to an enoyl-CoA and a second reduction to yield the elongated acyl-CoA.
  • Examples of reactions catalyzed by elongase systems are the conversion of GLA to DGLA, STA to ETA, LA to EDA, ALA to ETrA and EPA to DPA.
  • an enzyme catalyzing the first condensation reaction i.e., conversion of malonyl-CoA and long-chain acyl-CoA to ⁇ -ketoacyl- CoA
  • an enzyme catalyzing the first condensation reaction i.e., conversion of malonyl-CoA and long-chain acyl-CoA to ⁇ -ketoacyl- CoA
  • the substrate selectivity of elongases is somewhat broad but segregated by both chain length and the degree of unsaturation. Accordingly, elongases can have different specificities. For example, a C-
  • C 16/18 elongase will utilize a C 16 substrate (e.g., palmitate); a C-18/20 elongase will utilize a C 18 substrate (e.g., GLA, STA); and a C20/22 elongase will utilize a C 2 o substrate (e.g., ARA, EPA).
  • a "delta-9 elongase" is able to catalyze the conversion of LA to EDA and/or ALA to ETrA.
  • elongases have broad specificity and thus a single enzyme may be capable of catalyzing several elongase reactions.
  • a delta-9 elongase may also act as a C 16 / 18 elongase, C 18 /2o elongase and/or C20/22 elongase and may have alternate, but not preferred, specificities for delta-5 and delta-6 fatty acids such as EPA and/or GLA, respectively.
  • C20 elongase refers to an enzyme which utilizes a
  • C20 substrate such as EPA or ARA, for example.
  • C20/delta-5 elongase refers to an enzyme that utilizes a C20 substrate with a delta-5 double bond.
  • EgD9elo or “EgD9e” refers to a delta-9 elongase isolated from Euglena gracilis (see SEQ ID NO:112; also see U.S.
  • nucleic acid means a polynucleotide and includes a single or double-stranded polymer of deoxyribonucleotide or ribonucleotide bases. Nucleic acids may also include fragments and modified nucleotides. Thus, the terms “polynucleotide”, “nucleic acid sequence”, “nucleotide sequence” or “nucleic acid fragment” are used interchangeably and refer to a polymer of RNA or DNA that is single- or double-stranded, optionally containing synthetic, non-natural or altered nucleotide bases.
  • Nucleotides are referred to by their single letter designation as follows: “A” for adenylate or deoxyadenylate (for RNA or DNA, respectively), “C” for cytidylate or deoxycytidylate, “G” for guanylate or deoxyguanylate, “U” for uridylate, “T” for deoxythymidylate, “R” for purines (A or G), “Y” for pyrimidines (C or T), "K” for G or T, “H” for A or C or T, “I” for inosine, and “N” for any nucleotide.
  • fragment that is functionally equivalent and “functionally equivalent subfragment” are used interchangeably herein. These terms refer to a portion or subsequence of an isolated nucleic acid fragment in which the ability to alter gene expression or produce a certain phenotype is retained whether or not the fragment or subfragment encodes an active enzyme.
  • the fragment or subfragment can be used in the design of chimeric genes to produce the desired phenotype in a transformed plant. Chimeric genes can be designed for use in suppression by linking a nucleic acid fragment or subfragment thereof, whether or not it encodes an active enzyme, in the sense or antisense orientation relative to a plant promoter sequence.
  • conserved domain or "motif means a set of amino acids conserved at specific positions along an aligned sequence of evolutionarily related proteins. While amino acids at other positions can vary between homologous proteins, amino acids that are highly conserved at specific positions indicate amino acids that are essential in the structure, the stability, or the activity of a protein.
  • homologous amino acids that are highly conserved at specific positions indicate amino acids that are essential in the structure, the stability, or the activity of a protein.
  • homology “homologous”, “substantially similar” and “corresponding substantially” are used interchangeably herein. They refer to nucleic acid fragments wherein changes in one or more nucleotide bases do not affect the ability of the nucleic acid fragment to mediate gene expression or produce a certain phenotype.
  • nucleic acid fragments of the instant invention also refer to modifications of the nucleic acid fragments of the instant invention such as deletion or insertion of one or more nucleotides that do not substantially alter the functional properties of the resulting nucleic acid fragment relative to the initial, unmodified fragment. It is therefore understood, as those skilled in the art will appreciate, that the invention encompasses more than the specific exemplary sequences.
  • substantially similar nucleic acid sequences encompassed by this invention are also defined by their ability to hybridize (under moderately stringent conditions, e.g., 0.5X SSC, 0.1 % SDS, 60 0 C) with the sequences exemplified herein, or to any portion of the nucleotide sequences disclosed herein and which are functionally equivalent to any of the nucleic acid sequences disclosed herein.
  • Stringency conditions can be adjusted to screen for moderately similar fragments, such as homologous sequences from distantly related organisms, to highly similar fragments, such as genes that duplicate functional enzymes from closely related organisms. Post-hybridization washes determine stringency conditions.
  • sequences include reference to hybridization, under stringent hybridization conditions, of a nucleic acid sequence to a specified nucleic acid target sequence to a detectably greater degree (e.g., at least 2-fold over background) than its hybridization to non-target nucleic acid sequences and to the substantial exclusion of non-target nucleic acids.
  • Selectively hybridizing sequences typically have about at least 80% sequence identity, or 90% sequence identity, up to and including 100% sequence identity (i.e., fully complementary) with each other.
  • stringent conditions or “stringent hybridization conditions” includes reference to conditions under which a probe will selectively hybridize to its target sequence. Stringent conditions are sequence-dependent and will be different in different circumstances.
  • target sequences can be identified which are 100% complementary to the probe (homologous probing).
  • stringency conditions can be adjusted to allow some mismatching in sequences so that lower degrees of similarity are detected (heterologous probing).
  • a probe is less than about 1000 nucleotides in length, optionally less than 500 nucleotides in length.
  • stringent conditions will be those in which the salt concentration is less than about 1.5 M Na ion, typically about 0.01 to 1.0 M Na ion concentration (or other salts) at pH 7.0 to 8.3 and the temperature is at least about 30 0 C for short probes (e.g., 10 to 50 nucleotides) and at least about 60 0 C for long probes (e.g., greater than 50 nucleotides).
  • Stringent conditions may also be achieved with the addition of destabilizing agents such as formamide.
  • Exemplary moderate stringency conditions include hybridization in 40 to 45% formamide, 1 M NaCI, 1 % SDS at 37 0 C, and a wash in 0.5X to 1X SSC at 55 to 60 0 C.
  • Exemplary high stringency conditions include hybridization in 50% formamide, 1 M NaCI, 1% SDS at 37 0 C, and a wash in 0.1X SSC at 60 to 65 0 C.
  • the T m is the temperature (under defined ionic strength and pH) at which 50% of a complementary target sequence hybridizes to a perfectly matched probe.
  • T m is reduced by about 1°C for each 1% of mismatching; thus, T m , hybridization and/or wash conditions can be adjusted to hybridize to sequences of the desired identity. For example, if sequences with >90% identity are sought, the T m can be decreased 10 0 C. Generally, stringent conditions are selected to be about 5 0 C lower than the thermal melting point (T m ) for the specific sequence and its complement at a defined ionic strength and pH.
  • Sequence identity or “identity” in the context of nucleic acid or polypeptide sequences refers to the nucleic acid bases or amino acid residues in two sequences that are the same when aligned for maximum correspondence over a specified comparison window.
  • percentage of sequence identity refers to the value determined by comparing two optimally aligned sequences over a comparison window, wherein the portion of the polynucleotide or polypeptide sequence in the comparison window may comprise additions or deletions (i.e., gaps) as compared to the reference sequence (which does not comprise additions or deletions) for optimal alignment of the two sequences.
  • the percentage is calculated by determining the number of positions at which the identical nucleic acid base or amino acid residue occurs in both sequences to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the window of comparison and multiplying the results by 100 to yield the percentage of sequence identity.
  • percent sequence identities include, but are not limited to, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, or 95%, or any integer percentage from 50% to 100%. These identities can be determined using any of the programs described herein.
  • Sequence alignments and percent identity or similarity calculations may be determined using a variety of comparison methods designed to detect homologous sequences including, but not limited to, the MegAlignTM program of the LASERGENE bioinformatics computing suite (DNASTAR Inc., Madison, Wl).
  • sequence analysis software is used for analysis, that the results of the analysis will be based on the "default values" of the program referenced, unless otherwise specified.
  • default values will mean any set of values or parameters that originally load with the software when first initialized.
  • Clustal V method of alignment corresponds to the alignment method labeled Clustal V (described by Higgins and Sharp, CABIOS. 5:151-153 (1989); Higgins, D. G. et al., Comput. Appl. Biosci. 8:189-191 (1992)) and found in the
  • MegAlignTM program of the LASERGENE bioinformatics computing suite (DNASTAR Inc., Madison, Wl).
  • BLASTN method of alignment is an algorithm provided by the National Center for Biotechnology Information (NCBI) to compare nucleotide sequences using default parameters.
  • polypeptides from other species, wherein such polypeptides have the same or similar function or activity.
  • percent identities include, but are not limited to, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, or 95%, or any integer percentage from 50% to 100%.
  • any integer amino acid identity from 50% to 100% may be useful in describing the present invention, such as 51 %, 52%, 53%, 54%, 55%, 56%, 57%, 58%, 59%, 60%, 61%, 62%, 63%, 64%, 65%, 66%, 67%, 68%, 69%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99%. Also, of interest is any full-length or partial complement of this isolated nucleotide fragment.
  • Gene refers to a nucleic acid fragment that expresses a specific protein and can include either the coding region alone or the coding region in addition to the regulatory sequences preceding (5' non-coding sequences) and following (3' non- coding sequences) the coding sequence.
  • “Native gene” refers to a gene as found in nature with its own regulatory sequences.
  • “Chimeric gene” refers to any gene that is not a native gene, comprising regulatory and coding sequences that are not found together in nature. Accordingly, a chimeric gene may comprise regulatory sequences and coding sequences that are derived from different sources, or regulatory sequences and coding sequences derived from the same source, but arranged in a manner different than that found in nature.
  • Endogenous gene refers to a native gene in its natural location in the genome of an organism.
  • a “foreign” gene refers to a gene not normally found in the host organism, but that is introduced into the host organism by gene transfer.
  • Foreign genes can comprise native genes inserted into a non-native organism, or chimeric genes.
  • a “transgene” is a gene that has been introduced into the genome by a transformation procedure.
  • the term “genome” as it applies to plant cells encompasses not only chromosomal DNA found within the nucleus, but organelle DNA found within subcellular components (e.g., mitochondrial, plastid) of the cell.
  • a “codon-optimized gene” is a gene having its frequency of codon usage designed to mimic the frequency of preferred codon usage of the host cell.
  • An “allele” is one of several alternative forms of a gene occupying a given locus on a chromosome. When all the alleles present at a given locus on a chromosome are the same that plant is homozygous at that locus. If the alleles present at a given locus on a chromosome differ that plant is heterozygous at that locus.
  • Coding sequence refers to a DNA sequence that codes for a specific amino acid sequence.
  • Regulatory sequences refer to nucleotide sequences located upstream (5' non-coding sequences), within, or downstream (3' non-coding sequences) of a coding sequence, and which influence the transcription, RNA processing or stability, or translation of the associated coding sequence. Regulatory sequences may include, but are not limited to: promoters, translation leader sequences, introns, polyadenylation recognition sequences, RNA processing sites, effector binding sites and stem-loop structures.
  • Promoter refers to a DNA sequence capable of controlling the expression of a coding sequence or functional RNA. The promoter sequence consists of proximal and more distal upstream elements, the latter elements often referred to as enhancers.
  • an “enhancer” is a DNA sequence that can stimulate promoter activity, and may be an innate element of the promoter or a heterologous element inserted to enhance the level or tissue-specificity of a promoter. Promoters may be derived in their entirety from a native gene, or be composed of different elements derived from different promoters found in nature, or even comprise synthetic DNA segments. It is understood by those skilled in the art that different promoters may direct the expression of a gene in different tissues or cell types, or at different stages of development, or in response to different environmental conditions. It is further recognized that since in most cases the exact boundaries of regulatory sequences have not been completely defined, DNA fragments of some variation may have identical promoter activity.
  • Promoters that cause a gene to be expressed in most cell types at most times are commonly referred to as "constitutive promoters". New promoters of various types useful in plant cells are constantly being discovered; numerous examples may be found in the compilation by Okamuro, J. K., and Goldberg, R. B. Biochemistry of Plants 15:1-82 (1989).
  • Translation leader sequence refers to a polynucleotide sequence located between the promoter sequence of a gene and the coding sequence.
  • the translation leader sequence is present in the fully processed mRNA upstream of the translation start sequence.
  • the translation leader sequence may affect processing of the primary transcript to mRNA, mRNA stability or translation efficiency. Examples of translation leader sequences have been described (Turner, R. and Foster, G. D., MoI. Biotechnol. 3:225-236 (1995)).
  • “3' non-coding sequences”, “transcription terminator” or “termination sequences” refer to DNA sequences located downstream of a coding sequence, including polyadenylation recognition sequences and other sequences encoding regulatory signals capable of affecting mRNA processing or gene expression.
  • RNA transcript refers to the product resulting from RNA polymerase- catalyzed transcription of a DNA sequence.
  • the primary transcript When the RNA transcript is a perfect complementary copy of the DNA sequence, it is referred to as the primary transcript.
  • An RNA transcript is referred to as the mature RNA when it is an RNA sequence derived from post-transcriptional processing of the primary transcript.
  • RNA refers to the RNA that is without introns and that can be translated into protein by the cell.
  • cDNA refers to a DNA that is complementary to, and synthesized from, an mRNA template using the enzyme reverse transcriptase. The cDNA can be single-stranded or converted into double-stranded form using the Klenow fragment of DNA polymerase I.
  • Sense RNA refers to RNA transcript that includes the mRNA and can be translated into protein within a cell or in vitro.
  • Antisense RNA refers to an RNA transcript that is complementary to all or part of a target primary transcript or mRNA, and that blocks or reduces the expression of a target gene (U.S. Patent No. 5,107,065).
  • the complementarity of an antisense RNA may be with any part of the specific gene transcript, i.e., at the 5' non-coding sequence, 3' non-coding sequence, introns, or the coding sequence.
  • “Functional RNA” refers to antisense RNA, ribozyme RNA, or other RNA that may not be translated but yet has an effect on cellular processes.
  • RNA transcripts and are meant to define the antisense RNA of the message.
  • operably linked refers to the association of nucleic acid sequences on a single nucleic acid fragment so that the function of one is affected by the other.
  • a promoter is operably linked with a coding sequence when it is capable of affecting the expression of that coding sequence (i.e., the coding sequence is under the transcriptional control of the promoter). Coding sequences can be operably linked to regulatory sequences in a sense or antisense orientation.
  • the complementary RNA regions of the invention can be operably linked, either directly or indirectly, 5' to the target mRNA, or 3 1 to the target mRNA, or within the target mRNA, or a first complementary region is 5' and its complement is 3' to the target mRNA.
  • PCR or “polymerase chain reaction” is a technique for the synthesis of large quantities of specific DNA segments and consists of a series of repetitive cycles (Perkin Elmer Cetus Instruments, Norwalk, CT). Typically, the double-stranded DNA is heat denatured, the two primers complementary to the 3' boundaries of the target segment are annealed at low temperature and then extended at an intermediate temperature. One set of these three consecutive steps is referred to as a "cycle”.
  • cycle One set of these three consecutive steps.
  • the term “recombinant” refers to an artificial combination of two otherwise separated segments of sequence, e.g., by chemical synthesis or by the manipulation of isolated segments of nucleic acids by genetic engineering techniques.
  • a "plasmid” or “vector” is an extra chromosomal element often carrying genes that are not part of the central metabolism of the cell, and usually in the form of circular double-stranded DNA fragments. Such elements may be autonomously replicating sequences, genome integrating sequences, phage or nucleotide sequences, linear or circular, of a single- or double-stranded DNA or RNA, derived from any source, in which a number of nucleotide sequences have been joined or recombined into a unique construction which is capable of introducing an expression cassette(s) into a cell.
  • “Expression cassette” refers to a fragment of DNA containing a foreign gene and having elements in addition to the foreign gene that allow for enhanced expression of that gene in a foreign host.
  • Transformation cassette refers to a fragment of DNA containing a foreign gene and having elements in addition to the foreign gene that facilitate transformation of a particular host cell.
  • a recombinant construct comprises an artificial combination of nucleic acid fragments, e.g., regulatory and coding sequences that are not found together in nature.
  • a recombinant construct may comprise regulatory sequences and coding sequences that are derived from different sources, or regulatory sequences and coding sequences derived from the same source, but arranged in a manner different than that found in nature.
  • Such a construct may be used by itself or may be used in conjunction with a vector. If a vector is used, then the choice of vector is dependent upon the method that will be used to transform host cells as is well known to those skilled in the art.
  • a plasmid vector can be used.
  • the skilled artisan is well aware of the genetic elements that must be present on the vector in order to successfully transform, select and propagate host cells comprising any of the isolated nucleic acid fragments of the invention.
  • the skilled artisan will also recognize that different independent transformation events will result in different levels and patterns of expression (Jones et al., EMBO J. 4:2411-2418 (1985); De Almeida et al., MoI. Gen. Genetics 218:78-86 (1989)), and thus that multiple events must be screened in order to obtain lines displaying the desired expression level and pattern.
  • Such screening may be accomplished by Southern analysis of DNA, Northern analysis of mRNA expression, immunoblotting analysis of protein expression, or phenotypic analysis, among others.
  • expression refers to the production of a functional end-product (e.g., an mRNA or a protein [either precursor or mature]).
  • introduced means providing a nucleic acid (e.g., expression construct) or protein into a cell. Introduced includes reference to the incorporation of a nucleic acid into a eukaryotic or prokaryotic cell where the nucleic acid may be incorporated into the genome of the cell, and includes reference to the transient provision of a nucleic acid or protein to the cell. Introduced includes reference to stable or transient transformation methods, as well as sexually crossing.
  • nucleic acid fragment in the context of inserting a nucleic acid fragment (e.g., a recombinant construct/expression construct) into a cell, means “transfection” or “transformation” or “transduction” and includes reference to the incorporation of a nucleic acid fragment into a eukaryotic or prokaryotic cell where the nucleic acid fragment may be incorporated into the genome of the cell (e.g., chromosome, plasmid, plastid or mitochondrial DNA), converted into an autonomous replicon, or transiently expressed (e.g., transfected mRNA).
  • a nucleic acid fragment e.g., a recombinant construct/expression construct
  • transduction includes reference to the incorporation of a nucleic acid fragment into a eukaryotic or prokaryotic cell where the nucleic acid fragment may be incorporated into the genome of the cell (e.g., chromosome, plasmid, plastid or mitochondrial DNA),
  • “Mature” protein refers to a post-translationally processed polypeptide (i.e., one from which any pre- or propeptides present in the primary translation product have been removed).
  • "Precursor” protein refers to the primary product of translation of mRNA (i.e., with pre- and propeptides still present). Pre- and propeptides may be but are not limited to intracellular localization signals.
  • “Stable transformation” refers to the transfer of a nucleic acid fragment into a genome of a host organism, including both nuclear and organellar genomes, resulting in genetically stable inheritance.
  • “transient transformation” refers to the transfer of a nucleic acid fragment into the nucleus, or DNA-containing organelle, of a host organism resulting in gene expression without integration or stable inheritance.
  • Host organisms containing the transformed nucleic acid fragments are referred to as "transgenic" organisms.
  • transgenic refers to a plant or a cell which comprises within its genome a heterologous polynucleotide.
  • the heterologous polynucleotide is stably integrated within the genome such that the polynucleotide is passed on to successive generations.
  • the heterologous polynucleotide may be integrated into the genome alone or as part of an expression construct.
  • Transgenic is used herein to include any cell, cell line, callus, tissue, plant part or plant, the genotype of which has been altered by the presence of heterologous nucleic acid including those transgenics initially so altered as well as those created by sexual crosses or asexual propagation from the initial transgenic.
  • transgenic does not encompass the alteration of the genome (chromosomal or extra-chromosomal) by conventional plant breeding methods or by naturally occurring events such as random cross-fertilization, non-recombinant viral infection, non-recombinant bacterial transformation, non-recombinant transposition, or spontaneous mutation.
  • Antisense inhibition refers to the production of antisense RNA transcripts capable of suppressing the expression of the target protein.
  • Co-suppression refers to the production of sense RNA transcripts capable of suppressing the expression of identical or substantially similar foreign or endogenous genes (U.S. Patent No. 5,231 ,020). Co-suppression constructs in plants previously have been designed by focusing on overexpression of a nucleic acid sequence having homology to an endogenous mRNA, in the sense orientation, which results in the reduction of all RNA having homology to the overexpressed sequence (Vaucheret et al., Plant J. 16:651-659 (1998); Gura, Nature 404:804-808 (2000)).
  • oilseed plants refer to those organisms that tend to store their energy source in the form of lipid (Weete, In: Fungal Lipid Biochemistry, 2 nd Ed., Plenum, 1980).
  • oilseed plants include, but are not limited to: soybean (Glycine and Soja sp.), flax (Linum sp.), rapeseed (Brassica sp.), maize, cotton, safflower (Carthamus sp.) and sunflower (Helianthus sp.).
  • oleaginous microorganisms the cellular oil or TAG content generally follows a sigmoid curve, wherein the concentration of lipid increases until it reaches a maximum at the late logarithmic or early stationary growth phase and then gradually decreases during the late stationary and death phases (Yongmanitchai and Ward, Appl. Environ. Microbiol. 57:419-25 (1991)).
  • oleaginous yeast refers to those microorganisms classified as yeasts that make oil. It is not uncommon for oleaginous microorganisms to accumulate in excess of about 25% of their dry cell weight as oil.
  • oleaginous yeast examples include, but are no means limited to, the following genera: Yarrowia, Candida, Rhodotorula, Rhodosporidium, Cryptococcus, Trichosporon and Lipomyces.
  • biomass refers specifically to spent or used yeast cellular material resulting from the fermentation of a recombinant production host producing PUFAs in commercially significant amounts, wherein the preferred production host is a recombinant strain of the oleaginous yeast, Yarrowia lipolytica.
  • the biomass may be in the form of whole cells, whole cell lysates, homogenized cells, partially hydrolyzed cellular material, and/or partially purified cellular material (e.g., microbially produced oil).
  • Euglenophyceae refers to a group of unicellular colorless or photosynthetic flagellates ("euglenoids”) found living in freshwater, marine, soil, and parasitic environments.
  • the class is characterized by solitary unicells, wherein most are free-swimming and have two flagella (one of which may be nonemergent) arising from an anterior invagination known as a reservoir.
  • Photosynthetic euglenoids contain one to many grass-green chloroplasts, which vary from minute disks to expanded plates or ribbons. Colorless euglenoids depend on osmotrophy or phagotrophy for nutrient assimilation. About 1000 species have been described and classified into about 40 genera and 6 orders.
  • Examples of Euglenophyceae include, but are by no means limited to, the following genera: Euglena, Eutreptiella and Tetruetreptia.
  • plant refers to whole plants, plant organs, plant tissues, seeds, plant cells, seeds and progeny of the same. Plant cells include, without limitation, cells from seeds, suspension cultures, embryos, meristematic regions, callus tissue, leaves, roots, shoots, gametophytes, sporophytes, pollen and microspores. "Progeny” comprises any subsequent generation of a plant.
  • Palmitate is the precursor of longer-chain saturated and unsaturated fatty acid derivates, which are formed through the action of elongases and desaturases (FIG. 1).
  • TAGs (the primary storage unit for fatty acids) are formed by a series of reactions that involve: (1) the esterification of one molecule of acyl-CoA to glycerol- 3-phosphate via an acyltransferase to produce lysophosphatidic acid; (2) the esterification of a second molecule of acyl-CoA via an acyltransferase to yield 1,2- diacylglycerol phosphate (commonly identified as phosphatidic acid); (3) removal of a phosphate by phosphatidic acid phosphatase to yield 1,2-diacylglycerol (DAG); and (4) the addition of a third fatty acid by the action of an acyltransferase to form TAG.
  • a wide spectrum of fatty acids can be incorporated into TAGs, including saturated and unsaturated fatty acids and short-chain and long-chain fatty acids.
  • omega-6 fatty acids are formed as follows: (1) LA is converted to EDA by a delta-9 elongase; (2) EDA is converted to DGLA by a delta-8 desaturase; (3) DGLA is converted to ARA by a delta-5 desaturase; (4) ARA is converted to DTA by a C20/22 elongase; and, (5) DTA is converted to DPAn-6 by a delta-4 desaturase.
  • delta-9 elongase/delta-8 desaturase pathway can use ALA as substrate to produce long chain omega-3 fatty acids as follows: (1) LA is converted to ALA, the first of the omega-3 fatty acids, by a delta-15 desaturase; (2) ALA is converted to ETrA by a delta-9 elongase; (3) ETrA is converted to ETA by a delta-8 desaturase; (4) ETA is converted to EPA by a delta-5 desaturase; (5) EPA is converted to DPA by a C20/22 elongase; and (6) DPA is converted to DHA by a delta-4 desaturase.
  • omega-6 fatty acids may be converted to omega-3 fatty acids; for example, ETA and EPA are produced from DGLA and ARA, respectively, by delta-17 desaturase activity.
  • omega-3/omega-6 fatty acids utilize a delta-6 desaturase and C18/ 20 elongase (also known as delta-6 elongase, the terms can be used interchangeably) (i.e., the "delta-6 desaturase/delta-6 elongase pathway"). More specifically, LA and ALA may be converted to GLA and STA, respectively, by a delta-6 desaturase; then, a C1 8 /20 elongase converts GLA to DGLA and/or STA to ETA.
  • delta-6 desaturase and C18/ 20 elongase also known as delta-6 elongase, the terms can be used interchangeably
  • LA and ALA may be converted to GLA and STA, respectively, by a delta-6 desaturase; then, a C1 8 /20 elongase converts GLA to DGLA and/or STA to ETA.
  • omega-3/omega-6 fatty acids will depend on the host cell (and its native PUFA profile and/or desaturase/elongase profile), the availability of substrate, and the desired end product(s).
  • expression of the delta-9 elongase/delta-8 desaturase pathway may be preferred in some embodiments, as opposed to expression of the delta-6 desaturase/delta-6 elongase pathway, since PUFAs produced via the former pathway are devoid of GLA.
  • Useful desaturase and elongase sequences may be derived from any source, e.g., isolated from a natural source (from bacteria, algae, fungi, plants, animals, etc.), produced via a semi-synthetic route or synthesized de novo.
  • the particular source of the desaturase and elongase genes introduced into the host is not critical, considerations for choosing a specific polypeptide having desaturase or elongase activity include: (1 ) the substrate specificity of the polypeptide; (2) whether the polypeptide or a component thereof is a rate-limiting enzyme; (3) whether the desaturase or elongase is essential for synthesis of a desired PUFA; (4) co-factors required by the polypeptide; and/or, (5) whether the polypeptide is modified after its production (e.g., by a kinase or a prenyltransferase).
  • the expressed polypeptide preferably has parameters compatible with the biochemical environment of its location in the host cell (see U.S. Patent 7,238,482 for additional details).
  • each enzyme will also be useful to consider the conversion efficiency of each particular desaturase and/or elongase. More specifically, since each enzyme rarely functions with 100% efficiency to convert substrate to product, the final lipid profile of unpurified oils produced in a host cell will typically be a mixture of various PUFAs consisting of the desired omega-3/omega-6 fatty acid, as well as various upstream intermediary PUFAs. Thus, each enzyme's conversion efficiency is also a variable to consider when optimizing biosynthesis of a desired fatty acid.
  • candidate genes having the appropriate desaturase and elongase activities can be identified according to publicly available literature (e.g., GenBank), the patent literature, and experimental analysis of organisms having the ability to produce PUFAs. These genes will be suitable for introduction into a specific host organism, to enable or enhance the organism's synthesis of PUFAs.
  • the present invention concerns a multizyme comprising a single polypeptide having at least two independent and separable enzymatic activities
  • suitable enzymatic activities include elongases, fatty acid desaturases, transferases, acyl CoA synthases and thioesterases.
  • suitable fatty acid desaturases include, but are not limited to: delta-4 desaturase, delta-5 desaturase, delta-6 desaturase, delta-8 desaturase, delta-9 desaturase, delta-12 desaturase, delta-15 desaturase, and/or delta-17 desaturase.
  • suitable elongases include, but are not limited to: delta-9 elongase, C14/16 elongase, Ci 6 /i8 elongase, C18/20 elongase, and/or C20/22 elongase.
  • Suitable transferases include but are not limited to acyl transferases such as glycerol-3-phosphate O-acyltransferase (also called glycerol - phosphate acyl transferase or glycerol -3-phosphate acyl transferase; GPAT), 2- acylglycerol O-acyltransferase, 1-acylglycerol-3-phosphate O-acyltransferase (also called 1-acylglycerol-phosphate acyltransferase or lyso-phosphatidic acid acyltransferase; AGPAT or LPAAT or LPAT), 2-acylglycerol-3-phosphate O- acyltransferase, 1-acylglycerophosphocholine O-acyltransferase (also called lyso- lecithin acyltransferase or lyso-phosphatidylcholine acyltransferase; A
  • acyl CoA synthetase includes but is not limited to Iong-chain-fatty-acid-CoA ligase (also called acyl-activating enzyme or acyl-CoA synthetase).
  • thioesterase includes but is not limited to oleoyl- [acyl-carrier-protein] hydrolase (also called acyl-[acyl-carrier-protein] hydrolase, acyl-ACP-hydrolase or acyl-ACP-thioesterase).
  • the instant multizyme should have enzymatic activities comprising at least one fatty acid elongase linked to at least one fatty acid desaturase.
  • the link used to form the multizyme is minimally comprised of a single polypeptide bond.
  • the link may be comprised of one amino acid residue, such as proline, or a polypeptide. It may be desirable that if the link is a polypeptide then it has at least one proline amino acid residue.
  • the multizyme of the invention comprises a first enzymatic activity linked to a second enzymatic activity and the link is selected from the group consisting of a polypeptide bond, SEQ ID NO:198 (EgDHAsyni linker), SEQ ID NO:200 (EgDHAsyn2 linker), SEQ ID NO:235 (EaDHAsyni linker), SEQ ID NO:472, SEQ ID NO:504, and modified Yarrowia lipolytics linkers (SEQ ID NOs:438 and 445).
  • a method for making a multizyme which comprises:
  • step (b) evaluating the product of step (a) for the independent and separable enzymatic activities.
  • the enzymatic activities are selected from the group consisting of fatty acid elongases, fatty acid desaturases, acyl transferases, acyl CoA synthases and thioeste rases.
  • the enzymatic activities comprise at least one fatty acid elongase linked to at least one fatty acid desaturase.
  • DHA synthases comprising both C20 elongase activity and delta-4 desaturase activity
  • DGLA synthases comprising both delta-9 elongase and delta-8 desaturase activity
  • the invention relates to any multizyme that is made using a linker derived from the sequences of the invention.
  • Preferred multizymes are those that combine various genes of the PUFA biosynthetic pathway.
  • nucleotide sequences encoding DHA synthases have been isolated from Euglena gracilis and Euglena anabaena, as summarized below in Table 4.
  • the instant EgDHAsyni , EgDHAsyn2, EaDHAsyni , EaDHAsyn2 and EaDHAsyn3 DHA synthase sequences can be codon-optimized for expression in a particular host organism. As is well known in the art, this can be a useful means to further optimize the expression of the enzyme in the alternate host, since use of host-preferred codons can substantially enhance the expression of the foreign gene encoding the polypeptide.
  • EgDHAsyni for example, was codon- optimized for expression in Yarrowia lipolytica (example 54), thereby yielding EgDHAsyniS (as taught in U.S.
  • Patent 7,238,482 and U.S. Patent 7,125,672 One skilled in the art would be able to use the teachings herein to create various other codon-optimized DHA synthase proteins suitable for optimal expression in alternate hosts, based on the wildtype EgDHAsyni , EgDHAsyn2, EaDHAsyni , EaDHAsyn2 and/or EaDHAsyn3 sequences described above in Table 4. Accordingly, the instant invention relates to any codon-optimized DHA synthase protein that is derived from a wildtype sequence of the instant invention.
  • the present invention concerns an isolated polynucleotide encoding a DHA synthase comprising:
  • nucleotide sequence encoding a polypeptide having DHA synthase activity, wherein the polypeptide has at least 80% amino acid identity, based on the Clustal V method of alignment, when compared to an amino acid sequence as set forth in SEQ ID NO:12, SEQ ID NO:22, SEQ ID NO:95, SEQ ID NO:96, or SEQ ID NO:97;
  • nucleotide sequence encoding a polypeptide having DHA synthase activity wherein the nucleotide sequence has at least 80% sequence identity, based on the BLASTN method of alignment, when compared to a nucleotide sequence as set forth in SEQ ID NO:11 , SEQ ID NO:205, SEQ ID NO:21 , SEQ ID NO:91 , SEQ ID NO:92, SEQ ID NO:93, or SEQ ID NO:410;
  • nucleotide sequence encoding a polypeptide having DHA synthase activity, wherein the nucleotide sequence hybridizes under stringent conditions to a nucleotide sequence as set forth in SEQ ID NO:11 , SEQ ID NO:21 , SEQ ID NO:91 , SEQ ID NO:92, SEQ ID NO:93, SEQ ID NO:205, or SEQ ID NO:410; or
  • this invention concerns an isolated polynucleotide comprising:
  • nucleotide sequence encoding a polypeptide having DHA synthase activity, wherein the polypeptide has at least 80% amino acid identity, based on the Clustal V method of alignment, when compared to an amino acid sequence as set forth in SEQ ID NO:12, SEQ ID NO:22, SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:97, or SEQ ID NO:411 ;
  • nucleotide sequence encoding a polypeptide having DHA synthase activity wherein the nucleotide sequence has at least 80% sequence identity, based on the BLASTN method of alignment, when compared to a nucleotide sequence as set forth in SEQ ID NO:11 , SEQ ID NO:205, SEQ ID NO:21 , SEQ ID NO:91 , SEQ ID NO:92, SEQ ID NO:93 or SEQ ID NO:410;
  • nucleotide sequence encoding a polypeptide having DHA synthase activity, wherein the nucleotide sequence hybridizes under stringent conditions to a nucleotide sequence as set forth in SEQ ID NO:11 , SEQ ID NO:205, SEQ ID NO:21 , SEQ ID NO:91 , SEQ ID NO:92, SEQ ID NO:93 or SEQ ID NO:410; or
  • an isolated polynucleotide encoding a DHA synthase comprises the sequence set forth in any of SEQ ID NO:11 , SEQ ID NO:205, SEQ ID NO:21 , SEQ ID NO:91 , SEQ ID NO:92, SEQ ID NO:93, or SEQ ID NO:410. Identification and Isolation of Homologs Any of the instant DHA synthase sequences (i.e., EgDHAsyni , EgDHAsyn2,
  • EaDHAsyni , EaDHAsyn2 and EaDHAsyn3) or portions thereof may be used to search for DHA synthase homologs in the same or other bacterial, algal, fungal, euglenoid or plant species using sequence analysis software.
  • sequence analysis software matches similar sequences by assigning degrees of homology to various substitutions, deletions, and other modifications.
  • any of the instant DHA synthase sequences or portions thereof may also be employed as hybridization reagents for the identification of DHA synthase homologs.
  • the basic components of a nucleic acid hybridization test include a probe, a sample suspected of containing the gene or gene fragment of interest and a specific hybridization method.
  • Probes of the present invention are typically single-stranded nucleic acid sequences that are complementary to the nucleic acid sequences to be detected. Probes are "hybridizable" to the nucleic acid sequence to be detected. Although the probe length can vary from 5 bases to tens of thousands of bases, typically a probe length of about 15 bases to about 30 bases is suitable.
  • probe molecule Only part of the probe molecule needs to be complementary to the nucleic acid sequence to be detected. In addition, the complementarity between the probe and the target sequence need not be perfect. Hybridization does occur between imperfectly complementary molecules with the result that a certain fraction of the bases in the hybridized region are not paired with the proper complementary base.
  • Hybridization methods are well defined. Typically the probe and sample must be mixed under conditions that will permit nucleic acid hybridization. This involves contacting the probe and sample in the presence of an inorganic or organic salt under the proper concentration and temperature conditions. The probe and sample nucleic acids must be in contact for a long enough time that any possible hybridization between the probe and sample nucleic acid may occur. The concentration of probe or target in the mixture will determine the time necessary for hybridization to occur. The higher the probe or target concentration, the shorter the hybridization incubation time needed.
  • a chaotropic agent may be added (e.g., guanidinium chloride, guanidinium thiocyanate, sodium thiocyanate, lithium tetrachloroacetate, sodium perchlorate, rubidium tetrachloroacetate, potassium iodide, cesium trifluoroacetate).
  • a chaotropic agent e.g., guanidinium chloride, guanidinium thiocyanate, sodium thiocyanate, lithium tetrachloroacetate, sodium perchlorate, rubidium tetrachloroacetate, potassium iodide, cesium trifluoroacetate.
  • formamide to the hybridization mixture, typically 30-50% (v/v).
  • hybridization solutions can be employed. Typically, these comprise from about 20 to 60% volume, preferably 30%, of a polar organic solvent.
  • a common hybridization solution employs about 30-50% v/v formamide, about 0.15 to 1 M sodium chloride, about 0.05 to 0.1 M buffers (e.g., sodium citrate, Tris-HCI, PIPES or HEPES (pH range about 6-9)), about 0.05 to 0.2% detergent (e.g., sodium dodecylsulfate), or between 0.5-20 mM EDTA, FICOLL (Pharmacia Inc.) (about 300-500 kdal), polyvinylpyrrolidone (about 250-500 kdal), and serum albumin.
  • buffers e.g., sodium citrate, Tris-HCI, PIPES or HEPES (pH range about 6-9)
  • detergent e.g., sodium dodecylsulfate
  • FICOLL Fracia Inc.
  • unlabeled carrier nucleic acids from about 0.1 to 5 mg/mL, fragmented nucleic DNA (e.g., calf thymus or salmon sperm DNA, or yeast RNA), and optionally from about 0.5 to 2% wt/vol glycine.
  • fragmented nucleic DNA e.g., calf thymus or salmon sperm DNA, or yeast RNA
  • optionally from about 0.5 to 2% wt/vol glycine optionally from about 0.5 to 2% wt/vol glycine.
  • volume exclusion agents that include a variety of polar water-soluble or swellable agents (e.g., polyethylene glycol), anionic polymers (e.g., polyacrylate or polymethylacrylate) and anionic saccharidic polymers (e.g., dextran sulfate).
  • Nucleic acid hybridization is adaptable to a variety of assay formats. One of the most suitable is the sandwich assay format. The sandwich assay is particularly adaptable to hybridization under non-denaturing conditions.
  • a primary component of a sandwich-type assay is a solid support. The solid support has adsorbed to it or covalently coupled to it an immobilized nucleic acid probe that is unlabeled and complementary to one portion of the sequence.
  • any of the DHA synthase nucleic acid fragments described herein may be used to isolate genes encoding homologous proteins from the same or other bacterial, algal, fungal, euglenoid or plant species. Isolation of homologous genes using sequence-dependent protocols is well known in the art. Examples of sequence-dependent protocols include, but are not limited to: (1) methods of nucleic acid hybridization; (2) methods of DNA and RNA amplification, as exemplified by various uses of nucleic acid amplification technologies [e.g., polymerase chain reaction (PCR), Mullis et al., U.S.
  • PCR polymerase chain reaction
  • Patent 4,683,202 ligase chain reaction (LCR), Tabor et al., Proc. Acad. Sci. USA 82:1074 (1985); or strand displacement amplification (SDA), Walker et al., Proc. Natl. Acad. Sci. U.S.A., 89:392 (1992)]; and (3) methods of library construction and screening by complementation.
  • LCR ligase chain reaction
  • SDA strand displacement amplification
  • genes encoding similar proteins or polypeptides to a multizyme or an individual domain thereof could be isolated directly by using all or a portion of the instant nucleic acid fragments as DNA hybridization probes to screen libraries from e.g., any desired yeast or fungus using methodology well known to those skilled in the art (wherein those organisms producing DTA, DPAn-6, DPA and/or DHA would be preferred).
  • Specific oligonucleotide probes based upon the instant nucleic acid sequences can be designed and synthesized by methods known in the art (Maniatis, supra).
  • the entire sequences can be used directly to synthesize DNA probes by methods known to the skilled artisan (e.g., random primers DNA labeling, nick translation or end-labeling techniques), or RNA probes using available in vitro transcription systems.
  • specific primers can be designed and used to amplify a part of (or full-length of) the instant sequences.
  • the resulting amplification products can be labeled directly during amplification reactions or labeled after amplification reactions, and used as probes to isolate full-length DNA fragments under conditions of appropriate stringency.
  • the primers typically have different sequences and are not complementary to each other. Depending on the desired test conditions, the sequences of the primers should be designed to provide for both efficient and faithful replication of the target nucleic acid.
  • Methods of PCR primer design are common and well known in the art (Thein and Wallace, "The use of oligonucleotide as specific hybridization probes in the Diagnosis of Genetic Disorders", in Human Genetic Diseases: A Practical Approach, K. E. Davis Ed., (1986) pp 33-50, IRL: Herndon, VA; and Rychlik, W., In Methods in Molecular Biology. White, B. A. Ed., (1993) Vol. 15, pp 31-39, PCR Protocols: Current Methods and Applications. Humania: Totowa, NJ).
  • PCR may also be performed on a library of cloned nucleic acid fragments wherein the sequence of one primer is derived from the instant nucleic acid fragments, and the sequence of the other primer takes advantage of the presence of the polyadenylic acid tracts to the 3 1 end of the mRNA precursor encoding eukaryotic genes.
  • the second primer sequence may be based upon sequences derived from the cloning vector. For example, the skilled artisan can follow the RACE protocol (Frohman et al., Proc. Natl Acad.
  • any of the enzymes may be modified.
  • in vitro mutagenesis and selection, chemical mutagenesis, "gene shuffling" methods or other means can be employed to obtain mutations of naturally occurring genes.
  • multizymes may be synthesized by domain swapping, wherein a functional domain from any enzyme may be exchanged with or added to a functional domain in an alternate enzyme to thereby result in a novel protein.
  • nucleotide sequences encoding C20 elongases have been isolated from Euglena gracilis and Euglena anabaena, as summarized below in Table 5.
  • the instant invention concerns an isolated polynucleotide encoding a C20 elongase comprising:
  • nucleotide sequence encoding a polypeptide having C20 elongase activity, wherein the polypeptide has at least 80% amino acid identity, based on the Clustal V method of alignment, when compared to an amino acid sequence as set forth in SEQ ID NO:12, SEQ ID NO:22, SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:97; SEQ ID NO:202, SEQ ID NO:204, SEQ ID NO:231 , SEQ ID NO:232, or SEQ ID NO:233; (b) a nucleotide sequence encoding a polypeptide having C20 elongase activity wherein the nucleotide sequence has at least 80% sequence identity, based on the BLASTN method of alignment, when compared to a nucleotide sequence as set forth in SEQ ID NO:11 , SEQ ID NO:21 , SEQ ID NO:91 , SEQ ID NO:92, SEQ ID NO:93, SEQ
  • nucleotide sequence encoding a polypeptide having C20 elongase activity, wherein the nucleotide sequence hybridizes under stringent conditions to a nucleotide sequence as set forth in SEQ ID NO:11 , SEQ ID NO:21 , SEQ ID NO:91 , SEQ ID NO:92, SEQ ID NO:93, SEQ ID NO:183, SEQ ID NO:188, SEQ ID NO:201 , SEQ ID NO:206, SEQ ID NO:203, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230; or
  • an isolated polynucleotide encoding a C20 elongase comprises the sequence set forth in any of SEQ ID NO:183, SEQ ID NO:188, SEQ ID NO:201 , SEQ ID NO:206, SEQ ID NO:203, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, or SEQ ID NO:230. Sequence Identification of Novel Delta-4 Desaturases
  • nucleotide sequences encoding delta-4 desaturases have been isolated from Euglena gracilis and Euglena anabaena, as summarized below in Table 6.
  • the delta-4 desaturase domain 1 does not include the proline-rich linker of the DHA synthase from which it was derived.
  • the delta-4 desaturase domain 2 does include the proline-rich linker of the DHA synthase from which it was derived.
  • the instant delta-4 desaturase domain sequences can be codon-optimized for expression in a particular host organism.
  • the Euglena anabaena delta-4 desaturase domain of EaDHAsyn2 was codon- optimized for expression in Yarrowia lipolytica.
  • the Euglena gracilis delta-4 desaturase domain of EgDHAsyni was also codon-optimized for expression in Yarrowia lipolyticaOne skilled in the art would be able to use the teachings herein to create various other codon-optimized delta-4 desaturase proteins suitable for optimal expression in alternate hosts, based on the wildtype delta-4 desaturase domain sequences of EgDHAsyni , EgDHAsyn2, EaDHAsyni , EaDHAsyn2 and/or EaDHAsyn3 as described above in Table 6.
  • the instant invention relates to any codon-optimized delta-4 desaturase protein that is derived from a wildtype sequence of the instant invention.
  • EaD4S-3 (SEQ ID NO:386), EaD4S-2 (SEQ ID NO:384), EaD4S-1 (SEQ ID NO:382), EgD4S-3 (SEQ ID NO:408), EgD4S-2 (SEQ ID NO:406) and EgD4S-1 (SEQ ID NO:404).
  • EaD4S-3 (SEQ ID NO:386), EaD4S-2 (SEQ ID NO:384), EaD4S-1 (SEQ ID NO:382), EgD4S-3 (SEQ ID NO:408), EgD4S-2 (SEQ ID NO:406) and EgD4S-1 (SEQ ID NO:404).
  • the instant invention further concerns an isolated polynucleotide encoding a delta-4 desaturase comprising:
  • SEQ ID NO:243 SEQ ID NO:244, SEQ ID NO:245, SEQ ID NO:192, SEQ ID NO:381 , SEQ ID NO:383, SEQ ID NO:385, SEQ ID NO:387, SEQ ID NO:403, SEQ ID NO:405 or SEQ ID NO:407;
  • nucleotide sequence encoding a polypeptide having delta-4 desaturase activity, wherein the nucleotide sequence hybridizes under stringent conditions to a nucleotide sequence as set forth in SEQ ID NO:11 , SEQ ID NO:21 , SEQ ID NO:91 , SEQ ID NO:92, SEQ ID NO:93, SEQ ID NO:214, SEQ ID NO:216, SEQ ID NO:220, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:242, SEQ ID NO:243, SEQ ID NO:244, SEQ ID NO:245, SEQ ID NO:192, SEQ ID NO:381 , SEQ ID NO:383, SEQ ID NO:385, SEQ ID NO:387, SEQ ID NO:403, SEQ ID NO:405 or SEQ ID NO:407; or
  • an isolated polynucleotide encoding a delta-4 desaturase comprises the sequence set forth in any of SEQ ID NO:214, SEQ ID NO:220, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:242, SEQ ID NO:243, SEQ ID NO:244, SEQ ID NO:245, SEQ ID NO:192, SEQ ID NO:381 , SEQ ID NO:383, SEQ ID NO:385, SEQ ID NO:387, SEQ ID NO:403, SEQ ID NO:405, or SEQ ID NO:407.
  • the invention also provides a new method for deriving a delta-4 desaturase having higher activity than the wildtype sequence, the method comprising: a) providing a wild-type delta-4 desaturase polypeptide isolated from Euglena anabena having a base-line delta-4 desaturase activity; and b) truncating the wild-type polypeptide of (a) by about 1 to about 200 amino acids (a) to create a truncated mutant polypeptide having delta-4 desaturase activity that is increased as compared with the baseline delta-4 desaturase activity.
  • “Baseline” activity as used in this context is defined as the activity of the wildtype enzyme measured either in vivo or in vitro according to standard enzymatic protocols as described herein.
  • any of the enzymes e.g., multizymes, DHA synthases, C20 elongases, delta-4 desaturases, and/or any homologs
  • any of the enzymes may be modified to generate new and/or improved PUFA biosynthetic pathway enzymes.
  • in vitro mutagenesis and selection, chemical mutagenesis, "gene shuffling" methods or other means can be employed to obtain mutations of naturally occurring genes.
  • multizymes may be synthesized by domain swapping, wherein a functional domain from any enzyme may be exchanged with or added to a functional domain in an alternate enzyme to thereby result in a novel protein.
  • EaDHAsyn2 and EaDHAsyn3 or other mutant enzymes, codon-optimized enzymes or homologs thereof), under the control of the appropriate promoters will result in increased production of DTA, DPAn-6, DPA and/or DHA in the transformed host organism, respectively.
  • the present invention encompasses a method for the direct production of PUFAs comprising exposing a fatty acid substrate (i.e., EPA or DPA) to the DHA synthase enzymes described herein (e.g., EgDHAsyni , EgDHAsyn2, EaDHAsyni , EaDHAsyn2 and EaDHAsyn3), such that the substrate is converted to the desired fatty acid product (i.e., DHA).
  • a fatty acid substrate i.e., EPA or DPA
  • DHA synthase enzymes described herein e.g., EgDHAsyni , EgDHAsyn2, EaDHAsyni , EaDHAsyn2 and EaDHAsyn3
  • the present invention concerns a method for transforming a host cell such that the host cell comprises in its genome a recombinant construct of the invention.
  • suitable host cells include, but are not limited to, plants and yeast.
  • the plant cells are obtained from an oilseed plant such as soybean and the like and yeast cells are obtained from oleaginous yeast such as Yarrowia sp.
  • Also within the scope of this invention is a method for producing a transformed plant or yeast comprising transforming a plant cell or a yeast cell with any of the polynucleotides of the invention and regenerating a plant from the transformed plant cell or growing the transformed yeast cells. More specifically, it is an object of the present invention to provide a method for the production of DPAn-6 or DHA in a host cell (e.g., plants, oleaginous yeast), wherein the host cell comprises:
  • the present invention concerns a method for the production of DTA or DPA in a host cell (e.g., plants, oleaginous yeast), wherein the host cell comprises:
  • an isolated nucleotide molecule encoding a polypeptide having C20 elongase activity, wherein the polypeptide has at least 80% amino acid identity, based on the Clustal V method of alignment, when compared to an amino acid sequence as set forth in SEQ ID NO: 12, SEQ ID NO:22, SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:97, SEQ ID NO:202, SEQ ID NO:204, SEQ ID NO:231 , SEQ ID NO:232, or SEQ ID NO:233; and, (ii) a source of ARA or EPA; wherein the host cell is grown under conditions such that the polypeptide having C20 elongase activity is expressed and the ARA is converted to DTA and/or the EPA is converted to DPA, and wherein the DTA or DPA is optionally recovered.
  • the invention provides a method for the production of DPAn-6 or DHA, wherein the host cell comprises:
  • a source of DTA or DPA wherein the host cell is grown under conditions such that the polypeptide having delta-4 desaturase activity is expressed and the DTA is converted to DPAn-6 and/or the DPA is converted to DHA, and wherein the DPAn-6 or DHA is optionally recovered.
  • the source of the substrate(s) ARA, DTA, EPA or DPA used in any of the methods above may be produced by the host either naturally or transgenically, or may be provided exogenously.
  • Linking individual domains to form a multizyme could lead to a decrease in intermediate fatty acids. For instance, linking a C20 elongase with a delta-4 desaturase in a multizyme, such as DHA synthase, may lead to a decrease in the intermediate fatty acid DPA during production of DHA. Similarly, linking a delta-9 elongase with a delta-8 desaturase using the EgDHAsyni linker to form a multizyme as described herein may lead to the production of DGLA and ETA with a decrease in EDA and ERA intermediates.
  • each multizyme gene including DHA synthase and their corresponding enzyme products described herein can be used indirectly for the production of various omega-6 and omega-3 PUFAs, including e.g., DTA, DPAn-6, DGLA, ETA, ARA, EPA, DPA and/or DHA (FIG. 1 ; see U.S. Patent 7,238,482).
  • DTA DPAn-6
  • DGLA DGLA
  • ETA ETA
  • ARA EPA
  • DPA and/or DHA DHA synthases described herein (i.e., EgDHAsyni , EgDHAsyn2, EaDHAsyni ,
  • EaDHAsyn2 and EaDHAsyn3, or other mutant enzymes, codon-optimized enzymes or homologs thereof) may be expressed in conjunction with additional genes encoding enzymes of the PUFA biosynthetic pathway (e.g., delta-6 desaturases, C1 8/20 elongases, delta-17 desaturases, delta-8 desaturases, delta-15 desaturases, delta-9 desaturases, delta-12 desaturases, C14/1 6 elongases, Ci 6 /i ⁇ elongases, delta-9 elongases, delta-5 desaturases, delta-4 desaturases, C2 0 /22 elongases, DHA synthases) to result in higher levels of production of longer-chain omega-3/omega-6 fatty acids (e.g., ARA, DTA, DPAn-6, EPA, DPA and/or DHA).
  • omega-6 desaturases e.g., ARA, DTA, DPAn-6, EPA, DPA and/
  • genes included within a particular expression cassette will depend on the host cell (and its PUFA profile and/or desaturase/elongase profile), the availability of substrate and the desired end product(s).
  • by-product fatty acids could be decreased by linking individual pathway enzymes together with a linker to form a multizyme.
  • SCI sciadonic acid
  • JUP juniperonic acid
  • the presence of sciadonic acid (SCI) and/or juniperonic acid (JUP) might be considered by-product fatty acids of a delta-6 desaturase/delta-6 elongase pathway or delta-9-elongase/delta-8 desaturase pathway.
  • delta-6 elongase may elongate fatty acids other than the intended fatty acid.
  • delta-6 elongases generally convert GLA to DGLA but some delta-6 elongases may also convert unintended substrates such as LA or ALA to EDA or ETrA, respectively.
  • EDA and ETrA would be considered "by-product fatty acids”.
  • Addition of a delta-8 desaturase to a delta-6 desaturase/delta-6 elongase pathway may provide a means to convert the "by-product fatty acids" EDA and ETrA back into the "intermediate fatty acids" DGLA and ETA, respectively.
  • this invention concerns a recombinant construct comprising any one of the isolated polynucleotides of the invention operably linked to at least one regulatory sequence suitable for expression in a host cell such as a plant.
  • a promoter is a DNA sequence that directs cellular machinery of a plant to produce RNA from the contiguous coding sequence downstream (3 1 ) of the promoter. The promoter region influences the rate, developmental stage, and cell type in which the RNA transcript of the gene is made. The RNA transcript is processed to produce mRNA which serves as a template for translation of the RNA sequence into the amino acid sequence of the encoded polypeptide.
  • the 5' non- translated leader sequence is a region of the mRNA upstream of the protein coding region that may play a role in initiation and translation of the mRNA.
  • the 3' transcription termination/polyadenylation signal is a non-translated region downstream of the protein coding region that functions in the plant cell to cause termination of the RNA transcript and the addition of polyadenylate nucleotides to the 3 1 end of the RNA.
  • the origin of the promoter chosen to drive expression of the multizyme coding sequence is not important as long as it has sufficient transcriptional activity to accomplish the invention by expressing translatable mRNA for the desired nucleic acid fragments in the desired host tissue at the right time.
  • Either heterologous or non-heterologous (i.e., endogenous) promoters can be used to practice the invention.
  • suitable promoters in plants include, but are not limited to: the alpha prime subunit of beta conglycinin promoter, the Kunitz trypsin inhibitor 3 promoter, the annexin promoter, the glycinin Gy1 promoter, the beta subunit of beta conglycinin promoter, the P34/Gly Bd m 3OK promoter, the albumin promoter, the Leg A1 promoter and the Leg A2 promoter.
  • the annexin, or P34, promoter is described in PCT Publication No. WO 2004/071178 (published August 26, 2004).
  • the level of activity of the annexin promoter is comparable to that of many known strong promoters, such as: (1) the CaMV 35S promoter (Atanassova et al., Plant MoI. Biol. 37:275-285 (1998); Battraw and Hall, Plant MoI. Biol. 15:527-538 (1990); Holtorf et al., Plant MoI. Biol.
  • the annexin promoter is most active in developing seeds at early stages (before 10 days after pollination) and is largely quiescent in later stages.
  • the expression profile of the annexin promoter is different from that of many seed-specific promoters, e.g., seed storage protein promoters, which often provide highest activity in later stages of development (Chen et al., Dev. Genet. 10:112-122 (1989); Ellerstrom et al., Plant MoI. Biol. 32:1019-1027 (1996); Keddie et al., Plant MoI. Biol.
  • the annexin promoter has a more conventional expression profile but remains distinct from other known seed specific promoters. Thus, the annexin promoter will be a very attractive candidate when overexpression, or suppression, of a gene in embryos is desired at an early developing stage. For example, it may be desirable to overexpress a gene regulating early embryo development or a gene involved in the metabolism prior to seed maturation.
  • the promoter is then operably linked in a sense orientation using conventional means well known to those skilled in the art.
  • Standard recombinant DNA and molecular cloning techniques used herein are well known in the art and are described more fully in Sambrook, J. et al., In
  • a fusion gene can be constructed by linking at least two DNA fragments in frame so as not to introduce a stop codon (in-frame fusion). The resulting fusion gene will be such that each DNA fragment encodes for at least one independent and separable enzymatic activity.
  • the recombinant construct may then be introduced into a plant cell of choice by methods well known to those of ordinary skill in the art (e.g., transfection, transformation and electroporation).
  • Oilseed plant cells are the preferred plant cells.
  • the transformed plant cell is then cultured and regenerated under suitable conditions permitting expression of the long-chain PUFA which is then optionally recovered and purified.
  • the recombinant constructs of the invention may be introduced into one plant cell; or, alternatively, each construct may be introduced into separate plant cells.
  • Expression in a plant cell may be accomplished in a transient or stable fashion as is described above.
  • the desired long-chain PUFAs can be expressed in seed. Also within the scope of this invention are seeds or plant parts obtained from such transformed plants.
  • Plant parts include differentiated and undifferentiated tissues including, but not limited to the following: roots, stems, shoots, leaves, pollen, seeds, tumor tissue and various forms of cells and culture (e.g., single cells, protoplasts, embryos and callus tissue).
  • the plant tissue may be in plant or in a plant organ, tissue or cell culture.
  • plant organ refers to plant tissue or a group of tissues that constitute a morphologically and functionally distinct part of a plant.
  • gene refers to the following: (1) the entire complement of genetic material (genes and non-coding sequences) that is present in each cell of an organism, or virus or organelle; and/or (2) a complete set of chromosomes inherited as a (haploid) unit from one parent.
  • this invention also concerns a method for transforming a cell, comprising transforming a cell with the recombinant construct of the invention and selecting those cells transformed with the recombinant constructs described in the claims.
  • Also of interest is a method for producing a transformed plant comprising transforming a plant cell with the polynucleotides of the instant invention and regenerating a plant from the transformed plant cell.
  • Transgenic embryos and seeds are similarly regenerated.
  • the resulting transgenic rooted shoots are thereafter planted in an appropriate plant growth medium such as soil.
  • the regenerated plants are self-pollinated to provide homozygous transgenic plants. Otherwise, pollen obtained from the regenerated plants is crossed to seed-grown plants of agronomically important lines. Conversely, pollen from plants of these important lines is used to pollinate regenerated plants.
  • a transgenic plant of the present invention containing a desired polypeptide is cultivated using methods well known to one skilled in the art.
  • oilseed plants include, but are not limited to: soybean, Brassica species, sunflower, maize, cotton, flax and safflower.
  • Examples of PUFAs having at least twenty carbon atoms and four or more carbon-carbon double bonds include, but are not limited to, omega-3 fatty acids such as EPA, DPA, and DHA. Seeds obtained from such plants are also within the scope of this invention as well as oil obtained from such seeds.
  • the present invention also concerns a method for altering the fatty acid profile of an oilseed plant comprising: a) transforming an oilseed plant cell with the recombinant construct of claim of the invention; and b) regenerating a plant from the transformed oilseed plant cell step (a), wherein the plant has an altered fatty acid profile.
  • Microbial Expression Systems, Cassettes and Vectors The DHA synthase genes and gene products described herein (i.e.,
  • EgDHAsyni , EgDHAsyn2, EaDHAsyni , EaDHAsyn2 and EaDHAsyn3, or other mutant enzymes, codon-optimized enzymes or homologs thereof) may also be produced in heterologous microbial host cells, particularly in the cells of oleaginous yeasts (e.g., Yarrowia lipolytica).
  • Microbial expression systems and expression vectors containing regulatory sequences that direct high level expression of foreign proteins are well known to those skilled in the art. Any of these could be used to construct chimeric genes for production of any of the gene products of the instant sequences. These chimeric genes could then be introduced into appropriate microorganisms via transformation to provide high-level expression of the encoded enzymes.
  • Vectors useful for the transformation of suitable microbial host cells are well known in the art. The specific choice of sequences present in the construct is dependent upon the desired expression products (supra), the nature of the host cell and the proposed means of separating transformed cells versus non-transformed cells.
  • the vector contains at least one expression cassette, a selectable marker and sequences allowing autonomous replication or chromosomal integration.
  • Suitable expression cassettes comprise a region 5' of the gene that controls transcriptional initiation (e.g., a promoter), the gene coding sequence, and a region 3 1 of the DNA fragment that controls transcriptional termination (i.e., a terminator). It is most preferred when both control regions are derived from genes from the transformed microbial host cell, although it is to be understood that such control regions need not be derived from the genes native to the specific species chosen as a production host.
  • Initiation control regions or promoters which are useful to drive expression of the instant multizymes, such as DHA synthase or individual domain ORFs, in the desired microbial host cell are numerous and familiar to those skilled in the art. Virtually any promoter capable of directing expression of these genes in the selected host cell is suitable for the present invention. Expression in a microbial host cell can be accomplished in a transient or stable fashion. Transient expression can be accomplished by inducing the activity of a regulatable promoter operably linked to the gene of interest. Stable expression can be achieved by the use of a constitutive promoter operably linked to the gene of interest.
  • transcriptional and translational regions functional in yeast cells are provided, particularly from the host species (e.g., see U.S. Patent 7,238,482 and PCT Publication No. WO 2006/052870 for preferred transcriptional initiation regulatory regions for use in Yarrowia lipolytica). Any one of a number of regulatory sequences can be used, depending upon whether constitutive or induced transcription is desired, the efficiency of the promoter in expressing the ORF of interest, the ease of construction and the like.
  • nucleotide sequences surrounding the translational initiation codon 'ATG' have been found to affect expression in yeast cells. If the desired polypeptide is poorly expressed in yeast, the nucleotide sequences of exogenous genes can be modified to include an efficient yeast translation initiation sequence to obtain optimal gene expression. For expression in yeast, this can be done by site-directed mutagenesis of an inefficiently expressed gene by fusing it in-frame to an endogenous yeast gene, preferably a highly expressed gene. Alternatively, one can determine the consensus translation initiation sequence in the host and engineer this sequence into heterologous genes for their optimal expression in the host of interest.
  • the termination region can be derived from the 3' region of the gene from which the initiation region was obtained or from a different gene.
  • a large number of termination regions are known and function satisfactorily in a variety of hosts (when utilized both in the same and different genera and species from where they were derived).
  • the termination region usually is selected more as a matter of convenience rather than because of any particular property.
  • Termination control regions may also be derived from various genes native to the preferred hosts.
  • the 3'-region can also be synthetic, as one of skill in the art can utilize available information to design and synthesize a 3'-region sequence that functions as a transcription terminator.
  • a termination site may be unnecessary; however, it is most preferred if included.
  • some of the molecular features that have been manipulated to control gene expression include: the nature of the relevant transcriptional promoter and terminator sequences; the number of copies of the cloned gene; whether the gene is plasmid-borne or integrated into the genome of the host cell; the final cellular location of the synthesized foreign protein; the efficiency of translation and correct folding of the protein in the host organism; the intrinsic stability of the mRNA and protein of the cloned gene within the host cell; and the codon usage within the cloned gene, such that its frequency approaches the frequency of preferred codon usage of the host cell.
  • Each type of modification is encompassed in the present invention, as means to further optimize expression of the DHA synthases described herein. Transformation Of Microbial Host Cells
  • a cassette that is suitable for expression in an appropriate host cell e.g., a chimeric gene comprising a promoter, ORF and terminator
  • a cassette that is suitable for expression in an appropriate host cell is placed in a plasmid vector capable of autonomous replication in a host cell, or is directly integrated into the genome of the host cell. Integration of expression cassettes can occur randomly within the host genome or can be targeted through the use of constructs containing regions of homology with the host genome sufficient to target recombination within the host locus. Where constructs are targeted to an endogenous locus, all or some of the transcriptional and translational regulatory regions can be provided by the endogenous locus.
  • each vector has a different means of selection and should lack homology to the other construct(s) to maintain stable expression and prevent reassortment of elements among constructs. Judicious choice of regulatory regions, selection means and method of propagation of the introduced construct(s) can be experimentally determined so that all introduced genes are expressed at the necessary levels to provide for synthesis of the desired products. Constructs comprising the gene(s) of interest may be introduced into a microbial host cell by any standard technique.
  • transformation e.g., lithium acetate transformation [Methods in Enzymology, 194:186-187 (1991)]
  • protoplast fusion e.g., biolistic impact, electroporation, microinjection, or any other method that introduces the gene(s) of interest into the host cell.
  • a host cell that has been manipulated by any method to take up a DNA sequence (e.g., an expression cassette) will be referred to as "transformed” or “recombinant” herein.
  • the transformed host will have at least one copy of the expression construct and may have two or more, depending upon whether the expression cassette is integrated into the genome or is present on an extrachromosomal element having multiple copy numbers.
  • the transformed host cell can be identified by various selection techniques, as described in U.S. Patents 7,238,482 and 7,259,255 and PCT Publication No. WO 2006/052870. Following transformation, substrates suitable for the instant DHA synthases
  • PUFA enzymes that are co-expressed within the host cell may be produced by the host either naturally or transgenically, or may be provided exogenously.
  • Preferred Microbial Hosts for Recombinant Expression may be produced by the host either naturally or transgenically, or may be provided exogenously.
  • Microbial host cells for expression of the instant genes and nucleic acid fragments may include hosts that grow on a variety of feedstocks, including simple or complex carbohydrates, fatty acids, organic acids, oils and alcohols, and/or hydrocarbons over a wide range of temperature and pH values. Based on the needs of the Applicants' Assignee, the genes described in the instant invention will be expressed in an oleaginous yeast (and in particular Yarrowia lipolytica); however, it is contemplated that because transcription, translation and the protein biosynthetic apparatus are highly conserved, any bacteria, yeast, algae, euglenoid and/or fungus will be a suitable microbial host for expression of the present nucleic acid fragments.
  • Preferred microbial hosts are oleaginous organisms, such as oleaginous yeasts. These organisms are naturally capable of oil synthesis and accumulation, wherein the oil can comprise greater than about 25% of the cellular dry weight, more preferably greater than about 30% of the cellular dry weight, and most preferably greater than about 40% of the cellular dry weight.
  • Genera typically identified as oleaginous yeast include, but are not limited to: Yarrowia, Candida, Rhodotorula, Rhodosporidium, Cryptococcus, Trichosporon and Lipomyces. More specifically, illustrative oil-synthesizing yeasts include: Rhodosporidium toruloides, Lipomyces starkeyii, L.
  • Candida revkaufi C. pulcherrima, C. tropicalis, C. utilis, Trichosporon pullans, T. cutaneum, Rhodotorula glutinus, R. graminis, and Yarrowia lipolytica (formerly classified as Candida lipolytica).
  • oleaginous yeast Yarrowia lipolytica Most preferred is the oleaginous yeast Yarrowia lipolytica; and, in a further embodiment, most preferred are the Y. lipolytica strains designated as ATCC #20362, ATCC #8862, ATCC #18944, ATCC #76982 and/or LGAM S(7)1 (Papanikolaou S., and Aggelis G., Bioresour. Technol. 82(1):43-9 (2002)).
  • Y. lipolytica Historically, various strains of Y. lipolytica have been used for the manufacture and production of: isocitrate lyase; lipases; polyhydroxyalkanoates; citric acid; erythritol; 2-oxoglutaric acid; gamma-decalactone; gamma-dodecalatone; and pyruvic acid.
  • Specific teachings applicable for transformation of oleaginous yeasts include U.S. Patent 4,880,741 and U.S. Patent 5,071 ,764 and Chen, D. C. et al. (Appl. Microbiol. Biotechnol., 48(2):232-235 (1997)).
  • Yarrowia lipolytics Detailed means for the synthesis and transformation of expression vectors comprising C20 elongases and delta-4 desaturases in oleaginous yeast (i.e., Yarrowia lipolytics) are provided in PCT Publication No. WO 2006/052871.
  • the preferred method of expressing genes in Yarrowia lipolytica is by integration of linear DNA into the genome of the host. Integration into multiple locations within the genome can be particularly useful when high level expression of genes are desired [e.g., in the Ura3 locus (GenBank Accession No. AJ306421), the Leu2 gene locus (GenBank Accession No. AF260230), the Lys5 gene locus (GenBank Accession No.
  • the Aco2 gene locus (GenBank Accession No. AJ001300), the Pox3 gene locus (Pox3: GenBank Accession No. XP_503244; or, Aco3: GenBank Accession No. AJ001301), the delta-12 desaturase gene locus (U.S. Patent 7,214,491), the Lip1 gene locus (GenBank Accession No. Z50020), the Lip2 gene locus (GenBank Accession No. AJ012632), and/or the Pex10 gene locus (GenBank Accession No. CAG81606)].
  • Termination regions useful in the disclosure herein for Yarrowia expression vectors include, for example: ⁇ 100 bp of the 3' region of the Yarrowia lipolytica extracellular protease (XPR; GenBank Accession No. M17741); the acyl-CoA oxidase (Aco3: GenBank Accession No. AJ001301 and No. CAA04661 ; Pox3: GenBank Accession No. XP_503244) terminators; the Pex20 (GenBank Accession No. AF054613) terminator; the Pex16 (GenBank Accession No. U75433) terminator; the Lip1 (GenBank Accession No. Z50020) terminator; the Lip2 (GenBank Accession No. AJ012632) terminator; and the 3-oxoacyl-CoA thiolase (OCT; GenBank Accession No. X69988) terminator.
  • XPR GenBank Accession No. M17741
  • Preferred selection methods for use in Yarrowia lipolytica are resistance to kanamycin, hygromycin, and the amino glycoside G418, as well as the ability to grow on media lacking uracil, leucine, lysine, tryptophan or histidine.
  • 5-fluoroorotic acid (5-fluorouracil-6-carboxylic acid monohydrate; "5- FOA”) is used for selection of yeast Ura ⁇ mutants.
  • the compound is toxic to yeast cells that possess a functioning URA3 gene encoding orotidine 5'-monophosphate decarboxylase (OMP decarboxylase); thus, based on this toxicity, 5-FOA is especially useful for the selection and identification of Ura ' mutant yeast strains (Bartel, P. L. and Fields, S., Yeast 2-Hybrid System, Oxford University: New York, v. 7, pp 109-147, 1997; see also PCT Publication No. WO 2006/052870 for 5-FOA use in Yarrowia). More specifically, one can first knockout the native Ura3 gene to produce a strain having a Ura- phenotype, wherein selection occurs based on 5- FOA resistance.
  • OMP decarboxylase orotidine 5'-monophosphate decarboxylase
  • a cluster of multiple chimeric genes and a new Ura3 gene can be integrated into a different locus of the Yarrowia genome to produce a new strain having a Ura+ phenotype.
  • Subsequent integration produces a new Ura3- strain (again identified using 5-FOA selection), when the introduced Ura3 gene is knocked out.
  • the Ura3 gene in combination with 5-FOA selection
  • microbial hosts include oleaginous bacteria, algae, euglenoids, and other fungi; and, within this broad group of microbial hosts, of particular interest are microorganisms that synthesize omega-3/omega-6 fatty acids (or those that can be genetically engineered for this purpose [e.g., other yeast such as Saccharomyces cerevisiae]).
  • transformation of Mortierella alpina which is commercially used for production of ARA
  • any of the instant DHA synthase genes under the control of inducible or regulated promoters could yield a transformant organism capable of synthesizing increased quantities of PUFAs.
  • the method of transformation of M. alpina is described by Mackenzie et al. (Appl. Environ. Microbiol., 66:4655 (2000)).
  • methods for transformation of Thraustochytriales microorganisms are disclosed in U.S. 7,001 ,772. Substrate feeding may be required.
  • multiple transformants must be screened in order to obtain a strain displaying the desired expression level and pattern.
  • screening may be accomplished by Southern analysis of DNA blots (Southern, J. MoI. Biol., 98:503 (1975)), Northern analysis of mRNA expression (Kroczek, J. Chromatogr. Biomed. Appl., 618(1-2):133-145 (1993)), Western and/or Elisa analyses of protein expression, phenotypic analysis, or GC analysis of the PUFA products.
  • the oleaginous yeast will be genetically engineered to express multiple enzymes necessary for long-chain PUFA biosynthesis (thereby enabling production of e.g., ARA, EPA, DPA and DHA), in addition to the multizymes described herein.
  • the at least one additional recombinant DNA construct encode a DGLA synthase, such that the multizyme has both delta-9 elongase activity and delta-8 desaturase activity.
  • the delta-9 elongase can be isolated or derived from lsochrysis galbana (GenBank Accession No. AF390174; lgD9e or lgD9eS) or the delta-9 elongase can be isolated or derived from Euglena gracilis or Euglena anabaena.
  • biochemical pathways competing with the omega-3 and/or omega-6 fatty acid biosynthetic pathways for energy or carbon, or native PUFA biosynthetic pathway enzymes that interfere with production of a particular PUFA end-product may be eliminated by gene disruption or down-regulated by other means (e.g., antisense mRNA).
  • the present invention provides methods whereby genes encoding key enzymes in the PUFA biosynthetic pathway are introduced into oleaginous yeasts for the production of omega-3 and/or omega-6 fatty acids. It will be particularly useful to express the instant DHA synthase genes in oleaginous yeasts that do not naturally possess omega-3 and/or omega-6 fatty acid biosynthetic pathways and coordinate the expression of these genes, to maximize production of preferred PUFA products using various means for metabolic engineering of the host organism.
  • the transformed host cell is grown under conditions that optimize expression of chimeric genes and produce the greatest and most economical yield of desired PUFAs.
  • media conditions that may be optimized include the type and amount of carbon source, the type and amount of nitrogen source, the carbon-to- nitrogen ratio, the amount of different mineral ions, the oxygen level, growth temperature, pH, length of the biomass production phase, length of the oil accumulation phase and the time and method of cell harvest.
  • Yarrowia lipolytics are generally grown in complex media (e.g., yeast extract-peptone-dextrose broth (YPD)) or a defined minimal media that lacks a component necessary for growth and thereby forces selection of the desired expression cassettes (e.g., Yeast Nitrogen Base (DIFCO Laboratories, Detroit, Ml)).
  • complex media e.g., yeast extract-peptone-dextrose broth (YPD)
  • a defined minimal media that lacks a component necessary for growth and thereby forces selection of the desired expression cassettes (e.g., Yeast Nitrogen Base (DIFCO Laboratories, Detroit, Ml)).
  • Fermentation media in the present invention must contain a suitable carbon source. Suitable carbon sources are taught in U.S. Patent 7,238,482. Although it is contemplated that the source of carbon utilized in the present invention may encompass a wide variety of carbon-containing sources, preferred carbon sources are sugars, glycerol, and/or fatty acids. Most preferred is glucose and/or fatty acids containing between 10-22 carbons. Nitrogen may be supplied from an inorganic (e.g., (NH4)2SO4) or organic
  • the fermentation media must also contain suitable minerals, salts, cofactors, buffers, vitamins and other components known to those skilled in the art suitable for the growth of the oleaginous host and promotion of the enzymatic pathways necessary for PUFA production.
  • metal ions e.g., Fe +2 , Cu +2 , Mn +2 , Co +2 , Zn +2 , Mg +2
  • metal ions e.g., Fe +2 , Cu +2 , Mn +2 , Co +2 , Zn +2 , Mg +2
  • Preferred growth media in the present invention are common commercially prepared media, such as Yeast Nitrogen Base (DIFCO Laboratories, Detroit, Ml). Other defined or synthetic growth media may also be used and the appropriate medium for growth of the transformant host cells will be known by one skilled in the art of microbiology or fermentation science.
  • a suitable pH range for the fermentation is typically between about pH 4.0 to pH 8.0, wherein pH 5.5 to pH 7.5 is preferred as the range for the initial growth conditions.
  • the fermentation may be conducted under aerobic or anaerobic conditions, wherein microaerobic conditions are preferred.
  • PUFAs may be found in the host microorganisms and plants as free fatty acids or in esterified forms such as acylglycerols, phospholipids, sulfolipids, or glycolipids, and may be extracted from the host cells through a variety of means well-known in the art.
  • One review of extraction techniques, quality analysis, and acceptability standards for yeast lipids is that of Z. Jacobs (Critical Reviews in Biotechnology, 12(5/6) :463-491 (1992)).
  • a brief review of downstream processing is also available by A. Singh and O. Ward (Adv. Appl. Microbiol., 45:271-312 (1997)).
  • means for the purification of PUFAs may include extraction (e.g., U.S.
  • organic solvents e.g., using carbon dioxide
  • saponification e.g., using carbon dioxide
  • physical means such as presses, or combinations thereof.
  • U.S. Patent 7,238,482 U.S. Patent 7,238,482 for additional details.
  • Methods of isolating seed oils are well known in the art: (Young et al., Processing of Fats and Oils, In The Lipid Handbook, Gunstone et al., eds., Chapter 5 pp 253-257; Chapman & Hall: London (1994)).
  • soybean oil is produced using a series of steps involving the extraction and purification of an edible oil product from the oil-bearing seed. Soybean oils and soybean byproducts are produced using the generalized steps shown in Table 7.
  • soybean seeds are cleaned, tempered, dehulled, and flaked, thereby increasing the efficiency of oil extraction.
  • Oil extraction is usually accomplished by solvent (e.g., hexane) extraction but can also be achieved by a combination of physical pressure and/or solvent extraction.
  • the resulting oil is called crude oil.
  • the crude oil may be degummed by hydrating phospholipids and other polar and neutral lipid complexes that facilitate their separation from the nonhydrating, triglyceride fraction (soybean oil).
  • the resulting lecithin gums may be further processed to make commercially important lecithin products used in a variety of food and industrial products as emulsification and release (i.e., antisticking) agents.
  • Degummed oil may be further refined for the removal of impurities (primarily free fatty acids, pigments and residual gums). Refining is accomplished by the addition of a caustic agent that reacts with free fatty acid to form soap and hydrates phosphatides and proteins in the crude oil. Water is used to wash out traces of soap formed during refining. The soapstock byproduct may be used directly in animal feeds or acidulated to recover the free fatty acids. Color is removed through adsorption with a bleaching earth that removes most of the chlorophyll and carotenoid compounds. The refined oil can be hydrogenated, thereby resulting in fats with various melting properties and textures.
  • impurities primarily free fatty acids, pigments and residual gums.
  • Winterization may be used to remove stearine from the hydrogenated oil through crystallization under carefully controlled cooling conditions.
  • Deodorization (principally via steam distillation under vacuum) is the last step and is designed to remove compounds which impart odor or flavor to the oil.
  • Other valuable byproducts such as tocopherols and sterols may be removed during the deodorization process.
  • Deodorized distillate containing these byproducts may be sold for production of natural vitamin E and other high-value pharmaceutical products.
  • Refined, bleached, (hydrogenated, fractionated) and deodorized oils and fats may be packaged and sold directly or further processed into more specialized products.
  • soybean seed processing soybean oil production, and byproduct utilization can be found in Erickson, Practical Handbook of Soybean Processing and Utilization, The American Oil Chemists' Society and United Soybean Board (1995).
  • Soybean oil is liquid at room temperature because it is relatively low in saturated fatty acids when compared with oils such as coconut, palm, palm kernel, and cocoa butter.
  • Plant and microbial oils containing PUFAs that have been refined and/or purified can be hydrogenated, thereby resulting in fats with various melting properties and textures.
  • Many processed fats including spreads, confectionary fats, hard butters, margarines, baking shortenings, etc.
  • Hydrogenation is a chemical reaction in which hydrogen is added to the unsaturated fatty acid double bonds with the aid of a catalyst such as nickel.
  • a catalyst such as nickel.
  • high oleic soybean oil contains unsaturated oleic, linoleic, and linolenic fatty acids, and each of these can be hydrogenated. Hydrogenation has two primary effects. First, the oxidative stability of the oil is increased as a result of the reduction of the unsaturated fatty acid content. Second, the physical properties of the oil are changed because the fatty acid modifications increase the melting point resulting in a semi-liquid or solid fat at room temperature.
  • Hydrogenated oils have become somewhat controversial due to the presence of frans-fatty acid isomers that result from the hydrogenation process. Ingestion of large amounts of frans-isomers has been linked with detrimental health effects including increased ratios of low density to high density lipoproteins in the blood plasma and increased risk of coronary heart disease.
  • omega-3 and/or omega-6 fatty acids particularly e.g., ALA, GLA, ARA, EPA, DPA and DHA.
  • omega-3 and/or omega-6 fatty acids particularly e.g., ALA, GLA, ARA, EPA, DPA and DHA.
  • the PUFA-comprising plant/seed oils, altered seeds, and microbial biomass and/or oils of the invention will function in food and feed products to impart the health benefits of current formulations.
  • the oils of the invention are believed to function similarly to other oils in food applications from a physical standpoint (for example, partially hydrogenated oils such as soybean oil are widely used as ingredients for soft spreads, margarine and shortenings for baking and frying).
  • Plant/seed oils, altered seeds, and microbial biomass and/or oils containing omega-3 and/or omega-6 fatty acids will be suitable for use in a variety of food and feed products including, but not limited to: food analogs, meat products, cereal products, baked foods, snack foods and dairy products.
  • the present plant/seed oils, altered seeds, and microbial biomass and/or oils may be used in formulations to impart health benefit in medical foods including medical nutritionals, dietary supplements, infant formula as well as pharmaceutical products.
  • medical nutritionals including medical nutritionals, dietary supplements, infant formula as well as pharmaceutical products.
  • One of skill in the art of food processing and food formulation will understand how the amount and composition of the plant and microbial oils may be added to the food or feed product. Such an amount will be referred to herein as an "effective" amount and will depend on the food or feed product, the diet that the product is intended to supplement or the medical condition that the medical food or medical nutritional is intended to correct or treat.
  • Food analogs can be made using processes well known to those skilled in the art. There can be mentioned meat analogs, cheese analogs, milk analogs and the like. Meat analogs made from soybeans contain soy protein or tofu and other ingredients mixed together to simulate various kinds of meats. These meat alternatives are sold as frozen, canned or dried foods. Usually, they can be used the same way as the foods they replace. Meat alternatives made from soybeans are excellent sources of protein, iron and B vitamins. Examples of meat analogs include, but are not limited to: ham analogs, sausage analogs, bacon analogs, and the like.
  • Food analogs can be classified as imitation or substitutes depending on their functional and compositional characteristics. For example, an imitation cheese need only resemble the cheese it is designed to replace. However, a product can generally be called a substitute cheese only if it is nutritionally equivalent to the cheese it is replacing and meets the minimum compositional requirements for that cheese. Thus, substitute cheese will often have higher protein levels than imitation cheeses and be fortified with vitamins and minerals.
  • Milk analogs or nondairy food products include, but are not limited to, imitation milks and nondairy frozen desserts (e.g., those made from soybeans and/or soy protein products).
  • Meat products encompass a broad variety of products.
  • “meat” includes "red meats” produced from cattle, hogs and sheep.
  • poultry items which include chickens, turkeys, geese, guineas, ducks and the fish and shellfish.
  • seasoned and processed meat products fresh, cured and fried, and cured and cooked. Sausages and hot dogs are examples of processed meat products.
  • the term "meat products” as used herein includes, but is not limited to, processed meat products.
  • a cereal food product is a food product derived from the processing of a cereal grain.
  • a cereal grain includes any plant from the grass family that yields an edible grain (seed). The most popular grains are barley, corn, millet, oats, quinoa, rice, rye, sorghum, triticale, wheat and wild rice. Examples of a cereal food product include, but are not limited to: whole grain, crushed grain, grits, flour, bran, germ, breakfast cereals, extruded foods, pastas, and the like.
  • a baked goods product comprises any of the cereal food products mentioned above and has been baked or processed in a manner comparable to baking (i.e., to dry or harden by subjecting to heat).
  • Examples of a baked good product include, but are not limited to: bread, cakes, doughnuts, bars, pastas, bread crumbs, baked snacks, mini-biscuits, mini-crackers, mini-cookies, and mini-pretzels.
  • oils of the invention can be used as an ingredient.
  • a snack food product comprises any of the above or below described food products.
  • a fried food product comprises any of the above or below described food products that has been fried.
  • a health food product is any food product that imparts a health benefit. Many oilseed-derived food products may be considered as health foods.
  • a beverage can be in a liquid or in a dry powdered form.
  • non-carbonated drinks such as fruit juices, fresh, frozen, canned or concentrate; flavored or plain milk drinks, etc.
  • non-carbonated drinks such as fruit juices, fresh, frozen, canned or concentrate
  • flavored or plain milk drinks etc.
  • infant and infant nutritional formulas are well known in the art and commercially available
  • infant formulas are liquids or reconstituted powders fed to infants and young children.
  • "Infant formula” is defined herein as an enteral nutritional product which can be substituted for human breast milk in feeding infants and typically is composed of a desired percentage of fat mixed with desired percentages of carbohydrates and proteins in an aqueous solution (e.g., see U.S. Patent No.
  • a dairy product is a product derived from milk.
  • a milk analog or nondairy product is derived from a source other than milk, for example, soymilk as was discussed above. These products include, but are not limited to: whole milk, skim milk, fermented milk products such as yogurt or sour milk, cream, butter, condensed milk, dehydrated milk, coffee whitener, coffee creamer, ice cream, cheese, etc.
  • Additional food products into which the PUFA-containing oils of the invention could be included are, for example, chewing gums, confections and frostings, gelatins and puddings, hard and soft candies, jams and jellies, white granulated sugar, sugar substitutes, sweet sauces, toppings and syrups, and dry-blended powder mixes.
  • a health food product is any food product that imparts a health benefit and includes functional foods, medical foods, medical nutritionals and dietary supplements. Additionally, the plant/seed oils, altered seeds and microbial oils of the invention may be used in standard pharmaceutical compositions (e.g., the long- chain PUFA containing oils could readily be incorporated into the any of the above mentioned food products, to thereby produce a functional or medical food). More concentrated formulations comprising PUFAs include capsules, powders, tablets, softgels, gelcaps, liquid concentrates and emulsions which can be used as a dietary supplement in humans or animals other than humans.
  • Animal feeds are generically defined herein as products intended for use as feed or for mixing in feed for animals other than humans.
  • the plant/seed oils, altered seeds and microbial oils of the invention can be used as an ingredient in various animal feeds.
  • Pet food products are those products intended to be fed to a pet (e.g., dog, cat, bird, reptile, and rodent). These products can include the cereal and health food products above, as well as meat and meat byproducts, soy protein products, grass and hay products (e.g., alfalfa, timothy, oat or brome grass, vegetables). Ruminant and poultry food products are those wherein the product is intended to be fed to an animal (e.g., turkeys, chickens, cattle, and swine).
  • an animal e.g., turkeys, chickens, cattle, and swine
  • these products can include cereal and health food products, soy protein products, meat and meat byproducts, and grass and hay products as listed above.
  • Aquacultural food products are those products intended to be used in aquafarming, i.e., which concerns the propagation, cultivation, or farming of aquatic organisms and/or animals in fresh or marine waters.
  • Yarrowia lipolytica strains with ATCC Accession Nos. #20362, #76982 and #90812 were purchased from the American Type Culture Collection (Rockville, MD). Yarrowia lipolytica strains were typically grown at 28-30 0 C in several media, according to the recipes shown below. Agar plates were prepared as required by addition of 20 g/L agar to each liquid media, according to standard methodology.
  • YPD agar medium 10 g of yeast extract [Difco], 20 g of Bacto peptone [Difco]; and 20 g of glucose.
  • MM Basic Minimal Media
  • MMU MMU
  • MMU+SU Minimal Media + Uracil + Sulfonylurea (per liter): Prepare MMU media as above and add 280 mg sulfonylurea.
  • Minimal Media + Leucine (MM+leucine or MMLeu) (per liter): Prepare MM media as above and add 0.1 g leucine.
  • Minimal Media + Leucine + Uracil (MMLeuUra) (per liter): Prepare MM media as above and add 0.1 g leucine, 0.1 g uracil and 0.1 g uridine.
  • Minimal Media + Leucine + Lysine (MMLeuLvs) (per liter): Prepare MM media as above and add 0.1 g lysine, 0.1 g leucine.
  • MM + 5-FOA Minimal Media + 5-Fluoroorotic Acid (MM + 5-FOA) (per liter): 2O g glucose, 6.7 g Yeast Nitrogen base, 75 mg uracil, 75 mg uridine and appropriate amount of FOA (Zymo Research Corp., Orange, CA), based on FOA activity testing against a range of concentrations from 100 mg/L to
  • High Glucose Media (per liter): 80 glucose, 2.58 g KH 2 PO 4 and 5.36 g K 2 HPO 4 , pH 7.5 (do not need to adjust).
  • Transformation of Yarrowia lipolytics was performed according to the method of Chen, D. C. et al. (Appl. Microbiol. Biotechnol. 48(2):232-235 (1997)), unless otherwise noted. Briefly, Yarrowia was streaked onto a YPD plate and grown at 30 0 C for approximately 18 h. Several large loopfuls of cells were scraped from the plate and resuspended in 1 mL of transformation buffer, comprising: 2.25 mL of 50% PEG, average MW 3350; 0.125 mL of 2 M lithium acetate, pH 6.0; 0.125 mL of 2 M DTT; and (optionally) 50 ⁇ g sheared salmon sperm DNA.
  • lipids were extracted as described in Bligh, E. G. & Dyer, W. J. (Can. J. Biochem. Physiol.
  • Fatty acid methyl esters were prepared by transesterification of the lipid extract with sodium methoxide (Roughan, G. and Nishida I., Arch Biochem Biophys. 276(1 ):38-46 (1990)) and subsequently analyzed with a Hewlett-Packard 6890 GC fitted with a 30 m X 0.25 mm (i.d.) HP-INNOWAX (Hewlett-Packard) column. The oven temperature was from 170 0 C (25 min hold) to 185 0 C at 3.5 °C/min.
  • Yarrowia culture (3 ml_) was harvested, washed once in distilled water, and dried under vacuum in a Speed-Vac for 5-10 min.
  • Sodium methoxide (100 ⁇ l_ of 1 %) was added to the sample, which was then vortexed and rocked for 20 min. After adding 3 drops of 1 M NaCI and 400 ⁇ l_ hexane, the sample was vortexed and spun. The upper layer was removed and analyzed by GC as described above. Construction Of Yarrowia lipolvtica Strain Y4305U3:
  • Y. lipolytica strain Y4305U3 was used as the host in Examples 52, 53 and 54, infra. The following description is a summary of the construction of strain Y4305U3, derived from Yarrowia lipolytica ATCC #20362. Strain Y4305U3 is capable of producing about 53.2% EPA relative to the total lipids via expression of a delta-9 elongase/ delta-8 desaturase pathway (FIG. 44).
  • strain Y4305U3 required the construction of strain Y2224 (a FOA resistant mutant from an autonomous mutation of the Ura3 gene of wildtype Yarrowia strain ATCC #20362), strain Y4001 (producing 17% EDA with a Leu- phenotype), strain Y4001 U1 (producing 17% EDA with a Leu- and Ura- phenotype), strain Y4036 (producing 18% DGLA with a Leu- phenotype), strain Y4036U (producing 18% DGLA with a Leu- and Ura- phenotype), strain Y4070 (producing 12% ARA with a Ura- phenotype), strain Y4086 (producing 14% EPA), strain
  • Strain Y2224 was isolated in the following manner: Yarrowia lipolytica ATCC #20362 cells from a YPD agar plate were streaked onto a MM plate (75 mg/L each of uracil and uridine, 6.7 g/L YNB with ammonia sulfate, without amino acids, and 20 g/L glucose) containing 250 mg/L 5- FOA (Zymo Research). Plates were incubated at 28 0 C, and four of the resulting colonies were patched separately onto MM plates containing 200 mg/mL 5-FOA and MM plates lacking uracil and uridine. This was done to confirm uracil Ura3 auxotrophy.
  • Strain Y4001 was created via integration of construct pZKLeuN-29E3 (FIG. 45A). This construct, comprising four chimeric genes (i.e., a delta-12 desaturase, a C-i ⁇ m elongase, and two delta-9 elongases), was integrated into the Leu2 loci of strain Y2224 to thereby enable production of EDA.
  • construct pZKLeuN-29E3 FIG. 45A
  • Plasmid pZKLeuN-29E3 was digested with Asc ⁇ /Sph ⁇ and then used for transformation of Y. lipolytica strain Y2224 (i.e., ATCC #20362 Ura3-) according to the General Methods.
  • the transformed cells were plated onto MMLeu media plates, and plates were maintained at 30 ° C for 2 to 3 days. The colonies were picked and streaked onto MM and MMLeu selection plates. The colonies that could grow on MMLeu plates but not on MM plates were selected as Leu- strains. Single colonies of Leu- strains were used to inoculate liquid MMLeu, and the liquid cultures were shaken at 250 rpm/min for 2 days at 30 C.
  • the cells were collected by centrifugation, and lipids were extracted. Fatty acid methyl esters were prepared by trans-esterification and subsequently analyzed with a Hewlett-Packard 6890 GC. GC analyses showed the presence of EDA in the transformants containing the 4 chimeric genes of pZKLeuN-29E3, but not in the Yarrowia Y2224 control strain. Most of the selected 36 Leu- strains produced about 12 to 16.9% EDA of total lipids. Three strains, designated as strains Y4001 , Y4002, and Y4003, produced about 17.4%, 17%, and 17.5% EDA of total lipids, respectively.
  • Strain Y4001U (Leu-, Ura-) was created via temporary expression of the Cre recombinase enzyme in plasmid pY116 (FIG. 45B) within strain Y4001 to produce a Leu- and Ura- phenotype.
  • Construct pY116 contained the following components:
  • Plasmid pY116 was used for transformation of freshly grown Y4001 cells according to the General Methods. The transformed cells were plated onto MMLeuUra plates containing 280 ⁇ g/mL sulfonylurea (chlorimuron ethyl, E. I. duPont de Nemours & Co., Inc., Wilmington, DE), and plates were maintained at 30
  • Construct pKO2UF8289 (FIG. 46A; SEQ ID NO:324) was generated to integrate four chimeric genes (comprising a delta-12 desaturase, one delta-9 elongase, and two mutant delta-8 desaturases) into the delta-12 loci of strain Y4001 U1 , to thereby enable production of DGLA.
  • Construct pKO2UF8289 contained the following components:
  • the pKO2UF8289 plasmid was digested with Asc ⁇ /Sph ⁇ and then used for transformation of strain Y4001 U1 according to the General Methods.
  • the transformed cells were plated onto MMLeu plates, and plates were maintained at 30 C for 2 to 3 days. The colonies were picked and streaked onto MMLeu selection plates at 30 C for 2 days. These cells were then used to inoculate liquid MMLeu media, and liquid cultures were shaken at 250 rpm/min for 2 days at 30 ° C. The cells were collected by centrifugation, and lipids were extracted. Fatty acid methyl esters were prepared by trans-estehfication and subsequently analyzed with a Hewlett-Packard 6890 GC.
  • GC analyses showed the presence of DGLA in the transformants containing the 4 chimeric genes of pKO2UF8289, but not in the parent Y4001 U1 strain. Most of the selected 96 strains produced between 7% and 13% DGLA of total lipids. Six strains, designated as Y4034, Y4035, Y4036, Y4037, Y4038, and Y4039, produced about 15%, 13.8%, 18.2%, 13.1%, 15.6%, and 13.9% DGLA of total lipids, respectively.
  • Construct pY116 (FIG. 45B; SEQ ID NO:323) was utilized to temporarily express a Cre recombinase enzyme in strain Y4036. This released the LoxP sandwiched Ura3 gene from the genome. Plasmid pY116 was used to transform strain Y4036 according to the General
  • the pZKSL-555R plasmid was digested with AscUSphl and then used for transformation of strain Y4036U according to the General Methods.
  • the transformed cells were plated onto MMLeuLys plates, and plates were maintained at 30 ° C for 2 to 3 days. Single colonies were then re-streaked onto MMLeuLys plates, and the resulting colonies were used to inoculate liquid MMLeuLys. Liquid cultures were then shaken at 250 rpm/min for 2 days at 30 ° C. The cells were collected by centrifugation, and lipids were extracted. Fatty acid methyl esters were prepared by trans-esterification and subsequently analyzed with a Hewlett-Packard 6890 GC.
  • GC analyses showed the presence of ARA in the transformants containing the 3 chimeric genes of pZKSL-555R, but not in the parent Y4036U strain. Most of the selected 96 strains produced -10% ARA of total lipids. Four strains, designated as Y4068, Y4069, Y4070, and Y4071 , produced about 11.7%, 11.8%, 11.9% and 11.7% ARA of total lipids, respectively. Further analyses showed that the three chimeric genes of pZKSL-555R were not integrated into the Lys5 site in the Y4068, Y4069, Y4070 and Y4071 strains. All strains possessed a Lys+ phenotype.
  • strain Y4070 with respect to wildtype Yarrowia lipolytica ATCC #20362, was Ura-, unknown 1-, unknown 3-, Leu+, Lys+, GPD::FmD12::Pex20, YAT1 ::FmD12::OCT, YAT1 ::ME3S::Pex16, GPAT::EgD9e::Lip2, EXP1 ::EgD9eS::Lip1 , FBAINm::EgD9eS::Lip2, FBAINm::EgD8M::Pex20, EXP1::EgD8M::Pex16, FBAIN::EgD5::Aco, EXP1 ::EgD5S::Pex20, YAT1 :RD5S::OCT.
  • pZP3-Pa777U (FIG. 47A; SEQ ID NO:338) was generated to integrate three delta-17 desaturase genes into the Pox3 loci (GenBank Accession No. AJ001301) of strain Y4070, to thereby enable production of EPA.
  • the pZP3- Pa777U plasmid contained the following components:
  • the pZP3-Pa777U plasmid was digested with Asc ⁇ /Sph ⁇ and then used for transformation of strain Y4070 according to the General Methods.
  • the transformed cells were plated onto MM plates, and plates were maintained at 30 C for 2 to 3 days. Single colonies were then re-streaked onto MM plates, and the resulting colonies were used to inoculate liquid MMLeuLys. Liquid cultures were shaken at 250 rpm/min for 2 days at 30 C. The cells were collected by centrifugation, and lipids were extracted. Fatty acid methyl esters were prepared by trans-esterification and subsequently analyzed with a Hewlett-Packard 6890 GC.
  • GC analyses showed the presence of EPA in the transformants containing the 3 chimeric genes of pZP3-Pa777U, but not in the parent Y4070 strain. Most of the selected 96 strains produced 10-13% EPA of total lipids. Two strains, designated as Y4085 and Y4086, produced about 14.2% and 13.8% EPA of total lipids, respectively.
  • strain Y4086 with respect to wildtype Yarrowia lipolytica ATCC #20362, was Ura3+, Leu+, Lys+, unknown 1-, unknown 2-, YALI0F24167g-, GPD::FmD12::Pex20, YAT1 ::FmD12::OCT, YAT1 ::ME3S::Pex16, GPAT::EgD9e::Lip2, EXP1 ::EgD9eS::Lip1 , FBAINm::EgD9eS::Lip2, FBAINm::EgD8M::Pex20, EXP1 ::EgD8M::Pex16, FBAIN::EgD5::Aco, EXP1 ::EgD5S::Pex20, YAT1 ::RD5S::OCT, YAT1 ::PaD17
  • Strain Y4086U1 was created via temporary expression of the Cre recombinase enzyme in construct pY117 (FIG. 47B; SEQ ID NO:343) within strain Y4086 to produce a Ura- phenotype. This released the LoxP sandwiched Ura3 gene from the genome.
  • the mutated Yarrowia AHAS enzyme in plasmid pY117 conferred SU R , which was used as a positive screening marker.
  • Plasmid pY117 was derived from plasmid pY116 (described supra, and in U.S. Patent Application No. 11/635258) by inserting the mutant AHAS gene flanked by Pacl-Swal sites into Pacl-Swal digested pY116, thereby replacing the LEU selectable marker with the sulfonylurea marker. Construct pY117 thereby contained the following components:
  • Plasmid pY117 was used to transform strain Y4086 according to the General Methods. Following transformation, the cells were plated onto MMU+SU (280 ⁇ g/mL sulfonylurea; also known as chlorimuron ethyl, E. I. duPont de Nemours & Co., Inc., Wilmington, DE) plates, and plates were maintained at 30 ° C for 2 to 3 days. The individual SU R colonies grown on MMU+SU plates were picked and streaked into YPD liquid media, and liquid cultures were shaken at 250 rpm/min for 1 day at 30 C to cure the pY117 plasmid. Cells from the grown cultures were streaked onto MMU plates.
  • MMU+SU 280 ⁇ g/mL sulfonylurea; also known as chlorimuron ethyl, E. I. duPont de Nemours & Co., Inc., Wilmington, DE
  • Construct pZP2-2988 (FIG. 48A; SEQ ID NO:345) was generated to integrate one delta-12 desaturase gene, two delta-8 desaturase genes, and one delta-9 elongase gene into the Pox2 loci (GenBank Accession No. AJ001300) of strain Y4086U1 , to thereby enable higher level production of EPA.
  • the pZP2-2988 plasmid contained the following components:
  • the pZP2-2988 plasmid was digested with Asc ⁇ /Sph ⁇ and then used for transformation of strain Y4086U1 according to the General Methods.
  • the transformed cells were plated onto MM plates, and plates were maintained at 30 C for 2 to 3 days. Single colonies were re-streaked onto MM plates, and the resulting colonies were used to inoculate liquid MMLeuLys. Liquid cultures were shaken at 250 rpm/min for 2 days at 30 C.
  • the cells were collected by centrifugation, resuspended in HGM, and then shaken at 250 rpm/min for 5 days.
  • the cells were collected by centrifugation, and lipids were extracted.
  • Fatty acid methyl esters were prepared by trans-esterification and subsequently analyzed with a Hewlett-Packard 6890 GC.
  • strain Y4128 with respect to wildtype Yarrowia lipolytics ATCC #20362, was: YALI0F24167g-, PexW-, unknown 1-, unknown 2-, GPD::FmD12::Pex20, YAT1 ::FmD12::0CT, GPM/FBAIN::FmD12S::OCT, YAT1::ME3S::Pex16, GPAT::EgD9e::Lip2, EXP1 ::EgD9eS::Lip1, FBAINm::EgD9eS::Lip2, FBA::EgD9eS::Pex20, FBAINm::EgD8M::Pex20, EXP1 ::EgD8M::Pex16, GPDIN::EgD8M::Lip1 , YAT1 :EgD8M::Aco,
  • Plasmid pZKUE3S contained the following components: TABLE 15
  • Plasmid pZKUE3S was digested with Sph ⁇ /Pacl and then used to transform strain Y4128 according to the General Methods. Following transformation, cells were plated onto MM + 5-FOA selection plates, and plates were maintained at 30 C for 2 to 3 days.
  • the discrepancy in the % EPA quantified in Y4128 (37.6%) versus Y4128U (average 13.8%) is based on differing growth conditions. Specifically, the former culture was analyzed following two days of growth in liquid culture, while the latter culture was analyzed after growth on an agar plate. The Applicants have observed a 2-3 fold increase in % EPA, when comparing results from agar plates to those in liquid culture. Thus, although results are not directly comparable, both Y4128 and Y4128U strains demonstrate high production of EPA.
  • Construct pZKL2-5U89GC (FIG. 49A; SEQ ID NO:348) was generated to integrate one delta-9 elongase gene, one delta-8 desaturase gene, one delta-5 desaturase gene, and one Yarrowia lipolytics diacylglycerol cholinephosphotransferase gene (CPT1) into the Lip2 loci (GenBank Accession No. AJ012632) of strain Y4128U3 to thereby enable higher level production of EPA.
  • CPT1 Yarrowia lipolytics diacylglycerol cholinephosphotransferase gene
  • the pZKL2-5U89GC plasmid contained the following components: TABLE 16 Description of Plasmid pZKL2-5U89GC (SEQ ID NO:348) EgD5S: codon-optimized delta-5 desaturase (SEQ ID NO:332), derived from Euglena gracilis (Patent Publication US 2007-0292924-A1);
  • Aco Aco terminator sequence from Yarrowia Aco gene (GenBank Accession No. AJ001300)
  • the pZKL2-5U89GC plasmid was digested with Asc ⁇ /Sph ⁇ and then used for transformation of strain Y4128U3 according to the General Methods.
  • the transformed cells were plated onto MM plates, and plates were maintained at 30 ° C for 3 to 4 days. Single colonies were re-streaked onto MM plates, and the resulting colonies were then used to inoculate liquid MM.
  • Liquid cultures were shaken at 250 rpm/min for 2 days at 30 C.
  • the cells were collected by centrifugation, resuspended in HGM, and then shaken at 250 rpm/min for 5 days.
  • the cells were collected by centrifugation, and lipids were extracted.
  • Fatty acid methyl esters were prepared by trans-esterification and subsequently analyzed with a Hewlett-Packard 6890 GC.
  • GC analyses showed that most of the selected 96 strains produced 32-39.9% EPA of total lipids.
  • the final genotype of each strain was: YALI0C18711g-, Pex10-, YALI0F24167g-, unknown 1-, unknown 3-, GPD::FmD12::Pex20, YAT1 ::FmD12::OCT, GPM/FBAIN::FmD12S::OCT, YAT1 ::ME3S::Pex16, EXP1 ::ME3S::Pex20, GPAT::EgD9e::Lip2, EXP1 ::EgD9eS::Lip1 , FBAINm::EgD9eS::Lip2, FBA::EgD9eS::Pex20, GPD::EgD9eS::Lip2, FBAINm::EgD8M::Pex20, FBAIN::EgD8M::Pex20, FBAIN::EgD8M::Pex20
  • pZKL1-2SP98C (FIG. 49B; SEQ ID NO:352) was generated to integrate one delta-9 elongase gene, one delta-8 desaturase gene, one delta-12 desaturase gene, and one Yarrowia lipolytica diacylglycerol cholinephosphotransferase gene (CPT1) into the Lip1 loci (GenBank Accession No. Z50020) of strain Y4217U2, to thereby enable higher level production of EPA.
  • the pZKL1-2SP98C plasmid contained the following components:
  • the pZKL1-2SP98C plasmid was digested with Asc ⁇ /Sph ⁇ and then used for transformation of strain Y4217U2 according to the General Methods.
  • the transformed cells were plated onto MM plates, and plates were maintained at 30 ° C for 3 to 4 days. Single colonies were re-streaked onto MM plates, and the resulting colonies were then used to inoculate liquid MM.
  • the liquid cultures were then shaken at 250 rpm/min for 2 days at 30 C.
  • the cells were collected by centrifugation, resuspended in HGM, and then shaken at 250 rpm/min for 5 days.
  • the cells were collected by centrifugation, and lipids were extracted.
  • Fatty acid methyl esters were prepared by trans-esterification and subsequently analyzed with a Hewlett-Packard 6890 GC.
  • GC analyses showed that most of the selected 72 strains produced 40-44% EPA of total lipids.
  • Six strains designated as Y4259, Y4260, Y4261 , Y4262, Y4263, and Y4264, produced about 46.5%, 44.5%, 44.5%, 44.8%, 44.5%, and 44.3% EPA of total lipids, respectively.
  • strain Y4259 with respect to wild type Yarrowia lipolytics ATCC #20362 was: YALI0C18711g-, Pex10-, YALI0F24167g-, unknown 1- , unknown 3-, unknown 8-, GPD::FmD12::Pex20, YAT1 ::FmD12::OCT, GPM/FBAIN::FmD12S::OCT, EXP1 ::FmD12S::Aco, YAT1 ::ME3S::Pex16, EXP1 ::ME3S::Pex20 (2 copies), GPAT::EgD9e::Lip2, EXP1 ::EgD9eS::Lip1 , FBAINm::EgD9eS::l_ip2, FBA::EgD9eS::Pex20, GPD::EgD9eS::Lip2, Y
  • construct pZKUM (FIG. 5OA; SEQ ID NO:353) was used to integrate a Ura3 mutant gene into the Ura3 gene of strain Y4259.
  • the plasmid pZKUM contained the following components:
  • pZKD2-5U89A2 (FIG. 5OB; SEQ ID NO:355) was generated to integrate one delta-9 elongase gene, one delta-5 desaturase gene, one delta-8 desaturase gene, and one delta-12 desaturase gene into the diacylglycerol acyltransferase (DGAT2) loci of strain Y4259U2, to thereby enable higher level production of EPA.
  • the pZKD2-5U89A2 plasmid contained the following components:
  • the pZKD2-5U89A2 plasmid was digested with Asc ⁇ /Sph ⁇ and then used for transformation of strain Y4259U2 according to the General Methods.
  • the transformed cells were plated onto MM plates, and plates were maintained at 3O C for 3 to 4 days. Single colonies were re-streaked onto MM plates, and the resulting colonies were used to inoculate liquid MM.
  • Liquid cultures were shaken at 250 rpm/min for 2 days at 30 C.
  • the cells were collected by centrifugation, resuspended in HGM, and then shaken at 250 rpm/min for 5 days.
  • the cells were collected by centrifugation, and lipids were extracted.
  • Fatty acid methyl esters were prepared by trans-esterification and subsequently analyzed with a Hewlett-Packard 6890 GC.
  • GC analyses showed that most of the selected 96 strains produced 40-46% EPA of total lipids.
  • the complete lipid profile of Y4305 is as follows: 16:0 (2.8%), 16:1 (0.7%), 18:0 (1.3%), 18:1 (4.9%), 18:2 (17.6%), ALA (2.3%), EDA (3.4%), DGLA (2.0%), ARA (0.6%), ETA (1.7%), and EPA (53.2%).
  • the total lipid % dry cell weight (dew) was 27.5.
  • strain Y4305 with respect to wild type Yarrowia lipolytica ATCC #20362 was SCP2- (YALI0E01298g), YALI0C18711g-, Pex10-, YALI0F24167g-, unknown 1-, unknown 3-, unknown 8-, GPD::FmD12::Pex20, YAT1 ::FmD12::OCT, GPM/FBAIN::FmD12S::OCT, EXP1 ::FmD12S::Aco, YAT1 ::FmD12S::Lip2, YAT1 ::ME3S::Pex16, EXP1 ::ME3S::Pex20 (3 copies), GPAT::EgD9e::Lip2, EXP1 ::EgD9eS::Lip1 , FBAINm::EgD9eS::Lip2, FBA::Eg
  • Y. lipolytica strain Y4184U was used as the host in Examples 32, 33, 34 and
  • strain Y4184U was derived from Y. lipolytica ATCC #20362, and is capable of producing about 31% EPA relative to the total lipids via expression of a delta-9 elongase/ delta-8 desaturase pathway.
  • strain Y4184U required the construction of strain Y2224, strain Y4001 , strain Y4001 U, strain Y4036, strain Y4036U and strain Y4069 (supra).
  • strain Y4184U (diagrammed in FIG. 51A) required construction of strain Y4084 (producing 14% EPA), strain Y4084U1 (Um-), strain
  • strain Y4084 (Um-), strain Y4158U1 (Um-), and strain 4184 (producing 30.7% EPA).
  • strain Y4158, strain Y4158U1 , strain Y4184, and strain Y4184U was as described during construction of strain Y4305, supra.
  • construct pZP3-Pa777U (FIG. 47A; SEQ ID NO:338) was utilized to integrate three delta-17 desaturase genes into the Pox3 loci (GenBank Accession No. AJ001301 ) of strain Y4069, thereby resulting in isolation of strain Y4084
  • Strain Y4084U1 was created via temporary expression of the Cre recombinase enzyme in construct pY117 (FIG. 47B; SEQ ID NO:343) within strain Y4084 to produce a Ura- phenotype.
  • Construct pZP2-2988 FIG. 48A; SEQ ID NO:
  • Y4127 (producing 18% EPA). Yarrowia lipolytica strain Y4127 was deposited with the American Type Culture Collection on November 29, 2007 and bears the designation ATCC PTA-8802.
  • Strain Y4127U2 was created by disrupting the Ura3 gene in strain Y4127 via construct pZKUE3S (FIG. 48B; SEQ ID NO:351), comprising a chimeric EXP1 ::ME3S::Pex20 gene targeted for the Ura3 gene.
  • Construct pZKL1-2SP98C (FIG. 49B; SEQ ID NO:352) was utilized to integrate one delta-9 elongase gene, one delta-8 desaturase gene, one delta-12 desaturase gene, and one Yarrowia lipolytica diacylglycerol cholinephosphotransferase gene (CPT1) into the Lip1 loci (GenBank Accession No.
  • strain Y4127U2 thereby resulting in isolation of strain Y4158 (producing 25% EPA).
  • a Ura- derivative i.e., strain Y4158U1 was then created, via transformation with construct pZKUE3S (FIG. 48B; SEQ ID NO:351), comprising a chimeric EXP1 ::ME3S::Pex20 gene targeted for the Ura3 gene.
  • construct pZKL2-5U89GC FIG. 49A; SEQ ID NO:348) was utilized to integrate one delta-9 elongase gene, one delta-8 desaturase gene, one delta-5 desaturase gene, and one Yarrowia lipolytica CPT1 into the Lip2 loci (GenBank
  • the complete lipid profile of strain Y4184 is as follows: 16:0 (3.1%), 16:1 (1.5%), 18:0 (1.8%), 18:1 (8.7%), 18:2 (31.5%), ALA (4.9%), EDA (5.6%), DGLA (2.9%), ARA (0.6%), ETA (2.4%), and EPA (28.9%).
  • the total lipid % dry cell weight (dew) was 23.9.
  • strain Y4184 with respect to wildtype Yarrowia lipolytica ATCC #20362 was unknown 1-, unknown 2-, unknown 4-, unknown 5-, unknown 6-, unknown 7-, YAT1::ME3S::Pex16, EXP1 ::ME3S::Pex20 (2 copies), GPAT::EgD9e::Lip2, FBAINm::EgD9eS::Lip2, EXP1 ::EgD9eS::Lip1 , FBA::EgD9eS::Pex20, YAT1 ::EgD9eS::Lip2, GPD::EgD9eS::Lip2, GPDIN::EgD8M::Lip1 , YAT1 ::EgD8M::Aco, EXP1 ::EgD8M::Pex16, FBAINm::EgD8M::P
  • Euglena gracilis was obtained from Dr. Richard Triemer's lab at Michigan State University (East Lansing, Ml). From 10 mL of actively growing culture, a 1 mL aliquot was transferred into 250 mL of Euglena gracilis (Eg) Medium in a 500 mL glass bottle. Eg medium was made by combining 1 g of sodium acetate, 1 g of beef extract (Cat. No. U 126-01 , Difco Laboratories, Detroit, Ml), 2 g of Bacto® tryptone (0123-17-3, Difco Laboratories), and 2 g of Bacto® yeast extract (Cat. No. 0127-17- 9, Difco Laboratories) in 970 mL of water.
  • TMSH trimethylsulfonium hydroxide
  • Fatty acid methyl esters (5 ⁇ L injected from hexane layer) were separated and quantified using a Hewlett-Packard 6890 Gas Chromatograph fitted with an Omegawax 320 fused silica capillary column (Supelco Inc., Cat. No. 24152). The oven temperature was programmed to hold at 220 0 C for 2.7 min, increase to 240 0 C at 20 0 C /min, and then hold for an additional 2.3 min. Carrier gas was supplied by a Whatman hydrogen generator. Retention times were compared to those for methyl esters of standards commercially available (Nu-Chek Prep, Inc. Cat. No. U-99-A), and the resulting chromatogram is shown in FIG. 27.
  • RNA STAT-60TM reagent TEL-TEST, Inc., Friendswood, TX
  • RNA STAT-60TM reagent TEL-TEST, Inc., Friendswood, TX
  • the mRNA was isolated from 1 mg of total RNA using the mRNA Purification Kit (Amersham Biosciences, Piscataway, NJ) following the manufacturer's protocol provided. In this way, 85 ⁇ g of mRNA were obtained.
  • a cDNA library was generated using the CloneminerTM cDNA Library Construction Kit (Cat. No.18249-029, Invitrogen Corporation, Carlsbad, CA) and following the manufacturer's protocol provided (Version B, 25-0608).
  • cDNA was synthesized from 3.2 ⁇ g of mRNA (described above) using the Biotin-atfB2-Oligo(dT) primer. After synthesis of the first and second strand, the atiB ⁇ adapter was added; ligation was performed; and the cDNA was size fractionated using column chromatography.
  • DNA from fractions 7 and 8 were concentrated, recombined into pDONRTM222, and transformed into E. coli ElectroMAXTM DH10BTM T1 Phage- Resistant cells (Invitrogen Corporation).
  • the Euglena gracilis library was named eeg1c.
  • clones first were recovered from archived glycerol cultures grown/frozen in 384-well freezing media plates. Using an automatic QPix colony picker (Genetix), cells were picked and then used to inoculate 96-well deep-well plates containing LB + 50 ⁇ g/mL kanamycin.
  • Plasmids After growing 20 h at 37 0 C, cells were pelleted by centrifugation and stored at -20 0 C. Plasmids then were isolated on an Eppendorf 5Prime robot, using a modified 96-well format alkaline lysis miniprep method (Eppendorf PerfectPrep). Briefly, a filter and vacuum manifold were used to facilitate removal of cellular debris after acetate precipitation. Plasmid DNA was then bound on a second filter plate directly from the filtrate, washed, dried, and eluted.
  • Plasmids were end-sequenced in 384-well plates, using vector-primed M13F Universal primer (SEQ ID NO:1) and the ABI BigDye version 3 Prism sequencing kit.
  • SEQ ID NO:1 vector-primed M13F Universal primer
  • ABI BigDye version 3 Prism sequencing kit 100-200 ng of template and 6.4 pmol of primer were used, and the following reaction conditions were repeated 25 times: 96 0 C for 10 sec, 50 0 C for 5 sec and 60 0 C for 4 min. After ethanol-based cleanup, cycle sequencing reaction products were resolved and detected on Perkin-Elmer ABI 3700 automated sequencers.
  • C20-PUFA Elongating Enzyme Homologs from Euglena gracilis cDNA Library eegic cDNA clones encoding C20-PUFA elongating enzyme homologs were identified by conducting BLAST (Basic Local Alignment Search Tool; Altschul et al., J. Mot. Biol.
  • the BLASTX search using the nucleotide sequences from clone eeg1c.pkOO5.p14.f revealed similarity of the protein encoded by the cDNA to the C20-PUFA EIo from Pavlova sp. CCMP459 (SEQ ID NO:2) (NCBI Accession No. AAV33630 (Gl 54307108), locus AAV33630, CDS AY630573; Pereira et al., Biochem. J. 384:357-366 (2004)).
  • SEQ ID NO:3 5' end of cDNA insert.
  • FIS Full insert sequencing
  • the amino acid sequence set forth in SEQ ID NO:6 was evaluated by BLASTP, yielding a pLog value of 61.22 (E value of 6e-62) versus the Pavlova sp. CCMP459 C20-PUFA EIo (SEQ ID NO:2).
  • the amino acid sequence set forth in SEQ ID NO:6 is 45.1% identical to the Pavlova sp. CCMP459 C20-PUFA EIo sequence (SEQ ID NO:2) using the Jotun Hein method. Sequence percent identity calculations performed by the Jotun Hein method (Hein, J. J., Meth. Enz.
  • EgDHAsyni Example 4, infra
  • EgDHAsyn2 Example 5, infra
  • the BLASTX search using the nucleotide sequences from clone eeg1c.pkO16.e6.f (also called pKR1049) revealed similarity of the protein encoded by the cDNA to the C20-PUFA EIo from Pavlova sp. CCMP459 (SEQ ID NO:2)
  • SEQ ID NO: 12 The amino acid sequence set forth in SEQ ID NO: 12 was evaluated by BLASTP as described in Example 3. Interestingly, SEQ ID NO:12 was found to be similar to both C20-PUFA EIo and delta-4 fatty acid desaturase.
  • the N-terminus of SEQ ID NO:12 (from approximately amino acids 16-268) yields a pLog value of 60.30 (E value of 5e-61 ; 124/258 identical amino acids; 48% identity) versus the Pavlova sp. CCMP459 C20-PUFA EIo (SEQ ID NO:2).
  • SEQ ID NO: 12 From approximately amino acids 253-793 yields an E value of 0.0 (535/541 identical amino acids; 98% identity), versus the delta-4 fatty acid desaturase from Euglena gracilis (SEQ ID NO: 13) (NCBI Accession No. AAQ19605 (Gl 33466346), locus AAQ19605, CDS AY278558; Meyer et al., Biochemistry 42(32): 9779-9788 (2003)).
  • EgDHAsyni Euglena gracilis DHA synthase 1
  • the amino acid sequence of EgDHAsyni (SEQ ID NO:12) is 47.8% identical to the C20-PUFA EIo from Pavlova sp. CCMP459 (SEQ ID NO:2) and 98.9% identical to the delta-4 fatty acid desaturase from Euglena gracilis (SEQ ID NO: 13), using the Jotun Hein method as described in Example 3.
  • EgDHAsyni (SEQ ID NO: 12) is 41.2% identical to the C20-PUFA EIo from Pavlova sp. CCMP459 (SEQ ID NO:2) and 98.9% identical to the delta-4 fatty acid desaturase from Euglena gracilis (SEQ ID NO: 13), using the Clustal V method as described in Example 3.
  • FIG 27 summarizes BLASTP and percent identity values for EgDHAsyni (Example 4), EgC20elo1 (Example 3, supra) and EgDHAsyn2 (Example 5, infra).
  • EXAMPLE 5 summarizes BLASTP and percent identity values for EgDHAsyni (Example 4), EgC20elo1 (Example 3, supra) and EgDHAsyn2 (Example 5, infra).
  • Biodyne B 0.45 ⁇ m membrane (Cat. No. 60207, Pall Corporation, Pensacola, FL) was trimmed to approximately 22 cm x 22 cm, and the membrane was carefully laid on top of the agar to avoid air bubbles. After incubation for 2 min at room temperature, the membrane was marked for orientation, lifted off with tweezers, and placed colony-side up on filter paper soaked with 0.5 M sodium hydroxide and 1.5 M sodium chloride. After denaturation for 4 min, the sodium hydroxide was neutralized by placing the membrane on filter paper soaked with 0.5 M Tris-HCL (pH 7.5) and 1.5 M sodium chloride for 4 min. This step was repeated, and the membrane was rinsed briefly in 2X SSC buffer (2OX SSC is 3M sodium chloride, 0.3 M sodium citrate; pH 7.0) and air dried on filter paper.
  • Hybridization 2X SSC buffer
  • Hybridization solution contained 6X SSPE (2OX SSPE is 3 M sodium chloride, 0.2 M sodium phosphate, 20 mM EDTA; pH 7.4), 5X Denhardt's reagent (100X Denhardt's reagent is 2%(w/v) Ficoll, 2% (w/v) polyvinylpyrrolidone, 2% (w/v) acetylated bovine serum albumin), 0.5% sodium dodecyl sulfate (SDS), 100 ⁇ g/mL sheared salmon sperm DNA, and 5% dextran sulfate.
  • 6X SSPE 2OX SSPE is 3 M sodium chloride, 0.2 M sodium phosphate, 20 mM EDTA; pH 7.4
  • 5X Denhardt's reagent 100X Denhardt's reagent is 2%(w/v) Ficoll, 2% (w/v) polyvinylpyrrolidone, 2% (w/v
  • a DNA probe was made using an agarose gel purified NcoUNott DNA fragment, containing EgDHAsyni*, from pY141 (described in Example 10 herein) labeled with P 32 dCTP using the Rad Prime DNA Labeling System (Cat. No. 18428- 011 , Invitrogen, Carlsbad, CA), following the manufacturer's instructions. Unincorporated P 32 dCTP was separated using a NICK column (Cat. No. 17-0855- 02, Amersham Biosciences, Piscataway, NJ), following the manufacturer's instructions. The probe was denatured for 5 min at 100 0 C and placed on ice for 3 min; then, half was added to the hybridization solution.
  • the membrane was hybridized with the probe overnight at 65 0 C with gentle shaking and then washed the following day twice with 2X SSC containing 0.5% SDS (5 min each) and twice with 0.2X SSC containing 0.1% SDS (15 min each). After washing, hyperfilm (Cat. No. RPN30K, Amersham Biosciences) was exposed to the membrane overnight at -80 0 C.
  • the plasmid from eeg1c-1 may also be referred to as pLF116.
  • the individual positive clone was grown at 37 0 C in LB + 50 ⁇ g/mL kanamycin liquid media, and plasmid was purified using the QIAprep® Spin Miniprep Kit (Qiagen Inc., Valencia, CA) following the manufacturer's protocol.
  • the plasmid insert was sequenced as described in Example 2, with the ABI BigDye version 3 Prism sequencing kit using vector-primed M13F Universal primer (SEQ ID NO:1), vector-primed M13rev primer (SEQ ID NO:14), and the poly(A) tail-primed WobbleT oligonucleotides.
  • the WobbleT primer is an equimolar mix of 21 mer poly(T)A, poly(T)C, and poly(T)G, used to sequence the 3' end of cDNA clones. Based on initial sequence data, additional internal fragment sequence was obtained in a similar way using oligonucleotides oEUGel4-1 (SEQ ID NO: 15), EgEloD4Mut-5 (SEQ ID NO:16), oEUGel4-2 (SEQ ID NO:17), EgDHAsyn ⁇ 1 (SEQ ID NO: 18), and EgDHAsyn3' (SEQ ID NO: 19).
  • SEQ ID NO:22 The amino acid sequence set forth in SEQ ID NO:22 was evaluated by BLASTP as described in Example 3. As was the case for EgDHAsyni , SEQ ID NO:22 was also found to be similar to both C20-PUFA EIo and delta-4 fatty acid desaturase.
  • the N-terminus of SEQ ID NO:22 (from approximately amino acids 41- 268) yields a pLog value of 61.0 (E value of 1e-61 ; 118/231 identical amino acids; 51 % identity) versus the Pavlova sp. CCMP459 C20-PUFA EIo (SEQ ID NO:2).
  • SEQ ID NO:22 From approximately amino acids 253-793 yields an E value of 0.0 (541/541 identical amino acids; 100% identity), versus the amino acid sequence of delta-4 fatty acid desaturase from Euglena gracilis (SEQ ID NO: 13).
  • BLAST scores and probabilities indicate that the instant nucleic acid fragment (SEQ ID NO:21) encodes an entire Euglena gracilis C20-PUFA Elo/delta-4 fatty acid desaturase fusion gene, hereby named Euglena gracilis DHA synthase 2 (EgDHAsyn2).
  • the amino acid sequence of EgDHAsyn2 (SEQ ID NO:22) is 48.2% identical to the C20-PUFA EIo from Pavlova sp. CCMP459 (SEQ ID NO:2) and 100% identical to the delta-4 fatty acid desaturase from Euglena gracilis (SEQ ID NO:13), using the Jotun Hein method as described in Example 3.
  • the amino acid sequence of EgDHAsyn2 (SEQ ID NO:22) is 41.2% identical to the C20-PUFA EIo from Pavlova sp. CCMP459 (SEQ ID NO:2) and 100% identical to the delta-4 fatty acid desaturase from Euglena gracilis (SEQ ID NO:13), using the Clustal V method as described in Example 3.
  • FIG. 25 summarizes BLASTP and percent identity values for EgDHAsyn2 (Example 5), EgC20elo1 (Example 3, supra) and EgDHAsyni (Example 4, supra).
  • EgDHAsvni and EgDHAsvn2 Given the 100% amino acid identity between the C-terminus of EgDHAsyn2
  • the alignment is shown in FIG. 2.
  • the Euglena gracilis delta-4 desaturase coding sequence is named EgD4_CDS (SEQ ID NO:24); the Euglena gracilis delta-4 desaturase cDNA sequence is named EgD4_cDNA (SEQ ID NO:23); and the Euglena gracilis DHA synthase 2 coding sequence is named EgDHAsyn2_CDS (SEQ ID NO:21).
  • FIG. 2 illustrates that the sequences are highly divergent from the start of the Euglena gracilis delta-4 desaturase cDNA to 83 bp upstream of the coding sequence (CDS) start site.
  • nucleotide sequences for EgD4_cDNA and EgDHAsyn2_CDS are identical from 83 bp upstream of the CDS start site of the Euglena gracilis delta-4 desaturase cDNA sequence (SEQ ID NO:23), which is equivalent to nucleotide 674 of the EgDHAsyn2_CDS (SEQ ID NO:21), through to the end of the sequences.
  • a Not ⁇ site can be found in the Euglena gracilis cDNA sequence (nucleotides 656-663 of SEQ ID NO:23), and since Not ⁇ linkers were used in the original cloning of the Euglena gracilis delta-4 desaturase cDNA (see Meyer et al., supra), it is likely that what was cloned was an incomplete, not full- length, transcript for EgDHAsyn2.
  • EgDHAsyni The amino acid sequence EgDHAsyni (SEQ ID NO:12) was compared to EgDHAsyn2 (SEQ ID NO:22) and EgC20elo1 (SEQ ID NO:6) using the Clustal W method as described above, and the alignment is shown in FIGs. 3A and 3B.
  • EgC20elo1 Compared to EgDHAsyni and EgDHAsyn2, EgC20elo1 has a deletion of 7 amino acids (i.e., A L D L A [V/l] L) and 2 other amino acid substitutions (i.e., W47R, T48I; based on numbering for EgDHAsyni) at the N-terminus. After amino acid 289 of EgC20elo1 , the sequences are very different when compared to the DHA synthases. EgDHAsyni and EgDHAsyn2 have an additional 498 amino acids at their C-terminal ends with homology to delta-4 fatty acid desaturases, while EgC20elo1 ends after only 9 additional amino acids.
  • EgDHAsyni SEQ ID NO:12
  • EgDHAsyn2 SEQ ID NO:22
  • the amino acid sequences of EgDHAsyni SEQ ID NO:12
  • EgDHAsyn2 SEQ ID NO:22
  • the last four differences occur in the delta-4 desaturase domain.
  • FIGs. 4A and 4B show the Clustal W alignment of the N-terminus of EgDHAsyni (SEQ ID NO:12) and the N-terminus of EgDHAsyn2 (SEQ ID NO:22) with EgC20elo1 (SEQ ID NO:6), Pavlova sp. CCMP459 C20-PUFA EIo (SEQ ID NO:2), Ostreococcus tauri PUFA elongase 2 (SEQ ID NO:25) (NCBI Accession No. AAV67798 (Gl 55852396), locus AAV67798, CDS AY591336; Meyer et al., J. Lipid Res.
  • Thalassiosira pseudonana PUFA elongase 2 (SEQ ID NO:26) (NCBI Accession No. AAV67800 (Gl 55852441), locus AAV67800, CDS AY591338; Meyer et al., J. Lipid Res., supra).
  • the Pavlova, Ostreococcus, and Thalassiosira proteins are labeled as PavC20elo, OtPUFAelo2, and TpPUFAelo2, respectively.
  • FIGs. 5A, 5B 1 5C, and 5D show the Clustal W alignment of the C-terminus of EgDHAsyni (EgDHAsyn1_CT; amino acids 253-793 of SEQ ID NO:12; the N- terminus of EgDHAsyni is not shown and is indicated by "!) and the C-terminus of EgDHAsyn2 (EgDHAsyn2_CT; amino acids 253-793 of SEQ ID NO:22, the N- terminus of EgDHAsyn2 is not shown and is indicated by "") with Euglena gracilis delta-4 fatty acid desaturase (SEQ ID NO: 13), Thraustochytrium aureum delta-4 desaturase (SEQ ID NO:27) (NCBI Accession No.
  • AAN75707 (GI 25956288), locus AAN75707, CDS AF391543), Schizochytrium aggregatum delta-4 desaturase (SEQ ID NO:28) (PCT Publication No. WO 2002/090493), Thalassiosira pseudonana delta-4 desaturase (SEQ ID NO:29) (NCBI Accession No. AAX14506 (Gl 60173017), locus AAX14506, CDS AY817156; Tonon et al., FEBS J. 272 (13):3401-3412 (2005)), and lsochrysis galbana delta-4 desaturase (SEQ ID NO:30) (NCBI Accession No.
  • AAV33631 (Gl 54307110), locus AAV33631 , CDS AY630574; Pereira et al., Biochem. J. 384(2), :357-366 (2004) and PCT Publication No. WO 2002/090493).
  • the Euglena, Thraustochytrium, Thalassiosira, and lsochrysis proteins are labeled as EgD4, TaD4, TpD4, and lgD4, respectively.
  • FIG. 6 shows an alignment of interior fragments of EgDHAsyni (labeled as "EgDHAsyn1_ NCT.pro”; amino acids 253-365 of SEQ ID NO:12) and EgDHAsyn2 (labeled as “EgDHAsyn2_NCT.pro”; amino acids 253-365 of SEQ ID NO:22), spanning both the C20 elongase region and the delta-4 desaturase domain (based on homology), with the C-termini of C20 elongases (EgC20elo1_CT.pro, amino acids 246-298 of SEQ ID NO:6; PavC20elo_CT.pro, amino acids 240-277 of SEQ ID NO:2; OtPUFAelo2_CT.pro, amino acids 256-300 of SEQ ID NO:25; TpPUFAelo2_CT.pro, amino acids 279-358 of SEQ ID NO:26) and the N-termini of delta
  • VLFXXFYXXXY (SEQ ID NO:180)
  • VLFXXFYXXXY (SEQ ID NO:180)
  • both EgDHAsyni and EgDHAsyn2 contain a proline-rich region (labeled "Proline-rich linker" in FIG. 6), which may act as a linker between the C20 elongase and delta-4 desaturase domains.
  • the linker may play a role in keeping the C20 elongase and delta-4 desaturase domains in the proper structural orientation to allow efficient conversion of EPA to DHA.
  • the proline-rich linker is shown in FIG. 6 as extending from P304 to V321 (based on numbering for EgDHAsyni), the NG repeat region is also somewhat proline-rich and may also play a role in this linker function.
  • nucleotide and corresponding amino acid sequences for the proline-rich linker of EgDHAsyni are set forth in SEQ ID NO:197 and SEQ ID NO: 198, respectively.
  • nucleotide and corresponding amino acid sequences for the proline-rich linker of EgDHAsyn2, as defined in FIG. 6, are set forth in SEQ ID NO:199 and SEQ ID NO:200, respectively.
  • the nucleotide and corresponding amino acid sequences for the EgDHAsyni C20 elongase domain from EgDHAsyni are set forth in SEQ ID NO:201 and SEQ ID NO:202, respectively.
  • the nucleotide and corresponding amino acid sequences for the EgDHAsyn2 C20 elongase domain are set forth in SEQ ID NO:203 and SEQ ID NO:204, respectively.
  • Plasmid pY5-30 contains the following: a Yarrowia autonomous replication sequence (ARS18); a CoIEI plasmid origin of replication; an ampicillin-resistance gene (AmpR), for selection in E. coli; a Yarrowia LEU2 gene, for selection in Yarrowia; and a chimeric TEF::GUS::XPR gene.
  • ARS18 Yarrowia autonomous replication sequence
  • AmpR ampicillin-resistance gene
  • AmpR ampicillin-resistance gene
  • Plasmid pDMW263 (SEQ ID NO:31) was created from pY5- 30, by replacing the TEF promoter with the Yarrowia lipolytica FBAINm promoter (U.S. Patent 7,202,356), using techniques well known to one skilled in the art. Briefly, the FBAIN promoter is located in the 5' upstream untranslated region in front of the 'ATG' translation initiation codon of the fructose-bisphosphate aldolase enzyme (E. C. 4.1.2.13), encoded by the fba1 gene. This promoter is necessary for expression and includes a portion of 5' coding region that has an intron.
  • the modified promoter, FBAINm has a 52 bp deletion between the ATG translation initiation codon and the intron of the FBAIN promoter (thereby including only 22 amino acids of the N-terminus) and a new translation consensus motif after the intron.
  • Table 20 summarizes the components of pDMW263 (SEQ ID NO:31 ; also described in PCT Publication No. WO 2007/061845).
  • Plasmid pY115 (SEQ ID NO:33; FIG. 7A) was digested with Nco ⁇ /Not ⁇ , and the resulting DNA ends were filled using Klenow. After filling to form blunt ends, the DNA fragments were treated with calf intestinal alkaline phosphatase and separated using agarose gel electrophoresis.
  • the 6989 bp fragment containing the Yarrowia lipolytica FBAINm promoter was excised from the agarose gel and purified using the QIAquick® Gel Extraction Kit (Qiagen Inc., Valencia, CA), following the manufacturer's protocol.
  • the purified 6989 bp fragment was ligated with cassette rfA using the Gateway Vector Conversion System (Cat. No. 11823-029, Invitrogen Corporation), following the manufacturer's protocol, to form Yarrowia lipolytica Gateway® destination vector pBY1 (SEQ ID NO:34; FIG. 7B).
  • the filled ⁇ /col site provides an ATG start for translation initiation.
  • genes transferred to this expression vector are expressed as fusion proteins and must be in the correct frame after Gateway® cloning. Also, 5' untranslated sequence results in additional amino acids being added to the N- terminus of the resulting protein. For this reason, a second Gateway® destination vector was made which had the vector-derived ATG start codon removed, thus allowing for translational start from the gene inserted.
  • the FBAINm promoter was amplified from plasmid pY115 (SEQ ID NO:33), using PCR with oligonucleotide primers oYFBAI (SEQ ID NO:35) and 0YFBAI-6 (SEQ ID NO:36).
  • Primer oYFBAI (SEQ ID NO:35) was designed to introduce a BgIW site at the 5' end of the promoter
  • primer 0YFBAI-6 SEQ ID NO:36
  • the resulting PCR fragment was digested with BgIW and Not ⁇ and cloned into the BglW/Not ⁇ fragment of pY115, containing the vector backbone, to form pY158 (SEQ ID NO:37).
  • Plasmid pY158 (SEQ ID NO:37) was digested with Not ⁇ , and the resulting DNA ends were filled. After filling to form blunt ends, the DNA fragments were treated with calf intestinal alkaline phosphatase and separated using agarose gel electrophoresis.
  • the 6992 bp fragment containing the Yarrowia lipolytica FBAINm promoter was excised from the agarose gel and purified using the QIAquick® Gel Extraction Kit (Qiagen Inc., Valencia, CA), following the manufacturer's protocol.
  • the purified 6992 bp fragment was ligated with cassette rfA using the Gateway Vector Conversion System (Cat. No. 11823-029, Invitrogen Corporation), following the manufacturer's protocol, to form Yarrowia lipolytica Gateway® destination vector pY159 (SEQ ID NO:38; FIG. 7C).
  • Plasmid was purified from clones eeg1c.pkOO5.p14.f (Example 3), eeg1c.pkO16.e6.f (Example 4), and eeg1c-1 (Example 5) using the QIAprep® Spin Miniprep Kit (Qiagen Inc., Valencia, CA), following the manufacturer's protocol.
  • the cDNA inserts from eeg1c.pkOO1.p14.f (comprising EgC20elo1) and eeg1c.pkO16.e6.f (comprising EgDHAsyni) were transferred to pBY1 (SEQ ID NO:34; FIG. 7B) to form pBY- EgC20elo1 (SEQ ID NO:39, FIG. 7D) and pY132 (SEQ ID NO:40; FIG. 8A), respectively.
  • the cDNA insert from eeg1c-1 (comprising EgDHAsyn2) was not transferred to pBY1 , because it would have resulted in the wrong translation frame being expressed.
  • the cDNA inserts from eeg1c.pkO16.e6.f and eeg1c-1 were transferred to pY159 (SEQ ID NO:38; Example 8) to form pY161 (SEQ ID NO:41 , FIG. 8B) and pY164 (SEQ ID NO:42; FIG. 8C), respectively.
  • EgDHAsyni was amplified from clone eeg1c.pkOO1.e6.f with oligonucleotide primers EgEPAEIoDom-5 (SEQ ID NO:43) and oEUG el4-3 (SEQ ID NO:44), using the PhusionTM High-Fidelity DNA Polymerase (Cat. No. F553S, Finnzymes Oy, Finland) following the manufacturer's protocol.
  • the resulting DNA fragment was cloned into the pCR-Blunt® cloning vector using the Zero Blunt® PCR Cloning Kit (Invitrogen Corporation), following the manufacturer's protocol, to produce pKR1062 (SEQ ID NO:45).
  • An internal ⁇ /col site at nucleotides 619-624 was removed from EgDHAsyni in pKR1062 using the Quickchange® Site Directed Mutagenesis kit (Cat. No. 200518, Stratagene, La JoIIa, CA), with oligonucleotides EgEloD4Mut-5 (SEQ ID NO:46) and EgEloD4Mut-3 (SEQ ID NO:47), following the manufacturer's protocol. After extensive sequencing, a clone with the ⁇ /col site removed (i.e., a ccatgg to ccttgg mutation) and no further nucleotide changes made was chosen for further study.
  • This clone was designated pLF115-7 (SEQ ID NO:48).
  • the nucleotide sequence for EgDHAsyni having the ⁇ /col site removed (EgDHAsyni * ) is set forth in SEQ ID NO:205.
  • the corresponding amino acid sequence is identical to SEQ ID NO:12. Construction Of Plasmid pY141. Expressing EgDHAsvni*: The Nco ⁇ /Not ⁇
  • DNA fragment from pLF115-7 (SEQ ID NO:48), containing EgDHAsyni (SEQ ID NO:205; without the internal ⁇ /col site; at nt 621 of the EgDHAsyni CDS; ccatgg to ccttgg), was cloned into the ⁇ /col/ ⁇ /ofl DNA fragment from pY115, containing the Yarrowia lipolytics FBAINm promoter, to produce pY141 (SEQ ID NO:49; FIG. 8D).
  • plasmid pY141 contains the full length EgDHAsyni* gene (labeled as ⁇ gDHAsyni(-Ncol)" in FIG.), under control of the Yarrowia lipolytica FBAINm promoter (PCT Publication No. WO 2005/049805; U.S. Patent 7,202,356; labeled as "Fba1+lntron” in FIG.), and the Pex20 terminator sequence from Yarrowia Pex20 gene (GenBank Accession No. AF054613).
  • E ⁇ DHAsvn1-C20EloDom1 The nucleotide sequence for the EgDHAsyni * C20 elongase domain (EgDHAsyn1C20EloDom1) in pY141 is set forth in SEQ ID NO:206 (identical to SEQ ID NO:201 but ⁇ /col site removed). The corresponding amino acid sequence is identical to SEQ ID NO:202.
  • EgDHAsyn1C20EloDom1 (SEQ ID NO:206) was amplified from pLF115- 7 with oligonucleotide primers EgEPAEIoDom-5 (SEQ ID NO:43) and EgDPAEIoDom-3 (SEQ ID NO:50) using the PhusionTM High-Fidelity DNA
  • EgDHAsyn1C20EloDom1 (without the internal ⁇ /col site), was cloned into the NcoUNoti DNA fragment from pY115, containing the Yarrowia lipolytica FBAINm promoter, to produce pY143 (SEQ ID NO:52; FIG. 9A).
  • Plasmid pY143 contains the N-terminal domain of EgDHAsyni* (EgDHAsyn1C20EloDom1) and does not include the proline-hch linker or delta-4 desaturase domain.
  • EgDHAsvni- C20EloDom2Linker The EgDHAsyni * C20 elongase domain (SEQ ID NO:206) and proline-hch linker (SEQ ID NO:197), were amplified from pLF115-7 (SEQ ID NO:48) with oligonucleotide primers EgEPAEIoDom-5 (SEQ ID NO:43) and oEUGsyn6-2 (SEQ ID NO:53) using the PhusionTM High-Fidelity DNA Polymerase (Cat. No. F553S, Finnzymes Oy, Finland ) following the manufacturer's protocol.
  • PhusionTM High-Fidelity DNA Polymerase Cat. No. F553S, Finnzymes Oy, Finland
  • the resulting DNA fragment was cloned into the pCR-Blunt® cloning vector using the Zero Blunt® PCR Cloning Kit (Invitrogen Corporation), following the manufacturer's protocol, to produce pKR1071 (SEQ ID NO:54).
  • the ⁇ /col/Ec/132ll DNA fragment from pKR1071 was cloned into the Nco ⁇ /Not ⁇ DNA fragment from pY115 (where the Noti site had been filled in), containing the Yarrowia lipolytica FBAINm promoter, to produce pY149 (SEQ ID NO:54).
  • Plasmid pY149 contains the EgDHAsyn1C20EloDom1/proline- rich linker fusion gene (i.e., EgDHAsyn1C20EloDom2Linker; SEQ ID NO:207), but does not contain the delta-4 desaturase domain.
  • the amino acid sequence of EgDHAsyn1C20EloDom2Linker is set forth in SEQ ID NO:208.
  • an additional 4 amino acids i.e., SCRT
  • Novel C20 Elongase/Delta-4 Desaturase Fusion Proteins In order to synthesize novel C20 elongase/delta-4 desaturase fusion proteins, a unique Sbft site was added to the 3' end of the C20 elongase domain of EgDHAsyni* after the proline-rich linker region (EgDHAsyn1C20EloDom3Linker).
  • EgDHAsyn1C20EloDom3 was amplified from pLF115-7 (SEQ ID NO:48) with oligonucleotide primers EgEPAEIoDom-5 (SEQ ID NO:43) and oEUGsyn6-3 (SEQ ID NO:56) using the PhusionTM High-Fidelity DNA Polymerase (Cat. No. F553S, Finnzymes Oy 1 Finland) following the manufacturer's protocol.
  • the resulting DNA fragment was cloned into the pCR-Blunt® cloning vector using the Zero Blunt® PCR Cloning Kit (Invitrogen Corporation), following the manufacturer's protocol, to produce pKR1091 (SEQ ID NO:57).
  • EgDHAsyn1C20EloDom3Linker was cloned into the NcoUNoft DNA fragment from pY115 (where the Not ⁇ was filled to form a blunt end), containing the Yarrowia lipolytica FBAINm promoter, to produce pY155 (SEQ ID NO:58).
  • a unique Sbft site was added to the 5' end of various delta-4 desaturases.
  • the Sbfl site is located after the ATG start site of each coding sequence and resulted in the addition and/or replacement of a few amino acids at the N-terminus of the delta-4 desaturase coded for by the genes.
  • the resulting DNA fragment which contains the lgD4 CDS and is identical to SEQ ID NO:209 except that an Sbft site was added at the 5' end after the start codon (lgD4*; SEQ ID NO:210), was cloned into the pCR-Blunt® cloning vector using the Zero Blunt® PCR Cloning Kit (Invitrogen Corporation), following the manufacturer's protocol, to produce pKR1067 (SEQ ID NO:61).
  • the amino acid sequence for lgD4 * from pKR1067 is set forth in SEQ ID NO:211 and is identical to that to lgD4 (SEQ ID NO:30) except that the first 4 amino acids (i.e., MCNA) have been changed to MALQ due to the addition of the Sbfl site in the nucleotide sequence.
  • the Nco ⁇ /Not ⁇ DNA fragment from pKR1067 (SEQ ID NO:61), containing lgD4 * was cloned into the Nco ⁇ /Not ⁇ DNA fragment from pY115, containing the Yarrowia lipolytica FBAINm promoter, to produce pY150 (SEQ ID NO:62; FIG. 9C).
  • SEQ ID NO:62 The amino acid sequence for lgD4 * from pKR1067 is set forth in SEQ ID NO:211 and is identical to that to lgD4 (SEQ ID NO:30) except that the first 4 amino acids (i.e.,
  • lgD4 * is labeled as "Ig d4 DS". In this way, lgD4 * could be expressed alone in Yarrowia.
  • EgDHAsyn1C20EloDom3Linker and lgD4*, separated by the proline-rich linker region (called EgDHAsyn1C20EloDom3-lgD4; SEQ ID NO:212).
  • the amino acid sequence for EgDHAsyn1C20EloDom3-lgD4 is set forth in SEQ ID NO:213.
  • the Nco ⁇ /Not ⁇ DNA fragment from pKR1097 (SEQ ID NO:63), containing the EgDHAsyn1C20EloDom3-lgD4, was cloned into the Nco ⁇ /Not ⁇ DNA fragment from pY115, containing the Yarrowia lipolytica FBAINm promoter, to produce pY156 (SEQ ID NO:64; FIG. 9D).
  • the EgDHAsyn1C20EloDom3-lgD4 is labeled as "EGel-IGd4".
  • EqDHAsvn1-D4Dom1* A region of the C-terminus of EgDHAsyni* (SEQ ID NO:205) containing the delta-4 desaturase domain (EgDHAsyn1 D4Dom1 ; SEQ ID NO:214; corresponding amino acid sequence for EgDHAsyni D4Dom1 is set forth in SEQ ID NO:215), starting just after the end of the proline-rich linker region, was amplified from pLF115-7 (as described in Example 10) with oligonucleotides oEGslne6-1 (SEQ ID NO:65) and oEUGel4-3 (SEQ ID NO:44) using the PhusionTM High-Fidelity DNA Polymerase (Cat.
  • Oligonucleotide oEGslne ⁇ - 1 (SEQ ID NO:65) introduced an ATG start codon at the 5' end of the PCR product followed by an Sbfi site.
  • the resulting DNA fragment was cloned into the pCR- Blunt® cloning vector using the Zero Blunt® PCR Cloning Kit (Invitrogen Corporation), following the manufacturer's protocol, to produce pKR1069 (SEQ ID NO:66).
  • EgDHAsyn1 D4Dom1 The new CDS and amino acid sequences containing EgDHAsyn1 D4Dom1 from pKR1069 (i.e., EgDHAsyn1 D4Dom1*) are set forth in SEQ ID NO:216 and SEQ ID NO:217, respectively.
  • the amino acid sequence for EgDHAsyn1 D4Dom1 * (SEQ ID NO:217) is identical to that of EgDHAsyn1 D4Dom1 (SEQ ID NO:215), except that the first 2 amino acids (i.e., SG) have been changed to MAL due to the addition of the Sbf ⁇ site in the nucleotide sequence.
  • the ⁇ /col/ ⁇ /ofl DNA fragment from pKR1069 (SEQ ID NO:66), containing the EgDHAsyni D4Dom1*, was cloned into the ⁇ /col/ ⁇ /ofl DNA fragment from pY115, containing the Yarrowia lipolytics FBAINm promoter, to produce pY152 (SEQ ID NO:67; FIG. 10A).
  • the EgDHAsyn1 D4Dom1 * is labeled as "EUG d4 (fus test)". In this way, the EgDHAsyn1 D4Dom1* could be expressed alone in Yarrowia.
  • EgDHAsyn1C20EloDom the EgDHAsyn1 D4Dom1 * , separated by the proline- rich linker region (called EgDHAsyn1C20EloDom3-EgD4Dom1 ; SEQ ID NO:218).
  • the amino acid sequence of EgDHAsyn1 C20EloDom3-EgD4Dom1 (SEQ ID NO:219) is almost identical to EgDHAsyni except one amino acid (i.e., G323L based on numbering for EgDHAsyni) was changed due to the Sbft cloning site and fusion junction.
  • the ⁇ /col/ ⁇ /ofl DNA fragment from pKR1099 (SEQ ID NO:68), containing the EgDHAsyni C20EloDom3-EgD4Dom1 , was cloned into the NcoUNoti DNA fragment from pY115, containing the Yarrowia lipolytica FBAINm promoter, to produce pY157 (SEQ ID NO:69; FIG. 10B).
  • the EgDHAsyn1C20EloDom3-EgD4Dom1 is labeled as "EGel-EGd4 fus". Construction Of Plasmid pY153.
  • EqDHAsvn1*D4Dom2 A region of the C-terminus of EgDHAsyni containing the delta-4 desaturase domain and some of the C20 elongase domain (EgDHAsyn1 D4Dom2; SEQ ID NO:220; corresponding amino acid sequence for EgDHAsyn1 D4Dom2 is set forth in SEQ ID NO:221), which corresponds to the amino acid sequence identified as EgD4 (SEQ ID NO: 13; Meyer et al., Biochemistry 42(32):9779-9788 (2003)), was amplified from pLF115-7 (described in Example 10) with oligonucleotides oEUGel4-4 (SEQ ID NO:70) and oEUGel4-3 (SEQ ID NO:44) using the PhusionTM High-Fidelity DNA Polymerase (Cat.
  • the PaVNotl DNA fragment from pKR1073 (SEQ ID NO:71), containing the EgDHAsyn1 D4Dom2, was cloned into the NcoUNott DNA fragment from pY115, containing the Yarrowia lipolytica FBAINm promoter, to produce pY153 (SEQ ID NO:72; FIG. 10C).
  • the EgDHAsyn1 D4Dom2 is labeled as EUG d4 (HZ). In this way, the EgDHAsynD4Dom2 could be expressed alone in Yarrowia.
  • the resulting DNA fragment which contains the SaD4 CDS and is identical to SEQ ID NO:222, except that an Sbf ⁇ site was added at the 5' end after the start codon (SaD4 * ; SEQ ID NO:223), was cloned into the pCR-Blunt® cloning vector using the Zero Blunt® PCR Cloning Kit (Invitrogen Corporation), following the manufacturer's protocol, to produce pKR1068 (SEQ ID NO:75).
  • the amino sequence for SaD4 * from pKR1068 is set forth in SEQ ID NO:224 and is identical to that to SaD4 (SEQ ID NO:28) except that the first 3 amino acids (i.e., MTV) have been changed to MALQ due to the addition of the Sbft site in the nucleotide sequence.
  • the NcoUNott DNA fragment from pKR1068 (SEQ ID NO:75) (partial digest to avoid internal ⁇ /col site), containing the SaD4*, was cloned into the NcoUNoti DNA fragment from pY115, containing the Yarrowia lipolytica FBAINm promoter, to produce pY151 (SEQ ID NO:76; FIG. 10D).
  • the SaD4 * is labeled as "RSA d4 DS". In this way, the SaD4* could be expressed alone in Yarrowia.
  • Eu ⁇ lena anabaena Growth Conditions Lipid Profile and mRNA Isolation Euglena anabaena was obtained from Dr. Richard Triemer's lab at Michigan State University (East Lansing, Ml). Approximately 2 mL of culture were removed for lipid analysis and centrifuged at 1 ,800 x g for 5 min. The pellet was washed once with water and re-centrifuged. The resulting pellet was dried for 5 min under vacuum, resuspended in 100 ⁇ L of trimethylsulfonium hydroxide (TMSH), and incubated at room temperature for 15 min with shaking. After this, 0.5 mL of hexane were added, and the vials were incubated for 15 min at room temperature with shaking.
  • TMSH trimethylsulfonium hydroxide
  • Fatty acid methyl esters (5 ⁇ L injected from hexane layer) were separated and quantified using a Hewlett-Packard 6890 Gas Chromatograph fitted with an Omegawax 320 fused silica capillary column (Supelco Inc., Cat. No. 24152). The oven temperature was programmed to hold at 170 0 C for 1.0 min, increase to 240 0 C at 5 0 C /min, and then hold for an additional 1.0 min. Carrier gas was supplied by a Whatman hydrogen generator. Retention times were compared to those for methyl esters of standards commercially available (Nu-Chek Prep, Inc. Cat. No. U- 99-A), and the resulting chromatogram is shown in FIG. 12.
  • Euglena anabaena would be a good source for long-chain PUFA biosynthetic genes such as, but not limited to, C20 elongases, delta-4 desaturases, and/or DHA synthases.
  • the culture (25 ml_) was transferred to 100 ml_ of AF-6 medium in a 500 ml_ glass bottle, and the culture was grown for 1 month as described above. After this time, two 50 mL aliquots were transferred into two separate 500 ml_ glass bottles containing 250 mL of AF-6 medium, and the cultures were grown for two months as described above (giving a total of ⁇ 600 mL of culture). After this, the cultures were pelleted by centrifugation at 1 ,800 x g for 10 min, washed once with water, and re-centrifuged.
  • cDNA was synthesized from 5.12 ⁇ g of mRNA (Example 12) using the Biotin-atfB2-Oligo(dT) primer. After synthesis of the first and second strand, the affB1 adapter was added; ligation was performed; and the cDNA was size fractionated using column chromatography. DNA from fractions were concentrated, recombined into pDONRTM222, and transformed into E. coli
  • ElectroMAXTM DM 0BTM T1 Phage-Resistant cells (Invitrogen Corporation).
  • the Euglena anabaena library was named eug1c.
  • Approximately 17,000 clones of cDNA library eugic were plated onto 3 large square (24 cm x 24 cm) petri plates (Corning, Corning, NY), each containing LB + 50 ⁇ g/mL kanamycin agar media. Cells were grown, transferred to Biodyne B membrane, and hybridized with a labeled Nco ⁇ /Not ⁇ DNA fragment, containing EgDHAsyni * , from pY141 , exactly as described in Example 5. In this way, 11 positive clones were identified (designated as eug1c-1 to eug1c-11).
  • the positive clones were grown, and DNA was purified and sequenced as described in Example 2 using vector-primed M13F Universal primer (SEQ ID NO:1), vector-primed M13-28Rev primer (SEQ ID NO:14), and the poly(A) tail-primed WobbleT oligonucleotides.
  • EaDHAsyn ⁇ 1 SEQ ID NO:78
  • EaDHAsyn5'2 SEQ ID NO:79
  • EaDHAsyn5'3 SEQ ID NO:80
  • EaDHAsyn5'4 SEQ ID NO:81
  • EaDHAsyn3' SEQ ID NO:82
  • EaDHAsyn3'2 (SEQ ID NO:83), EaDHAsyn3'3 (SEQ ID NO:84), EaDHAsyn3'4 (SEQ ID NO:85), and EaDHAsyn3'5 (SEQ ID NO:86).
  • Sequences were aligned and compared using SequencherTM (Version 4.2, Gene Codes Corporation, Ann Arbor, Ml), and in this way, the clones could be categorized into one of four distinct groups based on insert sequence (identified as EaDHAsyni to EaDHAsyn4).
  • Representative clones containing the cDNA for each class of sequence were chosen for further study, and sequences for each representative plasmid (i.e., pLF117-1 , pl_F117-2, pLF117-3 and pLF117-4) are shown as SEQ ID NO: 87, SEQ ID NO:88, SEQ ID NO:89, and SEQ ID NO:90, respectively.
  • the sequence of pLF117-1 shown by a string of NNNN's represents a region of the polyA tail which was not sequenced.
  • the coding sequences for EaDHAsyni , EaDHAsyn2, EaDHAsyn3, and EaDHAsyn4 are shown as SEQ ID NO:91 , SEQ ID NO:92, SEQ ID NO:93, and SEQ ID NO:94, respectively.
  • the corresponding amino acid sequences for EaDHAsyni , EaDHAsyn2, EaDHAsyn3, and EaDHAsyn4 are shown as SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:97, and SEQ ID NO:98, respectively.
  • EaDHAsyni SEQ ID NO:95
  • EaDHAsyn2 SEQ ID NO:96
  • EaDHAsyn3 SEQ ID NO:97
  • EaDHAsyn4 SEQ ID NO:98
  • EaDHAsyni SEQ ID NO:95
  • EaDHAsyn2 SEQ ID NO:96
  • EaDHAsyn3 SEQ ID NO:97
  • EaDHAsyn4 SEQ ID NO:98
  • EaDHAsyni (SEQ ID NO:95), EaDHAsyn2 (SEQ ID NO:96), EaDHAsyn3 (SEQ ID NO:97), and EaDHAsyn4 (SEQ ID NO:98) yielded E values of 0.0 (378/538 identical amino acids; 70% identity), 0.0 (378/538 identical amino acids; 70% identity), 0.0 (379/538 identical amino acids; 70% identity), and 0.0 (368/522 identical amino acids; 70% identity), respectively, versus the amino acid sequence of delta-4 fatty acid desaturase from Euglena gracilis (SEQ ID NO: 13).
  • EaDHAsyni SEQ ID NO:95
  • EaDHAsyn2 SEQ ID NO:96
  • EaDHAsyn3 SEQ ID NO:97
  • EaDHAsyn4 SEQ ID NO:98
  • the C-terminus of the resulting amino acid sequence for EaDHAsyn4 (approximately last 35 amino acids) is highly divergent and smaller than the other three EaDHAsyn proteins.
  • EaDHAsyn2 (SEQ ID NO:96), EaDHAsyn3 (SEQ ID NO:97), and EaDHAsyn4 (SEQ ID NO:98) were 70% (558/791), 70% (558/791), 70% (559/791) and 70% (548/775) identical, respectively.
  • EgDHAsyni SEQ ID NO: 12
  • EgDHAsyn2 SEQ ID NO:22
  • all four EaDHAsyn sequences have a proline-rich linker region (from approximately P300 to T332 based on numbering for EaDHAsyn 1).
  • the linker appears to be slightly longer than that for EgDHAsyni (SEQ ID NO: 12) or EgDHAsyn2 (SEQ ID NO:22).
  • EaDHAsyn sequences also lack the NG repeat motif found upstream of the proline-rich motif of EgDH Asyni and EgDHAsyn2; but, this region, as was the case for EgDHAsyni and EgDHAsyn2, is also slightly proline-rich in all four EaDHAsyn sequences and may play a role in the linker function.
  • the nucleotide sequences for the C20 elongase domains of EaDHAsyni , EaDHAsyn2, EaDHAsyn3, and EaDHAsyn4 are set forth in SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, and SEQ ID NO:230, respectively.
  • the amino acid sequences for the C20 elongase domains of EaDHAsyni , EaDHAsyn2, and EaDHAsyn3 are set forth in SEQ ID NO:231 , SEQ ID NO:232, and SEQ ID NO:233, respectively.
  • the amino acid sequence of the C20 elongase domain of EaDHAsyn4 is identical to that for EaDHAsyni .
  • the nucleotide and amino acid sequences for the proline-rich linker of EaDHAsyni are set forth in SEQ ID NO:234 and SEQ ID NO:235, respectively.
  • the nucleotide and amino acid sequences for the proline-rich linkers of EaDHAsyn2, EaDHAsyn3, and EaDHAsyn4 are identical to that for EaDHAsyni .
  • the nucleotide sequences for the delta-4 desaturase domain 1 of each of EaDHAsyni , EaDHAsyn2, and EaDHAsyn4 are set forth in SEQ ID NO:236, SEQ ID NO:237, and SEQ ID NO:238, respectively.
  • the amino acid sequences for the delta-4 desaturase domains of EaDHAsyni , EaDHAsyn2, and EaDHAsyn4 are set forth in SEQ ID NO:239, SEQ ID NO:240, and SEQ ID NO:241 , respectively.
  • the nucleotide and amino acid sequence of the delta-4 desaturase domain 1 of EaDHAsyn3 is identical to that of EaDHAsyni .
  • the nucleotide sequences for the delta-4 desaturase domain 2 of EaDHAsyni , EaDHAsyn2, EaDHAsyn3, and EaDHAsyn4, including the proline-rich linker and a portion of the 3' end of the C20 elongase domain, are set forth in SEQ ID NO:242, SEQ ID NO:243, SEQ ID NO:244, and SEQ ID NO:245, respectively.
  • EaDHAsyni The amino acid sequences for the delta-4 desaturase domains of EaDHAsyni , EaDHAsyn2, EaDHAsyn3, and EaDHAsyn4 are set forth in SEQ ID NO:246, SEQ ID NO:247, SEQ ID NO:248, and SEQ ID NO:249, respectively.
  • FIG. 29 summarizes the Euglena anabaena DHA synthase domain sequences.
  • the cDNA inserts from pLF117-1 (SEQ ID NO:87), pLF117-2 (SEQ ID NO:88), pLF117-3 (SEQ ID NO:89), and pLF117-4 (SEQ ID NO:90) were transferred to pY159 (SEQ ID NO:38; Example 8) to form pY165 (SEQ ID NO:99, FIG. 14A) 1 pY166 (SEQ ID NO:100; FIG. 14B), pY167 (SEQ ID NO:101 ; FIG. 14C), and pY168 (SEQ ID NO:102; FIG.
  • each plasmid contains the full length EaDHAsyn gene, under control of the Yarrowia lipolytica FBAINm promoter (PCT Publication No. WO 2005/049805; U.S. Patent 7,202,356; labeled as "Yar Fba1 Pro+lntron” in FIG.), and the Pex20 terminator sequence from Yarrowia Pex20 gene (GenBank Accession No. AF054613).
  • Euplena gracilis DHA Synthase 1 Euplena gracilis DHA Synthase 1
  • SdD17 The present Example describes construction of a soybean vector for co- expression of EgDHAsyni (SEQ ID NO:12) with SdD17 and a hygromycin phosphotransferase selectable marker (hpt).
  • EgDHAsyni was amplified from pKR1049 (clone eeg1c.pkO16.e6.f) with oligonucleotide primers oEGel2-1 (SEQ ID NO:103) and oEUG el4-3 (SEQ ID NO:44), using the PhusionTM High-Fidelity DNA Polymerase (Cat. No. F553S, Finnzymes Oy, Finland) following the manufacturer's protocol.
  • the resulting DNA fragment was cloned into the pCR-Blunt® cloning vector using the Zero Blunt® PCR Cloning Kit (Invitrogen Corporation), following the manufacturer's protocol, to produce pKR1055 (SEQ ID NO: 104).
  • a starting plasmid pKR72 (ATCC Accession No. PTA-6019; SEQ ID NO: 105, 7085 bp sequence), a derivative of pKS123 which was previously described in PCT Publication No. WO 2002/008269 (the contents of which are hereby incorporated by reference), contains the hygromycin B phosphotransferase gene (HPT) (Gritz, L. and Davies, J., Gene 25:179-188 (1983)), flanked by the T7 promoter and transcription terminator (T7prom/HPT/T7term cassette), and a bacterial origin of replication (ori) for selection and replication in bacteria (e.g., E. coli).
  • HPT hygromycin B phosphotransferase gene
  • T7prom/HPT/T7term cassette flanked by the T7 promoter and transcription terminator
  • ori bacterial origin of replication
  • pKR72 also contains HPT, flanked by the 35S promoter (Odell
  • pKR72 also contains a Not ⁇ restriction site, flanked by the promoter for the ⁇ ' subunit of ⁇ -conglycinin (Beachy et al., EMBO J. 4:3047-3053 (1985)) and the 3' transcription termination region of the phaseolin gene (Doyle et al., J. Biol. Chem.
  • EgDHAsyni was released from pKR1055 (SEQ ID NO: 104) by digestion with Nott and was cloned into the ⁇ /ofl site of plasmid pKR179 (SEQ ID NO: 108) to produce pKR1057 (SEQ ID NO: 109).
  • the Sbfl fragment of pKR1057 (SEQ ID NO:109), containing the ⁇ con/EgDHAsyn1/Phas3' cassette was cloned into the Sbfl site of pKR328 (SEQ ID NO: 110; which is described in PCT Publication No.
  • FIG. 15A A schematic depiction of pKR1061 is shown in FIG. 15A.
  • Soybean Expression Vector pKR973 For Co-Expression of the Paylova lutheri Delta-8 Desaturase (PavD8) With the Euglena gracilis Delta-9 Elongase (EgD9elo) and the Mortierella alpina Delta-5 Desaturase (MaD5)
  • Euplena gracilis delta-9 elongase (EgD9elo):
  • BB-1562 the contents of which are hereby incorporated by reference
  • oligonucleotide primers oEugEL1-1 (SEQ ID NO:113) and oEugEL1-2 (SEQ ID NO:114) using the VentR® DNA Polymerase (Cat. No. M0254S, New England Biolabs Inc., Beverly, MA) following the manufacturer's protocol.
  • the resulting DNA fragment was cloned into the pCR-Blunt® cloning vector using the Zero Blunt® PCR Cloning Kit (Invitrogen Corporation), following the manufacturer's protocol, to produce pKR906 (SEQ ID NO:115). Plasmid pKR906 was digested with Not ⁇ , and the fragment containing the
  • Euglena gracilis delta-9 elongase was cloned into plasmid pKR132 (SEQ ID NO: 116; which is described in PCT Publication No. WO 2004/071467) to produce pKR953 (SEQ ID NO:117).
  • Mortierella alpina delta-5 desaturase (MaD5): Vector pKR287 (SEQ ID NO:118; which is described in PCT Publication No.
  • WO 2004/071467 published August 26, 2004; the contents of which are hereby incorporated by reference
  • flanked by the soybean glycinin Gy1 promoter and the pea leguminA2 3' termination region (Gy1/MaD5/legA2 cassette).
  • Vector pKR287 was digested with Sbf ⁇ /Bsi ⁇ N ⁇ , and the fragment containing the Gy1/MaD5/legA2 cassette was cloned into the Sbft/BsN ⁇ l ⁇ fragment of pKR277 (SEQ ID NO: 120; which is described in PCT Publication No. WO 2004/071467, the contents of which are hereby incorporated by reference) to produce pK952 (SEQ ID NO:121).
  • Vector pKR457 (SEQ ID NO: 122), which was previously described in PCT Publication No. WO 2005/047479 (the contents of which are hereby incorporated by reference), contains a Not ⁇ site flanked by the Kunitz soybean Trypsin Inhibitor (KTi) promoter (Jofuku et al., Plant Cell 1 :1079-1093 (1989)) and the KTi 3' termination region, the isolation of which is described in U.S. Patent No. 6,372,965, followed by the soy albumin transcription terminator, which was previously described in PCT Publication No. WO 2004/071467 (Kti/ ⁇ /ofl/Kti3'Salb3' cassette).
  • KTi Kunitz soybean Trypsin Inhibitor
  • Pavlova lutheri was obtained from the Culture of Marine Phytoplankton (CCMP, West Boothbay Harbor, ME) and grown in 250 ml_ flasks containing 50 mL of F/2-Si medium (made using F/2 Family Medium Kit-KIT20F2 and Filtered Seqwater-SEA2 from CCMP) at 26 0 C with shaking at 150 rpm. Cultures were transferred to new medium on a weekly basis using a 1 :4 (old culture:new medium) dilution. Cultures from 28 flasks (1400 mL) were combined, and cells were pelleted by centrifugation at 1 ,800 x g for 10 min, washed once with water, and re-centrifuged.
  • cDNA was synthesized from 224 ng of mRNA using the SuperscriptTM First- Strand Synthesis System for RT-PCR Kit (InvitrogenTM Life Technologies, Carlsbad, CA) with the provided oligo(dT) primer, according to the manufacturer's protocol.
  • the Pavlova lutheri delta-8 desaturase (PavD8; SEQ ID NO:124; which is described in U.S. Patent Application No. 11/737772 (filed April 20, 2007; Attorney Docket No.
  • cDNA (2 ⁇ L) from the reaction described above was combined with 50 pmol of PvDES5'Not-1 (SEQ ID NO:125), 50 pmol of PvDES3'Not-1 (SEQ ID NO:126), 1 ⁇ L of PCR nucleotide mix (10 mM, Promega, Madison, Wl), 5 ⁇ L of 10X PCR buffer (Invitrogen Corporation), 1.5 ⁇ L of MgCl2 (50 mM, Invitrogen Corporation), 0.5 ⁇ L of Taq polymerase (Invitrogen Corporation), and water to 50 ⁇ L.
  • the reaction conditions were 94 0 C for 3 min followed by 35 cycles of 94 0 C for 45 sec, 55 0 C for 45 sec, and 72 0 C for 1 min.
  • the PCR was finished at 72 0 C for 7 min, and then held at 4 0 C.
  • the PCR reaction was analyzed by agarose gel electrophoresis on 5 ⁇ L, and a DNA band with molecular weight around 1.3 kb was observed.
  • the remaining product was separated by agarose gel electrophoresis, and the DNA was purified using the ZymocleanTM Gel DNA Recovery Kit (Zymo Research, Orange, CA), following the manufacturer's protocol.
  • Plasmid pKR953 (SEQ ID NO:117) was digested with Pst ⁇ , and the fragment containing the Euglena gracilis delta-9 elongase was cloned into the Sbft site of pKR970 (SEQ ID NO:127) to produce pKR973 (SEQ ID NO:128, FIG. 15B).
  • the Pavlova lutheri delta-8 desaturase could be co-expressed with the Mortierella alpina delta-5 desaturase and the Euglena gracilis delta-9 elongase behind strong, seed-specific promoters.
  • Soybean Expression Vector pKR1064 For Co-Expression of the Eu ⁇ lena gracilis DHA Synthase 1 (EgDHAsvnP With the Saprolepnia diclina Delta-17 Desaturase (Sd D 17)
  • the present Example describes construction of a soybean vector for co- expression of EgDHAsyni with SdD17 and the acetolactate synthase (ALS) selectable marker.
  • Soybean Expression Vector pKR1133 For Co-Expression of the Euplena gracilis DHA Synthase 1 (EqDHAsvnP With the Eu ⁇ lena gracilis Delta-9
  • Elongase EqD9elo
  • Mortierella alpina Delta-5 Desaturase MaD5
  • the glycinin Gy1 promoter was PCR amplified from pZBL119 (SEQ ID NO: 1
  • the Pstt/Nott fragment of plasmid pSGIy32 (SEQ ID NO:136), containing the Gy1 promoter, was cloned into the Pst ⁇ /Not ⁇ fragment from plasmid pKR142 (SEQ ID NO:137; which is described in PCT Publication No. WO 2004/071467 and the contents of which are hereby incorporated by reference), containing the leguminA2 3' transcription termination region, an ampicillin resistance gene, and bacterial oh, to produce pKR264 (SEQ ID NO: 138).
  • vector pKR264 contains a ⁇ /ofl site flanked by the promoter for the glycinin Gy1 gene and the leguminA2 3' transcription termination region (Gy1/ ⁇ /of//legA2 cassette).
  • EgDHAsyni was released from pKR1055 (SEQ ID NO: 104; Example 15) by digestion with Nott and was cloned into the ⁇ /ofl site of plasmid pKR264 (SEQ ID NO:138), to produce pKR1128 (SEQ ID NO:139).
  • Vector pKR606 (SEQ ID NO: 141) was digested with 8s/WI and after filling to blunt the ends, the fragment containing the Gy1/MaD5/legA2 cassette was cloned into the filled ⁇ / ⁇ /oMI site of pKR277 (SEQ ID NO: 120) to produce pKR804 (SEQ ID NO:142).
  • the Bs/WI fragment from pKR1128 (SEQ ID NO: 139), containing the Gy1/EgDHAsyn1/legA2 cassette, was cloned into the Ss/WI site of pKR804 (SEQ ID NO:142) to produce pKR1130 (SEQ ID NO:143).
  • Plasmid pKR953 (SEQ ID NO:117) was digested with ⁇ s ⁇ /VI; ends were blunted by filling; and pKR953 was then digested with BamH ⁇ .
  • Plasmid pKR1131 (SEQ ID NO: 144) was digested with Pst ⁇ and the fragment containing the Euglena gracilis delta-9 elongase was cloned into the Sbft site of pKR1130 (SEQ ID NO:143) to produce pKR1133 (SEQ ID NO:145, FIG. 15D).
  • the Euglena gracilis DHA synthase 1 could be co-expressed with the Mortierella alpina delta-5 desaturase and the Euglena gracilis delta-9 elongase behind strong, seed-specific promoters.
  • Soybean Expression Vector pKR1105 For Co-Expression of the Eu ⁇ lena gracilis DHA Synthase 1 C20 Elongase Domain (EqDHAsvn1C20EloDom1) with the Schizochyt ⁇ um apprepatum Delta-4 Desaturase (SaD4)
  • the ⁇ con/ ⁇ /of//Phas cassette was PCR amplified from pKS123 (SEQ ID NO:146; which is described in PCT Publication No. WO 2004/071467 and the contents of which are hereby incorporated by reference) using primers oKti ⁇ (SEQ ID NO:147) and oKti ⁇ (SEQ ID NO:148).
  • the resulting PCR fragment was digested with Bsi ⁇ N ⁇ and cloned into the Bs ⁇ /l site of pKR124 (SEQ ID NO:149; which is described in PCT Publication No. WO 2004/071467 and the contents of which are hereby incorporated by reference), containing the bacterial origin of replication and selection, to produce plasmid pKR193 (SEQ ID NO: 150).
  • EgDHAsyn1C20Elodom1 was released from pHD16 (SEQ ID NO:51 ; Example 10) by digestion with Not ⁇ and was cloned into the Not ⁇ site of plasmid pKR193 (SEQ ID NO: 150) to produce pKR1103 (SEQ ID NO: 151).
  • Vector pKR300 (SEQ ID NO:153; which is described in PCT Publication No.
  • WO 2004/071467 contains the Schizochytrium aggregatum delta-4 desaturase (SaD4), which is described in U.S. Patent No. 7,045,683 and PCT Publication No WO 02/090493, the contents of which are hereby incorporated by reference), flanked by the ⁇ /ofl restriction sites.
  • SaD4 Schizochytrium aggregatum delta-4 desaturase
  • the Asc ⁇ site present within the SaD4 was removed without affecting the corresponding amino acid sequence to produce a new sequence (SEQ ID NO: 154) which remains flanked by the ⁇ /ofl sites.
  • the ⁇ /ofl fragment (SEQ ID NO: 154) was cloned into the ⁇ /ofl site of plasmid pKR457 (SEQ ID NO: 122; Example 16) to produce pKR1102 (SEQ ID NO: 155).
  • Plasmid pKR1102 (SEQ ID NO: 155) was digested with Pst ⁇ , and the fragment containing the SaD4 was cloned into the Sbft site of pKR1104 (SEQ ID NO:152) to produce pKR1105 (SEQ ID NO:156; FIG. 16A).
  • the Euglena gracilis DHA synthase 1 C20 elongase domain could be co-expressed with the Schizochytrium aggregatum delta-4 desaturase behind strong, seed-specific promoters.
  • Soybean Expression Vector pKR1134 For Expression of the Euglena gracilis DHA Synthase 1 C20 Elongase Doma ⁇ n/Schizochvthum apprepatum Delta-4 Desaturase Fusion (EgDHAsvn1C20EloDom3-SaD4)
  • EgDHAsyn1C20EloDom3 was amplified from pKR1091 with oligonucleotide primers EgEPAEIoDom-5 (SEQ ID NO:43) and oEUGsyn6-4 (SEQ ID NO: 157) using the PhusionTM High-Fidelity DNA Polymerase (Cat. No. F553S, Finnzymes Oy, Finland) following the manufacturer's protocol.
  • the resulting DNA fragment was cloned into the pCR-Blunt® cloning vector using the Zero Blunt® PCR Cloning Kit (Invitrogen Corporation), following the manufacturer's protocol, to produce pKR1107 (SEQ ID NO:158).
  • Plasmid pKR1107 (SEQ ID NO: 158) was digested with ⁇ /ofl, and the fragment containing the EgDHAsyn1C20EloDom3 was religated to form pKR1112 (SEQ ID NO:159).
  • the Xba ⁇ /Pst ⁇ DNA fragment from pKR1112 (SEQ ID NO:159), containing EgDHAsyn1C20EloDom3, was cloned into the Xba ⁇ /Sbft DNA fragment from pKR1068 (SEQ ID NO:75; Example 11), containing the SaD4, to produce pKR1115 (SEQ ID NO: 160).
  • the EgDHAsyn1C20Elodom3-SaD4 was re-created without an internal Sbft site but codes for an identical amino acid sequence as that described in Example 11.
  • EgDHAsyn1C20Elodom3-SaD4 was released from pKR1115 (SEQ ID NO: 160) by digestion with Not ⁇ and was cloned into the Not ⁇ site of plasmid pKR1104 (SEQ ID NO:152), containing an ALS selectable marker, to produce pKR1134 (SEQ ID NO:161 ; FIG. 16B).
  • EXAMPLE 21 EXAMPLE 21
  • Saprolegnia diclina Delta-17 Desaturase (SdD17)
  • the present Example describes construction of a soybean vector for co- expression of TpomD8 with SdD17 and a hygromycin phosphotransferase selectable marker (hpt).
  • Tetruetreptia pomquetensis CCMP1491 cells (from 1 liter of culture) were purchased from the Provasoli-Guillard National Center for Culture of Marine Phytoplakton (CCMP) (Bigelow Laboratory for Ocean Sciences, West Boothbay Harbor, Maine). Total RNA was isolated using the trizol reagent (Invitrogen, Carlsbad, CA), according to the manufacturer's protocol. The cell pellet was resuspended in 0.75 mL of trizol reagent, mixed with 0.5 mL of 0.5 mm glass beads, and homogenized in a Biospec mini beadbeater (Bartlesville, OK) at the highest setting for 3 min.
  • Trizol reagent Invitrogen, Carlsbad, CA
  • the cell pellet was resuspended in 0.75 mL of trizol reagent, mixed with 0.5 mL of 0.5 mm glass beads, and homogenized in a Biospec mini beadbeater (Bartlesville, OK) at the highest
  • the mixture was centrifuged in an Eppendorf centrifuge for 30 sec at 14,000 rpm to remove debri and glass beads. Supernatant was extracted with 150 ⁇ L of 24:1 chloroform :isoamy alcohol. The upper aqueous phase was used for RNA isolation.
  • RNA isolation the aqueous phase was mixed with 0.375 mL of isopropyl alcohol and allowed to incubate at room temperature for 5 min. Precipitated RNA was collected by centrifugation at 8,000 rpm and kept at 4 0 C for 5 min. The pellet was washed once with 0.7 mL of 80% ethanol and air dried. Thus, 95 ⁇ g of total RNA were obtained from Tetruetreptia pomquetensis CCMP1491.
  • Total RNA (0.95 ⁇ g of total RNA in 1 ⁇ L) was used as template to synthesize double stranded cDNA.
  • the CreatorTM SMARTTM cDNA Library Construction Kit from BD Bioscience Clontech (Palo Alto, CA) was used.
  • Total RNA (1 ⁇ L) was mixed with 1 ⁇ L of SMART IV oligonucleotide (SEQ ID NO:181) 1 ⁇ L of the Adaptor Primer from Invitrogen 3'-RACE kit (SEQ ID NO:182), and 2 ⁇ L of water. The mixture was heated to 75 0 C for 5 min and then cooled on ice for 5 min.
  • SEQ ID NO:162 which is described in U.S. Patent Application No. 11/876,115 (filed October 22, 2007; Attorney Docket No. BB-1574) the contents of which are hereby incorporated by reference
  • TpomNot-5 SEQ ID NO:163
  • TpomNot-3 SEQ ID NO:164
  • Tetruetreptia pomquetensis CCMP1491 cDNA (1 ⁇ L) was combined with 50 pmol of TpomNot-5 (SEQ ID NO: 163), 50 pmol of TpomNot-3 (SEQ ID NO: 164), 1 ⁇ L of PCR nucleotide mix (10 mM, Promega, Madison, Wl), 5 ⁇ L of 10X PCR buffer (Invitrogen Corporation), 1.5 ⁇ L of MgC ⁇ (50 mM, Invitrogen Corporation), 0.5 ⁇ L of Taq polymerase (Invitrogen Corporation) and water to 50 ⁇ L.
  • the reaction conditions were 94 0 C for 3 min followed by 35 cycles of 94 0 C for 45 sec, 55 0 C for 45 sec and 72 0 C for 1 min.
  • the PCR was finished at 72 0 C for 7 min and then held at 4 0 C.
  • 5 ⁇ L of the PCR reaction were analyzed by agarose gel electrophoresis, and a DNA band with molecular weight around 1.3 kb was observed.
  • the remaining product was separated by agarose gel electrophoresis, and the DNA was purified using the ZymocleanTM Gel DNA Recovery Kit (Zymo
  • Example 15 to produce pKR1002 (SEQ ID NO:166).
  • TPomD8 was released from pLF114-10 (SEQ ID NO: 165; Example 21) by digestion with ⁇ /ofl and was cloned into the Not ⁇ site of plasmid pKR264 (SEQ ID NO:138; Example 18) to produce pKR1127 (SEQ ID NO:168).
  • the SsM/l fragment from pKR1127 (SEQ ID NO: 168), containing the Gy1/TPomD8/legA2 cassette, was cloned into the Bs/WI site of pKR804 (SEQ ID NO:142; Example 18) to produce pKR1129 (SEQ ID NO:169).
  • Plasmid pKR1131 (SEQ ID NO: 144; Example 18) was digested with Pst ⁇ , and the fragment containing the Euglena gracilis delta-9 elongase was cloned into the Sbf ⁇ site of pKR1129 (SEQ ID NO:169) to produce pKR1132 (SEQ ID NO:170, FIG. 16D).
  • tbeTetruetreptia pomquetensis delta-8 desaturase could be co- expressed with the Mortierella alpina delta-5 desaturase and the Euglena gracilis delta-9 elongase behind strong, seed-specific promoters.
  • Soybean Expression Vector KS373 For Expression of a Euglena gracilis Delta-9 Elonqase/Etvg/ena gracilis DHA Synthase 1 Linker/Pav7oi/a lutheri Delta-8 Desaturase Fusion (EqD9elo-EqDHAsvn1 Link-PavD8)
  • EgD9elo was amplified with oligonucleotides MWG507 (SEQ ID NO: 172) and MWG509 (SEQ ID NO:173), using the PhusionTM High-Fidelity DNA Polymerase (Cat. No. F553S, Finnzymes Oy, Finland), following the manufacturer's protocol.
  • EgDHAsyni Link (SEQ ID NO: 171) was amplified in a similar way with oligonucleotides MWG510 (SEQ ID NO:174) and MWG511 (SEQ ID NO:175). The two resulting PCR products were combined and re-amplifed using MWG507 (SEQ ID NO:172) and MWG511 (SEQ ID NO: 175) to form EgD9elo-EgDHAsyn1 Link. The sequence of the EgD9elo- EgDHAsyni Link is shown in SEQ ID NO:176.
  • EgD9elo-EgDHAsyn1 Link does not contain an in-frame stop codon upstream of the ⁇ /ofl site at the 3' end, and therefore, a DNA fragment cloned into the ⁇ /ofl site can give rise to an in-frame fusion with the EgD9elo-EgDHAsyn1 Link.
  • Plasmid KS366 (SEQ ID NO: 177) contains unique ⁇ /col and ⁇ /ofl restriction sites, flanked by the promoter for the ⁇ ' subunit of ⁇ -conglycinin (Beachy et al., EMBO J. 4:3047-3053 (1985)) and the 3' transcription termination region of the phaseolin gene (Doyle et al., J. Biol. Chem. 261 :9228-9238 (1986)).
  • the Bcon/NcolNotl/PhasS' cassette in KS366 is identical to that found in pKR72 (SEQ ID NO:105), except that the flanking Hind ⁇ sites were replaced by BamH ⁇ sites.
  • the Bcon/NcolNotl/PhasS' cassette of KS366 was cloned into the BamH ⁇ site of pBluescript Il SK(+) vector (Stratagene).
  • DHA Synthases C20 Elongase Domains, Delta-4 Desaturase Domains.
  • synthetic fusions between an elongase domain and a desaturase domain separated by a suitable linker region could be made and expressed.
  • a synthetic fusion between a C20 elongase (or C20 elongase domain from a DHA synthase) and a suitable delta-4 desaturase (or delta-4 desaturase domain from a DHA synthase) could be made and expressed.
  • other elongases or desaturases could be used such as, but not limited to, the synthetic fusion described herein between a delta-9 elongase and delta-8 desaturase separated by a linker from a DHA synthase (i.e., Example 23).
  • PCT Publication Nos. WO 2004/071467 and WO 2004/071178 describe the isolation of a number of promoter and transcription terminator sequences for use in embryo-specific expression in soybean.
  • PCT Publication Nos. WO 2004/071467, WO 2005/047479 and WO 2006/012325 describe the synthesis of multiple promoter/gene/terminator cassette combinations by ligating individual promoters, genes, and transcription terminators together in unique combinations.
  • a Not ⁇ site flanked by the suitable promoter such as those listed in, but not limited to, Table 21
  • a transcription terminator such as those listed in, but not limited to, Table 22
  • Not ⁇ sites can be added to a gene of interest such as those listed in, but not limited to, Table 23 using PCR amplification with oligonucleotides designed to introduce ⁇ /ofl sites at the 5' and 3' ends of the gene.
  • the resulting PCR product is then digested with ⁇ /ofl and cloned into a suitable promoter/ ⁇ /ofl/terminator cassette.
  • PCT Publication Nos. WO 2004/071467, WO 2005/047479 and WO 2006/012325 describe the further linking together of individual gene cassettes in unique combinations, along with suitable selectable marker cassettes, in order to obtain the desired phenotypic expression. Although this is done mainly using different restriction enzymes sites, one skilled in the art can appreciate that a number of techniques can be utilized to achieve the desired promoter/gene/transcription terminator combination. In so doing, any combination of embryo-specific promoter/gene/transcription terminator cassettes can be achieved. One skilled in the art can also appreciate that these cassettes can be located on individual DNA fragments or on multiple fragments where co-expression of genes is the outcome of co-transformation of multiple DNA fragments.
  • Soybean embryogenic suspension cultures (cv. Jack) are maintained in 35 ml_ liquid medium SB196 (infra) on a rotary shaker, 150 rpm, 26 0 C with cool white fluorescent lights on 16:8 hr day/night photoperiod at light intensity of 60-85 ⁇ E/m2/s. Cultures are subcultured every 7 days to two weeks by inoculating approximately 35 mg of tissue into 35 ml_ of fresh liquid SB196 (the preferred subculture interval is every 7 days).
  • Soybean embryogenic suspension cultures are transformed with the soybean expression plasmids by the method of particle gun bombardment (Klein et al., Nature 327:70 (1987)) using a DuPont Biolistic PDS1000/HE instrument (helium retrofit) for all transformations.
  • Soybean cultures are initiated twice each month with 5-7 days between each initiation. Pods with immature seeds from available soybean plants are picked 45- 55 days after planting. Seeds are removed from the pods and placed into a sterilized magenta box. The soybean seeds are sterilized by shaking them for 15 min in a 5% Clorox solution with 1 drop of Ivory soap (i.e., 95 ml_ of autoclaved distilled water plus 5 ml_ Clorox and 1 drop of soap, mixed well). Seeds are rinsed using 2 1 -liter bottles of sterile distilled water and those less than 4 mm are placed on individual microscope slides. The small end of the seed is cut and the cotyledons are pressed out of the seed coat.
  • Ivory soap i.e., 95 ml_ of autoclaved distilled water plus 5 ml_ Clorox and 1 drop of soap, mixed well.
  • Either an intact plasmid or a DNA plasmid fragment containing the genes of interest and the selectable marker gene are used for bombardment. Fragments from soybean expression plasmids, the construction of which is described herein, are obtained by gel isolation of digested plasmids. In each case, 100 ⁇ g of plasmid DNA is used in 0.5 ml_ of the specific enzyme mix described below.
  • Plasmids are digested with Asc ⁇ (100 units) in NEBuffer 4 (20 mM Tris-acetate, 10 mM magnesium acetate, 50 mM potassium acetate, 1 mM dithiothreitol, pH 7.9), 100 ⁇ g/mL BSA, and 5 mM beta-mercaptoethanol at 37 0 C for 1.5 hr.
  • the resulting DNA fragments are separated by gel electrophoresis on 1% SeaPlaque GTG agarose (BioWhitaker Molecular Applications), and the DNA fragments containing gene cassettes are cut from the agarose gel.
  • DNA is purified from the agarose using the GELase digesting enzyme following the manufacturer's protocol.
  • a 50 ⁇ l_ aliquot of sterile distilled water containing 3 mg of gold particles (3 mg gold) is added to 30 ⁇ L of a 10 ng/ ⁇ L DNA solution (either intact plasmid or DNA fragment prepared as described herein), 25 ⁇ L 5M CaCI 2 , and 20 ⁇ L of 0.1 M spermidine.
  • the mixture is shaken 3 min on level 3 of a vortex shaker and spun for 10 sec in a bench microfuge. The supernatant is removed, followed by a wash with 400 ⁇ l_ 100% ethanol and another brief centrifugation. The 400 ul ethanol is removed, and the pellet is resuspended in 40 ⁇ l_ of 100% ethanol.
  • Five ⁇ L of DNA suspension is dispensed to each flying disk of the Biolistic PDS1000/HE instrument disk. Each 5 ⁇ L aliquot contains approximately 0.375 mg gold per bombardment (e.g., per disk).
  • the protocol is identical except for a few minor changes (i.e., 1 mg of gold particles is added to 5 ⁇ L of a 1 ⁇ g/ ⁇ L DNA solution; 50 ⁇ L of a 2.5M CaCfe is used; and the pellet is ultimately resuspended in 85 ⁇ L of 100% ethanol thus providing 0.058 mg of gold particles per bombardment).
  • Approximately 150-200 mg of seven day old embryogenic suspension cultures is placed in an empty, sterile 60 x 15 mm petri dish, and the dish is covered with plastic mesh.
  • the chamber is evacuated to a vacuum of 27-28 inches of mercury, and tissue is bombarded one or two shots per plate with membrane rupture pressure set at 1100 PSI.
  • Tissue is placed approximately 3.5 inches from the retaining /stopping screen.
  • Model system transformation conditions are identical except 100-150 mg of embryogenic tissue is used; rupture pressure is set at 650 PSI; and tissue is placed approximately 2.5 inches from the retaining screen.
  • Transformed embryos are selected either using hygromycin (when the hygromycin B phosphotransferase (HPT) gene is used as the selectable marker) or chlorsulfuron (when the acetolactate synthase (ALS) gene is used as the selectable marker).
  • HPT hygromycin B phosphotransferase
  • ALS acetolactate synthase
  • the tissue is placed into fresh SB196 media and cultured as described above.
  • the SB196 is exchanged with fresh SB196 containing either 30 mg/L hygromycin or 100 ng/mL chlorsulfuron, depending on the selectable marker used.
  • the selection media is refreshed weekly.
  • green, transformed tissue is observed growing from untransformed, necrotic embryogenic clusters.
  • Transformed embryogenic clusters are cultured for four-six weeks in multiwell plates at 26 0 C in SB196 under cool white fluorescent (Phillips cool white Econowatt F40/CW/RS/EW) and Agro (Phillips F40 Agro) bulbs (40 watt) on a 16:8 hr photoperiod with light intensity of 90-120 ⁇ E/m 2 s. After this time, embryo clusters are removed to a solid agar media, SB166, for one-two weeks and then subcultured to SB103 medium for 3-4 weeks to mature embryos. After maturation on plates in SB103, individual embryos are removed from the clusters, dried, and screened for alterations in their fatty acid compositions as described supra.
  • embryos are matured in soybean histodifferentiation and maturation liquid medium (SHaM liquid media; Schmidt et al., Cell Biology and Morphogenesis 24:393 (2005)), using a modified procedure. Briefly, after 4 weeks of selection in SB196, as described above, embryo clusters are removed to 35 ml_ of SB228 (SHaM liquid media) in a 250 ml_ Erlenmeyer flask. Tissue is maintained in SHaM liquid media on a rotary shaker at 130 rpm and 26 0 C, with cool white fluorescent lights on a 16:8 hr day/night photoperiod at a light intensity of 60-85 ⁇ E/m2/s for 2 weeks as embryos matured. Embryos grown for 2 weeks in SHaM liquid media are equivalent in size and fatty acid content to embryos cultured on SB166/SB103 for 5-8 weeks.
  • 2,4-D Stock Obtain premade from Phytotech Cat. No. D 295 - concentration 1 mg/mL B5 Vitamins Stock (per 100 m ⁇ Store aliquots at -20 0 C 10 g myo-inositol 100 mg nicotinic acid 100 mg pyridoxine HCI
  • Sorbitol 30 g Adjust volume to 900 mL pH 5.8 Autoclave
  • Bottle(s) should be wrapped in foil to omit light. Autoclave
  • the tissue is divided between 2 flasks with fresh SB196 media and cultured as described in Example 25.
  • the SB196 is exchanged with fresh SB196 containing selection agent of 100 ng/mL chlorsulfuron (chlorsulfuron stock is 1 mg/mL in 0.01 N ammonium hydroxide).
  • the selection media is refreshed weekly.
  • green, transformed tissue may be observed growing from untransformed, necrotic embryogenic clusters. Isolated, green tissue is removed and inoculated into multiwell plates containing SB196, and embryos are matured as described in Example 25.
  • Embryos are matured as described in Example 25. After subculturing on medium SB103 for 3 weeks, individual embryos can be removed from the clusters and screened for alterations in their fatty acid compositions as described herein. It should be noted that any detectable phenotype, resulting from the expression of the genes of interest, could be screened at this stage. This would include, but not be limited to, alterations in fatty acid profile, protein profile and content, carbohydrate content, growth rate, viability, or the ability to develop normally into a soybean plant.
  • Matured individual embryos are desiccated by placing them into an empty, small petri dish (35 x 10 mm) for approximately 4 to 7 days. The plates are sealed with fiber tape (creating a small humidity chamber). Desiccated embryos are planted into SB71-4 medium where they are left to germinate under the same culture conditions described above. Germinated plantlets are removed from germination medium and rinsed thoroughly with water and then are planted in Redi- Earth in 24-cell pack tray, covered with clear plastic dome. After 2 weeks the dome is removed, and plants are hardened off for a further week. If plantlets look hardy, they are transplanted to 10" pot of Redi-Earth with up to 3 plantlets per pot. After 10 to 16 weeks, mature seeds are harvested, chipped, and analyzed for fatty acids.
  • the fatty acid profile for Yarrowia expressing pBY-EgC20elo1 showed no elongation of EPA to DPA.
  • the fatty acid profiles, calculated % elongation and calculated % desaturation for the remaining clones are shown in FIG. 18.
  • Percent C20 elongation (% C20 Elong) was calculated by dividing the sum of the weight percent (wt. %) for DPA and DHA by the sum of the wt. % for EPA, DPA and DHA and multiplying by 100 to express as a %.
  • percent delta-4 desaturation % D4 Desat
  • Averages are indicated by Ave. followed by appropriate header.
  • EaDHAsyn4 functioned as both C20 elongases (elongating EPA to DPA) and as delta-4 desaturases (desaturating DPA to DHA) in Yarrowia.
  • EaDHAsyn4 which contained a substantially different amino acid sequence at the C-terminus due to a frameshift in the nucleotide sequence, had considerably lower elongation function and no desaturase activity was detected.
  • EgDHAsyni in pY132 consistently resulted in higher activity in Yarrowia when compared to the other EgDHAsyni constructs, likely due to the fact that EgDHAsyni was expressed as an in-frame fusion between some vector sequence, the 5' UTR of EgDHAsyni and the EgDHAsyni coding sequence.
  • the resulting fusion created may lead to enhanced activity because of enhanced expression in Yarrowia or because of an inherent increase in activity to the enzyme itself.
  • EgDHAsyni * When only the coding sequence of EgDHAsyni * is expressed (i.e., with no 5'UTR; see pY141), the activity is higher than when the 5'UTR is present but not translated as a fusion (i.e., see pY161). This observation is likely due to a decrease in expression of EgDHAsyni due to the presence of the 5'UTR.
  • EgDHAsyni* (pY141 ; SEQ ID NO:49; see FIG. 20) fed DPA.
  • the lgD4 has no delta-4 desaturase activity when expressed alone (pY150; SEQ ID NO:62; see FIG. 20) or as a fusion (pY156; SEQ ID NO:64; see FIGs. 19 and 20) and even causes an approximately 50% decrease in elongation activity when fused to the EgDHAsyn1C20 elongase domain (pY156; SEQ ID NO:64; see FIG. 19), possibly due to incorrect folding.
  • the SaD4 expressed alone (pY151 ; SEQ ID NO:76; see FIG.
  • Yarrowia cells transformed with pY141 EgDHAsyni * ; SEQ ID NO:49
  • a vector only control and fatty acid profiles were analyzed as described in Example 27.
  • % for docosatetraenoic acid [DTA; 22:4 (7,10,13,16)] and omega-6 docosapentaenoic acid [DPAn-6; 22:5(4,7,10,13,16)] by the sum of the wt. % for ARA, DTA and DPAn- 6 and multiplying by 100 to express as a %.
  • percent delta-4 desaturation (% D4 Desat) when fed ARA was calculated by dividing the wt. % for DPAn-6 by the sum of the wt. % for DTA and DPAn-6 and multiplying by 100 to express as a %.
  • EgDHAsyni * elongates both ARA and EPA although it has a slight preference (approximately 40% more active) for EPA.
  • the elongation product of ARA i.e., DTA
  • DTA is also desaturated in the delta-4 position by EgDHAsyni to produce DPAn-6 and the activity is approximately 40% higher for DTA than DPA.
  • Mature somatic soybean embryos are a good model for zygotic embryos. While in the globular embryo state in liquid culture, somatic soybean embryos contain very low amounts of triacylglycerol or storage proteins typical of maturing, zygotic soybean embryos. At this developmental stage, the ratio of total triacylglyceride to total polar lipid (phospholipids and glycolipid) is about 1 :4, as is typical of zygotic soybean embryos at the developmental stage from which the somatic embryo culture was initiated. At the globular stage as well, the mRNAs for the prominent seed proteins, ⁇ '-subunit of ⁇ -conglycinin, kunitz trypsin inhibitor 3, and seed lectin are essentially absent.
  • Soybean embryogenic suspension cultures (cv. Jack) were transformed with the Asc ⁇ fragments of pKR973 and pKR1064 (fragments containing the expression cassettes), as described for production in Example 25 and as summarized in Table 26.
  • a subset of soybean embryos generated from each event were harvested and picked into glass GC vials and fatty acid methyl esters were prepared by transesterification.
  • 50 ⁇ l_ of trimethylsulfonium hydroxide (TMSH) and 0.5 ml_ of hexane were added to the embryos in glass vials and incubated for 30 min at room temperature while shaking.
  • Fatty acid methyl esters (5 ⁇ l_ injected from hexane layer) were separated and quantified using a Hewlett-Packard 6890 Gas Chromatograph fitted with an Omegawax 320 fused silica capillary column (Cat. No. 24152, Supelco Inc.).
  • the oven temperature was programmed to hold at 220 0 C for 2.6 min, increase to 240 0C at 20 °C/min and then hold for an additional 2.4 min.
  • Carrier gas was supplied by a Whatman hydrogen generator. Retention times were compared to those for methyl esters of standards commercially available (Nu-Chek Prep, Inc.). Events having good phenotype were re-analyzed by GC using identical conditions except the oven temperature held at 150 0 C for 1 min and then increased to 240 0 C at 5 0 C.
  • the fatty acid profiles for individual embryos from a representative event are shown in FIG. 23.
  • Fatty acids are identified as 16:0 (palmitate), 18:0 (stearic acid), 18:1 (oleic acid), 18:2(LA), GLA, 18:3 (ALA), EDA, DGLA, ARA, ERA, JUN, EPA, 22:3(10,13,16) (docosatrienoic acid), DTA, DPA and DHA; and, fatty acid compositions listed in FIG. 23 are expressed as a weight percent (wt. %) of total fatty acids.
  • the activity of EgDHAsyni is expressed as percent C20 elongation (% C20
  • Elong and/or percent delta-4 desaturation calculated according to the following formula: ([product]/[substrate + product])*100. More specifically, the percent elongation for EPA is shown as % C20 Elong, determined as: ([DPA + DHA]/[EPA + DPA + DHA])*100. The percent delta-4 desaturation for DPA is shown as % D4 Desat, determined as ([DHA]/[DPA + DHA]) * 100. Other fatty acids that may be elongated or desaturated were not included in this calculation.
  • DGLA is also elongated by the EgDHAsyni as a significant amount of the fatty acid 22:3(10,13,16) was made.
  • the fatty acid was identified as 22:3(10,13,16) because it was found to have a mass for 22:3 by GC- MS and had an MS profile that agrees with that for 22:3(10,13,16).
  • KS373 (SEQ ID NO:179; FIG. 17) and KS120 (which is described in PCT Publication No. WO 2004/071467 and the contents of which are hereby incorporated by reference) as described for the model system in Example 25.
  • KS120 contains the hygromycin selection.
  • KS373, produced in Example 23, enabled expression of a fusion protein comprising the Euglena gracilis delta-9 elongase and the Pavlova lutheri delta-8 desaturase, wherein the two domains were linked with Euglena gracilis DHA Synthase 1 Linker (i.e., EgDHAsyni Link).
  • the fatty acid profiles for five individual embryos from 31 events were obtained as described in Example 30. Results from the five best elongation events are shown in FIG. 24. Fatty acids are identified as 16:0 (palmitate), 18:0 (stearic acid), 18:1 (oleic acid), 18:2 (LA), GLA, 18:3 (ALA), EDA, DGLA, ERA and ETA; and, fatty acid compositions listed in FIG. 24 are expressed as a weight percent (wt. %) of total fatty acids.
  • EgD9elo-EgDHAsyn1 Link-PavD8 is expressed as percent delta-9 elongation (% D9 Elong) and/or percent delta-8 desaturation (% D8 Desat), calculated according to the following formula: ([product]/[substrate + product])*100.
  • the percent delta-9 elongation is shown as % D9 Elong, determined as: ([EDA + ERA + DGLA + ETA]/[LA + ALA + EDA + ERA + DGLA + ETA]) * 100.
  • the percent delta-8 desaturation is shown as % D8 Desat, determined as ([DGLA + ETA]/[EDA + ERA + DGLA + ETA]) * 100.
  • the best % D9 Elong event had an average elongation of 22.1 % with an average % D8 Desat of 92.7%. Elongation is slightly lower than that seen when the delta-9 elongase is expressed alone in soybean embryos although this might be due to the small numbers of events looked at. In contrast, desaturation is considerably higher when the PavD ⁇ is fused with the EgD9elo and EgDHAsynHink than when the PavD ⁇ is expressed alone in soybean embryos, reaching almost 100% conversion in some events. This enhanced conversion by the delta-8 desaturase might be due to increased efficiency or flux, perhaps due to substrate channeling.
  • EgC20ES From Eualena gracilis in Yarrowia lipolytics
  • the codon usage of the C20 elongase domain of EgDHAsyni (EgDHAsyn1C20EloDom1) of Euglena gracilis was optimized for expression in Yarrowia lipolytica, in a manner similar to that described in PCT Publication No. WO 2004/101753 and U.S. Patent 7,125,672.
  • a codon-optimized C20 elongase gene (designated "EgC20ES” and having the nucleotide sequence as set forth in SEQ ID NO: 183 and the amino acid sequence as set forth in SEQ ID NO:184) was designed, based on the coding sequence of the C20 elongase domain of EgDHAsyni (SEQ ID NO:201), according to the Yarrowia codon usage pattern (PCT Publication No. WO 2004/101753), the consensus sequence around the 'ATG' translation initiation codon, and the general rules of RNA stability (Guhaniyogi, G. and J. Brewer, Gene, 265(1-2):11-23 (2001)).
  • Plasmid pZuFmEgC20ES contained the following components:
  • Plasmid pZuFmEgC20ES was transformed into Yarrowia lipolytica strain Y4184U4, as described in the General Methods. The transformants were selected on MM plates. After 2 days growth at 30 ° C, 10 transformants grown on the MM plates were picked and re-streaked onto fresh MM plates. Once grown, 10 strains were individually inoculated into 3 ml_ liquid MM at 30 C and shaken at 250 rpm/min for 2 days. The cells were collected by centrifugation; lipids were extracted; and fatty acid methyl esters were prepared by trans-esterification and subsequently analyzed with a Hewlett-Packard 6890 GC.
  • EaC20ES (designated “EaC20ES” and having the nucleotide sequence as set forth in SEQ ID NO: 188 and the amino acid sequence as set forth in SEQ ID NO: 189) was designed, based on the coding sequence of the C20 elongase domain of EaDHAsyn2 (SEQ ID NO:92), according to the Yarrowia codon usage pattern (PCT Publication No. WO 2004/101753), the consensus sequence around the 'ATG' translation initiation codon, and the general rules of RNA stability (Guhaniyogi, G. and J. Brewer, Gene, 265(1-2):11-23 (2001)).
  • SEQ ID NO:189 is 100% identical in sequence to amino acids 1-299 of SEQ ID NO:96.
  • the designed EaC20ES gene (SEQ ID NO:188) was synthesized by GenScript Corporation (Piscataway, NJ) and was cloned into pUC57 (GenBank Accession No. Y14837) to generate pEaC20ES (SEQ ID NO:190).
  • Plasmid pZuFmEaC20ES (SEQ ID NO:361) was identical in construction to that of plasmid pZuFmEgC20ES (SEQ ID NO:360; FIG. 52A), with the exception that EaC20ES (SEQ ID NO: 188) was used in place of EgC20ES (SEQ ID NO:183).
  • Plasmid pZuFmEaC20ES (SEQ ID NO:361) was transformed into Yarrowia lipolytica strain Y4184U4, as described in the General Methods. The transformants were selected on MM plates. After 2 days growth at 30 ° C, 20 transformants grown on the MM plates were picked and re-streaked onto fresh MM plates. Once grown, 20 strains were individually inoculated into 3 ml_ liquid MM at 30 ° C and shaken at 250 rpm/min for 2 days. The cells were collected by centrifugation; lipids were extracted; and fatty acid methyl esters were prepared by trans-esterification and subsequently analyzed with a Hewlett-Packard 6890 GC.

Abstract

Isolated nucleic acid fragments and recombinant constructs comprising such fragments encoding multizymes (i.e., single polypeptides having at least two independent and separable enzymatic activities) along with a method of making long-chain polyunsaturated fatty acids (PUFAs) using these multizymes in plants and oleaginous yeast are disclosed.

Description

TITLE MULTIZYMES AND THEIR USE IN MAKING POLYUNSATURATED FATTY ACIDS
This application claims the benefit of U.S. Provisional Application No. 60/909790, filed April 3, 2007, and U.S. Provisional Application No. 61/027898, filed February 12, 2008, the disclosures of which are hereby incorporated in their entirety.
FIELD OF THE INVENTION
This invention is in the field of biotechnology. More specifically, this invention pertains to polynucleotide sequences encoding multizymes and their use in the synthesis of long-chain polyunsaturated fatty acids (PUFAs).
BACKGROUND OF THE INVENTION
The importance of PUFAs is undisputed. For example, certain PUFAs are important biological components of healthy cells and are recognized as: "essential" fatty acids that cannot be synthesized de novo in mammals and instead must be obtained either in the diet or derived by further elongation and desaturation of linoleic acid (LA; 18:2 omega-6) or α-linolenic acid (ALA; 18:3 omega-3); constituents of plasma membranes of cells, where they may be found in such forms as phospholipids or triacylglycerols; necessary for proper development (particularly in the developing infant brain) and for tissue formation and repair; and, precursors to several biologically active eicosanoids of importance in mammals (e.g., prostacyclins, eicosanoids, leukotrienes, prostaglandins). Additionally, a high intake of long-chain omega-3 PUFAs produces cardiovascular protective effects (Dyerberg et al., Amer. J. Clin. Nutr. 28:958-966 (1975); Dyerberg et al., Lancet. 2(8081):117- 119 (1978); Shimokawa, H., World Rev. Nutr. Diet 88:100-108 (2001); von Schacky et al., World Rev. Nutr. Diet 88:90-99 (2001)). Numerous other studies document wide-ranging health benefits conferred by administration of omega-3 and/or omega- 6 PUFAs against a variety of symptoms and diseases (e.g., asthma, psoriasis, eczema, diabetes, cancer).
Today, a variety of different hosts including plants, algae, fungi, and yeast are being investigated as means for commercial PUFA production via numerous divergent efforts. Although the natural PUFA-producing abilities of the host organisms are sometimes essential to a given methodology, genetic engineering has also proven that the natural abilities of some hosts (even those natively limited to LA and ALA fatty acid production) can be substantially altered to result in high- level production of various long-chain omega-3/omega-6 PUFAs. Whether this effect is the result of natural abilities or recombinant technology, arachidonic acid (ARA; 20:4 omega-6), eicosapentaenoic acid (EPA; 20:5 omega-3), and docosahexaenoic acid (DHA; 22:6 omega-3) all require expression of either the delta-9 elongase/delta-8 desaturase pathway (which operates in some organisms, such as euglenoid species and which is characterized by the production of eicosadienoic acid (EDA; 20:2 omega-6) and/or eicosatrienoic acid (ETrA; 20:3 omega-3)) or the delta-6 desaturase/delta-6 elongase pathway (which is predominantly found in algae, mosses, fungi, nematodes and humans and which is characterized by the production of gamma-linolenic acid (GLA; 18:3 omega-6) and/or stearidonic acid (STA; 18:4 omega-3)) (FIG. 1). A delta-6 elongase is also known as a C-] 8/20 elongase.
The delta-8 desaturase enzymes identified thus far have the ability to convert both EDA to dihomo gamma-linolenic acid (DGLA (also known as HGLA); 20:3, n-6) and ETrA to eicosatetraenoic acid (ETA; 20:4, n-3). ARA and EPA are subsequently synthesized from DGLA and ETA, respectively, following reaction with a delta-5 desaturase. DHA synthesis, however, requires the subsequent expression of an additional C20/22 elongase and a delta-4 desaturase. Most C20722 elongases identified so far have the primary ability to convert EPA to DPA, with secondary activity in converting arachidonic acid (ARA; 20:4 omega-6) to docosatetraenoic acid (DTA; 22:4 omega-6), while most delta-4 desaturase enzymes identified so far have the primary ability to convert DPA to DHA, with secondary activity in converting docosatetraenoic acid (DTA; 22:4 omega -6) to ω-6 docosapentaenoic acid (DPAn-6; 22:5 omega-6).
Based on the role C20/22 elongase and delta-4 desaturase enzymes play in the synthesis of DHA, there has been considerable effort to identify and characterize these enzymes from various sources. As such, numerous C20/22 elongases have been disclosed in both the open literature and the patent literature (e.g., Pavlova sp. CCMP459 (GenBank Accession No. AAV33630), Ostreococcus tauri (GenBank
Accession No. AAV67798) and Thalassiosira pseudonana (GenBank Accession No. AAV67800)). Similarly, the following delta-4 desaturases have been disclosed: Euglena gracilis (SEQ ID NO: 13; GenBank Accession No. AAQ19605; Meyer et al., Biochemistry, 42(32):9779-9788 (2003)); Thalassiosira pseudonana (SEQ ID NO:29; GenBank Accession No. AAX14506; Tonon et al., FEBS J., 272(13):3401- 3412 (2005)); Thraustochytrium aureum (SEQ ID NO:27; GenBank Accession No. AAN75707); Thraustochytrium sp. (GenBank Accession No. CAD42496; U.S. Patent 7,087,432); Schizochytrium aggregatum (SEQ ID NO:28; PCT Publication No. WO 2002/090493); Pavlova lutheri (GenBank Accession No. AAQ98793); and lsochrysis galbana (SEQ ID NO:30; GenBank Accession No. AAV33631 ; Pereira et al., Biochem. J., 384(2): 357-366 (2004); PCT Publication No. WO 2002/090493)]. Applicants' Assignee has a number of patent applications concerning the production of PUFAs in oleaginous yeasts (i.e., Yarrowia lipolytica), including, for example: U.S. Patents No. 7,238,482 and No. 7,125,672; U.S. Application No. 11/265,761 (filed November 2, 2005); U.S. Application No. 11/264,784 (filed November 1 , 2005); U.S. Application No. 11/264,737 (filed November 1 , 2005). Relatedly, PCT Publication No. WO 2004/071467 (published August 26,
2004) concerns the production of PUFAs in plants, while PCT Publication No. WO 2004/071178 (published August 26, 2004) concerns annexin promoters and their use in expression of transgenes in plants. Both are Applicants' Assignee's copending applications. SUMMARY OF THE INVENTION
The present invention concerns a multizyme comprising a single polypeptide having at least two independent and separable enzymatic activities.
In a second embodiment the enzymatic activities of the multizyme can be selected from the group consisting of fatty acid elongases, fatty acid desaturases, acyl transferases, acyl CoA synthases, and thioesterases. More specifically, the enzymatic activities can comprise at least one fatty acid elongase linked to at least one fatty acid desaturase.
In a third embodiment the multizyme can comprise a first enzymatic activity linked to a second enzymatic activity and said link is selected from the group consisting of a polypeptide bond, SEQ ID NO:198 (EgDHAsyni linker), SEQ ID
NO:200 (EgDHAsyn2 linker), SEQ ID NO:235 (EaDHAsyni linker), SEQ ID NO:438, SEQ ID NO:445, SEQ ID NO:472, and SEQ ID NO:504. In a fourth embodiment, the invention concerns an isolated polynucleotide encoding a DHA synthase comprising:
(a) a nucleotide sequence encoding a polypeptide having DHA synthase activity, wherein the polypeptide has at least 80% amino acid identity, based on the Clustal V method of alignment, when compared to an amino acid sequence as set forth in SEQ ID NO:12, SEQ ID NO:22, SEQ ID NO:95, SEQ ID NO:96, or SEQ ID NO:97;
(b) a nucleotide sequence encoding a polypeptide having DHA synthase activity wherein the nucleotide sequence has at least 80% sequence identity, based on the BLASTN method of alignment, when compared to a nucleotide sequence as set forth in SEQ ID NO:11 , SEQ ID NO:205, SEQ ID NO:21 , SEQ ID NO:91 , SEQ ID NO:92, SEQ ID NO:93, or SEQ ID NO:410;
(c) a nucleotide sequence encoding a polypeptide having DHA synthase activity, wherein the nucleotide sequence hybridizes under stringent conditions to a nucleotide sequence as set forth in SEQ ID NO:11 , SEQ ID NO:205, SEQ ID NO:21 , SEQ ID NO:91 , SEQ ID NO:92, SEQ ID NO:93, or SEQ ID NO:410; or
(d) a complement of the nucleotide sequence of (a), (b) or (c), wherein the complement and the nucleotide sequence consist of the same number of nucleotides and are 100% complementary.
In a fifth embodiment, the invention concerns the polynucleotide encoding a polypeptide having DHA synthase activity wherein the nucleotide sequence comprises SEQ ID NO:11 , SEQ ID NO:21 , SEQ ID NO:91 , SEQ ID NO:92, SEQ ID NO:93, or SEQ ID NO:410. In a sixth embodiment, the invention concerns the polypeptide of the invention having DHA synthase activity, wherein the amino acid sequence of the polypeptide comprises SEQ ID NO:12, SEQ ID NO:22, SEQ ID NO:95, SEQ ID NO:96, or SEQ ID NO:97.
In a seventh embodiment, the invention concerns an isolated polynucleotide encoding a C20 elongase comprising:
(a) a nucleotide sequence encoding a polypeptide having C20 elongase activity, wherein the polypeptide has at least 80% amino acid identity, based on the Clustal V method of alignment, when compared to an amino acid sequence as set forth in SEQ ID NO:12, SEQ ID NO:22, SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:97, SEQ ID NO:202 (EgDHAsyni C20 elongase domain), SEQ ID NO:204 (EgDHAsyn2 C20 elongase domain), SEQ ID NO:231 (EaDHAsyni C20 elongase domain), SEQ ID NO:232 (EaDHAsyn2 C20 elongase domain) or SEQ ID NO:233 (EaDHAsyn3 C20 elongase domain);
(b) a nucleotide sequence encoding a polypeptide having C20 elongase activity wherein the nucleotide sequence has at least 80% sequence identity, based on the BLASTN method of alignment, when compared to a nucleotide sequence as set forth in SEQ ID NO:11 , SEQ ID NO:21 , SEQ ID NO:91 , SEQ ID NO:92, SEQ ID NO:93, SEQ ID NO:183, SEQ ID NO:188, SEQ ID NO:201 (EgDHAsyni C20 elongase domain), SEQ ID NO:206 (EgDHAsyni* C20 elongase domain), SEQ ID NO:203 (EgDHAsyn2 C20 elongase domain), SEQ ID NO:227 (EaDHAsyni C20 elongase domain), SEQ ID NO:228 (EaDHAsyn2 C20 elongase domain), SEQ ID NO:229 (EaDHAsyn3 C20 elongase domain) or SEQ ID NO:230 (EaDHAsyn4 C20 elongase domain);
(c) a nucleotide sequence encoding a polypeptide having C20 elongase activity, wherein the nucleotide sequence hybridizes under stringent conditions to a nucleotide sequence as set forth in SEQ ID NO:11 , SEQ ID NO:21 , SEQ ID NO:91 , SEQ ID NO:92, SEQ ID NO:93, SEQ ID NO:183, SEQ ID NO:188, SEQ ID NO:201 (EgDHAsyni C20 elongase domain), SEQ ID NO:206
(EgDHAsyni* C20 elongase domain), SEQ ID NO:203 (EgDHAsyn2 C20 elongase domain), SEQ ID NO:227 (EaDHAsyni C20 elongase domain), SEQ ID NO:228 (EaDHAsyn2 C20 elongase domain), SEQ ID NO:229 (EaDHAsyn3 C20 elongase domain) or SEQ ID NO:230 (EaDHAsyn4 C20 elongase domain); or (d) a complement of the nucleotide sequence of (a), (b) or (c), wherein the complement and the nucleotide sequence consist of the same number of nucleotides and are 100% complementary.
In an eighth embodiment, the invention concerns an isolated polynucleotide encoding a delta-4 desaturase comprising: (a) a nucleotide sequence encoding a polypeptide having delta-4 desaturase activity, wherein the polypeptide has at least 80% amino acid identity, based on the Clustal V method of alignment, when compared to an amino acid sequence as set forth in SEQ ID NO:12, SEQ ID NO:22, SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:97, SEQ ID NO:193, SEQ ID NO:215, SEQ ID NO:217, SEQ ID NO:221 , SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241 , SEQ ID NO:246, SEQ ID NO:247, SEQ ID NO:248, SEQ ID NO:249, SEQ ID NO:382, SEQ ID NO:384, SEQ ID NO:386, SEQ ID NO:388, SEQ ID NO:404, SEQ ID NO:406, or SEQ ID NO:408;
(b) a nucleotide sequence encoding a polypeptide having delta-4 desaturase activity wherein the nucleotide sequence has at least 80% sequence identity, based on the BLASTN method of alignment, when compared to a nucleotide sequence as set forth in SEQ ID NO:11 , SEQ ID NO:21 , SEQ ID NO:91 , SEQ ID NO:92, SEQ ID NO:93, SEQ ID NO:192, SEQ ID NO:214, SEQ ID No:216, SEQ ID NO:220, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:242, SEQ ID NO:243, SEQ ID NO:244, SEQ ID NO:245, SEQ ID NO:381 , SEQ ID NO:383, SEQ ID NO:385, SEQ ID NO:387, SEQ ID NO:403, SEQ ID NO:405, or SEQ ID NO:407; (c) a nucleotide sequence encoding a polypeptide having delta-4 desaturase activity, wherein the nucleotide sequence hybridizes under stringent conditions to a nucleotide sequence as set forth in SEQ ID NO:11 , SEQ ID NO:21 , SEQ ID NO:91 , SEQ ID NO:92, SEQ ID NO:93, SEQ ID NO:214, SEQ ID NO:216, SEQ ID NO:220, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:242, SEQ ID NO:243, SEQ ID NO:244, SEQ ID NO:245, SEQ ID NO: 381 , SEQ ID NO:383, SEQ ID NO:385, SEQ ID NO:387, SEQ ID NO:403, SEQ ID NO:405, or SEQ ID NO:407; or
(d) a complement of the nucleotide sequence of (a), (b) or (c), wherein the complement and the nucleotide sequence consist of the same number of nucleotides and are 100% complementary.
In a ninth embodiment, the invention concerns an isolated polynucleotide encoding a DHA synthase, said polynucleotide comprising the sequence set forth in any of SEQ ID NO:11 , SEQ ID NO:205, SEQ ID NO:21 , SEQ ID NO:91 , SEQ ID NO:92, SEQ ID NO:93, or SEQ ID NO:410. In a tenth embodiment, the invention concerns an isolated polynucleotide encoding a C20 elongase, said isolated polynucleotide encoding a C20 elongase, said polynucleotide comprising the sequence set forth in any of SEQ ID NO: 183, SEQ ID NO:188, SEQ ID NO:201 (EgDHAsyni C20 elongase domain, SEQ ID NO:206 (EgDHAsyni* C20 elongase domain), SEQ ID NO:203 (EgDHAsyn2 C20 elongase domain), SEQ ID NO:227 (EaDHAsyni C20 elongase domain), SEQ ID NO:228 (EaDHAsyn2 C20 elongase domain), SEQ ID NO:229 (EaDHAsyn3 C20 elongase domain) or SEQ ID NO:230 (EaDHAsyn4 C20 elongase domain). In an eleventh embodiment, the invention concerns an isolated polynucleotide encoding a delta-4 desaturase, said polynucleotide comprising the sequence set forth in SEQ ID NO:192, SEQ ID NO:214, SEQ ID NO:220, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:242, SEQ ID NO:243, SEQ ID NO:244, SEQ ID NO:245, SEQ ID NO:381 , SEQ ID NO:383, SEQ ID NO:385, SEQ ID No:387, SEQ ID NO:403, SEQ ID NO:405, or SEQ ID NO:407.
In a twelfth embodiment, the invention concerns a recombinant construct comprising any of the isolated polynucleotides of the invention operably linked to at least one regulatory sequence.
In a thirteenth embodiment, the invention concerns a host cell comprising in its genome the recombinant construct of the invention. More particularly, the host cell is a recombinant microbial host cell comprising a multizyme of the invention, wherein the first enzymatic activity is a delta-9 elongase and the second enzymatic activity is a delta-8 desaturase. In another aspect, the first enzymatic activity is a C20 elongase, and the second enzymatic activity is a delta-4 desaturase. In a fourteenth embodiment, the invention concerns a transformed Yarrowia sp. comprising the recombinant construct of the invention.
In a fifteenth embodiment, the invention concerns a method for transforming a cell, comprising transforming a cell with the recombinant construct of the invention and selecting those cells transformed with said recombinant construct. In a sixteenth embodiment, the invention concerns a method for producing a transformed plant comprising transforming a plant cell with any of the polynucleotides of the invention and regenerating a plant from the transformed plant cell.
In a seventeenth embodiment, the invention concerns a method for producing yeast comprising transforming a yeast cell with any of the polynucleotides of the invention and growing yeast from the transformed yeast cell.
In an eighteenth embodiment, the invention concerns a plant comprising in its genome the recombinant construct of the invention. Also of interest are seeds obtained from such plants, oil obtained from such seeds, food or feed incorporating such oil, and a beverage incorporating the oil of the invention.
In a nineteenth embodiment, the invention concerns an isolated nucleic acid molecule which encodes a C20 elongase as set forth in SEQ ID NO: 183 wherein at least 147 codons are codon-optimized for expression in Yarrowia sp.
In a twentieth embodiment, the invention concerns an isolated nucleic acid molecule which encodes a C20 elongase as set forth in SEQ ID NO: 188 wherein at least 134 codons are codon-optimized for expression in Yarrowia sp.
In a twenty-first embodiment, the invention concerns an isolated nucleic acid molecule which encodes a delta-4 desaturase enzyme as set forth in SEQ ID NO: 192 wherein at least 285 codons are codon-optimized for expression in Yarrowia sp.
In a twenty-second embodiment, the invention concerns a method for making a multizyme which comprises: (a) linking a first polypeptide with at least a second polypeptide wherein each polypeptide has an independent and separable enzymatic activity; and
(b) evaluating the product of step (a) for the independent and separable enzymatic activities. In a twenty-third embodiment, the invention concerns a method for altering the fatty acid profile of an oilseed plant comprising: a) transforming an oilseed plant cell with the recombinant construct of the invention; b) regenerating a plant from the transformed oilseed plant cell step (a), wherein the plant has an altered fatty acid profile.
In a twenty-fourth embodiment, the invention concerns an isolated polynucleotide encoding a DGLA synthase comprising:
(a) a nucleotide sequence encoding a polypeptide having DGLA synthase activity, wherein the polypeptide is set forth in SEQ ID NO:441 , SEQ ID NO:454, SEQ ID NO:461 , SEQ ID NO:464, SEQ ID NO:471 , SEQ ID NO:515, SEQ ID NO:516, SEQ ID NO:517, SEQ ID NO:518, or SEQ ID NO:519;
(b) a nucleotide sequence encoding a polypeptide having DGLA synthase activity wherein the nucleotide sequence is set forth in SEQ ID NO:440, SEQ ID NO:446, SEQ ID NO:453, SEQ ID NO:460, SEQ ID NO:463, SEQ ID NO:470, SEQ ID NO:492, SEQ ID NO:493, SEQ ID NO:494, SEQ ID NO:495, or SEQ ID NO:496;
(c) a nucleotide sequence encoding a polypeptide having DGLA synthase activity, wherein the nucleotide sequence hybridizes under stringent conditions to a nucleotide sequence as set forth in SEQ ID NO:440, SEQ ID NO:446, SEQ ID NO:453, SEQ ID NO:460, SEQ ID NO:463, SEQ ID NO:470, SEQ ID NO:492, SEQ ID NO:493, SEQ ID NO:494, SEQ ID NO:495, or SEQ ID NO:496; or (d) a complement of the nucleotide sequence of (a), (b) or (c), wherein the complement and the nucleotide sequence consist of the same number of nucleotides and are 100% complementary.
In a twenty-fifth embodiment, the invention concerns a method for converting linoleic acid to dihomo gamma-linolenic acid comprising: a) providing a recombinant microbial host cell comprising: i) a DGLA synthase comprising:
1) at least one polypeptide encoding a delta-9 elongase;
2) at least one polypeptide encoding a delta-8 desaturase; and 3) a polypeptide linker; wherein the linker is interposed between the delta-9 elongase and the delta-8 desaturase; and ii) a source of linoleic acid; and b) growing the host cell of (a) under conditions whereby dihomo gamma-linolenic acid is produced.
In a twenty-sixth embodiment, the invention concerns a method for the conversion of α-linolenic acid to eicosatrienoic acid comprising: a) providing a recombinant microbial host cell comprising: i) a DGLA synthase comprising: 1) at least one polypeptide encoding a delta-9 elongase;
2) at least one polypeptide encoding a delta-8 desaturase; and
3) a polypeptide linker; wherein the linker is interposed between the delta-9 elongase and the delta-8 desaturase; and ii) a source of α-linolenic acid; and b) growing the host cell of (a) under conditions whereby eicosatrienoic acid is produced.
In a twenty-seventh embodiment, the invention concerns a method for the conversion of eicosapentaenoic acid to docosahexaenoic acid comprising: a) providing a recombinant microbial host cell comprising: i) a DHA synthase comprising: 1) at least one polypeptide encoding a C20 elongase;
2) at least one polypeptide encoding a delta-4 desaturase; and
3) a polypeptide linker; wherein the linker is interposed between the C20 elongase and the delta-4 desaturase; and ii) a source of eicosapentaenoic acid; and b) growing the host cell of (a) under conditions whereby docosahexaenoic acid is produced.
In a twenty-eighth embodiment, the invention concerns a method for the conversion of arachidonic acid to docosapentaenoic acid comprising: a) providing a recombinant microbial host cell comprising: i) a DHA synthase comprising:
1) at least one polypeptide encoding a C20 elongase;
2) at least one polypeptide encoding a delta-4 desaturase; and
3) a polypeptide linker; wherein the linker is interposed between the C20 elongase and the delta-4 desaturase; and ii) a source of arachidonic acid; and b) growing the host cell of (a) under conditions whereby docosapentaenoic acid is produced. In a twenty-ninth embodiment, the invention concerns a method for the identification of a polypeptide having improved delta-4 desaturase activity comprising: a) providing a wild-type delta-4 desaturase polypeptide isolated from Euglena anabena having a base-line delta-4 desaturase activity; b) truncating the wild-type polypeptide of (a) by from about 1 to about 200 amino acids to create a truncated mutant polypeptide having delta-4 desaturase activity that is increased as compared with the base-line delta-4 desaturase activity. In a thirtieth embodiment, the invention concerns a microbial host cell which produces a polyunsaturated fatty acid and expresses polypeptides encoding enzymes in the following sequential pathway:
1) a delta-9 desaturase,
2) a delta-12 desaturase, 3) a delta-9 elongase,
4) a delta-8 desaturase,
5) a delta-5 desaturase,
6) a delta-17 desaturase,
7) a C20/22 elongase, and 8) a delta-4 desaturase; wherein the polypeptides comprise at least one multizyme, a fusion comprising a fusion between at least one contiguous enzyme pair.
BIOLOGICAL DEPOSITS
The following biological materials have been deposited with the American Type Culture Collection (ATCC), 10801 University Boulevard, Manassas, VA
20110-2209, and bear the following designations, Accession Numbers and dates of deposit (Table 1).
TABLE 1 ATCC Deposit
Figure imgf000013_0001
BRIEF DESCRIPTION OF THE DRAWINGS AND SEQUENCE LISTINGS
The invention can be more fully understood from the following detailed description and the accompanying drawings and Sequence Listing, which form a part of this application.
FIG. 1 is a representative omega-3 and omega-6 fatty acid pathway providing for the conversion of myristic acid through various intermediates to DHA.
FIG. 2 shows a Clustal W alignment between a portion of the coding sequence of EgDHAsyn2 (SEQ ID NO:21), the cDNA sequence of the Euglena gracilis delta-4 desaturase (SEQ ID NO:23) (NCBI Accession No. AY278558 (Gl 33466345), locus AY278558, Meyer et al., β/ocΛem/sfry 42(32):9779-9788 (2003)), and the coding sequence of the Euglena gracilis delta-4 desaturase (SEQ ID NO:24) (Meyer et al., supra).
FIGs. 3A and 3B show a Clustal W alignment between the amino acid sequence of EgDHAsyni (SEQ ID NO:12), EgDHAsyn2 (SEQ ID NO:22), and EgC20elo1 (SEQ ID NO:6).
FIGs. 4A and 4B show the Clustal W alignment of the N-terminus of EgDHAsyni (SEQ ID NO: 12) and the N-terminus of EgDHAsyn2 (SEQ ID NO:22) with EgC20elo1 (SEQ ID NO:6), Pavlova sp. CCMP459 C20-PUFA EIo (SEQ ID NO:2), Ostreococcus tauri PUFA elongase 2 (SEQ ID NO:25) (NCBI Accession No. AAV67798 (Gl 55852396), locus AAV67798, CDS AY591336; Meyer et al., J. Lipid Res. 45(10): 1899-1909 (2004)), and Thalassiosira pseudonana PUFA elongase 2 (SEQ ID NO:26) (NCBI Accession No. AAV67800 (Gl 55852441), locus AAV67800, CDS AY591338; Meyer et al., J. Lipid Res. 45(10): 1899-1909 (2004)). FIGs. 5A , 5B, 5C and 5D show the Clustal W alignment of the C-terminus of
EgDHAsyni (EgDHAsyn1_CT; amino acids 253-793 of SEQ ID NO:12; the N- terminus of EgDHAsyni is not shown and is indicated by "...") and the C-terminus of EgDHAsyn2 (EgDHAsyn2_CT; amino acids 253-793 of SEQ ID NO:22, the N- terminus of EgDHAsyn2 is not shown and is indicated by "...") with Euglena gracilis delta-4 fatty acid desaturase (SEQ ID NO: 13), Thraustochytrium aureum delta-4 desaturase (SEQ ID NO:27) (NCBI Accession No. AAN75707(GI 25956288), locus AAN75707, CDS AF391543), Schizochytrium aggregatum delta-4 desaturase (SEQ ID NO:28) (PCT Publication No. WO 2002/090493), Thalassiosira pseudonana delta-4 desaturase (SEQ ID NO:29) (NCBI Accession No. AAX14506 (Gl 60173017), locus AAX14506, CDS AY817156; Tonon et al., FEBS J. 272 (13):3401-3412 (2005)), and lsochrysis galbana delta-4 desaturase (SEQ ID NO:30) (NCBI Accession No. AAV33631 (Gl 54307110), locus AAV33631 , CDS AY630574; Pereira et al., Biochem. J., 384(2):357-366 (2004) and PCT Publication No. WO 2002/090493).
FIG. 6 shows an alignment of interior fragments of EgDHAsyni (EgDHAsyn1_ NCT; amino acids 253-365 of SEQ ID NO:12) and EgDHAsyn2 (EgDHAsyn2_NCT; amino acids 253-365 of SEQ ID NO:22) spanning both the C20 elongase region and the delta-4 desaturase domain (based on homology) with the C-termini of C20 elongases (EgC20elo1_CT, amino acids 246-298 of SEQ ID NO:6; PavC20elo_CT, amino acids 240-277 of SEQ ID NO:2; OtPUFAelo2_CT, amino acids 256-300 of SEQ ID NO:25; TpPUFAelo2_CT, amino acids 279-358 of SEQ ID NO:26) and the N-termini of delta-4 desaturases (EgD4_NT, amino acids 1-116 of SEQ ID NO: 13; TaD4_NT, amino acids 1-47 of SEQ ID NO:27; SaD4_NT, amino acids 1-47 of SEQ ID NO:28; TpD4_NT, amino acids 1-82 of SEQ ID NO:29; lgD4_NT, amino acids 1- 43 of SEQ ID NO:30).
FIG. 7 provides plasmid maps for the following: (A) pY115 (see also SEQ ID
NO:33); (B) Yarrowia lipolytica Gateway® destination vector pBY1 (see also SEQ ID NO:34); (C) Yarrowia lipolytica Gateway® destination vector pY159 (see also SEQ ID NO:38); and (D) pBY-EgC20elo1 (see also SEQ ID NO:39).
FIG. 8 provides plasmid maps for the following: (A) pY132 (see also SEQ ID NO:40); (B) pY161 (see also SEQ ID NO:41); (C) pY164 (see also SEQ ID NO:42); and (D) pY141 (see also SEQ ID NO:49). FIG. 9 provides plasmid maps for the following: (A) pY143 (see also SEQ ID
NO:52); (B) pY149 (see also SEQ ID NO:55); (C) pY150 (see also SEQ ID NO:62); and (D) pY156 (see also SEQ ID NO:64).
FIG. 10 provides plasmid maps for the following: (A) pY152 (see also SEQ ID NO:67); (B) pY157 (see also SEQ ID NO:69); (C) pY153 (see also SEQ ID NO:72); and (D) pY151 (see also SEQ ID NO:76).
FIG. 11 is a map of pY160 (see also SEQ ID NO:77).
FIG. 12 shows a chromatogram of the lipid profile of a Euglena anabaena cell extract as described in the Examples. FIGs. 13A, 13B and 13C show a Clustal W alignment of the amino acid sequences for EaDHAsyni (SEQ ID NO:95), EaDHAsyn2 (SEQ ID NO:96), EaDHAsyn3 (SEQ ID NO:97), and EaDHAsyn4 (SEQ ID NO:98).
FIG. 14 provides plasmid maps for the following: (A) pY165 (see also SEQ ID NO:99); (B) pY166 (see also SEQ ID NO:100); (C) pY167 (see also SEQ ID NO: 101); and (D) pY168 (see also SEQ ID NO: 102).
FIG. 15 provides plasmid maps for the following: (A) pKR1061 (see also SEQ ID NO:111); (B) pKR973 (see also SEQ ID NO: 128); (C) pKR1064 (see also SEQ ID NO:132); and (D) pKR1133 (see also SEQ ID NO:145). FIG. 16 provides plasmid maps for the following: (A) pKR1105 (see also SEQ
ID NO:156); (B) pKR1134 (see also SEQ ID NO:161); (C) pKR1095 (see also SEQ ID NO:167); and (D) pKR1132 (see also SEQ ID NO:170.
FIG. 17 is a map of KS373 (see also SEQ ID NO: 179).
FIG. 18 shows the fatty acid profiles, calculated % elongation, and calculated % desaturation for the clones (except pBY-EgC20elo1) shown in Table 24.
FIG. 19 shows the fatty acid profiles, calculated % elongation, and calculated % desaturation for feeding EPA to a vector only control, pY141, pY143, pY149, pY156, pY157, and pY160.
FIG. 20 shows the fatty acid profiles, calculated % elongation, and calculated % desaturation for feeding DPA to a vector only control, pY141 , pY150, pY151, pY152, pY153, pY156, pY157, and pY160.
FIG. 21 shows a schematic of the relative domain structure for each construct described in Table 25.
FIG. 22 shows the fatty acid profiles, calculated % elongation, and calculated % desaturation for feeding EPA, ARA, and DPA to Yarrowia cells transformed with pY141 (EgDHAsyni ; SEQ ID NO:49) and to a vector only control.
FIG. 23 shows the fatty acid profiles for individual embryos from a representative event in somatic soybean embryos transformed with soybean expression vectors pKR973 and pKR1064 (see Table 26). FIG. 24 shows the fatty acid profiles from the five best elongation events in soybean embryos transformed with soybean expression vector KS373.
FIG. 25 summarizes BLASTP and percent identity values for EgC20elo1 (Example 3), EgDHAsyni (Example 4), and EgDHAsyn2 (Example 5). FIG. 26 shows the fatty acid profiles from feeding soybean embryos with EPA. The soybean embryos were selected from the best C20/delta-5 elongase and delta-4 desaturase activities in soybean embryos transformed with soybean expression vector pKR1105. FIG. 27 shows a chromatogram of the lipid profile of a Euglena gracilis cell extract as described in the Examples.
FIG. 28 is a map of pKR1183 (see also SEQ ID NO:266). FIG. 29 summarizes the Euglena anabaena DHA synthase domain sequences. FIG. 30 is a map of pKR1253 (see also SEQ ID NO:270).
FIG. 31 is a map of pKR1255 (see also SEQ ID NO:275). FIG. 32 is a map of pKR1189 (see also SEQ ID NO:285). FIG. 33 is a map of pKR1229 (see also SEQ ID NO:296). FIG. 34 is a map of pKR1249 (see also SEQ ID NO:297). FIG. 35 is a map of pKR1322 (see also SEQ ID NO:314).
FIG. 36 shows the fatty acid profiles for five events transformed with pKR1189 that have the lowest average ALA content (average of 5 soybean somatic embryos analyzed) along with an event (2148-3-8-1) having a fatty acid profile typical of wild type embryos for this experiment. Fatty acids are identified as 16:0 (palmitate), 18:0 (stearic acid), 18:1 (oleic acid), LA, and ALA. Fatty acid compositions are expressed as a weight percent (wt. %) of total fatty acids. FIG. 37 shows the fatty acid profiles for five events transformed with pKR1183 that have the highest average DGLA content (average of 5 soybean somatic embryos analyzed). Fatty acids are identified as 16:0 (palmitate), 18:0 (stearic acid), 18:1 (oleic acid), LA, ALA, EDA, ERA, DGLA, and ETA. Fatty acid compositions are expressed as a weight percent (wt. %) of total fatty acids. FIG. 38 shows the average fatty acid profiles (Average of 10 soybean somatic embryos) for 20 events transformed with pKR1249 and pKR1253 that have the highest ARA. Fatty acids are identified as 16:0 (palmitate), 18:0 (stearic acid), 18:1 (oleic acid), LA, ALA, EDA, SCI, DGLA, ARA, ERA, JUN, ETA, and EPA. Fatty acid compositions are expressed as a weight percent (wt. %) of total fatty acids. Fatty acids listed as "others" include: 18:2 (5,9), 18:3 (5,9,12), STA, 20:0, 20:1(11), 20:2 (7,11) or 20:2 (8,11), and DPA. FIG. 39 shows the actual fatty acid profiles for each soybean somatic embryo from one event (AFS 5416-8-1-1) having an average ARA content of 17.0% and an average EPA content of 1.5%. Fatty acids are identified as 16:0 (palmitate), 18:0 (stearic acid), 18:1 (oleic acid), LA, ALA, EDA, SCI, DGLA, ARA, ERA, JUN, ETA, and EPA. Fatty acid compositions are expressed as a weight percent (wt. %) of total fatty acids. Fatty acids listed as "others" include: 18:2 (5,9), 18:3 (5,9,12), STA, 20:0, 20:1(11), 20:2 (7,11) or 20:2 (8,11), and DPA.
FIG. 40 shows the average fatty acid profiles (Average of 9 or 10 soybean somatic embryos) for 20 events transformed with pKR1249 and pKR1255 that have the highest ARA. Fatty acids are identified as 16:0 (palmitate), 18:0 (stearic acid), 18:1 (oleic acid), LA, ALA, EDA, SCI, DGLA, ARA, ERA, JUN, ETA, and EPA; fatty acid compositions are expressed as a weight percent (wt. %) of total fatty acids. Fatty acids listed as "others" include: 18:2 (5,9), 18:3 (5,9,12), STA, 20:0, 20:1(11), 20:2 (7,11) or 20:2 (8,11), and DPA. FIG. 41 shows the fatty acid profiles from feeding embryos with EPA. The soybean embryos were selected from the events with the best C20/delta-5 elongase and delta-4 desaturase activities in soybean embryos transformed with soybean expression vector pKR1134. Fatty acids in FIG. 41 are identified as 16:0 (palmitate), 18:0 (stearic acid), 18:1 (oleic acid), LA, ALA, EPA, 22:0 (docosanoic acid), DPA, 24:0 (tetracosanoic acid), DHA, and 24:1 (nevonic acid). Fatty acid compositions listed in FIG. 41 are expressed as a weight percent (wt. %) of total fatty acids.
FIG. 42 shows the fatty acid profiles from feeding soybean embryos with EPA. The soybean embryos were selected from the events with the best C20/delta-5 elongase and delta-4 desaturase activities from the 20 new events analyzed for soy transformed with pKR1105. Fatty acids in FIG. 42 are identified as 16:0 (palmitate), 18:0 (stearic acid), 18:1 (oleic acid), LA, ALA, EPA, 22:0 (docosanoic acid), DPA, 24:0 (tetracosanoic acid), DHA, and 24:1 (nevonic acid). Fatty acid compositions listed in FIG. 42 are expressed as a weight percent (wt. %) of total fatty acids.
FIG. 43 shows a graph depicting the relative activities of events transformed with either pKR1105 (C20 elongase and delta-4 desaturase expressed individually) or pKR1134 (C20 elongase and delta-4 desaturase expressed as a fusion), when the soybean embryos were fed EPA.
FIG. 44 diagrams the development of Yarrowia lipolytics strain Y4305U3. FIG. 45 provides plasmid maps for the following: (A) pZKLeuN-29E3 and (B) pY116.
FIG. 46 provides plasmid maps for the following: (A) pKO2UF8289 and (B) pZKSL-555R. FIG. 47 provides plasmid maps for the following: (A) pZP3-Pa777U and (B) pY117.
FIG. 48 provides plasmid maps for the following: (A) pZP2-2988 and (B) pZKUE3S.
FIG. 49 provides plasmid maps for the following: (A) pZKL2-5U89GC and (B) pZKL1-2SP98C.
FIG. 50 provides plasmid maps for the following: (A) pZKUM and (B) pZKD2- 5U89A2.
FIG. 51 A diagrams the development of Yarrowia lipolytica strain Y4184U. FIG. 51 B provides a plasmid map for pEgC20ES. FIG. 52 provides plasmid maps for the following: (A) pZUFmEgC20ES and
(B) pZKL4-220EA4. FIG. 52C is a schematic drawing showing overlap of the 3' region of the EaC20E domain (SEQ ID NO:231) with the 51 region of the EaD4 domain (SEQ ID NO:246) within EaDHAsyni (SEQ ID NO:95).
FIG. 53A shows an alignment between the N-termini of EaD4S (SEQ ID NO:193), EaD4S-1 (SEQ ID NO:382), EaD4S-2 (SEQ ID NO:384), and EaD4S-3 (SEQ ID NO:386). FIG. 53B shows an alignment between the N-termini of EgD4S (SEQ ID NO:388), EgD4S-1 (SEQ ID NO:404), EgD4S-2 (SEQ ID NO:406), and EgD4S-3 (SEQ ID NO:408).
FIG. 54 provides plasmid maps for the following: (A) pZKLY-G204, (B) pEgC20ES-K, (C) pYNTGUS1-CNP, and (D) pZKLY.
FIG. 55 provides plasmid maps for the following: (A) pZUFmG9G8fu and (B) pZUFmG9A8.
FIG. 56 is a map of pKR1014.
FIG. 57 is a map of pKR1152. FIG. 58 is a map of pKR1151.
FIG. 59 is a map of pKR1150.
FIG. 60 is a map of pKR1199.
FIG. 61 is a map of pKR1200. FIG. 62 is a map of pKR1184.
FIG. 63 is a map of pKR1321.
FIG. 64 is a map of pKR1326.
For FIGs. 65-71 , fatty acids are identified as 16:0 (palmitate), 18:0 (stearic acid), 18:1 (oleic acid), LA, ALA, EDA, ERA, DGLA, and ETA, and fatty acid compositions are expressed as a weight percent (wt. %) of total fatty acids. In addition, elongation activity is expressed as % delta-9 elongation of C18 fatty acids (C18 % delta-9 elong), calculated according to the following formula: ([product]/[substrate + product])*100. More specifically, the combined percent elongation for LA and ALA is determined as: ([DGLA + ETA + EDA + ERA]/[LA + ALA + DGLA + ETA + EDA + ERA])*100. The combined percent desaturation for EDA and ERA is shown as "C20 % delta-8 desat", determined as: ([DGLA + ETA]/[DGLA + ETA + EDA + ERA])*100, and is also referred to as the overall % desaturation. FIG. 65 shows the fatty acid profiles for the five events transformed with pKR1014 that have the highest average DGLA content (average of 5 soybean somatic embryos analyzed).
FIG. 66 shows the fatty acid profiles for the five events transformed with pKR1152 that have the highest average DGLA content (average of 5 soybean somatic embryos analyzed).
FIG. 67 shows the fatty acid profiles for the five events transformed with pKR1151 that have the highest average DGLA content (average of 5 soybean somatic embryos analyzed).
FIG. 68 shows the fatty acid profiles for the five events transformed with pKR1150 that have the highest average DGLA content (average of 5 soybean somatic embryos analyzed).
FIG. 69 shows the fatty acid profiles for the five events transformed with pKR1199 that have the highest average DGLA content (average of 5 soybean somatic embryos analyzed). FIG. 70 shows the fatty acid profiles for the five events transformed with pKR1200 that have the highest average DGLA content (average of 5 soybean somatic embryos analyzed). FIG. 71 shows the fatty acid profiles for the five events transformed with pKR1184 that have the highest average DGLA content (average of 5 soybean somatic embryos analyzed).
FIG. 72 shows a comparison of individually expressed delta-9 elongases with delta-8 desaturases versus the equivalent delta-9 elongase-delta-8 desaturase fusion. Each data point represents the average %DGLA or %EDA for 5-6 embryos (as a % of total fatty acids) for all events analyzed, and Avg. %DGLA is plotted vs. Avg. % EDA. In (A), EgTpom represents EgD9e co-expressed with TpomDδ (pKR1014), and EgTpomfus represents the EgD9e/TpomD8 fusion (pKR1199). In (B), EgEa represents EgD9e co-expressed with EaD8 (pKR1152), and EgEafus represents the EgD9e/EaD8 fusion (pKR1200). In (C), EaTpom represents EaD9e co-expressed with TpomDδ (pKR1151), and EaTpomfus represents the EaD9e/TpomD8 fusion (pKR1183). In FIG. (D), EaEa represents EaD9e co- expressed with EaD8 (pKR1150) and EaEafus represents the EaD9e/EaD8 fusion (pKR1200).
FIG. 73 shows the fatty acid profiles for the five events transformed with pKR1322 (Experiment MSE2274) that have the highest average ARA and EPA content (average of the 5 embryos analyzed) Fatty acids are identified as 16:0 (palmitate), 18:0 (stearic acid), 18:1 (oleic acid), 18:2 (5,9), LA, ALA, EDA, ERA, SCI, DGLA, JUN (also called JUP), ETA, ARA and EPA. Fatty acid compositions are expressed as a weight percent (wt. %) of total fatty acids. Elongation activity is expressed as % delta-9 elongation of C18 fatty acids (%Elo), calculated according to the following formula: ([product]/[substrate + product])*100. More specifically, the combined percent elongation for LA and ALA is determined as: ([DGLA + ETA + EDA + ERA + EPA + ARA]/[LA + ALA + DGLA + ETA + EDA + ERA + EPA +
ARA])*100. The combined percent delta-8 desaturation for EDA and ERA is shown as "%D8", determined as: ([DGLA + ETA + EPA + ARA]/[DGLA + ETA + EDA + ERA + EPA + ARA])*100. This is also referred to as the overall % delta-8 desaturation. The combined percent delta-5 desaturation for DGLA and ETA is shown as "%D5", determined as: ([EPA + ARA]/[DGLA + ETA + EPA + ARA])*100. This is also referred to as the overall % delta-5 desaturation.
FIG. 74 shows the fatty acid profiles for the five events transformed with pKR1326 (Experiment MSE2275) that have the highest average DGLA and ETA content (average of the 5 embryos analyzed). Fatty acids are identified as 16:0 (palmitate), 18:0 (stearic acid), 18:1 (oleic acid), LA, ALA, EDA, ERA, DGLA and DGLA and ETA. Fatty acid compositions are expressed as a weight percent (wt. %) of total fatty acids. Elongation activity is expressed as % delta-9 elongation of C18 fatty acids (C18 % delta-9 elong), calculated according to the following formula: ([product]/[substrate + product])*100. More specifically, the combined percent elongation for LA and ALA is determined as: ([DGLA + ETA + EDA + ERA]/[LA + ALA + DGLA + ETA + EDA + ERA])MOO. The combined percent desaturation for EDA and ERA is shown as "C20 % delta-8 desat", determined as: ([DGLA + ETA]/[DGLA + ETA + EDA + ERA])*100. This is also referred to as the overall % desaturation.
The sequence descriptions summarize the Sequences Listing attached hereto. The Sequence Listing contains one letter codes for nucleotide sequence characters and the single and three letter codes for amino acids as defined in the IUPAC-IUB standards described in Nucleic Acids Research 13:3021-3030 (1985) and in the Biochemical Journal 219(2):345-373 (1984).
SEQ ID NOs: 1-519 are primers, ORFs encoding genes, proteins (or portions thereof), or plasmids, as identified in Table 2.
TABLE 2 Summary Of Nucleic Acid And Protein SEQ ID Numbers
Figure imgf000022_0001
Figure imgf000023_0001
Figure imgf000024_0001
Figure imgf000025_0001
Figure imgf000026_0001
Figure imgf000027_0001
Figure imgf000028_0001
Figure imgf000029_0001
Figure imgf000030_0001
Figure imgf000031_0001
Figure imgf000032_0001
DETAILED DESCRIPTION OF THE INVENTION
The disclosure of each reference set forth herein is hereby incorporated by reference in its entirety.
The present invention relates to multizymes, such as DHA synthase. These are useful for, inter alia, the manipulation of biochemical pathways for the production of healthful PUFAs and more specifically for the production of docosahexaenoic acid (DHA). Thus, the subject invention finds many applications. PUFAs, or derivatives thereof, made by the methodology disclosed herein can be used as dietary substitutes, or supplements, particularly infant formulas, for patients undergoing intravenous feeding or for preventing or treating malnutrition. Alternatively, the purified PUFAs (or derivatives thereof) may be incorporated into cooking oils, fats, or margarines formulated so that in normal use the recipient would receive the desired amount for dietary supplementation. The PUFAs may also be incorporated into infant formulas, nutritional supplements, or other food products and may find use as anti-inflammatory or cholesterol lowering agents. Optionally, the compositions may be used for pharmaceutical use (human or veterinary). In this case, the PUFAs are generally administered orally but can be administered by any route by which they may be successfully absorbed, e.g., parenterally (e.g., subcutaneously, intramuscularly or intravenously), rectally, vaginally, or topically (e.g., as a skin ointment or lotion).
Supplementation of humans or animals with PUFAs produced by recombinant means can result in increased levels of the added PUFAs, as well as their metabolic progeny. For example, treatment with EPA can result not only in increased levels of EPA, but also downstream products of EPA such as eicosanoids (i.e., prostaglandins, leukotrienes, thromboxanes). Complex regulatory mechanisms can make it desirable to combine various PUFAs, or add different conjugates of PUFAs, in order to prevent, control, or overcome such mechanisms to achieve the desired levels of specific PUFAs in an individual. Definitions As used herein and in the appended claims, the singular forms "a", "an", and
"the" include plural reference unless the context clearly dictates otherwise. Thus, for example, reference to "a plant" includes a plurality of such plants, reference to "a cell" includes one or more cells and equivalents thereof known to those skilled in the art, and so forth. The term "invention" or "present invention" as used herein is not meant to be limiting to any one specific embodiment of the invention but applies generally to any and all embodiments of the invention as described in the claims and specification.
In the context of this disclosure, a number of terms and abbreviations are used. The following definitions are provided. "Open reading frame" is abbreviated ORF.
"Polymerase chain reaction" is abbreviated PCR. "American Type Culture Collection" is abbreviated ATCC. "Polyunsaturated fatty acid(s)" is abbreviated PUFA(s). "Triacylglycerols" are abbreviated TAGs.
The terms "down-regulate or down-regulation", as used herein, refer to a reduction or decrease in the level of expression of a gene or polynucleotide.
The term "multizyme" refers to a single polypeptide having at least two independent and separable enzymatic activities. Preferably, the multizyme comprises a first enzymatic activity linked to a second enzymatic activity.
The term "fusion protein" is used interchangeably with the term "multizyme". Thus, a "fusion protein" refers to a single polypeptide having at least two independent and separable enzymatic activities. The term "fusion gene" refers to a polynucleotide or gene that encodes a multizyme. A fusion gene can be constructed by linking at least two DNA fragments, wherein each DNA fragment encodes for an independent and separate enzyme activity. An example of a fusion gene is described herein below in Example 38, in which the Hybrid 1 -HGLA Synthase fusion gene was constructed by linking the Euglena anabaena delta-9 elongase (EaD9Elo1 ; SEQ ID NO:252) and the
Tetruetreptia pomquetensis CCMP1491 delta-8 desaturase (TpomD8; SEQ ID NO: 162) using the Euglena gracilis DHA synthase 1 proline-rich linker. (EgDHAsynHink; SEQ ID NO:197).
A "domain" or "functional domain" is a discrete, continuous part or subsequence of a polypeptide that can be associated with a function (e.g. enzymatic activity). As used herein, the term "domain" includes but is not limited to fatty acid biosynthetic enzymes and portions of fatty acid biosynthetic enzymes that retain enzymatic activity.
"DHA synthase" is an example of a multizyme. Specifically, a DHA synthase comprises a C20 elongase linked to a delta-4 desaturase using any of the linkers described herein. Another example of a multizyme is a single polypeptide comprising a delta-9 elongase linked to a delta-8 desaturase as discussed below. The term "link" refers to joining or bonding at least two polypeptides having independent and separable enzyme activities. The term "linker" refers to the bond or link between two or more polypeptides each having independent and separable enzymatic activities
The link used to form a multizyme is minimally comprised of a single polypeptide bond. In another aspect, the link may be comprised of one amino acid residue, such as proline, or a polypeptide. If the link is a polypeptide, it may be desirable for the link to have at least one proline amino acid residue.
An example of a linker is shown in SEQ ID NO: 198 (the EgDHAsyni proline- rich linker). The term "fatty acids" refers to long-chain aliphatic acids (alkanoic acids) of varying chain lengths, from about Ci2 to C22 (although both longer and shorter chain-length acids are known). The predominant chain lengths are between Ciβ and C22. Additional details concerning the differentiation between "saturated fatty acids" versus "unsaturated fatty acids", "monounsaturated fatty acids" versus "polyunsaturated fatty acids" (or "PUFAs"), and "omega-6 fatty acids" (ω-6 or n-6) versus "omega-3 fatty acids" (omega-3 or π-3) are provided in U.S. Patent 7,238,482.
Fatty acids are described herein by a simple notation system of "X:Y", wherein X is the total number of carbon (C) atoms in the particular fatty acid and Y is the number of double bonds. The number following the fatty acid designation indicates the position of the double bond from the carboxyl end of the fatty acid with the "c" affix for the c/s-configu ration of the double bond (e.g., palmitic acid (16:0), stearic acid (18:0), oleic acid (18:1 , 9c), petroselinic acid (18:1 , 6c), LA (18:2, 9c,12c), GLA (18:3, 6c,9c,12c) and ALA (18:3, 9c,12c,15c)). Unless otherwise specified, 18:1 , 18:2 and 18:3 refer to oleic, LA and ALA fatty acids, respectively. If not specifically written as otherwise, double bonds are assumed to be of the cis configuration. For instance, the double bonds in 18:2 (9,12) would be assumed to be in the cis configuration.
Nomenclature used to describe PUFAs in the present disclosure is shown below in Table 3. In the column titled "Shorthand Notation", the omega-reference system is used to indicate the number of carbons, the number of double bonds and the position of the double bond closest to the omega carbon, counting from the omega carbon (which is numbered 1 for this purpose). The remainder of the table summarizes the common names of omega-3 and omega-6 fatty acids and their precursors, the abbreviations that will be used throughout the remainder of the specification, and each compounds' chemical name. TABLE 3
Figure imgf000036_0001
A metabolic, or biosynthetic, pathway, in a biochemical sense, can be regarded as a series of chemical reactions occurring within a cell, catalyzed by enzymes, to achieve either the formation of a metabolic product to be used or stored by the cell, or the initiation of another metabolic pathway (then called a flux generating step). Many of these pathways are elaborate, and involve a step by step modification of the initial substance to shape it into a product having the exact chemical structure desired.
The term "PUFA biosynthetic pathway" refers to a metabolic process that converts oleic acid to LA, EDA, GLA, DGLA, ARA, DTA, DPAn-6, ALA, STA, ETrA, ETA, EPA, DPA and DHA. This process is well described in the literature (e.g., see PCT Publication No. WO 2006/052870). Simplistically, this process involves elongation of the carbon chain through the addition of carbon atoms and desaturation of the molecule through the addition of double bonds, via a series of special desaturation and elongation enzymes (i.e., "PUFA biosynthetic pathway enzymes") present in the endoplasmic reticulum membrane. More specifically, "PUFA biosynthetic pathway enzyme" refers to any of the following enzymes (and genes which encode said enzymes) associated with the biosynthesis of a PUFA, including: a delta-4 desaturase, a delta-5 desaturase, a delta-6 desaturase, a delta- 12 desaturase, a delta-15 desaturase, a delta-17 desaturase, a delta-9 desaturase, a delta-8 desaturase, a delta-9 elongase, a C14/16 elongase, a C16/18 elongase, a C18/20 elongase, a C20/22 elongase, a DHA synthase and/or a multizyme of the instant invention.
The term "omega-3/omega-6 fatty acid biosynthetic pathway" refers to a set of genes which, when expressed under the appropriate conditions encode enzymes that catalyze the production of either or both omega-3 and omega-6 fatty acids. Typically the genes involved in the omega-3/omega-6 fatty acid biosynthetic pathway encode PUFA biosynthetic pathway enzymes. A representative pathway is illustrated in FIG. 1 , providing for the conversion of myhstic acid through various intermediates to DHA, which demonstrates how both omega-3 and omega-6 fatty acids may be produced from a common source. The pathway is naturally divided into two portions where one portion will generate omega-3 fatty acids and the other portion, omega-6 fatty acids. The term "functional" as used herein in context with the omega-3/omega-6 fatty acid biosynthetic pathway means that some (or all) of the genes in the pathway express active enzymes, resulting in in vivo catalysis or substrate conversion. It should be understood that "omega-3/omega-6 fatty acid biosynthetic pathway" or "functional omega-3/omega-6 fatty acid biosynthetic pathway" does not imply that all the PUFA biosynthetic pathway enzyme genes are required, as a number of fatty acid products will only require the expression of a subset of the genes of this pathway.
The term "delta-6 desaturase/ delta-6 elongase pathway" refers to a PUFA biosynthetic pathway that minimally includes at least one delta-6 desaturase and at least one Ci 8/20 elongase, thereby enabling biosynthesis of DGLA and/or ETA from LA and ALA, respectively, with GLA and/or STA as intermediate fatty acids. With expression of other desaturases and elongases, ARA, DTA, DPAn-6, EPA, DPA, and DHA may also be synthesized. The term "delta-9 elongase/delta-8 desaturase pathway" refers to a PUFA biosynthetic pathway that minimally comprises at least one delta-9 elongase and at least one delta-8 desaturase, thereby enabling biosynthesis of DGLA and/or ETA from LA and ALA, respectively, with EDA and/or ETrA as intermediate fatty acids With expression of other desaturases and elongases, ARA, DTA1 DPAn-6, EPA, DPA and DHA may also be synthesized. This pathway may be advantageous in some embodiments, as the biosynthesis of GLA and/or STA is excluded.
The term "intermediate fatty acid" refers to any fatty acid produced in a fatty acid metabolic pathway that can be further converted to an intended product fatty acid in this pathway by the action of other metabolic pathway enzymes. For instance, when EPA is produced using the delta-9 elongase/delta-8 desaturase pathway, EDA, ETrA, DGLA, ETA and ARA can be produced and are considered "intermediate fatty acids" since these fatty acids can be further converted to EPA via action of other metabolic pathway enzymes.
The term "by-product fatty acid" refers to any fatty acid produced in a fatty acid metabolic pathway that is not the intended fatty acid product of the pathway nor an "intermediate fatty acid" of the pathway. For instance, when EPA is produced using the delta-9 elongase/delta-8 desaturase pathway, sciadonic acid (SCI) and juniperonic acid (JUP) also can be produced by the action of a delta-5 desaturase on either EDA or ETrA, respectively. They are considered to be "by-product fatty acids" since neither can be further converted to EPA by the action of other metabolic pathway enzymes.
The terms "triacylglycerol", "oil" and "TAGs" refer to neutral lipids composed of three fatty acyl residues esterified to a glycerol molecule (and such terms will be used interchangeably throughout the present disclosure herein). Such oils can contain long-chain PUFAs, as well as shorter saturated and unsaturated fatty acids and longer chain saturated fatty acids. Thus, "oil biosynthesis" generically refers to the synthesis of TAGs in the cell. "Percent (%) PUFAs in the total lipid and oil fractions" refers to the percent of PUFAs relative to the total fatty acids in those fractions. The term "total lipid fraction" or "lipid fraction" both refer to the sum of all lipids (i.e., neutral and polar) within an oleaginous organism, thus including those lipids that are located in the phosphatidylcholine (PC) fraction, phosphatidylethanolamine (PE) fraction and triacylglycerol (TAG or oil) fraction. However, the terms "lipid" and "oil" will be used interchangeably throughout the specification.
The terms "conversion efficiency" and "percent substrate conversion" refer to the efficiency by which a particular enzyme (e.g., a desaturase) can convert substrate to product. The conversion efficiency is measured according to the following formula: ([product]/[substrate + product])*100, where 'product' includes the immediate product and all products in the pathway derived from it.
"Desaturase" is a polypeptide that can desaturate, i.e., introduce a double bond, in one or more fatty acids to produce a fatty acid or precursor of interest. Despite use of the omega-reference system throughout the specification to refer to specific fatty acids, it is more convenient to indicate the activity of a desaturase by counting from the carboxyl end of the substrate using the delta-system. For example, delta-8 desaturases will desaturate a fatty acid between the eighth and ninth carbon atom numbered from the carboxyl-terminal end of the molecule and can, for example, catalyze the conversion of EDA to DGLA and/or ETrA to ETA. Other useful fatty acid desaturases include, for example: (1) delta-5 desaturases that catalyze the conversion of DGLA to ARA and/or ETA to EPA; (2) delta-6 desaturases that catalyze the conversion of LA to GLA and/or ALA to STA; (3) delta- 4 desaturases that catalyze the conversion of DPA to DHA and/or DTA to DPAn-6; (4) delta-12 desaturases that catalyze the conversion of oleic acid to LA; (5) delta- 15 desaturases that catalyze the conversion of LA to ALA and/or GLA to STA; (6) delta-17 desaturases that catalyze the conversion of ARA to EPA and/or DGLA to ETA; and (7) delta-9 desaturases that catalyze the conversion of palmitic acid to palmitoleic acid (16:1) and/or stearic acid to oleic acid (18:1). In the art, delta-15 and delta-17 desaturases are also occasionally referred to as "omega-3 desaturases", "w-3 desaturases", and/or "n-3 desaturases", based on their ability to convert omega-6 fatty acids into their omega-3 counterparts (e.g., conversion of LA into ALA and ARA into EPA, respectively). In some embodiments, it is most desirable to empirically determine the specificity of a particular fatty acid desaturase by transforming a suitable host with the gene for the fatty acid desaturase and determining its effect on the fatty acid profile of the host.
The term "delta-4 desaturase" refers to an enzyme that will desaturate a fatty acid between the fourth and fifth carbon atom numbered from the carboxyl-terminal end of the molecule and that can, for example, catalyze the conversion of DPA to DHA and/or DTA to DPAn-6. For the purposes herein, the term "EgDHAsyni" refers to a DHA synthase enzyme (SEQ ID NO:12) isolated from Euglena gracilis, encoded by SEQ ID NO:11 herein. The term "EgDHAsyn2" refers to a DHA synthase enzyme (SEQ ID NO:22) isolated from Euglena gracilis, encoded by SEQ ID NO:21 herein. The term "EaDHAsyni" refers to a DHA synthase enzyme (SEQ ID NO:95) isolated from Euglena anabaena, encoded by SEQ ID NO:91 herein. The term "EaDHAsyn2" refers to a DHA synthase enzyme (SEQ ID NO:96) isolated from Euglena anabaena, encoded by SEQ ID NO:92 herein. The term "EaDHAsyn3" refers to a DHA synthase enzyme (SEQ ID NO:97) isolated from Euglena anabaena, encoded by SEQ ID NO:93 herein. The term "EaDHAsyn4" refers to an enzyme (SEQ ID NO:98) isolated from Euglena anabaena, encoded by SEQ ID NO:94 herein.
The term "elongase system" refers to a suite of four enzymes that are responsible for elongation of a fatty acid carbon chain to produce a fatty acid that is two carbons longer than the fatty acid substrate that the elongase system acts upon. More specifically, the process of elongation occurs in association with fatty acid synthase, whereby CoA is the acyl carrier (Lassner et al., Plant Cell 8:281-292 (1996)). In the first step, which has been found to be both substrate-specific and also rate-limiting, malonyl-CoA is condensed with a long-chain acyl-CoA to yield carbon dioxide (CO2) and a β-ketoacyl-CoA (where the acyl moiety has been elongated by two carbon atoms). Subsequent reactions include reduction to β- hydroxyacyl-CoA, dehydration to an enoyl-CoA and a second reduction to yield the elongated acyl-CoA. Examples of reactions catalyzed by elongase systems are the conversion of GLA to DGLA, STA to ETA, LA to EDA, ALA to ETrA and EPA to DPA.
For the purposes herein, an enzyme catalyzing the first condensation reaction (i.e., conversion of malonyl-CoA and long-chain acyl-CoA to β-ketoacyl- CoA) will be referred to generically as an "elongase". In general, the substrate selectivity of elongases is somewhat broad but segregated by both chain length and the degree of unsaturation. Accordingly, elongases can have different specificities. For example, a C-|4/16 elongase will utilize a C-14 substrate (e.g., myristic acid); a
C16/18 elongase will utilize a C16 substrate (e.g., palmitate); a C-18/20 elongase will utilize a C18 substrate (e.g., GLA, STA); and a C20/22 elongase will utilize a C2o substrate (e.g., ARA, EPA). Similarly, a "delta-9 elongase" is able to catalyze the conversion of LA to EDA and/or ALA to ETrA.
It is important to note that some elongases have broad specificity and thus a single enzyme may be capable of catalyzing several elongase reactions. Thus, for example, a delta-9 elongase may also act as a C16/18 elongase, C18/2o elongase and/or C20/22 elongase and may have alternate, but not preferred, specificities for delta-5 and delta-6 fatty acids such as EPA and/or GLA, respectively.
The term "C20 elongase" as used herein refers to an enzyme which utilizes a
C20 substrate such as EPA or ARA, for example. The term "C20/delta-5 elongase" refers to an enzyme that utilizes a C20 substrate with a delta-5 double bond.
Similarly for the purposes herein, the term "EgD9elo" or "EgD9e" refers to a delta-9 elongase isolated from Euglena gracilis (see SEQ ID NO:112; also see U.S.
Application No. 11/601 ,563 (filed November 16, 2006, which published as US-2007-
0118929-A1 on May 24, 2007)). As used herein, "nucleic acid" means a polynucleotide and includes a single or double-stranded polymer of deoxyribonucleotide or ribonucleotide bases. Nucleic acids may also include fragments and modified nucleotides. Thus, the terms "polynucleotide", "nucleic acid sequence", "nucleotide sequence" or "nucleic acid fragment" are used interchangeably and refer to a polymer of RNA or DNA that is single- or double-stranded, optionally containing synthetic, non-natural or altered nucleotide bases. Nucleotides (usually found in their 5'-monophosphate form) are referred to by their single letter designation as follows: "A" for adenylate or deoxyadenylate (for RNA or DNA, respectively), "C" for cytidylate or deoxycytidylate, "G" for guanylate or deoxyguanylate, "U" for uridylate, "T" for deoxythymidylate, "R" for purines (A or G), "Y" for pyrimidines (C or T), "K" for G or T, "H" for A or C or T, "I" for inosine, and "N" for any nucleotide. The terms "subfragment that is functionally equivalent" and "functionally equivalent subfragment" are used interchangeably herein. These terms refer to a portion or subsequence of an isolated nucleic acid fragment in which the ability to alter gene expression or produce a certain phenotype is retained whether or not the fragment or subfragment encodes an active enzyme. For example, the fragment or subfragment can be used in the design of chimeric genes to produce the desired phenotype in a transformed plant. Chimeric genes can be designed for use in suppression by linking a nucleic acid fragment or subfragment thereof, whether or not it encodes an active enzyme, in the sense or antisense orientation relative to a plant promoter sequence. The term "conserved domain" or "motif means a set of amino acids conserved at specific positions along an aligned sequence of evolutionarily related proteins. While amino acids at other positions can vary between homologous proteins, amino acids that are highly conserved at specific positions indicate amino acids that are essential in the structure, the stability, or the activity of a protein. The terms "homology", "homologous", "substantially similar" and "corresponding substantially" are used interchangeably herein. They refer to nucleic acid fragments wherein changes in one or more nucleotide bases do not affect the ability of the nucleic acid fragment to mediate gene expression or produce a certain phenotype. These terms also refer to modifications of the nucleic acid fragments of the instant invention such as deletion or insertion of one or more nucleotides that do not substantially alter the functional properties of the resulting nucleic acid fragment relative to the initial, unmodified fragment. It is therefore understood, as those skilled in the art will appreciate, that the invention encompasses more than the specific exemplary sequences.
Moreover, the skilled artisan recognizes that substantially similar nucleic acid sequences encompassed by this invention are also defined by their ability to hybridize (under moderately stringent conditions, e.g., 0.5X SSC, 0.1 % SDS, 60 0C) with the sequences exemplified herein, or to any portion of the nucleotide sequences disclosed herein and which are functionally equivalent to any of the nucleic acid sequences disclosed herein. Stringency conditions can be adjusted to screen for moderately similar fragments, such as homologous sequences from distantly related organisms, to highly similar fragments, such as genes that duplicate functional enzymes from closely related organisms. Post-hybridization washes determine stringency conditions.
The term "selectively hybridizes" includes reference to hybridization, under stringent hybridization conditions, of a nucleic acid sequence to a specified nucleic acid target sequence to a detectably greater degree (e.g., at least 2-fold over background) than its hybridization to non-target nucleic acid sequences and to the substantial exclusion of non-target nucleic acids. Selectively hybridizing sequences typically have about at least 80% sequence identity, or 90% sequence identity, up to and including 100% sequence identity (i.e., fully complementary) with each other. The term "stringent conditions" or "stringent hybridization conditions" includes reference to conditions under which a probe will selectively hybridize to its target sequence. Stringent conditions are sequence-dependent and will be different in different circumstances. By controlling the stringency of the hybridization and/or washing conditions, target sequences can be identified which are 100% complementary to the probe (homologous probing). Alternatively, stringency conditions can be adjusted to allow some mismatching in sequences so that lower degrees of similarity are detected (heterologous probing). Generally, a probe is less than about 1000 nucleotides in length, optionally less than 500 nucleotides in length. Typically, stringent conditions will be those in which the salt concentration is less than about 1.5 M Na ion, typically about 0.01 to 1.0 M Na ion concentration (or other salts) at pH 7.0 to 8.3 and the temperature is at least about 30 0C for short probes (e.g., 10 to 50 nucleotides) and at least about 60 0C for long probes (e.g., greater than 50 nucleotides). Stringent conditions may also be achieved with the addition of destabilizing agents such as formamide. Exemplary low stringency conditions include hybridization with a buffer solution of 30 to 35% formamide, 1 M NaCI1 1% SDS (sodium dodecyl sulphate) at 37 0C, and a wash in 1 X to 2X SSC (2OX SSC = 3.0 M NaCI/0.3 M trisodium citrate) at 50 to 55 0C. Exemplary moderate stringency conditions include hybridization in 40 to 45% formamide, 1 M NaCI, 1 % SDS at 37 0C, and a wash in 0.5X to 1X SSC at 55 to 60 0C. Exemplary high stringency conditions include hybridization in 50% formamide, 1 M NaCI, 1% SDS at 37 0C, and a wash in 0.1X SSC at 60 to 65 0C.
Specificity is typically the function of post-hybridization washes, the critical factors being the ionic strength and temperature of the final wash solution. For
DNA-DNA hybrids, the Tm can be approximated from the equation of Meinkoth et al., Anal. Biochem. 138:267-284 (1984): Tm = 81.5 0C + 16.6 (log M) + 0.41 (%GC) - 0.61 (% form) - 500/L; where M is the molarity of monovalent cations, %GC is the percentage of guanosine and cytosine nucleotides in the DNA, % form is the percentage of formamide in the hybridization solution, and L is the length of the hybrid in base pairs. The Tm is the temperature (under defined ionic strength and pH) at which 50% of a complementary target sequence hybridizes to a perfectly matched probe. Tm is reduced by about 1°C for each 1% of mismatching; thus, Tm, hybridization and/or wash conditions can be adjusted to hybridize to sequences of the desired identity. For example, if sequences with >90% identity are sought, the Tm can be decreased 10 0C. Generally, stringent conditions are selected to be about 5 0C lower than the thermal melting point (Tm) for the specific sequence and its complement at a defined ionic strength and pH. However, severely stringent conditions can utilize a hybridization and/or wash at 1 , 2, 3, or 4 0C lower than the thermal melting point (Tm); moderately stringent conditions can utilize a hybridization and/or wash at 6, 7, 8, 9, or 10 0C lower than the thermal melting point (Tm); low stringency conditions can utilize a hybridization and/or wash at 11 , 12, 13, 14, 15, or 20 0C lower than the thermal melting point (Tm). Using the equation, hybridization and wash compositions, and desired Tm, those of ordinary skill will understand that variations in the stringency of hybridization and/or wash solutions are inherently described. If the desired degree of mismatching results in a Tm of less than 45 0C (aqueous solution) or 32 0C (formamide solution) it is preferred to increase the SSC concentration so that a higher temperature can be used. An extensive guide to the hybridization of nucleic acids is found in Tijssen, Laboratory Techniques in Biochemistry and Molecular Biology-Hybridization with Nucleic Acid Probes, Part I, Chapter 2 "Overview of principles of hybridization and the strategy of nucleic acid probe assays", Elsevier, New York (1993); and Current Protocols in Molecular Biology, Chapter 2, Ausubel et al., Eds., Greene Publishing and Wiley-lnterscience, New York (1995). Hybridization and/or wash conditions can be applied for at least 10, 30, 60, 90, 120, or 240 minutes.
"Sequence identity" or "identity" in the context of nucleic acid or polypeptide sequences refers to the nucleic acid bases or amino acid residues in two sequences that are the same when aligned for maximum correspondence over a specified comparison window.
Thus, "percentage of sequence identity" refers to the value determined by comparing two optimally aligned sequences over a comparison window, wherein the portion of the polynucleotide or polypeptide sequence in the comparison window may comprise additions or deletions (i.e., gaps) as compared to the reference sequence (which does not comprise additions or deletions) for optimal alignment of the two sequences. The percentage is calculated by determining the number of positions at which the identical nucleic acid base or amino acid residue occurs in both sequences to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the window of comparison and multiplying the results by 100 to yield the percentage of sequence identity. Useful examples of percent sequence identities include, but are not limited to, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, or 95%, or any integer percentage from 50% to 100%. These identities can be determined using any of the programs described herein.
Sequence alignments and percent identity or similarity calculations may be determined using a variety of comparison methods designed to detect homologous sequences including, but not limited to, the MegAlign™ program of the LASERGENE bioinformatics computing suite (DNASTAR Inc., Madison, Wl). Within the context of this application it will be understood that where sequence analysis software is used for analysis, that the results of the analysis will be based on the "default values" of the program referenced, unless otherwise specified. As used herein "default values" will mean any set of values or parameters that originally load with the software when first initialized.
The "Clustal V method of alignment" corresponds to the alignment method labeled Clustal V (described by Higgins and Sharp, CABIOS. 5:151-153 (1989); Higgins, D. G. et al., Comput. Appl. Biosci. 8:189-191 (1992)) and found in the
MegAlign™ program of the LASERGENE bioinformatics computing suite (DNASTAR Inc., Madison, Wl). For multiple alignments, the default values correspond to GAP PENALTY=IO and GAP LENGTH PENALTY=IO. Default parameters for pairwise alignments and calculation of percent identity of protein sequences using the Clustal V method are KTUPLE=I , GAP PENALTY=3, WINDOW=5 and DIAGONALS
SAVED=5. For nucleic acids these parameters are KTUPLE=2, GAP PENALTY=5, WINDOW=4 and DIAGONALS SAVED=4. After alignment of the sequences using the Clustal V program, it is possible to obtain a "percent identity" by viewing the "sequence distances" table in the same program. The "Clustal W method of alignment" corresponds to the alignment method labeled Clustal W (described by Higgins and Sharp, supra; Higgins, D. G. et al., supra) and found in the MegAlign™ v6.1 program of the LASERGENE bioinformatics computing suite (DNASTAR Inc., Madison, Wl). Default parameters for multiple alignment correspond to GAP PENALTY=IO, GAP LENGTH PENALTY=0.2, Delay Divergen Seqs(%)=30, DNA Transition Weight=0.5, Protein Weight Matrix=Gonnet Series, DNA Weight Matrix=IUB. After alignment of the sequences using the Clustal W program, it is possible to obtain a "percent identity" by viewing the "sequence distances" table in the same program.
"BLASTN method of alignment" is an algorithm provided by the National Center for Biotechnology Information (NCBI) to compare nucleotide sequences using default parameters.
It is well understood by one skilled in the art that many levels of sequence identity are useful in identifying polypeptides, from other species, wherein such polypeptides have the same or similar function or activity. Useful examples of percent identities include, but are not limited to, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, or 95%, or any integer percentage from 50% to 100%. Indeed, any integer amino acid identity from 50% to 100% may be useful in describing the present invention, such as 51 %, 52%, 53%, 54%, 55%, 56%, 57%, 58%, 59%, 60%, 61%, 62%, 63%, 64%, 65%, 66%, 67%, 68%, 69%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99%. Also, of interest is any full-length or partial complement of this isolated nucleotide fragment. "Gene" refers to a nucleic acid fragment that expresses a specific protein and can include either the coding region alone or the coding region in addition to the regulatory sequences preceding (5' non-coding sequences) and following (3' non- coding sequences) the coding sequence. "Native gene" refers to a gene as found in nature with its own regulatory sequences. "Chimeric gene" refers to any gene that is not a native gene, comprising regulatory and coding sequences that are not found together in nature. Accordingly, a chimeric gene may comprise regulatory sequences and coding sequences that are derived from different sources, or regulatory sequences and coding sequences derived from the same source, but arranged in a manner different than that found in nature. "Endogenous gene" refers to a native gene in its natural location in the genome of an organism. A "foreign" gene refers to a gene not normally found in the host organism, but that is introduced into the host organism by gene transfer. Foreign genes can comprise native genes inserted into a non-native organism, or chimeric genes. A "transgene" is a gene that has been introduced into the genome by a transformation procedure. The term "genome" as it applies to plant cells encompasses not only chromosomal DNA found within the nucleus, but organelle DNA found within subcellular components (e.g., mitochondrial, plastid) of the cell.
A "codon-optimized gene" is a gene having its frequency of codon usage designed to mimic the frequency of preferred codon usage of the host cell. An "allele" is one of several alternative forms of a gene occupying a given locus on a chromosome. When all the alleles present at a given locus on a chromosome are the same that plant is homozygous at that locus. If the alleles present at a given locus on a chromosome differ that plant is heterozygous at that locus. "Coding sequence" refers to a DNA sequence that codes for a specific amino acid sequence. "Regulatory sequences" refer to nucleotide sequences located upstream (5' non-coding sequences), within, or downstream (3' non-coding sequences) of a coding sequence, and which influence the transcription, RNA processing or stability, or translation of the associated coding sequence. Regulatory sequences may include, but are not limited to: promoters, translation leader sequences, introns, polyadenylation recognition sequences, RNA processing sites, effector binding sites and stem-loop structures. "Promoter" refers to a DNA sequence capable of controlling the expression of a coding sequence or functional RNA. The promoter sequence consists of proximal and more distal upstream elements, the latter elements often referred to as enhancers. Accordingly, an "enhancer" is a DNA sequence that can stimulate promoter activity, and may be an innate element of the promoter or a heterologous element inserted to enhance the level or tissue-specificity of a promoter. Promoters may be derived in their entirety from a native gene, or be composed of different elements derived from different promoters found in nature, or even comprise synthetic DNA segments. It is understood by those skilled in the art that different promoters may direct the expression of a gene in different tissues or cell types, or at different stages of development, or in response to different environmental conditions. It is further recognized that since in most cases the exact boundaries of regulatory sequences have not been completely defined, DNA fragments of some variation may have identical promoter activity. Promoters that cause a gene to be expressed in most cell types at most times are commonly referred to as "constitutive promoters". New promoters of various types useful in plant cells are constantly being discovered; numerous examples may be found in the compilation by Okamuro, J. K., and Goldberg, R. B. Biochemistry of Plants 15:1-82 (1989).
"Translation leader sequence" refers to a polynucleotide sequence located between the promoter sequence of a gene and the coding sequence. The translation leader sequence is present in the fully processed mRNA upstream of the translation start sequence. The translation leader sequence may affect processing of the primary transcript to mRNA, mRNA stability or translation efficiency. Examples of translation leader sequences have been described (Turner, R. and Foster, G. D., MoI. Biotechnol. 3:225-236 (1995)). "3' non-coding sequences", "transcription terminator" or "termination sequences" refer to DNA sequences located downstream of a coding sequence, including polyadenylation recognition sequences and other sequences encoding regulatory signals capable of affecting mRNA processing or gene expression. The polyadenylation signal is usually characterized by affecting the addition of polyadenylic acid tracts to the 3' end of the mRNA precursor. The use of different 3' non-coding sequences is exemplified by Ingelbrecht, I. L., et al. Plant Cell 1 :671-680 (1989). "RNA transcript" refers to the product resulting from RNA polymerase- catalyzed transcription of a DNA sequence. When the RNA transcript is a perfect complementary copy of the DNA sequence, it is referred to as the primary transcript. An RNA transcript is referred to as the mature RNA when it is an RNA sequence derived from post-transcriptional processing of the primary transcript. "Messenger RNA" or "mRNA" refers to the RNA that is without introns and that can be translated into protein by the cell. "cDNA" refers to a DNA that is complementary to, and synthesized from, an mRNA template using the enzyme reverse transcriptase. The cDNA can be single-stranded or converted into double-stranded form using the Klenow fragment of DNA polymerase I. "Sense" RNA refers to RNA transcript that includes the mRNA and can be translated into protein within a cell or in vitro.
"Antisense RNA" refers to an RNA transcript that is complementary to all or part of a target primary transcript or mRNA, and that blocks or reduces the expression of a target gene (U.S. Patent No. 5,107,065). The complementarity of an antisense RNA may be with any part of the specific gene transcript, i.e., at the 5' non-coding sequence, 3' non-coding sequence, introns, or the coding sequence. "Functional RNA" refers to antisense RNA, ribozyme RNA, or other RNA that may not be translated but yet has an effect on cellular processes. The terms "complement" and "reverse complement" are used interchangeably herein with respect to mRNA transcripts, and are meant to define the antisense RNA of the message. The term "operably linked" refers to the association of nucleic acid sequences on a single nucleic acid fragment so that the function of one is affected by the other. For example, a promoter is operably linked with a coding sequence when it is capable of affecting the expression of that coding sequence (i.e., the coding sequence is under the transcriptional control of the promoter). Coding sequences can be operably linked to regulatory sequences in a sense or antisense orientation. In another example, the complementary RNA regions of the invention can be operably linked, either directly or indirectly, 5' to the target mRNA, or 31 to the target mRNA, or within the target mRNA, or a first complementary region is 5' and its complement is 3' to the target mRNA.
Standard recombinant DNA and molecular cloning techniques used herein are well known in the art and are described more fully in Sambrook, J., Fritsch, E. F. and Maniatis, T. Molecular Cloning: A Laboratory Manual; Cold Spring Harbor Laboratory: Cold Spring Harbor, NY (1989). Transformation methods are well known to those skilled in the art and are described infra.
"PCR" or "polymerase chain reaction" is a technique for the synthesis of large quantities of specific DNA segments and consists of a series of repetitive cycles (Perkin Elmer Cetus Instruments, Norwalk, CT). Typically, the double-stranded DNA is heat denatured, the two primers complementary to the 3' boundaries of the target segment are annealed at low temperature and then extended at an intermediate temperature. One set of these three consecutive steps is referred to as a "cycle". The term "recombinant" refers to an artificial combination of two otherwise separated segments of sequence, e.g., by chemical synthesis or by the manipulation of isolated segments of nucleic acids by genetic engineering techniques.
A "plasmid" or "vector" is an extra chromosomal element often carrying genes that are not part of the central metabolism of the cell, and usually in the form of circular double-stranded DNA fragments. Such elements may be autonomously replicating sequences, genome integrating sequences, phage or nucleotide sequences, linear or circular, of a single- or double-stranded DNA or RNA, derived from any source, in which a number of nucleotide sequences have been joined or recombined into a unique construction which is capable of introducing an expression cassette(s) into a cell. "Expression cassette" refers to a fragment of DNA containing a foreign gene and having elements in addition to the foreign gene that allow for enhanced expression of that gene in a foreign host. "Transformation cassette" refers to a fragment of DNA containing a foreign gene and having elements in addition to the foreign gene that facilitate transformation of a particular host cell.
The terms "recombinant construct", "expression construct", "chimeric construct", "construct", and "recombinant DNA construct" are used interchangeably herein. A recombinant construct comprises an artificial combination of nucleic acid fragments, e.g., regulatory and coding sequences that are not found together in nature. For example, a recombinant construct may comprise regulatory sequences and coding sequences that are derived from different sources, or regulatory sequences and coding sequences derived from the same source, but arranged in a manner different than that found in nature. Such a construct may be used by itself or may be used in conjunction with a vector. If a vector is used, then the choice of vector is dependent upon the method that will be used to transform host cells as is well known to those skilled in the art. For example, a plasmid vector can be used. The skilled artisan is well aware of the genetic elements that must be present on the vector in order to successfully transform, select and propagate host cells comprising any of the isolated nucleic acid fragments of the invention. The skilled artisan will also recognize that different independent transformation events will result in different levels and patterns of expression (Jones et al., EMBO J. 4:2411-2418 (1985); De Almeida et al., MoI. Gen. Genetics 218:78-86 (1989)), and thus that multiple events must be screened in order to obtain lines displaying the desired expression level and pattern. Such screening may be accomplished by Southern analysis of DNA, Northern analysis of mRNA expression, immunoblotting analysis of protein expression, or phenotypic analysis, among others.
The term "expression", as used herein, refers to the production of a functional end-product (e.g., an mRNA or a protein [either precursor or mature]).
The term "introduced" means providing a nucleic acid (e.g., expression construct) or protein into a cell. Introduced includes reference to the incorporation of a nucleic acid into a eukaryotic or prokaryotic cell where the nucleic acid may be incorporated into the genome of the cell, and includes reference to the transient provision of a nucleic acid or protein to the cell. Introduced includes reference to stable or transient transformation methods, as well as sexually crossing. Thus, "introduced" in the context of inserting a nucleic acid fragment (e.g., a recombinant construct/expression construct) into a cell, means "transfection" or "transformation" or "transduction" and includes reference to the incorporation of a nucleic acid fragment into a eukaryotic or prokaryotic cell where the nucleic acid fragment may be incorporated into the genome of the cell (e.g., chromosome, plasmid, plastid or mitochondrial DNA), converted into an autonomous replicon, or transiently expressed (e.g., transfected mRNA). "Mature" protein refers to a post-translationally processed polypeptide (i.e., one from which any pre- or propeptides present in the primary translation product have been removed). "Precursor" protein refers to the primary product of translation of mRNA (i.e., with pre- and propeptides still present). Pre- and propeptides may be but are not limited to intracellular localization signals.
"Stable transformation" refers to the transfer of a nucleic acid fragment into a genome of a host organism, including both nuclear and organellar genomes, resulting in genetically stable inheritance. In contrast, "transient transformation" refers to the transfer of a nucleic acid fragment into the nucleus, or DNA-containing organelle, of a host organism resulting in gene expression without integration or stable inheritance. Host organisms containing the transformed nucleic acid fragments are referred to as "transgenic" organisms.
As used herein, "transgenic" refers to a plant or a cell which comprises within its genome a heterologous polynucleotide. Preferably, the heterologous polynucleotide is stably integrated within the genome such that the polynucleotide is passed on to successive generations. The heterologous polynucleotide may be integrated into the genome alone or as part of an expression construct. Transgenic is used herein to include any cell, cell line, callus, tissue, plant part or plant, the genotype of which has been altered by the presence of heterologous nucleic acid including those transgenics initially so altered as well as those created by sexual crosses or asexual propagation from the initial transgenic. The term "transgenic" as used herein does not encompass the alteration of the genome (chromosomal or extra-chromosomal) by conventional plant breeding methods or by naturally occurring events such as random cross-fertilization, non-recombinant viral infection, non-recombinant bacterial transformation, non-recombinant transposition, or spontaneous mutation.
"Antisense inhibition" refers to the production of antisense RNA transcripts capable of suppressing the expression of the target protein. "Co-suppression" refers to the production of sense RNA transcripts capable of suppressing the expression of identical or substantially similar foreign or endogenous genes (U.S. Patent No. 5,231 ,020). Co-suppression constructs in plants previously have been designed by focusing on overexpression of a nucleic acid sequence having homology to an endogenous mRNA, in the sense orientation, which results in the reduction of all RNA having homology to the overexpressed sequence (Vaucheret et al., Plant J. 16:651-659 (1998); Gura, Nature 404:804-808 (2000)). The overall efficiency of this phenomenon is low, and the extent of the RNA reduction is widely variable. More recent work has described the use of "hairpin" structures that incorporate all, or part, of an mRNA encoding sequence in a complementary orientation that results in a potential "stem-loop" structure for the expressed RNA (PCT Publication No. WO 99/53050; PCT Publication No. WO 02/00904). This increases the frequency of co-suppression in the recovered transgenic plants. Another variation describes the use of plant viral sequences to direct the suppression, or "silencing", of proximal mRNA encoding sequences (PCT
Publication No. WO 98/36083). Both of these co-suppressing phenomena have not been elucidated mechanistically, although genetic evidence has begun to unravel this complex situation (Elmayan et al., Plant Cell 10:1747-1757 (1998)).
The term "oleaginous" refers to those organisms that tend to store their energy source in the form of lipid (Weete, In: Fungal Lipid Biochemistry, 2nd Ed., Plenum, 1980). A class of plants identified as oleaginous are commonly referred to as "oilseed" plants. Examples of oilseed plants include, but are not limited to: soybean (Glycine and Soja sp.), flax (Linum sp.), rapeseed (Brassica sp.), maize, cotton, safflower (Carthamus sp.) and sunflower (Helianthus sp.). Within oleaginous microorganisms the cellular oil or TAG content generally follows a sigmoid curve, wherein the concentration of lipid increases until it reaches a maximum at the late logarithmic or early stationary growth phase and then gradually decreases during the late stationary and death phases (Yongmanitchai and Ward, Appl. Environ. Microbiol. 57:419-25 (1991)). The term "oleaginous yeast" refers to those microorganisms classified as yeasts that make oil. It is not uncommon for oleaginous microorganisms to accumulate in excess of about 25% of their dry cell weight as oil. Examples of oleaginous yeast include, but are no means limited to, the following genera: Yarrowia, Candida, Rhodotorula, Rhodosporidium, Cryptococcus, Trichosporon and Lipomyces. As used herein, the term "biomass" refers specifically to spent or used yeast cellular material resulting from the fermentation of a recombinant production host producing PUFAs in commercially significant amounts, wherein the preferred production host is a recombinant strain of the oleaginous yeast, Yarrowia lipolytica. The biomass may be in the form of whole cells, whole cell lysates, homogenized cells, partially hydrolyzed cellular material, and/or partially purified cellular material (e.g., microbially produced oil).
The term "Euglenophyceae" refers to a group of unicellular colorless or photosynthetic flagellates ("euglenoids") found living in freshwater, marine, soil, and parasitic environments. The class is characterized by solitary unicells, wherein most are free-swimming and have two flagella (one of which may be nonemergent) arising from an anterior invagination known as a reservoir. Photosynthetic euglenoids contain one to many grass-green chloroplasts, which vary from minute disks to expanded plates or ribbons. Colorless euglenoids depend on osmotrophy or phagotrophy for nutrient assimilation. About 1000 species have been described and classified into about 40 genera and 6 orders. Examples of Euglenophyceae include, but are by no means limited to, the following genera: Euglena, Eutreptiella and Tetruetreptia. The term "plant" refers to whole plants, plant organs, plant tissues, seeds, plant cells, seeds and progeny of the same. Plant cells include, without limitation, cells from seeds, suspension cultures, embryos, meristematic regions, callus tissue, leaves, roots, shoots, gametophytes, sporophytes, pollen and microspores. "Progeny" comprises any subsequent generation of a plant. An Overview: Microbial Biosynthesis of Fatty Acids and Triacylglvcerols
In general, lipid accumulation in oleaginous microorganisms is triggered in response to the overall carbon to nitrogen ratio present in the growth medium. This process, leading to the de novo synthesis of free palmitate (16:0) in oleaginous microorganisms, is described in detail in U.S. Patent 7,238,482. Palmitate is the precursor of longer-chain saturated and unsaturated fatty acid derivates, which are formed through the action of elongases and desaturases (FIG. 1).
TAGs (the primary storage unit for fatty acids) are formed by a series of reactions that involve: (1) the esterification of one molecule of acyl-CoA to glycerol- 3-phosphate via an acyltransferase to produce lysophosphatidic acid; (2) the esterification of a second molecule of acyl-CoA via an acyltransferase to yield 1,2- diacylglycerol phosphate (commonly identified as phosphatidic acid); (3) removal of a phosphate by phosphatidic acid phosphatase to yield 1,2-diacylglycerol (DAG); and (4) the addition of a third fatty acid by the action of an acyltransferase to form TAG. A wide spectrum of fatty acids can be incorporated into TAGs, including saturated and unsaturated fatty acids and short-chain and long-chain fatty acids. Biosynthesis of Omega Fatty Acids
The metabolic process wherein oleic acid is converted to long chain omega- 3/omega-6 fatty acids involves elongation of the carbon chain through the addition of carbon atoms and desaturation of the molecule through the addition of double bonds. This requires a series of special desaturation and elongation enzymes present in the endoplasmic reticulum membrane. However, as seen in FIG. 1 and as described below, there are often multiple alternate pathways for production of a specific long chain omega-3/omega-6 fatty acid.
Specifically, all pathways require the initial conversion of oleic acid to LA, the first of the omega-6 fatty acids, by a delta-12 desaturase. Then, using the "delta-9 elongase/delta-8 desaturase pathway" and LA as substrate, long chain omega-6 fatty acids are formed as follows: (1) LA is converted to EDA by a delta-9 elongase; (2) EDA is converted to DGLA by a delta-8 desaturase; (3) DGLA is converted to ARA by a delta-5 desaturase; (4) ARA is converted to DTA by a C20/22 elongase; and, (5) DTA is converted to DPAn-6 by a delta-4 desaturase. Alternatively, the "delta-9 elongase/delta-8 desaturase pathway" can use ALA as substrate to produce long chain omega-3 fatty acids as follows: (1) LA is converted to ALA, the first of the omega-3 fatty acids, by a delta-15 desaturase; (2) ALA is converted to ETrA by a delta-9 elongase; (3) ETrA is converted to ETA by a delta-8 desaturase; (4) ETA is converted to EPA by a delta-5 desaturase; (5) EPA is converted to DPA by a C20/22 elongase; and (6) DPA is converted to DHA by a delta-4 desaturase. Optionally, omega-6 fatty acids may be converted to omega-3 fatty acids; for example, ETA and EPA are produced from DGLA and ARA, respectively, by delta-17 desaturase activity.
Alternate pathways for the biosynthesis of omega-3/omega-6 fatty acids utilize a delta-6 desaturase and C18/20 elongase (also known as delta-6 elongase, the terms can be used interchangeably) (i.e., the "delta-6 desaturase/delta-6 elongase pathway"). More specifically, LA and ALA may be converted to GLA and STA, respectively, by a delta-6 desaturase; then, a C18/20 elongase converts GLA to DGLA and/or STA to ETA. It is contemplated that the particular functionalities required to be introduced into a specific host organism for production of omega-3/omega-6 fatty acids will depend on the host cell (and its native PUFA profile and/or desaturase/elongase profile), the availability of substrate, and the desired end product(s). For example, expression of the delta-9 elongase/delta-8 desaturase pathway may be preferred in some embodiments, as opposed to expression of the delta-6 desaturase/delta-6 elongase pathway, since PUFAs produced via the former pathway are devoid of GLA.
One skilled in the art will be able to identify various candidate genes encoding each of the enzymes desired for omega-3/omega-6 fatty acid biosynthesis. Useful desaturase and elongase sequences may be derived from any source, e.g., isolated from a natural source (from bacteria, algae, fungi, plants, animals, etc.), produced via a semi-synthetic route or synthesized de novo. Although the particular source of the desaturase and elongase genes introduced into the host is not critical, considerations for choosing a specific polypeptide having desaturase or elongase activity include: (1 ) the substrate specificity of the polypeptide; (2) whether the polypeptide or a component thereof is a rate-limiting enzyme; (3) whether the desaturase or elongase is essential for synthesis of a desired PUFA; (4) co-factors required by the polypeptide; and/or, (5) whether the polypeptide is modified after its production (e.g., by a kinase or a prenyltransferase). The expressed polypeptide preferably has parameters compatible with the biochemical environment of its location in the host cell (see U.S. Patent 7,238,482 for additional details).
In additional embodiments, it will also be useful to consider the conversion efficiency of each particular desaturase and/or elongase. More specifically, since each enzyme rarely functions with 100% efficiency to convert substrate to product, the final lipid profile of unpurified oils produced in a host cell will typically be a mixture of various PUFAs consisting of the desired omega-3/omega-6 fatty acid, as well as various upstream intermediary PUFAs. Thus, each enzyme's conversion efficiency is also a variable to consider when optimizing biosynthesis of a desired fatty acid.
With each of the considerations above in mind, candidate genes having the appropriate desaturase and elongase activities (e.g., delta-6 desaturases, C18/20 elongases, delta-5 desaturases, delta-17 desaturases, delta-15 desaturases, delta-9 desaturases, delta-12 desaturases, C14/16 elongases, C16/18 elongases, delta-9 elongases, delta-8 desaturases, delta-4 desaturases, C20/22 elongases and DHA synthases) can be identified according to publicly available literature (e.g., GenBank), the patent literature, and experimental analysis of organisms having the ability to produce PUFAs. These genes will be suitable for introduction into a specific host organism, to enable or enhance the organism's synthesis of PUFAs. Multizvmes and Linkers
In one embodiment, the present invention concerns a multizyme comprising a single polypeptide having at least two independent and separable enzymatic activities
Examples of suitable enzymatic activities include elongases, fatty acid desaturases, transferases, acyl CoA synthases and thioesterases. For example, suitable fatty acid desaturases include, but are not limited to: delta-4 desaturase, delta-5 desaturase, delta-6 desaturase, delta-8 desaturase, delta-9 desaturase, delta-12 desaturase, delta-15 desaturase, and/or delta-17 desaturase. Examples of suitable elongases include, but are not limited to: delta-9 elongase, C14/16 elongase, Ci6/i8 elongase, C18/20 elongase, and/or C20/22 elongase.
Examples of suitable transferases include but are not limited to acyl transferases such as glycerol-3-phosphate O-acyltransferase (also called glycerol - phosphate acyl transferase or glycerol -3-phosphate acyl transferase; GPAT), 2- acylglycerol O-acyltransferase, 1-acylglycerol-3-phosphate O-acyltransferase (also called 1-acylglycerol-phosphate acyltransferase or lyso-phosphatidic acid acyltransferase; AGPAT or LPAAT or LPAT), 2-acylglycerol-3-phosphate O- acyltransferase, 1-acylglycerophosphocholine O-acyltransferase (also called lyso- lecithin acyltransferase or lyso-phosphatidylcholine acyltransferase; AGPCAT or LLAT or LPCAT), 2-acylglycerophosphocholine O-acyltransferase, diacylglycerol O- acyltransferase (also called diglyceride acyltransferase; DAGAT or DGAT) and phospholipid:diacylglycerol acyltransferase (PDAT).
An example of a suitable acyl CoA synthetase includes but is not limited to Iong-chain-fatty-acid-CoA ligase (also called acyl-activating enzyme or acyl-CoA synthetase). An example of a suitable thioesterase includes but is not limited to oleoyl- [acyl-carrier-protein] hydrolase (also called acyl-[acyl-carrier-protein] hydrolase, acyl-ACP-hydrolase or acyl-ACP-thioesterase).
Preferably, the instant multizyme should have enzymatic activities comprising at least one fatty acid elongase linked to at least one fatty acid desaturase.
The link used to form the multizyme is minimally comprised of a single polypeptide bond. In another aspect, the link may be comprised of one amino acid residue, such as proline, or a polypeptide. It may be desirable that if the link is a polypeptide then it has at least one proline amino acid residue. Preferably, the multizyme of the invention comprises a first enzymatic activity linked to a second enzymatic activity and the link is selected from the group consisting of a polypeptide bond, SEQ ID NO:198 (EgDHAsyni linker), SEQ ID NO:200 (EgDHAsyn2 linker), SEQ ID NO:235 (EaDHAsyni linker), SEQ ID NO:472, SEQ ID NO:504, and modified Yarrowia lipolytics linkers (SEQ ID NOs:438 and 445).
Also within the scope of this invention is a method for making a multizyme which comprises:
(a) linking a first polypeptide with at least a second polypeptide wherein each polypeptide has an independent and separable enzymatic activity; and
(b) evaluating the product of step (a) for the independent and separable enzymatic activities.
As was discussed above, the enzymatic activities are selected from the group consisting of fatty acid elongases, fatty acid desaturases, acyl transferases, acyl CoA synthases and thioeste rases. Preferably, the enzymatic activities comprise at least one fatty acid elongase linked to at least one fatty acid desaturase.
Examples of suitable desaturases, elongases and linkers are discussed above.
Although numerous examples of multizymes are described above, DHA synthases (comprising both C20 elongase activity and delta-4 desaturase activity) and DGLA synthases (comprising both delta-9 elongase and delta-8 desaturase activity) are of particular interest. Data described herein confirm that linking of the two domains within each synthase results in increased efficiency or flux, as compared to efficiency or flux observed when the enzymatic domains exist as independent entities, i.e., not linked together in a multizyme..
For example, when a mulltizyme comprising the Euglena gracilis C20 elongase domain and a Schizochytrium aggregatum delta-4 desaturase was expressed in Yarrowia lipolytica, the delta-4 desaturase activity was approximately 2 to 3-fold greater in the fused construct, as opposed to its activity when expressed alone (Example 28). Similarly, when the Euglena gracilis C20 elongase domain- Schizochytrium aggregatum delta-4 desaturase fusion was expressed as a multizyme in soybean, increased EPA to DHA flux was measured, as opposed to when the two enzymes were expressed independently (Example 49).
Increased efficiency (or LA to DGLA flux) was also demonstrated in various DGLA synthases that were created. A series of six delta-9 elongase/delta-8 desaturase fusion constructs were created using various combinations of delta-9 elongases derived from E. gracillis, E. anabaena UTEX 373 and Eutreptiella sp. CCMP389 and delta-8 desaturases derived from E. gracillis and E. anabaena UTEX 373; these were individually expressed in Yarrowia lipolytica (Examples 55 and 56, respectively). In all cases, the fusion gene had higher activity than the individual gene alone when expressed in Yarrowia. These data again suggested that the product of delta-9 elongase may be directly channeled as substrate of delta-8 desaturase in the fusion protein. One skilled in the art would be able to use the teachings herein to create various other multizymes that have increased efficiency or flux. Accordingly, the invention relates to any multizyme that is made using a linker derived from the sequences of the invention. Preferred multizymes are those that combine various genes of the PUFA biosynthetic pathway. Sequence Identification of Novel DHA Synthases
In the present invention, nucleotide sequences encoding DHA synthases have been isolated from Euglena gracilis and Euglena anabaena, as summarized below in Table 4.
TABLE 4 Summary Of Euαlena DHA Synthases
Figure imgf000059_0001
Figure imgf000060_0001
In some embodiments, the instant EgDHAsyni , EgDHAsyn2, EaDHAsyni , EaDHAsyn2 and EaDHAsyn3 DHA synthase sequences can be codon-optimized for expression in a particular host organism. As is well known in the art, this can be a useful means to further optimize the expression of the enzyme in the alternate host, since use of host-preferred codons can substantially enhance the expression of the foreign gene encoding the polypeptide. EgDHAsyni , for example, was codon- optimized for expression in Yarrowia lipolytica (example 54), thereby yielding EgDHAsyniS (as taught in U.S. Patent 7,238,482 and U.S. Patent 7,125,672). One skilled in the art would be able to use the teachings herein to create various other codon-optimized DHA synthase proteins suitable for optimal expression in alternate hosts, based on the wildtype EgDHAsyni , EgDHAsyn2, EaDHAsyni , EaDHAsyn2 and/or EaDHAsyn3 sequences described above in Table 4. Accordingly, the instant invention relates to any codon-optimized DHA synthase protein that is derived from a wildtype sequence of the instant invention. In some preferred embodiments, it may be desirable to modify a portion of the codons encoding EgDHAsyni , EgDHAsyn2, EaDHAsyni , EaDHAsyn2 and/or EaDHAsyn3 to enhance expression of the gene in a host organism including, but not limited to, a plant or plant part. In another embodiment, the present invention concerns an isolated polynucleotide encoding a DHA synthase comprising:
(a) a nucleotide sequence encoding a polypeptide having DHA synthase activity, wherein the polypeptide has at least 80% amino acid identity, based on the Clustal V method of alignment, when compared to an amino acid sequence as set forth in SEQ ID NO:12, SEQ ID NO:22, SEQ ID NO:95, SEQ ID NO:96, or SEQ ID NO:97;
(b) a nucleotide sequence encoding a polypeptide having DHA synthase activity wherein the nucleotide sequence has at least 80% sequence identity, based on the BLASTN method of alignment, when compared to a nucleotide sequence as set forth in SEQ ID NO:11 , SEQ ID NO:205, SEQ ID NO:21 , SEQ ID NO:91 , SEQ ID NO:92, SEQ ID NO:93, or SEQ ID NO:410;
(c) a nucleotide sequence encoding a polypeptide having DHA synthase activity, wherein the nucleotide sequence hybridizes under stringent conditions to a nucleotide sequence as set forth in SEQ ID NO:11 , SEQ ID NO:21 , SEQ ID NO:91 , SEQ ID NO:92, SEQ ID NO:93, SEQ ID NO:205, or SEQ ID NO:410; or
(d) a complement of the nucleotide sequence of (a), (b) or (c), wherein the complement and the nucleotide sequence consist of the same number of nucleotides and are 100% complementary.
In still another aspect, this invention concerns an isolated polynucleotide comprising:
(a) a nucleotide sequence encoding a polypeptide having DHA synthase activity, wherein the polypeptide has at least 80% amino acid identity, based on the Clustal V method of alignment, when compared to an amino acid sequence as set forth in SEQ ID NO:12, SEQ ID NO:22, SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:97, or SEQ ID NO:411 ;
(b) a nucleotide sequence encoding a polypeptide having DHA synthase activity wherein the nucleotide sequence has at least 80% sequence identity, based on the BLASTN method of alignment, when compared to a nucleotide sequence as set forth in SEQ ID NO:11 , SEQ ID NO:205, SEQ ID NO:21 , SEQ ID NO:91 , SEQ ID NO:92, SEQ ID NO:93 or SEQ ID NO:410;
(c) a nucleotide sequence encoding a polypeptide having DHA synthase activity, wherein the nucleotide sequence hybridizes under stringent conditions to a nucleotide sequence as set forth in SEQ ID NO:11 , SEQ ID NO:205, SEQ ID NO:21 , SEQ ID NO:91 , SEQ ID NO:92, SEQ ID NO:93 or SEQ ID NO:410; or
(d) a complement of the nucleotide sequence of (a), (b) or (c), wherein the complement and the nucleotide sequence consist of the same number of nucleotides and are 100% complementary. Preferably, an isolated polynucleotide encoding a DHA synthase comprises the sequence set forth in any of SEQ ID NO:11 , SEQ ID NO:205, SEQ ID NO:21 , SEQ ID NO:91 , SEQ ID NO:92, SEQ ID NO:93, or SEQ ID NO:410. Identification and Isolation of Homologs Any of the instant DHA synthase sequences (i.e., EgDHAsyni , EgDHAsyn2,
EaDHAsyni , EaDHAsyn2 and EaDHAsyn3) or portions thereof may be used to search for DHA synthase homologs in the same or other bacterial, algal, fungal, euglenoid or plant species using sequence analysis software. In general, such computer software matches similar sequences by assigning degrees of homology to various substitutions, deletions, and other modifications.
Alternatively, any of the instant DHA synthase sequences or portions thereof may also be employed as hybridization reagents for the identification of DHA synthase homologs. The basic components of a nucleic acid hybridization test include a probe, a sample suspected of containing the gene or gene fragment of interest and a specific hybridization method. Probes of the present invention are typically single-stranded nucleic acid sequences that are complementary to the nucleic acid sequences to be detected. Probes are "hybridizable" to the nucleic acid sequence to be detected. Although the probe length can vary from 5 bases to tens of thousands of bases, typically a probe length of about 15 bases to about 30 bases is suitable. Only part of the probe molecule needs to be complementary to the nucleic acid sequence to be detected. In addition, the complementarity between the probe and the target sequence need not be perfect. Hybridization does occur between imperfectly complementary molecules with the result that a certain fraction of the bases in the hybridized region are not paired with the proper complementary base.
Hybridization methods are well defined. Typically the probe and sample must be mixed under conditions that will permit nucleic acid hybridization. This involves contacting the probe and sample in the presence of an inorganic or organic salt under the proper concentration and temperature conditions. The probe and sample nucleic acids must be in contact for a long enough time that any possible hybridization between the probe and sample nucleic acid may occur. The concentration of probe or target in the mixture will determine the time necessary for hybridization to occur. The higher the probe or target concentration, the shorter the hybridization incubation time needed. Optionally, a chaotropic agent may be added (e.g., guanidinium chloride, guanidinium thiocyanate, sodium thiocyanate, lithium tetrachloroacetate, sodium perchlorate, rubidium tetrachloroacetate, potassium iodide, cesium trifluoroacetate). If desired, one can add formamide to the hybridization mixture, typically 30-50% (v/v).
Various hybridization solutions can be employed. Typically, these comprise from about 20 to 60% volume, preferably 30%, of a polar organic solvent. A common hybridization solution employs about 30-50% v/v formamide, about 0.15 to 1 M sodium chloride, about 0.05 to 0.1 M buffers (e.g., sodium citrate, Tris-HCI, PIPES or HEPES (pH range about 6-9)), about 0.05 to 0.2% detergent (e.g., sodium dodecylsulfate), or between 0.5-20 mM EDTA, FICOLL (Pharmacia Inc.) (about 300-500 kdal), polyvinylpyrrolidone (about 250-500 kdal), and serum albumin. Also included in the typical hybridization solution will be unlabeled carrier nucleic acids from about 0.1 to 5 mg/mL, fragmented nucleic DNA (e.g., calf thymus or salmon sperm DNA, or yeast RNA), and optionally from about 0.5 to 2% wt/vol glycine.
Other additives may also be included, such as volume exclusion agents that include a variety of polar water-soluble or swellable agents (e.g., polyethylene glycol), anionic polymers (e.g., polyacrylate or polymethylacrylate) and anionic saccharidic polymers (e.g., dextran sulfate). Nucleic acid hybridization is adaptable to a variety of assay formats. One of the most suitable is the sandwich assay format. The sandwich assay is particularly adaptable to hybridization under non-denaturing conditions. A primary component of a sandwich-type assay is a solid support. The solid support has adsorbed to it or covalently coupled to it an immobilized nucleic acid probe that is unlabeled and complementary to one portion of the sequence.
In additional embodiments, any of the DHA synthase nucleic acid fragments described herein (or any homologs identified thereof) may be used to isolate genes encoding homologous proteins from the same or other bacterial, algal, fungal, euglenoid or plant species. Isolation of homologous genes using sequence- dependent protocols is well known in the art. Examples of sequence-dependent protocols include, but are not limited to: (1) methods of nucleic acid hybridization; (2) methods of DNA and RNA amplification, as exemplified by various uses of nucleic acid amplification technologies [e.g., polymerase chain reaction (PCR), Mullis et al., U.S. Patent 4,683,202; ligase chain reaction (LCR), Tabor et al., Proc. Acad. Sci. USA 82:1074 (1985); or strand displacement amplification (SDA), Walker et al., Proc. Natl. Acad. Sci. U.S.A., 89:392 (1992)]; and (3) methods of library construction and screening by complementation. For example, genes encoding similar proteins or polypeptides to a multizyme or an individual domain thereof (such as the DHA synthases) described herein, could be isolated directly by using all or a portion of the instant nucleic acid fragments as DNA hybridization probes to screen libraries from e.g., any desired yeast or fungus using methodology well known to those skilled in the art (wherein those organisms producing DTA, DPAn-6, DPA and/or DHA would be preferred). Specific oligonucleotide probes based upon the instant nucleic acid sequences can be designed and synthesized by methods known in the art (Maniatis, supra). Moreover, the entire sequences can be used directly to synthesize DNA probes by methods known to the skilled artisan (e.g., random primers DNA labeling, nick translation or end-labeling techniques), or RNA probes using available in vitro transcription systems. In addition, specific primers can be designed and used to amplify a part of (or full-length of) the instant sequences. The resulting amplification products can be labeled directly during amplification reactions or labeled after amplification reactions, and used as probes to isolate full-length DNA fragments under conditions of appropriate stringency.
Typically, in PCR-type amplification techniques, the primers have different sequences and are not complementary to each other. Depending on the desired test conditions, the sequences of the primers should be designed to provide for both efficient and faithful replication of the target nucleic acid. Methods of PCR primer design are common and well known in the art (Thein and Wallace, "The use of oligonucleotide as specific hybridization probes in the Diagnosis of Genetic Disorders", in Human Genetic Diseases: A Practical Approach, K. E. Davis Ed., (1986) pp 33-50, IRL: Herndon, VA; and Rychlik, W., In Methods in Molecular Biology. White, B. A. Ed., (1993) Vol. 15, pp 31-39, PCR Protocols: Current Methods and Applications. Humania: Totowa, NJ).
Generally two short segments of the instant sequences may be used in PCR protocols to amplify longer nucleic acid fragments encoding homologous genes from DNA or RNA. PCR may also be performed on a library of cloned nucleic acid fragments wherein the sequence of one primer is derived from the instant nucleic acid fragments, and the sequence of the other primer takes advantage of the presence of the polyadenylic acid tracts to the 31 end of the mRNA precursor encoding eukaryotic genes. Alternatively, the second primer sequence may be based upon sequences derived from the cloning vector. For example, the skilled artisan can follow the RACE protocol (Frohman et al., Proc. Natl Acad. ScL U.S.A., 85:8998 (1988)) to generate cDNAs by using PCR to amplify copies of the region between a single point in the transcript and the 3' or 5' end. Primers oriented in the 3' and 51 directions can be designed from the instant sequences. Using commercially available 31 RACE or 5' RACE systems (Gibco/BRL, Gaithersburg, MD), specific 3' or 5' cDNA fragments can be isolated (Ohara et al., Proc. Natl Acad. Sci. U.S.A., 86:5673 (1989); Loh et al., Science 243:217 (1989)).
In other embodiments, any of the enzymes (e.g., multizymes, DHA synthases, or individual domains described herein) may be modified. As is well known to those skilled in the art, in vitro mutagenesis and selection, chemical mutagenesis, "gene shuffling" methods or other means can be employed to obtain mutations of naturally occurring genes. Alternatively, multizymes may be synthesized by domain swapping, wherein a functional domain from any enzyme may be exchanged with or added to a functional domain in an alternate enzyme to thereby result in a novel protein. Sequence Identification Of Novel C20 Elonqases
In the present invention, nucleotide sequences encoding C20 elongases have been isolated from Euglena gracilis and Euglena anabaena, as summarized below in Table 5.
TABLE 5 Summary Of Euαlena C20 Elonqases
Figure imgf000065_0001
Figure imgf000066_0001
The instant invention concerns an isolated polynucleotide encoding a C20 elongase comprising:
(a) a nucleotide sequence encoding a polypeptide having C20 elongase activity, wherein the polypeptide has at least 80% amino acid identity, based on the Clustal V method of alignment, when compared to an amino acid sequence as set forth in SEQ ID NO:12, SEQ ID NO:22, SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:97; SEQ ID NO:202, SEQ ID NO:204, SEQ ID NO:231 , SEQ ID NO:232, or SEQ ID NO:233; (b) a nucleotide sequence encoding a polypeptide having C20 elongase activity wherein the nucleotide sequence has at least 80% sequence identity, based on the BLASTN method of alignment, when compared to a nucleotide sequence as set forth in SEQ ID NO:11 , SEQ ID NO:21 , SEQ ID NO:91 , SEQ ID NO:92, SEQ ID NO:93, SEQ ID NO:183, SEQ ID NO:188, SEQ ID NO:201 , SEQ ID NO:206, SEQ ID NO:203, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, or SEQ ID NO:230;
(c) a nucleotide sequence encoding a polypeptide having C20 elongase activity, wherein the nucleotide sequence hybridizes under stringent conditions to a nucleotide sequence as set forth in SEQ ID NO:11 , SEQ ID NO:21 , SEQ ID NO:91 , SEQ ID NO:92, SEQ ID NO:93, SEQ ID NO:183, SEQ ID NO:188, SEQ ID NO:201 , SEQ ID NO:206, SEQ ID NO:203, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230; or
(d) a complement of the nucleotide sequence of (a), (b) or (c), wherein the complement and the nucleotide sequence consist of the same number of nucleotides and are 100% complementary.
Preferably, an isolated polynucleotide encoding a C20 elongase, comprises the sequence set forth in any of SEQ ID NO:183, SEQ ID NO:188, SEQ ID NO:201 , SEQ ID NO:206, SEQ ID NO:203, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, or SEQ ID NO:230. Sequence Identification of Novel Delta-4 Desaturases
In the present invention, nucleotide sequences encoding delta-4 desaturases have been isolated from Euglena gracilis and Euglena anabaena, as summarized below in Table 6.
TABLE 6 Summary Of Euplena Delta-4 Desaturases
Figure imgf000067_0001
Figure imgf000068_0001
* Note: The delta-4 desaturase domain 1 does not include the proline-rich linker of the DHA synthase from which it was derived. In contrast, the delta-4 desaturase domain 2 does include the proline-rich linker of the DHA synthase from which it was derived. In alternate embodiments, the instant delta-4 desaturase domain sequences can be codon-optimized for expression in a particular host organism. For example, the Euglena anabaena delta-4 desaturase domain of EaDHAsyn2 was codon- optimized for expression in Yarrowia lipolytica. For example, the Euglena gracilis delta-4 desaturase domain of EgDHAsyni was also codon-optimized for expression in Yarrowia lipolyticaOne skilled in the art would be able to use the teachings herein to create various other codon-optimized delta-4 desaturase proteins suitable for optimal expression in alternate hosts, based on the wildtype delta-4 desaturase domain sequences of EgDHAsyni , EgDHAsyn2, EaDHAsyni , EaDHAsyn2 and/or EaDHAsyn3 as described above in Table 6. Accordingly, the instant invention relates to any codon-optimized delta-4 desaturase protein that is derived from a wildtype sequence of the instant invention. In some preferred embodiments, it may be desirable to modify a portion of the codons encoding the delta-4 desaturase domain sequences of EgDHAsyni , EgDHAsyn2, EaDHAsyni , EaDHAsyn2 and/or EaDHAsyn3 to enhance expression of the gene in a host organism including, but not limited to, a plant or plant part.
Moreover, based on the observation that the C-terminal portion of the C20 elongase domain of the DHA synthases appears to overlap with the N-terminal portion of the delta-4 desaturase domain, functional analyses were performed to define the optimal functional delta-4 desaturase domain. As described in Examples 51 and 53 hereinbelow, deletion mutagenesis studies were performed using the codon-optimized protein sequences, EaD4S (SEQ ID NO:193) and EgD4S (SEQ ID NO:388). The following variants were produced: EaD4S-3 (SEQ ID NO:386), EaD4S-2 (SEQ ID NO:384), EaD4S-1 (SEQ ID NO:382), EgD4S-3 (SEQ ID NO:408), EgD4S-2 (SEQ ID NO:406) and EgD4S-1 (SEQ ID NO:404). One skilled in the art will recognize that since the exact boundaries of these particular delta-4 desaturase sequences from Euglena gracilis and Euglena anabaena have not been completely defined, protein fragments or polypeptides of increased or diminished lengths may have comparable delta-4 desaturase activity. Similarly, comparable truncations could readily be performed based on the wildtype delta-4 desaturase domain sequences of EgDHAsyni , EgDHAsyn2, EaDHAsyni , EaDHAsyn2 and/or EaDHAsyn3 as described above in Table 6, to produce a delta- 4 desaturase having a sufficient amount of delta-4 desaturase activity, wherein equivalent or increased delta-4 desaturase activity would be preferred. Thus, the instant invention further concerns an isolated polynucleotide encoding a delta-4 desaturase comprising:
(a) a nucleotide sequence encoding a polypeptide having delta-4 desaturase activity, wherein the polypeptide has at least 80% amino acid identity, based on the Clustal V method of alignment, when compared to an amino acid sequence as set forth in SEQ ID NO:12, SEQ ID NO:22, SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:97, SEQ ID NO:215, SEQ ID NO:217, SEQ ID NO:221 , SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241 , SEQ ID NO:246, SEQ ID NO:247, SEQ ID NO:248, SEQ ID NO:249, SEQ ID NO:193, SEQ ID NO:382, SEQ ID NO:384, SEQ ID NO:386, SEQ ID NO:388, SEQ ID NO:404, SEQ ID NO:406, or SEQ ID NO:408; (b) a nucleotide sequence encoding a polypeptide having delta-4 desaturase activity wherein the nucleotide sequence has at least 80% sequence identity, based on the BLASTN method of alignment, when compared to a nucleotide sequence as set forth in SEQ ID NO:11 , SEQ ID NO:21 , SEQ ID NO:91 , SEQ ID NO:92, SEQ ID NO:93, SEQ ID NO:214, SEQ ID NO:216, SEQ ID NO:220, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:242, SEQ ID
NO:243, SEQ ID NO:244, SEQ ID NO:245, SEQ ID NO:192, SEQ ID NO:381 , SEQ ID NO:383, SEQ ID NO:385, SEQ ID NO:387, SEQ ID NO:403, SEQ ID NO:405 or SEQ ID NO:407;
(c) a nucleotide sequence encoding a polypeptide having delta-4 desaturase activity, wherein the nucleotide sequence hybridizes under stringent conditions to a nucleotide sequence as set forth in SEQ ID NO:11 , SEQ ID NO:21 , SEQ ID NO:91 , SEQ ID NO:92, SEQ ID NO:93, SEQ ID NO:214, SEQ ID NO:216, SEQ ID NO:220, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:242, SEQ ID NO:243, SEQ ID NO:244, SEQ ID NO:245, SEQ ID NO:192, SEQ ID NO:381 , SEQ ID NO:383, SEQ ID NO:385, SEQ ID NO:387, SEQ ID NO:403, SEQ ID NO:405 or SEQ ID NO:407; or
(d) a complement of the nucleotide sequence of (a), (b) or (c), wherein the complement and the nucleotide sequence consist of the same number of nucleotides and are 100% complementary.
Preferably, an isolated polynucleotide encoding a delta-4 desaturase comprises the sequence set forth in any of SEQ ID NO:214, SEQ ID NO:220, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:242, SEQ ID NO:243, SEQ ID NO:244, SEQ ID NO:245, SEQ ID NO:192, SEQ ID NO:381 , SEQ ID NO:383, SEQ ID NO:385, SEQ ID NO:387, SEQ ID NO:403, SEQ ID NO:405, or SEQ ID NO:407.
The effect of truncating the Euglena anabaena delta-4 desaturase is that enzymatic activity is increased when compared to enzymatic activity of the wildtype sequence . This result is unexpected and unforeseeable, as one of ordinary skill in the art would expect the activity of a truncated sequence to be no better and possibly less active than the wildtype sequence. Accordingly, the invention also provides a new method for deriving a delta-4 desaturase having higher activity than the wildtype sequence, the method comprising: a) providing a wild-type delta-4 desaturase polypeptide isolated from Euglena anabena having a base-line delta-4 desaturase activity; and b) truncating the wild-type polypeptide of (a) by about 1 to about 200 amino acids (a) to create a truncated mutant polypeptide having delta-4 desaturase activity that is increased as compared with the baseline delta-4 desaturase activity. "Baseline" activity as used in this context is defined as the activity of the wildtype enzyme measured either in vivo or in vitro according to standard enzymatic protocols as described herein.
In other embodiments, any of the enzymes (e.g., multizymes, DHA synthases, C20 elongases, delta-4 desaturases, and/or any homologs) identified herein may be modified to generate new and/or improved PUFA biosynthetic pathway enzymes. As is well known to those skilled in the art, in vitro mutagenesis and selection, chemical mutagenesis, "gene shuffling" methods or other means can be employed to obtain mutations of naturally occurring genes. Alternatively, multizymes may be synthesized by domain swapping, wherein a functional domain from any enzyme may be exchanged with or added to a functional domain in an alternate enzyme to thereby result in a novel protein. Methods for Production of Various Omeqa-3 and/or Omeαa-6 Fatty Acids It is expected that introduction of chimeric genes encoding the DHA synthases described herein (i.e., EgDHAsyni , EgDHAsyn2, EaDHAsyni ,
EaDHAsyn2 and EaDHAsyn3 or other mutant enzymes, codon-optimized enzymes or homologs thereof), under the control of the appropriate promoters will result in increased production of DTA, DPAn-6, DPA and/or DHA in the transformed host organism, respectively. As such, the present invention encompasses a method for the direct production of PUFAs comprising exposing a fatty acid substrate (i.e., EPA or DPA) to the DHA synthase enzymes described herein (e.g., EgDHAsyni , EgDHAsyn2, EaDHAsyni , EaDHAsyn2 and EaDHAsyn3), such that the substrate is converted to the desired fatty acid product (i.e., DHA).
More specifically, the present invention concerns a method for transforming a host cell such that the host cell comprises in its genome a recombinant construct of the invention.
Examples of suitable host cells include, but are not limited to, plants and yeast. Preferably, the plant cells are obtained from an oilseed plant such as soybean and the like and yeast cells are obtained from oleaginous yeast such as Yarrowia sp.
Also within the scope of this invention is a method for producing a transformed plant or yeast comprising transforming a plant cell or a yeast cell with any of the polynucleotides of the invention and regenerating a plant from the transformed plant cell or growing the transformed yeast cells. More specifically, it is an object of the present invention to provide a method for the production of DPAn-6 or DHA in a host cell (e.g., plants, oleaginous yeast), wherein the host cell comprises:
(i) an isolated nucleotide molecule encoding a polypeptide having DHA synthase activity, wherein the polypeptide has at least 80% amino acid identity, based on the Clustal V method of alignment, when compared to an amino acid sequence as set forth in SEQ ID NO:12, SEQ ID NO:22, SEQ ID NO:95, SEQ ID NO:96, or SEQ ID NO:97; and, (ii) a source of ARA or EPA; wherein the host cell is grown under conditions such that the polypeptide having DHA synthase activity is expressed and the ARA is converted to DPAn-6 and/or the EPA is converted to DHA, and wherein the DPAn-6 or DHA is optionally recovered. In alternate embodiments, the present invention concerns a method for the production of DTA or DPA in a host cell (e.g., plants, oleaginous yeast), wherein the host cell comprises:
(ii) an isolated nucleotide molecule encoding a polypeptide having C20 elongase activity, wherein the polypeptide has at least 80% amino acid identity, based on the Clustal V method of alignment, when compared to an amino acid sequence as set forth in SEQ ID NO: 12, SEQ ID NO:22, SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:97, SEQ ID NO:202, SEQ ID NO:204, SEQ ID NO:231 , SEQ ID NO:232, or SEQ ID NO:233; and, (ii) a source of ARA or EPA; wherein the host cell is grown under conditions such that the polypeptide having C20 elongase activity is expressed and the ARA is converted to DTA and/or the EPA is converted to DPA, and wherein the DTA or DPA is optionally recovered.
Additionally, the invention provides a method for the production of DPAn-6 or DHA, wherein the host cell comprises:
(i) an isolated nucleotide molecule encoding a polypeptide having delta-4 desaturase activity, wherein the polypeptide has at least 80% amino acid identity, based on the Clustal V method of alignment, when compared to an amino acid sequence as set forth in SEQ ID NO: 12, SEQ ID NO:22, SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:97, SEQ
ID NO:215, SEQ ID NO:221 , SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241 , SEQ ID NO:246, SEQ ID NO:247, SEQ ID NO:248, SEQ ID NO:249, SEQ ID NO:382, SEQ ID NO:384, SEQ ID NO:386, SEQ ID NO:388, SEQ ID NO:404, SEQ ID NO:406, or SEQ ID NO:408; and,
(ii) a source of DTA or DPA; wherein the host cell is grown under conditions such that the polypeptide having delta-4 desaturase activity is expressed and the DTA is converted to DPAn-6 and/or the DPA is converted to DHA, and wherein the DPAn-6 or DHA is optionally recovered.
The source of the substrate(s) ARA, DTA, EPA or DPA used in any of the methods above may be produced by the host either naturally or transgenically, or may be provided exogenously.
Linking individual domains to form a multizyme could lead to a decrease in intermediate fatty acids. For instance, linking a C20 elongase with a delta-4 desaturase in a multizyme, such as DHA synthase, may lead to a decrease in the intermediate fatty acid DPA during production of DHA. Similarly, linking a delta-9 elongase with a delta-8 desaturase using the EgDHAsyni linker to form a multizyme as described herein may lead to the production of DGLA and ETA with a decrease in EDA and ERA intermediates.
Alternatively, each multizyme gene including DHA synthase and their corresponding enzyme products described herein can be used indirectly for the production of various omega-6 and omega-3 PUFAs, including e.g., DTA, DPAn-6, DGLA, ETA, ARA, EPA, DPA and/or DHA (FIG. 1 ; see U.S. Patent 7,238,482). Indirect production of omega-3/omega-6 PUFAs occurs wherein the fatty acid substrate is converted indirectly into the desired fatty acid product, via means of an intermediate step(s) or pathway intermediate(s). Thus, it is contemplated that the DHA synthases described herein (i.e., EgDHAsyni , EgDHAsyn2, EaDHAsyni ,
EaDHAsyn2 and EaDHAsyn3, or other mutant enzymes, codon-optimized enzymes or homologs thereof) may be expressed in conjunction with additional genes encoding enzymes of the PUFA biosynthetic pathway (e.g., delta-6 desaturases, C18/20 elongases, delta-17 desaturases, delta-8 desaturases, delta-15 desaturases, delta-9 desaturases, delta-12 desaturases, C14/16 elongases, Ci6/iβ elongases, delta-9 elongases, delta-5 desaturases, delta-4 desaturases, C20/22 elongases, DHA synthases) to result in higher levels of production of longer-chain omega-3/omega-6 fatty acids (e.g., ARA, DTA, DPAn-6, EPA, DPA and/or DHA).
The specific genes included within a particular expression cassette will depend on the host cell (and its PUFA profile and/or desaturase/elongase profile), the availability of substrate and the desired end product(s).
At times, it may be desirable to minimize by-product fatty acids. The relative abundance of by-product fatty acids could be decreased by linking individual pathway enzymes together with a linker to form a multizyme. For instance, the presence of sciadonic acid (SCI) and/or juniperonic acid (JUP) [commonly found in the seed lipids of gymnosperms (Wolff et al., Lipids 35(1): 1-22 (2000)), such as those in the Pinaceae family (pine)] might be considered by-product fatty acids of a delta-6 desaturase/delta-6 elongase pathway or delta-9-elongase/delta-8 desaturase pathway. Although these fatty acids are considered to have various health-enhancing properties themselves (Nakane et al., Biol. Pharm. Bull. 23: 758- 761 (2000)), their presence as by-product fatty acids in an engineered PUFA pathway, such as in an oilseed crop, may not be desirable depending on the application. Linking a delta-9 elongase together with a delta-8 desaturase using a linker to form a multizyme (DGLA and/or ETA synthase), for example, could result in increased flux through these steps leading to reduced availability of the EDA/ERA intermediate fatty acids to delta-5 desaturase, and thus reduced concentrations of SCI and JUP. Occasionally, a delta-6 elongase may elongate fatty acids other than the intended fatty acid. For instance, delta-6 elongases generally convert GLA to DGLA but some delta-6 elongases may also convert unintended substrates such as LA or ALA to EDA or ETrA, respectively. In a delta-6 desaturase/delta-6 elongase pathway, EDA and ETrA would be considered "by-product fatty acids". Addition of a delta-8 desaturase to a delta-6 desaturase/delta-6 elongase pathway may provide a means to convert the "by-product fatty acids" EDA and ETrA back into the "intermediate fatty acids" DGLA and ETA, respectively.
In alternative embodiments, it may be useful to disrupt a host organism's native DHA synthase, C20 elongase, or delta-4 desaturase, based on the complete sequences described herein, the complement of those complete sequences, substantial portions of those sequences, codon-optimized desaturases derived therefrom, and those sequences that are substantially homologous thereto. Plant Expression Systems. Cassettes and Vectors, and Transformation
In one embodiment, this invention concerns a recombinant construct comprising any one of the isolated polynucleotides of the invention operably linked to at least one regulatory sequence suitable for expression in a host cell such as a plant. A promoter is a DNA sequence that directs cellular machinery of a plant to produce RNA from the contiguous coding sequence downstream (31) of the promoter. The promoter region influences the rate, developmental stage, and cell type in which the RNA transcript of the gene is made. The RNA transcript is processed to produce mRNA which serves as a template for translation of the RNA sequence into the amino acid sequence of the encoded polypeptide. The 5' non- translated leader sequence is a region of the mRNA upstream of the protein coding region that may play a role in initiation and translation of the mRNA. The 3' transcription termination/polyadenylation signal is a non-translated region downstream of the protein coding region that functions in the plant cell to cause termination of the RNA transcript and the addition of polyadenylate nucleotides to the 31 end of the RNA.
The origin of the promoter chosen to drive expression of the multizyme coding sequence is not important as long as it has sufficient transcriptional activity to accomplish the invention by expressing translatable mRNA for the desired nucleic acid fragments in the desired host tissue at the right time. Either heterologous or non-heterologous (i.e., endogenous) promoters can be used to practice the invention. For example, suitable promoters in plants include, but are not limited to: the alpha prime subunit of beta conglycinin promoter, the Kunitz trypsin inhibitor 3 promoter, the annexin promoter, the glycinin Gy1 promoter, the beta subunit of beta conglycinin promoter, the P34/Gly Bd m 3OK promoter, the albumin promoter, the Leg A1 promoter and the Leg A2 promoter.
The annexin, or P34, promoter is described in PCT Publication No. WO 2004/071178 (published August 26, 2004). The level of activity of the annexin promoter is comparable to that of many known strong promoters, such as: (1) the CaMV 35S promoter (Atanassova et al., Plant MoI. Biol. 37:275-285 (1998); Battraw and Hall, Plant MoI. Biol. 15:527-538 (1990); Holtorf et al., Plant MoI. Biol.
29:637-646 (1995); Jefferson et al., EMBO J. 6:3901-3907 (1987); Wilmink et al., Plant MoI. Biol. 28:949-955 (1995)); (2) the Arabidopsis oleosin promoters (Plant et al., Plant MoI. Biol. 25:193-205 (1994); Li, Texas A&M University Ph.D. dissertation, pp. 107-128 (1997)); (3) the Arabidopsis ubiquitin extension protein promoters (CaIMs et al., J Biol. Chem. 265(21):12486-93 (1990)); (4) a tomato ubiquitin gene promoter (Rollfinke et al., Gene. 211(2):267-76 (1998)); (5) a soybean heat shock protein promoter (Schoffl et al., MoI Gen Genet. 217(2-3):246-53 (1989)); and, (6) a maize H3 histone gene promoter (Atanassova et al., Plant MoI Biol. 37(2):275-85 (1989)).
Another useful feature of the annexin promoter is its expression profile in developing seeds. The annexin promoter is most active in developing seeds at early stages (before 10 days after pollination) and is largely quiescent in later stages. The expression profile of the annexin promoter is different from that of many seed-specific promoters, e.g., seed storage protein promoters, which often provide highest activity in later stages of development (Chen et al., Dev. Genet. 10:112-122 (1989); Ellerstrom et al., Plant MoI. Biol. 32:1019-1027 (1996); Keddie et al., Plant MoI. Biol. 24:327-340 (1994); Plant et al., (supra); Li, (supra)). The annexin promoter has a more conventional expression profile but remains distinct from other known seed specific promoters. Thus, the annexin promoter will be a very attractive candidate when overexpression, or suppression, of a gene in embryos is desired at an early developing stage. For example, it may be desirable to overexpress a gene regulating early embryo development or a gene involved in the metabolism prior to seed maturation.
Following identification of an appropriate promoter suitable for expression of a specific DHA synthase coding sequence, the promoter is then operably linked in a sense orientation using conventional means well known to those skilled in the art. Standard recombinant DNA and molecular cloning techniques used herein are well known in the art and are described more fully in Sambrook, J. et al., In
Molecular Cloning: A Laboratory Manual; 2nd ed.; Cold Spring Harbor Laboratory Press: Cold Spring Harbor, New York, 1989 (hereinafter "Sambrook et al., 1989" ) or Ausubel, F. M., Brent, R., Kingston, R. E., Moore, D. D., Seidman, J. G., Smith, J. A. and Struhl, K., Eds.; In Current Protocols in Molecular Biology, John Wiley and Sons: New York, 1990 (hereinafter "Ausubel et al., 1990"). For example, a fusion gene can be constructed by linking at least two DNA fragments in frame so as not to introduce a stop codon (in-frame fusion). The resulting fusion gene will be such that each DNA fragment encodes for at least one independent and separable enzymatic activity.
Once the recombinant construct has been made, it may then be introduced into a plant cell of choice by methods well known to those of ordinary skill in the art (e.g., transfection, transformation and electroporation). Oilseed plant cells are the preferred plant cells. The transformed plant cell is then cultured and regenerated under suitable conditions permitting expression of the long-chain PUFA which is then optionally recovered and purified.
The recombinant constructs of the invention may be introduced into one plant cell; or, alternatively, each construct may be introduced into separate plant cells.
Expression in a plant cell may be accomplished in a transient or stable fashion as is described above.
The desired long-chain PUFAs can be expressed in seed. Also within the scope of this invention are seeds or plant parts obtained from such transformed plants.
Plant parts include differentiated and undifferentiated tissues including, but not limited to the following: roots, stems, shoots, leaves, pollen, seeds, tumor tissue and various forms of cells and culture (e.g., single cells, protoplasts, embryos and callus tissue). The plant tissue may be in plant or in a plant organ, tissue or cell culture.
The term "plant organ" refers to plant tissue or a group of tissues that constitute a morphologically and functionally distinct part of a plant. The term "genome" refers to the following: (1) the entire complement of genetic material (genes and non-coding sequences) that is present in each cell of an organism, or virus or organelle; and/or (2) a complete set of chromosomes inherited as a (haploid) unit from one parent.
Thus, this invention also concerns a method for transforming a cell, comprising transforming a cell with the recombinant construct of the invention and selecting those cells transformed with the recombinant constructs described in the claims.
Also of interest is a method for producing a transformed plant comprising transforming a plant cell with the polynucleotides of the instant invention and regenerating a plant from the transformed plant cell.
Methods for transforming dicots (primarily by use of Agrobacterium tumefaciens) and obtaining transgenic plants have been published, among others, for: cotton (U.S. Patent No. 5,004,863; U.S. Patent No. 5,159,135); soybean (U.S. Patent No. 5,569,834; U.S. Patent No. 5,416,011); Brassica (U.S. Patent No. 5,463,174); peanut (Cheng et al. Plant Cell Rep. 15:653-657 (1996); McKently et al. Plant Cell Rep. 14:699-703 (1995)); papaya (Ling, K. et al. Bio/technology 9:752-758 (1991)); and pea (Grant et al. Plant Cell Rep. 15:254-258 (1995)). For a review of other commonly used methods of plant transformation see Newell, CA. (MoI. Biotechnol. 16:53-65 (2000)). One of these methods of transformation uses Agrobacterium rhizogenes (Tepfler, M. and Casse-Delbart, F. Microbiol. Sci. 4:24-28 (1987)). Transformation of soybeans using direct delivery of DNA has been published using PEG fusion (PCT Publication No. WO 92/17598), electroporation (Chowrira, G.M. et al., MoI. Biotechnol. 3:17-23 (1995); Christou, P. et al., Proc. Natl. Acad. Sci. U.S.A. 84:3962-3966 (1987)), microinjection and particle bombardement (McCabe, D.E. et. al., Bio/Technology 6:923 (1988); Christou et al., Plant Physiol. 87:671-674 (1988)).
There are a variety of methods for the regeneration of plants from plant tissue. The particular method of regeneration will depend on the starting plant tissue and the particular plant species to be regenerated. The regeneration, development and cultivation of plants from single plant protoplast transformants or from various transformed explants is well known in the art (Weissbach and Weissbach, In: Methods for Plant Molecular Biology, (Eds.), Academic: San Diego, CA (1988)). This regeneration and growth process typically includes the steps of selection of transformed cells and culturing those individualized cells through the usual stages of embryonic development through the rooted plantlet stage.
Transgenic embryos and seeds are similarly regenerated. The resulting transgenic rooted shoots are thereafter planted in an appropriate plant growth medium such as soil. Preferably, the regenerated plants are self-pollinated to provide homozygous transgenic plants. Otherwise, pollen obtained from the regenerated plants is crossed to seed-grown plants of agronomically important lines. Conversely, pollen from plants of these important lines is used to pollinate regenerated plants. A transgenic plant of the present invention containing a desired polypeptide is cultivated using methods well known to one skilled in the art.
In addition to the above discussed procedures, practitioners are familiar with the standard resource materials which describe specific conditions and procedures for: the construction, manipulation and isolation of macromolecules (e.g., DNA molecules, plasmids, etc.); the generation of recombinant DNA fragments and recombinant expression constructs; and, the screening and isolating of clones. See, for example: Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor: NY (1989); Maliga et al., Methods in Plant Molecular Biology, Cold Spring Harbor: NY (1995); Birren et al., Genome Analysis: Detecting Genes, Vol.1 , Cold Spring Harbor: NY (1998); Birren et al., Genome Analysis: Analyzing DNA1 Vol.2, Cold Spring Harbor: NY (1998); Plant Molecular Biology: A Laboratory Manual, eds. Clark, Springer: NY (1997).
Examples of oilseed plants include, but are not limited to: soybean, Brassica species, sunflower, maize, cotton, flax and safflower.
Examples of PUFAs having at least twenty carbon atoms and four or more carbon-carbon double bonds include, but are not limited to, omega-3 fatty acids such as EPA, DPA, and DHA. Seeds obtained from such plants are also within the scope of this invention as well as oil obtained from such seeds.
Thus, the present invention also concerns a method for altering the fatty acid profile of an oilseed plant comprising: a) transforming an oilseed plant cell with the recombinant construct of claim of the invention; and b) regenerating a plant from the transformed oilseed plant cell step (a), wherein the plant has an altered fatty acid profile. Microbial Expression Systems, Cassettes and Vectors The DHA synthase genes and gene products described herein (i.e.,
EgDHAsyni , EgDHAsyn2, EaDHAsyni , EaDHAsyn2 and EaDHAsyn3, or other mutant enzymes, codon-optimized enzymes or homologs thereof) may also be produced in heterologous microbial host cells, particularly in the cells of oleaginous yeasts (e.g., Yarrowia lipolytica). Microbial expression systems and expression vectors containing regulatory sequences that direct high level expression of foreign proteins are well known to those skilled in the art. Any of these could be used to construct chimeric genes for production of any of the gene products of the instant sequences. These chimeric genes could then be introduced into appropriate microorganisms via transformation to provide high-level expression of the encoded enzymes.
Vectors useful for the transformation of suitable microbial host cells are well known in the art. The specific choice of sequences present in the construct is dependent upon the desired expression products (supra), the nature of the host cell and the proposed means of separating transformed cells versus non-transformed cells. Typically, however, the vector contains at least one expression cassette, a selectable marker and sequences allowing autonomous replication or chromosomal integration. Suitable expression cassettes comprise a region 5' of the gene that controls transcriptional initiation (e.g., a promoter), the gene coding sequence, and a region 31 of the DNA fragment that controls transcriptional termination (i.e., a terminator). It is most preferred when both control regions are derived from genes from the transformed microbial host cell, although it is to be understood that such control regions need not be derived from the genes native to the specific species chosen as a production host.
Initiation control regions or promoters which are useful to drive expression of the instant multizymes, such as DHA synthase or individual domain ORFs, in the desired microbial host cell are numerous and familiar to those skilled in the art. Virtually any promoter capable of directing expression of these genes in the selected host cell is suitable for the present invention. Expression in a microbial host cell can be accomplished in a transient or stable fashion. Transient expression can be accomplished by inducing the activity of a regulatable promoter operably linked to the gene of interest. Stable expression can be achieved by the use of a constitutive promoter operably linked to the gene of interest. As an example, when the host cell is yeast, transcriptional and translational regions functional in yeast cells are provided, particularly from the host species (e.g., see U.S. Patent 7,238,482 and PCT Publication No. WO 2006/052870 for preferred transcriptional initiation regulatory regions for use in Yarrowia lipolytica). Any one of a number of regulatory sequences can be used, depending upon whether constitutive or induced transcription is desired, the efficiency of the promoter in expressing the ORF of interest, the ease of construction and the like.
Nucleotide sequences surrounding the translational initiation codon 'ATG' have been found to affect expression in yeast cells. If the desired polypeptide is poorly expressed in yeast, the nucleotide sequences of exogenous genes can be modified to include an efficient yeast translation initiation sequence to obtain optimal gene expression. For expression in yeast, this can be done by site-directed mutagenesis of an inefficiently expressed gene by fusing it in-frame to an endogenous yeast gene, preferably a highly expressed gene. Alternatively, one can determine the consensus translation initiation sequence in the host and engineer this sequence into heterologous genes for their optimal expression in the host of interest.
The termination region can be derived from the 3' region of the gene from which the initiation region was obtained or from a different gene. A large number of termination regions are known and function satisfactorily in a variety of hosts (when utilized both in the same and different genera and species from where they were derived). The termination region usually is selected more as a matter of convenience rather than because of any particular property. Termination control regions may also be derived from various genes native to the preferred hosts. In alternate embodiments, the 3'-region can also be synthetic, as one of skill in the art can utilize available information to design and synthesize a 3'-region sequence that functions as a transcription terminator. Optionally, a termination site may be unnecessary; however, it is most preferred if included. As one of skill in the art is aware, merely inserting a gene into a cloning vector does not ensure that it will be successfully expressed at the level needed. In response to the need for a high expression rate, many specialized expression vectors have been created by manipulating a number of different genetic elements that control aspects of transcription, translation, protein stability, oxygen limitation and secretion from the microbial host cell. More specifically, some of the molecular features that have been manipulated to control gene expression include: the nature of the relevant transcriptional promoter and terminator sequences; the number of copies of the cloned gene; whether the gene is plasmid-borne or integrated into the genome of the host cell; the final cellular location of the synthesized foreign protein; the efficiency of translation and correct folding of the protein in the host organism; the intrinsic stability of the mRNA and protein of the cloned gene within the host cell; and the codon usage within the cloned gene, such that its frequency approaches the frequency of preferred codon usage of the host cell. Each type of modification is encompassed in the present invention, as means to further optimize expression of the DHA synthases described herein. Transformation Of Microbial Host Cells
Once a cassette that is suitable for expression in an appropriate host cell has been obtained (e.g., a chimeric gene comprising a promoter, ORF and terminator), it is placed in a plasmid vector capable of autonomous replication in a host cell, or is directly integrated into the genome of the host cell. Integration of expression cassettes can occur randomly within the host genome or can be targeted through the use of constructs containing regions of homology with the host genome sufficient to target recombination within the host locus. Where constructs are targeted to an endogenous locus, all or some of the transcriptional and translational regulatory regions can be provided by the endogenous locus.
Where two or more genes are expressed from separate replicating vectors, it is desirable that each vector has a different means of selection and should lack homology to the other construct(s) to maintain stable expression and prevent reassortment of elements among constructs. Judicious choice of regulatory regions, selection means and method of propagation of the introduced construct(s) can be experimentally determined so that all introduced genes are expressed at the necessary levels to provide for synthesis of the desired products. Constructs comprising the gene(s) of interest may be introduced into a microbial host cell by any standard technique. These techniques include transformation (e.g., lithium acetate transformation [Methods in Enzymology, 194:186-187 (1991)]), protoplast fusion, biolistic impact, electroporation, microinjection, or any other method that introduces the gene(s) of interest into the host cell.
For convenience, a host cell that has been manipulated by any method to take up a DNA sequence (e.g., an expression cassette) will be referred to as "transformed" or "recombinant" herein. The transformed host will have at least one copy of the expression construct and may have two or more, depending upon whether the expression cassette is integrated into the genome or is present on an extrachromosomal element having multiple copy numbers.
The transformed host cell can be identified by various selection techniques, as described in U.S. Patents 7,238,482 and 7,259,255 and PCT Publication No. WO 2006/052870. Following transformation, substrates suitable for the instant DHA synthases
(and, optionally other PUFA enzymes that are co-expressed within the host cell) may be produced by the host either naturally or transgenically, or may be provided exogenously. Preferred Microbial Hosts For Recombinant Expression
Microbial host cells for expression of the instant genes and nucleic acid fragments may include hosts that grow on a variety of feedstocks, including simple or complex carbohydrates, fatty acids, organic acids, oils and alcohols, and/or hydrocarbons over a wide range of temperature and pH values. Based on the needs of the Applicants' Assignee, the genes described in the instant invention will be expressed in an oleaginous yeast (and in particular Yarrowia lipolytica); however, it is contemplated that because transcription, translation and the protein biosynthetic apparatus are highly conserved, any bacteria, yeast, algae, euglenoid and/or fungus will be a suitable microbial host for expression of the present nucleic acid fragments.
Preferred microbial hosts, however, are oleaginous organisms, such as oleaginous yeasts. These organisms are naturally capable of oil synthesis and accumulation, wherein the oil can comprise greater than about 25% of the cellular dry weight, more preferably greater than about 30% of the cellular dry weight, and most preferably greater than about 40% of the cellular dry weight. Genera typically identified as oleaginous yeast include, but are not limited to: Yarrowia, Candida, Rhodotorula, Rhodosporidium, Cryptococcus, Trichosporon and Lipomyces. More specifically, illustrative oil-synthesizing yeasts include: Rhodosporidium toruloides, Lipomyces starkeyii, L. lipoferus, Candida revkaufi, C. pulcherrima, C. tropicalis, C. utilis, Trichosporon pullans, T. cutaneum, Rhodotorula glutinus, R. graminis, and Yarrowia lipolytica (formerly classified as Candida lipolytica).
Most preferred is the oleaginous yeast Yarrowia lipolytica; and, in a further embodiment, most preferred are the Y. lipolytica strains designated as ATCC #20362, ATCC #8862, ATCC #18944, ATCC #76982 and/or LGAM S(7)1 (Papanikolaou S., and Aggelis G., Bioresour. Technol. 82(1):43-9 (2002)).
Historically, various strains of Y. lipolytica have been used for the manufacture and production of: isocitrate lyase; lipases; polyhydroxyalkanoates; citric acid; erythritol; 2-oxoglutaric acid; gamma-decalactone; gamma-dodecalatone; and pyruvic acid. Specific teachings applicable for transformation of oleaginous yeasts (i.e., Yarrowia lipolytica) include U.S. Patent 4,880,741 and U.S. Patent 5,071 ,764 and Chen, D. C. et al. (Appl. Microbiol. Biotechnol., 48(2):232-235 (1997)). Specific teachings applicable for engineering ARA, EPA and DHA production in Y. lipolytica are provided in U.S. Patent Application No. 11/264784 (PCT Publication No. WO 2006/055322), U.S. Patent Application No. 11/265761 (PCT Publication No. WO 2006/052870) and U.S. Patent Application No. 11/264737 (PCT Publication No. WO 2006/052871), respectively.
Detailed means for the synthesis and transformation of expression vectors comprising C20 elongases and delta-4 desaturases in oleaginous yeast (i.e., Yarrowia lipolytics) are provided in PCT Publication No. WO 2006/052871. The preferred method of expressing genes in Yarrowia lipolytica is by integration of linear DNA into the genome of the host. Integration into multiple locations within the genome can be particularly useful when high level expression of genes are desired [e.g., in the Ura3 locus (GenBank Accession No. AJ306421), the Leu2 gene locus (GenBank Accession No. AF260230), the Lys5 gene locus (GenBank Accession No. M34929), the Aco2 gene locus (GenBank Accession No. AJ001300), the Pox3 gene locus (Pox3: GenBank Accession No. XP_503244; or, Aco3: GenBank Accession No. AJ001301), the delta-12 desaturase gene locus (U.S. Patent 7,214,491), the Lip1 gene locus (GenBank Accession No. Z50020), the Lip2 gene locus (GenBank Accession No. AJ012632), and/or the Pex10 gene locus (GenBank Accession No. CAG81606)].
Termination regions useful in the disclosure herein for Yarrowia expression vectors include, for example: ~100 bp of the 3' region of the Yarrowia lipolytica extracellular protease (XPR; GenBank Accession No. M17741); the acyl-CoA oxidase (Aco3: GenBank Accession No. AJ001301 and No. CAA04661 ; Pox3: GenBank Accession No. XP_503244) terminators; the Pex20 (GenBank Accession No. AF054613) terminator; the Pex16 (GenBank Accession No. U75433) terminator; the Lip1 (GenBank Accession No. Z50020) terminator; the Lip2 (GenBank Accession No. AJ012632) terminator; and the 3-oxoacyl-CoA thiolase (OCT; GenBank Accession No. X69988) terminator.
Preferred selection methods for use in Yarrowia lipolytica are resistance to kanamycin, hygromycin, and the amino glycoside G418, as well as the ability to grow on media lacking uracil, leucine, lysine, tryptophan or histidine. In alternate embodiments, 5-fluoroorotic acid (5-fluorouracil-6-carboxylic acid monohydrate; "5- FOA") is used for selection of yeast Ura~ mutants. The compound is toxic to yeast cells that possess a functioning URA3 gene encoding orotidine 5'-monophosphate decarboxylase (OMP decarboxylase); thus, based on this toxicity, 5-FOA is especially useful for the selection and identification of Ura' mutant yeast strains (Bartel, P. L. and Fields, S., Yeast 2-Hybrid System, Oxford University: New York, v. 7, pp 109-147, 1997; see also PCT Publication No. WO 2006/052870 for 5-FOA use in Yarrowia). More specifically, one can first knockout the native Ura3 gene to produce a strain having a Ura- phenotype, wherein selection occurs based on 5- FOA resistance. Then, a cluster of multiple chimeric genes and a new Ura3 gene can be integrated into a different locus of the Yarrowia genome to produce a new strain having a Ura+ phenotype. Subsequent integration produces a new Ura3- strain (again identified using 5-FOA selection), when the introduced Ura3 gene is knocked out. Thus, the Ura3 gene (in combination with 5-FOA selection) can be used as a selection marker in multiple rounds of transformation, thereby readily permitting genetic modifications to be integrated into the Yarrowia genome in a facile manner.
Other preferred microbial hosts include oleaginous bacteria, algae, euglenoids, and other fungi; and, within this broad group of microbial hosts, of particular interest are microorganisms that synthesize omega-3/omega-6 fatty acids (or those that can be genetically engineered for this purpose [e.g., other yeast such as Saccharomyces cerevisiae]). Thus, for example, transformation of Mortierella alpina (which is commercially used for production of ARA) with any of the instant DHA synthase genes under the control of inducible or regulated promoters could yield a transformant organism capable of synthesizing increased quantities of PUFAs. The method of transformation of M. alpina is described by Mackenzie et al. (Appl. Environ. Microbiol., 66:4655 (2000)). Similarly, methods for transformation of Thraustochytriales microorganisms are disclosed in U.S. 7,001 ,772. Substrate feeding may be required.
Irrespective of the host selected for expression of the multizymes (e.g. DHA synthases), multiple transformants must be screened in order to obtain a strain displaying the desired expression level and pattern. Such screening may be accomplished by Southern analysis of DNA blots (Southern, J. MoI. Biol., 98:503 (1975)), Northern analysis of mRNA expression (Kroczek, J. Chromatogr. Biomed. Appl., 618(1-2):133-145 (1993)), Western and/or Elisa analyses of protein expression, phenotypic analysis, or GC analysis of the PUFA products. Of course, since naturally produced PUFAs in oleaginous yeast are limited to 18:2 fatty acids (i.e., LA), and less commonly, 18:3 fatty acids (i.e., ALA), in more preferred embodiments of the present invention, the oleaginous yeast will be genetically engineered to express multiple enzymes necessary for long-chain PUFA biosynthesis (thereby enabling production of e.g., ARA, EPA, DPA and DHA), in addition to the multizymes described herein.
In particularly preferred embodiments, the at least one additional recombinant DNA construct encode a DGLA synthase, such that the multizyme has both delta-9 elongase activity and delta-8 desaturase activity. In some embodiments the delta-9 elongase can be isolated or derived from lsochrysis galbana (GenBank Accession No. AF390174; lgD9e or lgD9eS) or the delta-9 elongase can be isolated or derived from Euglena gracilis or Euglena anabaena. For example, see the DGLA synthases set forth as SEQ ID NO:441 , SEQ ID NO:447, SEQ ID NO:454, SEQ ID NO:461 , SEQ ID NO:464 and SEQ ID NO:471. Metabolic Engineering of Omeqa-3 and/or Omeqa-6 Fatty Acid Biosynthesis in Microbes
Methods for manipulating biochemical pathways are well known to those skilled in the art; and, it is expected that numerous manipulations will be possible to maximize omega-3 and/or omega-6 fatty acid biosynthesis in oleaginous yeasts, and particularly, in Yarrowia lipolytica. This manipulation may require metabolic engineering directly within the PUFA biosynthetic pathway or additional manipulation of pathways that contribute carbon to the PUFA biosynthetic pathway. Methods useful for up-regulating desirable biochemical pathways and down- regulating undesirable biochemical pathways are well known to those skilled in the art.
For example, biochemical pathways competing with the omega-3 and/or omega-6 fatty acid biosynthetic pathways for energy or carbon, or native PUFA biosynthetic pathway enzymes that interfere with production of a particular PUFA end-product, may be eliminated by gene disruption or down-regulated by other means (e.g., antisense mRNA).
Detailed discussion of manipulations within the PUFA biosynthetic pathway as a means to increase ARA, EPA or DHA (and associated techniques thereof) are presented in PCT Publication Nos. WO 2006/055322 [U.S. Patent Publication No. 2006-0094092-A1], PCT Publication No. WO 2006/052870 [U.S. Patent Publication No. 2006-0115881-A1] and PCT Publication No. WO 2006/052871 [U.S. Patent Publication No. 2006-0110806-A1], respectively, as are desirable manipulations in the TAG biosynthetic pathway and the TAG degradation pathway (and associated techniques thereof)-
Within the context of the present invention, it may be useful to modulate the expression of the fatty acid biosynthetic pathway by any one of the strategies described above. For example, the present invention provides methods whereby genes encoding key enzymes in the PUFA biosynthetic pathway are introduced into oleaginous yeasts for the production of omega-3 and/or omega-6 fatty acids. It will be particularly useful to express the instant DHA synthase genes in oleaginous yeasts that do not naturally possess omega-3 and/or omega-6 fatty acid biosynthetic pathways and coordinate the expression of these genes, to maximize production of preferred PUFA products using various means for metabolic engineering of the host organism.
Microbial Fermentation Processes for PUFA Production
The transformed host cell is grown under conditions that optimize expression of chimeric genes and produce the greatest and most economical yield of desired PUFAs. In general, media conditions that may be optimized include the type and amount of carbon source, the type and amount of nitrogen source, the carbon-to- nitrogen ratio, the amount of different mineral ions, the oxygen level, growth temperature, pH, length of the biomass production phase, length of the oil accumulation phase and the time and method of cell harvest. Yarrowia lipolytics are generally grown in complex media (e.g., yeast extract-peptone-dextrose broth (YPD)) or a defined minimal media that lacks a component necessary for growth and thereby forces selection of the desired expression cassettes (e.g., Yeast Nitrogen Base (DIFCO Laboratories, Detroit, Ml)).
Fermentation media in the present invention must contain a suitable carbon source. Suitable carbon sources are taught in U.S. Patent 7,238,482. Although it is contemplated that the source of carbon utilized in the present invention may encompass a wide variety of carbon-containing sources, preferred carbon sources are sugars, glycerol, and/or fatty acids. Most preferred is glucose and/or fatty acids containing between 10-22 carbons. Nitrogen may be supplied from an inorganic (e.g., (NH4)2SO4) or organic
(e.g., urea or glutamate) source. In addition to appropriate carbon and nitrogen sources, the fermentation media must also contain suitable minerals, salts, cofactors, buffers, vitamins and other components known to those skilled in the art suitable for the growth of the oleaginous host and promotion of the enzymatic pathways necessary for PUFA production. Particular attention is given to several metal ions (e.g., Fe+2, Cu+2, Mn+2, Co+2, Zn+2, Mg+2) that promote synthesis of lipids and PUFAs (Nakahara, T. et al., Ind. Appl. Single Cell Oils, D. J. Kyle and R. Colin, eds. pp 61-97 (1992)). Preferred growth media in the present invention are common commercially prepared media, such as Yeast Nitrogen Base (DIFCO Laboratories, Detroit, Ml). Other defined or synthetic growth media may also be used and the appropriate medium for growth of the transformant host cells will be known by one skilled in the art of microbiology or fermentation science. A suitable pH range for the fermentation is typically between about pH 4.0 to pH 8.0, wherein pH 5.5 to pH 7.5 is preferred as the range for the initial growth conditions. The fermentation may be conducted under aerobic or anaerobic conditions, wherein microaerobic conditions are preferred.
Typically, accumulation of high levels of PUFAs in oleaginous yeast cells requires a two-stage process, since the metabolic state must be "balanced" between growth and synthesis/storage of fats. Thus, most preferably, a two-stage fermentation process is necessary for the production of PUFAs in oleaginous yeast (e.g., Yarrowia lipolytica). This approach is described in U.S. Patent 7,238,482, as are various suitable fermentation process designs (i.e., batch, fed-batch and continuous) and considerations during growth. Purification and Processing of PUFA Oils
PUFAs may be found in the host microorganisms and plants as free fatty acids or in esterified forms such as acylglycerols, phospholipids, sulfolipids, or glycolipids, and may be extracted from the host cells through a variety of means well-known in the art. One review of extraction techniques, quality analysis, and acceptability standards for yeast lipids is that of Z. Jacobs (Critical Reviews in Biotechnology, 12(5/6) :463-491 (1992)). A brief review of downstream processing is also available by A. Singh and O. Ward (Adv. Appl. Microbiol., 45:271-312 (1997)). In general, means for the purification of PUFAs may include extraction (e.g., U.S. Patent 6,797,303 and U.S. Patent 5,648,564) with organic solvents, sonication, supercritical fluid extraction (e.g., using carbon dioxide), saponification and physical means such as presses, or combinations thereof. One is referred to the teachings of U.S. Patent 7,238,482 for additional details. Methods of isolating seed oils are well known in the art: (Young et al., Processing of Fats and Oils, In The Lipid Handbook, Gunstone et al., eds., Chapter 5 pp 253-257; Chapman & Hall: London (1994)). For example, soybean oil is produced using a series of steps involving the extraction and purification of an edible oil product from the oil-bearing seed. Soybean oils and soybean byproducts are produced using the generalized steps shown in Table 7.
TABLE 7 Generalized Steps for Soybean Oil and Byproduct Production
Figure imgf000089_0001
More specifically, soybean seeds are cleaned, tempered, dehulled, and flaked, thereby increasing the efficiency of oil extraction. Oil extraction is usually accomplished by solvent (e.g., hexane) extraction but can also be achieved by a combination of physical pressure and/or solvent extraction. The resulting oil is called crude oil. The crude oil may be degummed by hydrating phospholipids and other polar and neutral lipid complexes that facilitate their separation from the nonhydrating, triglyceride fraction (soybean oil). The resulting lecithin gums may be further processed to make commercially important lecithin products used in a variety of food and industrial products as emulsification and release (i.e., antisticking) agents. Degummed oil may be further refined for the removal of impurities (primarily free fatty acids, pigments and residual gums). Refining is accomplished by the addition of a caustic agent that reacts with free fatty acid to form soap and hydrates phosphatides and proteins in the crude oil. Water is used to wash out traces of soap formed during refining. The soapstock byproduct may be used directly in animal feeds or acidulated to recover the free fatty acids. Color is removed through adsorption with a bleaching earth that removes most of the chlorophyll and carotenoid compounds. The refined oil can be hydrogenated, thereby resulting in fats with various melting properties and textures. Winterization (fractionation) may be used to remove stearine from the hydrogenated oil through crystallization under carefully controlled cooling conditions. Deodorization (principally via steam distillation under vacuum) is the last step and is designed to remove compounds which impart odor or flavor to the oil. Other valuable byproducts such as tocopherols and sterols may be removed during the deodorization process. Deodorized distillate containing these byproducts may be sold for production of natural vitamin E and other high-value pharmaceutical products. Refined, bleached, (hydrogenated, fractionated) and deodorized oils and fats may be packaged and sold directly or further processed into more specialized products. A more detailed reference to soybean seed processing, soybean oil production, and byproduct utilization can be found in Erickson, Practical Handbook of Soybean Processing and Utilization, The American Oil Chemists' Society and United Soybean Board (1995). Soybean oil is liquid at room temperature because it is relatively low in saturated fatty acids when compared with oils such as coconut, palm, palm kernel, and cocoa butter.
Plant and microbial oils containing PUFAs that have been refined and/or purified can be hydrogenated, thereby resulting in fats with various melting properties and textures. Many processed fats (including spreads, confectionary fats, hard butters, margarines, baking shortenings, etc.) require varying degrees of solidity at room temperature and can only be produced through alteration of the source oil's physical properties. This is most commonly achieved through catalytic hydrogenation.
Hydrogenation is a chemical reaction in which hydrogen is added to the unsaturated fatty acid double bonds with the aid of a catalyst such as nickel. For example, high oleic soybean oil contains unsaturated oleic, linoleic, and linolenic fatty acids, and each of these can be hydrogenated. Hydrogenation has two primary effects. First, the oxidative stability of the oil is increased as a result of the reduction of the unsaturated fatty acid content. Second, the physical properties of the oil are changed because the fatty acid modifications increase the melting point resulting in a semi-liquid or solid fat at room temperature.
There are many variables which affect the hydrogenation reaction, which in turn alter the composition of the final product. Operating conditions including pressure, temperature, catalyst type and concentration, agitation, and reactor design are among the more important parameters that can be controlled. Selective hydrogenation conditions can be used to hydrogenate the more unsaturated fatty acids in preference to the less unsaturated ones. Very light or brush hydrogenation is often employed to increase stability of liquid oils. Further hydrogenation converts a liquid oil to a physically solid fat. The degree of hydrogenation depends on the desired performance and melting characteristics designed for the particular end product. Liquid shortenings (used in the manufacture of baking products, solid fats and shortenings used for commercial frying and roasting operations) and base stocks for margarine manufacture are among the myriad of possible oil and fat products achieved through hydrogenation. A more detailed description of hydrogenation and hydrogenated products can be found in Patterson, H. B. W., Hydrogenation of Fats and Oils: Theory and Practice. The American Oil Chemists' Society (1994).
Hydrogenated oils have become somewhat controversial due to the presence of frans-fatty acid isomers that result from the hydrogenation process. Ingestion of large amounts of frans-isomers has been linked with detrimental health effects including increased ratios of low density to high density lipoproteins in the blood plasma and increased risk of coronary heart disease. PUFA-Containing Oils for Use in Foodstuffs. Health Food Products, Pharmaceuticals And Animal Feeds
The market place currently supports a large variety of food and feed products, incorporating omega-3 and/or omega-6 fatty acids (particularly e.g., ALA, GLA, ARA, EPA, DPA and DHA). It is contemplated that the PUFA-comprising plant/seed oils, altered seeds, and microbial biomass and/or oils of the invention will function in food and feed products to impart the health benefits of current formulations. Compared to other vegetable oils, the oils of the invention are believed to function similarly to other oils in food applications from a physical standpoint (for example, partially hydrogenated oils such as soybean oil are widely used as ingredients for soft spreads, margarine and shortenings for baking and frying).
Plant/seed oils, altered seeds, and microbial biomass and/or oils containing omega-3 and/or omega-6 fatty acids will be suitable for use in a variety of food and feed products including, but not limited to: food analogs, meat products, cereal products, baked foods, snack foods and dairy products.
Additionally, the present plant/seed oils, altered seeds, and microbial biomass and/or oils may be used in formulations to impart health benefit in medical foods including medical nutritionals, dietary supplements, infant formula as well as pharmaceutical products. One of skill in the art of food processing and food formulation will understand how the amount and composition of the plant and microbial oils may be added to the food or feed product. Such an amount will be referred to herein as an "effective" amount and will depend on the food or feed product, the diet that the product is intended to supplement or the medical condition that the medical food or medical nutritional is intended to correct or treat.
Food analogs can be made using processes well known to those skilled in the art. There can be mentioned meat analogs, cheese analogs, milk analogs and the like. Meat analogs made from soybeans contain soy protein or tofu and other ingredients mixed together to simulate various kinds of meats. These meat alternatives are sold as frozen, canned or dried foods. Usually, they can be used the same way as the foods they replace. Meat alternatives made from soybeans are excellent sources of protein, iron and B vitamins. Examples of meat analogs include, but are not limited to: ham analogs, sausage analogs, bacon analogs, and the like.
Food analogs can be classified as imitation or substitutes depending on their functional and compositional characteristics. For example, an imitation cheese need only resemble the cheese it is designed to replace. However, a product can generally be called a substitute cheese only if it is nutritionally equivalent to the cheese it is replacing and meets the minimum compositional requirements for that cheese. Thus, substitute cheese will often have higher protein levels than imitation cheeses and be fortified with vitamins and minerals. Milk analogs or nondairy food products include, but are not limited to, imitation milks and nondairy frozen desserts (e.g., those made from soybeans and/or soy protein products).
Meat products encompass a broad variety of products. In the United States "meat" includes "red meats" produced from cattle, hogs and sheep. In addition to the red meats there are poultry items which include chickens, turkeys, geese, guineas, ducks and the fish and shellfish. There is a wide assortment of seasoned and processed meat products: fresh, cured and fried, and cured and cooked. Sausages and hot dogs are examples of processed meat products. Thus, the term "meat products" as used herein includes, but is not limited to, processed meat products.
A cereal food product is a food product derived from the processing of a cereal grain. A cereal grain includes any plant from the grass family that yields an edible grain (seed). The most popular grains are barley, corn, millet, oats, quinoa, rice, rye, sorghum, triticale, wheat and wild rice. Examples of a cereal food product include, but are not limited to: whole grain, crushed grain, grits, flour, bran, germ, breakfast cereals, extruded foods, pastas, and the like.
A baked goods product comprises any of the cereal food products mentioned above and has been baked or processed in a manner comparable to baking (i.e., to dry or harden by subjecting to heat). Examples of a baked good product include, but are not limited to: bread, cakes, doughnuts, bars, pastas, bread crumbs, baked snacks, mini-biscuits, mini-crackers, mini-cookies, and mini-pretzels. As was mentioned above, oils of the invention can be used as an ingredient. A snack food product comprises any of the above or below described food products.
A fried food product comprises any of the above or below described food products that has been fried. A health food product is any food product that imparts a health benefit. Many oilseed-derived food products may be considered as health foods.
A beverage can be in a liquid or in a dry powdered form.
For example, there can be mentioned non-carbonated drinks such as fruit juices, fresh, frozen, canned or concentrate; flavored or plain milk drinks, etc. Adult and infant nutritional formulas are well known in the art and commercially available
(e.g., Similac®, Ensure®, Jevity®, and Alimentum® from Ross Products Division,
Abbott Laboratories).
Infant formulas are liquids or reconstituted powders fed to infants and young children. "Infant formula" is defined herein as an enteral nutritional product which can be substituted for human breast milk in feeding infants and typically is composed of a desired percentage of fat mixed with desired percentages of carbohydrates and proteins in an aqueous solution (e.g., see U.S. Patent No.
4,670,285). Based on the worldwide composition studies, as well as levels specified by expert groups, average human breast milk typically contains about 0.20% to 0.40% of total fatty acids (assuming about 50% of calories from fat); and, generally the ratio of DHA to ARA would range from about 1 :1 to 1 :2 (see, e.g., formulations of Enfamil LIPIL™ (Mead Johnson & Company) and Similac Advance™ (Ross Products Division, Abbott Laboratories)). Infant formulas have a special role to play in the diets of infants because they are often the only source of nutrients for infants; and, although breast-feeding is still the best nourishment for infants, infant formula is a close enough second that babies not only survive but thrive.
A dairy product is a product derived from milk. A milk analog or nondairy product is derived from a source other than milk, for example, soymilk as was discussed above. These products include, but are not limited to: whole milk, skim milk, fermented milk products such as yogurt or sour milk, cream, butter, condensed milk, dehydrated milk, coffee whitener, coffee creamer, ice cream, cheese, etc.
Additional food products into which the PUFA-containing oils of the invention could be included are, for example, chewing gums, confections and frostings, gelatins and puddings, hard and soft candies, jams and jellies, white granulated sugar, sugar substitutes, sweet sauces, toppings and syrups, and dry-blended powder mixes.
A health food product is any food product that imparts a health benefit and includes functional foods, medical foods, medical nutritionals and dietary supplements. Additionally, the plant/seed oils, altered seeds and microbial oils of the invention may be used in standard pharmaceutical compositions (e.g., the long- chain PUFA containing oils could readily be incorporated into the any of the above mentioned food products, to thereby produce a functional or medical food). More concentrated formulations comprising PUFAs include capsules, powders, tablets, softgels, gelcaps, liquid concentrates and emulsions which can be used as a dietary supplement in humans or animals other than humans.
Animal feeds are generically defined herein as products intended for use as feed or for mixing in feed for animals other than humans. The plant/seed oils, altered seeds and microbial oils of the invention can be used as an ingredient in various animal feeds.
More specifically, although not limited therein, it is expected that the oils of the invention can be used within pet food products, ruminant and poultry food products and aquacultural food products. Pet food products are those products intended to be fed to a pet (e.g., dog, cat, bird, reptile, and rodent). These products can include the cereal and health food products above, as well as meat and meat byproducts, soy protein products, grass and hay products (e.g., alfalfa, timothy, oat or brome grass, vegetables). Ruminant and poultry food products are those wherein the product is intended to be fed to an animal (e.g., turkeys, chickens, cattle, and swine). As with the pet foods above, these products can include cereal and health food products, soy protein products, meat and meat byproducts, and grass and hay products as listed above. Aquacultural food products (or "aquafeeds") are those products intended to be used in aquafarming, i.e., which concerns the propagation, cultivation, or farming of aquatic organisms and/or animals in fresh or marine waters.
EXAMPLES
The present invention is further defined in the following Examples, in which parts and percentages are by weight and degrees are Celsius, unless otherwise stated. It should be understood that these Examples, while indicating preferred embodiments of the invention, are given by way of illustration only. From the above discussion and these Examples, one skilled in the art can ascertain the essential characteristics of this invention, and without departing from the spirit and scope thereof, can make various changes and modifications of the invention to adapt it to various usages and conditions. Thus, various modifications of the invention in addition to those shown and described herein will be apparent to those skilled in the art from the foregoing description. Such modifications are also intended to fall within the scope of the appended claims. The meaning of abbreviations is as follows: "sec" means second(s), "min" means minute(s), "h" means hour(s), "d" means day(s), "μL" means microliter(s), "ml_" means milliliter(s), "L" means liter(s), "μM" means micromolar, "mM" means millimolar, "M" means molar, "mmol" means millimole(s), "μmole" mean micromole(s), "g" means gram(s), "μg" means microgram(s), "ng" means nanogram(s), "U" means unit(s), "bp" means base pair(s) and "kB" means kilobase(s). GENERAL METHODS: Nomenclature for Expression Cassettes:
The structure of an expression cassette will be represented by a simple notation system of "X::Y::Z", wherein X describes the promoter fragment, Y describes the gene coding region fragment, and Z describes the terminator fragment, which are all operably linked to one another. Transformation and Cultivation of Yarrowia lipolvtica:
Yarrowia lipolytica strains with ATCC Accession Nos. #20362, #76982 and #90812 were purchased from the American Type Culture Collection (Rockville, MD). Yarrowia lipolytica strains were typically grown at 28-30 0C in several media, according to the recipes shown below. Agar plates were prepared as required by addition of 20 g/L agar to each liquid media, according to standard methodology.
YPD agar medium (per liter): 10 g of yeast extract [Difco], 20 g of Bacto peptone [Difco]; and 20 g of glucose.
Basic Minimal Media (MM) (per liter): 20 g glucose; 1.7 g yeast nitrogen base without amino acids; 1.0 g proline; and pH 6.1 (not adjusted). Minimal Media + Uracil (MM+uracil or MMU) (per liter): Prepare MM media as above and add 0.1 g uracil and 0.1 g uridine.
Minimal Media + Uracil + Sulfonylurea (MMU+SU) (per liter): Prepare MMU media as above and add 280 mg sulfonylurea.
Minimal Media + Leucine (MM+leucine or MMLeu) (per liter): Prepare MM media as above and add 0.1 g leucine. Minimal Media + Leucine + Uracil (MMLeuUra) (per liter): Prepare MM media as above and add 0.1 g leucine, 0.1 g uracil and 0.1 g uridine.
Minimal Media + Leucine + Lysine (MMLeuLvs) (per liter): Prepare MM media as above and add 0.1 g lysine, 0.1 g leucine.
Minimal Media + 5-Fluoroorotic Acid (MM + 5-FOA) (per liter): 2O g glucose, 6.7 g Yeast Nitrogen base, 75 mg uracil, 75 mg uridine and appropriate amount of FOA (Zymo Research Corp., Orange, CA), based on FOA activity testing against a range of concentrations from 100 mg/L to
1000 mg/L (since variation occurs within each batch received from the supplier).
High Glucose Media (HGM) (per liter): 80 glucose, 2.58 g KH2PO4 and 5.36 g K2HPO4, pH 7.5 (do not need to adjust).
Transformation of Yarrowia lipolytics was performed according to the method of Chen, D. C. et al. (Appl. Microbiol. Biotechnol. 48(2):232-235 (1997)), unless otherwise noted. Briefly, Yarrowia was streaked onto a YPD plate and grown at 30 0C for approximately 18 h. Several large loopfuls of cells were scraped from the plate and resuspended in 1 mL of transformation buffer, comprising: 2.25 mL of 50% PEG, average MW 3350; 0.125 mL of 2 M lithium acetate, pH 6.0; 0.125 mL of 2 M DTT; and (optionally) 50 μg sheared salmon sperm DNA. Then, approximately 500 ng of linearized plasmid DNA were incubated in 100 μL of resuspended cells and maintained at 390C for 1 hr with vortex mixing at 15 min intervals. The cells were plated onto selection media plates, which were maintained at 30 0C for 2 to 3 days.
Fatty Acid Analysis of Yarrowia lipolytica:
For fatty acid analysis, cells were collected by centrifugation and lipids were extracted as described in Bligh, E. G. & Dyer, W. J. (Can. J. Biochem. Physiol.
37:911 -917 (1959)). Fatty acid methyl esters were prepared by transesterification of the lipid extract with sodium methoxide (Roughan, G. and Nishida I., Arch Biochem Biophys. 276(1 ):38-46 (1990)) and subsequently analyzed with a Hewlett-Packard 6890 GC fitted with a 30 m X 0.25 mm (i.d.) HP-INNOWAX (Hewlett-Packard) column. The oven temperature was from 170 0C (25 min hold) to 185 0C at 3.5 °C/min.
For direct base transesterification, Yarrowia culture (3 ml_) was harvested, washed once in distilled water, and dried under vacuum in a Speed-Vac for 5-10 min. Sodium methoxide (100 μl_ of 1 %) was added to the sample, which was then vortexed and rocked for 20 min. After adding 3 drops of 1 M NaCI and 400 μl_ hexane, the sample was vortexed and spun. The upper layer was removed and analyzed by GC as described above. Construction Of Yarrowia lipolvtica Strain Y4305U3:
Y. lipolytica strain Y4305U3 was used as the host in Examples 52, 53 and 54, infra. The following description is a summary of the construction of strain Y4305U3, derived from Yarrowia lipolytica ATCC #20362. Strain Y4305U3 is capable of producing about 53.2% EPA relative to the total lipids via expression of a delta-9 elongase/ delta-8 desaturase pathway (FIG. 44).
The development of strain Y4305U3 required the construction of strain Y2224 (a FOA resistant mutant from an autonomous mutation of the Ura3 gene of wildtype Yarrowia strain ATCC #20362), strain Y4001 (producing 17% EDA with a Leu- phenotype), strain Y4001 U1 (producing 17% EDA with a Leu- and Ura- phenotype), strain Y4036 (producing 18% DGLA with a Leu- phenotype), strain Y4036U (producing 18% DGLA with a Leu- and Ura- phenotype), strain Y4070 (producing 12% ARA with a Ura- phenotype), strain Y4086 (producing 14% EPA), strain
Y4086U1 (Ura3-)t strain Y4128 (producing 37% EPA), strain Y4128U3 {Ura-), strain Y4217 (producing 42% EPA), strain Y4217U2 (Ura-), strain Y4259 (producing 46.5% EPA) and strain Y4259U2 (Ura-). Generation Of Strain Y2224: Strain Y2224 was isolated in the following manner: Yarrowia lipolytica ATCC #20362 cells from a YPD agar plate were streaked onto a MM plate (75 mg/L each of uracil and uridine, 6.7 g/L YNB with ammonia sulfate, without amino acids, and 20 g/L glucose) containing 250 mg/L 5- FOA (Zymo Research). Plates were incubated at 28 0C, and four of the resulting colonies were patched separately onto MM plates containing 200 mg/mL 5-FOA and MM plates lacking uracil and uridine. This was done to confirm uracil Ura3 auxotrophy.
Generation Of Strain Y4001 To Produce About 17% EDA Of Total Lipids: Strain Y4001 was created via integration of construct pZKLeuN-29E3 (FIG. 45A). This construct, comprising four chimeric genes (i.e., a delta-12 desaturase, a C-iβm elongase, and two delta-9 elongases), was integrated into the Leu2 loci of strain Y2224 to thereby enable production of EDA.
Construct pZKLeuN-29E3 contained the components shown below in Table 8.
TABLE 8 Description of Plasmid pZKLeuN-29E3 (SEQ ID NO:315)
Figure imgf000099_0001
Figure imgf000100_0001
Plasmid pZKLeuN-29E3 was digested with Asc\/Sph\ and then used for transformation of Y. lipolytica strain Y2224 (i.e., ATCC #20362 Ura3-) according to the General Methods. The transformed cells were plated onto MMLeu media plates, and plates were maintained at 30 °C for 2 to 3 days. The colonies were picked and streaked onto MM and MMLeu selection plates. The colonies that could grow on MMLeu plates but not on MM plates were selected as Leu- strains. Single colonies of Leu- strains were used to inoculate liquid MMLeu, and the liquid cultures were shaken at 250 rpm/min for 2 days at 30 C. The cells were collected by centrifugation, and lipids were extracted. Fatty acid methyl esters were prepared by trans-esterification and subsequently analyzed with a Hewlett-Packard 6890 GC. GC analyses showed the presence of EDA in the transformants containing the 4 chimeric genes of pZKLeuN-29E3, but not in the Yarrowia Y2224 control strain. Most of the selected 36 Leu- strains produced about 12 to 16.9% EDA of total lipids. Three strains, designated as strains Y4001 , Y4002, and Y4003, produced about 17.4%, 17%, and 17.5% EDA of total lipids, respectively. Single colonies of Y4001 , Y4002, and Y4003 strains were used to inoculate liquid MMLeu, and the liquid cultures were shaken at 250 rpm/min for 2 days at 30 °C. The cells were collected by centhfugation, resuspended in HGM, and then shaken at 250 rpm/min for 5 days. The cells were collected by centhfugation, and lipids were extracted. Fatty acid methyl esters were prepared by trans-esterification and subsequently analyzed with a Hewlett-Packard 6890 GC. GC analyses showed that the Y4001 , Y4002, and Y4003 strains produced about 24% EDA of total lipids.
Generation Of Strain Y4001U (Leu-, Ura-): Strain Y4001 U was created via temporary expression of the Cre recombinase enzyme in plasmid pY116 (FIG. 45B) within strain Y4001 to produce a Leu- and Ura- phenotype. Construct pY116 contained the following components:
TABLE 9 Description of Plasmid pY116 (SEQ ID NO:323)
Figure imgf000101_0001
Plasmid pY116 was used for transformation of freshly grown Y4001 cells according to the General Methods. The transformed cells were plated onto MMLeuUra plates containing 280 μg/mL sulfonylurea (chlorimuron ethyl, E. I. duPont de Nemours & Co., Inc., Wilmington, DE), and plates were maintained at 30
°C for 3 to 4 days. Four colonies were picked and then used to inoculate 3 ml_ liquid YPD . The liquid cultures were shaken at 250 rpm/min for 1 day at 30 C. The cultures were diluted to 1 :50,000 with liquid MMLeuUra media, and 100 μl_ were plated onto new YPD plates. The plates were maintained at 30 °C for 2 days. Colonies were picked and streaked onto MMLeu and MMLeuUra selection plates. The colonies that could grow on MMLeuUra plates but not on MMLeu plates were selected and analyzed by GC to confirm the presence of C20:2 (EDA). Several strains, each having a Leu- and lira- phenotype, produced about 17% EDA of total lipids and collectively, were designated as Y4001 U. One of these strains was designated as Y4001 U1.
Generation Of Y4036 Strain To Produce About 18% DGLA Of Total Lipids: Construct pKO2UF8289 (FIG. 46A; SEQ ID NO:324) was generated to integrate four chimeric genes (comprising a delta-12 desaturase, one delta-9 elongase, and two mutant delta-8 desaturases) into the delta-12 loci of strain Y4001 U1 , to thereby enable production of DGLA. Construct pKO2UF8289 contained the following components:
TABLE 10 Description of Plasmid pKO2UF8289 (SEQ ID NO:324)
Figure imgf000102_0001
Figure imgf000103_0001
The pKO2UF8289 plasmid was digested with Asc\/Sph\ and then used for transformation of strain Y4001 U1 according to the General Methods. The transformed cells were plated onto MMLeu plates, and plates were maintained at 30 C for 2 to 3 days. The colonies were picked and streaked onto MMLeu selection plates at 30 C for 2 days. These cells were then used to inoculate liquid MMLeu media, and liquid cultures were shaken at 250 rpm/min for 2 days at 30 °C. The cells were collected by centrifugation, and lipids were extracted. Fatty acid methyl esters were prepared by trans-estehfication and subsequently analyzed with a Hewlett-Packard 6890 GC. GC analyses showed the presence of DGLA in the transformants containing the 4 chimeric genes of pKO2UF8289, but not in the parent Y4001 U1 strain. Most of the selected 96 strains produced between 7% and 13% DGLA of total lipids. Six strains, designated as Y4034, Y4035, Y4036, Y4037, Y4038, and Y4039, produced about 15%, 13.8%, 18.2%, 13.1%, 15.6%, and 13.9% DGLA of total lipids, respectively.
Generation Of Strain Y4036U {Leu-, Ura3-): Construct pY116 (FIG. 45B; SEQ ID NO:323) was utilized to temporarily express a Cre recombinase enzyme in strain Y4036. This released the LoxP sandwiched Ura3 gene from the genome. Plasmid pY116 was used to transform strain Y4036 according to the General
Methods. Following transformation, the cells were plated onto MMLeuUra plates, and plates were maintained at 30 C for 2 to 3 days. The individual colonies grown on MMLeuUra plates were picked and streaked into YPD liquid media. Liquid cultures were shaken at 250 rpm/min for 1 day at 30 °C to cure the pY116 plasmid. Cells from the grown cultures were streaked on MMLeuUra plates. After two days at 30 °C, the individual colonies were re-streaked on MMLeuUra, MMU and MMLeu plates. Those colonies that could grow on MMLeuUra, but not on MMU or MMLeu plates, were selected. One strain with Leu- and Ura- phenotypes was designated as Y4036U (Ura-, Leu-). Generation Of Y4069 And Y4070 Strains To Produce About 12% ARA Of
Total Lipids: Construct pZKSL-555R (FIG. 46B; SEQ ID NO:331) was generated to integrate three delta-5 desaturase genes into the Lys loci of strain Y4036U, to thereby enable production of ARA. The pZKSL-555R plasmid contained the following components: TABLE 11
Description of Plasmid pZKSL-555R (SEQ ID NO:331)
Figure imgf000104_0001
Figure imgf000105_0001
The pZKSL-555R plasmid was digested with AscUSphl and then used for transformation of strain Y4036U according to the General Methods. The transformed cells were plated onto MMLeuLys plates, and plates were maintained at 30 °C for 2 to 3 days. Single colonies were then re-streaked onto MMLeuLys plates, and the resulting colonies were used to inoculate liquid MMLeuLys. Liquid cultures were then shaken at 250 rpm/min for 2 days at 30 °C. The cells were collected by centrifugation, and lipids were extracted. Fatty acid methyl esters were prepared by trans-esterification and subsequently analyzed with a Hewlett-Packard 6890 GC.
GC analyses showed the presence of ARA in the transformants containing the 3 chimeric genes of pZKSL-555R, but not in the parent Y4036U strain. Most of the selected 96 strains produced -10% ARA of total lipids. Four strains, designated as Y4068, Y4069, Y4070, and Y4071 , produced about 11.7%, 11.8%, 11.9% and 11.7% ARA of total lipids, respectively. Further analyses showed that the three chimeric genes of pZKSL-555R were not integrated into the Lys5 site in the Y4068, Y4069, Y4070 and Y4071 strains. All strains possessed a Lys+ phenotype.
The final genotype of strain Y4070, with respect to wildtype Yarrowia lipolytica ATCC #20362, was Ura-, unknown 1-, unknown 3-, Leu+, Lys+, GPD::FmD12::Pex20, YAT1 ::FmD12::OCT, YAT1 ::ME3S::Pex16, GPAT::EgD9e::Lip2, EXP1 ::EgD9eS::Lip1 , FBAINm::EgD9eS::Lip2, FBAINm::EgD8M::Pex20, EXP1::EgD8M::Pex16, FBAIN::EgD5::Aco, EXP1 ::EgD5S::Pex20, YAT1 ::RD5S::OCT.
Generation Of Y4086 Strain To Produce About 14% EPA Of Total Lipids: Construct pZP3-Pa777U (FIG. 47A; SEQ ID NO:338) was generated to integrate three delta-17 desaturase genes into the Pox3 loci (GenBank Accession No. AJ001301) of strain Y4070, to thereby enable production of EPA. The pZP3- Pa777U plasmid contained the following components:
TABLE 12 Description of Plasmid pZP3-Pa777U (SEQ ID NO:338)
Figure imgf000106_0001
Figure imgf000107_0001
The pZP3-Pa777U plasmid was digested with Asc\/Sph\ and then used for transformation of strain Y4070 according to the General Methods. The transformed cells were plated onto MM plates, and plates were maintained at 30 C for 2 to 3 days. Single colonies were then re-streaked onto MM plates, and the resulting colonies were used to inoculate liquid MMLeuLys. Liquid cultures were shaken at 250 rpm/min for 2 days at 30 C. The cells were collected by centrifugation, and lipids were extracted. Fatty acid methyl esters were prepared by trans-esterification and subsequently analyzed with a Hewlett-Packard 6890 GC. GC analyses showed the presence of EPA in the transformants containing the 3 chimeric genes of pZP3-Pa777U, but not in the parent Y4070 strain. Most of the selected 96 strains produced 10-13% EPA of total lipids. Two strains, designated as Y4085 and Y4086, produced about 14.2% and 13.8% EPA of total lipids, respectively. The final genotype of strain Y4086, with respect to wildtype Yarrowia lipolytica ATCC #20362, was Ura3+, Leu+, Lys+, unknown 1-, unknown 2-, YALI0F24167g-, GPD::FmD12::Pex20, YAT1 ::FmD12::OCT, YAT1 ::ME3S::Pex16, GPAT::EgD9e::Lip2, EXP1 ::EgD9eS::Lip1 , FBAINm::EgD9eS::Lip2, FBAINm::EgD8M::Pex20, EXP1 ::EgD8M::Pex16, FBAIN::EgD5::Aco, EXP1 ::EgD5S::Pex20, YAT1 ::RD5S::OCT, YAT1 ::PaD17S::Lip1 , EXP1::PaD17::Pex16, FBAINm::PaD17::Aco.
Generation Of Strain Y4086U1 (Ura3-): Strain Y4086U1 was created via temporary expression of the Cre recombinase enzyme in construct pY117 (FIG. 47B; SEQ ID NO:343) within strain Y4086 to produce a Ura- phenotype. This released the LoxP sandwiched Ura3 gene from the genome. The mutated Yarrowia AHAS enzyme in plasmid pY117 conferred SUR, which was used as a positive screening marker.
Plasmid pY117 was derived from plasmid pY116 (described supra, and in U.S. Patent Application No. 11/635258) by inserting the mutant AHAS gene flanked by Pacl-Swal sites into Pacl-Swal digested pY116, thereby replacing the LEU selectable marker with the sulfonylurea marker. Construct pY117 thereby contained the following components:
TABLE 13 Description of Plasmid ρY117 (SEQ ID NO:343)
Figure imgf000108_0001
Figure imgf000109_0001
Plasmid pY117 was used to transform strain Y4086 according to the General Methods. Following transformation, the cells were plated onto MMU+SU (280 μg/mL sulfonylurea; also known as chlorimuron ethyl, E. I. duPont de Nemours & Co., Inc., Wilmington, DE) plates, and plates were maintained at 30 °C for 2 to 3 days. The individual SUR colonies grown on MMU+SU plates were picked and streaked into YPD liquid media, and liquid cultures were shaken at 250 rpm/min for 1 day at 30 C to cure the pY117 plasmid. Cells from the grown cultures were streaked onto MMU plates. After two days at 30 C, the individual colonies were re- streaked onto MM and MMU plates. Those colonies that could grow on MMU, but not on MM plates were selected. Two of these strains with Ura- phenotypes were designated as Y4086U1 and Y4086U2 (Ura-).
Generation Of Y4128 Strain To Produce About 37% EPA Of Total Lipids: Construct pZP2-2988 (FIG. 48A; SEQ ID NO:345) was generated to integrate one delta-12 desaturase gene, two delta-8 desaturase genes, and one delta-9 elongase gene into the Pox2 loci (GenBank Accession No. AJ001300) of strain Y4086U1 , to thereby enable higher level production of EPA. The pZP2-2988 plasmid contained the following components:
TABLE 14 Description of Plasmid pZP2-2988 (SEQ ID NO:345)
Figure imgf000109_0002
Figure imgf000110_0001
The pZP2-2988 plasmid was digested with Asc\/Sph\ and then used for transformation of strain Y4086U1 according to the General Methods. The transformed cells were plated onto MM plates, and plates were maintained at 30 C for 2 to 3 days. Single colonies were re-streaked onto MM plates, and the resulting colonies were used to inoculate liquid MMLeuLys. Liquid cultures were shaken at 250 rpm/min for 2 days at 30 C. The cells were collected by centrifugation, resuspended in HGM, and then shaken at 250 rpm/min for 5 days. The cells were collected by centrifugation, and lipids were extracted. Fatty acid methyl esters were prepared by trans-esterification and subsequently analyzed with a Hewlett-Packard 6890 GC.
GC analyses showed that most of the selected 96 strains produced 12-15.6% EPA of total lipids. Two strains, designated as Y4128 and Y4129, produced about 37.6% and 16.3% EPA of total lipids, respectively.
The final genotype of strain Y4128, with respect to wildtype Yarrowia lipolytics ATCC #20362, was: YALI0F24167g-, PexW-, unknown 1-, unknown 2-, GPD::FmD12::Pex20, YAT1 ::FmD12::0CT, GPM/FBAIN::FmD12S::OCT, YAT1::ME3S::Pex16, GPAT::EgD9e::Lip2, EXP1 ::EgD9eS::Lip1, FBAINm::EgD9eS::Lip2, FBA::EgD9eS::Pex20, FBAINm::EgD8M::Pex20, EXP1 ::EgD8M::Pex16, GPDIN::EgD8M::Lip1 , YAT1 ::EgD8M::Aco, FBAIN::EgD5::Aco, EXP1 ::EgD5S::Pex20, YAT1 ::RD5S::OCT, YAT1 ::PaD17S::Lip1 , EXP1 ::PaD17::Pex16, FBAINm::PaD17::Aco. Yarrowia lipolytica strain Y4128 was deposited with the American Type Culture Collection on August 23, 2007 and bears the designation ATCC PTA-8614.
Generation Of Y4128U Strains: In order to disrupt the Ura3 gene in strain Y4128, construct pZKUE3S (FIG. 48B; SEQ ID NO:351) was created to integrate a EXP1 ::ME3S::Pex20 chimeric gene into the Ura3 gene of strain Y4128. Plasmid pZKUE3S contained the following components: TABLE 15
Description of Plasmid pZKUE3S(SEQ ID NO:351)
Figure imgf000111_0001
Figure imgf000112_0001
Plasmid pZKUE3S was digested with Sph\/Pacl and then used to transform strain Y4128 according to the General Methods. Following transformation, cells were plated onto MM + 5-FOA selection plates, and plates were maintained at 30 C for 2 to 3 days.
A total of 24 transformants grown on MM + 5-FOA selection plates were picked and re-streaked onto fresh MM + 5-FOA plates. The cells were stripped from the plates, and lipids were extracted. Fatty acid methyl esters were prepared by trans-esterification and subsequently analyzed with a Hewlett-Packard 6890 GC. GC analyses showed the presence of between 10-15% EPA in all of the transformants with pZKUE3S from plates. The strains designated as Y4128U1 , Y4128U2, Y4128U3, Y4128U4, Y4128U5, and Y4128U6 produced 12.9%, 14.4%, 15.2%, 15.4%, 14%, and 10.9% EPA, respectively (collectively, Y4128U).
The discrepancy in the % EPA quantified in Y4128 (37.6%) versus Y4128U (average 13.8%) is based on differing growth conditions. Specifically, the former culture was analyzed following two days of growth in liquid culture, while the latter culture was analyzed after growth on an agar plate. The Applicants have observed a 2-3 fold increase in % EPA, when comparing results from agar plates to those in liquid culture. Thus, although results are not directly comparable, both Y4128 and Y4128U strains demonstrate high production of EPA.
Generation Of Y4217 Strain To Produce About 42% EPA Of Total Lipids: Construct pZKL2-5U89GC (FIG. 49A; SEQ ID NO:348) was generated to integrate one delta-9 elongase gene, one delta-8 desaturase gene, one delta-5 desaturase gene, and one Yarrowia lipolytics diacylglycerol cholinephosphotransferase gene (CPT1) into the Lip2 loci (GenBank Accession No. AJ012632) of strain Y4128U3 to thereby enable higher level production of EPA. The pZKL2-5U89GC plasmid contained the following components: TABLE 16 Description of Plasmid pZKL2-5U89GC (SEQ ID NO:348)
Figure imgf000113_0001
EgD5S: codon-optimized delta-5 desaturase (SEQ ID NO:332), derived from Euglena gracilis (Patent Publication US 2007-0292924-A1);
Aco: Aco terminator sequence from Yarrowia Aco gene (GenBank Accession No. AJ001300)
The pZKL2-5U89GC plasmid was digested with Asc\/Sph\ and then used for transformation of strain Y4128U3 according to the General Methods. The transformed cells were plated onto MM plates, and plates were maintained at 30 °C for 3 to 4 days. Single colonies were re-streaked onto MM plates, and the resulting colonies were then used to inoculate liquid MM. Liquid cultures were shaken at 250 rpm/min for 2 days at 30 C. The cells were collected by centrifugation, resuspended in HGM, and then shaken at 250 rpm/min for 5 days. The cells were collected by centrifugation, and lipids were extracted. Fatty acid methyl esters were prepared by trans-esterification and subsequently analyzed with a Hewlett-Packard 6890 GC.
GC analyses showed that most of the selected 96 strains produced 32-39.9% EPA of total lipids. Six strains, designated as Y4215, Y4216, Y4217, Y4218, Y4219 and Y4220, produced about 41.1%, 41.8%, 41.7%, 41.1 %, 41% and 41.1% EPA of total lipids, respectively. The final genotype of each strain, with respect to wild type Yarrowia lipolytica ATCC #20362, was: YALI0C18711g-, Pex10-, YALI0F24167g-, unknown 1-, unknown 3-, GPD::FmD12::Pex20, YAT1 ::FmD12::OCT, GPM/FBAIN::FmD12S::OCT, YAT1 ::ME3S::Pex16, EXP1 ::ME3S::Pex20, GPAT::EgD9e::Lip2, EXP1 ::EgD9eS::Lip1 , FBAINm::EgD9eS::Lip2, FBA::EgD9eS::Pex20, GPD::EgD9eS::Lip2, FBAINm::EgD8M::Pex20, FBAIN::EgD8M::Lip1 , EXP1 ::EgD8M::Pex16, GPDIN::EgD8M::Lip1 , YAT1 ::EgD8M::Aco, FBAIN::EgD5::Aco, EXP1 ::EgD5S::Pex20, YAT1 ::EgD5S::Aco, YAT1 ::RD5S::OCT, YAT1 ::PaD17S::Lip1 , EXP1 ::PaD17::Pex16, FBAINm::PaD17::Aco, YAT1 ::YICPT1 ::ACO. Generation Of Strain Y4217U2 (Ura3-): In order to disrupt the Ura3 gene in strain Y4217, construct pZKUE3S (FIG. 48B; SEQ ID NO:351) was used to integrate a chimeric EXP1 ::ME3S::Pex20 gene into the Ura3 gene of strain Y4217. Following transformation, cells were plated onto MM + 5-FOA selection plates, and plates were maintained at 30 °C for 3 to 4 days.
A total of 6 transformants grown on MM + 5-FOA plates were picked and re- streaked onto MM plates and MM + 5-FOA plates. All 6 strains had a Ura- phenotype (i.e., cells could grow on MM + 5-FOA plates, but not on MM plates). The cells were scraped from the MM + 5-FOA plates, and lipids were extracted. Fatty acid methyl esters were prepared by trans-esterification and subsequently analyzed with a Hewlett-Packard 6890 GC.
GC analyses showed the presence of 18.7% to 28.6% EPA in all of the transformants with pZKUE3S grown on MM + 5-FOA plates. Two strains, designated as strains Y4217U1 and Y4217U2, produced 22.5% and 28.6% EPA, respectively. Generation Of Y4259 Strain To Produce About 46.5% EPA Of Total Lipids:
Construct pZKL1-2SP98C (FIG. 49B; SEQ ID NO:352) was generated to integrate one delta-9 elongase gene, one delta-8 desaturase gene, one delta-12 desaturase gene, and one Yarrowia lipolytica diacylglycerol cholinephosphotransferase gene (CPT1) into the Lip1 loci (GenBank Accession No. Z50020) of strain Y4217U2, to thereby enable higher level production of EPA. The pZKL1-2SP98C plasmid contained the following components:
TABLE 17 Description of Plasmid pZKL1-2SP98C (SEQ ID NO:352)
Figure imgf000115_0001
Figure imgf000116_0001
The pZKL1-2SP98C plasmid was digested with Asc\/Sph\ and then used for transformation of strain Y4217U2 according to the General Methods. The transformed cells were plated onto MM plates, and plates were maintained at 30 °C for 3 to 4 days. Single colonies were re-streaked onto MM plates, and the resulting colonies were then used to inoculate liquid MM. The liquid cultures were then shaken at 250 rpm/min for 2 days at 30 C. The cells were collected by centrifugation, resuspended in HGM, and then shaken at 250 rpm/min for 5 days. The cells were collected by centrifugation, and lipids were extracted. Fatty acid methyl esters were prepared by trans-esterification and subsequently analyzed with a Hewlett-Packard 6890 GC.
GC analyses showed that most of the selected 72 strains produced 40-44% EPA of total lipids. Six strains, designated as Y4259, Y4260, Y4261 , Y4262, Y4263, and Y4264, produced about 46.5%, 44.5%, 44.5%, 44.8%, 44.5%, and 44.3% EPA of total lipids, respectively.
The final genotype of strain Y4259 with respect to wild type Yarrowia lipolytics ATCC #20362 was: YALI0C18711g-, Pex10-, YALI0F24167g-, unknown 1- , unknown 3-, unknown 8-, GPD::FmD12::Pex20, YAT1 ::FmD12::OCT, GPM/FBAIN::FmD12S::OCT, EXP1 ::FmD12S::Aco, YAT1 ::ME3S::Pex16, EXP1 ::ME3S::Pex20 (2 copies), GPAT::EgD9e::Lip2, EXP1 ::EgD9eS::Lip1 , FBAINm::EgD9eS::l_ip2, FBA::EgD9eS::Pex20, GPD::EgD9eS::Lip2, YAT1 ::EgD9eS::Lip2, FBAINm::EgD8M::Pex20, FBAIN::EgD8M::Lip1 (2 copies), EXP1 ::EgD8M::Pex16, GPDIN::EgD8M::Lip1 , YAT1 ::EgD8M::Aco, FBAIN::EgD5::Aco, EXP1 ::EgD5S::Pex20, YAT1 ::EgD5S::Aco, YAT1 ::RD5S::OCT, YAT1 ::PaD17S::Lip1 , EXP1 ::PaD17::Pex16, FBAINm::PaD17::Aco, YAT1 ::YICPT1 ::ACO, GPD::YICPT1 ::ACO.
Generation Of Strain Y4259U2 (Ura3-)\ In order to disrupt the Ura3 gene in Y4259 strain, construct pZKUM (FIG. 5OA; SEQ ID NO:353) was used to integrate a Ura3 mutant gene into the Ura3 gene of strain Y4259. The plasmid pZKUM contained the following components:
TABLE 18 Description of Plasmid pZKUM (SEQ ID NO:353)
Figure imgf000117_0001
Figure imgf000118_0001
A total of 3 transformants grown on MM + 5-FOA plates were picked and re- streaked onto MM plates and MM + 5-FOA plates. All 3 strains had a Ura- phenotype (i.e., cells could grow on MM + 5-FOA plates, but not on MM plates). The cells were scraped from the MM + 5-FOA plates, and lipids were extracted. Fatty acid methyl esters were prepared by trans-esterification and subsequently analyzed with a Hewlett-Packard 6890 GC.
GC analyses showed the presence of 31.4%, 31% and 31.3% EPA in the #1 , #2 and #3 transformants with pZKUM grown on MM + 5-FOA plates. These three strains were designated as strains Y4259U1 , Y4259U2 and Y4259U3, respectively (collectively, Y4259U).
Generation Of Y4305 Strain To Produce About 53% EPA Of Total Lipids: Construct pZKD2-5U89A2 (FIG. 5OB; SEQ ID NO:355) was generated to integrate one delta-9 elongase gene, one delta-5 desaturase gene, one delta-8 desaturase gene, and one delta-12 desaturase gene into the diacylglycerol acyltransferase (DGAT2) loci of strain Y4259U2, to thereby enable higher level production of EPA. The pZKD2-5U89A2 plasmid contained the following components:
TABLE 19 Description of Plasmid pZKD2-5U89A2 (SEQ ID NO:355)
Figure imgf000118_0002
Figure imgf000119_0001
The pZKD2-5U89A2 plasmid was digested with Asc\/Sph\ and then used for transformation of strain Y4259U2 according to the General Methods. The transformed cells were plated onto MM plates, and plates were maintained at 3O C for 3 to 4 days. Single colonies were re-streaked onto MM plates, and the resulting colonies were used to inoculate liquid MM. Liquid cultures were shaken at 250 rpm/min for 2 days at 30 C. The cells were collected by centrifugation, resuspended in HGM, and then shaken at 250 rpm/min for 5 days. The cells were collected by centrifugation, and lipids were extracted. Fatty acid methyl esters were prepared by trans-esterification and subsequently analyzed with a Hewlett-Packard 6890 GC.
GC analyses showed that most of the selected 96 strains produced 40-46% EPA of total lipids. Four strains, designated as Y4305, Y4306, Y4307 and Y4308, produced about 53.2%, 46.4%, 46.8 %, and 47.8% EPA of total lipids, respectively. The complete lipid profile of Y4305 is as follows: 16:0 (2.8%), 16:1 (0.7%), 18:0 (1.3%), 18:1 (4.9%), 18:2 (17.6%), ALA (2.3%), EDA (3.4%), DGLA (2.0%), ARA (0.6%), ETA (1.7%), and EPA (53.2%). The total lipid % dry cell weight (dew) was 27.5.
The final genotype of strain Y4305 with respect to wild type Yarrowia lipolytica ATCC #20362 was SCP2- (YALI0E01298g), YALI0C18711g-, Pex10-, YALI0F24167g-, unknown 1-, unknown 3-, unknown 8-, GPD::FmD12::Pex20, YAT1 ::FmD12::OCT, GPM/FBAIN::FmD12S::OCT, EXP1 ::FmD12S::Aco, YAT1 ::FmD12S::Lip2, YAT1 ::ME3S::Pex16, EXP1 ::ME3S::Pex20 (3 copies), GPAT::EgD9e::Lip2, EXP1 ::EgD9eS::Lip1 , FBAINm::EgD9eS::Lip2, FBA::EgD9eS::Pex20, GPD::EgD9eS::Lip2, YAT1 ::EgD9eS::Lip2, YAT1 ::E389D9eS::OCT, FBAINm::EgD8M::Pex20, FBAIN::EgD8M::Lip1 (2 copies), EXP1 ::EgD8M::Pex16, GPDIN::EgD8M::ϋp1 , YAT1 ::EgD8M::Aco, FBAIN::EgD5::Aco, EXP1 ::EgD5S::Pex20, YAT1 ::EgD5S::Aco, EXP1 ::EgD5S::ACO, YAT1 ::RD5S::OCT, YAT1 ::PaD17S::Lip1 , EXP1 ::PaD17::Pex16, FBAINm::PaD17::Aco, YAT1 ::YICPT1 ::ACO, GPD::YICPT1 ::ACO.
Generation Of Strain Y4305U3 (Ura3-): In order to disrupt the Ura3 gene in strain Y4305, construct pZKUM (FIG. 5OA; SEQ ID NO:353) was used to integrate a Ura3 mutant gene into the Ura3 gene of strain Y4305. A total of 8 transformants grown on MM + 5-FOA plates were picked and re-streaked onto MM plates and MM + 5-FOA plates, separately. All 8 strains had a Ura- phenotype (i.e., cells could grow on MM + 5-FOA plates, but not on MM plates). The cells were scraped from the MM + 5-FOA plates, and lipids were extracted. Fatty acid methyl esters were prepared by trans-esterification and subsequently analyzed with a Hewlett-Packard
6890 GC.
GC analyses showed the presence of 37.6%, 37.3% and 36.5% EPA in pZKUM transformants #1 , #6 and #7 grown on MM + 5-FOA plates. These three strains were designated as strains Y4305U1 , Y4305U2 and Y4305U3, respectively
(collectively, Y4305U).
Construction Of Yarrowia lipolvtica Strain Y4184U
Y. lipolytica strain Y4184U was used as the host in Examples 32, 33, 34 and
51 , infra. Strain Y4184U was derived from Y. lipolytica ATCC #20362, and is capable of producing about 31% EPA relative to the total lipids via expression of a delta-9 elongase/ delta-8 desaturase pathway.
The development of strain Y4184U required the construction of strain Y2224, strain Y4001 , strain Y4001 U, strain Y4036, strain Y4036U and strain Y4069 (supra).
Further development of strain Y4184U (diagrammed in FIG. 51A) required construction of strain Y4084 (producing 14% EPA), strain Y4084U1 (Um-), strain
Y4127 (producing 18% EPA), strain Y4127U2 (Um-), strain Y4158 (producing 25%
EPA), strain Y4158U1 (Um-), and strain 4184 (producing 30.7% EPA). Although the details concerning transformation and selection of the EPA-producing strains developed after strain Y4069 will not be elaborated herein, the methodology used for isolation of strain Y4084, strain Y4084U1 , strain Y4127, strain Y4127U2, strain
Y4158, strain Y4158U1 , strain Y4184, and strain Y4184U was as described during construction of strain Y4305, supra.
Briefly, construct pZP3-Pa777U (FIG. 47A; SEQ ID NO:338) was utilized to integrate three delta-17 desaturase genes into the Pox3 loci (GenBank Accession No. AJ001301 ) of strain Y4069, thereby resulting in isolation of strain Y4084
(producing 14% EPA). Strain Y4084U1 was created via temporary expression of the Cre recombinase enzyme in construct pY117 (FIG. 47B; SEQ ID NO:343) within strain Y4084 to produce a Ura- phenotype. Construct pZP2-2988 (FIG. 48A; SEQ
ID NO:345) was then utilized to integrate one delta-12 desaturase gene, two delta-8 desaturase genes, and one delta-9 elongase gene into the Pox2 loci (GenBank
Accession No. AJ001300) of strain Y4084U1 , thereby resulting in isolation of strain
Y4127 (producing 18% EPA). Yarrowia lipolytica strain Y4127 was deposited with the American Type Culture Collection on November 29, 2007 and bears the designation ATCC PTA-8802.
Strain Y4127U2 was created by disrupting the Ura3 gene in strain Y4127 via construct pZKUE3S (FIG. 48B; SEQ ID NO:351), comprising a chimeric EXP1 ::ME3S::Pex20 gene targeted for the Ura3 gene. Construct pZKL1-2SP98C (FIG. 49B; SEQ ID NO:352) was utilized to integrate one delta-9 elongase gene, one delta-8 desaturase gene, one delta-12 desaturase gene, and one Yarrowia lipolytica diacylglycerol cholinephosphotransferase gene (CPT1) into the Lip1 loci (GenBank Accession No. Z50020) of strain Y4127U2, thereby resulting in isolation of strain Y4158 (producing 25% EPA). A Ura- derivative (i.e., strain Y4158U1) was then created, via transformation with construct pZKUE3S (FIG. 48B; SEQ ID NO:351), comprising a chimeric EXP1 ::ME3S::Pex20 gene targeted for the Ura3 gene. Finally, construct pZKL2-5U89GC (FIG. 49A; SEQ ID NO:348) was utilized to integrate one delta-9 elongase gene, one delta-8 desaturase gene, one delta-5 desaturase gene, and one Yarrowia lipolytica CPT1 into the Lip2 loci (GenBank
Accession No. AJ012632) of Y4158U1 , thereby resulting in isolation of strain Y4184.
The complete lipid profile of strain Y4184 is as follows: 16:0 (3.1%), 16:1 (1.5%), 18:0 (1.8%), 18:1 (8.7%), 18:2 (31.5%), ALA (4.9%), EDA (5.6%), DGLA (2.9%), ARA (0.6%), ETA (2.4%), and EPA (28.9%). The total lipid % dry cell weight (dew) was 23.9.
The final genotype of strain Y4184 with respect to wildtype Yarrowia lipolytica ATCC #20362 was unknown 1-, unknown 2-, unknown 4-, unknown 5-, unknown 6-, unknown 7-, YAT1::ME3S::Pex16, EXP1 ::ME3S::Pex20 (2 copies), GPAT::EgD9e::Lip2, FBAINm::EgD9eS::Lip2, EXP1 ::EgD9eS::Lip1 , FBA::EgD9eS::Pex20, YAT1 ::EgD9eS::Lip2, GPD::EgD9eS::Lip2, GPDIN::EgD8M::Lip1 , YAT1 ::EgD8M::Aco, EXP1 ::EgD8M::Pex16, FBAINm::EgD8M::Pex20, FBAIN::EgD8M::Lip1 (2 copies), GPM/FBAIN::FmD12S::Oct, EXP1 ::FmD12S::Aco, YAT1 ::FmD12::Oct, GPD::FmD12::Pex20, EXP1 ::EgD5S::Pex20, YAT1 ::EgD5S::Aco, YAT1 ::Rd5S::Oct, FBAIN::EgD5::Aco, FBAINm::PaD17::Aco, EXP1 ::PaD17::Pex16, YAT1 ::PaD17S::Lip1 , YAT1 ::YICPT1 ::Aco, GPD::YICPT1 ::Aco. In order to disrupt the Ura3 gene in strain Y4184, construct pZKUM (FIG. 5OA; SEQ ID NO:353) was used to integrate a Ura3 mutant gene into the Ura3 gene of strain Y4184.
A total of 11 transformants grown on MM + 5-FOA plates were picked and re- streaked onto MM plates and MM + 5-FOA plates, separately. All 11 strains had a lira- phenotype (i.e., cells could grow on MM + 5-FOA plates, but not on MM plates). The cells were scraped from the MM + 5-FOA plates; lipids were extracted; and fatty acid methyl esters were prepared by trans-esterification and subsequently analyzed with a Hewlett-Packard 6890 GC. GC analyses showed the presence of 11.2%, 10.6%, and 15.5% EPA in the
#7, #8 and #10 transformants with pZKUM grown on MM + 5-FOA plates. These three strains were designated as strains Y4184U1 , Y4184U2 and Y4184U4, respectively (collectively, Y4184U).
EXAMPLE 1 Euplena gracilis Growth Conditions, Lipid Profile and mRNA Isolation
Euglena gracilis was obtained from Dr. Richard Triemer's lab at Michigan State University (East Lansing, Ml). From 10 mL of actively growing culture, a 1 mL aliquot was transferred into 250 mL of Euglena gracilis (Eg) Medium in a 500 mL glass bottle. Eg medium was made by combining 1 g of sodium acetate, 1 g of beef extract (Cat. No. U 126-01 , Difco Laboratories, Detroit, Ml), 2 g of Bacto® tryptone (0123-17-3, Difco Laboratories), and 2 g of Bacto® yeast extract (Cat. No. 0127-17- 9, Difco Laboratories) in 970 mL of water. After filter sterilizing, 30 mL of soil-water supernatant (Cat. No. 15-3790, Carolina Biological Supply Company, Burlington, NC) were aseptically added to produce the final Eg medium. Euglena gracilis cultures were grown at 23 CC with a 16 h light, 8 h dark cycle for 2 weeks with no agitation.
After 2 weeks, 10 mL of culture were removed for lipid analysis and centrifuged at 1 ,800 x g for 5 min. The pellet was washed once with water and re- centrifuged. The resulting pellet was dried for 5 min under vacuum, resuspended in 100 μL of trimethylsulfonium hydroxide (TMSH), and incubated at room temperature for 15 min with shaking. After this, 0.5 mL of hexane were added, and the vials were incubated for 15 min at room temperature with shaking. Fatty acid methyl esters (5 μL injected from hexane layer) were separated and quantified using a Hewlett-Packard 6890 Gas Chromatograph fitted with an Omegawax 320 fused silica capillary column (Supelco Inc., Cat. No. 24152). The oven temperature was programmed to hold at 220 0C for 2.7 min, increase to 240 0C at 20 0C /min, and then hold for an additional 2.3 min. Carrier gas was supplied by a Whatman hydrogen generator. Retention times were compared to those for methyl esters of standards commercially available (Nu-Chek Prep, Inc. Cat. No. U-99-A), and the resulting chromatogram is shown in FIG. 27.
The remaining 2 week culture (240 mL) was pelleted by centrifugation at 1 ,800 x g for 10 min, washed once with water, and re-centrifuged. Total RNA was extracted from the resulting pellet using the RNA STAT-60™ reagent (TEL-TEST, Inc., Friendswood, TX) and following the manufacturer's protocol provided (use 5 mL of reagent, dissolved RNA in 0.5 mL of water). In this way, 1 mg of total RNA (2 mg/mL) was obtained from the pellet. The mRNA was isolated from 1 mg of total RNA using the mRNA Purification Kit (Amersham Biosciences, Piscataway, NJ) following the manufacturer's protocol provided. In this way, 85 μg of mRNA were obtained.
EXAMPLE 2 Euglena gracilis cDNA Synthesis, Library Construction and Sequencing
A cDNA library was generated using the Cloneminer™ cDNA Library Construction Kit (Cat. No.18249-029, Invitrogen Corporation, Carlsbad, CA) and following the manufacturer's protocol provided (Version B, 25-0608). Using the non- radiolabeling method, cDNA was synthesized from 3.2 μg of mRNA (described above) using the Biotin-atfB2-Oligo(dT) primer. After synthesis of the first and second strand, the atiBλ adapter was added; ligation was performed; and the cDNA was size fractionated using column chromatography. DNA from fractions 7 and 8 (size ranging from ~800-1500 bp) were concentrated, recombined into pDONR™222, and transformed into E. coli ElectroMAX™ DH10B™ T1 Phage- Resistant cells (Invitrogen Corporation). The Euglena gracilis library was named eeg1c. For sequencing, clones first were recovered from archived glycerol cultures grown/frozen in 384-well freezing media plates. Using an automatic QPix colony picker (Genetix), cells were picked and then used to inoculate 96-well deep-well plates containing LB + 50 μg/mL kanamycin. After growing 20 h at 37 0C, cells were pelleted by centrifugation and stored at -20 0C. Plasmids then were isolated on an Eppendorf 5Prime robot, using a modified 96-well format alkaline lysis miniprep method (Eppendorf PerfectPrep). Briefly, a filter and vacuum manifold were used to facilitate removal of cellular debris after acetate precipitation. Plasmid DNA was then bound on a second filter plate directly from the filtrate, washed, dried, and eluted.
Plasmids were end-sequenced in 384-well plates, using vector-primed M13F Universal primer (SEQ ID NO:1) and the ABI BigDye version 3 Prism sequencing kit. For the sequencing reaction, 100-200 ng of template and 6.4 pmol of primer were used, and the following reaction conditions were repeated 25 times: 96 0C for 10 sec, 50 0C for 5 sec and 60 0C for 4 min. After ethanol-based cleanup, cycle sequencing reaction products were resolved and detected on Perkin-Elmer ABI 3700 automated sequencers. EXAMPLE 3
Identification of C20-PUFA Elongating Enzyme Homologs from Euglena gracilis cDNA Library eegic cDNA clones encoding C20-PUFA elongating enzyme homologs (i.e., "C20- PUFA EIo") were identified by conducting BLAST (Basic Local Alignment Search Tool; Altschul et al., J. Mot. Biol. 215:403-410 (1993)) searches for similarity to sequences contained in the BLAST "nr" database (comprising all non-redundant GenBank CDS translations, sequences derived from the 3-dimensional structure Brookhaven Protein Data Bank, the last major release of the SWISS-PROT protein sequence database, EMBL and DDBJ databases). The cDNA sequences obtained in Example 2 were analyzed for similarity to all publicly available DNA sequences contained in the "nr" database using the BLASTN algorithm provided by the National Center for Biotechnology Information (NCBI). The DNA sequences were translated in all reading frames and compared for similarity to all publicly available protein sequences contained in the "nr" database using the BLASTX algorithm (Gish and States, Nat. Genet. 3:266-272 (1993)) provided by the NCBI. For convenience, the P-value (probability) of observing a match of a cDNA sequence to a sequence contained in the searched databases merely by chance as calculated by BLAST are reported herein as "pLog" values, which represent the negative of the logarithm of the reported P-value. Accordingly, the greater the pLog value, the greater the likelihood that the cDNA sequence and the BLAST "hit" represent homologous proteins.
The BLASTX search using the nucleotide sequences from clone eeg1c.pkOO5.p14.f revealed similarity of the protein encoded by the cDNA to the C20-PUFA EIo from Pavlova sp. CCMP459 (SEQ ID NO:2) (NCBI Accession No. AAV33630 (Gl 54307108), locus AAV33630, CDS AY630573; Pereira et al., Biochem. J. 384:357-366 (2004)). The sequence of a portion of the cDNA insert from clone eeg1c.pkOO5.p14.f is shown in SEQ ID NO:3 (5' end of cDNA insert). Subsequently, the full insert sequence (i.e., eeg1c.pkOO5.p14.f:fis) was obtained and is shown in SEQ ID NO:4. Sequence for the coding sequence (CDS) is shown in SEQ ID NO:5. Sequence for the corresponding deduced amino acid sequence is shown in SEQ ID NO:6.
Full insert sequencing (FIS) was carried out using a modified transposition protocol. Clones identified for FIS were recovered from archived glycerol stocks as single colonies, and plasmid DNA was isolated via alkaline lysis. Plasmid templates were transposed via the Template Generation System (TGS II) transposition kit (Finnzymes Oy, Espoo, Finland), following the manufacturer's protocol. The transposed DNA was transformed into EH 10B electro-competent cells (Edge BioSystems, Gaithersburg, MD) via electroporation. Multiple transformants were randomly selected from each transposition reaction, plasmid DNA was prepared, and templates were sequenced as above (ABI BigDye v3.1) outward from the transposition event site, utilizing unique primers SeqE (SEQ ID NO:7) and SeqW (SEQ ID NO:8). Sequence data was collected (ABI Prism Collections software) and assembled using the Phrap sequence assembly program (P. Green, University of Washington, Seattle). Assemblies were viewed by the Consed sequence editor (D. Gordon, University of Washington, Seattle) for final editing.
The amino acid sequence set forth in SEQ ID NO:6 was evaluated by BLASTP, yielding a pLog value of 61.22 (E value of 6e-62) versus the Pavlova sp. CCMP459 C20-PUFA EIo (SEQ ID NO:2). The amino acid sequence set forth in SEQ ID NO:6 is 45.1% identical to the Pavlova sp. CCMP459 C20-PUFA EIo sequence (SEQ ID NO:2) using the Jotun Hein method. Sequence percent identity calculations performed by the Jotun Hein method (Hein, J. J., Meth. Enz. 183:626-645 (1990)) were done using the MegAlign™ v6.1 program of the LASERGENE bioinformatics computing suite (DNASTAR Inc., Madison, Wl) with the default parameters for pairwise alignment (KTUPLE=2). The amino acid sequence set forth in SEQ ID NO:6 is 40.4% identical to the Pavlova sp. CCMP459 C20-PUFA EIo sequence (SEQ ID NO:2) using the Clustal V method. Sequence percent identity calculations performed by the Clustal V method (Higgins, D. G. and Sharp, P.M., Comput. Appl. Biosci. 5:151-153 (1989); Higgins et al., Comput. Appl.
Biosci. 8:189-191 (1992)) were done using the MegAlign™ v6.1 program of the LASERGENE bioinformatics computing suite (supra) with the default parameters for pairwise alignment (KTUPLE=I , GAP PENALTY=3, WINDOW=5 and DIAGONALS SAVED=5 and GAP LENGTH PENALTY=IO). BLAST scores and probabilities indicate that the instant nucleic acid fragment (SEQ ID NO:5) encodes an entire Euglena gracilis C20-PUFA EIo gene, hereby named EgC20elo1. FIG. 25 summarizes BLASTP and percent identity values for EgC20elo1
(Example 3), EgDHAsyni (Example 4, infra), and EgDHAsyn2 (Example 5, infra).
EXAMPLE 4
Identification of DHA synthase 1 (EgDHAsvnP from Euαlena gracilis cDNA Library eegic cDNA clones encoding additional C20-PUFA EIo homologs were identified by conducting BLAST searches for similarity to sequences contained in the BLAST "nr" database as described in Example 3.
The BLASTX search using the nucleotide sequences from clone eeg1c.pkO16.e6.f (also called pKR1049) revealed similarity of the protein encoded by the cDNA to the C20-PUFA EIo from Pavlova sp. CCMP459 (SEQ ID NO:2)
(NCBI Accession No. AAV33630 (Gl 54307108), locus AAV33630, CDS AY630573; Pereira et al., Biochem. J. 384:357-366 (2004)). The sequence of a portion of the cDNA insert from clone eeg1c.pkO16.e6.f is shown in SEQ ID NO:9 (5' end of cDNA insert). Subsequently, the full insert sequence (eeg1c.pkO16.e6.f:fis) was obtained as described in Example 3 and is shown in SEQ ID NO: 10. The coding sequence is shown in SEQ ID NO:11 ; the corresponding deduced amino acid sequence is shown in SEQ ID NO:12. The amino acid sequence set forth in SEQ ID NO: 12 was evaluated by BLASTP as described in Example 3. Interestingly, SEQ ID NO:12 was found to be similar to both C20-PUFA EIo and delta-4 fatty acid desaturase. The N-terminus of SEQ ID NO:12 (from approximately amino acids 16-268) yields a pLog value of 60.30 (E value of 5e-61 ; 124/258 identical amino acids; 48% identity) versus the Pavlova sp. CCMP459 C20-PUFA EIo (SEQ ID NO:2). The C-terminus of SEQ ID NO: 12 (from approximately amino acids 253-793) yields an E value of 0.0 (535/541 identical amino acids; 98% identity), versus the delta-4 fatty acid desaturase from Euglena gracilis (SEQ ID NO: 13) (NCBI Accession No. AAQ19605 (Gl 33466346), locus AAQ19605, CDS AY278558; Meyer et al., Biochemistry 42(32): 9779-9788 (2003)). BLAST scores and probabilities indicate that the instant nucleic acid fragment (SEQ ID NO:11) encodes an entire Euglena gracilis C20-PUFA Elo/delta-4 fatty acid desaturase fusion gene, hereby named Euglena gracilis DHA synthase 1 (EgDHAsyni). The amino acid sequence of EgDHAsyni (SEQ ID NO:12) is 47.8% identical to the C20-PUFA EIo from Pavlova sp. CCMP459 (SEQ ID NO:2) and 98.9% identical to the delta-4 fatty acid desaturase from Euglena gracilis (SEQ ID NO: 13), using the Jotun Hein method as described in Example 3. The amino acid sequence of EgDHAsyni (SEQ ID NO: 12) is 41.2% identical to the C20-PUFA EIo from Pavlova sp. CCMP459 (SEQ ID NO:2) and 98.9% identical to the delta-4 fatty acid desaturase from Euglena gracilis (SEQ ID NO: 13), using the Clustal V method as described in Example 3.
FIG 27 summarizes BLASTP and percent identity values for EgDHAsyni (Example 4), EgC20elo1 (Example 3, supra) and EgDHAsyn2 (Example 5, infra). EXAMPLE 5
Identification of DHA synthase 2 (EqDHAsvn2) from
Euglena gracilis cDNA Library eegic
Approximately 17,000 clones of the Euglena gracilis cDNA library eegic were plated onto three large square (24 cm x 24 cm) petri plates (Corning, Corning, NY) each containing LB + 50 μg/mL kanamycin agar media. Cells were grown overnight at 37 0C, and plates were then cooled to room temperature. Colony Lifts:
Biodyne B 0.45 μm membrane (Cat. No. 60207, Pall Corporation, Pensacola, FL) was trimmed to approximately 22 cm x 22 cm, and the membrane was carefully laid on top of the agar to avoid air bubbles. After incubation for 2 min at room temperature, the membrane was marked for orientation, lifted off with tweezers, and placed colony-side up on filter paper soaked with 0.5 M sodium hydroxide and 1.5 M sodium chloride. After denaturation for 4 min, the sodium hydroxide was neutralized by placing the membrane on filter paper soaked with 0.5 M Tris-HCL (pH 7.5) and 1.5 M sodium chloride for 4 min. This step was repeated, and the membrane was rinsed briefly in 2X SSC buffer (2OX SSC is 3M sodium chloride, 0.3 M sodium citrate; pH 7.0) and air dried on filter paper. Hybridization:
Membranes were pre-hybridized at 65 0C in 200 mL hybridization solution for 2 hr. Hybridization solution contained 6X SSPE (2OX SSPE is 3 M sodium chloride, 0.2 M sodium phosphate, 20 mM EDTA; pH 7.4), 5X Denhardt's reagent (100X Denhardt's reagent is 2%(w/v) Ficoll, 2% (w/v) polyvinylpyrrolidone, 2% (w/v) acetylated bovine serum albumin), 0.5% sodium dodecyl sulfate (SDS), 100 μg/mL sheared salmon sperm DNA, and 5% dextran sulfate.
A DNA probe was made using an agarose gel purified NcoUNott DNA fragment, containing EgDHAsyni*, from pY141 (described in Example 10 herein) labeled with P32 dCTP using the Rad Prime DNA Labeling System (Cat. No. 18428- 011 , Invitrogen, Carlsbad, CA), following the manufacturer's instructions. Unincorporated P32 dCTP was separated using a NICK column (Cat. No. 17-0855- 02, Amersham Biosciences, Piscataway, NJ), following the manufacturer's instructions. The probe was denatured for 5 min at 100 0C and placed on ice for 3 min; then, half was added to the hybridization solution.
The membrane was hybridized with the probe overnight at 65 0C with gentle shaking and then washed the following day twice with 2X SSC containing 0.5% SDS (5 min each) and twice with 0.2X SSC containing 0.1% SDS (15 min each). After washing, hyperfilm (Cat. No. RPN30K, Amersham Biosciences) was exposed to the membrane overnight at -80 0C.
Based on alignment of plates with the exposed hyperfilm, positive colonies were picked using the blunt end of a Pasteur pipette into 1 mL of water and then vortexed. Several dilutions were made and plated onto small round Petri dishes (82 mm) containing LB media plus 50 μg/mL kanamycin to obtain around 100 well isolated colonies on a single plate. Lifts were done as described above except NytranN membrane circles (Cat, No. 10416116, Schleicher & Schuell, Keene, NH) were used, and hybridization was carried out in 100 mL using the remaining radiolabeled probe. In this way, one positive clone was identified (designated eeg1c-1). The plasmid from eeg1c-1 may also be referred to as pLF116. The individual positive clone was grown at 37 0C in LB + 50 μg/mL kanamycin liquid media, and plasmid was purified using the QIAprep® Spin Miniprep Kit (Qiagen Inc., Valencia, CA) following the manufacturer's protocol. The plasmid insert was sequenced as described in Example 2, with the ABI BigDye version 3 Prism sequencing kit using vector-primed M13F Universal primer (SEQ ID NO:1), vector-primed M13rev primer (SEQ ID NO:14), and the poly(A) tail-primed WobbleT oligonucleotides. Briefly, the WobbleT primer is an equimolar mix of 21 mer poly(T)A, poly(T)C, and poly(T)G, used to sequence the 3' end of cDNA clones. Based on initial sequence data, additional internal fragment sequence was obtained in a similar way using oligonucleotides oEUGel4-1 (SEQ ID NO: 15), EgEloD4Mut-5 (SEQ ID NO:16), oEUGel4-2 (SEQ ID NO:17), EgDHAsynδ1 (SEQ ID NO: 18), and EgDHAsyn3' (SEQ ID NO: 19). In this way, the full insert sequence of eeg1c-1 was obtained and is shown in SEQ ID NO:20. The coding sequence is shown as SEQ ID NO:21 , while the corresponding deduced amino acid sequence is shown as SEQ ID NO:22.
The amino acid sequence set forth in SEQ ID NO:22 was evaluated by BLASTP as described in Example 3. As was the case for EgDHAsyni , SEQ ID NO:22 was also found to be similar to both C20-PUFA EIo and delta-4 fatty acid desaturase. The N-terminus of SEQ ID NO:22 (from approximately amino acids 41- 268) yields a pLog value of 61.0 (E value of 1e-61 ; 118/231 identical amino acids; 51 % identity) versus the Pavlova sp. CCMP459 C20-PUFA EIo (SEQ ID NO:2). The C-terminus of SEQ ID NO:22 (from approximately amino acids 253-793) yields an E value of 0.0 (541/541 identical amino acids; 100% identity), versus the amino acid sequence of delta-4 fatty acid desaturase from Euglena gracilis (SEQ ID NO: 13). BLAST scores and probabilities indicate that the instant nucleic acid fragment (SEQ ID NO:21) encodes an entire Euglena gracilis C20-PUFA Elo/delta-4 fatty acid desaturase fusion gene, hereby named Euglena gracilis DHA synthase 2 (EgDHAsyn2).
The amino acid sequence of EgDHAsyn2 (SEQ ID NO:22) is 48.2% identical to the C20-PUFA EIo from Pavlova sp. CCMP459 (SEQ ID NO:2) and 100% identical to the delta-4 fatty acid desaturase from Euglena gracilis (SEQ ID NO:13), using the Jotun Hein method as described in Example 3. The amino acid sequence of EgDHAsyn2 (SEQ ID NO:22) is 41.2% identical to the C20-PUFA EIo from Pavlova sp. CCMP459 (SEQ ID NO:2) and 100% identical to the delta-4 fatty acid desaturase from Euglena gracilis (SEQ ID NO:13), using the Clustal V method as described in Example 3.
FIG. 25 summarizes BLASTP and percent identity values for EgDHAsyn2 (Example 5), EgC20elo1 (Example 3, supra) and EgDHAsyni (Example 4, supra).
EXAMPLE 6
Primary Structure Analysis of EqC20elo1. EgDHAsvni and EgDHAsvn2 Given the 100% amino acid identity between the C-terminus of EgDHAsyn2
(SEQ ID NO:22) and the Euglena gracilis delta-4 desaturase (SEQ ID NO:13), a nucleotide sequence alignment was carried out between the coding sequence of EgDHAsyn2 (SEQ ID NO:21), the cDNA sequence of the Euglena gracilis delta-4 desaturase (SEQ ID NO:23) (NCBI Accession No. AY278558 (Gl 33466345), locus AY278558, Meyer et al., Biochemistry 42(32):9779-9788 (2003)), and the coding sequence of the Euglena gracilis delta-4 desaturase (SEQ ID NO:24) (Meyer et al., supra). Sequence alignment was performed by the Clustal W method (using the
MegAlign™ v6.1 program of the LASERGENE bioinformatics computing suite (DNASTAR Inc.) with the default parameters for multiple alignment (GAP PENALTY=IO, GAP LENGTH PENALTY=0.2, Delay Divergen Seqs(%)=30, DNA Transition Weight=0.5, Protein Weight Matrix=Gonnet Series, DNA Weight Matrix=IUB ). The alignment is shown in FIG. 2. The Euglena gracilis delta-4 desaturase coding sequence is named EgD4_CDS (SEQ ID NO:24); the Euglena gracilis delta-4 desaturase cDNA sequence is named EgD4_cDNA (SEQ ID NO:23); and the Euglena gracilis DHA synthase 2 coding sequence is named EgDHAsyn2_CDS (SEQ ID NO:21).
The 5' end (where the sequences are divergent) and the 3' end (where the sequences are identical) of the alignment are truncated in order to fit the alignment on one page. FIG. 2 illustrates that the sequences are highly divergent from the start of the Euglena gracilis delta-4 desaturase cDNA to 83 bp upstream of the coding sequence (CDS) start site. It is clear from the alignment that the nucleotide sequences for EgD4_cDNA and EgDHAsyn2_CDS are identical from 83 bp upstream of the CDS start site of the Euglena gracilis delta-4 desaturase cDNA sequence (SEQ ID NO:23), which is equivalent to nucleotide 674 of the EgDHAsyn2_CDS (SEQ ID NO:21), through to the end of the sequences. At the exact point of divergence, a Not\ site can be found in the Euglena gracilis cDNA sequence (nucleotides 656-663 of SEQ ID NO:23), and since Not\ linkers were used in the original cloning of the Euglena gracilis delta-4 desaturase cDNA (see Meyer et al., supra), it is likely that what was cloned was an incomplete, not full- length, transcript for EgDHAsyn2.
The amino acid sequence EgDHAsyni (SEQ ID NO:12) was compared to EgDHAsyn2 (SEQ ID NO:22) and EgC20elo1 (SEQ ID NO:6) using the Clustal W method as described above, and the alignment is shown in FIGs. 3A and 3B.
Compared to EgDHAsyni and EgDHAsyn2, EgC20elo1 has a deletion of 7 amino acids (i.e., A L D L A [V/l] L) and 2 other amino acid substitutions (i.e., W47R, T48I; based on numbering for EgDHAsyni) at the N-terminus. After amino acid 289 of EgC20elo1 , the sequences are very different when compared to the DHA synthases. EgDHAsyni and EgDHAsyn2 have an additional 498 amino acids at their C-terminal ends with homology to delta-4 fatty acid desaturases, while EgC20elo1 ends after only 9 additional amino acids. The amino acid sequences of EgDHAsyni (SEQ ID NO:12) and EgDHAsyn2 (SEQ ID NO:22) have 8 amino acid differences between the 2 sequences (i.e., V25I, G54V, A305T, L310P, V380I, S491 N, I744T, R747P; based on numbering for EgDHAsyni). The last four differences occur in the delta-4 desaturase domain.
FIGs. 4A and 4B show the Clustal W alignment of the N-terminus of EgDHAsyni (SEQ ID NO:12) and the N-terminus of EgDHAsyn2 (SEQ ID NO:22) with EgC20elo1 (SEQ ID NO:6), Pavlova sp. CCMP459 C20-PUFA EIo (SEQ ID NO:2), Ostreococcus tauri PUFA elongase 2 (SEQ ID NO:25) (NCBI Accession No. AAV67798 (Gl 55852396), locus AAV67798, CDS AY591336; Meyer et al., J. Lipid Res. 45(10): 1899-1909 (2004)), and Thalassiosira pseudonana PUFA elongase 2 (SEQ ID NO:26) (NCBI Accession No. AAV67800 (Gl 55852441), locus AAV67800, CDS AY591338; Meyer et al., J. Lipid Res., supra). In FIGs. 4A and 4B, the Pavlova, Ostreococcus, and Thalassiosira proteins are labeled as PavC20elo, OtPUFAelo2, and TpPUFAelo2, respectively.
FIGs. 5A, 5B1 5C, and 5D show the Clustal W alignment of the C-terminus of EgDHAsyni (EgDHAsyn1_CT; amino acids 253-793 of SEQ ID NO:12; the N- terminus of EgDHAsyni is not shown and is indicated by "...") and the C-terminus of EgDHAsyn2 (EgDHAsyn2_CT; amino acids 253-793 of SEQ ID NO:22, the N- terminus of EgDHAsyn2 is not shown and is indicated by "...") with Euglena gracilis delta-4 fatty acid desaturase (SEQ ID NO: 13), Thraustochytrium aureum delta-4 desaturase (SEQ ID NO:27) (NCBI Accession No. AAN75707(GI 25956288), locus AAN75707, CDS AF391543), Schizochytrium aggregatum delta-4 desaturase (SEQ ID NO:28) (PCT Publication No. WO 2002/090493), Thalassiosira pseudonana delta-4 desaturase (SEQ ID NO:29) (NCBI Accession No. AAX14506 (Gl 60173017), locus AAX14506, CDS AY817156; Tonon et al., FEBS J. 272 (13):3401-3412 (2005)), and lsochrysis galbana delta-4 desaturase (SEQ ID NO:30) (NCBI Accession No. AAV33631 (Gl 54307110), locus AAV33631 , CDS AY630574; Pereira et al., Biochem. J. 384(2), :357-366 (2004) and PCT Publication No. WO 2002/090493). In FIGs. 5A, 5B, 5C, and 5D, the Euglena, Thraustochytrium, Thalassiosira, and lsochrysis proteins are labeled as EgD4, TaD4, TpD4, and lgD4, respectively.
FIG. 6 shows an alignment of interior fragments of EgDHAsyni (labeled as "EgDHAsyn1_ NCT.pro"; amino acids 253-365 of SEQ ID NO:12) and EgDHAsyn2 (labeled as "EgDHAsyn2_NCT.pro"; amino acids 253-365 of SEQ ID NO:22), spanning both the C20 elongase region and the delta-4 desaturase domain (based on homology), with the C-termini of C20 elongases (EgC20elo1_CT.pro, amino acids 246-298 of SEQ ID NO:6; PavC20elo_CT.pro, amino acids 240-277 of SEQ ID NO:2; OtPUFAelo2_CT.pro, amino acids 256-300 of SEQ ID NO:25; TpPUFAelo2_CT.pro, amino acids 279-358 of SEQ ID NO:26) and the N-termini of delta-4 desaturases (EgD4_NT.pro, amino acids 1-116 of SEQ ID NO:13; TaD4_NT.pro, amino acids 1-47 of SEQ ID NO:27; SaD4_NT.pro, amino acids 1-47 of SEQ ID NO:28; TpD4_NT.pro, amino acids 1-82 of SEQ ID NO:29; lgD4_NT.pro, amino acids 1-43 of SEQ ID NO:30) is shown. A conserved motif at the C-terminus of all the C20 elongase domains (i.e., VLFXXFYXXXY (SEQ ID NO:180)) is also present at the N-terminus of EgD4 and further supports EgD4 being an incomplete DHA synthase.
At the C-terminus of the C20 elongase domain for each of EgDHAsyni , EgDHAsyn2, and EgC20elo1 , there is a repeated sequence containing an NG motif (i.e., KNGK (SEQ ID NO:186), PENGA (SEQ ID NO:187), PENGA (SEQ ID NO:187), and PCENGTV (SEQ ID NO:191); called NG repeats and indicated in FIG. 6 with lines under the sequence). Although the pattern occurs with a high probability of occurrence, a scan of the NG repeated region using Prosite shows the last NG motif (i.e., NGTV) in this region as a potential N-glycosylation site. After the NG repeat region, both EgDHAsyni and EgDHAsyn2 contain a proline-rich region (labeled "Proline-rich linker" in FIG. 6), which may act as a linker between the C20 elongase and delta-4 desaturase domains. The linker may play a role in keeping the C20 elongase and delta-4 desaturase domains in the proper structural orientation to allow efficient conversion of EPA to DHA. Although the proline-rich linker is shown in FIG. 6 as extending from P304 to V321 (based on numbering for EgDHAsyni), the NG repeat region is also somewhat proline-rich and may also play a role in this linker function.
The nucleotide and corresponding amino acid sequences for the proline-rich linker of EgDHAsyni , as defined in FIG. 6, are set forth in SEQ ID NO:197 and SEQ ID NO: 198, respectively. The nucleotide and corresponding amino acid sequences for the proline-rich linker of EgDHAsyn2, as defined in FIG. 6, are set forth in SEQ ID NO:199 and SEQ ID NO:200, respectively.
The nucleotide and corresponding amino acid sequences for the EgDHAsyni C20 elongase domain from EgDHAsyni are set forth in SEQ ID NO:201 and SEQ ID NO:202, respectively. The nucleotide and corresponding amino acid sequences for the EgDHAsyn2 C20 elongase domain are set forth in SEQ ID NO:203 and SEQ ID NO:204, respectively.
EXAMPLE 7 Construction of PDMW263 Plasmid pY5-30 (which was previously described in U.S. Patent 7,259,255
(the contents of which are hereby incorporated by reference)), is a shuttle plasmid that can replicate both in E. coli and Yarrowia lipolytica. Plasmid pY5-30 contains the following: a Yarrowia autonomous replication sequence (ARS18); a CoIEI plasmid origin of replication; an ampicillin-resistance gene (AmpR), for selection in E. coli; a Yarrowia LEU2 gene, for selection in Yarrowia; and a chimeric TEF::GUS::XPR gene. Plasmid pDMW263 (SEQ ID NO:31) was created from pY5- 30, by replacing the TEF promoter with the Yarrowia lipolytica FBAINm promoter (U.S. Patent 7,202,356), using techniques well known to one skilled in the art. Briefly, the FBAIN promoter is located in the 5' upstream untranslated region in front of the 'ATG' translation initiation codon of the fructose-bisphosphate aldolase enzyme (E. C. 4.1.2.13), encoded by the fba1 gene. This promoter is necessary for expression and includes a portion of 5' coding region that has an intron. The modified promoter, FBAINm, has a 52 bp deletion between the ATG translation initiation codon and the intron of the FBAIN promoter (thereby including only 22 amino acids of the N-terminus) and a new translation consensus motif after the intron. Table 20 summarizes the components of pDMW263 (SEQ ID NO:31 ; also described in PCT Publication No. WO 2007/061845).
TABLE 20 Components of Plasmid pDMW263
Figure imgf000135_0001
EXAMPLE 8 Construction of Yarrowia lipolytica Expression Vector pY115 and
Gateway® Destination Vectors pBY1 and pY159 The Λ/col/Sa/l DNA fragment from pDMW263 (SEQ ID NO:31) (see construction in Example 7), containing the Yarrowia lipolytica FBAINm promoter, was cloned into the NcoUSall DNA fragment of pDMW237 (SEQ ID NO:32), previously described in PCT Publication No. WO 2006/012325 (the contents of which are hereby incorporated by reference). pDMW237contains a synthetic delta-9 elongase gene derived from lsochrysis galbana and codon-optimized for expression in Yarrowia lipolytica (lgD9e). In this way, plasmid pY115 (SEQ ID NO:33; FIG. 7A) was produced. In FIG. 7A, the modified FBAINm promoter is called FBA1 + Intron. The modified FBAINm promoter is referred to in other figures as either FBA1 + Intron or YAR FBA1 PRO + Intron; these terms are used interchangeably with FBAINm. Plasmid pY115 (SEQ ID NO:33) was digested with Nco\/Not\, and the resulting DNA ends were filled using Klenow. After filling to form blunt ends, the DNA fragments were treated with calf intestinal alkaline phosphatase and separated using agarose gel electrophoresis. The 6989 bp fragment containing the Yarrowia lipolytica FBAINm promoter was excised from the agarose gel and purified using the QIAquick® Gel Extraction Kit (Qiagen Inc., Valencia, CA), following the manufacturer's protocol. The purified 6989 bp fragment was ligated with cassette rfA using the Gateway Vector Conversion System (Cat. No. 11823-029, Invitrogen Corporation), following the manufacturer's protocol, to form Yarrowia lipolytica Gateway® destination vector pBY1 (SEQ ID NO:34; FIG. 7B). In constructing pBY1 , the filled Λ/col site provides an ATG start for translation initiation. Thus, genes transferred to this expression vector are expressed as fusion proteins and must be in the correct frame after Gateway® cloning. Also, 5' untranslated sequence results in additional amino acids being added to the N- terminus of the resulting protein. For this reason, a second Gateway® destination vector was made which had the vector-derived ATG start codon removed, thus allowing for translational start from the gene inserted.
The FBAINm promoter was amplified from plasmid pY115 (SEQ ID NO:33), using PCR with oligonucleotide primers oYFBAI (SEQ ID NO:35) and 0YFBAI-6 (SEQ ID NO:36). Primer oYFBAI (SEQ ID NO:35) was designed to introduce a BgIW site at the 5' end of the promoter, and primer 0YFBAI-6 (SEQ ID NO:36) was designed to introduce a Not\ site at the 3' end of the promoter while removing the Λ/col site and thus, the ATG start codon. The resulting PCR fragment was digested with BgIW and Not\ and cloned into the BglW/Not\ fragment of pY115, containing the vector backbone, to form pY158 (SEQ ID NO:37).
Plasmid pY158 (SEQ ID NO:37) was digested with Not\, and the resulting DNA ends were filled. After filling to form blunt ends, the DNA fragments were treated with calf intestinal alkaline phosphatase and separated using agarose gel electrophoresis. The 6992 bp fragment containing the Yarrowia lipolytica FBAINm promoter was excised from the agarose gel and purified using the QIAquick® Gel Extraction Kit (Qiagen Inc., Valencia, CA), following the manufacturer's protocol. The purified 6992 bp fragment was ligated with cassette rfA using the Gateway Vector Conversion System (Cat. No. 11823-029, Invitrogen Corporation), following the manufacturer's protocol, to form Yarrowia lipolytica Gateway® destination vector pY159 (SEQ ID NO:38; FIG. 7C).
EXAMPLE 9
Construction of Yarrowia lipolytica Expression Vectors pBY-EgC20elo1 (EqC20elo1). PY132 (EqDHAsvnP, pY161 (EqDHAsvnP and pY164 (EqDHAsvn2) Plasmid was purified from clones eeg1c.pkOO5.p14.f (Example 3), eeg1c.pkO16.e6.f (Example 4), and eeg1c-1 (Example 5) using the QIAprep® Spin Miniprep Kit (Qiagen Inc., Valencia, CA), following the manufacturer's protocol. Using the Gateway® LR Clonase™ Il enzyme mix (Cat. No. 11791-020, Invitrogen Corporation) and following the manufacturer's protocol, the cDNA inserts from eeg1c.pkOO1.p14.f (comprising EgC20elo1) and eeg1c.pkO16.e6.f (comprising EgDHAsyni) were transferred to pBY1 (SEQ ID NO:34; FIG. 7B) to form pBY- EgC20elo1 (SEQ ID NO:39, FIG. 7D) and pY132 (SEQ ID NO:40; FIG. 8A), respectively. The cDNA insert from eeg1c-1 (comprising EgDHAsyn2) was not transferred to pBY1 , because it would have resulted in the wrong translation frame being expressed.
Using the Gateway® LR Clonase™ Il enzyme mix (Cat. No. 11791-020, Invitrogen Corporation) and following the manufacturer's protocol, the cDNA inserts from eeg1c.pkO16.e6.f and eeg1c-1 were transferred to pY159 (SEQ ID NO:38; Example 8) to form pY161 (SEQ ID NO:41 , FIG. 8B) and pY164 (SEQ ID NO:42; FIG. 8C), respectively.
EXAMPLE 10 Construction of Yarrowia lipolytics Expression Vectors
PY141 (EqDHAsvni*), pY143 (EqDHAsvn1*C20EloDom1) and pY149
(EqDHAsvn1*C20EloDom2Linker)
EgDHAsyni was amplified from clone eeg1c.pkOO1.e6.f with oligonucleotide primers EgEPAEIoDom-5 (SEQ ID NO:43) and oEUG el4-3 (SEQ ID NO:44), using the Phusion™ High-Fidelity DNA Polymerase (Cat. No. F553S, Finnzymes Oy, Finland) following the manufacturer's protocol. The resulting DNA fragment was cloned into the pCR-Blunt® cloning vector using the Zero Blunt® PCR Cloning Kit (Invitrogen Corporation), following the manufacturer's protocol, to produce pKR1062 (SEQ ID NO:45). An internal Λ/col site at nucleotides 619-624 was removed from EgDHAsyni in pKR1062 using the Quickchange® Site Directed Mutagenesis kit (Cat. No. 200518, Stratagene, La JoIIa, CA), with oligonucleotides EgEloD4Mut-5 (SEQ ID NO:46) and EgEloD4Mut-3 (SEQ ID NO:47), following the manufacturer's protocol. After extensive sequencing, a clone with the Λ/col site removed (i.e., a ccatgg to ccttgg mutation) and no further nucleotide changes made was chosen for further study. This clone was designated pLF115-7 (SEQ ID NO:48). The nucleotide sequence for EgDHAsyni having the Λ/col site removed (EgDHAsyni*) is set forth in SEQ ID NO:205. The corresponding amino acid sequence is identical to SEQ ID NO:12. Construction Of Plasmid pY141. Expressing EgDHAsvni*: The Nco\/Not\
DNA fragment from pLF115-7 (SEQ ID NO:48), containing EgDHAsyni (SEQ ID NO:205; without the internal Λ/col site; at nt 621 of the EgDHAsyni CDS; ccatgg to ccttgg), was cloned into the Λ/col/Λ/ofl DNA fragment from pY115, containing the Yarrowia lipolytics FBAINm promoter, to produce pY141 (SEQ ID NO:49; FIG. 8D). Thus, plasmid pY141 contains the full length EgDHAsyni* gene (labeled as ΕgDHAsyni(-Ncol)" in FIG.), under control of the Yarrowia lipolytica FBAINm promoter (PCT Publication No. WO 2005/049805; U.S. Patent 7,202,356; labeled as "Fba1+lntron" in FIG.), and the Pex20 terminator sequence from Yarrowia Pex20 gene (GenBank Accession No. AF054613).
Construction Of Plasmid pY143. Expressing EαDHAsvn1-C20EloDom1 : The nucleotide sequence for the EgDHAsyni* C20 elongase domain (EgDHAsyn1C20EloDom1) in pY141 is set forth in SEQ ID NO:206 (identical to SEQ ID NO:201 but Λ/col site removed). The corresponding amino acid sequence is identical to SEQ ID NO:202.
The EgDHAsyn1C20EloDom1 (SEQ ID NO:206) was amplified from pLF115- 7 with oligonucleotide primers EgEPAEIoDom-5 (SEQ ID NO:43) and EgDPAEIoDom-3 (SEQ ID NO:50) using the Phusion™ High-Fidelity DNA
Polymerase (Cat. No. F553S, Finnzymes Oy, Finland) following the manufacturer's protocol. The resulting DNA fragment was cloned into the pCR-Blunt® cloning vector using the Zero Blunt® PCR Cloning Kit (Invitrogen Corporation), following the manufacturer's protocol, to produce pHD16 (SEQ ID NO:51). The Nco\/Not\ DNA fragment from pHD16 (SEQ ID NO:51), containing the
EgDHAsyn1C20EloDom1 (without the internal Λ/col site), was cloned into the NcoUNoti DNA fragment from pY115, containing the Yarrowia lipolytica FBAINm promoter, to produce pY143 (SEQ ID NO:52; FIG. 9A). Plasmid pY143 contains the N-terminal domain of EgDHAsyni* (EgDHAsyn1C20EloDom1) and does not include the proline-hch linker or delta-4 desaturase domain.
Construction Of Plasmid pY149, Expressing EgDHAsvni- C20EloDom2Linker: The EgDHAsyni* C20 elongase domain (SEQ ID NO:206) and proline-hch linker (SEQ ID NO:197), were amplified from pLF115-7 (SEQ ID NO:48) with oligonucleotide primers EgEPAEIoDom-5 (SEQ ID NO:43) and oEUGsyn6-2 (SEQ ID NO:53) using the Phusion™ High-Fidelity DNA Polymerase (Cat. No. F553S, Finnzymes Oy, Finland ) following the manufacturer's protocol. The resulting DNA fragment was cloned into the pCR-Blunt® cloning vector using the Zero Blunt® PCR Cloning Kit (Invitrogen Corporation), following the manufacturer's protocol, to produce pKR1071 (SEQ ID NO:54). The Λ/col/Ec/132ll DNA fragment from pKR1071 (SEQ ID NO:54) was cloned into the Nco\/Not\ DNA fragment from pY115 (where the Noti site had been filled in), containing the Yarrowia lipolytica FBAINm promoter, to produce pY149 (SEQ ID
NO:55; FIG. 9B). Plasmid pY149 contains the EgDHAsyn1C20EloDom1/proline- rich linker fusion gene (i.e., EgDHAsyn1C20EloDom2Linker; SEQ ID NO:207), but does not contain the delta-4 desaturase domain. The amino acid sequence of EgDHAsyn1C20EloDom2Linker is set forth in SEQ ID NO:208. In addition to the amino acids from EgDHAsyni* C20 elongase domain and proline-rich linker, an additional 4 amino acids (i.e., SCRT) were added after the linker region as a result of how the fragment was synthesized and cloned.
EXAMPLE 11 Construction of Yarrowia lipolytica Expression Vectors for Generation of
Novel C20 Elongase/Delta-4 Desaturase Fusion Proteins In order to synthesize novel C20 elongase/delta-4 desaturase fusion proteins, a unique Sbft site was added to the 3' end of the C20 elongase domain of EgDHAsyni* after the proline-rich linker region (EgDHAsyn1C20EloDom3Linker). EgDHAsyn1C20EloDom3 was amplified from pLF115-7 (SEQ ID NO:48) with oligonucleotide primers EgEPAEIoDom-5 (SEQ ID NO:43) and oEUGsyn6-3 (SEQ ID NO:56) using the Phusion™ High-Fidelity DNA Polymerase (Cat. No. F553S, Finnzymes Oy1 Finland) following the manufacturer's protocol. The resulting DNA fragment was cloned into the pCR-Blunt® cloning vector using the Zero Blunt® PCR Cloning Kit (Invitrogen Corporation), following the manufacturer's protocol, to produce pKR1091 (SEQ ID NO:57). The Λ/col/Ec/136ll DNA fragment from pKR1091 (SEQ ID NO:57), containing
EgDHAsyn1C20EloDom3Linker, was cloned into the NcoUNoft DNA fragment from pY115 (where the Not\ was filled to form a blunt end), containing the Yarrowia lipolytica FBAINm promoter, to produce pY155 (SEQ ID NO:58).
In order to synthesize novel C20 elongase/delta-4 desaturase fusion proteins, a unique Sbft site was added to the 5' end of various delta-4 desaturases. In each case, the Sbfl site is located after the ATG start site of each coding sequence and resulted in the addition and/or replacement of a few amino acids at the N-terminus of the delta-4 desaturase coded for by the genes.
Construction Of Plasmid pY156. Expressing EqDHAsvn1-C20EloDom3- lgD4*: The lsochrysis galbana delta-4 desaturase (SEQ ID NO:209; lgD4) was amplified from pRIG6 (previously described in PCT Publication No. WO 2002/090493) with oligonucleotides oRIG6-1 (SEQ ID NO:59) and oRIGβ-2 (SEQ ID NO:60) using the Phusion™ High-Fidelity DNA Polymerase (Cat. No. F553S) following the manufacturer's protocol. The resulting DNA fragment, which contains the lgD4 CDS and is identical to SEQ ID NO:209 except that an Sbft site was added at the 5' end after the start codon (lgD4*; SEQ ID NO:210), was cloned into the pCR-Blunt® cloning vector using the Zero Blunt® PCR Cloning Kit (Invitrogen Corporation), following the manufacturer's protocol, to produce pKR1067 (SEQ ID NO:61). The amino acid sequence for lgD4* from pKR1067 is set forth in SEQ ID NO:211 and is identical to that to lgD4 (SEQ ID NO:30) except that the first 4 amino acids (i.e., MCNA) have been changed to MALQ due to the addition of the Sbfl site in the nucleotide sequence. The Nco\/Not\ DNA fragment from pKR1067 (SEQ ID NO:61), containing lgD4*, was cloned into the Nco\/Not\ DNA fragment from pY115, containing the Yarrowia lipolytica FBAINm promoter, to produce pY150 (SEQ ID NO:62; FIG. 9C). In FIG. 9C, lgD4* is labeled as "Ig d4 DS". In this way, lgD4* could be expressed alone in Yarrowia. The XbaUSbft DNA fragment from pKR1091 (SEQ ID NO:57; supra), containing EgDHAsyn1C20EloDom3Linker, was cloned into the X/?al/Sbfl DNA fragment from pKR1067 (SEQ ID NO:61), containing lgD4*, to produce pKR1097 (SEQ ID NO:63). Thus, an in-frame fusion was made between EgDHAsyn1C20EloDom3Linker and lgD4*, separated by the proline-rich linker region (called EgDHAsyn1C20EloDom3-lgD4; SEQ ID NO:212). The amino acid sequence for EgDHAsyn1C20EloDom3-lgD4 is set forth in SEQ ID NO:213.
The Nco\/Not\ DNA fragment from pKR1097 (SEQ ID NO:63), containing the EgDHAsyn1C20EloDom3-lgD4, was cloned into the Nco\/Not\ DNA fragment from pY115, containing the Yarrowia lipolytica FBAINm promoter, to produce pY156 (SEQ ID NO:64; FIG. 9D). In FIG. 9D, the EgDHAsyn1C20EloDom3-lgD4 is labeled as "EGel-IGd4".
Construction Of Plasmid pY152. Expressing EqDHAsvn1-D4Dom1*: A region of the C-terminus of EgDHAsyni* (SEQ ID NO:205) containing the delta-4 desaturase domain (EgDHAsyn1 D4Dom1 ; SEQ ID NO:214; corresponding amino acid sequence for EgDHAsyni D4Dom1 is set forth in SEQ ID NO:215), starting just after the end of the proline-rich linker region, was amplified from pLF115-7 (as described in Example 10) with oligonucleotides oEGslne6-1 (SEQ ID NO:65) and oEUGel4-3 (SEQ ID NO:44) using the Phusion™ High-Fidelity DNA Polymerase (Cat. No. F553S) following the manufacturer's protocol. Oligonucleotide oEGslneθ- 1 (SEQ ID NO:65) introduced an ATG start codon at the 5' end of the PCR product followed by an Sbfi site. The resulting DNA fragment was cloned into the pCR- Blunt® cloning vector using the Zero Blunt® PCR Cloning Kit (Invitrogen Corporation), following the manufacturer's protocol, to produce pKR1069 (SEQ ID NO:66). The new CDS and amino acid sequences containing EgDHAsyn1 D4Dom1 from pKR1069 (i.e., EgDHAsyn1 D4Dom1*) are set forth in SEQ ID NO:216 and SEQ ID NO:217, respectively. The amino acid sequence for EgDHAsyn1 D4Dom1* (SEQ ID NO:217) is identical to that of EgDHAsyn1 D4Dom1 (SEQ ID NO:215), except that the first 2 amino acids (i.e., SG) have been changed to MAL due to the addition of the Sbf\ site in the nucleotide sequence.
The Λ/col/Λ/ofl DNA fragment from pKR1069 (SEQ ID NO:66), containing the EgDHAsyni D4Dom1*, was cloned into the Λ/col/Λ/ofl DNA fragment from pY115, containing the Yarrowia lipolytics FBAINm promoter, to produce pY152 (SEQ ID NO:67; FIG. 10A). In FIG. 1OA, the EgDHAsyn1 D4Dom1* is labeled as "EUG d4 (fus test)". In this way, the EgDHAsyn1 D4Dom1* could be expressed alone in Yarrowia.
Construction Of Plasmid pY157. Expressing EqDHAsvn1-C20EloDom3- EqD4Dom1 : The XbaUSbΑ DNA fragment from pKR1091 (SEQ ID NO:57), containing EgDHAsyn1C20EloDom3-l_inker, was cloned into the XbaUSbfl DNA fragment from pKR1069, containing the EgDHAsyn1 D4Dom1*, to produce pKR1099 (SEQ ID NO:68). In this way, an in-frame fusion was made between the EgDHAsyn1C20EloDom and the EgDHAsyn1 D4Dom1*, separated by the proline- rich linker region (called EgDHAsyn1C20EloDom3-EgD4Dom1 ; SEQ ID NO:218). The amino acid sequence of EgDHAsyn1 C20EloDom3-EgD4Dom1 (SEQ ID NO:219) is almost identical to EgDHAsyni except one amino acid (i.e., G323L based on numbering for EgDHAsyni) was changed due to the Sbft cloning site and fusion junction.
The Λ/col/Λ/ofl DNA fragment from pKR1099 (SEQ ID NO:68), containing the EgDHAsyni C20EloDom3-EgD4Dom1 , was cloned into the NcoUNoti DNA fragment from pY115, containing the Yarrowia lipolytica FBAINm promoter, to produce pY157 (SEQ ID NO:69; FIG. 10B). In FIG. 10B, the EgDHAsyn1C20EloDom3-EgD4Dom1 is labeled as "EGel-EGd4 fus". Construction Of Plasmid pY153. Expressing EqDHAsvn1*D4Dom2: A region of the C-terminus of EgDHAsyni containing the delta-4 desaturase domain and some of the C20 elongase domain (EgDHAsyn1 D4Dom2; SEQ ID NO:220; corresponding amino acid sequence for EgDHAsyn1 D4Dom2 is set forth in SEQ ID NO:221), which corresponds to the amino acid sequence identified as EgD4 (SEQ ID NO: 13; Meyer et al., Biochemistry 42(32):9779-9788 (2003)), was amplified from pLF115-7 (described in Example 10) with oligonucleotides oEUGel4-4 (SEQ ID NO:70) and oEUGel4-3 (SEQ ID NO:44) using the Phusion™ High-Fidelity DNA Polymerase (Cat. No. F553S) following the manufacturer's protocol. The resulting DNA fragment was cloned into the pCR-Blunt® cloning vector using the Zero Blunt® PCR Cloning Kit (Invitrogen Corporation), following the manufacturer's protocol, to produce pKR1073 (SEQ ID NO:71).
The PaVNotl DNA fragment from pKR1073 (SEQ ID NO:71), containing the EgDHAsyn1 D4Dom2, was cloned into the NcoUNott DNA fragment from pY115, containing the Yarrowia lipolytica FBAINm promoter, to produce pY153 (SEQ ID NO:72; FIG. 10C). In FIG. 10C, the EgDHAsyn1 D4Dom2 is labeled as EUG d4 (HZ). In this way, the EgDHAsynD4Dom2 could be expressed alone in Yarrowia.
Construction Of Plasmid pY160, Expressing EgDHAsvn1-C20EloDom3- SaD4*: The Schizochytrium aggregatum delta-4 desaturase (SEQ ID NO:222; SaD4) was amplified from pRSA-1 (previously described in PCT Publication No. WO 2002/090493) with oligonucleotides oRSA1-1 (SEQ ID NO:73) and oRSA1-2 (SEQ ID NO:74) using the Phusion™ High-Fidelity DNA Polymerase (Cat. No. F553S) following the manufacturer's protocol. The resulting DNA fragment, which contains the SaD4 CDS and is identical to SEQ ID NO:222, except that an Sbf\ site was added at the 5' end after the start codon (SaD4*; SEQ ID NO:223), was cloned into the pCR-Blunt® cloning vector using the Zero Blunt® PCR Cloning Kit (Invitrogen Corporation), following the manufacturer's protocol, to produce pKR1068 (SEQ ID NO:75). The amino sequence for SaD4* from pKR1068 is set forth in SEQ ID NO:224 and is identical to that to SaD4 (SEQ ID NO:28) except that the first 3 amino acids (i.e., MTV) have been changed to MALQ due to the addition of the Sbft site in the nucleotide sequence.
The NcoUNott DNA fragment from pKR1068 (SEQ ID NO:75) (partial digest to avoid internal Λ/col site), containing the SaD4*, was cloned into the NcoUNoti DNA fragment from pY115, containing the Yarrowia lipolytica FBAINm promoter, to produce pY151 (SEQ ID NO:76; FIG. 10D). In FIG. 1OD, the SaD4* is labeled as "RSA d4 DS". In this way, the SaD4* could be expressed alone in Yarrowia.
The Sbfl/Not\ DNA fragment from pKR1068 (SEQ ID NO:75), containing the SaD4*, was cloned into the SbWNott DNA fragment from pY157 (SEQ ID NO:69), containing the EgDHAsyn1 C20EloDom3Linker, to produce pY160 (SEQ ID NO:77; FIG. 11). In this way, an in-frame fusion was made between the EgDHAsyn1C20EloDom3 and the SaD4*, separated by the proline-rich linker region (i.e., EgDHAsyn1C20EloDom3-SaD4; SEQ ID NO:225). The amino acid sequence for EgDHAsyn1C20EloDom3-SaD4 is set forth in SEQ ID NO:226.
EXAMPLE 12
Euαlena anabaena Growth Conditions. Lipid Profile and mRNA Isolation Euglena anabaena was obtained from Dr. Richard Triemer's lab at Michigan State University (East Lansing, Ml). Approximately 2 mL of culture were removed for lipid analysis and centrifuged at 1 ,800 x g for 5 min. The pellet was washed once with water and re-centrifuged. The resulting pellet was dried for 5 min under vacuum, resuspended in 100 μL of trimethylsulfonium hydroxide (TMSH), and incubated at room temperature for 15 min with shaking. After this, 0.5 mL of hexane were added, and the vials were incubated for 15 min at room temperature with shaking. Fatty acid methyl esters (5 μL injected from hexane layer) were separated and quantified using a Hewlett-Packard 6890 Gas Chromatograph fitted with an Omegawax 320 fused silica capillary column (Supelco Inc., Cat. No. 24152). The oven temperature was programmed to hold at 170 0C for 1.0 min, increase to 240 0C at 5 0C /min, and then hold for an additional 1.0 min. Carrier gas was supplied by a Whatman hydrogen generator. Retention times were compared to those for methyl esters of standards commercially available (Nu-Chek Prep, Inc. Cat. No. U- 99-A), and the resulting chromatogram is shown in FIG. 12. The presence of EPA and DHA in the fatty acid profile suggested that Euglena anabaena would be a good source for long-chain PUFA biosynthetic genes such as, but not limited to, C20 elongases, delta-4 desaturases, and/or DHA synthases.
The remaining 5 mL of an actively growing culture was transferred into 25 mL of AF-6 Medium (Watanabe & Hiroki, NIES-Collection List of Strains, 5th ed., National Institute for Environmental Studies, Tsukuba, 127 pp (2004)) in a 125 mL glass flask. Euglena anabaena cultures were grown at 22 0C with a 16 hr light, 8 hr dark cycle for 2 weeks with very gentle agitation.
After 2 weeks, the culture (25 ml_) was transferred to 100 ml_ of AF-6 medium in a 500 ml_ glass bottle, and the culture was grown for 1 month as described above. After this time, two 50 mL aliquots were transferred into two separate 500 ml_ glass bottles containing 250 mL of AF-6 medium, and the cultures were grown for two months as described above (giving a total of ~600 mL of culture). After this, the cultures were pelleted by centrifugation at 1 ,800 x g for 10 min, washed once with water, and re-centrifuged. Total RNA was extracted from one of the resulting pellets using the RNA STAT-60™ reagent (TEL-TEST, Inc., Friendswood, TX) and following the manufacturer's protocol (use 5 mL of reagent, dissolved RNA in 0.5 mL of water). In this way, 340 μg of total RNA (680 ug/mL) were obtained from the pellet. The remaining pellet was frozen in liquid nitrogen and stored at -80 0C. The mRNA was isolated from all 340 μg of total RNA using the mRNA Purification Kit (Amersham Biosciences, Piscataway, NJ), following the manufacturer's protocol. In this way, 9.0 μg of mRNA were obtained.
EXAMPLE 13
Euplena anabaena cDNA Synthesis, Library Construction and Identification of DHA Synthases from cDNA Library eugic A cDNA library was generated using the Cloneminer™ cDNA Library
Construction Kit (Cat. No.18249-029, Invitrogen Corporation, Carlsbad, CA), following the manufacturer's protocol (Version B, 25-0608). Using the non- radiolabeling method, cDNA was synthesized from 5.12 μg of mRNA (Example 12) using the Biotin-atfB2-Oligo(dT) primer. After synthesis of the first and second strand, the affB1 adapter was added; ligation was performed; and the cDNA was size fractionated using column chromatography. DNA from fractions were concentrated, recombined into pDONR™222, and transformed into E. coli
ElectroMAX™ DM 0B™ T1 Phage-Resistant cells (Invitrogen Corporation). The Euglena anabaena library was named eug1c. Approximately 17,000 clones of cDNA library eugic were plated onto 3 large square (24 cm x 24 cm) petri plates (Corning, Corning, NY), each containing LB + 50 μg/mL kanamycin agar media. Cells were grown, transferred to Biodyne B membrane, and hybridized with a labeled Nco\/Not\ DNA fragment, containing EgDHAsyni*, from pY141 , exactly as described in Example 5. In this way, 11 positive clones were identified (designated as eug1c-1 to eug1c-11).
The positive clones were grown, and DNA was purified and sequenced as described in Example 2 using vector-primed M13F Universal primer (SEQ ID NO:1), vector-primed M13-28Rev primer (SEQ ID NO:14), and the poly(A) tail-primed WobbleT oligonucleotides. Based on initial sequence data, additional internal fragment sequence was obtained in a similar way using oligonucleotides EaDHAsynδ1 (SEQ ID NO:78), EaDHAsyn5'2 (SEQ ID NO:79), EaDHAsyn5'3 (SEQ ID NO:80), EaDHAsyn5'4 (SEQ ID NO:81), EaDHAsyn3' (SEQ ID NO:82),
EaDHAsyn3'2 (SEQ ID NO:83), EaDHAsyn3'3 (SEQ ID NO:84), EaDHAsyn3'4 (SEQ ID NO:85), and EaDHAsyn3'5 (SEQ ID NO:86). In this way, the full insert sequences of the eugic clones were obtained.
Sequences were aligned and compared using Sequencher™ (Version 4.2, Gene Codes Corporation, Ann Arbor, Ml), and in this way, the clones could be categorized into one of four distinct groups based on insert sequence (identified as EaDHAsyni to EaDHAsyn4). Representative clones containing the cDNA for each class of sequence were chosen for further study, and sequences for each representative plasmid (i.e., pLF117-1 , pl_F117-2, pLF117-3 and pLF117-4) are shown as SEQ ID NO: 87, SEQ ID NO:88, SEQ ID NO:89, and SEQ ID NO:90, respectively. The sequence of pLF117-1 shown by a string of NNNN's represents a region of the polyA tail which was not sequenced. The coding sequences for EaDHAsyni , EaDHAsyn2, EaDHAsyn3, and EaDHAsyn4 are shown as SEQ ID NO:91 , SEQ ID NO:92, SEQ ID NO:93, and SEQ ID NO:94, respectively. The corresponding amino acid sequences for EaDHAsyni , EaDHAsyn2, EaDHAsyn3, and EaDHAsyn4 are shown as SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:97, and SEQ ID NO:98, respectively.
The amino acid sequences for EaDHAsyni (SEQ ID NO:95), EaDHAsyn2 (SEQ ID NO:96), EaDHAsyn3 (SEQ ID NO:97), and EaDHAsyn4 (SEQ ID NO:98) were evaluated by BLASTP as described in Example 3 and, as was the case for EgDHAsyni (SEQ ID NO: 12) and EgDHAsyn2 (SEQ ID NO:22), all four EaDHAsyn sequences were also found to be similar to both C20-PUFA EIo and delta-4 fatty acid desaturases. The N-termini of EaDHAsyni (SEQ ID NO:95), EaDHAsyn2 (SEQ ID NO:96), EaDHAsyn3 (SEQ ID NO:97), and EaDHAsyn4 (SEQ ID NO:98) each yielded a pLog value of 58.5 (E value of 3e-59; 114/247 identical amino acids; 46% identity) versus the Pavlova sp. CCMP459 C20-PUFA EIo (SEQ ID NO:2). The C-termini of EaDHAsyni (SEQ ID NO:95), EaDHAsyn2 (SEQ ID NO:96), EaDHAsyn3 (SEQ ID NO:97), and EaDHAsyn4 (SEQ ID NO:98) yielded E values of 0.0 (378/538 identical amino acids; 70% identity), 0.0 (378/538 identical amino acids; 70% identity), 0.0 (379/538 identical amino acids; 70% identity), and 0.0 (368/522 identical amino acids; 70% identity), respectively, versus the amino acid sequence of delta-4 fatty acid desaturase from Euglena gracilis (SEQ ID NO: 13). BLAST scores and probabilities indicate that the instant nucleic acid fragments encode entire Euglena anabaena C20-PUFA Elo/delta-4 fatty acid desaturases. The amino acid sequences for EaDHAsyni (SEQ ID NO:95), EaDHAsyn2 (SEQ ID NO:96), EaDHAsyn3 (SEQ ID NO:97), and EaDHAsyn4 (SEQ ID NO:98) were compared using the Clustal W method as described in Example 6, and the alignment is shown in FIGs. 13A, 13B, and 13C. Interestingly, due to a single bp deletion in the nucleotide sequence, the C-terminus of the resulting amino acid sequence for EaDHAsyn4 (approximately last 35 amino acids) is highly divergent and smaller than the other three EaDHAsyn proteins.
When compared to the amino acid sequence of EgDHAsyni (SEQ ID NO: 12) using BLASTP, the amino acid sequences of EaDHAsyni (SEQ ID NO:95),
EaDHAsyn2 (SEQ ID NO:96), EaDHAsyn3 (SEQ ID NO:97), and EaDHAsyn4 (SEQ ID NO:98) were 70% (558/791), 70% (558/791), 70% (559/791) and 70% (548/775) identical, respectively.
As was the case for EgDHAsyni (SEQ ID NO: 12) and EgDHAsyn2 (SEQ ID NO:22), all four EaDHAsyn sequences have a proline-rich linker region (from approximately P300 to T332 based on numbering for EaDHAsyn 1). The linker appears to be slightly longer than that for EgDHAsyni (SEQ ID NO: 12) or EgDHAsyn2 (SEQ ID NO:22). All four EaDHAsyn sequences also lack the NG repeat motif found upstream of the proline-rich motif of EgDH Asyni and EgDHAsyn2; but, this region, as was the case for EgDHAsyni and EgDHAsyn2, is also slightly proline-rich in all four EaDHAsyn sequences and may play a role in the linker function. The nucleotide sequences for the C20 elongase domains of EaDHAsyni , EaDHAsyn2, EaDHAsyn3, and EaDHAsyn4 are set forth in SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, and SEQ ID NO:230, respectively. The amino acid sequences for the C20 elongase domains of EaDHAsyni , EaDHAsyn2, and EaDHAsyn3 are set forth in SEQ ID NO:231 , SEQ ID NO:232, and SEQ ID NO:233, respectively. The amino acid sequence of the C20 elongase domain of EaDHAsyn4 is identical to that for EaDHAsyni .
The nucleotide and amino acid sequences for the proline-rich linker of EaDHAsyni are set forth in SEQ ID NO:234 and SEQ ID NO:235, respectively. The nucleotide and amino acid sequences for the proline-rich linkers of EaDHAsyn2, EaDHAsyn3, and EaDHAsyn4 are identical to that for EaDHAsyni .
The nucleotide sequences for the delta-4 desaturase domain 1 of each of EaDHAsyni , EaDHAsyn2, and EaDHAsyn4 are set forth in SEQ ID NO:236, SEQ ID NO:237, and SEQ ID NO:238, respectively. The amino acid sequences for the delta-4 desaturase domains of EaDHAsyni , EaDHAsyn2, and EaDHAsyn4 are set forth in SEQ ID NO:239, SEQ ID NO:240, and SEQ ID NO:241 , respectively. The nucleotide and amino acid sequence of the delta-4 desaturase domain 1 of EaDHAsyn3 is identical to that of EaDHAsyni .
The nucleotide sequences for the delta-4 desaturase domain 2 of EaDHAsyni , EaDHAsyn2, EaDHAsyn3, and EaDHAsyn4, including the proline-rich linker and a portion of the 3' end of the C20 elongase domain, are set forth in SEQ ID NO:242, SEQ ID NO:243, SEQ ID NO:244, and SEQ ID NO:245, respectively. The amino acid sequences for the delta-4 desaturase domains of EaDHAsyni , EaDHAsyn2, EaDHAsyn3, and EaDHAsyn4 are set forth in SEQ ID NO:246, SEQ ID NO:247, SEQ ID NO:248, and SEQ ID NO:249, respectively.
FIG. 29 summarizes the Euglena anabaena DHA synthase domain sequences.
EXAMPLE 14
Construction of Yarrowia lipolvtica Expression Vectors pY165. PY166, pY167 and pY168
Using the Gateway® LR Clonase™ Il enzyme mix (Cat. No. 11791-020, Invitrogen Corporation) and following the manufacturer's protocol, the cDNA inserts from pLF117-1 (SEQ ID NO:87), pLF117-2 (SEQ ID NO:88), pLF117-3 (SEQ ID NO:89), and pLF117-4 (SEQ ID NO:90) were transferred to pY159 (SEQ ID NO:38; Example 8) to form pY165 (SEQ ID NO:99, FIG. 14A)1 pY166 (SEQ ID NO:100; FIG. 14B), pY167 (SEQ ID NO:101 ; FIG. 14C), and pY168 (SEQ ID NO:102; FIG. 14D), respectively. Thus, each plasmid contains the full length EaDHAsyn gene, under control of the Yarrowia lipolytica FBAINm promoter (PCT Publication No. WO 2005/049805; U.S. Patent 7,202,356; labeled as "Yar Fba1 Pro+lntron" in FIG.), and the Pex20 terminator sequence from Yarrowia Pex20 gene (GenBank Accession No. AF054613).
EXAMPLE 15 Construction of Soybean Expression Vector pKR1061 For Co-Expression of the
Euplena gracilis DHA Synthase 1 (EgDHAsynP With the Saprolepnia diclina Delta-17 Desaturase (SdD17) The present Example describes construction of a soybean vector for co- expression of EgDHAsyni (SEQ ID NO:12) with SdD17 and a hygromycin phosphotransferase selectable marker (hpt).
EgDHAsyni was amplified from pKR1049 (clone eeg1c.pkO16.e6.f) with oligonucleotide primers oEGel2-1 (SEQ ID NO:103) and oEUG el4-3 (SEQ ID NO:44), using the Phusion™ High-Fidelity DNA Polymerase (Cat. No. F553S, Finnzymes Oy, Finland) following the manufacturer's protocol. The resulting DNA fragment was cloned into the pCR-Blunt® cloning vector using the Zero Blunt® PCR Cloning Kit (Invitrogen Corporation), following the manufacturer's protocol, to produce pKR1055 (SEQ ID NO: 104).
A starting plasmid pKR72 (ATCC Accession No. PTA-6019; SEQ ID NO: 105, 7085 bp sequence), a derivative of pKS123 which was previously described in PCT Publication No. WO 2002/008269 (the contents of which are hereby incorporated by reference), contains the hygromycin B phosphotransferase gene (HPT) (Gritz, L. and Davies, J., Gene 25:179-188 (1983)), flanked by the T7 promoter and transcription terminator (T7prom/HPT/T7term cassette), and a bacterial origin of replication (ori) for selection and replication in bacteria (e.g., E. coli). In addition, pKR72 also contains HPT, flanked by the 35S promoter (Odell et al., Nature
313:810-812 (1985)) and NOS 3' transcription terminator (Depicker et al., J. MoI. Appl. Genet. 1 :561-570 (1982)) (35S/HPT/NOS31 cassette), for selection in plants such as soybean. pKR72 also contains a Not\ restriction site, flanked by the promoter for the α' subunit of β-conglycinin (Beachy et al., EMBO J. 4:3047-3053 (1985)) and the 3' transcription termination region of the phaseolin gene (Doyle et al., J. Biol. Chem. 261 :9228-9238 (1986)), thus allowing for strong tissue-specific expression in the seeds of soybean of genes cloned into the Not\ site. The βcon/Λ/ofl/Phas3' cassette in plasmid pKR72 (SEQ ID NO: 105, having
ATCC Accession No. PTA-6019) was amplified using oligonucleotide primers oCon- 1 (SEQ ID NO:106) and oCon-2 (SEQ ID NO:107) using the VentR® DNA Polymerase (Catalog No. M0254S, New England Biolabs Inc., Beverly, MA) following the manufacturer's protocol. The resulting DNA fragment was digested with Xba\ and cloned into the Xba\ site of pUC19, to produce pKR179 (SEQ ID NO:108).
EgDHAsyni was released from pKR1055 (SEQ ID NO: 104) by digestion with Nott and was cloned into the Λ/ofl site of plasmid pKR179 (SEQ ID NO: 108) to produce pKR1057 (SEQ ID NO: 109). The Sbfl fragment of pKR1057 (SEQ ID NO:109), containing the βcon/EgDHAsyn1/Phas3' cassette was cloned into the Sbfl site of pKR328 (SEQ ID NO: 110; which is described in PCT Publication No. WO 2004/071467 and the contents of which are hereby incorporated by reference), containing SdD17, to produce vector pKR1061 (SEQ ID NO: 111). A schematic depiction of pKR1061 is shown in FIG. 15A.
EXAMPLE 16
Construction of Soybean Expression Vector pKR973 For Co-Expression of the Paylova lutheri Delta-8 Desaturase (PavD8) With the Euglena gracilis Delta-9 Elongase (EgD9elo) and the Mortierella alpina Delta-5 Desaturase (MaD5)
Euplena gracilis delta-9 elongase (EgD9elo):
A clone from the Euglena cDNA library (eegic), called eeg1c.pk001.n5f, containing the Euglena gracilis delta-9 elongase (EgD9elo; SEQ ID NO:112; which is described in U.S. Application No. 11/601 ,563 (filed November 16, 2006, which published May 24, 2007 as US-2007-0118929-A1 ; Attorney Docket No. BB-1562) the contents of which are hereby incorporated by reference) was used as template to amplify EgD9elo with oligonucleotide primers oEugEL1-1 (SEQ ID NO:113) and oEugEL1-2 (SEQ ID NO:114) using the VentR® DNA Polymerase (Cat. No. M0254S, New England Biolabs Inc., Beverly, MA) following the manufacturer's protocol. The resulting DNA fragment was cloned into the pCR-Blunt® cloning vector using the Zero Blunt® PCR Cloning Kit (Invitrogen Corporation), following the manufacturer's protocol, to produce pKR906 (SEQ ID NO:115). Plasmid pKR906 was digested with Not\, and the fragment containing the
Euglena gracilis delta-9 elongase was cloned into plasmid pKR132 (SEQ ID NO: 116; which is described in PCT Publication No. WO 2004/071467) to produce pKR953 (SEQ ID NO:117). Mortierella alpina delta-5 desaturase (MaD5): Vector pKR287 (SEQ ID NO:118; which is described in PCT Publication No.
WO 2004/071467, published August 26, 2004; the contents of which are hereby incorporated by reference), contains the Mortierella alpina delta-5 desaturase (MaD5; SEQ ID NO:119, which is described in U.S. Patent No. 6,075,183 and PCT Publication Nos. WO 2004/071467 and WO 2005/047479, the contents of which are hereby incorporated by reference), flanked by the soybean glycinin Gy1 promoter and the pea leguminA2 3' termination region (Gy1/MaD5/legA2 cassette). Vector pKR287 was digested with Sbf\/Bsi\N\, and the fragment containing the Gy1/MaD5/legA2 cassette was cloned into the Sbft/BsN\l\ fragment of pKR277 (SEQ ID NO: 120; which is described in PCT Publication No. WO 2004/071467, the contents of which are hereby incorporated by reference) to produce pK952 (SEQ ID NO:121).
Vector pKR457 (SEQ ID NO: 122), which was previously described in PCT Publication No. WO 2005/047479 (the contents of which are hereby incorporated by reference), contains a Not\ site flanked by the Kunitz soybean Trypsin Inhibitor (KTi) promoter (Jofuku et al., Plant Cell 1 :1079-1093 (1989)) and the KTi 3' termination region, the isolation of which is described in U.S. Patent No. 6,372,965, followed by the soy albumin transcription terminator, which was previously described in PCT Publication No. WO 2004/071467 (Kti/Λ/ofl/Kti3'Salb3' cassette). Through a number of sub-cloning steps, sequences containing >4sp718 restriction sites were added to the 5' and 3' ends of the Kti/Λ/ofl/Kti3'Salb3' cassette to produce SEQ ID NO:123. Paylova lutheri delta-8 desaturase (PavD8):
Pavlova lutheri (CCMP459) was obtained from the Culture of Marine Phytoplankton (CCMP, West Boothbay Harbor, ME) and grown in 250 ml_ flasks containing 50 mL of F/2-Si medium (made using F/2 Family Medium Kit-KIT20F2 and Filtered Seqwater-SEA2 from CCMP) at 26 0C with shaking at 150 rpm. Cultures were transferred to new medium on a weekly basis using a 1 :4 (old culture:new medium) dilution. Cultures from 28 flasks (1400 mL) were combined, and cells were pelleted by centrifugation at 1 ,800 x g for 10 min, washed once with water, and re-centrifuged.
Total RNA was extracted from the resulting pellet using the RNA STAT-60™ reagent (TEL-TEST, Inc., Friendswood, TX), following the manufacturer's protocol. In this way, 2.6 mg of total RNA (2.6 mg/mL) were obtained from the pellet. The mRNA was isolated from 1.25 mg of total RNA using the mRNA Purification Kit (Amersham Biosciences, Piscataway, NJ), following the manufacturer's protocol. In this way, 112 μg of mRNA were obtained. cDNA was synthesized from 224 ng of mRNA using the Superscript™ First- Strand Synthesis System for RT-PCR Kit (Invitrogen™ Life Technologies, Carlsbad, CA) with the provided oligo(dT) primer, according to the manufacturer's protocol. After RNase H treatment as per the protocol, the Pavlova lutheri delta-8 desaturase (PavD8; SEQ ID NO:124; which is described in U.S. Patent Application No. 11/737772 (filed April 20, 2007; Attorney Docket No. BB-1566) the contents of which are hereby incorporated by reference) was amplified from the resulting cDNA with oligonucleotide primers PvDES5'Not-1 (SEQ ID NO: 125) and PvDES3'Not-1 (SEQ ID NO: 126) using the conditions described below. cDNA (2 μL) from the reaction described above was combined with 50 pmol of PvDES5'Not-1 (SEQ ID NO:125), 50 pmol of PvDES3'Not-1 (SEQ ID NO:126), 1 μL of PCR nucleotide mix (10 mM, Promega, Madison, Wl), 5 μL of 10X PCR buffer (Invitrogen Corporation), 1.5 μL of MgCl2 (50 mM, Invitrogen Corporation), 0.5 μL of Taq polymerase (Invitrogen Corporation), and water to 50 μL. The reaction conditions were 94 0C for 3 min followed by 35 cycles of 94 0C for 45 sec, 55 0C for 45 sec, and 72 0C for 1 min. The PCR was finished at 72 0C for 7 min, and then held at 4 0C. The PCR reaction was analyzed by agarose gel electrophoresis on 5 μL, and a DNA band with molecular weight around 1.3 kb was observed. The remaining product was separated by agarose gel electrophoresis, and the DNA was purified using the Zymoclean™ Gel DNA Recovery Kit (Zymo Research, Orange, CA), following the manufacturer's protocol.
The PavD8, flanked by Not\ sites, was cloned into the Not\ site of the modified Kti/Λ/ofl/Kti3'Salb3' cassette (SEQ ID NO: 123), and then the DNA fragment was digested with /\sp718 and cloned into the Sbft site of pKR952 (SEQ ID NO: 121) to produce pKR970 (SEQ ID NO: 127). V
Plasmid pKR953 (SEQ ID NO:117) was digested with Pst\, and the fragment containing the Euglena gracilis delta-9 elongase was cloned into the Sbft site of pKR970 (SEQ ID NO:127) to produce pKR973 (SEQ ID NO:128, FIG. 15B). In this way, the Pavlova lutheri delta-8 desaturase could be co-expressed with the Mortierella alpina delta-5 desaturase and the Euglena gracilis delta-9 elongase behind strong, seed-specific promoters.
EXAMPLE 17
Construction of Soybean Expression Vector pKR1064 For Co-Expression of the Euαlena gracilis DHA Synthase 1 (EgDHAsvnP With the Saprolepnia diclina Delta-17 Desaturase (Sd D 17) The present Example describes construction of a soybean vector for co- expression of EgDHAsyni with SdD17 and the acetolactate synthase (ALS) selectable marker. The Pst\ fragment, containing the Ann/Sdd17/BD30 cassette from pKR271
(SEQ ID NO:129; which is described in PCT Publication No. WO 2004/071467 and the contents of which are hereby incorporated by reference), was cloned into the Sbft site of pKR226 (SEQ ID NO: 130; which is also described in PCT Publication No. WO 2004/071467) to produce vector pKR886r (SEQ ID NO:131). In this way, the Saprolegnia diclina delta-17 desaturase (SdD17) was cloned behind the annexin promoter which is strong and seed specific.
The Sbfl fragment of pKR 1057 (SEQ ID NO: 109), containing the βcon/EgDHAsyn1/Phas3' cassette, was cloned into the Sbft site of pKR886r (SEQ ID NO:131), containing SdD17, to produce vector pKR1064 (SEQ ID NO:132). A schematic depiction of pKR1064 is shown in FIG. 15C. EXAMPLE 18
Construction of Soybean Expression Vector pKR1133 For Co-Expression of the Euplena gracilis DHA Synthase 1 (EqDHAsvnP With the Euαlena gracilis Delta-9
Elongase (EqD9elo) and the Mortierella alpina Delta-5 Desaturase (MaD5) The glycinin Gy1 promoter was PCR amplified from pZBL119 (SEQ ID
NO: 133; which is described in PCT Publication No. WO 2004/071467 and the contents of which are hereby incorporated by reference) using primers oSGIy-2 (SEQ ID NO:134) and oSGIy-3 (SEQ ID NO:135). The resulting PCR fragment was subcloned into the intermediate cloning vector pCR-Script AMP SK(+) (Stratagene), according to the manufacturer's protocol, to produce plasmid pPSgly32 (SEQ ID NO:136).
The Pstt/Nott fragment of plasmid pSGIy32 (SEQ ID NO:136), containing the Gy1 promoter, was cloned into the Pst\/Not\ fragment from plasmid pKR142 (SEQ ID NO:137; which is described in PCT Publication No. WO 2004/071467 and the contents of which are hereby incorporated by reference), containing the leguminA2 3' transcription termination region, an ampicillin resistance gene, and bacterial oh, to produce pKR264 (SEQ ID NO: 138). Thus, vector pKR264 contains a Λ/ofl site flanked by the promoter for the glycinin Gy1 gene and the leguminA2 3' transcription termination region (Gy1/Λ/of//legA2 cassette). EgDHAsyni was released from pKR1055 (SEQ ID NO: 104; Example 15) by digestion with Nott and was cloned into the Λ/ofl site of plasmid pKR264 (SEQ ID NO:138), to produce pKR1128 (SEQ ID NO:139).
The Λ/ofl fragment of pKS129 (SEQ ID NO:140; which is described in PCT Publication No. WO 2004/071467 and the contents of which are hereby incorporated by reference), containing the MaD5 was cloned into the Λ/ofl site of pKR457 (SEQ ID NO:122; Example 16), to produce pKR606 (SEQ ID NO:141).
Vector pKR606 (SEQ ID NO: 141) was digested with 8s/WI and after filling to blunt the ends, the fragment containing the Gy1/MaD5/legA2 cassette was cloned into the filled Λ/α/oMI site of pKR277 (SEQ ID NO: 120) to produce pKR804 (SEQ ID NO:142).
The Bs/WI fragment from pKR1128 (SEQ ID NO: 139), containing the Gy1/EgDHAsyn1/legA2 cassette, was cloned into the Ss/WI site of pKR804 (SEQ ID NO:142) to produce pKR1130 (SEQ ID NO:143). Plasmid pKR953 (SEQ ID NO:117) was digested with βsΛ/VI; ends were blunted by filling; and pKR953 was then digested with BamH\. The filled Ss/WI/SamHI fragment of pKR953, containing the Salb/EgD9Elo/Phas3' cassette, was cloned into the PmeUBamHl sites of pNEB193 (New England Biolabs, Ipswich, MA) to produce pKR1131 (SEQ ID NO:144).
Plasmid pKR1131 (SEQ ID NO: 144) was digested with Pst\ and the fragment containing the Euglena gracilis delta-9 elongase was cloned into the Sbft site of pKR1130 (SEQ ID NO:143) to produce pKR1133 (SEQ ID NO:145, FIG. 15D).
In this way, the Euglena gracilis DHA synthase 1 could be co-expressed with the Mortierella alpina delta-5 desaturase and the Euglena gracilis delta-9 elongase behind strong, seed-specific promoters.
EXAMPLE 19
Construction of Soybean Expression Vector pKR1105 For Co-Expression of the Euαlena gracilis DHA Synthase 1 C20 Elongase Domain (EqDHAsvn1C20EloDom1) with the Schizochytήum apprepatum Delta-4 Desaturase (SaD4)
The βcon/Λ/of//Phas cassette was PCR amplified from pKS123 (SEQ ID NO:146; which is described in PCT Publication No. WO 2004/071467 and the contents of which are hereby incorporated by reference) using primers oKtiδ (SEQ ID NO:147) and oKtiθ (SEQ ID NO:148). The resulting PCR fragment was digested with Bsi\N\ and cloned into the BsΛΛ/l site of pKR124 (SEQ ID NO:149; which is described in PCT Publication No. WO 2004/071467 and the contents of which are hereby incorporated by reference), containing the bacterial origin of replication and selection, to produce plasmid pKR193 (SEQ ID NO: 150).
EgDHAsyn1C20Elodom1 was released from pHD16 (SEQ ID NO:51 ; Example 10) by digestion with Not\ and was cloned into the Not\ site of plasmid pKR193 (SEQ ID NO: 150) to produce pKR1103 (SEQ ID NO: 151).
The SsΛ/vϊ fragment, containing the EgDHAsyn1C20Elodom1 , was released from pKR1103 (SEQ ID NO: 151) and was cloned into the BsΛΛ/l site of pKR226 (SEQ ID NO:130; Example 17) to produce vector pKR1104 (SEQ ID NO:152). Vector pKR300 (SEQ ID NO:153; which is described in PCT Publication No.
WO 2004/071467, published August 26, 2004; the contents of which are hereby incorporated by reference), contains the Schizochytrium aggregatum delta-4 desaturase (SaD4), which is described in U.S. Patent No. 7,045,683 and PCT Publication No WO 02/090493, the contents of which are hereby incorporated by reference), flanked by the Λ/ofl restriction sites. The Asc\ site present within the SaD4 was removed without affecting the corresponding amino acid sequence to produce a new sequence (SEQ ID NO: 154) which remains flanked by the Λ/ofl sites. The Λ/ofl fragment (SEQ ID NO: 154) was cloned into the Λ/ofl site of plasmid pKR457 (SEQ ID NO: 122; Example 16) to produce pKR1102 (SEQ ID NO: 155).
Plasmid pKR1102 (SEQ ID NO: 155) was digested with Pst\, and the fragment containing the SaD4 was cloned into the Sbft site of pKR1104 (SEQ ID NO:152) to produce pKR1105 (SEQ ID NO:156; FIG. 16A). In this way, the Euglena gracilis DHA synthase 1 C20 elongase domain could be co-expressed with the Schizochytrium aggregatum delta-4 desaturase behind strong, seed-specific promoters.
EXAMPLE 20
Construction of Soybean Expression Vector pKR1134 For Expression of the Euglena gracilis DHA Synthase 1 C20 Elongase Doma\n/Schizochvthum apprepatum Delta-4 Desaturase Fusion (EgDHAsvn1C20EloDom3-SaD4)
EgDHAsyn1C20EloDom3 was amplified from pKR1091 with oligonucleotide primers EgEPAEIoDom-5 (SEQ ID NO:43) and oEUGsyn6-4 (SEQ ID NO: 157) using the Phusion™ High-Fidelity DNA Polymerase (Cat. No. F553S, Finnzymes Oy, Finland) following the manufacturer's protocol. The resulting DNA fragment was cloned into the pCR-Blunt® cloning vector using the Zero Blunt® PCR Cloning Kit (Invitrogen Corporation), following the manufacturer's protocol, to produce pKR1107 (SEQ ID NO:158).
Plasmid pKR1107 (SEQ ID NO: 158) was digested with Λ/ofl, and the fragment containing the EgDHAsyn1C20EloDom3 was religated to form pKR1112 (SEQ ID NO:159).
The Xba\/Pst\ DNA fragment from pKR1112 (SEQ ID NO:159), containing EgDHAsyn1C20EloDom3, was cloned into the Xba\/Sbft DNA fragment from pKR1068 (SEQ ID NO:75; Example 11), containing the SaD4, to produce pKR1115 (SEQ ID NO: 160). In this way, the EgDHAsyn1C20Elodom3-SaD4 was re-created without an internal Sbft site but codes for an identical amino acid sequence as that described in Example 11. EgDHAsyn1C20Elodom3-SaD4 was released from pKR1115 (SEQ ID NO: 160) by digestion with Not\ and was cloned into the Not\ site of plasmid pKR1104 (SEQ ID NO:152), containing an ALS selectable marker, to produce pKR1134 (SEQ ID NO:161 ; FIG. 16B). EXAMPLE 21
Construction of Soybean Expression Vector pKR1095 For Co-Expression of the Tetruetreptia pomαuetensis CCMP1491 Delta-8 Desaturase (TpomD8) With the
Saprolegnia diclina Delta-17 Desaturase (SdD17) The present Example describes construction of a soybean vector for co- expression of TpomD8 with SdD17 and a hygromycin phosphotransferase selectable marker (hpt).
Tetruetreptia pomquetensis CCMP1491 cells (from 1 liter of culture) were purchased from the Provasoli-Guillard National Center for Culture of Marine Phytoplakton (CCMP) (Bigelow Laboratory for Ocean Sciences, West Boothbay Harbor, Maine). Total RNA was isolated using the trizol reagent (Invitrogen, Carlsbad, CA), according to the manufacturer's protocol. The cell pellet was resuspended in 0.75 mL of trizol reagent, mixed with 0.5 mL of 0.5 mm glass beads, and homogenized in a Biospec mini beadbeater (Bartlesville, OK) at the highest setting for 3 min. The mixture was centrifuged in an Eppendorf centrifuge for 30 sec at 14,000 rpm to remove debri and glass beads. Supernatant was extracted with 150 μL of 24:1 chloroform :isoamy alcohol. The upper aqueous phase was used for RNA isolation.
For RNA isolation, the aqueous phase was mixed with 0.375 mL of isopropyl alcohol and allowed to incubate at room temperature for 5 min. Precipitated RNA was collected by centrifugation at 8,000 rpm and kept at 4 0C for 5 min. The pellet was washed once with 0.7 mL of 80% ethanol and air dried. Thus, 95 μg of total RNA were obtained from Tetruetreptia pomquetensis CCMP1491.
Total RNA (0.95 μg of total RNA in 1 μL) was used as template to synthesize double stranded cDNA. The Creator™ SMART™ cDNA Library Construction Kit from BD Bioscience Clontech (Palo Alto, CA) was used. Total RNA (1 μL) was mixed with 1 μL of SMART IV oligonucleotide (SEQ ID NO:181) 1 μL of the Adaptor Primer from Invitrogen 3'-RACE kit (SEQ ID NO:182), and 2 μL of water. The mixture was heated to 75 0C for 5 min and then cooled on ice for 5 min. To the mixture was added: 2 μl_ of 5X first strand buffer, 1 μl_ 20 mM DTT, 1 μL of dNTP mix (10 mM each of dATP, dCTP, dGTP and dTTP), and 1 μL of PowerScript reverse transcriptase. The sample was incubated at 42 0C for 1 h. The resulting first strand cDNAs were then used as templates for amplification. The Tetruetreptia pomquetensis CCMP1491 delta-8 desaturase (TpomDδ;
SEQ ID NO:162; which is described in U.S. Patent Application No. 11/876,115 (filed October 22, 2007; Attorney Docket No. BB-1574) the contents of which are hereby incorporated by reference) was amplified from the cDNA with oligonucleotide primers TpomNot-5 (SEQ ID NO:163) and TpomNot-3 (SEQ ID NO:164) using Taq polymerase (Invitrogen Corporation) following the manufacturer's protocol.
Tetruetreptia pomquetensis CCMP1491 cDNA (1 μL) was combined with 50 pmol of TpomNot-5 (SEQ ID NO: 163), 50 pmol of TpomNot-3 (SEQ ID NO: 164), 1 μL of PCR nucleotide mix (10 mM, Promega, Madison, Wl), 5 μL of 10X PCR buffer (Invitrogen Corporation), 1.5 μL of MgC^ (50 mM, Invitrogen Corporation), 0.5 μL of Taq polymerase (Invitrogen Corporation) and water to 50 μL. The reaction conditions were 94 0C for 3 min followed by 35 cycles of 94 0C for 45 sec, 55 0C for 45 sec and 72 0C for 1 min. The PCR was finished at 72 0C for 7 min and then held at 4 0C. 5 μL of the PCR reaction were analyzed by agarose gel electrophoresis, and a DNA band with molecular weight around 1.3 kb was observed. The remaining product was separated by agarose gel electrophoresis, and the DNA was purified using the Zymoclean™ Gel DNA Recovery Kit (Zymo
Research, Orange, CA)1 following the manufacturer's protocol. The resulting DNA was cloned into the pGEM®-T Easy Vector (Promega), following the manufacturer's protocol, to produce pLF114-10 (SEQ ID NO:165). TpomDδ was released from pLF114-10 (SEQ ID NO:165) by digestion with
Λ/ofl and was cloned into the Not\ site of plasmid pKR179 (SEQ ID NO: 108;
Example 15) to produce pKR1002 (SEQ ID NO:166).
The Pstt fragment of pKR1002 (SEQ ID NO:166), containing the βconπ"pomD8/Phas3' cassette was cloned into the Sbfl site of pKR328 (SEQ ID NO: 110; Example 15), containing the SdD17, to produce vector pKR1095 (SEQ ID
NO: 167). A schematic depiction of pKR1095 is shown in FIG. 16C. EXAMPLE 22
Construction of Soybean Expression Vector pKR1132 For Co-Expression of the Tetrυetreotia pomαuetensis CCMP1491 Delta-8 Desaturase (TpomDδ) with the
Euglena gracilis Delta-9 Elongase (EqD9elo) and the Mortierella alpina Delta-5 Desaturase (MaD5)
TPomD8 was released from pLF114-10 (SEQ ID NO: 165; Example 21) by digestion with Λ/ofl and was cloned into the Not\ site of plasmid pKR264 (SEQ ID NO:138; Example 18) to produce pKR1127 (SEQ ID NO:168).
The SsM/l fragment from pKR1127 (SEQ ID NO: 168), containing the Gy1/TPomD8/legA2 cassette, was cloned into the Bs/WI site of pKR804 (SEQ ID NO:142; Example 18) to produce pKR1129 (SEQ ID NO:169).
Plasmid pKR1131 (SEQ ID NO: 144; Example 18) was digested with Pst\, and the fragment containing the Euglena gracilis delta-9 elongase was cloned into the Sbf\ site of pKR1129 (SEQ ID NO:169) to produce pKR1132 (SEQ ID NO:170, FIG. 16D). In this way, tbeTetruetreptia pomquetensis delta-8 desaturase could be co- expressed with the Mortierella alpina delta-5 desaturase and the Euglena gracilis delta-9 elongase behind strong, seed-specific promoters.
EXAMPLE 23
Construction of Soybean Expression Vector KS373 For Expression of a Euglena gracilis Delta-9 Elonqase/Etvg/ena gracilis DHA Synthase 1 Linker/Pav7oi/a lutheri Delta-8 Desaturase Fusion (EqD9elo-EqDHAsvn1 Link-PavD8)
An in-frame fusion between the Euglena gracilis delta-9 elongase (EgD9elo; Example 16; SEQ ID NO:112), the Euglena gracilis DHA synthase 1 proline-rich linker (EgDHAsyni Link; SEQ ID NO:171 ; described in Example 6 and shown in FIG. 6), and the Pavlova lutheri delta-8 desaturase (PavD8; Example 16; SEQ ID NO: 124) was constructed using the conditions described below.
An initial in-frame fusion between the EgD9elo and the EgDHAsyniϋnk (EgD9elo-EgDHAsyn1 Link) was made, flanked by a Λ/col site at the 5'end and a Not\ site at the 3' end, by PCR amplification. EgD9elo (SEQ ID NO:112) was amplified with oligonucleotides MWG507 (SEQ ID NO: 172) and MWG509 (SEQ ID NO:173), using the Phusion™ High-Fidelity DNA Polymerase (Cat. No. F553S, Finnzymes Oy, Finland), following the manufacturer's protocol. EgDHAsyni Link (SEQ ID NO: 171) was amplified in a similar way with oligonucleotides MWG510 (SEQ ID NO:174) and MWG511 (SEQ ID NO:175). The two resulting PCR products were combined and re-amplifed using MWG507 (SEQ ID NO:172) and MWG511 (SEQ ID NO: 175) to form EgD9elo-EgDHAsyn1 Link. The sequence of the EgD9elo- EgDHAsyni Link is shown in SEQ ID NO:176. EgD9elo-EgDHAsyn1 Link does not contain an in-frame stop codon upstream of the Λ/ofl site at the 3' end, and therefore, a DNA fragment cloned into the Λ/ofl site can give rise to an in-frame fusion with the EgD9elo-EgDHAsyn1 Link.
Plasmid KS366 (SEQ ID NO: 177) contains unique Λ/col and Λ/ofl restriction sites, flanked by the promoter for the α' subunit of β-conglycinin (Beachy et al., EMBO J. 4:3047-3053 (1985)) and the 3' transcription termination region of the phaseolin gene (Doyle et al., J. Biol. Chem. 261 :9228-9238 (1986)). Other than the replacement of the unique Λ/ofl site in pKR72 (SEQ ID NO: 105) with a unique Λ/col/Λ/ofl multiple cloning site, the Bcon/NcolNotl/PhasS' cassette in KS366 is identical to that found in pKR72 (SEQ ID NO:105), except that the flanking Hind\\\ sites were replaced by BamH\ sites. The Bcon/NcolNotl/PhasS' cassette of KS366 was cloned into the BamH\ site of pBluescript Il SK(+) vector (Stratagene).
The NcoUNotl DNA fragment, containing EgD9elo-EgDHAsyn1 Link (SEQ ID NO: 176), was cloned into the Λ/col/Λ/ofl DNA fragment from KS366 (SEQ ID NO: 177), containing the promoter for the α' subunit of β-conglycinin, to produce KS366-EgD9elo-EgDHAsyn1 Link (SEQ ID NO: 178).
The Not\ fragment containing PavD8 (generated as described in Example 16) was cloned into the Λ/ofl fragment of KS366-EgD9elo-EgDHAsyn1 Link (SEQ ID NO:178) to produce KS373 (SEQ ID NO:179; FIG. 17). EXAMPLE 24
Construction of Alternate Soybean Expression Vectors For Expression of
DHA Synthases, C20 Elongase Domains, Delta-4 Desaturase Domains.
Synthetic C20 Elongase/Delta-4 Desaturase Fusion Proteins and
Other Synthetic Elonαase/Desaturase Fusion Proteins In addition to the genes, promoters, terminators and gene cassettes described herein, one skilled in the art can appreciate that other promoter/gene/terminator cassette combinations can be synthesized in a way similar to, but not limited to, that described herein for expression of EgDHAsyni . Similarly, it may be desirable to express other PUFA genes (such as those described in Table 23), for co-expression with any of the DHA synthases of the present invention or DHA synthase domains (i.e., C20 elongase domain or delta-4 desaturase domain expressed individually). Additionally, synthetic fusions between an elongase domain and a desaturase domain separated by a suitable linker region could be made and expressed. For instance, a synthetic fusion between a C20 elongase (or C20 elongase domain from a DHA synthase) and a suitable delta-4 desaturase (or delta-4 desaturase domain from a DHA synthase) could be made and expressed. Alternatively, other elongases or desaturases could be used such as, but not limited to, the synthetic fusion described herein between a delta-9 elongase and delta-8 desaturase separated by a linker from a DHA synthase (i.e., Example 23).
For instance, PCT Publication Nos. WO 2004/071467 and WO 2004/071178 describe the isolation of a number of promoter and transcription terminator sequences for use in embryo-specific expression in soybean. Furthermore, PCT Publication Nos. WO 2004/071467, WO 2005/047479 and WO 2006/012325 describe the synthesis of multiple promoter/gene/terminator cassette combinations by ligating individual promoters, genes, and transcription terminators together in unique combinations. Generally, a Not\ site flanked by the suitable promoter (such as those listed in, but not limited to, Table 21) and a transcription terminator (such as those listed in, but not limited to, Table 22) is used to clone the desired gene. Not\ sites can be added to a gene of interest such as those listed in, but not limited to, Table 23 using PCR amplification with oligonucleotides designed to introduce Λ/ofl sites at the 5' and 3' ends of the gene. The resulting PCR product is then digested with Λ/ofl and cloned into a suitable promoter/Λ/ofl/terminator cassette.
In addition, PCT Publication Nos. WO 2004/071467, WO 2005/047479 and WO 2006/012325 describe the further linking together of individual gene cassettes in unique combinations, along with suitable selectable marker cassettes, in order to obtain the desired phenotypic expression. Although this is done mainly using different restriction enzymes sites, one skilled in the art can appreciate that a number of techniques can be utilized to achieve the desired promoter/gene/transcription terminator combination. In so doing, any combination of embryo-specific promoter/gene/transcription terminator cassettes can be achieved. One skilled in the art can also appreciate that these cassettes can be located on individual DNA fragments or on multiple fragments where co-expression of genes is the outcome of co-transformation of multiple DNA fragments.
TABLE 21 Seed-specific Promoters
Figure imgf000162_0001
TABLE 22 Transcription Terminators
Figure imgf000162_0002
TABLE 23 PUFA Biosvnthetic Pathway Genes
Figure imgf000162_0003
Figure imgf000163_0001
Figure imgf000164_0001
EXAMPLE 25 Production and Model System Transformation of Somatic Soybean Embryo Cultures with Soybean Expression Vectors Culture Conditions:
Soybean embryogenic suspension cultures (cv. Jack) are maintained in 35 ml_ liquid medium SB196 (infra) on a rotary shaker, 150 rpm, 26 0C with cool white fluorescent lights on 16:8 hr day/night photoperiod at light intensity of 60-85 μE/m2/s. Cultures are subcultured every 7 days to two weeks by inoculating approximately 35 mg of tissue into 35 ml_ of fresh liquid SB196 (the preferred subculture interval is every 7 days).
Soybean embryogenic suspension cultures are transformed with the soybean expression plasmids by the method of particle gun bombardment (Klein et al., Nature 327:70 (1987)) using a DuPont Biolistic PDS1000/HE instrument (helium retrofit) for all transformations.
Soybean Embryoqenic Suspension Culture Initiation:
Soybean cultures are initiated twice each month with 5-7 days between each initiation. Pods with immature seeds from available soybean plants are picked 45- 55 days after planting. Seeds are removed from the pods and placed into a sterilized magenta box. The soybean seeds are sterilized by shaking them for 15 min in a 5% Clorox solution with 1 drop of Ivory soap (i.e., 95 ml_ of autoclaved distilled water plus 5 ml_ Clorox and 1 drop of soap, mixed well). Seeds are rinsed using 2 1 -liter bottles of sterile distilled water and those less than 4 mm are placed on individual microscope slides. The small end of the seed is cut and the cotyledons are pressed out of the seed coat. When cultures are being prepared for production transformation, cotyledons are transferred to plates containing SB1 medium (25-30 cotyledons per plate). Plates are wrapped with fiber tape and are maintained at 26 0C with cool white fluorescent lights on 16:8 h day/night photoperiod at light intensity of 60-80 μE/m2/s for eight weeks, with a media change after 4 weeks. When cultures are being prepared for model system experiments, cotyledons are transferred to plates containing SB199 medium (25-30 cotyledons per plate) for 2 weeks, and then transferred to SB1 for 2-4 weeks. Light and temperature conditions are the same as described above. After incubation on SB1 medium, secondary embryos are cut and placed into SB196 liquid media for 7 days. Preparation of DNA for Bombardment:
Either an intact plasmid or a DNA plasmid fragment containing the genes of interest and the selectable marker gene are used for bombardment. Fragments from soybean expression plasmids, the construction of which is described herein, are obtained by gel isolation of digested plasmids. In each case, 100 μg of plasmid DNA is used in 0.5 ml_ of the specific enzyme mix described below. Plasmids are digested with Asc\ (100 units) in NEBuffer 4 (20 mM Tris-acetate, 10 mM magnesium acetate, 50 mM potassium acetate, 1 mM dithiothreitol, pH 7.9), 100 μg/mL BSA, and 5 mM beta-mercaptoethanol at 37 0C for 1.5 hr. The resulting DNA fragments are separated by gel electrophoresis on 1% SeaPlaque GTG agarose (BioWhitaker Molecular Applications), and the DNA fragments containing gene cassettes are cut from the agarose gel. DNA is purified from the agarose using the GELase digesting enzyme following the manufacturer's protocol.
A 50 μl_ aliquot of sterile distilled water containing 3 mg of gold particles (3 mg gold) is added to 30 μL of a 10 ng/μL DNA solution (either intact plasmid or DNA fragment prepared as described herein), 25 μL 5M CaCI2, and 20 μL of 0.1 M spermidine. The mixture is shaken 3 min on level 3 of a vortex shaker and spun for 10 sec in a bench microfuge. The supernatant is removed, followed by a wash with 400 μl_ 100% ethanol and another brief centrifugation. The 400 ul ethanol is removed, and the pellet is resuspended in 40 μl_ of 100% ethanol. Five μL of DNA suspension is dispensed to each flying disk of the Biolistic PDS1000/HE instrument disk. Each 5 μL aliquot contains approximately 0.375 mg gold per bombardment (e.g., per disk).
For model system transformations, the protocol is identical except for a few minor changes (i.e., 1 mg of gold particles is added to 5 μL of a 1 μg/μL DNA solution; 50 μL of a 2.5M CaCfe is used; and the pellet is ultimately resuspended in 85 μL of 100% ethanol thus providing 0.058 mg of gold particles per bombardment). Tissue Preparation and Bombardment with DNA:
Approximately 150-200 mg of seven day old embryogenic suspension cultures is placed in an empty, sterile 60 x 15 mm petri dish, and the dish is covered with plastic mesh. The chamber is evacuated to a vacuum of 27-28 inches of mercury, and tissue is bombarded one or two shots per plate with membrane rupture pressure set at 1100 PSI. Tissue is placed approximately 3.5 inches from the retaining /stopping screen. Model system transformation conditions are identical except 100-150 mg of embryogenic tissue is used; rupture pressure is set at 650 PSI; and tissue is placed approximately 2.5 inches from the retaining screen. Selection of Transformed Embryos: Transformed embryos are selected either using hygromycin (when the hygromycin B phosphotransferase (HPT) gene is used as the selectable marker) or chlorsulfuron (when the acetolactate synthase (ALS) gene is used as the selectable marker).
Following bombardment, the tissue is placed into fresh SB196 media and cultured as described above. Six to eight days post-bombardment, the SB196 is exchanged with fresh SB196 containing either 30 mg/L hygromycin or 100 ng/mL chlorsulfuron, depending on the selectable marker used. The selection media is refreshed weekly. Four to six weeks post-selection, green, transformed tissue is observed growing from untransformed, necrotic embryogenic clusters. Embryo Maturation:
For production transformations, isolated, green tissue is removed and inoculated into multiwell plates to generate new, clonally propagated, transformed embryogenic suspension cultures. Transformed embryogenic clusters are cultured for four-six weeks in multiwell plates at 26 0C in SB196 under cool white fluorescent (Phillips cool white Econowatt F40/CW/RS/EW) and Agro (Phillips F40 Agro) bulbs (40 watt) on a 16:8 hr photoperiod with light intensity of 90-120 μE/m2s. After this time, embryo clusters are removed to a solid agar media, SB166, for one-two weeks and then subcultured to SB103 medium for 3-4 weeks to mature embryos. After maturation on plates in SB103, individual embryos are removed from the clusters, dried, and screened for alterations in their fatty acid compositions as described supra.
For model system transformations, embryos are matured in soybean histodifferentiation and maturation liquid medium (SHaM liquid media; Schmidt et al., Cell Biology and Morphogenesis 24:393 (2005)), using a modified procedure. Briefly, after 4 weeks of selection in SB196, as described above, embryo clusters are removed to 35 ml_ of SB228 (SHaM liquid media) in a 250 ml_ Erlenmeyer flask. Tissue is maintained in SHaM liquid media on a rotary shaker at 130 rpm and 26 0C, with cool white fluorescent lights on a 16:8 hr day/night photoperiod at a light intensity of 60-85 μE/m2/s for 2 weeks as embryos matured. Embryos grown for 2 weeks in SHaM liquid media are equivalent in size and fatty acid content to embryos cultured on SB166/SB103 for 5-8 weeks.
After maturation in SHaM liquid media, individual embryos are removed from the clusters, dried, and screened for alterations in their fatty acid compositions as described supra. Media Recipes:
SB 196 - FN Lite Liquid Proliferation Medium (per liter) MS FeEDTA - 100x Stock 1 1O mL MS Sulfate - 100x Stock 2 10 mL
FN Lite Halides - 100x Stock 3 10 mL
FN Lite P, B, Mo - 100x Stock 4 1O mL
B5 vitamins (1 mL/L) 1.0 mL
2,4-D (10mg/L final concentration) 1.0 mL KNO3 2.83 gm
(NH4)2SO4 0.463 gm asparagine 1.0 gm sucrose (1 %) 10 gm pH 5.8
FN Lite Stock Solutions
Stock Number 100O mL 50O mL
1 MS Fe EDTA IOOx Stock
Na2 EDTA* 3.724 g 1.862 g
FeSO4 - 7H2O 2.784 g 1.392 g
*Add first, dissolve in dark bottle while stirring
MS Sulfate 10Ox stock
MgSO4 - 7H2O 37.O g 18.5 g
MnSO4 - H2O 1.69 g 0.845 g
ZnSO4 - 7H2O 0.86 g 0.43 g
CuSO4 - 5H2O 0.0025 g 0.00125 g
FN Lite Halides 10Ox Stock
CaCI2 - 2H2O 30.O g 15.O g
Kl 0.083 g 0.0715 g
CoCI2 - 6H2O 0.0025 g 0.00125 g
FN Lite P, B, Mo 10Ox Stock
KH2PO4 18.5 g 9.25 g
H3BO3 0.62 g 0.31 g
Na2MoO4 - 2H2O 0.025 g 0.0125 g
SB1 Solid Medium (per liter)
1 package MS salts (Gibco/ BRL - Cat. No. 11117-066)
1 mL B5 vitamins 1000X stock 31.5 g glucose
2 mL 2,4-D (20 mg/L final concentration) pH 5.7
8 g TC agar SB199 Solid Medium (per liter) 1 package MS salts (Gibco/ BRL - Cat. No. 11117-066)
1 ml. B5 vitamins 100OX stock 3Og Sucrose 4 ml 2,4-D (40 mg/L final concentration) pH 7.0
2 gm Gelrite
SB 166 Solid Medium (per liter) 1 package MS salts (Gibco/ BRL - Cat. No. 11117-066)
1 mL B5 vitamins 1000X stock 60 g maltose
750 mg MgCI2 hexahydrate 5 g activated charcoal pH 5.7
2 g gelrite
SB 103 Solid Medium (per liter)
1 package MS salts (Gibco/ BRL - Cat. No. 11117-066) 1 mL B5 vitamins 100OX stock
60 g maltose
750 mg MgCI2 hexahydrate pH 5.7
2 g gelrite
SB 71-4 Solid Medium (per liter)
1 bottle Gamborg's B5 salts w/ sucrose (Gibco/ BRL - Cat. No. 21153-036) pH 5.7 5 g TC agar
2,4-D Stock Obtain premade from Phytotech Cat. No. D 295 - concentration 1 mg/mL B5 Vitamins Stock (per 100 mϋ Store aliquots at -20 0C 10 g myo-inositol 100 mg nicotinic acid 100 mg pyridoxine HCI
1 g thiamine
If the solution does not dissolve quickly enough, apply a low level of heat via the hot stir plate.
SB 228- Soybean Histodifferentiation & Maturation (SHaM) (per liter)
DDI H2O 600 mL
FN-Lite Macro Salts for SHaM 10X 100 mL MS Micro Salts 1000x 1 mL
MS FeEDTA IOOx 1O mL CaCMOOx 6.82 mL
B5 Vitamins 1000x 1 mL
L-Methionine 0.149 g
Sucrose 30 g
Sorbitol 30 g Adjust volume to 900 mL pH 5.8 Autoclave
Add to cooled media (<30 0C):
*Glutamine (final concentration 30 mM) 4% 11O mL *Note: Final volume will be 1010 mL after glutamine addition.
Since glutamine degrades relatively rapidly, it may be preferable to add immediately prior to using media. Expiration 2 weeks after glutamine is added; base media can be kept longer without glutamine.
FN-lite Macro for SHAM 10X- Stock #1 (per liter)
(NH-O2SO4 (ammonium sulfate) 4.63 g
KNO3 (potassium nitrate) 28.3 g
MgSO4 *7H20 (magnesium sulfate heptahydrate) 3.7 g KH2PO4 (potassium phosphate, monobasic) 1.85 g
Bring to volume
Autoclave
MS Micro 10QOX- Stock #2 (per 1 liter)
H3BO3 (boric acid) 6.2 g
MnSO4*H2O (manganese sulfate monohydrate) 16.9 g
ZnSO4*7H20 (zinc sulfate heptahydrate) 8.6 g
Na2MoO4 *2H20 (sodium molybdate dihydrate) 0.25 g CuSO4*5H20 (copper sulfate pentahydrate) 0.025 g
CoCI2*6H20 (cobalt chloride hexahydrate) 0.025 g
Kl (potassium iodide) 0.8300 g
Bring to volume Autoclave
FeEDTA 100X- Stock #3 (per liter)
Na2EDTA* (sodium EDTA) 3.73 g
FeSO4 *7H20 (iron sulfate heptahydrate) 2.78 g
*EDTA must be completely dissolved before adding iron. Bring to Volume
Solution is photosensitive. Bottle(s) should be wrapped in foil to omit light. Autoclave
Ca 100X- Stock #4 (per liter)
CaCI2 *2H20 (calcium chloride dihydrate) 44 g
Bring to Volume
Autoclave
B5 Vitamin 1000X- Stock #5 (per liter)
Thiamine*HCI 10 g
Nicotinic Acid 1 9
Pyridoxine*HCI i g
Myo-lnositol 100 g Bring to Volume Store frozen
4% Glutamine- Stock #6 (per liter) DDI water heated to 30 0C 900 ml_
L-Glutamine 40 g
Gradually add while stirring and applying low heat. Do not exceed 35 0C. Bring to Volume Filter Sterilize Store frozen* *Note: Warm thawed stock in 31 0C bath to fully dissolve crystals.
EXAMPLE 26 Chlorsulfuron Selection (ALS) and Plant Regeneration
Chlorsulfuron (ALS) Selection:
Following bombardment, the tissue is divided between 2 flasks with fresh SB196 media and cultured as described in Example 25. Six to seven days post- bombardment, the SB196 is exchanged with fresh SB196 containing selection agent of 100 ng/mL chlorsulfuron (chlorsulfuron stock is 1 mg/mL in 0.01 N ammonium hydroxide). The selection media is refreshed weekly. Four to six weeks post selection, green, transformed tissue may be observed growing from untransformed, necrotic embryogenic clusters. Isolated, green tissue is removed and inoculated into multiwell plates containing SB196, and embryos are matured as described in Example 25.
Regeneration of Soybean Somatic Embryos Into Plants:
In order to obtain whole plants from embryogenic suspension cultures, the tissue must be regenerated. Embryos are matured as described in Example 25. After subculturing on medium SB103 for 3 weeks, individual embryos can be removed from the clusters and screened for alterations in their fatty acid compositions as described herein. It should be noted that any detectable phenotype, resulting from the expression of the genes of interest, could be screened at this stage. This would include, but not be limited to, alterations in fatty acid profile, protein profile and content, carbohydrate content, growth rate, viability, or the ability to develop normally into a soybean plant.
Matured individual embryos are desiccated by placing them into an empty, small petri dish (35 x 10 mm) for approximately 4 to 7 days. The plates are sealed with fiber tape (creating a small humidity chamber). Desiccated embryos are planted into SB71-4 medium where they are left to germinate under the same culture conditions described above. Germinated plantlets are removed from germination medium and rinsed thoroughly with water and then are planted in Redi- Earth in 24-cell pack tray, covered with clear plastic dome. After 2 weeks the dome is removed, and plants are hardened off for a further week. If plantlets look hardy, they are transplanted to 10" pot of Redi-Earth with up to 3 plantlets per pot. After 10 to 16 weeks, mature seeds are harvested, chipped, and analyzed for fatty acids.
EXAMPLE 27 Functional Analysis of the Euplena gracilis and Euαlena anabaena DHA Synthases in Yarrowia lipolvtica
Each of the expression vectors described below in Table 24 was transformed into Yarrowia lipolytica strain Y2224 (a uracil ura3 auxotrophic strain of Yarrowia lipolytica) as described in the General Methods above.
Single colonies of transformant Yarrowia lipolytica containing an appropriate Yarrowia expression vector (see Table 24) were grown in 3 ml_ MM lacking uracil supplemented with 0.2% tergitol at 30 0C for 1 day. After this, 0.05 ml_ was transferred to 3 ml_ of the same medium supplemented with either no fatty acid or EPA to 0.175 mM. These were incubated for 16 hr at 30 0C, 250 rpm and then pellets were obtained by centrifugation. Cells were washed once with water, pelleted by centrifugation and air dried. Pellets were transesterified (Roughan, G. and Nishida, I., Arch. Biochem. Biophys. 276(1 ):38-46 (1990)) with 500 μl_ of 1 % sodium methoxide for 30 min at 50 0C after which 500 μl_ of 1 M sodium chloride and 100 μL of heptane were added. After thorough mixing and centrifugation, fatty acid methyl esters (FAMEs) were analyzed by GC as described supra (see General Methods). TABLE 24
Figure imgf000174_0001
The fatty acid profile for Yarrowia expressing pBY-EgC20elo1 showed no elongation of EPA to DPA. The fatty acid profiles, calculated % elongation and calculated % desaturation for the remaining clones are shown in FIG. 18. Percent C20 elongation (% C20 Elong) was calculated by dividing the sum of the weight percent (wt. %) for DPA and DHA by the sum of the wt. % for EPA, DPA and DHA and multiplying by 100 to express as a %. Similarly, percent delta-4 desaturation (% D4 Desat) was calculated by dividing the wt. % for DHA by the sum of the wt. % for DPA and DHA and multiplying by 100 to express as a %. Averages are indicated by Ave. followed by appropriate header.
In summary of FIG. 18, all of the DHA synthases except for EaDHAsyn4 functioned as both C20 elongases (elongating EPA to DPA) and as delta-4 desaturases (desaturating DPA to DHA) in Yarrowia. EaDHAsyn4, which contained a substantially different amino acid sequence at the C-terminus due to a frameshift in the nucleotide sequence, had considerably lower elongation function and no desaturase activity was detected. Expressing EgDHAsyni in pY132 consistently resulted in higher activity in Yarrowia when compared to the other EgDHAsyni constructs, likely due to the fact that EgDHAsyni was expressed as an in-frame fusion between some vector sequence, the 5' UTR of EgDHAsyni and the EgDHAsyni coding sequence. The resulting fusion created may lead to enhanced activity because of enhanced expression in Yarrowia or because of an inherent increase in activity to the enzyme itself. When only the coding sequence of EgDHAsyni* is expressed (i.e., with no 5'UTR; see pY141), the activity is higher than when the 5'UTR is present but not translated as a fusion (i.e., see pY161). This observation is likely due to a decrease in expression of EgDHAsyni due to the presence of the 5'UTR.
EXAMPLE 28
Functional Analysis of EgDHAsyni Independent C20 Elongase and Delta-4 Desaturase Domains and Comparison with Heterologous Fusions
Each of the expression vectors described below in Table 25 (and a vector only control) was transformed into Yarrowia lipolytica strain Y2224, as described in Example 27. EPA and/or DPA was fed to the transformed Yarrowia cells. A schematic showing the relative domain structure for each construct in Table 25 is shown in FIG. 21.
TABLE 25 Summary of Vectors Expressed in Yarrowia lipolytica
Figure imgf000175_0001
Figure imgf000176_0001
Figure imgf000177_0001
Fatty acid profiles of the transformant cells were subsequently analyzed as described in Example 27.
The results for feeding EPA to a vector only control, pY141 , pY143, pY149, pY156, pY157 and pY160 are shown in FIG. 19. The fatty acid profiles for Yarrowia expressing pY150, pY151 , pY152 and pY153 showed no elongation of EPA to DPA and are not shown in FIG. 19.
The results for feeding DPA to a vector only control, pY141 , pY150, pY151 , pY152, pY153, pY156, pY157 and pY160 are shown in FIG. 20. The fatty acid profiles for Yarrowia expressing pY143 and pY149 showed no desaturation of DPA to DHA and are not shown in FIG. 20.
Percent C20 elongation (% C20 Elong), percent delta-4 desaturation (% D4 Desat) and averages were calculated as described in Example 27.
In summary of FIG. 19 and FIG. 20, when the EgDHAsyn1C20Elo domain is expressed alone (with or without the linker; i.e., pY143 and pY149), the average percent C20 elongation increases by about 40% compared to the native EgDHAsyni* (pY141 ; SEQ ID NO:49). The opposite occurs with the EgDHAsyni delta-4 desaturase domain where there is no activity with EgDHAsyn1 D4Dom1 (pY152; SEQ ID NO:67; see FIG. 20) and about 50% less with EgDHAsyn1 D4Dom2 (pY153; SEQ ID NO:72; see FIG. 20) when expressed alone compared to
EgDHAsyni* (pY141 ; SEQ ID NO:49; see FIG. 20) fed DPA. The lgD4 has no delta-4 desaturase activity when expressed alone (pY150; SEQ ID NO:62; see FIG. 20) or as a fusion (pY156; SEQ ID NO:64; see FIGs. 19 and 20) and even causes an approximately 50% decrease in elongation activity when fused to the EgDHAsyn1C20 elongase domain (pY156; SEQ ID NO:64; see FIG. 19), possibly due to incorrect folding. In contrast, the SaD4 expressed alone (pY151 ; SEQ ID NO:76; see FIG. 20) has approximately the same delta-4 desaturase activity as that for the native EgDHAsyni* (pY141 ; SEQ ID NO:49; see FIG. 20). Interestingly, the delta-4 desaturase activity of SaD4 increases approximately 2-fold when fused to the EgDHAsyni C20 elongase domain (pY160; SEQ ID NO:77; see FIG. 20). When fused to EgDHAsyni C20 elongase domain and fed EPA (pY160; SEQ ID NO:77; see FIG. 19), the delta-4 desaturase activity is approximately 3-fold higher than when DPA is fed (pY160; SEQ ID NO:77; see FIG. 20) suggesting the linking of the two domains results in increased efficiency or flux, perhaps due to substrate channeling.
EXAMPLE 29 Substrate Specificity of EgDHAsyni* GLA1 STA, EDA, ERA, DGLA, ETA, ARA, EPA and DPA were fed to
Yarrowia cells transformed with pY141 (EgDHAsyni*; SEQ ID NO:49) and a vector only control and fatty acid profiles were analyzed as described in Example 27.
The results for feeding EPA, ARA and DPA are shown in FIG. 22. The fatty acid profiles for Yarrowia fed with GLA, STA1 EDA, ERA, DGLA and ETA showed no elongation and are not shown in FIG. 22. Percent C20 elongation (% C20 Elong) and percent delta-4 desaturation (% D4 Desat) and averages were calculated as described in Example 27 when fed EPA or DPA. When fed ARA, percent C20 elongation (% C20 Elong) was calculated by dividing the sum of the wt. % for docosatetraenoic acid [DTA; 22:4 (7,10,13,16)] and omega-6 docosapentaenoic acid [DPAn-6; 22:5(4,7,10,13,16)] by the sum of the wt. % for ARA, DTA and DPAn- 6 and multiplying by 100 to express as a %. Similarly, percent delta-4 desaturation (% D4 Desat) when fed ARA was calculated by dividing the wt. % for DPAn-6 by the sum of the wt. % for DTA and DPAn-6 and multiplying by 100 to express as a %.
In summary of FIG. 22, EgDHAsyni* elongates both ARA and EPA although it has a slight preference (approximately 40% more active) for EPA. The elongation product of ARA (i.e., DTA) is also desaturated in the delta-4 position by EgDHAsyni to produce DPAn-6 and the activity is approximately 40% higher for DTA than DPA. EXAMPLE 30
Co-expression of the Euplena gracilis DHA Synthase 1 with the Paylova lutheri Delta-8 Desaturase, the Mortierella alpina Delta-5 Desaturase, the Saproleαnia diclina Delta-17 Desaturase and the Euαlena gracilis Delta-9 Elonqase in Soybean Embryos Transformed with Soybean Expression Vectors pKR973 and
PKR1064
Mature somatic soybean embryos are a good model for zygotic embryos. While in the globular embryo state in liquid culture, somatic soybean embryos contain very low amounts of triacylglycerol or storage proteins typical of maturing, zygotic soybean embryos. At this developmental stage, the ratio of total triacylglyceride to total polar lipid (phospholipids and glycolipid) is about 1 :4, as is typical of zygotic soybean embryos at the developmental stage from which the somatic embryo culture was initiated. At the globular stage as well, the mRNAs for the prominent seed proteins, α'-subunit of β-conglycinin, kunitz trypsin inhibitor 3, and seed lectin are essentially absent. Upon transfer to hormone-free media to allow differentiation to the maturing somatic embryo state, triacylglycerol becomes the most abundant lipid class. As well, mRNAs for α'-subunit of β-conglycinin, kunitz trypsin inhibitor 3 and seed lectin become very abundant messages in the total mRNA population. On this basis, the somatic soybean embryo system behaves very similarly to maturing zygotic soybean embryos in vivo, and is thus a good and rapid model system for analyzing the phenotypic effects of modifying the expression of genes in the fatty acid biosynthesis pathway (see PCT Publication No. WO 2002/00904, Example 3). Most importantly, the model system is also predictive of the fatty acid composition of seeds from plants derived from transgenic embryos. Fatty Acid Analysis of Transgenic Somatic Soybean Embryos Expressing pKR973 and pKR1064:
Soybean embryogenic suspension cultures (cv. Jack) were transformed with the Asc\ fragments of pKR973 and pKR1064 (fragments containing the expression cassettes), as described for production in Example 25 and as summarized in Table 26. TABLE 26 Summary of Vectors Expressed in Soybean
Figure imgf000180_0001
A subset of soybean embryos generated from each event (ten embryos per event) were harvested and picked into glass GC vials and fatty acid methyl esters were prepared by transesterification. For transesterification, 50 μl_ of trimethylsulfonium hydroxide (TMSH) and 0.5 ml_ of hexane were added to the embryos in glass vials and incubated for 30 min at room temperature while shaking. Fatty acid methyl esters (5 μl_ injected from hexane layer) were separated and quantified using a Hewlett-Packard 6890 Gas Chromatograph fitted with an Omegawax 320 fused silica capillary column (Cat. No. 24152, Supelco Inc.). The oven temperature was programmed to hold at 220 0C for 2.6 min, increase to 240 0C at 20 °C/min and then hold for an additional 2.4 min. Carrier gas was supplied by a Whatman hydrogen generator. Retention times were compared to those for methyl esters of standards commercially available (Nu-Chek Prep, Inc.). Events having good phenotype were re-analyzed by GC using identical conditions except the oven temperature held at 150 0C for 1 min and then increased to 240 0C at 5 0C. The fatty acid profiles for individual embryos from a representative event are shown in FIG. 23. Fatty acids are identified as 16:0 (palmitate), 18:0 (stearic acid), 18:1 (oleic acid), 18:2(LA), GLA, 18:3 (ALA), EDA, DGLA, ARA, ERA, JUN, EPA, 22:3(10,13,16) (docosatrienoic acid), DTA, DPA and DHA; and, fatty acid compositions listed in FIG. 23 are expressed as a weight percent (wt. %) of total fatty acids. The activity of EgDHAsyni is expressed as percent C20 elongation (% C20
Elong) and/or percent delta-4 desaturation (% D4 Desat), calculated according to the following formula: ([product]/[substrate + product])*100. More specifically, the percent elongation for EPA is shown as % C20 Elong, determined as: ([DPA + DHA]/[EPA + DPA + DHA])*100. The percent delta-4 desaturation for DPA is shown as % D4 Desat, determined as ([DHA]/[DPA + DHA])*100. Other fatty acids that may be elongated or desaturated were not included in this calculation.
In addition to elongation and desaturation products for EPA and ARA, it appears that in soybean, DGLA is also elongated by the EgDHAsyni as a significant amount of the fatty acid 22:3(10,13,16) was made. The fatty acid was identified as 22:3(10,13,16) because it was found to have a mass for 22:3 by GC- MS and had an MS profile that agrees with that for 22:3(10,13,16).
EXAMPLE 31
Expression of the Euplena gracilis Delta-9 Elongase/ Paylova lutheri Delta-8 Desaturase Fusion (EqD9elo-EαDHAsvn1 Link-PavD8) in
Soybean Embryos Transformed With Soybean Expression Vectors KS373 Soybean embryogenic suspension culture (cv. Jack) was transformed with
KS373 (SEQ ID NO:179; FIG. 17) and KS120 (which is described in PCT Publication No. WO 2004/071467 and the contents of which are hereby incorporated by reference) as described for the model system in Example 25. KS120 contains the hygromycin selection. KS373, produced in Example 23, enabled expression of a fusion protein comprising the Euglena gracilis delta-9 elongase and the Pavlova lutheri delta-8 desaturase, wherein the two domains were linked with Euglena gracilis DHA Synthase 1 Linker (i.e., EgDHAsyni Link).
The fatty acid profiles for five individual embryos from 31 events were obtained as described in Example 30. Results from the five best elongation events are shown in FIG. 24. Fatty acids are identified as 16:0 (palmitate), 18:0 (stearic acid), 18:1 (oleic acid), 18:2 (LA), GLA, 18:3 (ALA), EDA, DGLA, ERA and ETA; and, fatty acid compositions listed in FIG. 24 are expressed as a weight percent (wt. %) of total fatty acids.
The activity of EgD9elo-EgDHAsyn1 Link-PavD8 is expressed as percent delta-9 elongation (% D9 Elong) and/or percent delta-8 desaturation (% D8 Desat), calculated according to the following formula: ([product]/[substrate + product])*100.
More specifically, the percent delta-9 elongation is shown as % D9 Elong, determined as: ([EDA + ERA + DGLA + ETA]/[LA + ALA + EDA + ERA + DGLA + ETA])*100. The percent delta-8 desaturation is shown as % D8 Desat, determined as ([DGLA + ETA]/[EDA + ERA + DGLA + ETA])*100.
The best % D9 Elong event had an average elongation of 22.1 % with an average % D8 Desat of 92.7%. Elongation is slightly lower than that seen when the delta-9 elongase is expressed alone in soybean embryos although this might be due to the small numbers of events looked at. In contrast, desaturation is considerably higher when the PavDδ is fused with the EgD9elo and EgDHAsynHink than when the PavDδ is expressed alone in soybean embryos, reaching almost 100% conversion in some events. This enhanced conversion by the delta-8 desaturase might be due to increased efficiency or flux, perhaps due to substrate channeling.
EXAMPLE 32 Synthesis And Functional Analysis of a Codon-Optimized C20 Elongase Gene
(EgC20ES), From Eualena gracilis in Yarrowia lipolytics The codon usage of the C20 elongase domain of EgDHAsyni (EgDHAsyn1C20EloDom1) of Euglena gracilis (i.e., corresponding to amino acids 1- 303 of SEQ ID NO: 12) was optimized for expression in Yarrowia lipolytica, in a manner similar to that described in PCT Publication No. WO 2004/101753 and U.S. Patent 7,125,672. Specifically, a codon-optimized C20 elongase gene (designated "EgC20ES" and having the nucleotide sequence as set forth in SEQ ID NO: 183 and the amino acid sequence as set forth in SEQ ID NO:184) was designed, based on the coding sequence of the C20 elongase domain of EgDHAsyni (SEQ ID NO:201), according to the Yarrowia codon usage pattern (PCT Publication No. WO 2004/101753), the consensus sequence around the 'ATG' translation initiation codon, and the general rules of RNA stability (Guhaniyogi, G. and J. Brewer, Gene, 265(1-2):11-23 (2001)). In addition to the modification of the translation initiation site, 163 bp of the 909 bp coding region were modified (17.9%) and 147 codons were optimized (48.5%). None of the modifications in the codon-optimized gene changed the amino acid sequence of the encoded protein (i.e., SEQ ID NO: 184 is 100% identical in sequence to amino acids 1-303 of SEQ ID NO: 12). The designed EgC20ES gene (SEQ ID NO: 183) was synthesized by GenScript Corporation (Piscataway, NJ) and cloned into pUC57 (GenBank Accession No. Y14837) to generate pEgC20ES (FIG. 51 B; SEQ ID NO:185). To analyze the function of the codon-optimized EgC20ES gene, plasmid pZuFmEgC20ES (FIG. 52A; SEQ ID NO:360) comprising a chimeric FBAINm::EgC20ES::Pex20 gene was constructed. Plasmid pZuFmEgC20ES contained the following components:
TABLE 27 Components Of Plasmid pZuFmEqC20ES (SEQ ID NO:360)
Figure imgf000183_0001
Plasmid pZuFmEgC20ES was transformed into Yarrowia lipolytica strain Y4184U4, as described in the General Methods. The transformants were selected on MM plates. After 2 days growth at 30 °C, 10 transformants grown on the MM plates were picked and re-streaked onto fresh MM plates. Once grown, 10 strains were individually inoculated into 3 ml_ liquid MM at 30 C and shaken at 250 rpm/min for 2 days. The cells were collected by centrifugation; lipids were extracted; and fatty acid methyl esters were prepared by trans-esterification and subsequently analyzed with a Hewlett-Packard 6890 GC.
GC analyses showed that there were about 6.2% EPA and 2.8% DPA of total lipids produced in all ten transformants, wherein the conversion efficiency of EPA to DPA in these 10 strains was determined to be about 31% (calculated as described in Example 27). Thus, this experimental data demonstrated that the synthetic Euglena gracilis C20 elongase codon-optimized for expression in Yarrowia lipolytica (i.e., EgC20ES, as set forth in SEQ ID NO: 183) actively converts EPA to DPA.
EXAMPLE 33
Synthesis And Functional Analysis of a Codon-Optimized C20 Elongase Gene (EaC20ES) From Euαlena anabaena in Yarrowia lipolytica
The codon usage of the C20 elongase domain of EaDHAsyn2 (SEQ ID NO:228) of Euglena anabaena (i.e., corresponding to amino acids 1-299 of SEQ ID NO:96) was optimized for expression in Yarrowia lipolytica, in a manner similar to that described in PCT Publication No. WO 2004/101753, U.S. Patent 7,125,672 and above in Example 32. Specifically, a codon-optimized C20 elongase gene
(designated "EaC20ES" and having the nucleotide sequence as set forth in SEQ ID NO: 188 and the amino acid sequence as set forth in SEQ ID NO: 189) was designed, based on the coding sequence of the C20 elongase domain of EaDHAsyn2 (SEQ ID NO:92), according to the Yarrowia codon usage pattern (PCT Publication No. WO 2004/101753), the consensus sequence around the 'ATG' translation initiation codon, and the general rules of RNA stability (Guhaniyogi, G. and J. Brewer, Gene, 265(1-2):11-23 (2001)). In addition to the modification of the translation initiation site, 143 bp of the 897 bp coding region were modified (15.9%) and 134 codons were optimized (44.8%). None of the modifications in the codon- optimized gene changed the amino acid sequence of the encoded protein (i.e., SEQ ID NO:189 is 100% identical in sequence to amino acids 1-299 of SEQ ID NO:96). The designed EaC20ES gene (SEQ ID NO:188) was synthesized by GenScript Corporation (Piscataway, NJ) and was cloned into pUC57 (GenBank Accession No. Y14837) to generate pEaC20ES (SEQ ID NO:190). To analyze the function of the codon-optimized EaC20ES gene, plasmid pZuFmEaC20ES (SEQ ID NO:361) was constructed comprising a chimeric FBAINm::EaC20ES::Pex20 gene. Plasmid pZuFmEaC20ES (SEQ ID NO: 361) was identical in construction to that of plasmid pZuFmEgC20ES (SEQ ID NO:360; FIG. 52A), with the exception that EaC20ES (SEQ ID NO: 188) was used in place of EgC20ES (SEQ ID NO:183).
Plasmid pZuFmEaC20ES (SEQ ID NO:361) was transformed into Yarrowia lipolytica strain Y4184U4, as described in the General Methods. The transformants were selected on MM plates. After 2 days growth at 30 °C, 20 transformants grown on the MM plates were picked and re-streaked onto fresh MM plates. Once grown, 20 strains were individually inoculated into 3 ml_ liquid MM at 30 °C and shaken at 250 rpm/min for 2 days. The cells were collected by centrifugation; lipids were extracted; and fatty acid methyl esters were prepared by trans-esterification and subsequently analyzed with a Hewlett-Packard 6890 GC.
GC analyses showed that there were about 7.4% EPA and 1% DPA of total lipids produced in all 20 transformants, wherein the conversion efficiency of EPA to DPA in these 20 strains was determined to be about 11% (calculated as described in Example 27). Thus, this experimental data demonstrated that the synthetic Euglena anabaena C20 elongase codon-optimized for expression in Yarrowia lipolytica (i.e., EaC20ES, as set forth in SEQ ID NO:188) actively converts EPA to DPA.
EXAMPLE 34 Synthesis And Functional Analysis of a Codon-Optimized Delta-4 Desaturase Gene (EaD4S) From Euglena anabaena in Yarrowia lipolytica
The codon usage of the delta-4 desaturase domain of EaDHAsyn2 (SEQ ID NO:243) of Euglena anabaena (i.e., corresponding to amino acids 259-841 of SEQ ID NO:96) was optimized for expression in Yarrowia lipolytica, in a manner similar to that described in PCT Publication No. WO 2004/101753, U.S. Patent 7,125,672 and above in Examples 32 and 33. Specifically, a codon-optimized delta-4 desaturase gene (designated "EaD4S" and having the nucleotide sequence as set forth in SEQ ID NO:192 and the amino acid sequence as set forth in SEQ ID NO: 193) was designed, based on the coding sequence of the delta-4 desaturase domain of EaDHAsyn2 (SEQ ID NO:92), which is also provided as SEQ ID NO:194 (nucleotide) and SEQ ID NO: 195 (amino acid). In addition to the modification of the translation initiation site, 307 bp of the 1752 (including the TAA stop codon) bp coding region were modified (17.5%) and 285 codons were optimized (48.8%). Additionally, a Ncol site was introduced around the translation start codon by changing the second amino acid of the wild type delta-4 desaturase domain (i.e., amino acid residue 260 of SEQ ID NO:96 or amino acid residue 2 of SEQ ID
NO: 195) from a leucine to a valine residue in the synthetic EaD4S gene; thus, the amino acid sequence of EaD4S is set forth in SEQ ID NO: 193. The designed EaD4S gene (SEQ ID NO: 192) was synthesized by GenScript Corporation (Piscataway, NJ) and was cloned into pUC57 (GenBank Accession No. Y14837) to generate pEaD4S (SEQ ID NO: 196).
To analyze the function of the codon-optimized EaD4S gene, plasmid pZKL4- 220EA4 (FIG. 52B; SEQ ID NO:362) was constructed to integrate two chimeric C20 elongase genes and the chimeric EaD4S gene into the lipase 4 like locus (GenBank Accession No. XM_503825) of Yarrowia lipolytica strain Y4184U4. Plasmid pZKL4- 220EA4 contained the following components:
TABLE 28 Components Of Plasmid PZKL4-220EA4 (SEQ ID NO:362)
Figure imgf000186_0001
Figure imgf000187_0001
Plasmid pZKL4-220EA4 was digested with Asc\/Sph\, and then transformed into Yarrowia lipolytica strain Y4184U4, as described in the General Methods. The transformants were selected on MM plates. After 5 days growth at 30 C, 8 transformants grown on the MM plates were picked and re-streaked onto fresh MM plates. Once grown, these strains were individually inoculated into 3 ml_ liquid MM at 30 °C and shaken at 250 rpm/min for 2 days. The cells were collected by centrifugation, resuspended in HGM and then shaken at 250 rpm/min for 5 days. The cells were collected by centrifugation, lipids were extracted, and fatty acid methyl esters were prepared by trans-esterification, and subsequently analyzed with a Hewlett-Packard 6890 GC.
GC analyses showed that there were an average of 0.4% DHA and 10.2% DPA of total lipids produced in all 8 transformants, wherein the conversion efficiency of DPA to DHA in these 8 strains was determined to be about 4.2% (calculated as described in Example 27). Thus, this experimental data demonstrated that the synthetic Euglena anabaena delta-4 desaturase codon-optimized for expression in Yarrowia lipolytica (i.e., EaD4S, as set forth in SEQ ID NO: 192) is active, but functions with relatively low conversion efficiency.
EXAMPLE 35 Co-expression of the Euplena gracilis DHA Synthase 1 C20 Elongase Domain
(EqDHAsvn1C20EloDom1) with the Schizochvtrium aααreαatum Delta-4 Desaturase (SaD4) in Soybean Embryos Transformed with Soybean Expression Vector pKR1105
The following example describes the generation of transgenic soybean events expressing EgDHAsyn1C20EloDom1 and SaD4 that, when generated into plants, could be crossed with EPA-producing soybean events to generate DHA- producing plants.
Soybean embryogenic suspension culture (cv. Jack) was transformed with the /Ascl fragment of pKR1105 (SEQ ID NO:156; FIG. 16A; fragment containing the expression cassette) and embryos were matured as described for production in Example 25 but with the following change. After maturation on SB103 for 10-12 days, a single cluster of embryos for each event was removed to 4 mL of SB148 liquid media (recipe below) containing 0.02% tergitol and 0.33 mM EPA in a six-well micro-titer plate.
SB 103 Solid Medium (per liter)
1 package MS salts (Gibco/ BRL - Cat. No. 11117-066) 1 mL B5 vitamins 1000X stock 60 g maltose pH 5.7
Clusters were carefully broken up to release individual embryos and micro- titer plates were shaken on a rotary shaker at 150 rpm and 26 0C under cool white fluorescent lights on a 16:8 hr day/night photoperiod at a light intensity of 60-85 μE/m2/s for 48 hrs. After 48 hrs, embryos were rinsed with water, dried and five embryos per event were picked into glass GC vials. Fatty acid methyl esters were prepared by transesterification with TMSH and were quantified using a Hewlett- Packard 6890 Gas Chromatograph fitted with an Omegawax 320 fused silica capillary column (Cat. No. 24152, Supelco Inc.) as described in Example 30. The oven temperature was programmed to hold at 150 CC for 1 min and then was increased to 240 0C at 5 0C. Retention times were compared to those for methyl esters of standards commercially available (Nu-Chek Prep, Inc.).
In this way, 122 events transformed with pKR1105 were analyzed. From the 122 events analyzed, 49 were identified that elongated EPA (C20/delta-5 elongase activity) and of these, 41 were identified that desaturated DPA (delta-4 desaturase activity) to produce DHA. The events with the best C20/delta-5 elongase and delta- 4 desaturase activities were advanced and the fatty acid profiles from feeding embryos with EPA are shown in FIG. 26.
Fatty acids in FIG. 26 are identified as 16:0 (palmitate), 18:0 (stearic acid), 18:1 (oleic acid), LA, ALA, EPA, 22:0 (docosanoic acid), DPA, 24:0 (tetracosanoic acid), DHA and 24:1 (nevonic acid); and, fatty acid compositions listed in FIG. 26 are expressed as a weight percent (wt. %) of total fatty acids. The activity of the EgDHAsyn1C20EloDom1 is expressed as percent C20/delta-5 elongation (% C20/delta-5 elong), calculated according to the following formula: ([product]/[substrate + product])*100. More specifically, the combined percent elongation for EPA is shown as "% C20/delta-5 elong", determined as: ([DPA + DHA]/[EPA + DPA + DHA])*100.
The activity of the SaD4 is expressed as percent delta-4 desaturation (% delta-4 desat), calculated according to the following formula: ([product]/[substrate + product])*100. More specifically, the combined percent desaturation for DPA is shown as "% delta-4 desat", determined as: (DHA/[DPA + DHA])*100.
EXAMPLE 36
Identification of a Delta-9 Elongase From Euplena anabaena UTEX 373 The present example describes the identification of delta-9 elongases from a
Euglena anabaena UTEX 373 cDNA library. This work is also described in U.S. Provisional Application No. 60/911 ,925 (filed April 16, 2007; Attorney Docket No. BB- 1613; the contents of which are hereby incorporated by reference). Growth of Euαlena anabaena UTEX 373 and preparation of RNA Amplified cDNA library eugic was plated and colonies lifted as described in
Example 13. A DNA probe was made using an agarose gel purified Ncol/Notl DNA fragment containing the Euglena gracilis delta-9 elongase gene, from pKR906 (SEQ ID NO: 115; Example 16 and WO 2007/061845, which published May 31 , 2007; Attorney Docket No. BB-1562; the contents of which are hereby incorporated by reference) labeled with P32 dCTP using the RadPrime DNA Labeling System (Cat. No. 18428-011 , Invitrogen, Carlsbad, CA) following the manufacturer's instructions.
Colony lifts were probed and positives were identified and confirmed as described in Example 13. Plasmid DNA was isolated and sequenced exactly as described in Example 2 and sequences were aligned and compared using Sequencher™ (Version 4.2, Gene Codes Corporation, Ann Arbor, Ml). In this way, the clones could be categorized into one of two distinct groups based on insert sequence (designated EaD9Elo1 and EaD9Elo2). Representative clones containing the cDNA for each class of sequence were chosen for further study, and the sequences for each representative plasmid (pLF121-1 and pLF121-2) are shown in SEQ ID NO:250 and SEQ ID NO:251 , respectively. The sequence shown by a string of NNNN's represents a region of the polyA tail which was not sequenced. The coding sequences for EaD9Elo1 and EaD9Elo2 are shown in SEQ ID NO:252 and SEQ ID NO:253, respectively. The corresponding amino acid sequences for EaD9Elo1 and EaD9Elo2 are shown in SEQ ID NO:254 and SEQ ID NO:255, respectively.
EXAMPLE 37
Identification of a Delta-5 Desaturase From Eualena anabaena UTEX 373 The present Example describes the identification of a delta-5 desaturase from
Euglena anabaena UTEX 373. This work is also described in U.S. Provisional Application No. 60/915733 (filed May 3, 2007; Attorney Docket No. BB-1614; the contents of which are hereby incorporated by reference).
Amplified cDNA library eugic was plated and colonies lifted as described in Example 13. A DNA probe was made using an agarose gel purified Ncol/Notl DNA fragment containing the Euglena gracilis delta-5 desaturase gene (EgD5; SEQ ID NO:267) from pDMW367, previously described in PCT Publication No. WO 2007/136877 (published November 29, 2007; Attorney Docket No. BB1629; the contents of which are hereby incorporated by reference), labeled with P32. Colony lifts were probed and positives were identified and confirmed as described in Example 13. Plasmid DNA was isolated and sequenced exactly as described in Example 2, and sequences were aligned and compared using
Sequencher™ (Version 4.2, Gene Codes Corporation, Ann Arbor, Ml).
A representative clone containing a cDNA (pLF119) is shown in SEQ ID NO:256 and the gene contained within the cDNA was called EaD5Des1. The coding sequence for EaD5Des1 is shown in SEQ ID NO:257. The corresponding amino acid sequence for EaD5Des1 is shown in SEQ ID NO:258.
EXAMPLE 38
Construction of Soybean Expression Vector pKR1183 for Expression of a Euαlena anabaena delta-9 e\onQase-Tetruetreotia pomαuetensis CCMP1491 Delta-8
Desaturase Fusion Gene (Hybrid 1 -HGLA Synthase) An in-frame fusion between the Euglena anabaena delta-9 elongase (EaD9Elo1 ; SEQ ID NO:252; Example 36), the Euglena gracilis DHA synthase 1 proline-rich linker (EgDHAsyni Link; SEQ ID NO: 197; Example 6) and the Tetruetreptia pomquetensis CCMP1491 delta-8 desaturase (TpomDδ; SEQ ID
NO:162; Example 21 ; see also Applicants' Assignee's co-pending application having U.S. Patent Application No. 11/876115 (filed October 22, 2007; Attorney Docket No. BB-1574)) was constructed using the conditions described below. An initial in-frame fusion between the EaD9Elo1 and the EgDHAsyni Link (EaD9elo-EgDHAsyn1 Link) was made, flanked by an Λ/col site at the 5'end and a Λ/ofl site at the 3' end, by PCR amplification. EaD9Elo1 (SEQ ID NO:252) was amplified from pLF121-1 (SEQ ID NO:250) with oligonucleotides EaD9-5Bbs (SEQ ID NO:259) and EaD9-3fusion (SEQ ID NO:260), using the Phusion™ High-Fidelity DNA Polymerase (Cat. No. F553S, Finnzymes Oy, Finland) following the manufacturer's protocol. EgDHAsyni Link (SEQ ID NO:197) was amplified in a similar way from pKR1049 (Example 4) with oligonucleotides EgDHAsyni Link- δfusion (SEQ ID NO:261) and MWG511 (SEQ ID NO: 175). The two resulting PCR products were combined and re-amplified using EaD9-5Bbs (SEQ ID NO:259) and MWG511 (SEQ ID NO:175) to form EaD9Elo1-EgDHAsyn1 Link. The sequence of the EaD9Elo1-EgDHAsyn1 Link is shown in SEQ ID NO:262. EaD9Elo1- EgDHAsyni Link does not contain an in-frame stop codon upstream of the Noti site at the 3' end and therefore, a DNA fragment cloned into the Not\ site can give rise to an in-frame fusion with the EgD9elo1-EgDHAsyn1 Link if the correct frame is chosen. EaD9Elo1-EgDHAsyn1 Link was cloned into the pCR-Blunt® cloning vector using the Zero Blunt® PCR Cloning Kit (Invitrogen Corporation), following the manufacturer's protocol, to produce pLF124 (SEQ ID NO:263).
The BbsUNoti DNA fragment of pLF124 (SEQ ID NO:263), containing EaD9Elo1-EgDHAsyn1 Link, was cloned into the Nco\/Not\ DNA fragment from
KS366 (SEQ ID NO:177; Example 23), containing the promoter for the α' subunit of β-conglycinin, to produce pKR1177 (SEQ ID NO:264).
The BamHI DNA fragment of pKR1177 (SEQ ID NO:264), containing EaD9Elo1-EgDHAsyn1 Link, was cloned into the BamHI DNA fragment of pKR325, previously described in PCT Publication No. WO 2006/012325 (the contents of which are hereby incorporated by reference) to produce pKR1179 (SEQ ID NO:265).
The Not\ fragment from pLF114-10 (Example 21 ; SEQ ID NO:165), containing TpomDδ was cloned into the Not\ fragment of pKR1179 (SEQ ID NO:265) to produce pKR1183 (SEQ ID NO:266; FIG. 28). In Fig. 28, the fusion gene (Hybrid 1 -HGLA synthase) is called EAd9ELONG-TPOMd8DS. EXAMPLE 39 Construction of Soybean Expression Vector pKR1253 for Expression of a Eualena anabaena delta-9 elonqase- Tetruetreptia pomguetensis CCMP1491 Delta-8 Desaturase Fusion Gene (Hybrid 1 -HGLA Synthase) with a Euαlena gracilis delta-5 desaturase
Through a number of subcloning steps, a Λ/ofl site was added to the 5' end of the Euglena gracilis delta-5 desaturase (EgD5; SEQ ID NO:267) from pDMW367 and this Λ/ofl fragment containing EgD5 was cloned into the Λ/ofl site of pKR457 (SEQ ID NO:122; Example 16) to produce pKR1237 (SEQ ID NO:268). The Asc\ fragment of pKR1183 (SEQ ID NO:266; Example 38), containing the Hybrid1-HGLA synthase, was cloned into the Asc\ fragment of pKR277 (SEQ ID NO: 120, which was previously described in PCT Publication No. WO 2004/071467 and published 8/26/2004; Attorney Docket No. BB-1538 (the contents of which are hereby incorporated by reference) to produce pKR1252 (SEQ ID NO:269). The Ss/WI fragment of pKR1237 (SEQ ID NO:268), containing the EgD5 gene, was cloned into the Ss/WI site of pKR1252 (SEQ ID NO:269) to produce PKR1253 (SEQ ID NO:270; FIG. 30).
EXAMPLE 40
Construction of Soybean Vector pKR1139 for Expression of a Euαlena anabaena delta-5 desaturase
The present example describes the cloning of a delta-5 desaturase from Euglena anabaena UTEX 373 into a soybean expression vector. This work is also described in U.S. Provisional Application No. 60/915733 (filed May 3, 2007; Attorney Docket No. BB-1614 (the contents of which are hereby incorporated by reference)).
EaD5Des1 (SEQ ID NO:257) was amplified from_pLF119 (SEQ ID NO:256, Example 37) with oEAd5-1-1 (SEQ ID NO:271) and oEAd5-1-2 (SEQ ID NO:272), using the Phusion™ High-Fidelity DNA Polymerase (Cat. No. F553S, Finnzymes Oy, Finland) following the manufacturer's protocol. The resulting PCR product was cloned into the pCR-Blunt® cloning vector using the Zero Blunt® PCR Cloning Kit (Invitrogen Corporation), following the manufacturer's protocol, to produce pKR1136 (SEQ ID NO:273). The Notl fragment for pKR1136 (SEQ ID NO:273) containing the EaD5Des1 was cloned into the Λ/ofl fragment of pKR974, previously described in PCT Publication No. WO 2007/136877, published November 29, 2007 (Attorney Docket No. BB-1629 (the contents of which are hereby incorporated by reference)), to produce pKR1139 (SEQ ID NO:274).
EXAMPLE 41 Construction of Soybean Expression Vector pKR1255 for Expression of a Euplena anabaena delta-9 elonqase- Tetruetreptia pomquetensis CCMP1491 Delta-8 Desaturase Fusion Gene (Hybrid 1 -HGLA Synthase) with a Euplena gracilis delta-5 desaturase and a Euplena anabaena delta-5 desaturase
Plasmid pKR1139 (SEQ ID NO:274; Example 40) was digested with SbΑ and the fragment containing the EaD5Des1 was cloned into the Sbf\ site of pKR1253 (SEQ ID NO:270; Example 39) to produce pKR1255 (SEQ ID NO:275; FIG. 31).
EXAMPLE 42 Construction of Soybean Expression Vector pKR1189 For Down-Regulating
Expression of Soybean Fad3
The present example describes a soybean expression vector designed to decrease fad3 expression in soybean.
A starting vector pKR561 (SEQ ID NO:276) was assembled by inserting the BsNSIl fragment of pKR268 (previously described in PCT Publication No. WO
04/071467) containing the annexin promoter into the Bs/WI site of pKR145, which is described in PCT Publication No. WO 04/071467.
Plasmid XF1 , described in PCT Publication No. WO 93/11245 (which was published on June 10, 1993; also U.S. Patent No. 5,952,544; the contents of which are hereby incorporated by reference), contains the soybean delta-15 desaturase (fad3) gene (SEQ ID NO:277; GenBank Accession No. L22964; also called GmFAD3B).
A portion of the 5' end of the fad3 gene was amplified from XF1 with the Phusion™ High-Fidelity DNA Polymerase (Cat. No. F553S, Finnzymes Oy, Finland) following the manufacturer's protocol, using HPfad3-1 (SEQ ID NO:278) and
HPfad3-2 (SEQ ID NO:279) to produce a DNA fragment called HPfad3AB (SEQ ID NO:280). A portion of the 3' end of the fad3 gene was amplified from XF1 with the Phusion™ High-Fidelity DNA Polymerase, using HPfad3-3 (SEQ ID NO:281) and HPfad3-1 (SEQ ID NO:278) to produce a DNA fragment called HPfad3A'-2 (SEQ ID NO:282). HPfad3AB and HPfad3A'-2 were combined and amplified using the
Phusion™ High-Fidelity DNA Polymerase with HPfad3-1 (SEQ ID NO:278) to produce HPfad3ABA'-2 (SEQ ID NO:283). HPfad3ABA'-2 (SEQ ID NO:283) has a Λ/ofl site at both the 5' and 3' end of the DNA fragment. The resulting PCR product was cloned into the pCR-Blunt® cloning vector using the Zero Blunt® PCR Cloning Kit (Invitrogen Corporation), following the manufacturer's protocol, to produce pLF129 (SEQ ID NO:284).
The Λ/ofl fragment for pLF129 (SEQ ID NO:284) containing the fad3 hairpin was cloned into the Λ/ofl fragment of pKR561 (SEQ ID NO:276) to produce pKR1189 (SEQ ID NO:285; FIG. 32). In FIG. 32, the A and A' domains for fad3 are indicated by the designation TR1 while the B domain is indicated by TR2.
EXAMPLE 43 Construction of Soybean Expression Vector pKR1249 For Down-Regulating
Soybean Fad3 and Soybean Fad3c
The Nott/HindlU fragment of pLF129 (SEQ ID NO:284) containing the TR1 and TR2 domains of fad3, as indicated in FIG. 32, was cloned into the Not\/Hind\\\ backbone fragment of pLF129 (SEQ ID NO:284) to produce pKR1209 (SEQ ID NO:286).
The coding sequence of GmFad3C (GenBank Accession No. AY204712) (Bilyeu et al., Crop Sci. 43:1833-1838 (2003); Anai et al., Plant Sci. 168:1615-1623 (2005)) is shown in SEQ ID NO:287 and the corresponding amino acid sequence is shown in SEQ ID NO:288. A portion of the fad3c gene was amplified from the soybean cDNA library described in PCT Publication No. WO 93/11245 (which was published on June 10, 1993; also U.S. Patent No. 5,952,544) (the contents of which are hereby incorporated by reference) with the Phusion™ High-Fidelity DNA Polymerase (Cat. No. F553S, Finnzymes Oy, Finland) following the manufacturer's protocol, using fad3c-5 (SEQ ID NO:289) and fad3c-3 (SEQ ID NO:290). The resulting DNA fragment was cloned into the pCR-Blunt® cloning vector using the Zero Blunt® PCR Cloning Kit (Invitrogen Corporation), following the manufacturer's protocol, to produce pKR1213 (SEQ ID NO:291).
The EcoRV/XΛol fragment of pKR1213 (SEQ ID NO:291) containing the fragment of fad3c was cloned into the Λ/ofl(filled)/X/7θl site of pKR1209 (SEQ ID NO:286) to produce pKR1218 (SEQ ID NO:292).
The Not\/Hind\\\ fragment of pLF129 (SEQ ID NO:284) containing the TR1 domain only from fad3, as indicated in FIG. 32, was cloned into the Not\/Hind\\\ backbone fragment of pLF129 (SEQ ID NO:284) to produce pKR1210 (SEQ ID NO:293). The EcoRV/XΛol fragment of pKR1213 (SEQ ID NO:291) containing the fragment of fad3c was cloned into the Λ/ofl(filled)/X/?ol site of pKR1210 (SEQ ID NO:293) to produce pKR1219 (SEQ ID NO:294).
The X/jol(filled)/H//7crtll fragment of pKR1218 (SEQ ID NO:292) containing the fragment of fad3c as well as fad3 TR1 and TR2 domains was cloned into the /W/L/I(filled)//-///7C/1 Il site of pKR1219 (SEQ ID NO:294), containing the fragment of fad3c as well as the fad3 TR1 only domain, to produce pKR1225 (SEQ ID NO:295). In this way, a new hairpin including fad3 and fad3c and flanked by Not\ sites was formed.
The Λ/ofl fragment for pKR1225 (SEQ ID NO:295) containing the new hairpin including fad3 and fad3c was cloned into the Λ/ofl fragment of pKR561 (SEQ ID
NO:276; Example 42) to produce pKR1229 (SEQ ID NO:296; FIG. 33). In this way, the fad3/fad3c hairpin can be expressed from a strong, seed-specific promoter with hygromycin selection in plants.
The Bs/WI fragment for pKR1225 (SEQ ID NO:295) containing the new hairpin including fad3 and fad3c was cloned into the Ss/WI fragment of pKR226 (SEQ ID NO:130; Example 17) to produce pKR1249 (SEQ ID NO:297; FIG. 34). In FIG. 34, pKR1249 is labeled pKR1249_PHP33240. In this way, the fad3/fad3c hairpin can be expressed from a strong, seed-specific promoter with chlorsulfuron (ALS) selection in plants. EXAMPLE 44
Construction of Soybean Expression Vector pKR1322 for Expression of a Eualena anabaena delta-9 elongase- Tetruetreptia pomαuetensis CCMP1491 Delta-8 Desaturase-Et/g/eπa anabaena delta-5 Desaturase Fusion Gene (EaD9Elo1- TpomD8-EaD5Des1 fusion)
The present example describes the construction of an in-frame fusion gene between the Euglena anabaena delta-9 elongase (EaD9Elo1; SEQ ID NO:252, Example 36), the Tetruetreptia pomquetensis CCMP1491 delta-8 Desaturase (TpomD8; SEQ ID NO: 162; Example 21) and the Euglena anabaena delta-5 desaturase (EaD5Des1 ; SEQ ID NO:257; Example 37). Each domain is separated by the EgDHAsyni linker (EgDHAsyni Link; SEQ ID NO:197; Example 6).
The EaD9Elo1-EgDHAsyn1 Link (SEQ ID NO:262; Example 38) was amplified from pLF124 (SEQ ID NO:263) with oligonucleotides oEAd9el1-1 (SEQ ID NO:298) and oLINK-1 (SEQ ID NO:299), using the Phusion™ High-Fidelity DNA Polymerase (Cat. No. F553S, Finnzymes Oy, Finland) following the manufacturer's protocol. EaD9Elo1-EgDHAsyn1 Link is flanked by Not\ at the 5' end and Eag\ at the 5' and 3' ends and does not contain an in-frame stop codon upstream of the Eag\ site at the 3' end. Therefore, a DNA fragment cloned into the Eag\ site can give rise to an in-frame fusion with the EgD9elo-EgDHAsyn1 Link if the correct frame is chosen. The resulting DNA fragment containing EaD9Elo1-EgDHAsyn1 Link was cloned into the pCR-Blunt® cloning vector using the Zero Blunt® PCR Cloning Kit (Invitrogen Corporation), following the manufacturer's protocol, to produce pKR1298 (SEQ ID NO:300).
An in-frame fusion between the TpomDδ and the EgDHAsyni Link (TpomDδ- EgDHAsyni Link) was made which contained a Noti site at the 5'end and Eag\ sites at the 5' and 3' ends, by PCR amplification. TpomD8 (SEQ ID NO:162) was amplified from pLF114-10 (SEQ ID NO:165) with oligonucleotides oTPd8-1 (SEQ ID NO:301) and oTPd8fus-1 (SEQ ID NO:302), using the Phusion™ High-Fidelity DNA Polymerase (Cat. No. F553S, Finnzymes Oy, Finland) following the manufacturer's protocol. EgDHAsyni Link (SEQ ID NO:197) was amplified in a similar way from pKR1049 (Example 4) with oligonucleotides oLINK-2 (SEQ ID NO:303) and oLINK-1 (SEQ ID NO:304). The two resulting PCR products were combined and re-amplifed using oTPd8-1 (SEQ ID NO:301) and oLINK-1 (SEQ ID NO:304) to form TpomDδ- EgDHAsyni Link (SEQ ID NO:305). TpomD8-EgDHAsyn1 Link was cloned into the pCR-Blunt® cloning vector using the Zero Blunt® PCR Cloning Kit (Invitrogen Corporation), following the manufacturer's protocol, to produce pKR1291 (SEQ ID NO:306). The Eag\ fragment of pKR1291 , containing the TpomD8-EgDHAsyn1 Link was cloned into the Nott site of pBluescript Il SK(+) vector (Stratagene) to form either pKR1301 (SEQ ID NO:307) in one orientation or pKR1301 R (SEQ ID NO:308) in the opposite orientation.
Plasmid pKR1301 (SEQ ID NO:307) was digested with Mfe\/BamH\, the DNA fragment containing TpomD8-EgDHAsyn1 Link was completely filled in, and the resulting DNA fragment was re-ligated to form pKR1311 (SEQ ID NO:309).
Plasmid pKR1301 R (SEQ ID NO:308) was digested with EcoR\, and the fragment containing the 5' end of TpomD8-EgDHAsyn1 Link (called TPOMD8TR2) and vector backbone was re-ligated to form pKR1304 (SEQ ID NO:310). The Eag\ site of pKR1298 (SEQ ID NO:300) containing the EaD9Elo1-
EgDHAsyni Link was cloned into the Eag\ site of pKR1304 (SEQ ID NO:310) to produce pKR1309 (SEQ ID NO:311).
The Not\ site of pKR1298 (SEQ ID NO:300) containing the EaD9Elo1- EgDHAsyni ϋnk was cloned into the Eag\ site of pKR1304 (SEQ ID NO:310) to produce pKR1309 (SEQ ID NO:311).
The Not\ fragment for pKR1136 (SEQ ID NO:273; Example 40) containing the EaD5Des1 was cloned into the Eag\ site of pKR1311 (SEQ ID NO:309) to produce pKR1313 (SEQ ID NO:312).
The Mfe\/Ecl136\\ fragment of pKR1313 (SEQ ID NO:312) was cloned into the EcofiV/Mfel sites of pKR1309 (SEQ ID NO:311) to produce pKR1315 (SEQ ID NO:313).
The Not\ fragment of pKR1315 (SEQ ID NO:313) was cloned into the Not\ site of pKR72 (SEQ ID NO: 105; Example 15) to produce pKR1322 (SEQ ID NO:314; FIG. 35). In FIG. 35, the EaD9Elo1-TpomD8-EaD5Des1 fusion is labeled as
EAd9el+TPd8ds+EAd5ds fusion. EXAMPLE 45 Down-regulation of the Soybean fad3 and fad3c Genes In Soybean Somatic
Embryos by Transformation with pKR1189 or pKR1229 The present example describes the transformation and expression in soybean somatic embryos of pKR1189 (SEQ ID NO:285, Example 42), containing a fad3 hairpin construct or pKR1229 (SEQ ID NO:296; Example 43), containing a fad3 and fad3c hairpin construct. Both constructs also have the hygromycin phosphotransferase gene for selection on hygromycin.
Soybean embryogenic suspension culture (cv. Jack) was transformed with pKR1189 (SEQ ID NO:285) or pKR1229 (SEQ ID NO:296) and embryos were matured in soybean histodifferentiation and maturation liquid medium (SHaM liquid media; Schmidt et al., Cell Biology and Morphogenesis, 24:393 (2005)) as described in Example 25 and previously described in PCT Publication No. WO 2007/136877, published November 29, 2007 (the contents of which are hereby incorporated by reference).
After maturation in SHaM liquid media, individual embryos were removed from the clusters, dried and screened for alterations in their fatty acid compositions as described in Example 1. In each case, a subset of soybean embryos (i.e., five embryos per event) transformed with either pKR1189 (SEQ ID NO:285) or pKR1229 (SEQ ID NO:296) were harvested and analyzed.
In this way, 41 events transformed with pKR1189 (SEQ ID NO:285; Experiment 2148) or pKR1229 (SEQ ID NO:296; Experiment 2165) were analyzed. The fatty acid profiles for the five events having the lowest average ALA content (average of the 5 embryos analyzed) along with an event (2148-3-8-1) having a fatty acid profile typical of wild type embryos for this experiment, are shown in FIG. 36. In FIG. 36, fatty acids are identified as 16:0 (palmitate), 18:0 (stearic acid), 18:1 (oleic acid), LA, and ALA. Fatty acid compositions are expressed as a weight percent (wt. %) of total fatty acids.
ALA content in somatic embryos expressing either a fad3 hairpin construct (event number 2148, FIG. 36) or a fad3c hairpin construct (event number 2165, FIG. 36) showed at least a 50% reduction when compared to typical wild type embryos (FIG. 36). This strongly indicates that either hairpin construct is functional to decrease ALA content in soybean embryos. EXAMPLE 46
Soybean somatic embryos transformed with pKR1183 for expression of a Eualena anabaena delta-9 elonαase- Tetruetreotia pomαuetensis CCMP1491 Delta-8
Desaturase Fusion Gene (Hybrid 1 -HGLA Synthase) The present example describes the transformation and expression in soybean somatic embryos of pKR1183 (SEQ ID NO:266; Example 38) containing the Euglena anabaena delta-9 e\ongase-Tetruetreptia pomquetensis CCM P 1491 delta- 8 Desaturase Fusion Gene (Hybrid1-HGLA Synthase) and the hygromycin phosphotransferase gene for selection on hygromycin. Soybean embryogenic suspension culture (cv. Jack) was transformed with pKR1183 (SEQ ID NO:266) and embryos were matured in soybean histodifferentiation and maturation liquid medium (SHaM liquid media; Schmidt et al., Cell Biology and Morphogenesis, 24:393 (2005)) as described in Example 25 and previously described in PCT Publication No. WO 2007/136877, published November 29, 2007 (the contents of which are hereby incorporated by reference).
After maturation in SHaM liquid media a subset of soybean embryos (i.e., four embryos per event) transformed with pKR1183 (SEQ ID NO:266) were harvested and analyzed as described herein.
In this way, 20 events transformed with pKR1183 (SEQ ID NO:266; Experiment 2145) were analyzed. The fatty acid profiles for the five events having the highest average DGLA content (average of the 5 embryos analyzed) are shown in FIG. 37. In FIG. 37, fatty acids are identified as 16:0 (palmitate), 18:0 (stearic acid), 18:1 (oleic acid), LA, ALA, EDA, ERA, DGLA and ETA. Fatty acid compositions are expressed as a weight percent (wt. %) of total fatty acids. EXAMPLE 47
Soybean Embryos Transformed with Soybean Expression Vectors PKR1253 for
Expression of a Euglena anabaena delta-9 elongase- Tefruefrepf/a pompuetensis
CCMP1491 Delta-8 Desaturase Fusion Gene (Hvbrid1-HGLA Synthase) with a
Euplena gracilis delta-5 desaturase and pKR1249 For Down-Regulating Soybean Fad3 and Soybean Fad3c
Soybean embryogenic suspension culture (cv. Jack) was transformed with the Asc\ fragments of pKR1249 (SEQ ID NO:297; Example 43) and pKR1253 (SEQ ID NO:270; Example 39) as described in Example 25. A subset of soybean embryos generated from each event (ten embryos per event) were harvested, picked into glass GC vials and fatty acid methyl esters (FAMEs) were prepared by transesterification and analyzed by GC as described in Example 1. Retention times were compared to those for methyl esters of standards commercially available (Nu- Chek Prep, Inc.).
In this way, 142 events transformed with pKR1249 (SEQ ID NO:297) and pKR1253 (SEQ ID NO:270) (experiment called Heal 25) were analyzed. From the 142 events analyzed, 90 were identified that produced ARA in at least one embryo out of ten analyzed at a relative abundance greater than 1.0% of the total fatty acids. Of these, 64 were identified that produced ARA in at least one embryo out of ten analyzed at a relative abundance greater than 10.0% of the total fatty acids. Of these, 44 events were identified that produced ARA in at least one embryo out of ten analyzed at a relative abundance greater than 20.0% of the total fatty acids.
The average fatty acid profiles (Average of 10 embryos) for 20 events having the highest ARA are shown in FIG. 38. Fatty acids are identified as 16:0 (palmitate), 18:0 (stearic acid), 18:1 (oleic acid), LA, ALA, EDA, SCI, DGLA, ARA, ERA, JUN, ETA and EPA; and fatty acid compositions listed in FIG. 38 are expressed as a weight percent (wt. %) of total fatty acids. For FIG. 38, fatty acids listed as "others" include: 18:2 (5,9), 18:3 (5,9,12), STA, 20:0, 20:1(11), 20:2 (7,11) or 20:2 (8,11) and DPA. Each of these fatty acids is present at a relative abundance of less than 2.0% of the total fatty acids. Average total omega-3 fatty acid (Total n-3) is the sum of the averages of all omega-3 fatty acids).
The actual fatty acid profiles for each embryo from one event (AFS 5416-8-1- 1) having an average ARA content of 17.0% and an average EPA content of 1.5% is shown in FIG. 39. Fatty acids are identified as 16:0 (palmitate), 18:0 (stearic acid), 18:1 (oleic acid), LA, ALA, EDA, SCI, DGLA, ARA, ERA, JUN, ETA and EPA; and fatty acid compositions listed in FIG. 39 are expressed as a weight percent (wt. %) of total fatty acids. For FIG. 39, fatty acids listed as "others" include: 18:2 (5,9), 18:3 (5,9,12), STA, 20:0, 20:1 (11), 20:2 (7,11) or 20:2 (8,11) and DPA. Each of these fatty acids is present at a relative abundance of less than 2.0% of the total fatty acids. Total omega-3 fatty acid (Total n-3) is the sum of all omega-3 fatty acids). Because ALA contents is generally 1.5- to 3-fold higher in soybean somatic embryos than it is in seed (i.e., 15%-30% in embryos (see, for example, typical wild type embryo in FIG 36), depending on maturation conditions and time versus 7-10% in a seed (Bilyeu et al., 2005, Crop Sci. 45:1830-1836), it is expected that omega-3 contents in general and EPA contents specifically, will be significantly lower in seed than somatic embryos.
EXAMPLE 48
Soybean Embryos Transformed with Soybean Expression Vectors pKR1255 for Expression of a Eualena anabaena delta-9 elongase- Tetruetreptia pomαuetensis CCMP1491 Delta-8 Desaturase Fusion Gene (Hvbrid1-HGLA Synthase) with a
Euglena gracilis delta-5 desaturase and a Euglena anabaena delta-5 desaturase and PKR1249 For Down-Regulating Soybean Fad3 and Soybean Fad3c Soybean embryogenic suspension culture (cv. Jack) was transformed with the Asc\ fragments of pKR1249 (SEQ ID NO:297; Example 43) and pKR1255 (SEQ ID NO:275; Example 41) as described in Example 25. A subset of soybean embryos generated from each event (ten embryos per event) were harvested, picked into glass GC vials and fatty acid methyl esters (FAMEs) were prepared by transesterification and analyzed by GC as described in Example 1. Retention times were compared to those for methyl esters of standards commercially available (Nu- Chek Prep, Inc.).
In this way, 197 events transformed with pKR1249 (SEQ ID NO:297) and pKR1255 (SEQ ID NO:275) (experiment called Heal 26) were analyzed. From the 197 events analyzed, 128 were identified that produced ARA in at least one embryo out of ten analyzed at a relative abundance greater than 1.0% of the total fatty acids. Of these, 105 were identified that produced ARA in at least one embryo out of ten analyzed at a relative abundance greater than 10.0% of the total fatty acids. And of these, 83 events were identified that produced ARA in at least one embryo out of ten analyzed at a relative abundance greater than 20.0% of the total fatty acids.
The average fatty acid profiles (Average of 9 or 10 embryos) for 20 events having the highest ARA are shown in FIG. 40. Fatty.acids are identified as 16:0 (palmitate), 18:0 (stearic acid), 18:1 (oleic acid), LA, ALA, EDA, SCI, DGLA, ARA, ERA, JUN, ETA and EPA; and, fatty acid compositions listed in FIG. 40 are expressed as a weight percent (wt. %) of total fatty acids. For FIG. 40, fatty acids listed as "others" include: 18:2 (5,9), 18:3 (5,9,12), STA, 20:0, 20:1(11), 20:2 (7,11) or 20:2 (8,11) and DPA. Each of these fatty acids is present at a relative abundance of less than 2.0% of the total fatty acids. Average total omega-3 fatty acid (Total n-3) is the sum of the averages of all omega-3 fatty acids). EXAMPLE 49
Expression of the Eualena gracilis DHA Synthase 1 C20 Elonqase
Doma\n/Schizochvtrium aααreαatum Delta-4 Desaturase Fusion
(EqDHAsvn1C20EloDom3-SaD4) Transformed with Soybean Expression Vector
PKR1134 Soybean embryogenic suspension culture (cv. Jack) was transformed with the Asc\ fragment of pKR1134 (SEQ ID NO:161 ; Example 20; fragment containing the expression cassette) and embryos were matured as described for production in Example 25. Substrate feeding of EPA and analysis was carried out as described in Example 35. In this way, 198 events transformed with pKR1134 (Experiment called
Heal24) were analyzed. From the 198 events analyzed, 193 were identified that elongated EPA (C20/delta-5 elongase activity) and all of these desaturated DPA (delta-4 desaturase activity) to produce DHA to some extent. The events with the best C20/delta-5 elongase and delta-4 desaturase activities were advanced and the fatty acid profiles from feeding embryos with EPA are shown in FIG. 41.
Fatty acids in FIG. 41 are identified as 16:0 (palmitate), 18:0 (stearic acid), 18:1 (oleic acid), LA, ALA, EPA, 22:0 (docosanoic acid), DPA, 24:0 (tetracosanoic acid), DHA and 24:1 (nevonic acid); and fatty acid compositions listed in FIG. 41 are expressed as a weight percent (wt. %) of total fatty acids. The activity of the EgDHAsyn1C20EloDom1 is expressed as percent C20/delta-5 elongation (% C20/delta-5 elong), calculated according to the following formula: ([product]/[substrate + product])*100. More specifically, the combined percent elongation for EPA is shown as "% C20/delta-5 elong", determined as: ([DPA + DHA]/[EPA + DPA + DHA])*100. The activity of the SaD4 is expressed as percent delta-4 desaturation (% delta-4 desat), calculated according to the following formula: ([product]/[substrate + product])*100. More specifically, the combined percent desaturation for DPA is shown as "% delta-4 desat", determined as: (DHA/[DPA + DHA])*100. In addition to the 122 events analyzed for soy transformed with pKR1105 (Experiment called Heal23) as described in Example 35, 20 more events were analyzed since the Example 35 was written bringing the total analyzed to 142 events. From the 20 new events analyzed, 18 were identified that elongated EPA (C20/delta-5 elongase activity) and 17 of these also desaturated DPA (delta-4 desaturase activity) to produce DHA to some extent. The events with the best C20/delta-5 elongase and delta-4 desaturase activities from the 20 new events analyzed for soy transformed with pKR1105 were advanced and the fatty acid profiles from feeding embryos with EPA are shown in FIG. 42. Relative activities of events transformed with either pKR1105 (C20 elongase and delta-4 desaturase expressed individually) or pKR1134 (C20 elongase and delta-4 desaturase expressed as a fusion) are compared by plotting %DHA (wt. %) vs. %DPA (wt. %) for all events where embryos were fed EPA. The results are plotted in FIG. 43. In FIG. 43, Heal23 is the name of the experiment where pKR1105 was transformed and Heal24 is the experiment where pKR1134 was transformed. From FIG. 43, it is clear that overall, DHA concentrations have increased while DPA concentrations have decreased (to as low as undetectable), when the C20 elongase is fused to the delta-4 desaturase. Thus, fusing the 2 independent enzymes together as one fusion protein separated by a linker region increased flux from EPA to DHA.
EXAMPLE 50
Expression of a Euglena anabaena delta-9 elonqase- Tetruetreptia pomαuetensis CCMP1491 Delta-8 Desaturase-Etvg/eπa anabaena delta-5 Desaturase Fusion
Gene (EaD9Elo1-TpomD8-EaD5Des1 fusion) Soybean embryogenic suspension culture (cv. Jack) is transformed with pKR1322 (SEQ ID NO:314) and embryos are matured and analyzed for fatty acid profiles as described herein.
Soybean somatic embryos transformed with pKR1322 (SEQ ID NO:314) will elongate LA to EDA, EDA will be desaturated to DGLA, and DGLA will be further desaturated to ARA. Because wild-type soybean also contains ALA, some ALA will be elongated to ERA, ERA will be desaturated to ETA, and ETA will be further desaturated to EPA. Soybean plants expressing the fusion gene from pKR1322 can be regenerated from embryos as described in Example 26 and seeds can be obtained.
In backgrounds that contain high ALA1 or where a delta-15 desaturase and or delta-17 desaturase has been co-expressed, EPA will predominate. Conversely, ARA can be enriched by using a background low in ALA (for example by crossing to a low ALA or low lin plant) or by knocking out the endogenous fad3 gene(s) as described herein. Intermediate fatty acids (i.e., EDA and DGLA or ERA and ETA) will be lower when the fusion is used than when individual activities are transformed independently (i.e., not a fusion). Other gene combinations (including linker combinations) can be fused together in a similar way as described in Example 24. Similarly, other promoter/gene fusion/terminator combinations can be made.
EXAMPLE 51
Determination Of The Functional Domain In The Synthetic Delta-4 Desaturase Derived From Euglena anabaena And Codon-Optimized For Expression In Yarrowia lipolvtica (EaD4S)
As schematically diagrammed in FIG. 52C, the C-terminal portion of the C20 elongase domain of EaDHAsyni (labeled as "EaC20E" in the figure) appears to overlap with the N-terminal portion of the delta-4 desaturase domain of EaDHAsyni (labeled as "EaD4" in the figure). This is suggested by sequence comparison.
In order to define the functional delta-4 desaturase domain in EaD4S (SEQ ID NO:192; Example 34), three EaD4S* mutants with different N-terminal truncations were generated. Specifically, pZuFmEaD4S (SEQ ID NO:364) was constructed by replacing the NcoUNott fragment of pZuFmlgD9ES (SEQ ID NO:365) with the NcoUNott EaD4S fragment of pEaD4S (SEQ ID NO: 196; Example 34). A Λ/col site was introduced into pZuFmEaD4S (SEQ ID NO:364) by site-directed mutagenesis using primer pairs YL921 and YL922 (SEQ ID NOs:366 and 367, respectively), YL923 and YL924 (SEQ ID NOs:368 and 369, respectively) and YL925 and YL926 (SEQ ID NOs:370 and 371 , respectively) to generate pZuFmEaD4S-M1 (SEQ ID NO:372), pZuFmEaD4S-M2 (SEQ ID NO:373) and pZuFmEaD4S-M3 (SEQ ID NO:374), respectively. The pZuFmEaD4S-M1 , pZuFmEaD4S-M2 and pZuFmEaD4S-M3 plasmids were digested with Λ/col, and then self-ligated to generate pZuFmEaD4S-1 (SEQ ID NO:375), pZuFmEaD4S-2 (SEQ ID NO:376) and pZuFmEaD4S-3 (SEQ ID NO:377) constructs. The NcoVNott fragments containing different truncations of EaD4S from pZuFmEaD4S-1 , pZuFmEaD4S-2, pZuFmEaD4S-3 were used to produce pZKL4-220EA4-1 (SEQ ID NO:378), pZKL4-220EA4-2 (SEQ ID NO:379) and pZKL4-220EA4-3 (SEQ ID NO:380) constructs. These three constructs were exactly the same as pZKL4- 220EA4 (SEQ ID NO:362 and FIG. 52B, as described in Table 28 of Example 34), except that the coding region of EaD4S was truncated at the N-terminal region. Specifically, instead of the 583 amino acid long coding sequence of EaD4S (SEQ ID NO: 193), the truncated EaD4S* polypeptide was 547 amino acids in length in pZKL4-220EA4-1 (i.e., SEQ ID NO:382), 527 amino acids in length in pZKL4-
220EA4-2 (i.e., SEQ ID NO:384) and 512 amino acids in length in pZKL4-220EA4-3 (i.e., SEQ ID NO:386). The N-terminal region of these polypeptides (corresponding to amino acids 1-90 of SEQ ID NO:193) are aligned in FIG. 53A.
Plasmids pZKL4-220EA4 (SEQ ID NO:362), pZKL4-220EA4-1 (SEQ ID NO:378), pZKL4-220EA4-2 (SEQ ID NO:379) and pZKL4-220EA4-3 (SEQ ID
NO:380) were digested with Asc\/Sph\, and then transformed into Yarrowia lipolytica strain Y4184U4, as described in the General Methods. Transformants were selected on MM plates. After 5 days growth at 30 'C, 5 transformants grown on the MM plates from each construct were picked and re-streaked onto fresh MM plates. Once grown, these strains were individually inoculated into 3 ml_ liquid MM at 30 C and shaken at 250 rpm/min for 2 days. The cells were collected by centrifugation, resuspended in HGM and then shaken at 250 rpm/min for 5 days. The cells were collected by centrifugation, lipids were extracted, and fatty acid methyl esters were prepared by trans-esterification, and subsequently analyzed with a Hewlett-Packard 6890 GC.
The GC results are shown below in Table 29. The composition of DPA and DHA are presented as a % of the total fatty acids. The conversion efficiency ("Conv. Effiα") was measured according to the following formula: ([DHA]/[DPA+DHA])*100. The delta-4 desaturase activity of each truncated EaD4S* (i.e., the polypeptides of 547 amino acids, 527 amino acids or 512 amino acids in Yarrowia transformants of PZKL4-220EA4-1 , pZKL4-220EA4-2 and pZKL4-220EA4-3, respectively) was compared to that of EaD4S (i.e., Yarrowia transformants of pZKL4-220EA4) in the column labeled "% Delta-4 Activity". TABLE 29
Functional Analysis Of EaD4S And Truncated Variants In Yarrowia lipolytica Strain Y4184U4
Figure imgf000206_0001
These data demonstrated that the N-terminal 37 amino acids of EaD4S (i.e., amino acids 1-37 of SEQ ID NO: 193) have a negative effect on the activity of the delta-4 desaturase. Elevated delta-4 desaturase activity is measured with respect to EaD4S in each of the EaD4S* truncated proteins, although the EaD4S* polypeptide of 547 amino acids (SEQ ID NO:382) is superior in activity as compared to the EaD4S* polypeptides lacking additional amino acids from their N-terminus (i.e., SEQ ID NOs:384 and 386).
Example 52
Synthesis And Functional Analysis Of A Codon-Optimized Delta-4 Desaturase Gene (EgD4S) From Euplena gracilis In Yarrowia lipolytica The codon usage of the delta-4 desaturase domain of EgDHAsyni (SEQ ID
NO:221) of Euglena gracilis (corresponding to amino acids 253-793 of SEQ ID NO: 12) was optimized for expression in Yarrowia lipolytica, in a manner similar to that described in PCT Publication No. WO 2004/101753, U.S. Patent 7,125,672, and Examples 32, 33, and 34 herein. Specifically, a codon-optimized delta-4 desaturase gene (designated "EgD4S"; SEQ ID NO:387) was designed, based on the coding sequence of the delta-4 desaturase domain of EgDHAsyni , according to the Yarrowia codon usage pattern (PCT Publication No. WO 2004/101753), the consensus sequence around the 'ATG' translation initiation codon, and the general rules of RNA stability (Guhaniyogi, G. and J. Brewer, Gene, 265(1 -2): 11-23 (2001)). In addition to the modification of the translation initiation site, 282 bp of the 1623 bp coding region were modified (17.4%) and 270 codons were optimized (49.9%). The codon-optimized coding region of EgD4S is 1623 bp in length, thereby encoding a polypeptide of 540 amino acids (SEQ ID NO:388). Thus, EgD4S is one amino acid shorter in length than the wildtype delta-4 desaturase domain of EgDHAsyni (i.e., SEQ ID NO:221; specifically, the leucine residue corresponding to amino acid position 2 of SEQ ID NO:13 was removed in EgD4S (SEQ ID NO:388). The designed EgD4S gene (SEQ ID NO:387) was synthesized by GenScript Corporation (Piscataway, NJ) and cloned into pUC57 (GenBank Accession No. Y14837) to generate pEgD4S (SEQ ID NO:389).
To analyze the function of the codon-optimized EgD4S gene, plasmid pZKL4- 220Eg4 (SEQ ID NO:390) was constructed to integrate two chimeric C20 elongase genes and the chimeric EgD4S gene into the lipase 4 like locus (GenBank Accession No. XM_503825) of Yarrowia lipolytica strain Y4305U3. Plasmid pZKL4- 220Eg4 (SEQ ID NO:390) was identical in construction to that of plasmid pZKL4- 220Ea4 (SEQ ID NO:362; FIG. 52B; Table 28 of Example 34), with the exception that EgD4S (SEQ ID NO:387) was used in place of EaD4S (SEQ ID NO: 192).
Plasmid pZKL4-220Eg4 was digested with Asc\/Sph\, and then transformed into Yarrowia lipolytica strain Y4305U3, as described in the General Methods. The transformants were selected on MM plates. After 5 days growth at 30 °C, 14 transformants grown on the MM plates were picked and re-streaked onto fresh MM plates. Once grown, these strains were individually inoculated into 3 ml_ liquid MM at 30 °C and shaken at 250 rpm/min for 2 days. The cells were collected by centrifugation, resuspended in HGM and then shaken at 250 rpm/min for 5 days. The cells were collected by centrifugation, lipids were extracted, and fatty acid methyl esters were prepared by trans-esterification, and subsequently analyzed with a Hewlett-Packard 6890 GC.
GC analyses showed that there were an average of 4.9% DHA and 17.8% DPA of total lipids produced in all 14 transformants, wherein the conversion efficiency of DPA to DHA was determined to be about 21.5% (calculated as described in Example 27). Thus, this experimental data demonstrated that the synthetic Euglena gracilis delta-4 desaturase codon-optimized for expression in Yarrowia lipolytica (i.e., EgD4S, as set forth in SEQ ID NO:387) was active to convert DPA to DHA. EXAMPLE 53
Determination Of The Functional Domain In The Synthetic Delta-4 Desaturase Derived From Eualena gracilis And Codon-Qptimized For Expression In Yarrowia lipolvtica (EαD4S) In a manner similar to that observed in EaDHAsyni (FIG. 52C), the C- terminal portion of the C20 elongase domain of EgDHAsyni appears to overlap with the N-terminal portion of the delta-4 desaturase domain of EgDHAsyni , based on sequence comparison.
In order to define the functional delta-4 desaturase domain in EgD4S (SEQ ID NO:387), three EgD4S mutants with different N-terminal truncations were generated. A Λ/col site was introduced into pEgD4S (SEQ ID NO:389; Example 52) by site-directed mutagenesis using primer pairs YL935 and YL936 (SEQ ID NOs:391 and 392, respectively), YL937 and YL938 (SEQ ID NOs:393 and 394, respectively) and YL939 and YL940 (SEQ ID NOs:395 and 396, respectively) to generate pEgD4S-M1 (SEQ ID NO:397), pEgD4S-M2 (SEQ ID NO:398) and pEgD4S-M3 (SEQ ID NO:399) constructs, respectively. The Nco\/Not\ fragments containing different truncations of EgD4S from pEgD4S-M1 , pEgD4S-M2 and pEgD4S-M3 were used to generate pZKL4-220Eg4-1 (SEQ ID NO:400), pZKL4- 220Eg4-2 (SEQ ID NO:401) and pZKL4-EgD4-3 (SEQ ID NO:402) constructs. These three constructs were exactly the same as pZKL4-220Eg4 (SEQ ID NO:390; Example 52), except that the coding region of EgD4S was truncated at the N- terminal region. Specifically, instead of the 540 amino acid long coding sequence of EgD4S (SEQ ID NO:388), the truncated EgD4S* polypeptide was 513 amino acids in length in pZKL4-220Eg4-1 (i.e., SEQ ID NO:404), 490 amino acids in length in pZKL4-220Eg4-2 (i.e., SEQ ID NO:406) and 474 amino acids in length in pZKL4- 220Eg4-3 (i.e., SEQ ID NO:408). The N-terminal region of these polypeptides (corresponding to amino acids 1-80 of SEQ ID NO:388) are aligned in FIG. 53B.
Plasmids pZKL4-220Eg4 (SEQ ID NO:390), pZKL4-220Eg4-1 (SEQ ID NO:400), pZKL4-220Eg4-2 (SEQ ID NO:401) and pZKL4-EgD4-3 (SEQ ID NO:402) were digested with Asc\/Sph\, and then transformed into Yarrowia lipolytica strain Y4305U3 individually, as described in the General Methods. Transformants were selected on MM plates. After 5 days growth at 30 C, 4 transformants grown on the MM plates from each construct were picked and re-streaked onto fresh MM plates. Once grown, these strains were individually inoculated into 3 ml_ liquid MM at 30 C and shaken at 250 rpm/min for 2 days. The cells were collected by centrifugation, resuspended in HGM and then shaken at 250 rpm/min for 5 days. The cells were collected by centrifugation, lipids were extracted, and fatty acid methyl esters were prepared by trans-esterification, and subsequently analyzed with a Hewlett-Packard 6890 GC.
The GC results are shown below in Table 30. The composition of DPA and DHA are presented as a % of the total fatty acids. The conversion efficiency ("Conv. Effic") was measured according to the following formula: ([DHA]/[DPA+DHA])*100. The delta-4 desaturase activity of each truncated EgD4S* (i.e., the polypeptides of 513 amino acids, 490 amino acids or 474 amino acids in Yarrowia transformants of pZKL4-220Eg4-1 , pZKL4-220Eg4-2 and pZKL4-220Eg4-3, respectively) was compared to that of EgD4S (i.e., Yarrowia transformants of pZKL4-220Eg4) in the column labeled "% Delta-4 Activity".
TABLE 30
Functional Analysis Of EgD4S And Truncated Variants In Yarrowia lipolvtica Strain Y4305U3
Figure imgf000209_0001
These data demonstrated that the N-terminal 28 amino acids of EgD4S* are dispensable (i.e., see pZKL4-220Eg4-1 , where the first 28 amino acids of EgD4S were truncated and the truncated protein set forth as SEQ ID NO:404 retained full delta-4 desaturase activity). Reduced delta-4 desaturase activity was measured in transformants with pZKL4-220EgD4-2 and pZKL4-220EgD4-3, when additional amino acids were truncated from the 5' portion of the EgD4S protein, thereby resulting in SEQ ID NO:406 and SEQ ID NO:408. EXAMPLE 54 Synthesis And Functional Analysis Of A Codon-Qptimized EqDHAsvni Gene
(EqDHAsvniS) From Eualena gracilis In Yarrowia lipolvtica Plasmid pZKLY-G204 (FIG.54A; SEQ ID NO:409) was designed to integrate a chimeric gene containing a codon-optimized EgDHAsyni coding region (i.e., EgDHAsyniS, set forth as SEQ ID NOs:410 and 411) into the lipase 7 locus (GenBank Accession No. AJ549519) of Yarrowia lipolytica strain Y4305U3. In addition to the modification of the translation initiation site, 417 bp of the 2382 bp coding region were modified (17.5%), and 391 codons of the total 794 codons were optimized (49.2%). The amino acid sequence of the codon-optimized EgDHAsyniS (SEQ ID NO:411) is 100% identical in sequence to that of EgDHAsyni (SEQ ID NO:12).
To generate pZKLY-G204, a Kpn\ site was introduced into pEgC20ES (SEQ ID NO: 185; see FIG. 51 B, Example 32) to generate pEgC20ES-K (SEQ ID NO:412; FIG. 54B) by site-directed mutagenesis using oligonucleotides YL973 and YL974 (SEQ ID NOs:413 and 414, respectively) as primers and pEgC20ES as template. A 732 bp Pme\/Nco\ fragment containing the YAT1 promoter (Patent Publication US 2006/0094102-A1) of pYNTGUS1-CNP (FIG. 54C; SEQ ID NO:415), the 873 bp Nco\/Kpn\ fragment of pEgC20ES-K containing the codon-optimized N-terminal portion of EgDHAsyniS and the 1512 bp Kpn\INco\ fragment of pEgD4S (SEQ ID NO:389; Example 52) containing the codon-optimized C-terminal portion of EgDHAsyniS were isolated, and then used to replace the PmeUNott fragment of pZKLY (FIG. 54D; SEQ ID NO:416) to generate pZKLY-G204 (FIG. 54A). Thus, pZKLY-G204 contained the following components: TABLE 31
Components Of Plasmid pZKLY-G204 (SEQ ID NO:409)
Figure imgf000210_0001
Figure imgf000211_0001
The pZKLY-G204 plasmid was digested with AscUSphl, and then transformed into Yarrowia lipolytica strain Y4305U3, as described in the General Methods. The transformants were selected on MM plates. After 5 days growth at 30 C, 8 transformants grown on the MM plates were picked and re-streaked onto fresh MM plates. Once grown, these strains were individually inoculated into 3 ml_ liquid MM at 30 C and shaken at 250 rpm/min for 2 days. The cells were collected by centrifugation, resuspended in HGM and then shaken at 250 rpm/min for 5 days. The cells were collected by centrifugation, lipids were extracted, and fatty acid methyl esters were prepared by trans-esterification, and subsequently analyzed with a Hewlett-Packard 6890 GC.
GC analyses showed that there were about 5.0% DHA, 13.5% DPA and 26.5% EPA of total lipids produced in all 8 transformants. The conversion efficiency of EPA to DPA and DHA was determined to be about 41%; and, the conversion efficiency of DPA to DHA was determined to be about 27% in these eight strains (calculated as described in Example 27). Thus, this experimental data demonstrated that the synthetic Euglena gracilis DHA synthase codon-optimized for expression in Yarrowia lipolytica (i.e., EgDHAsyniS as set forth in SEQ ID NO:410) contained both C20 elongase activity and delta-4 desaturase activity. EgDHAsyniS could use EPA as substrate to produce DHA.
EXAMPLE 55 Creation Of Delta-9 Elongase/Delta-8 Desaturase Gene Fusions For Expression In
Yarrowia lipolvtica
In order to improve the enzyme activity of delta-9 elongase and delta-8 desaturase in Yarrowia lipolytica, a series of six delta-9 elongase/delta-8 gene fusions (multizymes) are created in the present Example, using two variant linker sequences (i.e., SEQ ID NO:438 [GPARPAGLPPATYYDSLAV ] and SEQ ID NO:445 [GAGPARPAGLPPATYYDSLAVMGS]) derived from the EgDHAsyni proline-rich linker (SEQ ID NO:198; PARPAGLPPATYYDSLAV). This work required: identification of appropriate delta-9 elongases and delta-8 desaturases for expression in Yarrowia lipolytics; and, construction of plasmid pZUFmG9G8fu (comprising a EgD9ES/EgD8M gene fusion), plasmid pZuFmG9G8fu-B (comprising a EgD9ES/EgD8M gene fusion), plasmid pZUFmG9A8 (comprising a EgD9ES/EaD8S gene fusion), plasmid pZUFmA9G8 (comprising a EaD9ES/EgD8M gene fusion), plasmid pZUFmA9A8 (comprising a EaD9ES/EaD8S gene fusion) and plasmid pZUFmR9G8 (comprising a E389D9eS/EgD8M gene fusion). All plasmids shared a common vector backbone and thus were distinguished only by the gene fusion that each comprised.
Functional analysis of the activity of each gene fusion is tested infra, in Example 56.
Description Of Synthetic Delta-9 Elongase And Delta-8 Desaturase Genes Codon- Optimized For Expression In Yarrowia lipolytics
The Applicants have performed considerable analyses of various delta-9 elongases and delta-8 desaturases, to determine those enzymes having optimal substrate specificity and/or substrate selectivity when expressed in Yarrowia lipolytica. Based on these analyses, the genes described below in Table 32, and codon-optimized genes derived there from (based on methodology of U.S. Patent 7,125,672), were identified as preferred for expression in Y. lipolytica. Those genes highlighted in bold text were subsequently utilized to create delta-9 elongase/delta-8 desaturase gene fusions.
TABLE 32 Preferred Desaturases And Elonqases For Creation Of Delta-9/Delta-8 Gene Fusions In Yarrowia lipolvtica
4 4
Figure imgf000213_0001
Notes:
• EgD9e was described as "EgD9elo" in Example 16 herein.
• EaD9e was identified as "EaD9Elo1" in U.S. Provisional Patent Application No. 60/911925 and in Example 36, herein.
• EgD8 was identified as "Eg5" in U.S. Patent 7,256,033.
• EgD8S was identified as "D8SF" in U.S. Patent 7,256,033. i
• EgD8M was identified as "EgD8S-23" in U.S. Patent Application No. 11/635258. 4 - -
The LA to EDA conversion efficiency of EgD9eS, E389D9eS and EaD9eS is reported in each of the applications cited above. Briefly, however, when each delta- 9 elongase was expressed as a chimeric gene in Yarrowia lipolytics strain Y2224 (FIG. 44), under the control of a Yarrowia FBAINm promoter (PCT Publication No. WO 2005/049805; U.S. Patent 7,202,356) and a Pex20 terminator sequence from the Yarrowia Pex20 gene (GenBank Accession No. AF054613), the following substrate conversions were independently measured: EgD9eS (20.1%); EaD9eS (13%); and, E389D9eS (12%). All synthetic codon-optimized genes functioned with greater substrate conversion efficiency than the corresponding wildtype gene. U.S. Patent 7,256,033 discloses a E. gracilis delta-8 desaturase ("EgD8") able to desaturate EDA and EtrA to DGLA and ETA, respectively, as well as a synthetic delta-8 desaturase derived from EgD8 and codon-optimized for expression in Yarrowia lipolytica ( "EgD8S"). Despite the usefulness of EgD8 and EgD8S, a synthetically engineered mutant delta-8 desaturase identified herein as EgD8M (SEQ ID NOs:327 and 328) is used in more preferred embodiments for expression in Yarrowia lipolytica. As described in U.S. Patent Application No. 11/635258, "EgDδM" (identified therein as "EgD8S-23") was created by making multiple rounds of targeted mutations within EgD8S. The effect of each mutation on the delta-8 desaturase activity of the resulting mutant was screened to ensure functional equivalence with the delta-8 desaturase activity of EgD8S (SEQ ID NO:426). As a result of this work, mutant EgDδM (SEQ ID NO:328) comprises the following 24 amino acid mutations with respect to the synthetic codon-optimized EgDδS sequence set forth as SEQ ID NO:426: 4S to A, 5K to S, 12T to V, 16T to K, 17T to V, 66P to Q, 67S to A, 10δS to L, 117G to A, 11δY to F, 120L to M, 121 M to L, 125Q to H, 126M to L, 132V to L, 133 L to V, 162L to V, 163V to L, 293L to M, 407A to S, 40δV to Q, 41δA to G, 419G to A and 422L to Q. Pairwise alignment of the EgDδM and EgDδS protein sequences using default parameters of Vector NTI®'s AlignX program (Invitrogen Corporation, Carlsbad, CA) revealed 94.3% sequence identity and 97.9% consensus between the two proteins over a length of 422 amino acids. Average EDA to DGLA substrate conversion by this mutant delta-δ desaturase was determined to be 37%, when EgDδM was expressed in Yarrowia lipolytica strain Y4001 (FIG. 44), under the control of a Yarrowia FBAINm promoter (PCT Publication No. WO 2005/049605; U.S. Patent 7,202,356) and a Pex20 terminator sequence from the Yarrowia Pex20 gene (GenBank Accession No.
AF054613).
When EaDδS was expressed in Yarrowia lipolytica strain Y4001 U (FIG. 44), under the control of a Yarrowia FBAINm promoter U.S. Patent 7,202,356) and a Pex20 terminator sequence from the Yarrowia Pex20 gene (GenBank Accession
No. AF054613), there was 41% conversion efficiency to DGLA with endogenous
EDA as substrate.
Generation Of Construct pZUFmEαD9ES-Na. Comprising EqD9ES
Plasmid pZuFmEgD9ES (SEQ ID NO:431), which was previously described in Patent Publication US 2007-0117190 A1 , comprises a chimeric
FBAINm::EgD9ES::Pex20 gene, a CoIEI plasmid origin of replication, an ampicillin- resistance gene (AmpR) for selection in E. coli, a
Yarrowia autonomous replication sequence (ARS18; GenBank Accession No.
A17608), and a Yarrowia Ura 3 gene (GenBank Accession No. AJ306421). A Nar I site was introduced into pZuFmEgD9ES to generate pZuFmEgD9ES-
Na (SEQ ID NO:432) using oligonucleotides YL989 and YL990 (SEQ ID NOs:433 and 434, respectively) as primers and pZuFmEgD9ES as template. The introduced
Nar I site (i.e., GGCGCC) was located just before the translation stop codon of
EgD9ES; therefore, the coding region of EgD9ES was extended with two additional amino acids (i.e., a glycine and an alanine).
Generation Of Construct pZUFmG9G8fu. Comprising A EqD9ES/EqD8M Gene
Fusion
The N-terminal portion of EgDδM (SEQ ID NO:327) was amplified by PCR using oligonucleotides YL991 and YL992 (SEQ ID NOs:435 and 436, respectively) as primers and pKO2UFm8A (SEQ ID NO:437) as template. Oligonucleotide YL991 contained a Nar I site at its 5' end and DNA sequence encoding a modified variant of the EgDHAsyni proline-rich linker (i.e., GPARPAGLPPATYYDSLAV, as set forth in SEQ ID NO:438 versus PARPAGLPPATYYDSLAV, as set forth in SEQ ID
NO: 198). This linker possessed an additional glycine at the 5' end, with respect to the EgDHAsyni proline-rich linker (SEQ ID NO: 198).
The Nar UBgI Il digested PCR product comprising the 51 portion of EgDδM and the BgI W/Not I digested fragment of pKO2UFm8A comprising the 3' portion of
EgDδM was used to replace the Nar UNot I fragment of pZUFmEgD9ES-Na to generate pZUFmG9G8fu (FIG. 55A)1 which thereby contained the following components:
TABLE 33 Components Of Plasmid pZUFmG9G8fu (SEQ ID NO:439)
Figure imgf000216_0001
Generation Of Construct pZuFmG9G8fu-B. Comprising A EgD9ES/EgD8M Gene Fusion
A BamH I site (i.e., GGATCC) was introduced into pZUFmG9G8fu to generate pZUFmG9G8fu-B (SEQ ID NO:442) using oligonucleotides YL1043 and YL1044 (SEQ ID NOs:443 and 444, respectively) as primers and pZuFmG9G8fu as template. The BamH I site was located just after the translation start codon ATG and was in the reading frame of EgD8M, which resulted in a two amino acid insertion (i.e., glycine and serine) between the methionine amino acid residue and the remaining portion of the EgD8M polypeptide. This modification caused the linker region between EgD9ES and EgD8M to become a peptide having the sequence set forth as SEQ ID NO:445 (i.e., GAGPARPAGLPPATYYDSLAVMGS); the nucleotide and translated amino acid sequence of the full-length EgD9ES/EgD8M gene fusion is set forth as SEQ ID NOs:446 and 447, respectively. Generation Of Construct pZUFmG9A8. Comprising A EqD9ES/EaD8S Gene Fusion Plasmid pEaD8S (SEQ ID NO:448) was created when the EaD8S gene (SEQ ID NO:429) was cloned into pUC57 (GenBank Accession No. Y14837). Then, a BamH I site was introduced into pEaD8S using oligonucleotides YL1059 and YL1060 (SEQ ID NOs:449 and 450, respectively) as primers and pEaD8S as template, to generate pEaD8S-B (SEQ ID NO:451). The introduced BamH I site (i.e., GGATCC) was located just before the translation start codon of EaD8S in pEaD8S-B, and is in the same reading frame with EaD8S.
The BamH UNot I fragment of pEaD8S-B comprising EaD8S was used to replace the BamH UNot I fragment of pZUFmG9G8fu-B (SEQ ID NO:442) to generate pZUFmG9A8 (SEQ ID NO:452; FIG. 55B) (thereby introducing EaD8S in place of EgDδM). The linker region between EgD9ES and EaDδS was a peptide having the sequence set forth in SEQ ID NO:445 (i.e.,
GAGPARPAGLPPATYYDSLAVMGS). Thus, plasmid pZUFmG9Aδ contained the EgD9ES/EaDδS gene fusion, flanked by the Yarrowia lipolytica FBAINm promoter and a Pex20 terminator (GenBank Accession No. AF054613). The nucleotide and translated amino acid sequence of the full-length EgD9ES/EaDδS gene fusion is set forth as SEQ ID NOs:453 and 454, respectively.
Generation Of Construct pZUFmA9Gδ. Comprising A EaD9ES/EqD8M Gene Fusion Plasmid pZUFmEaD9ES (SEQ ID NO:455) contained the EaD9ES gene, flanked by the Yarrowia lipolytica FBAINm promoter and a Pex20 terminator
(GenBank Accession No. AF054613). A Nar I site was introduced into the plasmid to generate pZUFmEaD9ES-Na (SEQ ID NO.456) using oligonucleotides YL1049 and YL1050 (SEQ ID NOs:457 and 45δ, respectively) as primers and pZUFmEaD9ES as template. The introduced Nar I site (i.e., GGCGCC) was located just before the translation stop codon of EaD9ES, and is in the same reading frame of EaD9ES.
The Nco \INar I fragment of pZUFmEaD9ES-Na (SEQ ID NO:456) comprising EaD9ES was used to replace the Nco \INar I fragment of pZUFmG9G8fu-B (SEQ ID NO:442) to generate pZUFmA9G8 (SEQ ID NO:459) (thereby introducing EaD9ES in place of EgD9ES). The linker region between EaD9ES and EgD8M was a peptide having the sequence set forth in SEQ ID NO:445 (i.e., GAGPARPAGLPPATYYDSLAVMGS). Thus, plasmid pZUFmA9G8 contained the EaD9ES/EgD8M gene fusion, flanked by the Yarrowia lipolytics FBAINm promoter and a Pex20 terminator (GenBank Accession No. AF054613). The nucleotide and translated amino acid sequence of the full-length EaD9ES/EgD8M gene fusion is set forth as SEQ ID NOs:460 and 461 , respectively. Generation Of Construct pZUFmA9A8. Comprising A EaD9ES/EaD8S Gene Fusion The BamH I \/Not I fragment of pEaD8S-B (SEQ ID NO:451) comprising
EaD8S was used to replace the BamH VNot I fragment of pZUFmA9G8 (SEQ ID NO:459) to generate pZUFmA9A8 (SEQ ID NO:462) (thereby introducing EaD8S in place of EgD8M). The linker region between EaD9ES and EaD8S was a peptide having the sequence set forth in SEQ ID NO:445 (i.e., GAGPARPAGLPPATYYDSLAVMGS). Thus, plasmid pZUFmA9A8 contained the EaD9ES/EaD8S gene fusion, flanked by the Yarrowia lipolytica FBAINm promoter and a Pex20 terminator (GenBank Accession No. AF054613). The nucleotide and translated amino acid sequence of the full-length EaD9ES/EaD8S gene fusion is set forth as SEQ ID NOs:463 and 464, respectively. Generation Of Construct pZUFmR9G8. Comprising A E389D9eS/EqD8M Gene Fusion
Plasmid pE389S (SEQ ID NO:465) was created when the E389D9eS gene (SEQ ID NO:358) was cloned into pUC57 (GenBank Accession No. Y14837). Then, a Naή site was introduced into pE389S to generate pE389S-Na (SEQ ID NO:466) using oligonucleotides YL1051 and YL1052 (SEQ ID NOs:467 and 468, respectively) as primers and pE389S as template. The introduced Nar \ (i.e., GGCGCC) site was located just before the translation stop codon and was in the same reading frame of E389D9eS.
The Nco \INar I fragment of pE389S-Na comprising E389D9eS was used to replace the Nco UNar I fragment of pZUFmG9G8fu-B comprising EgD9ES to generate pZUFmR9G8 (SEQ ID NO:469) (thereby introducing E389D9eS in place of EgD9ES). The linker region between E389D9eS and EgD8M was a peptide having the sequence set forth in SEQ ID NO:445 (i.e., GAGPARPAGLPPATYYDSLAVMGS). Thus, plasmid pZUFmR9G8 contained the E389D9eS/EgD8M gene fusion, flanked by the Yarrowia lipolytics FBAINm promoter and a Pex20 terminator (GenBank Accession No. AF054613). The nucleotide and translated amino acid sequence of the full-length E389D9eS/EgD8M gene fusion is set forth as SEQ ID NOs:470 and 471 , respectively.
Example 56 Functional Analyses Of Delta-9 Elongase/Delta-8 Desaturase Gene Fusions In
Yarrowia lipolvtica Strain Y2224 The plasmids from Example 55 [i.e., pZUFmEgD9ES (SEQ ID NO:431), pZUFMEgD9ES-Na (SEQ ID NO:432), pZUFMG9G8fu (SEQ ID NO:439), pZUFmG9G8fu-B (SEQ ID NO:442), pZUFmG9A8 (SEQ ID NO:452), pZUFmA9G8 (SEQ ID NO:459), pZuFmA9A8 (SEQ ID NO:462) and pZUFmR9G8 (SEQ ID NO:469)] were transformed into Yarrowia lipolytica strain Y2224 individually, as described in the General Methods. The transformants were selected on MM plates. After 2 days growth at 30 C, eight transformants from each transformation reaction were streaked out onto new MM plates and incubated for an additional 2 days at 30 °C. Once grown, these strains were individually inoculated into 3 mL liquid MM at 30 °C and shaken at 250 rpm/min for 2 days. The cells were collected by centrifugation, the supernatant was removed and 3 mL of HGM was added. These strains were grown in a 30 °C incubator shaking at 250 rpm for an additional 5 days. The cells were collected by centrifugation, lipids were extracted, and fatty acid methyl esters were prepared by trans-esterification, and subsequently analyzed with a Hewlett-Packard 6890 GC.
GC analyses showed that there were both delta-9 elongase and delta-8 desaturase activities in all strains having a delta-9 elongase/delta-8 desaturase gene fusion. The results are summarized below in Table 34. Delta-9 elongase activity was calculated by dividing the sum of the weight percent (wt %) for EDA and DGLA by the sum of the wt % for LA, EDA and DGLA and multiplying by 100 to express as a percent; similarly, delta-8 desaturase activity was calculated by dividing the wt % for DGLA by the sum of the wt % for EDA and DGLA and multiplying by 100 to express as a percent. TABLE 34
O
4 4
0
K*
Figure imgf000220_0001
i O 4 - -
Delta-9 Elonαase And Delta-8 Desatu rase Activities In Yarrowia Transformed
With Various Gene Fusions
In summary of Table 34, the data showed that all six fusion genes had both delta-9 elongase and delta-8 desaturase activities; thus, the fusion proteins from the fusion genes effectively permitted expression of two independent and separable enzymatic activities. More importantly, fusing the two independent enzymes together as one fusion protein separated by a linker region increased flux from LA to DGLA. In all cases, the fusion gene had higher activity than at least one of the individual genes when expressed alone in Yarrowia. These data suggested that the product of delta-9 elongase may be directly channeled as substrate of delta-8 desaturase in the fusion protein.
More specifically, in the case of the EgD9ES/EgD8M gene fusion (i.e., pZuFmG9G8fu-B) and the EgD9ES/EaD8S gene fusion (i.e., pZuFmG9A8), the EgD9ES delta-9 elongase (21% conversion) performed in a manner comparable to that when EgD9ES was expressed alone (20% conversion [pZuFmEgD9ES data and Example 55]). In contrast, however, the EgD8M delta-8 desaturase activity in Yarrowia expressing pZuFmG9G8-B was about 97% more efficient than when EgD8M was expressed alone (73% versus 37% conversion [Example 55]). Similarly, the EaD8S delta-8 desaturase activity in Yarrowia expressing pZuFmG9A8 was about 63% more efficient than when EaD8S was expressed alone (67% versus 41% conversion [Example 55]).
In the case of the EaD9ES/EgD8M gene fusion (i.e., pZuFmA9G8) and the EaD9ES/EaD8S gene fusion (i.e., pZuFmA9A8), the EaD9ES delta-9 elongase activity was about 15% and 38% more efficient, respectively, than when EaD9ES was expressed alone (15% and 18% conversion, respectively, versus 13% conversion [Example 55]). Likewise, the EgD8M delta-8 desaturase activity in Yarrowia expressing pZuFmA9G8 was about 46% more efficient than when EgD8M was expressed alone (54% versus 37% conversion [Example 55]). Similarly, the EaD8S delta-8 desaturase activity in Yarrowia expressing pZuFmA9A8 was about 32% more efficient than when EaD8S was expressed alone (58% versus 41 % conversion [Example 55]).
Finally, in the case of the E389D9eS/EgD8M gene fusion (i.e., pZuFmR9G8), the E389D9eS delta-9 elongase activity was about 50% more efficient than when E389D9eS was expressed alone (18% versus 12% conversion [Example 55]). Likewise, the EgDδM delta-8 desaturase activity in Yarrowia expressing pZuFmR9G8 was about 89% more efficient than when EgDδM was expressed alone (70% versus 37% conversion [Example 55]). Table 34 also demonstrated that the modified linker
GAGPARPAGLPPATYYDSLAVMGS (SEQ ID NO:445) was preferred as opposed to the linker set forth as SEQ ID NO:438 in Yarrowia lipolytica, when fusing delta-9 elongase and delta-8 desaturase genes together.
It will be obvious to one of skill in the art that other PUFA desaturase and elongase genes that are preferred for expression in Yarrowia lipolytica (including, for example, any of those genes described in Tables 8-19) can be fused together in a manner similar to that described above and expressed in Yarrowia lipolytica. Preferred promoters and terminators suitable for construction of an expression cassette (wherein the ORF expressed encodes a multizyme) may be selected from those described in Tables 8-19. It is hypothesized that increased efficiency or flux would be observed in the fusion gene as opposed to when either (or both) individual genes are expressed alone.
EXAMPLE 57 Creation Of Delta-9 Elongase/Delta-8 Desaturase Gene Fusions For Expression in Sov
In order to characterize multizyme fusions between delta-9 elongases and delta-8 desaturases in soy, a series of delta-9 elongase/delta-8 multizymes were created. Delta-9 elongase and delta-8 desaturase domains were separated by the EgDHAsyni proline-rich linker (SEQ ID NO:198). For comparison, constructs that co-expressed individual delta-9 elongase and delta-8 desaturase genes were also created. Delta-9 elongases used include EgD9elo (Example 16; SEQ ID NO:112; also referred to as EgD9e and EgD9E herein, but they are identical) and EaD9elo1 (SEQ ID NO:252; Example 36; also referred to as EaD9E and EaD9e herein, but they are all identical). Delta-8 desaturases used include TpomD8 (SEQ ID NO:162; Example 21) and the Euglena anabaena delta-8 desaturase (EaD8Des3; SEQ ID NO:427; also referred to as EaD8 but they are identical; described in U.S. Provisional Application No. 60/910831 (filed April 10, 2007; Attorney Docket No. BB- 1615). In the present Example and for Example 23, which describes the synthesis of the EgD9elo-EgDHAsyn1 Link-PavD8 fusion, additional nucleotides were added to the 31 end of the EgDHAsyni proline-rich linker sequence to enable cloning when making the fusions for all constructs. Thus, an additional 4 amino acids were included between the end of the EgDHAsyni proline-rich linker (SEQ ID NO: 198; PARPAGLPPATYYDSLAV) and the start of the delta-8 desaturase used (i.e. SEQ ID NO:472; PARPAGLPPATYYDSLAVSGRT).
Plasmid pKR1183 (SEQ ID NO:266) comprising the Euglena anabaena delta- 9 elongase-Tetruetreptia pomquetensis CCMP1491 delta-8 desaturase fusion (Hybrid 1 -HGLA Synthase; also called EaD9e/TpomD8) was described in Example 38.
Other plasmids described in the present example include: pKR1014 (described in U.S. Patent Application No. 11/876,115 (filed October 22, 2007; Attorney Docket No. BB-1574; comprising EgD9e and TpomDδ expressed individually), pKR1152 (comprising an EgD9e and EaD8 expressed individually), pKR1151 (comprising an EaD9e and TpomDδ expressed individually), pKR1150 (comprising an EaD9e and EaD8 expressed individually), pKR1184 (comprising a EaD9e/EaD8 gene fusion), pKR1199 (comprising a EgD9e/TpomD8 gene fusion), and pKR1200 (comprising a EgD9e/EaD8 gene fusion). A summary of the constructs made and respective genes tested along with the SEQ ID NOs for the nucleotide and amino acid sequences produced is shown in Table 35.
Functional analysis of the activity of each gene fusion is tested infra, in Example 60.
TABLE 35 Preferred Desaturases And Elonqases For Creation Of Delta-9/Delta-8 Gene
Figure imgf000223_0001
Figure imgf000224_0001
Construction of pKR1014
Vector pKR123r, which was previously described in PCT Publication No. WO 2004/071467 (published August 26, 2004), contains a Λ/ofl site flanked by the Kunitz soybean Trypsin Inhibitor (KTi3) promoter (Jofuku et al., Plant Ce// 1 :1079-1093 (1989)) and the KTi 3' termination region, the isolation of which is described in U.S. Patent No. 6,372,965 (KTi3/Λ/ofl/KTi3' cassette). TpomDδ (SEQ ID NO:162; Example 21) was released from pLF114-10 (SEQ ID NO:165; Example 21) by digestion with Λ/ofl and cloned into the Λ/ofl site of pKR123r to produce pKR1007 (SEQ ID NO:473).
Vector pKR912, which was previously described in US-2007-0118929-A1 and published May 24, 2007, contains the hygromycin B phosphotransferase gene, flanked by the 35S promoter (Odell et al., Nature 313:810-812 (1985)) and NOS 3' transcription terminator (Depicker et al., J. MoI. Appl. Genet. 1 :561-570 (1982)) (35S/hpt/NOS3' cassette) for selection in plants such as soybean. Vector pKR912 also contains EgD9e (SEQ ID NO:112), flanked by the promoter for the α' subunit of β-conglycinin (Beachy et al., EMBO J. 4:3047-3053 (1985)) and the 3' transcription termination region of the phaseolin gene (Doyle et al., J. Biol. Chem. 261 :9228-9238 (1986)), thus allowing for strong tissue-specific expression of EgD9e in the seeds of soybean.
Plasmid pKR1007 (SEQ ID NO:473) was digested with Pstl, and the fragment containing the Tetruetreptia pomquetensis delta-8 desaturase was cloned into the Sbfl site of pKR912, to give pKR1014 (SEQ ID NO:474). In this way, the Tetruetreptia pomquetensis delta-8 desaturase is co-expressed with the Euglena gracilis delta-9 elongase behind strong, seed-specific promoters. A schematic depiction of pKR1014 is shown in FIG. 56. In FIG. 56, TpomD8 is called Tetruetreptia pomquetensis 1491 delta-8 Desaturase, and EgD9e is called eug eh . Construction of pKR1151
In order to introduce Notl and Λ/col restriction sites at the 5' end of the coding sequences and a Not\ site at the 3' end of the coding sequences, EaD8 (SEQ ID NO:427) was amplified with oligonucleotide primers EaD8-5 (SEQ ID NO:475) and EaD8-3 (SEQ ID NO:476) using the Phusion™ High-Fidelity DNA Polymerase (Cat. No. F553S, Finnzymes Oy, Finland) following the manufacturer's protocol. The resulting DNA fragment was cloned into the pCR-Blunt® cloning vector using the Zero Blunt® PCR Cloning Kit (Invitrogen Corporation), following the manufacturer's protocol, to produce pLF120-3 (SEQ ID NO:477).
Vector pLF120-3 (SEQ ID NO:477), was digested with Notl, and the fragment containing EaD8 was cloned into the Not\ site of pKR457 (SEQ ID NO:122; Example 16), to produce pKR1138 (SEQ ID NO:478).
Vector pKR1138 (SEQ ID NO:478) was digested with BsiWI, and the fragment containing EaD8 was cloned into the BsiWI site of pKR912 to give pKR1152 (SEQ ID NO:479). A schematic depiction of pKR1152 is shown in FIG. 57. In FIG. 57, EaD8 is called EaD8Des3, and EgD9e is called eug ell Construction of pKR1152
In order to introduce Notl and Ncol restriction sites at the 5' end of the coding sequences and a Notl site at the 3' end of the coding sequences, EaD9e was PCR amplified from pLF121-1 (SEQ ID NO:250; Example 36) with oligonucleotide primers oEAd9el1-1 (SEQ ID NO:298; Example 44) and oEAd9el1-2 (SEQ ID NO:480) using the Phusion™ High-Fidelity DNA Polymerase (Cat. No. F553S, Finnzymes Oy, Finland) following the manufacturer's protocol. The resulting DNA fragments were cloned into the pCR-BluntD cloning vector using the Zero BluntD PCR Cloning Kit (Invitrogen Corporation), following the manufacturer's protocol, to produce pKR1137 (SEQ ID NO:481 ).
EaD9e was released from pKR1137 (SEQ ID NO:481) by digestion with Notl and cloned into the Notl site of pKR72 (SEQ ID NO: 105; Example 15) to produce pKR1140 (SEQ ID NO:482). TpomD8 was released from pLF114-10 (SEQ ID NO: 165; Example 21) by digestion with Λ/ofl and was cloned into the Λ/ofl site of plasmid pKR457 (SEQ ID NO:122; Example 16) to produce pKR1145 (SEQ ID NO:483).
Vector pKR1145 (SEQ ID NO:483) was digested with BsiWI and the fragment containing TpomDδ was cloned into the BsiWI site of pKR1140 (SEQ ID NO:482) to give pKR1151 (SEQ ID NO:484). A schematic depiction of pKR1151 is shown in FIG. 58. In FIG. 58, TpomDδ is called Tetruetreptia pomquetensis 1491 delta-8 Desaturase, and EaD9e is called EAdθelong.
Construction of pKR1150 Vector pKR1138 (SEQ ID NO:478) was digested with BsiWI, and the fragment containing EaD8 was cloned into the BsiWI site of pKR1140 (SEQ ID NO:482) to give pKR1150 (SEQ ID NO:485). A schematic depiction of pKR1150 is shown in FIG. 59. In FIG. 59, EaD8 is called EaD8Des3, and EaD9e is called EAd9elong. Construction of pKR1199
The Λ/col/Λ/ofl DNA fragment of KS373 (SEQ ID NO: 179; Example 23), containing EgD9elo-EgDHAsyn1 Link, was cloned into the Nco\/Not\ DNA fragment from pKR1177 (SEQ ID NO:264; Example 38), containing the promoter for the α' subunit of β-conglycinin, to produce pKR1190 (SEQ ID NO:486). The Λ/ofl fragment from pLF114-10 (SEQ ID NO: 165; Example 21), containing TpomDδ, was cloned into the Λ/ofl fragment of pKR1190 (SEQ ID NO:486) to produce pKR1195 (SEQ ID NO:4δ7).
The BamH\ DNA fragment of pKR1195 (SEQ ID NO:4δ7), containing the EgD9e/TpomDδ fusion gene, was cloned into the BamYW DNA fragment of pKR325, previously described in PCT Publication No. WO 2006/012325 to produce pKR1199 (SEQ ID NO:4δδ). A schematic depiction of pKR1199 is shown in FIG. 60. In FIG. 60, EgD9e/TpomDδ is called EGd9elong-TPOMdδDS.
Construction of pKR1200
The Λ/ofl fragment from pLF120-3 (SEQ ID NO:477), containing EaDδ was cloned into the Λ/ofl fragment of pKR1190 (SEQ ID NO:4δ6) to produce pKR1196 (SEQ ID NO:489).
The BamH\ DNA fragment of pKR1196 (SEQ ID NO:489), containing the EgD9e/EaDδ fusion gene, was cloned into the SamHI DNA fragment of pKR325, previously described in PCT Publication No. WO 2006/012325 to produce pKR1200 (SEQ ID NO:490). A schematic depiction of pKR1200 is shown in FIG. 61. In FIG.
61 , EgD9e/EaD8 is called EGd9ELONG-EAd8DS.
Construction of pKR1184 The Not\ fragment from pLF120-3 (SEQ ID NO:477), containing EaD8, was cloned into the Not\ fragment of pKR1179 (SEQ ID NO:265) to produce pKR1184 (SEQ ID NO:491 ). A schematic depiction of pKR1184 is shown in FIG. 62. In FIG.
62, EaD9e/EaD8 is called EAd9ELONG-EAd8DS.
EXAMPLE 58 Construction of Soybean Expression Vector pKR1321 for Expression of a
Tetruetreptia pomαuetensis CCMP1491 Delta-8 Desaturase-Ei/α/ena anabaena delta-9 elongase Fusion Gene (TpomD8-EaD9Elo1 fusion) The present example describes the construction of an in-frame fusion gene between the Tetruetreptia pomquetensis CCMP1491 delta-8 Desaturase (TpomDδ; SEQ ID NO:162; Example 21) and the Euglena anabaena delta-9 elongase (EaD9e; SEQ ID NO:252, Example 36). Each domain is separated by the EgDHAsyni linker with an additional 4 amino acids included between the end of the EgDHAsyni proline-rich linker and the start of the EaD9e, as described in Example 57 (i.e. SEQ ID NO:472; PARPAGLPPATYYDSLAVSGRT). Plasmid pKR1301 (SEQ ID NO:307; Example 44) was digested with EcoR\, and the DNA fragment containing the 3' end of TpomD8-EgDHAsyn1 Link (called TpomD8+L1TR1) was re-ligated to form pKR1303 (SEQ ID NO:497).
The Notl fragment of pKR1137 (SEQ ID NO:481 ; Example 57), containing the EaD9e, was cloned into the Eag\ site of pKR1303 (SEQ ID NO:497) to produce pKR1308 (SEQ ID NO:498). In this way, EaD9e was fused to the 3' end of TpomDδ.
The Gy1/Pavelo/legA2 cassette was released from plasmid pKR336 (described in PCT Publication Nos. WO 04/071467; the contents of which are hereby incorporated by reference) by digestion with Pstl/BamHI and cloned into the Pstl/BamHi site of pKR268 (described in PCT Publication Nos. WO 04/071467) to produce pKR393 (SEQ ID NO:499). The Pavelo gene was released from pKR393 (SEQ ID NO:499) by digestion with Notl, and the vector was re-ligated to form pKR407 (SEQ ID NO:500). TpomD8 was released from pLF114-10 (SEQ ID NO:165; Example 21) by digestion with Λ/ofl and was cloned into the Λ/ofl site of plasmid pKR407 (SEQ ID NO:500) to produce pKR1018 (SEQ ID NO:501).
Plasmid pKR1018 (SEQ ID NO:501) was digested with Hindlll/EcoRI, and the fragment containing the 5' end of the Tpomdδ was cloned into the Hindlll/EcoRI site of pKR1308 (SEQ ID NO:498) to produce pKR1312 (SEQ ID NO:502). In this way, the TpomD8 sequence was restored, and the TpomD8/EaD9e fusion was formed.
The Notl fragment of pKR1312 (SEQ ID NO:502), containing the TpomD8/EaD9e fusion, was cloned into the Notl site of pKR72 (SEQ ID NO: 105; Example 23) to produce pKR1321 (SEQ ID NO:503). A schematic depiction of pKR1321 is shown in FIG. 63. In FIG. 63, TpomD8/EaD9e is called TPd8ds-EAd9el fusion.
EXAMPLE 59
Construction of Soybean Expression Vector pKR1326 for Expression of a Euglena anabaena delta-9 elongase- Tetruetreptia pomguetensis CCMP1491 Delta-8
Desaturase Fusion Gene Using the Euglena anabaena DHAsvni proline-rich linker
The present example describes the construction of an in-frame fusion gene between the Euglena anabaena delta-9 elongase (EaD9e; SEQ ID NO:252, Example 36) and the Tetruetreptia pomquetensis CCMP1491 delta-8 Desaturase (TpomDδ; SEQ ID NO:162; Example 21). Each domain is separated by the
EaDHAsyni proline-rich linker (SEQ ID NO:235), but with an additional 3 amino acids included between the end of the EaDHAsyni proline-rich linker (EaDHAsyni Link) and the start of the EaD9e (i.e. SEQ ID NO:504; PGGPGKPSEIASLPPPIRPVGNPPAAYYDALATGRT). Cloning was performed as similarly described in Example 57.
An initial in-frame fusion between the EaD9e and the EaDHAsyni Link (EaD9elo-EgDHAsyn1 Link) was made by PCR amplification and was flanked by a Notl and Λ/col site at the 5'end and a Λ/ofl site at the 3' end. EaD9e (SEQ ID NO:252) was amplified from pLF121-1 (SEQ ID NO:250) with oligonucleotides oEAd9el1-1 (SEQ ID NO:298) and EaLinki (SEQ ID NO:505), using the Phusion™ High-Fidelity DNA Polymerase (Cat. No. F553S, Finnzymes Oy, Finland) following the manufacturer's protocol. EaDHAsyni Link (SEQ ID NO:234) was amplified in a similar way from pLF117-1 (SEQ ID NO:87; Example 13) with oligonucleotides EaLink2 (SEQ ID NO:506) and Eaϋnk3 (SEQ ID NO:507). The two resulting PCR products were combined and re-amplified usingoEAd9el1-1 (SEQ ID NO:298) and EaLink3 (SEQ ID NO:507) to form EaD9e-EaDHAsyn1Link. The sequence of the EaD9e-EaDHAsyn1 Link is shown in SEQ ID NO:508. EaD9e-EaDHAsyn1 Link was cloned into the pCR-Blunt® cloning vector using the Zero Blunt® PCR Cloning Kit (Invitrogen Corporation), following the manufacturer's protocol, to produce pKR1305 (SEQ ID NO:509).
The Eag\ DNA fragment of pKR1305 (SEQ ID NO:509), containing EaD9e- EaDHAsyni Link, was cloned into the Not\ site pKR1304 (SEQ ID NO:310; Example 44) to produce pKR1317 (SEQ ID NO:510). In this way, the 51 end of the TpomD8 was fused to EaD9e-EaDHAsyn1 Link.
The EcoRI/Asp718 fragment of pKR1127 (SEQ ID NO: 168; Example 22), containing the 31 end of the TpomDδ was cloned into the EcoRI/Asp718 fragment of pKR1317 (SEQ ID NO:510), containing EaD9e-EaDHAsyn1 Link to produce pKR1320 (SEQ ID NO:511).
The Not\ fragment from pKR1320 (SEQ ID NO:511), containing the fusion, was cloned into the Not\ fragment of pKR72 (SEQ ID NO: 105; Example 15) to produce pKR1326 (SEQ ID NO:512). A schematic depiction of pKR1326 is shown in FIG. 64. In FIG. 64, EaD9e/TpomD8 with the EaDHAsyni proline-rich linker is called EAd9el-TPOMd8ds L2fusion.
EXAMPLE 60
Functional Analyses Of Delta-9 Elongase/Delta-8 Desaturase Gene Fusions In Soy The present example describes the transformation and expression in soybean somatic embryos of pKR1014 (SEQ ID NO:474), pKR1152 (SEQ ID NO:479), pKR1151 (SEQ ID NO:484), pKR1150 (SEQ ID NO:485), pKR1199 (SEQ ID NO:488), pKR1200 (SEQ ID NO:490), and pKR1184 (SEQ ID NO:491), the syntheses of which were previously described in Example 57. Functional analyses of pKR1183 (SEQ ID NO:266) and KS373 (SEQ ID NO: 179) were previously described in Examples 46 and 31 , respectively. Soybean embryogenic suspension culture (cv. Jack) was transformed with each of the vectors above, and embryos were matured in soybean histodifferentiation and maturation liquid medium (SHaM liquid media; Schmidt et al., Cell Biology and Morphogenesis, 24:393 (2005)), as described in Example 25 and previously described in PCT Publication No. WO 2007/136877, published November 29, 2007 (the contents of which are hereby incorporated by reference).
After maturation in SHaM liquid media, a subset of transformed soybean embryos (i.e., 5-6 embryos per event) were harvested and analyzed as described herein.
In this way, approximately 30 events transformed with pKR1014 (SEQ ID NO:474), pKR1152 (SEQ ID NO:479), pKR1151 (SEQ ID NO:484), pKR1150 (SEQ ID NO:485), pKR1199 (SEQ ID NO:488), pKR1200 (SEQ ID NO:490), or pKR1184 (SEQ ID NO:491) were analyzed. The five events having the highest average DGLA content (average of the 5 embryos analyzed) are shown in FIGs. 65, 66, 67, 68, 69, 70, or 71 , respectively. In FIGs. 65-71 , fatty acids are identified as 16:0 (palmitate), 18:0 (stearic acid), 18:1 (oleic acid), LA1 ALA, EDA, ERA, DGLA, and ETA. Fatty acid compositions are expressed as a weight percent (wt. %) of total fatty acids. Table 36 summarizes the vector, genes used, experiment number (MSE#), and corresponding FIG.
In FIGs. 65-71 , elongation activity is expressed as % delta-9 elongation of C18 fatty acids (C18 % delta-9 elong), calculated according to the following formula: ([product]/[substrate + product])*100. More specifically, the combined percent elongation for LA and ALA is determined as: ([DGLA + ETA + EDA + ERA]/[LA + ALA + DGLA + ETA + EDA + ERA])*100.
In FIGs. 65-71 , the combined percent desaturation for EDA and ERA is shown as "C20 % delta-8 desat", determined as: ([DGLA + ETA]/[DGLA + ETA + EDA + ERA])*100. This is also referred to as the overall % desaturation.
TABLE 36 Functional analysis of Delta-9/Delta-8 Gene Fusions In Soy
Figure imgf000230_0001
Figure imgf000231_0001
*ln FIG. 37. MSE2145 is listed as MSE2144
A comparison of individually expressed delta-9 elongases with delta-8 desaturases versus the equivalent delta-9 elongase-delta-8 desaturase fusion is shown in FIG. 72. In FIG. 72, each data point represents the average %DGLA or %EDA for 5-6 embryos (as a % of total fatty acids) for all events analyzed, and avg. %DGLA is plotted vs. avg. % EDA. In FIG. 72A1 EgTpom represents EgD9e co- expressed with TpomDδ (pKR1014), and EgTpomfus represents the EgD9e/TpomD8 fusion (pKR1199). In FIG. 72B, EgEa represents EgD9e co- expressed with EaD8 (pKR1152), and EgEafus represents the EgD9e/EaD8 fusion (pKR1200). In FIG. 72C, EaTpom represents EaD9e co-expressed with TpomDδ (pKR1151 ), and EaTpomfus represents the EaD9e/TpomD8 fusion (pKR1183). In FIG. 72D, EaEa represents EaD9e co-expressed with EaD8 (pKR1150), and EaEafus represents the EaD9e/EaD8 fusion (pKR1200).
EXAMPLE 61 Functional Analyses Of Delta-9 Elongase/Delta-8 Desaturase/Delta-5 Desaturase
Gene Fusion
The present example describes the transformation and expression in soybean somatic embryos of pKR1322 (SEQ ID NO:314; Example 50) comprising a EaD9Elo1-TpomD8-EaD5Des1 triple fusion (also called EaD9e/TpomD8/EaD5). Each domain is separated by the EgDHAsyni linker with an additional 4 amino acids (i.e. SEQ ID NO:472; PARPAGLPPATYYDSLAVSGRT).
Soybean embryogenic suspension culture (cv. Jack) was transformed with pKR1322, and embryos were matured in soybean histodifferentiation and maturation liquid medium (SHaM liquid media; Schmidt et al., Cell Biology and Morphogenesis, 24:393 (2005)) as described in Example 25 and previously described in PCT Publication No. WO 2007/136877, published November 29, 2007 (the contents of which are hereby incorporated by reference). After maturation in SHaM liquid media, a subset of transformed soybean embryos (i.e., δembryos per event) were harvested and analyzed as described herein.
In this way, approximately 30 events transformed with pKR1322 (Experiment MSE2274) were analyzed, and the five events having the highest average ARA and EPA content (average of the 5 embryos analyzed) are shown in FIG. 73. In FIG. 73, fatty acids are identified as 16:0 (palmitate), 18:0 (stearic acid), 18:1 (oleic acid), 18:2 (5,9), LA, ALA, EDA, ERA, SCI, DGLA, JUN (also called JUP), ETA, ARA, and EPA. Fatty acid compositions are expressed as a weight percent (wt. %) of total fatty acids. In FIG. 73, elongation activity is expressed as % delta-9 elongation of C18 fatty acids (%Elo), calculated according to the following formula: ([product]/[substrate + product])* 100. More specifically, the combined percent elongation for LA and ALA is determined as: ([DGLA + ETA + EDA + ERA + EPA + ARA]/[LA + ALA + DGLA + ETA + EDA + ERA + EPA + ARA])*100. In FIG. 73, the combined percent delta-8 desaturation for EDA and ERA is shown as "%D8", determined as: ([DGLA + ETA + EPA + ARA]/[DGLA + ETA + EDA + ERA + EPA + ARA])M OO. This is also referred to as the overall % delta-8 desaturation.
In FIG. 73, the combined percent delta-5 desaturation for DGLA and ETA is shown as "%D5", determined as: ([EPA + ARA]/[DGLA + ETA + EPA + ARA])*100. This is also referred to as the overall % delta-5 desaturation.
In summary of FIG. 73, all three domains are functional. This fusion could be referred to as either EPA synthase or ARA synthase.
EXAMPLE 62 Functional Analyses Of Delta-9 Elonqase/Delta-8 Desaturase and Delta-8
Desaturase/Delta-9 Elongase Gene Fusions
The present example describes the transformation and expression in soybean somatic embryos of either pKR1326 (SEQ ID NO:512), comprising an EaD9Elo1- TpomD8 fusion (also called EaD9e/TpomD8) separated by the EaDHAsyni linker with an additional 3 amino acids (i.e. SEQ ID NO:504; PGGPGKPSEIASLPPPIRPVGNPPAAYYDALATGRT), or pKR1321 (SEQ ID NO:503; Example 58), comprising TpomD8/EaD9e fusion separated by the EgDHAsyni linker.
Soybean embryogenic suspension culture (cv. Jack) was transformed with pKR1326 or pKR1321 , and embryos were matured in soybean histodifferentiation and maturation liquid medium (SHaM liquid media; Schmidt et al., Cell Biology and Morphogenesis, 24:393 (2005)), as described in Example 25 and previously described in PCT Publication No. WO 2007/136877, published November 29, 2007 (the contents of which are hereby incorporated by reference).
After maturation in SHaM liquid media, a subset of transformed soybean embryos (i.e., δembryos per event) were harvested and analyzed as described herein. In this way, approximately 30 events transformed with pKR1326 (Experiment
MSE2275) were analyzed, and the five events having the highest average DGLA and ETA content (average of the 5 embryos analyzed) are shown in FIG. 74. In FIG. 74, fatty acids are identified as 16:0 (palmitate), 18:0 (stearic acid), 18:1 (oleic acid), LA, ALA, EDA, ERA, DGLA and DGLA, and ETA. Fatty acid compositions are expressed as a weight percent (wt. %) of total fatty acids.
In FIG. 74, elongation activity is expressed as % delta-9 elongation of C18 fatty acids (C18 % delta-9 elong), calculated according to the following formula: ([product]/[substrate + product])*100. More specifically, the combined percent elongation for LA and ALA is determined as: ([DGLA + ETA + EDA + ERA]/[LA + ALA + DGLA + ETA + EDA + ERA])*100.
In FIG. 74, the combined percent desaturation for EDA and ERA is shown as "C20 % delta-8 desat", determined as: ([DGLA + ETA]/[DGLA + ETA + EDA + ERA])*100. This is also referred to as the overall % desaturation.
In summary of FIG. 74, the EaDHAsyni linker functions similarly to the EgDHAsyni linker. No activity was detected for any of the events transformed with pKR1321 where TpomDδ was fused to EaD9e with the EgDHAsyni linker.

Claims

CLAIMS What is claimed is:
1. A multizyme comprising a single polypeptide having at least two independent and separable enzymatic activities.
2. The multizyme of claim 1 , wherein the enzymatic activities are selected from the group consisting of fatty acid elongases, fatty acid desaturases, acyl transferases, acyl CoA synthases, and thioesterases.
3. The multizyme of claim 1 , wherein the enzymatic activities comprises at least one fatty acid elongase linked to at least one fatty acid desaturase.
4. The multizyme of claim 2 or 3, wherein the fatty acid desaturase is selected from the group consisting of a delta-4 desaturase, a delta-5 desaturase, a delta-6 desaturase, a delta-8 desaturase, a delta-9 desaturase, a delta-12 desaturase, a delta-15 desaturase, or a delta-17 desaturase.
5. The multizyme of claim 2 or 3, wherein the fatty acid elongase is selected from the group consisting of a delta-9 elongase, a C14/16 elongase, a C16/18 elongase, a C18/20 elongase, or a C20/22 elongase.
6. The multizyme of claim 1 , 2, or 3, wherein a first enzymatic activity is linked to a second enzymatic activity and said link is selected from the group consisting of a polypeptide bond, SEQ ID NO:198 (EgDHAsyni linker), SEQ ID NO:200 (EgDHAsyn2 linker) and SEQ ID NO:235 (EaDHAsyni linker), SEQ ID NO:438, SEQ ID NO:472, SEQ ID NO:445, and SEQ ID NO:504.
7. An isolated polynucleotide encoding a DHA synthase comprising:
(a) a nucleotide sequence encoding a polypeptide having DHA synthase activity, wherein the polypeptide has at least 80% amino acid identity, based on the Clustal V method of alignment, when compared to an amino acid sequence as set forth in SEQ ID NO:12, SEQ ID NO:22, SEQ ID NO:95, SEQ ID NO:96, or SEQ ID NO:97;
(b) a nucleotide sequence encoding a polypeptide having DHA synthase activity wherein the nucleotide sequence has at least 80% sequence identity, based on the BLASTN method of alignment, when compared to a nucleotide sequence as set forth in SEQ ID NO:11 , SEQ ID NO:205, SEQ ID NO:21 , SEQ ID NO:91, SEQ ID NO:92, SEQ ID NO:93, or SEQ ID NO:410; (c) a nucleotide sequence encoding a polypeptide having DHA synthase activity, wherein the nucleotide sequence hybridizes under stringent conditions to a nucleotide sequence as set forth in SEQ ID NO:11 , SEQ ID NO:205, SEQ ID NO:21 , SEQ ID NO:91 , SEQ ID NO:92, SEQ ID NO:93, or SEQ ID NO:410; or
(d) a complement of the nucleotide sequence of (a), (b), or (c), wherein the complement and the nucleotide sequence consist of the same number of nucleotides and are 100% complementary.
8. The polynucleotide of claim 7, wherein the nucleotide sequence comprises SEQ ID NO:11 , SEQ ID NO:21 , SEQ ID NO:91 , SEQ ID NO:92, SEQ ID NO:93, or SEQ ID NO: 410.
9. The polypeptide of claim 7, wherein the amino acid sequence of the polypeptide comprises SEQ ID NO:12, SEQ ID NO:22, SEQ ID NO:95, SEQ ID NO:96, or SEQ ID NO:97.
10. An isolated polynucleotide encoding a C20 elongase comprising:
(a) a nucleotide sequence encoding a polypeptide having C20 elongase activity, wherein the polypeptide has at least 80% amino acid identity, based on the Clustal V method of alignment, when compared to an amino acid sequence as set forth in SEQ ID NO:12, SEQ ID NO:22, SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:97, SEQ ID NO:202 (EgDHAsyni C20 elongase domain), SEQ ID NO:204 (EgDHAsyn2 C20 elongase domain), SEQ ID NO:231 (EaDHAsyni C20 elongase domain), SEQ ID NO:232 (EaDHAsyn2 C20 elongase domain), or SEQ ID NO:233 (EaDHAsyn3 C20 elongase domain);
(b) a nucleotide sequence encoding a polypeptide having C20 elongase activity wherein the nucleotide sequence has at least 80% sequence identity, based on the BLASTN method of alignment, when compared to a nucleotide sequence as set forth in SEQ ID NO:11 , SEQ ID NO:21 , SEQ ID NO:91 , SEQ ID NO:92, SEQ ID NO:93, SEQ ID NO:183, SEQ ID NO:188, SEQ ID NO:201 (EgDHAsyni C20 elongase domain), SEQ ID NO:206 (EgDHAsyni* C20 elongase domain), SEQ ID NO:203 (EgDHAsyn2 C20 elongase domain), SEQ ID NO:227 (EaDHAsyni C20 elongase domain), SEQ ID NO:228 (EaDHAsyn2 C20 elongase domain), SEQ ID NO:229 (EaDHAsyn3 C20 elongase domain), or SEQ ID NO:230 (EaDHAsyn4 C20 elongase domain); (c) a nucleotide sequence encoding a polypeptide having C20 elongase activity, wherein the nucleotide sequence hybridizes under stringent conditions to a nucleotide sequence as set forth in SEQ ID NO:11 , SEQ ID NO:21 , SEQ ID NO:91 , SEQ ID NO:92, SEQ ID NO:93, SEQ ID NO:183, SEQ ID NO:188, SEQ ID NO:201 (EgDHAsyni C20 elongase domain), SEQ ID NO:206
(EgDHAsyni* C20 elongase domain), SEQ ID NO:203 (EgDHAsyn2 C20 elongase domain), SEQ ID NO:227 (EaDHAsyni C20 elongase domain), SEQ ID NO:228 (EaDHAsyn2 C20 elongase domain), SEQ ID NO:229 (EaDHAsyn3 C20 elongase domain) or SEQ ID NO:230 (EaDHAsyn4 C20 elongase domain); or (d) a complement of the nucleotide sequence of (a), (b), or (c), wherein the complement and the nucleotide sequence consist of the same number of nucleotides and are 100% complementary.
11. An isolated polynucleotide encoding a delta-4 desaturase comprising:
(a) a nucleotide sequence encoding a polypeptide having delta-4 desaturase activity, wherein the polypeptide has at least 80% amino acid identity, based on the Clustal V method of alignment, when compared to an amino acid sequence as set forth in SEQ ID NO:12, SEQ ID NO:22, SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:97, SEQ ID NO:193, SEQ ID NO:215, SEQ ID NO:217, SEQ ID NO:221 , SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241 , SEQ ID NO:246, SEQ ID NO:247, SEQ ID NO:248, SEQ ID NO:249, SEQ ID NO:382, SEQ ID NO:384, SEQ ID NO:386, SEQ ID NO:388, SEQ ID NO:404, SEQ ID NO:406 or SEQ ID NO:408;
(b) a nucleotide sequence encoding a polypeptide having delta-4 desaturase activity wherein the nucleotide sequence has at least 80% sequence identity, based on the BLASTN method of alignment, when compared to a nucleotide sequence as set forth in SEQ ID NO:11 , SEQ ID NO:21 , SEQ ID NO:91 , SEQ ID NO:92, SEQ ID NO:93, SEQ ID NO:192, SEQ ID NO:214, SEQ ID NO:216, SEQ ID NO:220, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:242, SEQ ID NO:243, SEQ ID NO:244, SEQ ID NO:245; SEQ ID NO:381 , SEQ ID NO:383, SEQ ID NO:385, SEQ ID NO:387, SEQ ID NO:403, SEQ ID NO:405, or SEQ ID NO:407;
(c) a nucleotide sequence encoding a polypeptide having delta-4 desaturase activity, wherein the nucleotide sequence hybridizes under stringent conditions to a nucleotide sequence as set forth in SEQ ID NO:11 , SEQ ID NO:21 , SEQ ID NO:91 , SEQ ID NO:92, SEQ ID NO:93, SEQ ID NO:214, SEQ ID NO:216, SEQ ID NO:220, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:242, SEQ ID NO:243, SEQ ID NO:244, SEQ ID NO:245, SEQ ID NO:192, SEQ ID NO:381 , SEQ ID NO:383, SEQ ID NO:385, SEQ ID NO:387, SEQ ID NO:403, SEQ ID NO:405, or SEQ ID NO:407; or
(d) a complement of the nucleotide sequence of (a), (b), or (c), wherein the complement and the nucleotide sequence consist of the same number of nucleotides and are 100% complementary.
12. An isolated polynucleotide encoding a DHA synthase, said polynucleotide comprising the sequence set forth in any of SEQ ID NO:11 , SEQ ID NO:205, SEQ ID NO:21 , SEQ ID NO:91 , SEQ ID NO:92, SEQ ID NO:93, or SEQ ID NO:410.
13. An isolated polynucleotide encoding a C20 elongase, said polynucleotide comprising the sequence set forth in any of SEQ ID NO:183, SEQ ID NO:188, SEQ
ID NO:201 (EgDHAsyni C20 elongase domain, SEQ ID NO:206 (EgDHAsyni* C20 elongase domain), SEQ ID NO:203 (EgDHAsyn2 C20 elongase domain), SEQ ID NO:227 (EaDHAsyni C20 elongase domain), SEQ ID NO:228 (EaDHAsyn2 C20 elongase domain), SEQ ID NO:229 (EaDHAsyn3 C20 elongase domain), or SEQ ID NO:230 (EaDHAsyn4 C20 elongase domain).
14. An isolated polynucleotide encoding a delta-4 desaturase, said polynucleotide comprising the sequence set forth in SEQ ID NO:192, SEQ ID NO:214, SEQ ID NO:220, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:242, SEQ ID NO:243, SEQ ID NO:244, SEQ ID NO:245, SEQ ID NO:381 , SEQ ID NO:383, SEQ ID NO:385, SEQ ID NO:387, SEQ ID NO:403, SEQ ID NO:405, or SEQ ID NO:407.
15. A recombinant construct comprising any of the isolated polynucleotides of claims 7-14 operably linked to at least one regulatory sequence.
16. A host cell comprising in its genome the recombinant construct of claim 15.
17. The host cell of claim 16, wherein said cell is selected from the group consisting of plants and yeast.
18. A transformed Yarrowia sp. comprising the recombinant construct of Claims 15
19. A method for transforming a cell, comprising transforming a cell with the recombinant construct of claim 15 and selecting those cells transformed with said recombinant construct.
20. A method for producing a transformed plant comprising transforming a plant cell with any of the polynucleotides of claims 7-14 and regenerating a plant from the transformed plant cell.
21. The method of claim 20 wherein the plant is a soybean plant.
22. A method for producing yeast comprising transforming a yeast cell with any of the polynucleotides of claims 7-14 and growing yeast from the transformed yeast cell.
23. A plant comprising in its genome the recombinant construct of claim 15.
24. The plant of claim 23, wherein the plant is an oilseed plant.
25. The plant of claim 23 or 24, wherein the plant is soybean.
26. Seed obtained from the plant of claim 23 or 24.
27. Seed obtained from the plant of claim 25.
28. Oil obtained from the seed of claim 26.
29. Oil obtained from the seed of claim 27.
30. Food or feed incorporating the oil of claim 26.
31. Food or feed incorporating the oil of claim 27.
32. A beverage incorporating the oil of claim 26.
33. A beverage incorporating the oil of claim 27.
34. An isolated nucleic acid molecule which encodes a C20 elongase as set forth in SEQ ID NO: 183 wherein at least 147 codons are codon-optimized for expression in Yarrowia sp.
35. An isolated nucleic acid molecule which encodes a C20 elongase as set forth in SEQ ID NO: 188 wherein at least 134 codons are codon-optimized for expression in Yarrowia sp.
36. An isolated nucleic acid molecule which encodes a delta-4 desaturase enzyme as set forth in SEQ ID NO: 192 wherein at least 285 codons are codon- optimized for expression in Yarrowia sp. multizyme comprising an elongase joined to a desaturase.
37. A method for making a multizyme which comprises:
(a) linking a first polypeptide with at least a second polypeptide wherein each polypeptide has an independent and separable enzymatic activity; and (b) evaluating the product of step (a) for the independent and separable enzymatic activities.
38. The method of claim 37, wherein the enzymatic activities are selected from the group consisting of fatty acid elongases, fatty acid desaturases, acyl transferases, acyl CoA synthases, and thioesterases.
39. The method of claim 37, wherein the enzymatic activities comprises at least one fatty acid elongase linked to at least one fatty acid desaturase.
40. The method of claims 38 or 39, wherein the fatty acid desaturase is selected from the group consisting of a delta-4 desaturase, a delta-5 desaturase, a delta-6 desaturase, a delta-8 desaturase, a delta-9 desaturase, a delta-12 desaturase, a delta-15 desaturase, or a delta-17 desaturase.
41. The method of claims 38 or 39, wherein the fatty acid elongase is selected from the group consisting of a delta-9 elongase, a C14/16 elongase, a Ci6/i8 elongase, a C18/20 elongase, or a C20/22 elongase.
42. The method of claims 37, 38, or 39, wherein the link is selected from the group consisting of a polypeptide bond, SEQ ID NO:198 (EgDHAsyni linker amino acid sequence), SEQ ID NO:200 (EgDHAsyn2 linker), SEQ ID NO:235 (EaDHAsyni linker), SEQ ID NO:435, SEQ ID NO:438, SEQ ID NO:472, and SEQ ID NO:504.
43. A method for altering the fatty acid profile of an oilseed plant comprising:
(a) transforming an oilseed plant cell with the recombinant construct of claim 15; and
(b) regenerating a plant from the transformed oilseed plant cell step (a), wherein the plant has an altered fatty acid profile.
44. The method of claim 43, wherein the oilseed plant is soybean.
45. Progeny of the plant of claim 23.
46. A recombinant microbial host cell comprising the multizyme of Claim 6, wherein the first enzymatic activity is a delta-9 elongase and the second enzymatic activity is a delta-8 desaturase.
47. A recombinant microbial host cell comprising the multizyme of Claim 6, wherein the first enzymatic activity is a C20 elongase and the second enzymatic activity is a delta-4 desaturase.
48. The recombinant host cell of either Claim 46 or 47, wherein the host cell is an oleaginous yeast.
49. A method for the conversion of linoleic acid to dihomo gamma-linolenic acid comprising: a) providing a recombinant microbial host cell comprising: i) a DGLA synthase comprising: 1) at least one polypeptide encoding a delta-9 elongase;
2) at least one polypeptide encoding a delta-8 desaturase; and
3) a polypeptide linker; wherein the linker is interposed between the delta-9 elongase and the delta-8 desaturase; and ii) a source of linoleic acid; and b) growing the host cell of (a) under conditions whereby dihomo gamma-linolenic acid is produced.
50. A method for the conversion of α-linolenic acid to eicosatrienoic acid comprising: a) providing a recombinant microbial host cell comprising: i) a DGLA synthase comprising:
1) at least one polypeptide encoding a delta-9 elongase;
2) at least one polypeptide encoding a delta-8 desaturase; and
3) a polypeptide linker; wherein the linker is interposed between the delta-9 elongase and the delta-8 desaturase; and ii) a source of α-linolenic acid; and b) growing the host cell of (a) under conditions whereby eicosatrienoic acid is produced.
51. A method for the conversion of eicosapentaenoic acid to docosahexaenoic acid comprising: a) providing a recombinant microbial host cell comprising: i) a DHA synthase comprising: 1) at least one polypeptide encoding a C20 elongase;
2) at least one polypeptide encoding a delta-4 desaturase; and
3) a polypeptide linker wherein the linker is interposed between the C20 elongase and the delta-4 desaturase; and ii) a source of eicosapentaenoic acid; and b) growing the host cell of (a) under conditions whereby docosahexaenoic acid is produced.
52. A method for the conversion of arachidonic acid to docosapentaenoic acid comprising: a) providing a recombinant microbial host cell comprising: i) a DHA synthase comprising:
1) at least one polypeptide encoding a C20 elongase;
2) at least one polypeptide encoding a delta-4 desaturase; and
3) a polypeptide linker; wherein the linker is interposed between the C20 elongase and the delta-4 desaturase; and ii) a source of arachidonic acid; and b) growing the host cell of (a) under conditions whereby docosapentaenoic acid is produced.
53. The method of claim 49 or 50, wherein the DGLA synthase has the amino acid sequence selected from the group consisting of SEQ ID NO:441 , SEQ ID NO:447, SEQ ID NO:454, SEQ ID NO:461 , SEQ ID NO:464, SEQ ID NO:471 , SEQ ID NO:515, SEQ ID NO:516, SEQ ID NO:517, SEQ ID NO:518, and SEQ ID No:519.
54. The method of claim 51 or 52, wherein the DHA synthase has the amino acid sequence selected from the group consisting of SEQ ID NO:12, SEQ ID NO:22, SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:97, and SEQ ID NO:411.
55. The method of any one of claims 49 - 52, wherein the polypeptide linker has an amino acid sequence selected from the group consisting of SEQ ID NO:198,
SEQ ID NO:200, SEQ ID NO:235, SEQ ID NO:438, SEQ ID NO:445, SEQ ID NO:472, and SEQ ID NO:504.
56. The method of claim 49 or 50, wherein the delta-9 elongase has an amino acid sequence selected from the group consisting of SEQ ID NO: 254, SEQ ID NO:255, SEQ ID NO:319, SEQ ID NO:359, SEQ ID NO:420, SEQ ID NO:422, and SEQ ID NO:513.
57. The method of claim 49 or 50, wherein the delta-8 desaturase has an amino acid sequence selected from the group consisting of SEQ ID NO: 328, SEQ ID NO:424, SEQ ID NO:426, SEQ ID NO:428, SEQ ID NO:430, and SEQ ID NO:514.
58. The method of claim 51 or 52, wherein the C20 elongase has an amino acid sequence selected from the group consisting of SEQ ID NO:12, SEQ ID NO:22, SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:97, SEQ ID NO:202, SEQ ID NO:204, SEQ ID NO:231 , SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:184, and SEQ ID NO: 189.
59. The method of claim 51 or 52, wherein the delta-4 desaturase has an amino acid sequence selected from the group consisting of SEQ ID NO:12, SEQ ID NO:22, SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:97, SEQ ID NO:215, SEQ ID NO:221 , SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241 , SEQ ID NO:246, SEQ ID NO:247, SEQ ID NO:248, SEQ ID NO:249, SEQ ID NO:382, SEQ ID NO:384, SEQ ID NO:386, SEQ ID NO:388, SEQ ID NO:404, SEQ ID NO:406, SEQ ID NO:408, and SEQ ID NO:193.
60. A method for the identification of a polypeptide having improved delta-4 desaturase activity comprising: a) providing a wild-type delta-4 desaturase polypeptide isolated from
Euglena anabena having a base-line delta-4 desaturase activity; and b) truncating the wild-type polypeptide of (a) by from about 1 to about 200 amino acids to create a truncated mutant polypeptide having delta-4 desaturase activity that is increased as compared with the base-line delta-4 desaturase activity.
61. The method of claim 60, wherein the wild-type polypeptide is truncated at the N-terminal portion of the polypeptide
62. The method of claim 60, wherein the wild-type polypeptide is truncated by from about 1 to about 70 amino acids.
63. A microbial host cell which produces a polyunsaturated fatty acid and expresses polypeptides encoding enzymes in the following sequential pathway :
1) a delta-9 desaturase, 2) a delta-12 desaturase,
3) a delta-9 elongase,
4) a delta-8 desaturase,
5) a delta-5 desaturase,
6) a delta-17 desaturase, 7) a C20/22 elongase, and
8) a delta-4 desaturase; wherein the polypeptides comprise at least one multizyme, the fusion comprising a fusion between at least one contiguous enzyme pair.
64. The microbial host of claim 63, wherein the polyunsaturated fatty acid is selected from the group consisting of an ω-3 fatty acid and an ω-6 fatty acid.
65. An isolated polynucleotide encoding a DGLA synthase comprising:
(a) a nucleotide sequence encoding a polypeptide having DGLA synthase activity, wherein the polypeptide is set forth in SEQ ID NO:441 , SEQ ID NO:447, SEQ ID NO:454, SEQ ID NO:461 , SEQ ID NO:464, SEQ ID NO:471 , SEQ ID NO:515, SEQ ID NO:516, SEQ ID NO:517, SEQ ID NO:518, or SEQ ID NO:519;
(b) a nucleotide sequence encoding a polypeptide having DGLA synthase activity wherein the nucleotide sequence is set forth in SEQ ID NO:440, SEQ ID NO:446, SEQ ID NO:453, SEQ ID NO:460, SEQ ID NO:463, SEQ ID NO:470, SEQ ID NO:492, SEQ ID NO:493, SEQ ID NO:494, SEQ ID NO:495, or SEQ ID NO:496;
(c) a nucleotide sequence encoding a polypeptide having DGLA synthase activity, wherein the nucleotide sequence hybridizes under stringent conditions to a nucleotide sequence as set forth in SEQ ID NO:440, SEQ ID NO:446, SEQ ID NO:453, SEQ ID NO:460, SEQ ID NO:463, SEQ ID NO:470, SEQ ID NO:492, SEQ ID NO:493, SEQ ID NO:494, SEQ ID NO:495, or SEQ ID NO:496; or
(d) a complement of the nucleotide sequence of (a), (b) or (c), wherein the complement and the nucleotide sequence consist of the same number of nucleotides and are 100% complementary.
PCT/US2008/004377 2007-04-03 2008-04-03 Multizymes and their use in making polyunsaturated fatty acids WO2008124048A2 (en)

Priority Applications (12)

Application Number Priority Date Filing Date Title
EP08727277.9A EP2129777B1 (en) 2007-04-03 2008-04-03 Multizymes and their use in making polyunsaturated fatty acids
MX2009010574A MX2009010574A (en) 2007-04-03 2008-04-03 Multizymes and their use in making polyunsaturated fatty acids.
RSP-2009/0413A RS20090413A (en) 2007-04-03 2008-04-03 Multizymes and their use in making polyunsaturated fatty acids
JP2010502136A JP2010523113A (en) 2007-04-03 2008-04-03 Multizyme and its use in the production of polyunsaturated fatty acids
AU2008236723A AU2008236723B2 (en) 2007-04-03 2008-04-03 Multizymes and their use in making polyunsaturated fatty acids
CA002679988A CA2679988A1 (en) 2007-04-03 2008-04-03 Multizymes and their use in making polyunsaturated fatty acids
CN200880018608XA CN101765658B (en) 2007-04-03 2008-04-03 Multizymes and use thereof in making polyunsaturated fatty acids
BRPI0808577A BRPI0808577A2 (en) 2007-04-03 2008-04-03 multenzyme, isolated polynuleotides, recombinant construct, host cell, yarrowia sp. cell transformation method, method for producing a transformed plant, yeast, plant, seeds, oils, food or feed, beverages, isolated nucleic acid molecules, multienzyme production method, method for altering the profile fatty acids, progeny, recombinant microbial host cells, methods for converting linoleic acid to dihomo gamma-linolenic acid, α-linolenic acid to eicosatrienoic acid, eicosapentaenoic acid to docosahexaenoic acid, arachidonic acid to docosapentaenoic acid, method for identification of a polypeptide
UAA200910197A UA103595C2 (en) 2007-04-03 2008-04-03 Multizymes and using thereof in production of polyunsaturated fatty acids
DK08727277.9T DK2129777T3 (en) 2007-04-03 2008-04-03 MULTIZYMER AND USE THEREOF FOR THE PRODUCTION OF polyunsaturated fatty acids
ES08727277.9T ES2559312T3 (en) 2007-04-03 2008-04-03 Multienzymes and their use in the manufacture of polyunsaturated fatty acids
RU2009140397/10A RU2517608C2 (en) 2007-04-03 2008-04-03 Multizymes and use thereof in producing polyunsaturated fatty acids

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US90979007P 2007-04-03 2007-04-03
US60/909,790 2007-04-03
US2789808P 2008-02-12 2008-02-12
US61/027,898 2008-02-12

Publications (3)

Publication Number Publication Date
WO2008124048A2 true WO2008124048A2 (en) 2008-10-16
WO2008124048A3 WO2008124048A3 (en) 2009-02-05
WO2008124048A8 WO2008124048A8 (en) 2011-11-03

Family

ID=39650608

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2008/004377 WO2008124048A2 (en) 2007-04-03 2008-04-03 Multizymes and their use in making polyunsaturated fatty acids

Country Status (14)

Country Link
US (2) US8828690B2 (en)
EP (1) EP2129777B1 (en)
JP (1) JP2010523113A (en)
CN (2) CN101765658B (en)
BR (1) BRPI0808577A2 (en)
CA (1) CA2679988A1 (en)
DK (1) DK2129777T3 (en)
ES (1) ES2559312T3 (en)
HU (1) HUE028692T2 (en)
MX (1) MX2009010574A (en)
RS (1) RS20090413A (en)
RU (2) RU2517608C2 (en)
UA (1) UA103595C2 (en)
WO (1) WO2008124048A2 (en)

Cited By (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2009124225A1 (en) * 2008-04-03 2009-10-08 E. I. Du Pont De Nemours And Company Multizymes
US20100192238A1 (en) * 2007-07-31 2010-07-29 Bioriginal Food & Science Corp. Elongases and methods for producing polyunsaturated fatty acids in transgenic organisms
WO2011006948A1 (en) * 2009-07-17 2011-01-20 Basf Plant Science Company Gmbh Novel fatty acid desaturases and elongases and uses thereof
WO2011008510A2 (en) 2009-06-30 2011-01-20 E. I. Du Pont De Nemours And Company Plant seeds with altered storage compound levels, related constructs and methods involving genes encoding cytosolic pyrophosphatase
WO2011008803A3 (en) * 2009-07-17 2011-04-07 Abbott Laboratories Novel δ9-elongase for production of polyunsaturated fatty acid-enriched oils
WO2011053898A2 (en) 2009-10-30 2011-05-05 E. I. Du Pont De Nemours And Company Plants and seeds with altered storage compound levels, related constructs and methods involving genes encoding proteins with similarity to bacterial 2,4-dihydroxy-hept-2-ene-1,7-dioic acid class ii-like aldolase proteins
WO2011054801A1 (en) 2009-11-03 2011-05-12 Dsm Ip Assets B.V. Vegatable oil comprising a polyunsaturaded fatty acid having at least 20 carbon atoms
WO2011054800A1 (en) 2009-11-03 2011-05-12 Dsm Ip Assets B.V. Composition comprising cells and a polyunsaturated fatty acid having at least 20 carbon atoms (lc-pufa)
WO2011062748A1 (en) 2009-11-23 2011-05-26 E.I. Du Pont De Nemours And Company Sucrose transporter genes for increasing plant seed lipids
WO2011079005A1 (en) 2009-12-24 2011-06-30 E.I. Dupont De Nemours And Company Plant membrane bound o-acyl transferase (mboat) family protein sequences and their uses for altering fatty acid compositions
WO2011109618A2 (en) 2010-03-03 2011-09-09 E. I. Du Pont De Nemours And Company Plant seeds with altered storage compound levels, related constructs and methods involving genes encoding oxidoreductase motif polypeptides
WO2012003207A2 (en) 2010-07-01 2012-01-05 E. I. Du Pont De Nemours And Company Plant seeds with altered storage compound levels, related constructs and methods involving genes encoding pae and pae-like polypeptides
WO2012134978A2 (en) 2011-04-01 2012-10-04 Ice House America, Llc Ice bagging apparatus and methods
JP2012529913A (en) * 2009-06-16 2012-11-29 イー・アイ・デュポン・ドウ・ヌムール・アンド・カンパニー Yarrowia lipolytica optimized strain improved to produce high eicosapentaenoic acid
JP2012530182A (en) * 2009-06-16 2012-11-29 イー・アイ・デュポン・ドウ・ヌムール・アンド・カンパニー High eicosapentaenoic acid oil with improved YARROWIALIPOLYTICA optimized strain
EP2137304B1 (en) * 2007-04-16 2013-08-14 E. I. Du Pont de Nemours and Company Delta 9 elongases and their use in making polyunsaturated fatty acids
JP2013535987A (en) * 2010-08-26 2013-09-19 イー・アイ・デュポン・ドウ・ヌムール・アンド・カンパニー Recombinant microbial host cells for high level eicosapentaenoic acid production
EP3160482A4 (en) * 2014-06-27 2018-02-14 Commonwealth Scientific and Industrial Research Organisation Lipid comprising docosapentaenoic acid
WO2018109059A1 (en) 2016-12-15 2018-06-21 Dsm Ip Assets B.V. Blend formulation comprising silicate and microbial and / or plant cells comprising a polyunsaturated fatty acid having at least 20 carbon atoms (lc-pufa)
US10125084B2 (en) 2013-12-18 2018-11-13 Commonwealth Scientific And Industrial Research Organisation Lipid comprising docosapentaenoic acid

Families Citing this family (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103451246B (en) 2004-04-22 2018-02-16 联邦科学技术研究组织 With recombinant cell synthesis of long-chain polyunsaturated fatty acids
BRPI0510132A (en) 2004-04-22 2007-10-02 Commw Scient Ind Res Org long chain polyunsaturated fatty acid synthesis by recombinant cells
CN101578363A (en) 2006-08-29 2009-11-11 联邦科学技术研究组织 Synthesis of fatty acids
US7943365B2 (en) * 2007-05-03 2011-05-17 E.I. Du Pont De Nemours And Company Δ-5 desaturases and their use in making polyunsaturated fatty acids
EP2195415A1 (en) * 2007-10-03 2010-06-16 E. I. du Pont de Nemours and Company Optimized strains of yarrowia lipolytica for high eicosapentaenoic acid production
US8168858B2 (en) 2008-06-20 2012-05-01 E. I. Du Pont De Nemours And Company Delta-9 fatty acid elongase genes and their use in making polyunsaturated fatty acids
CN107858204B (en) 2008-11-18 2021-10-29 联邦科学技术研究组织 Enzymes and methods for producing omega-3 fatty acids
BRPI0917722A2 (en) 2008-12-18 2017-05-30 Du Pont transgenic organism and method for manipulating malonate content in a transgenic organism
DK2443248T3 (en) 2009-06-16 2018-03-12 Du Pont IMPROVEMENT OF LONG-CHAIN POLYUM Saturated OMEGA-3 AND OMEGA-6 FATTY ACID BIOS SYNTHESIS BY EXPRESSION OF ACYL-CoA LYSOPHOSPHOLIPID ACYL TRANSFERASES
US8399226B2 (en) * 2009-06-16 2013-03-19 E I Du Pont De Nemours And Company High eicosapentaenoic acid oils from improved optimized strains of Yarrowia lipolytica
WO2011087981A2 (en) 2010-01-15 2011-07-21 E. I. Du Pont De Nemours And Company Clinical benefits of eicosapentaenoic acid in humans
EP2542671A2 (en) * 2010-03-02 2013-01-09 Massachusetts Institute Of Technology Microbial engineering for the production of fatty acids and fatty acid derivatives
CA2795460A1 (en) 2010-04-22 2011-10-27 E. I. Du Pont De Nemours And Company Method for obtaining polyunsaturated fatty acid-containing compositions from microbial biomass
WO2012027698A1 (en) 2010-08-26 2012-03-01 E.I. Du Pont De Nemours And Company Mutant hpgg motif and hdash motif delta-5 desaturases and their use in making polyunsaturated fatty acids
US8980589B2 (en) 2010-08-26 2015-03-17 E I Du Pont De Nemours And Company Mutant delta-9 elongases and their use in making polyunsaturated fatty acids
KR20140000711A (en) 2010-12-30 2014-01-03 이 아이 듀폰 디 네모아 앤드 캄파니 Use of saccaromyces cerevisiae suc2 gene in yarrowia lipolytica for sucrose utilization
US20130040340A1 (en) 2011-02-07 2013-02-14 E. I. Du Pont De Nemours And Company Production of alcohol esters in situ using alcohols and fatty acids produced by microorganisms
WO2012109545A2 (en) 2011-02-11 2012-08-16 E. I. Du Pont De Nemours And Company Method for obtaining a lipid-containing composition from microbial biomass
EP2714724A1 (en) 2011-05-26 2014-04-09 E. I. Du Pont de Nemours and Company Expression of caleosin in recombinant oleaginous microorganisms to increase oil content therein
US20150089689A1 (en) * 2012-01-23 2015-03-26 E I Du Pont Nemours And Company Down-regulation of gene expression using artificial micrornas for silencing fatty acid biosynthetic genes
US8946460B2 (en) 2012-06-15 2015-02-03 Commonwealth Scientific And Industrial Research Organisation Process for producing polyunsaturated fatty acids in an esterified form
JP6351582B2 (en) 2012-06-19 2018-07-04 イー・アイ・デュポン・ドウ・ヌムール・アンド・カンパニーE.I.Du Pont De Nemours And Company Improved production of polyunsaturated fatty acids by co-expression of acyl CoA: lysophosphatidylcholine acyltransferase and phospholipid: diacylglycerol acyltransferase
JP2016502851A (en) 2012-12-21 2016-02-01 イー・アイ・デュポン・ドウ・ヌムール・アンド・カンパニーE.I.Du Pont De Nemours And Company Down-regulation of polynucleotides encoding SOU2 sorbitol-utilizing proteins to modify lipogenesis in microbial cells
EP3215611B1 (en) 2014-11-06 2019-08-21 E. I. du Pont de Nemours and Company Peptide-mediated delivery of rna-guided endonuclease into cells
US10393225B2 (en) * 2015-01-05 2019-08-27 Goodrich Corporation Integrated multi-function propulsion belt for air cushion supported aircraft cargo loading robot
WO2018129440A1 (en) 2017-01-09 2018-07-12 University Of Massachusetts Complexes for gene deletion and editing

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2002057465A2 (en) * 2001-01-19 2002-07-25 Basf Plant Science Gmbh Method for producing polyunsaturated fatty acids, novel biosynthesis genes and novel plant expression constructs
WO2002057464A2 (en) * 2001-01-19 2002-07-25 Basf Plant Science Gmbh Method for the expression of biosynthetic genes in plant seeds using multiple expression constructs
WO2004090123A2 (en) * 2003-04-08 2004-10-21 Basf Plant Science Gmbh Δ-4 desaturases from euglena gracilis, expressing plants, and oils containing pufa
US20050273885A1 (en) * 2004-04-22 2005-12-08 Singh Surinder P Synthesis of long-chain polyunsaturated fatty acids by recombinant cells
WO2006052807A2 (en) * 2004-11-04 2006-05-18 E.I. Dupont De Nemours And Company A dna molecule of mortierella alpina lpaat homolog

Family Cites Families (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH07501701A (en) 1991-12-04 1995-02-23 イー・アイ・デユポン・ドウ・ヌムール・アンド・カンパニー Plant-derived fatty acid desaturase gene
US5547868A (en) * 1993-06-09 1996-08-20 Regents Of The University Of California Cholesterol disposal fusion enzymes
US5968809A (en) 1997-04-11 1999-10-19 Abbot Laboratories Methods and compositions for synthesis of long chain poly-unsaturated fatty acids
US6075183A (en) 1997-04-11 2000-06-13 Abbott Laboratories Methods and compositions for synthesis of long chain poly-unsaturated fatty acids in plants
US6677145B2 (en) 1998-09-02 2004-01-13 Abbott Laboratories Elongase genes and uses thereof
US6403349B1 (en) 1998-09-02 2002-06-11 Abbott Laboratories Elongase gene and uses thereof
AU776711B2 (en) 1998-12-07 2004-09-16 Washington State University Research Foundation Desaturases and methods of using them for synthesis of polyunsaturated fatty acids
US6825017B1 (en) 1998-12-07 2004-11-30 Washington State University Research Foundation Desaturases and methods of using them for synthesis of polyunsaturated fatty acids
ATE485385T1 (en) * 2000-01-19 2010-11-15 Martek Biosciences Corp SOLVENT-FREE EXTRACTION PROCESS
WO2002026946A2 (en) 2000-09-28 2002-04-04 Bioriginal Food & Science Corporation Fad4, fad5, fad5-2, and fad6, fatty acid desaturase family members and uses thereof
EP1392823B1 (en) 2001-01-25 2011-11-30 Abbott Laboratories Desaturase genes and uses thereof
GB0107510D0 (en) 2001-03-26 2001-05-16 Univ Bristol New elongase gene and a process for the production of -9-polyunsaturated fatty acids
US7045683B2 (en) 2001-05-04 2006-05-16 Abbott Laboratories Δ4-desaturase genes and uses thereof
AU2003296638B2 (en) * 2002-12-19 2009-06-11 University Of Bristol Method for the production of polyunsaturated fatty acids
US20040172682A1 (en) 2003-02-12 2004-09-02 Kinney Anthony J. Production of very long chain polyunsaturated fatty acids in oilseed plants
US7214491B2 (en) 2003-05-07 2007-05-08 E. I. Du Pont De Nemours And Company Δ-12 desaturase gene suitable for altering levels of polyunsaturated fatty acids in oleaginous yeasts
US7125672B2 (en) 2003-05-07 2006-10-24 E. I. Du Pont De Nemours And Company Codon-optimized genes for the production of polyunsaturated fatty acids in oleaginous yeasts
US7238482B2 (en) 2003-05-07 2007-07-03 E. I. Du Pont De Nemours And Company Production of polyunsaturated fatty acids in oleaginous yeasts
EP2169053B1 (en) * 2003-08-01 2015-09-09 BASF Plant Science GmbH Method for production of polyunsaturated fatty acids in transgenic organisms
CA2542574C (en) 2003-11-12 2014-03-18 E. I. Du Pont De Nemours And Company Delta-15 desaturases suitable for altering levels of polyunsaturated fatty acids in oleaginous plants and yeast
US7504259B2 (en) 2003-11-12 2009-03-17 E. I. Du Pont De Nemours And Company Δ12 desaturases suitable for altering levels of polyunsaturated fatty acids in oleaginous yeast
US9458436B2 (en) * 2004-02-27 2016-10-04 Basf Plant Science Gmbh Method for producing polyunsaturated fatty acids in transgenic plants
BRPI0510132A (en) 2004-04-22 2007-10-02 Commw Scient Ind Res Org long chain polyunsaturated fatty acid synthesis by recombinant cells
AU2005267202B2 (en) 2004-06-25 2011-10-27 Corteva Agriscience Llc Delta-8 desaturase and its use in making polyunsaturated fatty acids
US7470532B2 (en) 2005-10-19 2008-12-30 E.I. Du Pont De Nemours And Company Mortierella alpina C16/18 fatty acid elongase
CA2625855C (en) * 2005-11-23 2016-04-19 E. I. Du Pont De Nemours And Company Delta-9 elongases and their use in making polyunsaturated fatty acids

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2002057465A2 (en) * 2001-01-19 2002-07-25 Basf Plant Science Gmbh Method for producing polyunsaturated fatty acids, novel biosynthesis genes and novel plant expression constructs
WO2002057464A2 (en) * 2001-01-19 2002-07-25 Basf Plant Science Gmbh Method for the expression of biosynthetic genes in plant seeds using multiple expression constructs
WO2004090123A2 (en) * 2003-04-08 2004-10-21 Basf Plant Science Gmbh Δ-4 desaturases from euglena gracilis, expressing plants, and oils containing pufa
US20050273885A1 (en) * 2004-04-22 2005-12-08 Singh Surinder P Synthesis of long-chain polyunsaturated fatty acids by recombinant cells
WO2006052807A2 (en) * 2004-11-04 2006-05-18 E.I. Dupont De Nemours And Company A dna molecule of mortierella alpina lpaat homolog

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
CHIRALA SUBRAHMANYAM S ET AL: "Animal fatty acid synthase: Functional mapping and cloning and expression of the domain I constituent activities" PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, vol. 94, no. 11, 1997, pages 5588-5593, XP002491682 ISSN: 0027-8424 *
DOMERGUE FRÉDÉRIC ET AL: "Cloning and functional characterization of Phaeodactylum tricornutum front-end desaturases involved in eicosapentaenoic acid biosynthesis." EUROPEAN JOURNAL OF BIOCHEMISTRY / FEBS AUG 2002, vol. 269, no. 16, August 2002 (2002-08), pages 4105-4113, XP002228745 ISSN: 0014-2956 *
MEYER A ET AL: "Biosynthesis of docosahexaenoic acid in Euglena gracilis: Biochemical and molecular evidence for the involvement of a DELTA4-fatty acyl group desaturase" BIOCHEMISTRY, AMERICAN CHEMICAL SOCIETY, vol. 42, no. 32, 19 August 2003 (2003-08-19), pages 9779-9788, XP002298344 ISSN: 0006-2960 *
MEYER ASTRID ET AL: "Novel fatty acid elongases and their use for the reconstitution of docosahexaenoic acid biosynthesis" JOURNAL OF LIPID RESEARCH, vol. 45, no. 10, October 2004 (2004-10), pages 1899-1909, XP009046591 ISSN: 0022-2275 *
PEREIRA SUZETTE L ET AL: "Identification of two novel microalgal enzymes involved in the conversion of the omega3-fatty acid, eicosapentaenoic acid, into docosahexaenoic acid" BIOCHEMICAL JOURNAL, vol. 384, no. Part 2, 1 December 2004 (2004-12-01), pages 357-366, XP004594626 ISSN: 0264-6021 *

Cited By (34)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2137304B1 (en) * 2007-04-16 2013-08-14 E. I. Du Pont de Nemours and Company Delta 9 elongases and their use in making polyunsaturated fatty acids
EP2134837B1 (en) * 2007-04-16 2014-01-01 E. I. Du Pont de Nemours and Company Delta 9 elongases and their use in making polyunsaturated fatty acids
US20100192238A1 (en) * 2007-07-31 2010-07-29 Bioriginal Food & Science Corp. Elongases and methods for producing polyunsaturated fatty acids in transgenic organisms
US8318914B2 (en) * 2007-07-31 2012-11-27 Bioriginal Food & Science Corp. Elongases and methods for producing polyunsaturated fatty acids in transgenic organisms
WO2009124225A1 (en) * 2008-04-03 2009-10-08 E. I. Du Pont De Nemours And Company Multizymes
JP2016101172A (en) * 2009-06-16 2016-06-02 イー・アイ・デュポン・ドウ・ヌムール・アンド・カンパニーE.I.Du Pont De Nemours And Company High eicosapentaenoic acid oils from improved optimized strains of yarrowia lipolytica
JP2012530182A (en) * 2009-06-16 2012-11-29 イー・アイ・デュポン・ドウ・ヌムール・アンド・カンパニー High eicosapentaenoic acid oil with improved YARROWIALIPOLYTICA optimized strain
JP2012529913A (en) * 2009-06-16 2012-11-29 イー・アイ・デュポン・ドウ・ヌムール・アンド・カンパニー Yarrowia lipolytica optimized strain improved to produce high eicosapentaenoic acid
WO2011008510A2 (en) 2009-06-30 2011-01-20 E. I. Du Pont De Nemours And Company Plant seeds with altered storage compound levels, related constructs and methods involving genes encoding cytosolic pyrophosphatase
WO2011008803A3 (en) * 2009-07-17 2011-04-07 Abbott Laboratories Novel δ9-elongase for production of polyunsaturated fatty acid-enriched oils
US9493520B2 (en) 2009-07-17 2016-11-15 Basf Plant Science Company Gmbh Fatty acid desaturases and elongases and uses thereof
US10351870B2 (en) 2009-07-17 2019-07-16 Basf Plant Science Company Gmbh Uses of novel fatty acid desaturases and elongases and products thereof
US8188335B2 (en) 2009-07-17 2012-05-29 Abbott Laboratories Δ9-elongase for production of polyunsaturated fatty acid-enriched oils
RU2558302C2 (en) * 2009-07-17 2015-07-27 Эбботт Лэборетриз Novel delta-9-elongase for producing polyunsaturated fatty acid-enriched oils
WO2011006948A1 (en) * 2009-07-17 2011-01-20 Basf Plant Science Company Gmbh Novel fatty acid desaturases and elongases and uses thereof
AU2010273508B2 (en) * 2009-07-17 2015-02-05 Abbott Laboratories Novel delta9-elongase for production of polyunsaturated fatty acid-enriched oils
WO2011053898A2 (en) 2009-10-30 2011-05-05 E. I. Du Pont De Nemours And Company Plants and seeds with altered storage compound levels, related constructs and methods involving genes encoding proteins with similarity to bacterial 2,4-dihydroxy-hept-2-ene-1,7-dioic acid class ii-like aldolase proteins
WO2011054800A1 (en) 2009-11-03 2011-05-12 Dsm Ip Assets B.V. Composition comprising cells and a polyunsaturated fatty acid having at least 20 carbon atoms (lc-pufa)
WO2011054801A1 (en) 2009-11-03 2011-05-12 Dsm Ip Assets B.V. Vegatable oil comprising a polyunsaturaded fatty acid having at least 20 carbon atoms
WO2011062748A1 (en) 2009-11-23 2011-05-26 E.I. Du Pont De Nemours And Company Sucrose transporter genes for increasing plant seed lipids
US8637733B2 (en) 2009-12-24 2014-01-28 E.I. Du Pont De Nemours And Company Plant membrane bound O-acyl transferase (MBOAT) family protein sequences and their uses for altering fatty acid compositions
WO2011079005A1 (en) 2009-12-24 2011-06-30 E.I. Dupont De Nemours And Company Plant membrane bound o-acyl transferase (mboat) family protein sequences and their uses for altering fatty acid compositions
US9006514B2 (en) 2009-12-24 2015-04-14 E. I. Du Pont De Nemours And Company Plant membrane O-acyl transferase (MBOAT) family protein sequences and their uses for altering fatty acid compositions
EP2860254A1 (en) 2010-03-03 2015-04-15 E. I. du Pont de Nemours and Company Plant seeds with altered storage compound levels, related constructs and methods involving genes encoding oxidoreductase motif polypeptides
EP2865761A1 (en) 2010-03-03 2015-04-29 E. I. Du Pont de Nemours and Company Plant seeds with altered storage compound levels, related constructs and methods involving genes encoding oxidoreductase motif polypeptides
WO2011109618A2 (en) 2010-03-03 2011-09-09 E. I. Du Pont De Nemours And Company Plant seeds with altered storage compound levels, related constructs and methods involving genes encoding oxidoreductase motif polypeptides
WO2012003207A2 (en) 2010-07-01 2012-01-05 E. I. Du Pont De Nemours And Company Plant seeds with altered storage compound levels, related constructs and methods involving genes encoding pae and pae-like polypeptides
EP2609203B1 (en) * 2010-08-26 2018-06-20 E. I. du Pont de Nemours and Company Recombinant microbial host cells for high eicosapentaenoic acid production
JP2013535987A (en) * 2010-08-26 2013-09-19 イー・アイ・デュポン・ドウ・ヌムール・アンド・カンパニー Recombinant microbial host cells for high level eicosapentaenoic acid production
WO2012134978A2 (en) 2011-04-01 2012-10-04 Ice House America, Llc Ice bagging apparatus and methods
US10125084B2 (en) 2013-12-18 2018-11-13 Commonwealth Scientific And Industrial Research Organisation Lipid comprising docosapentaenoic acid
US11623911B2 (en) 2013-12-18 2023-04-11 Commonwealth Scientific And Industrial Research Organisation Lipid comprising docosapentaenoic acid
EP3160482A4 (en) * 2014-06-27 2018-02-14 Commonwealth Scientific and Industrial Research Organisation Lipid comprising docosapentaenoic acid
WO2018109059A1 (en) 2016-12-15 2018-06-21 Dsm Ip Assets B.V. Blend formulation comprising silicate and microbial and / or plant cells comprising a polyunsaturated fatty acid having at least 20 carbon atoms (lc-pufa)

Also Published As

Publication number Publication date
ES2559312T3 (en) 2016-02-11
RU2517608C2 (en) 2014-05-27
BRPI0808577A2 (en) 2019-09-24
CN104152423A (en) 2014-11-19
MX2009010574A (en) 2009-10-22
US20150031096A1 (en) 2015-01-29
EP2129777B1 (en) 2015-10-28
HUE028692T2 (en) 2016-12-28
DK2129777T3 (en) 2016-01-25
CA2679988A1 (en) 2008-10-16
US20080254191A1 (en) 2008-10-16
CN101765658A (en) 2010-06-30
CN101765658B (en) 2013-11-20
WO2008124048A3 (en) 2009-02-05
WO2008124048A8 (en) 2011-11-03
JP2010523113A (en) 2010-07-15
RS20090413A (en) 2010-06-30
EP2129777A2 (en) 2009-12-09
AU2008236723A1 (en) 2008-10-16
RU2009140397A (en) 2011-05-10
UA103595C2 (en) 2013-11-11
RU2014102130A (en) 2015-08-27
US8828690B2 (en) 2014-09-09

Similar Documents

Publication Publication Date Title
US8828690B2 (en) Multizymes comprising delta-9 elongase and delta-8 desaturase and their use in making polyunsaturated fatty acids
EP2121737B1 (en) Delta-8 desaturases and their use in making polyunsaturated fatty acids
AU2008240028B2 (en) Delta 9 elongases and their use in making polyunsaturated fatty acids
AU2008247766B2 (en) Delta-5 desaturases and their use in making polyunsaturated fatty acids
US20110184200A1 (en) Delta-8 desaturase and its use in making polyunsaturated fatty acids
WO2007136877A2 (en) Delta-5 desaturase and its use in making polyunsaturated fatty acids
US20080095915A1 (en) Delta-8 Desaturases And Their Use In Making Polyunsaturated Fatty Acids
AU2008236723B2 (en) Multizymes and their use in making polyunsaturated fatty acids
AU2013245497A1 (en) Multizymes and their use in making polyunsaturated fatty acids
AU2014202508A1 (en) Delta 9 elongases and their use in making polyunsaturated fatty acids
AU2014240293A1 (en) Delta-8 desaturases and their use in making polyunsaturated fatty acids

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 200880018608.X

Country of ref document: CN

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 08727277

Country of ref document: EP

Kind code of ref document: A2

WWE Wipo information: entry into national phase

Ref document number: 2008236723

Country of ref document: AU

WWE Wipo information: entry into national phase

Ref document number: 2679988

Country of ref document: CA

WWE Wipo information: entry into national phase

Ref document number: 5826/DELNP/2009

Country of ref document: IN

ENP Entry into the national phase

Ref document number: 2008236723

Country of ref document: AU

Date of ref document: 20080403

Kind code of ref document: A

WWE Wipo information: entry into national phase

Ref document number: 2008727277

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: P-2009/0413

Country of ref document: RS

WWE Wipo information: entry into national phase

Ref document number: MX/A/2009/010574

Country of ref document: MX

ENP Entry into the national phase

Ref document number: 2010502136

Country of ref document: JP

Kind code of ref document: A

WWE Wipo information: entry into national phase

Ref document number: 12009501889

Country of ref document: PH

NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 2009140397

Country of ref document: RU

ENP Entry into the national phase

Ref document number: PI0808577

Country of ref document: BR

Kind code of ref document: A2

Effective date: 20090930