CA2478297C - Optimizing glycan processing in plants - Google Patents

Optimizing glycan processing in plants Download PDF

Info

Publication number
CA2478297C
CA2478297C CA2478297A CA2478297A CA2478297C CA 2478297 C CA2478297 C CA 2478297C CA 2478297 A CA2478297 A CA 2478297A CA 2478297 A CA2478297 A CA 2478297A CA 2478297 C CA2478297 C CA 2478297C
Authority
CA
Canada
Prior art keywords
leu
pro
gly
val
arg
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CA2478297A
Other languages
French (fr)
Other versions
CA2478297A1 (en
Inventor
Hendrikus Antonius Cornelis Bakker
Dionisius Elisabeth Antonius Florack
Hendrik Jan Bosch
Gerard Johan Adolph Rouwendal
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Stichting Dienst Landbouwkundig Onderzoek DLO
Original Assignee
Stichting Dienst Landbouwkundig Onderzoek DLO
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Stichting Dienst Landbouwkundig Onderzoek DLO filed Critical Stichting Dienst Landbouwkundig Onderzoek DLO
Priority to CA2799805A priority Critical patent/CA2799805C/en
Publication of CA2478297A1 publication Critical patent/CA2478297A1/en
Application granted granted Critical
Publication of CA2478297C publication Critical patent/CA2478297C/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/10Transferases (2.)
    • C12N9/1048Glycosyltransferases (2.4)
    • C12N9/1051Hexosyltransferases (2.4.1)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • C12N15/62DNA sequences coding for fusion proteins
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • C12N15/52Genes encoding for enzymes or proenzymes
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • C12N15/8241Phenotypically and genetically modified plants via recombinant DNA technology
    • C12N15/8242Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits
    • C12N15/8257Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits for the production of primary gene products, e.g. pharmaceutical products, interferon
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • C12N15/8241Phenotypically and genetically modified plants via recombinant DNA technology
    • C12N15/8242Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits
    • C12N15/8257Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits for the production of primary gene products, e.g. pharmaceutical products, interferon
    • C12N15/8258Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits for the production of primary gene products, e.g. pharmaceutical products, interferon for the production of oral vaccines (antigens) or immunoglobulins
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K2317/00Immunoglobulins specific features
    • C07K2317/10Immunoglobulins specific features characterized by their source of isolation or production
    • C07K2317/13Immunoglobulins specific features characterized by their source of isolation or production isolated from plants
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K2317/00Immunoglobulins specific features
    • C07K2317/40Immunoglobulins specific features characterized by post-translational modification
    • C07K2317/41Glycosylation, sialylation, or fucosylation
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K2319/00Fusion polypeptide
    • C07K2319/01Fusion polypeptide containing a localisation/targetting motif
    • C07K2319/02Fusion polypeptide containing a localisation/targetting motif containing a signal sequence

Abstract

The invention is directed to methods for optimizing glycan processing in organisms (and in particular, plants) so that a glycoprotein having complex type bi-antennary glycans and thus 5 containing galactose residues on both arms and which are devoid of (or reduce in) xylose and fucose can be obtained.
The invention is further directed to said glycoprotein obtained and host system comprising said protein.

Description

OPTIMIZING GLYCAN PROCESSING IN PLANTS

FIELD OF THE INVENTION
The invention is directed to methods for optimizing glycan processing of cell or an organism containing glycoproteins with N-glycans, in particular plants so that a glycoprotein having an N-glycan, high mannose type, hybrid or preferably complex type N-glycans, including but not limited to bi-antennary N-glycans, and containing a galactose residue on at least one arm of the N-glycan and which are devoid of (or reduced in) xylose and fucose residues can be obtained. The invention is further directed to said glycoprotein obtained and in particular a plant host system comprising said protein.

BACKGROUND OF THE INVENTION
N-linked glycans, specific oligosaccharide structures attached to asparagine residues of glycoproteins, can contribute significantly to the properties of the protein and, in turn, to the properties of the organism. Plant proteins can carry N-linked glycans but in marked contrast to mammals only few biological processes are known to which they contribute.
Biogenesis of N-linked glycans begins with the synthesis of a lipid linked oligosaccharide moiety (G1c3Man9G1cNAc2-) which is transferred en bloc to the nascent polypeptide chain in the endoplasmic reticulum (ER). Through a series of trimming reactions by exoglycosidases in the ER
and cis-Golgi compartments, the so-called "high mannose" (Man9G1cNAc2 to Man5G1cNAc2) glycans are formed. Subsequently, the formation of complex type glycans starts with the transfer of the first GlcNAc onto Man5G1cNAc2 by GnTI and further trimming by mannosidase II (Mani') to form G1cNAcMan3G1cNAc2. Complex glycan biosynthesis continues while the glycoprotein is progressing through the secretory pathway with the transfer in the Golgi apparatus of the second GlcNAc residue by GnTII as well as other monosaccharide residues onto the GlcNAcMan3G1cNAc2 under the action of several other glycosyl transferases.
Plants and mammals differ with respect to the formation of complex glycans (see Figure 1, which compares the glycosylation pathway of glycoproteins in plants and mammals). In plants, complex glycans are characterized by the presence off3(1,2)-xylose residues linked to the Man-3 and/or an a(1,3)-fucose residue linked to G1cNAc-1, instead of an a(1,6)-fucose residue linked to the G1cNAc-1. Genes encoding the corresponding xylosyl (XylT) and fucosyl (FucT) transferases have been isolated [Strasser et al., "Molecular cloning and functional expression of beta1,2-xylosyltransferase cDNA from Arabidopsis thaliana," FEBS Lett. 472:105 (2000);
Leiter et al., "Purification, cDNA cloning, and expression of GDP-L-Fuc:Asn-linked GlcNAc alpha 1,3-fucosyltransferase from mung beans," J. Biol. Chem. 274:21830 (1999)]. Plants do not possess f3(1,4)-galactosyltransferases nor a(2,6)sialyltransferases and consequently plant glycans lack the f3(1,4)-galactose and terminal a(2,6)NeuAc residues often found on mammalian glycans.
The final glycan structures are not only determined by the mere presence of enzymes involved in their biosynthesis and transport but to a large extent by the specific sequence of the various enzymatic reactions. The latter is controlled by discrete sequestering and relative position of these enzymes throughout the ER and Golgi, which is mediated by the interaction of determinants of the transferase and specific characteristics of the sub-Golgi compartment for which the transferase is destined. A number of studies using hybrid molecules have identified that the transmembrane domains of several glycosyltransferases, including that ofp(1,4)-galactosyltransferases, play a central role in their sub-Golgi sorting [Grabenhorst et al., J. Biol. Chem 274:36107 (1999); Colley, K., Glycobiology 7:1 (1997); Munro, S., Trends Cell Biol. 8:11(1998); Gleeson, P.A., Histochem.
Cell Biol. 109:517 (1998)1 Although plants and mammals have diverged a relatively long time ago, N-linked glycosylation seems at least partly conserved. This is evidenced by the similar though not identical glycan structures and by the observation that a mammalian GlcNAcTI gene complements a Arabidopsis mutant that is deficient in GlcNAcTI activity, and vice versa. The differences in glycan structures can have important consequences. For example, xylose and a(1,3)-fucose epitopes are known to be highly immunogenic and possibly allergenic in some circumstances, which may pose a problem when plants are used for the production of therapeutic glycoproteins.
Moreover, blood serum of many allergy patients contains IgE directed against these epitopes but also 50% of non-allergic blood donors contains in their sera antibodies specific for core-xylose whereas 25% have antibodies for core-alpha 1,3-fucose (Bardor et al., 2002, in press, Glycobiology) (Advance Access published December 17, 2002) which make these individuals at risk to treatments with recombinant proteins produced in plants containing fucose and/or xylose. In addition, this carbohydrate directed IgE in sera might cause false positive reaction in in vitro tests using plant extracts since there is evidence that these carbohydrate specific TgE's are not relevant for the allergenic reaction. In sum, a therapeutic failure with a glycoprotein produced in plants might be the result of accelerated clearance of the recombinant glycoprotein having xylose and/or fucose.
Accordingly, there is a need to better control glycosylation in plants, and particularly, glycosylation of glycoproteins intended for therapeutic use.

DEFINITIONS
To facilitate understanding of the invention, a number of terms as used in this specification are defined below.
The term "vector" refers to any genetic element, such as a plasmid, phage, transposon, cosmid, chromosome, retrovirus, virion, or similar genetic element, which is capable of replication when associated with the proper control elements and which can transfer gene sequences into cells and/or between cells. Thus, this term includes cloning and expression vehicles, as well as viral vectors.
The term "expression vector" as used herein refers to a recombinant DNA
molecule containing a desired coding sequence (or coding sequences) ¨ such as the coding sequence(s) for the hybrid enzyme(s) described in more detail below - and appropriate nucleic acid sequences necessary for the expression of the operably linked coding sequence in a particular host cell or organism.
Nucleic acid sequences necessary for expression in prokaryotes usually include a promoter, an operator (optional), and a ribosome binding site, often along with other sequences. Eukaryotic cells are known to utilize promoters, enhancers, and termination and polyadenylation signals. It is not intended that the present invention be limited to particular expression vectors or expression vectors with particular elements.
The term "transgenic" when used in reference to a cell refers to a cell which contains a transgene, or whose genome has been altered by the introduction of a transgene. The term "transgenic" when used in reference to a cell, tissue or to a plant refers to a cell, tissue or plant, respectively, which comprises a transgene, where one or more cells of the tissue contain a transgene (such as a gene encoding the hybrid enzyme(s) of the present invention), or a plant whose genome has been altered by the introduction of a transgene. Transgenic cells, tissues and plants may be produced by several methods including the introduction of a "transgene"
comprising nucleic acid (usually DNA) into a target cell or integration of the transgene into a chromosome of a target cell by way of human intervention, such as by the methods described herein.
The term "transgene" as used herein refers to any nucleic acid sequence which is introduced into the genome of a cell by experimental manipulations. A transgene may be an "endogenous DNA
sequence," or a "heterologous DNA sequence" (i.e., "foreign DNA"). The term "endogenous DNA
sequence" refers to a nucleotide sequence which is naturally found in the cell into which it is introduced so long as it does not contain some modification (e.g., a point mutation, the presence of a selectable marker gene, or other like modifications) relative to the naturally-occurring sequence.
The term "heterologous DNA sequence" refers to a nucleotide sequence which is ligated to, or is manipulated to become ligated to, a nucleic acid sequence to which it is not ligated in nature, or to which it is ligated at a different location in nature. Heterologous DNA is not endogenous to the cell into which it is introduced, but has been obtained from another cell.
Heterologous DNA also includes an endogenous DNA sequence which contains some modification.
Generally, although not necessarily, heterologous DNA encodes RNA and proteins that are not normally produced by the cell into which it is expressed. Examples of heterologous DNA include reporter genes, transcriptional and translational regulatory sequences, selectable marker proteins (e.g., proteins which confer drug resistance), or other similar elements.
The term "foreign gene" refers to any nucleic acid (e.g., gene sequence) which is introduced into the genome of a cell by experimental manipulations and may include gene sequences found in that cell so long as the introduced gene contains some modification (e.g., a point mutation, the presence of a selectable marker gene, or other like modifications) relative to the naturally-occurring gene.
The term "fusion protein" refers to a protein wherein at least one part or portion is from a first protein and another part or portion is from a second protein. The term "hybrid enzyme" refers to a fusion protein which is a functional enzyme, wherein at least one part or portion is from a first species and another part or portion is from a second species. Preferred hybrid enzymes of the present invention are functional glycosyltransferases (or portions thereof) wherein at least one part or portion is from a plant and another part or portion is from a mammal (such as human).

The term "introduction into a cell" or "introduction into a host cell" in the context of nucleic acid (e.g., vectors) is intended to include what the art calls "transformation" or "transfection" or "transduction." Transformation of a cell may be stable or transient ¨ and the present invention contemplates introduction of vectors under conditions where, on the one hand, there is stable expression, and on the other hand, where there is only transient expression.
The term "transient transformation" or "transiently transformed" refers to the introduction of one or more transgenes into a cell in the absence of integration of the transgene into the host cell's genome. Transient transformation may be detected by, for example, enzyme-linked immunosorbent assay (ELISA) which detects the presence of a polypeptide encoded by one or more of the transgenes. Alternatively, transient transformation may be detected by detecting the activity of the protein (e.g., antigen binding of an antibody) encoded by the transgene (e.g., the antibody gene).
The term "transient transformant" refers to a cell which has transiently incorporated one or more transgenes. In contrast, the term "stable transformation" or "stably transformed" refers to the introduction and integration of one or more transgenes into the genome of a cell. Stable transformation of a cell may be detected by Southern blot hybridization of genomic DNA of the cell with nucleic acid sequences which are capable of binding to one or more of the transgenes. Alternatively, stable transformation of a cell may also be detected by the polymerase chain reaction (PCR) of genomic DNA of the cell to amplify transgene sequences. The term "stable transformant" refers to a cell which has stably integrated one or more transgenes into the genomic DNA. Thus, a stable transformant is distinguished from a transient transformant in that, whereas genomic DNA from the stable transformant contains one or more transgenes, genomic DNA from the transient transformant does not contain a transgene.
The term "host cell" includes both mammalian (e.g. human B cell clones, Chinese hamster ovary cells, hepatocytes) and non-mammalian cells (e.g. insect cells, bacterial cells, plant cells). In one embodiment, the host cells are mammalian cells and the introduction of a vector expressing a hybrid protein of the present invention (e.g TmGnTII-GalT) inhibits (or at least reduces) fucosylation in said mammalian cells.
The term "nucleotide sequence of interest" refers to any nucleotide sequence, the manipulation of which may be deemed desirable for any reason (e.g., confer improved qualities, use for production of therapeutic proteins), by one of ordinary skill in the art.
Such nucleotide sequences include, but are not limited to, coding sequences of structural genes (e.g., reporter genes, selection marker genes, oncogenes, antibody genes, drug resistance genes, growth factors, and other like genes), and non-coding regulatory sequences which do not encode an mRNA or protein product, (e.g., promoter sequence, polyadenylation sequence, termination sequence, enhancer sequence, and other like sequences). The present invention contemplates host cells expressing a heterologous protein encoded by a nucleotide sequence of interest along with one or more hybrid enzymes.
The term "isolated" when used in relation to a nucleic acid, as in "an isolated nucleic acid sequence" refers to a nucleic acid sequence that is identified and separated from one or more other components (e.g., separated from a cell containing the nucleic acid, or separated from at least one contaminant nucleic acid, or separated from one or more proteins, one or more lipids) with which it is ordinarily associated in its natural source. Isolated nucleic acid is nucleic acid present in a form or setting that is different from that in which it is found in nature. In contrast, non-isolated nucleic acids are nucleic acids such as DNA and RNA which are found in the state they exist in nature. For example, a given DNA sequence (e.g., a gene) is found on the host cell chromosome in proximity to neighboring genes; RNA sequences, such as a specific mRNA sequence encoding a specific protein, are found in the cell as a mixture with numerous other mRNAs which encode a multitude of proteins. However, an isolated nucleic acid sequence comprising SEQ ID NO:1 includes, by way of example, such nucleic acid sequences in cells which ordinarily contain SEQ ID
NO:1 where the nucleic acid sequence is in a chromosomal or extrachromosomal location different from that of natural cells, or is otherwise flanked by a different nucleic acid sequence than that found in nature.
The isolated nucleic acid sequence may be present in single-stranded or double-stranded form.
When an isolated nucleic acid sequence is to be utilized to express a protein, the nucleic acid sequence will contain at a minimum at least a portion of the sense or coding strand (i.e., the nucleic acid sequence may be single-stranded). Alternatively, it may contain both the sense and anti-sense strands (i.e., the nucleic acid sequence may be double-stranded).
As used herein, the term "purified" refers to molecules, either nucleic or amino acid sequences, that are removed from their natural environment, isolated or separated. An "isolated nucleic acid sequence" is therefore a purified nucleic acid sequence.
"Substantially purified"
molecules are at least 60% free, preferably at least 75% free, and more preferably at least 90% free, from other components with which they are naturally associated. The present invention contemplates both purified (including substantially purified) and unpurified hybrid enzyme(s) (which are described in more detail below).
As used herein, the terms "complementary" or "complementarity" are used in reference to nucleotide sequences related by the base-pairing rules. For example, the sequence 5'-AGT-3' is complementary to the sequence 5'-ACT-3'. Complementarity can be "partial" or "total." "Partial"
complementarity is where one or more nucleic acid bases is not matched according to the base pairing rules. "Total" or "complete" complementarity between nucleic acids is where each and every nucleic acid base is matched with another base under the base pairing rules.
The degree of complementarity between nucleic acid strands has significant effects on the efficiency and strength of hybridization between nucleic acid strands.
A "complement" of a nucleic acid sequence as used herein refers to a nucleotide sequence whose nucleic acids show total complementarity to the nucleic acids of the nucleic acid sequence.
For example, the present invention contemplates the complements of SEQ ID NOS:
1, 3, 5, 9, 27, 28, 29, 30, 31, 32, 33, 34, 35, 37, 38, 40, 41 and 43.
The term "homology" when used in relation to nucleic acids refers to a degree of complementarity. There may be partial homology (i.e., partial identity) or complete homology (L e., complete identity). A partially complementary sequence is one that at least partially inhibits a completely complementary sequence from hybridizing to a target nucleic acid and is referred to using the functional term "substantially homologous." The inhibition of hybridization of the completely complementary sequence to the target sequence may be examined using a hybridization assay (Southern or Northern blot, solution hybridization and the like) under conditions of low stringency. A substantially homologous sequence or probe (i.e., an oligonucleotide which is capable of hybridizing to another oligonucleotide of interest) will compete for and inhibit the binding (i.e., the hybridization) of a completely homologous sequence to a target under conditions of low stringency. This is not to say that conditions of low stringency are such that non-specific binding is permitted; low stringency conditions require that the binding of two sequences to one another be a specific (i.e., selective) interaction. The absence of non-specific binding may be tested by the use of a second target which lacks even a partial degree of complementarity (e.g., less than about 30%
identity); in the absence of non-specific binding the probe will not hybridize to the second non-complementary target.
When used in reference to a double-stranded nucleic acid sequence such as a cDNA or genomic clone, the term "substantially homologous" refers to any probe which can hybridize to either or both strands of the double-stranded nucleic acid sequence under conditions of low stringency as described infra.
When used in reference to a single-stranded nucleic acid sequence, the term "substantially homologous" refers to any probe which can hybridize to the single-stranded nucleic acid sequence under conditions of low stringency as described infra.
The term "hybridization" as used herein includes "any process by which a strand of nucleic acid joins with a complementary strand through base pairing." [Coombs J (1994) Dictionary of Biotechnology, Stockton Press, New York NY]. Hybridization and the strength of hybridization (i.e., the strength of the association between the nucleic acids) is impacted by such factors as the degree of complementarity between the nucleic acids, stringency of the conditions involved, the Tm of the formed hybrid, and the G:C ratio within the nucleic acids.
As used herein, the term "Tm" is used in reference to the "melting temperature." The melting temperature is the temperature at which a population of double-stranded nucleic acid molecules becomes half dissociated into single strands. The equation for calculating the Tm of nucleic acids is well known in the art. As indicated by standard references, a simple estimate of the Tn, value may be calculated by the equation: Tn, = 81.5 + 0.41(% G + C), when a nucleic acid is in aqueous solution at 1 M NaC1 [see e.g., Anderson and Young, Quantitative Filter Hybridization, in: Nucleic Acid Hybridization (1985)]. Other references include more sophisticated computations which take structural as well as sequence characteristics into account for the calculation of Tm.
Low stringency conditions when used in reference to nucleic acid hybridization comprise conditions equivalent to binding or hybridization at 68 C in a solution consisting of 5X SSPE
(Saline, Sodium Phosphate, EDTA) (43.8 g/lNaC1, 6.9 g/lNaH2PO4=H20 and 1.85 g/1 EDTA
(Ethylenediaminetetracetic Acid), pH adjusted to 7.4 with NaOH), 0.1% SDS
(Sodium dodecyl sulfate), 5X Denhardt's reagent [50X Denhardt's contains the following per 500 ml: 5 g Ficoll (Type 400, Pharmacia), 5 g BSA (Bovine Serum Albumin) (Fraction V; Sigma)] and 100 jig/m1 denatured salmon sperm DNA followed by washing in a solution comprising between 0.2X and 2.0X SSPE, and 0.1% SDS at room temperature when a DNA probe of about 100 to about 1000 nucleotides in length is employed.
High stringency conditions when used in reference to nucleic acid hybridization comprise conditions equivalent to binding or hybridization at 68 C in a solution consisting of 5X SSPE, 1%
SDS, 5X Denhardt's reagent and 100 g/ml denatured salmon sperm DNA followed by washing in a solution comprising 0.1X SSPE, and 0.1% SDS at 68 C when a probe of about 100 to about 1000 nucleotides in length is employed.
The term "equivalent" when made in reference to a hybridization condition as it relates to a hybridization condition of interest means that the hybridization condition and the hybridization condition of interest result in hybridization of nucleic acid sequences which have the same range of percent (%) homology. For example, if a hybridization condition of interest results in hybridization of a first nucleic acid sequence with other nucleic acid sequences that have from 50% to 70%
homology to the first nucleic acid sequence, then another hybridization condition is said to be equivalent to the hybridization condition of interest if this other hybridization condition also results in hybridization of the first nucleic acid sequence with the other nucleic acid sequences that have from 50% to 70% homology to the first nucleic acid sequence.
When used in reference to nucleic acid hybridization the art knows well that numerous equivalent conditions may be employed to comprise either low or high stringency conditions; factors such as the length and nature (DNA, RNA, base composition) of the probe and nature of the target (DNA, RNA, base composition, present in solution or immobilized) and the concentration of the salts and other components (e.g., the presence or absence of formamide, dextran sulfate, polyethylene glycol) are considered and the hybridization solution may be varied to generate conditions of either low or high stringency hybridization different from, but equivalent to, the above-listed conditions.
The term "promoter," "promoter element," or "promoter sequence" as used herein, refers to a DNA sequence which when ligated to a nucleotide sequence of interest is capable of controlling the transcription of the nucleotide sequence of interest into mRNA. A promoter is typically, though not necessarily, located 5' (i.e., upstream) of a nucleotide sequence of interest whose transcription into mRNA it controls, and provides a site for specific binding by RNA polymerase and other transcription factors for initiation of transcription.
Promoters may be tissue specific or cell specific. The term "tissue specific"
as it applies to a promoter refers to a promoter that is capable of directing selective expression of a nucleotide sequence of interest to a specific type of tissue (e.g., petals) in the relative absence of expression of the same nucleotide sequence of interest in a different type of tissue (e.g., roots). Tissue specificity of a promoter may be evaluated by, for example, operably linking a reporter gene to the promoter sequence to generate a reporter construct, introducing the reporter construct into the genome of a plant such that the reporter construct is integrated into every tissue of the resulting transgenic plant, and detecting the expression of the reporter gene (e.g., detecting mRNA, protein, or the activity of a protein encoded by the reporter gene) in different tissues of the transgenic plant. The detection of a greater level of expression of the reporter gene in one or more tissues relative to the level of expression of the reporter gene in other tissues shows that the promoter is specific for the tissues in which greater levels of expression are detected. The term "cell type specific"
as applied to a promoter refers to a promoter which is capable of directing selective expression of a nucleotide sequence of interest in a specific type of cell in the relative absence of expression of the same nucleotide sequence of interest in a different type of cell within the same tissue. The term "cell type specific" when applied to a promoter also means a promoter capable of promoting selective expression of a nucleotide sequence of interest in a region within a single tissue. Cell type specificity of a promoter may be assessed using methods well known in the art, e.g., immuno-histochemical staining. Briefly, tissue sections are embedded in paraffin, and paraffin sections are reacted with a primary antibody which is specific for the polypeptide product encoded by the nucleotide sequence of interest whose expression is controlled by the promoter. A labeled (e.g., peroxidase conjugated) secondary antibody which is specific for the primary antibody is allowed to bind to the sectioned tissue and specific binding detected (e.g., with avidin/biotin) by microscopy.
Promoters may be constitutive or regulatable. The term "constitutive" when made in reference to a promoter means that the promoter is capable of directing transcription of an operably linked nucleic acid sequence in the absence of a stimulus (e.g., heat shock, chemicals, light, or similar stimuli). Typically, constitutive promoters are capable of directing expression of a transgene in substantially any cell and any tissue. In contrast, a "regulatable"
promoter is one which is capable of directing a level of transcription of an operably linked nuclei acid sequence in the presence of a stimulus (e.g., heat shock, chemicals, light, or similar stimuli) which is different from the level of transcription of the operably linked nucleic acid sequence in the absence of the stimulus.
The terms "infecting" and "infection" with a bacterium refer to co-incubation of a target biological sample, (e.g., cell, tissue, plant part) with the bacterium under conditions such that nucleic acid sequences contained within the bacterium are introduced into one or more cells of the target biological sample.
The term "Agrobacterium" refers to a soil-borne, Gram-negative, rod-shaped phytopathogenic bacterium which causes crown gall. The term "Agrobacterium"
includes, but is not limited to, the strains Agrobacterium tumefaciens, (which typically causes crown gall in infected plants), and Agrobacterium rhizogens (which causes hairy root disease in infected host plants).
Infection of a plant cell with Agrobacterium generally results in the production of opines (e.g., nopaline, agropine, octopine) by the infected cell. Thus, Agrobacterium strains which cause production of nopaline (e.g., strain LBA4301, C58, A208) are referred to as "nopaline-type"
Agrobacteria; Agrobacteriuin strains which cause production of octopine (e.g., strain LBA4404, Ach5, B6) are referred to as "octopine-type" Agrobacteria; and Agrobacterium strains which cause production of agropine (e.g., strain EHA105, EHA101, A281) are referred to as "agropine-type"
Agrobacteria.
The terms "bombarding, "bombardment," and "biolistic bombardment" refer to the process of accelerating particles towards a target biological sample (e.g., cell, tissue, plant part ¨ such as a leaf, or intact plant) to effect wounding of the cell membrane of a cell in the target biological sample and/or entry of the particles into the target biological sample. Methods for biolistic bombardment are known in the art (e.g., U.S. Patent Nos. 5,584,807 and 5,141,131, the contents of both are herein incorporated by reference), and are commercially available (e.g., the helium gas-driven microprojectile accelerator (PDS-1000/He) (BioRad).
The term "microwounding" when made in reference to plant tissue refers to the introduction of microscopic wounds in that tissue. Microwounding may be achieved by, for example, particle bombardment as described herein. The present invention specifically contemplates schemes for introducing nucleic acid which employ microwounding.
The term "organism" as used herein refers to all organisms and in particular organisms containing glycoproteins with n-linked glycans.
The term "plant" as used herein refers to a plurality of plant cells which are largely differentiated into a structure that is present at any stage of a plant's development. Such structures include, but are not limited to, a fruit, shoot, stem, root, leaf, seed, flower petal, or similar structure.
The term "plant tissue" includes differentiated and undifferentiated tissues of plants including, but not limited to, roots, shoots, leaves, pollen, seeds, tumor tissue and various types of cells in culture (e.g., single cells, protoplasts, embryos, callus, protocorm-like bodies, and other types of cells).
Plant tissue may be in planta, in organ culture, tissue culture, or cell culture. Similarly, "plant cells"
may be cells in culture or may be part of a plant.
Glycosyltransferases are enzymes that catalyze the processing reactions that determine the structures of cellular oligosaccharides, including the oligosaccharides on glycoproteins. As used herein, "glycosyltransferase" is meant to include mannosidases, even though these enzymes trim glycans and do not "transfer" a monosaccharide. Glycosyltransferases share the feature of a type II
membrane orientation. Each glycosyltransferase is comprised of an amino terminal cytoplasmic tail (shown for illustration purposes below as a made up of a string of amino acids arbitrarily labeled "X"
- without intending to suggest the actual size of the region), a signal anchor domain (shown below as made up of a string of amino acids labeled "H" for hydrophobic ¨ without intending to suggest the actual size of the domain and without intending to suggest that the domain is only made up of hydrophobic amino acids) that spans the membrane (referred to herein as a "transmembrane domain"), followed by a luminal stem (shown below as made up of a string of amino acids arbitrarily labeled "S" ¨ without intended to suggest the actual size of the region) or stalk region, and a carboxy-terminal catalytic domain (shown below as made up of a string of amino acids arbitrarily labeled "C" ¨ without intending to suggest the actual size of the domain:

NH2-XXXX>LXHHHHHIIHHSSSSSSSSCCCCCCCC

Collectively, The Cytoplasmic Tail-Transmembrane-Stem Region or "CTS" (which has been underlined in the above schematic for clarity) can be used (or portions thereof) in embodiments contemplated by the present invention wherein the catalytic domain is exchanged or "swapped" with a corresponding catalytic domain from another molecule (or portions of such regions/domains) to create a hybrid protein.
For example, in a preferred embodiment, the present invention contemplates nucleic acid encoding a hybrid enzyme (as well as vectors containing such nucleic acid, host cells containing such vectors, and the hybrid enzyme itself), said hybrid enzyme comprising at least a portion of a CTS region [e.g., the cytoplasmic tail ("C"), the transmembrane domain ("T"), the cytoplasmic tail together with the transmembrane domain ("CT"), the transmembrane domain together with the stem ("TS"), or the complete CTS region] of a first glycosyltransferase (e.g. plant glycosyltransferase) and at least a portion of a catalytic region of a second glycosyltransferase (e.g.
mammalian glycosyltransferase). To create such an embodiment, the coding sequence for the entire CTS region (or portion thereof) may be deleted from nucleic acid coding for the mammalian glycosyltransferase and replaced with the coding sequence for the entire CTS region (or portion thereof) of a plant glycosyltransferase. On the other hand, a different approach might be taken to create this embodiment; for example, the coding sequence for the entire catalytic domain (or portion thereof) may be deleted from the coding sequence for the plant glycosyltransferase and replaced with the coding sequence for the entire catalytic domain (or portion thereof) of the mammalian glycosyltransferase. In such a case, the resulting hybrid enzyme would have the amino-terminal cytoplasmic tail of the plant glycosyltransferase linked to the plant glycosyltransferase transmembrane domain linked to the stem region of the plant glycosyltransferase in the normal manner of the wild-type plant enzyme ¨ but the stem region would be linked to the catalytic domain of the mammalian glycosyltransferase (or portion thereof).
It is not intended that the present invention be limited only to the two approaches outlined above. Other variations in the approach are contemplated. For example, to create nucleic acid encoding a hybrid enzyme, said hybrid enzyme comprising at least a portion of a transmembrane region of a plant glycosyltransferase and at least a portion of a catalytic region of a mammalian glycosyltransferase, one might use less than the entire coding sequence for the CTS region (e.g., only the transmembrane domain of the plant glycosytransferase, or the complete cytoplasmic tail together with all or a portion of the transmembrane domain, or the complete cytoplasmic tail together with all of the transmembrane domain together with a portion of the stem region). One might delete the mammalian coding sequence for the entire cytoplasmic tail together with the coding sequence for the transmembrane domain (or portion thereof) ¨ followed by replacement with the corresponding coding sequence for the cytoplasmic tail and transmembrane domain (or portion thereof) of the plant glycosyltransferase. In such a case, the resulting hybrid enzyme would have the stem region of the mammalian glycosyltransferase linked to the plant glycosyltransferase transmembrane domain (or portion thereof) which in turn would be linked to the amino-terminal cytoplasmic tail of the plant glycosyltransferase, with the stem region being linked to the catalytic domain of the mammalian glycosyltransferase (i.e. two of the four regions/domains would be of plant origin and two would be of mammalian origin).

In other embodiments, the present invention contemplates nucleic acid encoding a hybrid enzyme (along with vectors, host cells containing the vectors, plants ¨ or plant parts - containing the host cells), said hybrid enzyme comprising at least a portion of an amino-terminal cytoplasmic tail of a plant glycosyltransferase and at least a portion of a catalytic region of a mammalian glycosyltransferase. In this embodiment, the hybrid enzyme encoded by the nucleic acid might or might not contain other plant sequences (e.g., the transmembrane domain or portion thereof, the stem region or portion thereof). For example, to create such an embodiment, the coding sequence for the entire cytoplasmic tail (or portion thereof) may be deleted from nucleic acid coding for the mammalian glycosyltransferase and replaced with the coding sequence for the entire cytoplasmic domain (or portion thereof) of a plant glycosyltransferase. In such a case, the resulting hybrid enzyme would have the amino-terminal cytoplasmic tail (or portion thereof) of the plant glycosyltransferase linked to the mammalian glycosyltransferase transmembrane domain, which in turn is linked to stem region of the mammalian glycosyltransferase, the stem region being linked to the catalytic domain of the mammalian glycosyltransferase. On the other hand, a different approach might be taken to create this embodiment; for example, the coding sequence for the entire catalytic domain (or portion thereof) may be deleted from the coding sequence for the plant glycosyltransferase and replaced with the coding sequence for the entire catalytic domain (or portion thereof) of the mammalian glycosyltransferase. In such a case, the resulting hybrid enzyme would have the amino-terminal cytoplasmic tail of the plant glycosyltransferase linked to the plant glycosyltransferase transmembrane domain linked to the stem region of the plant glycosyltransferase in the normal manner of the wild-type plant enzyme ¨ but the stem region would be linked to the catalytic domain of the mammalian glycosyltransferase (or portion thereof).

In the above discussion, the use of the phrase "or portion thereof' was used to expressly indicate that less than the entire region/domain might be employed in the particular case (e.g., a fragment might be used). For example, the cytoplasmic tail of glycosyltransferases ranges from approximately 5 to 50 amino acids in length, and more typically 15 to 30 amino acids, depending on the particular transferase. A "portion" of the cytoplasmic tail region is herein defined as no fewer than four amino acids and can be as large as up to the full length of the region/domain less one amino acid. It is desired that the portion function in a manner analogous to the full length region/domain ¨ but need not function to the same degree. For example, to the extent the full-length cytoplasmic tail functions as a Golgi retention region or ER retention signal, it is desired that the portion employed in the above-named embodiments also function as a Golgi or ER
retention region, albeit perhaps not as efficiently as the full-length region.
Similarly, the transmembrane domain is typically 15-25 amino acids in length and made up of primarily hydrophobic amino acids. A "portion" of the transmembrane domain is herein defined as no fewer than ten amino acids and can be as large as up to the full length of the region/domain (for the particular type of transferase) less one amino acid. It is desired that the portion function in a manner analogous to the full length region/domain ¨ but need not function to the same degree. For example, to the extent the full-length transmembrane domain functions as the primary Golgi retention region or ER retention signal, it is desired that the portion employed in the above-named embodiments also function as a Golgi or ER retention region, albeit perhaps not as efficiently as the full-length region. The present invention specifically contemplates conservative substitutions to create variants of the wild-type transmembrane domain or portions thereof. For example, the present invention contemplates replacing one or more hydrophobic amino acids (shown as "H" in the schematic above) of the wild-type sequence with one or more different amino acids, preferably also hydrophobic amino acids.
A portion of the catalytic domain can be as large as the full length of the domain less on amino acid. Where the catalytic domain is from a beta1,4-galactosyltransferase, it is preferred that the portion include at a minimum residues 345-365 which are believed to be involved in the conformation conferring an oligosaccharide acceptor binding site (it is preferred that the portion include this region at a minimum and five to ten amino acids on either side to permit the proper conformation).
The present invention also includes synthetic CTS regions and portions thereof. A "portion"

of a CTS region must include at least one (and may include more than one) entire domain (e.g., the entire transmembrane domain) but less than the entire CTS region.
Importantly, by using the term "CTS region" or "transmembrane domain" it is not intended that only wild type sequences be encompassed. Indeed, this invention is not limited to natural glycosyltransferases and enzymes involved in glycosylation, but also includes the use of synthetic enzymes exhibit the same or similar function. In one embodiment, wild type domains are changed (e.g. by deletion, insertion, replacement and the like).
Finally, by using the indicator "Tm" when referring to a particular hybrid (e.g., "TmXyl-), entire transmembrane/CTS domains (with or without changes to the wild-type sequence) as well as portions (with or without changes to the wild-type sequence) are intended to be encompassed.
SUMMARY OF THE INVENTION
The present invention contemplates nucleic acid (whether DNA or RNA) encoding hybrid enzymes (or "fusion proteins"), vectors containing such nucleic acid, host cells (including but not limited to cells in plant tissue and whole plants) containing such vectors an expressing the hybrid enzymes, and the isolated hybrid enzyme(s) themselves. In one embodiment, expression of said hybrid enzymes (or "fusion proteins") results in changes in glycosylation, such as, but not limited to, reduction of sugar moieties such as xylose, fucose, LewisAlm or other sugar structures that interfere with desired glycoform accumulation. In one embodiment, the present invention contemplates nucleic acid encoding a hybrid enzyme, said hybrid enzyme comprising a CTS
region (or portion thereof) of a glycosyltransferase (including but not limited to a plant glycosyltransferase) and a catalytic region (or portion thereof) of a non-plant glycosyltransferase (e.g., mammalian, fish, amphibian, fungal). It is preferred that, when expressed, the CTS region (or portion thereof) is linked (directly or indirectly) in operable combination to said catalytic region (or portion thereof).
The linking is preferably covalent and the combination is operable in that the catalytic region exhibits catalytic function (even if said catalytic function is reduced as compared to the wild-type enzyme). The linking can be direct in the sense that there are no intervening amino acids or other regions/domains. On the other hand, the linking can be indirect in that there are intervening amino acids (or other chemical groups) and/or other regions/domains between them. Of course, the nucleic acid used to make the nucleic acid encoding the above-described hybrid enzyme(s) can be obtained enzymatically from a physical sequence (e.g. genomic DNA, a cDNA, and the like) or alternatively, made synthetically using a reference sequence (e.g. electronic or hardcopy sequence) as a guide.
In a particular embodiment, the present invention contemplates nucleic acid encoding a hybrid enzyme, said hybrid enzyme comprising a transmembrane region (e.g., at least a transmembrane region and optionally more of the CTS region) of a plant glycosyltransferase and a catalytic region (or portion thereof) of a non-plant (such as a mammalian) glycosyltransferase.
Again, it is preferred that, when expressed, these regions are linked (directly or indirectly) in operable combination. In yet another embodiment, the present invention contemplates nucleic acid encoding a hybrid enzyme, said hybrid enzyme comprising a transmembrane domain (or portion thereof) of a plant glycosyltransferase and a catalytic region (or portion thereof) of a mammalian glycosyltransferase. Again, it is preferred that, when expressed, these regions are linked (directly or indirectly) in operable combination.
It is not intended that the present invention be limited to particular transferases. In one embodiment, the plant glycosyltransferase is a xylosyltransferase. In another embodiment, the plant glycosyltransferase is a N-acetylglucosaminyltransferase. In another embodiment, the plant glycosyltransferase is a fucosyltransferase. In a preferred embodiment, the mammalian glycosyltransferase is a human galactosyltransferase (such as the human beta 1,4-galactosyltransferase encoded by SEQ ID NO:1 wherein the nucleotides encoding the transmembrane domain are deleted and replaced).
It is not intended that the present invention is limited to the use of a plant-derived glycosyltransferase CTS-domain and a human glycosyltransferase catalytic domain but also vice versa and the use of any CTS-domain of a glycosyltransferase in combination with the catalytic fragment of at least one other glycosyltransferase. Indeed, the present invention broadly contemplates, in one embodiment, nucleic acid encoding a hybrid enzyme, said hybrid enzyme comprising a transmembrane region of a first glycosyltransferase and a catalytic region of a second glycosyltransferase. It is preferred that said first and second glycosyltransferases are from different species (and can be from a different genus or even from a different phylum).
In one embodiment, said first glycosyltransferase comprises a plant glycosyltransferase. In another embodiment, said plant glycosyltransferase is a xylosyltransferase. In yet another embodiment, said plant glycosyltransferase is a fucosyltransferase. In a preferred embodiment said second glycosyltransferase comprises a mammalian glycosyltransferase. In a particularly preferred embodiment, said mammalian glycosyltransferase is a human galactosyltransferase.
It is not intended that the present invention be limited to circumstances where the first and second glycosyltransferases are plant and non-plant, respectively. In one embodiment, said first glycosyltransferase comprises a first mammalian glycosyltransferase and said second glycosyltransferase comprises a second mammalian glycosyltransferase. In a preferred embodiment, said first mammalian glycosyltransferase is a non-human glycosyltransferase and said second mammalian glycosyltransferase is a human glycosyltransferase.
It is not intended that the present invention be limited to the type of vector. In one embodiment, the present invention contemplates an expression vector, comprising the nucleic acid encoding the above-described hybrid enzyme.
It is also not intended that the present invention be limited to the type of host cells. A variety of prokaryotic and eukaryotic host cells are commercially available for expressing proteins. In one embodiment, the present invention contemplates a host cell containing the vector comprising the nucleic acid encoding the above-described hybrid enzyme (with or without other vectors or other nucleic acid encoding other hybrid enzymes or glycosyltransferases). In a preferred embodiment, the host cell is a plant cell. In a particularly preferred embodiment, the present invention contemplates a plant comprising such a host cell.
It is not intended that the present invention be limited by the method by which host cells are made to express the hybrid enzymes of the present invention. In one embodiment, the present invention contemplates a method, comprising: a) providing: i) a host cell (such as a plant cell, whether in culture or as part of plant tissue or even as part of an intact growing plant), and ii) an expression vector comprising nucleic acid encoding a hybrid enzyme, said hybrid enzyme comprising at least a portion of a CTS region of a plant glycosyltransferase (e.g. the transmembrane domain) and at least a portion of a catalytic region of a mammalian glycosyltransferase; and b) introducing said expression vector into said plant cell under conditions such that said hybrid enzyme is expressed. Again, it is not intended that the present invention be limited to particular transferases.
In one embodiment, the plant glycosyltransferase used in the above-described method is a xylosyltransferase. In another embodiment, the plant glycosyltransferase is a N-acetylglucosaminyltransferase. In another embodiment, the plant glycosyltransferase is a fucosyltransferase. In a preferred embodiment, the mammalian glycosyltransferase used in the above-described method is a human galactosyltransferase (such as the human beta 1,4-galactosyltransferase encoded by SEQ ID NO:1 wherein the nucleotides encoding the transmembrane domain are deleted and replaced) (or simply where the nucleotides of SEQ ID NO:1 encoding the catalytic domain, or portion thereof, are taken and linked to nucleotides encoding the CTS region, or portion thereof, of a plant glycosyltransferase.).
It is not intended that the present invention be limited to a particular scheme for controlling glycosylation of a heterologous protein using the hybrid enzymes described above. In one embodiment, the present invention contemplates a method, comprising: a) providing: i) a host cell (such as a plant cell), ii) a first expression vector comprising nucleic acid encoding a hybrid enzyme, said hybrid enzyme comprising at least a portion of a CTS region (e.g. at least a transmembrane domain) of a first (such as a plant) glycosyltransferase and at least a portion of a catalytic region of a second (such as a mammalian) glycosyltransferase, and iii) a second expression vector comprising nucleic acid encoding a heterologous glycoprotein; (or portion thereof; and b) introducing said first and second expression vectors into said plant cell under conditions such that said hybrid enzyme and said heterologous protein are expressed. Alternatively, a single vector with nucleic acid encoding both the hybrid enzyme (or hybrid enzymes) and the heterologous glycoprotein might be used.
Regardless of which method is used, the invention contemplates, in one embodiment, the additional step (c) of isolating the heterologous protein ¨ as well as the isolated protein itself as a composition.
On the other hand, the present invention also contemplates introducing different vectors into different plant cells (whether they are cells in culture, part of plant tissue, or even part of an intact growing plant). In one embodiment, the present invention contemplates a method, comprising: a) providing: i) a first plant comprising a first expression vector, said first vector comprising nucleic acid encoding a hybrid enzyme (or encoding two or more hybrid enzymes), said hybrid enzyme comprising at least a portion of a CTS region (e.g. the first approximately 40-60 amino acids of the N-terminus) of a plant glycosyltransferase and at least a portion of a catalytic region of a mammalian glycosyltransferase, and ii) a second plant comprising a second expression vector, said second vector comprising nucleic acid encoding a heterologous protein (or portion thereof);
and crossing said first plant and said second plant to produce progeny expressing said hybrid enzyme and said heterologous protein. Of course, such progeny can be isolated, grown up, and analyzed for the presence of each (or both) of the proteins. Indeed, the heterologous protein can be used (typically first purified substantially free of plant cellular material) therapeutically (e.g., administered to a human or animal, whether orally, by intravenous, transdermally or by some other route of administration) to treat or prevent disease.

It is not intended that the present invention be limited to a particular heterologous protein. In one embodiment, any peptide or protein that is not endogenous to the host cell (or organism) is contemplated. In one embodiment, the heterologous protein is an antibody or antibody fragment. In a particularly preferred embodiment, the antibody is a human antibody or "humanized" antibody expressed in a plant in high yield. "Humanized" antibodies are typically prepared from non-human antibodies (e.g. rodent antibodies) by taking the hypervariable regions (the so-called CDRs) of the non-human antibodies and "grafting" them on to human frameworks. The entire process can be synthetic (provided that the sequences are known) and frameworks can be selected from a database of common human frameworks. Many times, there is a loss of affinity in the process unless either the framework sequences are modified or the CDRs are modified. Indeed, increases in affinity can be revealed when the CDRs are systematically mutated (for example, by randomization procedures) and tested.
While the present invention is particularly useful in the context of heterologous proteins, in one embodiment, the hybrid enzymes of the present invention are used to change the glycosylation of endogenous proteins, i.e. proteins normally expressed by the host cell or organism.

The present invention specifically contemplates the plants themselves. In one embodiment, the present invention contemplates a plant, comprising first and second expression vectors, said first vector comprising nucleic acid encoding a hybrid enzyme, said hybrid enzyme comprising at least a portion of a CTS region (e.g. the cytoplasmic tail together with at least a portion of the transmembrane domain) of a plant glycosyltransferase and at least a portion of a catalytic region of a mammalian glycosyltransferase, said second expression vector, said second vector comprising nucleic acid encoding a heterologous protein (or portion thereof). In a preferred embodiment, by virtue of being expressed along with the hybrid enzyme (or hybrid enzymes) of the present invention, the heterologous protein displays reduced (10% to 99%) alpha 1,3 -fucosylation (or even no fucosylation), as compared to when the heterologous protein is expressed in the plant in the absence of the hybrid enzyme (or enzymes). In a preferred embodiment, by virtue of being expressed along with the hybrid enzyme (or hybrid enzymes) of the present invention, the heterologous protein displays reduced (10% to 99%) xylosylation (or even no xylose), as compared to when the heterologous protein is expressed in the plant in the absence of the hybrid enzyme (or enzymes). In a preferred embodiment, by virtue of being expressed along with the hybrid enzyme (or hybrid enzymes) of the present invention, the heterologous protein displays both reduced fucose and xylose, as compared to when the heterologous protein is expressed in the plant in the absence of the hybrid enzyme (or enzymes).

It is not intended that the present invention be limited to a particular theory by which reduced fucose and/or xylose is achieved. Very little is known about the sub-Golgi sorting mechanism in plants. The mammalian specific r3(1,4)-galactosyltransferase (GalT) has been used (see the Examples below) as an excellent first marker to study this phenomenon since it generates glycan structures not normally found in plants. The glycan structures of plants that express galactosyltransferase has been compared with glycan structures from plants that express a chimeric galactosyltransferase of which the CTS domain is exchanged for that of a plant xylosyltransferase (or portion thereof). The change in observed glycan structures show that the galactosyltransferase is, as in mammals, confined to a specific sub-compartment of the plant Golgi. Without limiting the invention to any particular mechanism, the sorting mechanism of plants and mammals are apparently conserved even to the extent that glycosyltransferases unknown to plants are routed to specific analogous location in the Golgi. This location is later in the Golgi than where the endogenous xylosyl-, fucosyl- and GlcNAcTII (GnTII) transferases are located.
The finding that N-glycans in these plants that express relocalised variants of GaIT
containing significantly less xylose and fucose is also of biotechnological relevance. For glycoproteins intended for therapeutic use in mammals, such as humans, the approach of certain embodiments of the present invention provides methods and compositions for controlling N-linked glycosylation of glycoproteins in plants so that glycoprotein essentially free of xylose and fucose and containing at least a bi-antennary N-glycans (but not limited to bi-antennary, also include tri-antennary, and the like) and (at least one) galactose residue on at least one of the arms of the N-glycan can be obtained. Hence, it is not intended that the present invention is limited to bi-antennary N-glycans but also includes bisected bi-antennary N-glycans, tri-antennary N-glycans, and the like.
Furthermore, the invention is not limited to complex¨type N-glycans but also includes hybrid-type N-glycans and other type N-glycans. The present invention contemplates such resulting glyco-proteins. In addition, the methods and compositions of the present invention may be applicable for plants and non-plant systems where besides xylose, fucose, LewismBix type N-glycan modifications (131-3-GalT, al-4-FucT, other) or other sugars, "interfere" with desired glycoform accumulation.
In one embodiment, the invention is directed to controlling N-linked glycosylation of plants by modulating the localization of enzymes involved in glycan biosynthesis in the Golgi apparatus.

Specifically, embodiments of the invention are directed to a method of producing in a plant host system a glycoprotein having bi-antennary glycans and containing at least one galactose residues on at least one of the arms and which are devoid (or reduced in) of xylose and fucose, comprising:(a) preventing (or inhibiting) addition of xylose and fucose on the core of the glycan of said glycoprotein and (b) adding one or preferably two galactose residues to said arms.
Addition of xylose and fucose to said heterologous glycoprotein may be reduced or even prevented by introducing to said plant host system a nucleic acid encoding a hybrid enzyme comprising a CTS region (or portion thereof) of a protein, particularly an enzyme such as plant xylosyltransferase and catalytic region (or portion thereof) of a galactosyltransferase not normally found in a plant, or a modified galactosyltransferase where its transmembrane portion has been removed and endoplasmic reticulum retention signal have been inserted, wherein said protein or enzyme acts earlier in the Golgi apparatus of a plant cell in said plant host system than said galactosyltransferase. It is preferred that the galactosyltransferase is a mammalian galactosyltransferase and in particular, a human galactosyltransferase. In a most specific embodiment, said galactosyltransferase is human 131,4 galactosyltransferase (GalT). In a preferred embodiment, said xylosyltransferase is a P1,2-xylosyltransferase. The exchange of the CTS region or CTS fragment of a mammalian glycosyltransferase (such as a galactosyltransferase) by one from the group of enzymes that act earlier in the Golgi apparatus than galactosyltransferase including but not limited to those from of XylT, FucT, GnTI, GnTII, GnTIII, GnTIV, GnTV, GnTVI, ManI, ManII and ManIII results in strongly reduced amounts of glycans that contain the undesired xylose and fucose residues (see Figure 2). In addition, galactosylation is improved and the diversity in glycans is reduced. While not limited to any particular mechanism, the increase in galactosylated glycans that carry neither xylose nor fucose is believed to be mainly attributed to the accumulation of GalGNMan5, GNMan5 or GalGNMan4. Also, galactosylation occurs on one glycan arm only.
Apparently, the galactosylation earlier in the Golgi inhibits trimming of the said glycoforms by Mannosidase II (ManII) to GalGNMan3. Also addition of the second GlcNAc by GlcNAcTII
(GnTII) is inhibited.

Therefore, in one embodiment, a further step is contemplated to obtain the desired glycoprotein that has both arms galactosylated and yet is essentially devoid of xylose and fucose.
Thus, in one embodiment, the method of the invention as noted above further comprises adding galactose residues to the arms of said glycoprotein (see Figure 3). In one embodiment of the invention, galactose residues are added onto both arms by introducing to said plant host system (a) a nucleic acid sequence encoding a first hybrid enzyme comprising the CTS region (or fragment, such as one including the transmembrane domain) of GnTI and the active domain (or portion thereof) of GnTII; (b) a nucleic acid sequence encoding the second hybrid enzyme comprising the CTS region (or fragment, such as one including the transmembrane of GnTI and the active domain of ManII and (c) a nucleic acid sequence encoding a third hybrid enzyme comprising the CTS
region (or fragment, such as one including the transmembrane domain) of XylT and the active domain (or portion thereof) of human galactosyltransferse (TmXyl-GalT). In another embodiment of the invention, galactose residues are added onto both arms by introducing to said plant host system (a) a nucleic acid sequence encoding a first hybrid enzyme comprising the CTS region (or fragment, such as one including the transmembrane domain) of Mani and the active domain (or portion thereof) of GnTI;
(b) a nucleic acid sequence encoding the second hybrid enzyme comprising the CTS region (or fragment, such as one including the transmembrane domain) of ManI and the active domain (or portion thereof) of GnTII; (c) a nucleic acid sequence encoding the third hybrid enzyme comprising the CTS region (or fragment, such as one including the transmembrane domain) of ManI and the active domain (or portion thereof) of ManII, and (d) a nucleic acid sequence encoding a fourth hybrid enzyme comprising the CTS region (or fragment, such as one including the transmembrane domain) of XylT and the active domain (or portion thereof) of human galactosyltransferse (TmXyl-GalT).

It is not intended that the present invention be limited to particular combinations of hybrid enzymes or the number of such hybrid enzymes employed in a single cell, plant tissue or plant. In a preferred embodiment, the present invention contemplates host cells expressing TmXyl-GalT plus TmGnTI-GnTII plus TmGnTI-ManII. In one embodiment of the invention, galactose residues are added to said arms by introducing to said plant host system (a) a nucleic acid sequence encoding a first hybrid enzyme comprising a CTS region (or fragment thereof) of a protein, particularly an enzyme, including but not limited to N-acetylglucosaminyltransferase I (GnTI) and a catalytic region (or portion thereof) of a mannosidase II (ManII), wherein said enzyme acts earlier in the Golgi apparatus of a plant cell in said plant host system than said mannosidase II
or modified mannosidase II where its transmembrane portion has been deleted and endoplasmic reticulum retention signal have been inserted and (b) a nucleic acid sequence encoding a second hybrid enzyme comprising a CTS region (or fragment, such as one including the transmembrane domain) of an enzyme including but not limited to N-acetyl-glucosaminyltransferase I (GnTI) and a catalytic region (or portion thereof) of a N-acetylglucosaminyl-transferase II (GnTII), wherein said enzyme acts earlier in the Golgi apparatus of a plant cell in said plant host system than said N
acetylglucosaminyl-transferaseII
(GnTII) or modified N-acetylglucosaminyltransferase II (GnTII) where its transmembrane portion has been deleted and an endoplasmic reticulum retention signal have been inserted. The sequences encoding N-acetylglucosaminyltransferases or mannosidase II or the said transmembrane fragments can originate form plants or from eukaryotic non-plant organisms (e.g., mammals).

In yet another preferred embodiment, the present invention contemplates a host cell expressing TmXyl-GaIT plus TmManI-GnTI plus TmManI-ManII plus TmManI-GnTII. In another embodiment of the invention, galactose residues are added to said arms by introducing to said plant host system (a) a nucleic acid sequence encoding a first hybrid enzyme comprising a CTS region (or fragment, such as one including the transmembrane domain) of a protein, particularly an enzyme, including but not limited to Mannosidase I (ManI) and a catalytic region (or portion thereof) of a N
acetylglucosaminyltransferase I (GnTI), wherein said enzyme acts earlier in the Golgi apparatus of a plant cell in said plant host system than said N-acetylglucosaminyl-transferase I (GnTI) or modified N acetylglucosaminyltransferase I (GnTI) where its transmembrane portion has been deleted and endoplasmic reticulum retention signal have been inserted and (b) a nucleic acid sequence encoding a second hybrid enzyme comprising a CTS region (or fragment, such as one including the transmembrane domain) of an enzyme including but not limited to Mannosidase I
(Mani) and a catalytic region (or portion thereof) of a Mannosidase II (ManII), wherein said enzyme acts earlier in the Golgi apparatus of a plant cell in said plant host system than said Mannosidase II (ManII) or modified Mannosidase II (ManII) where its transmembrane portion has been deleted and an endoplasmic reticulum retention signal have been inserted and (c) a nucleic acid sequence encoding a third hybrid enzyme comprising a CTS region (or fragment, such as one including the transmembrane domain) of an enzyme including but not limited to Mannosidase I
(ManI) and a catalytic region (or portion thereof) of a N-acetylglucos-aminyltransferase II
(GnTII), wherein said enzyme acts earlier in the Golgi apparatus of a plant cell in said plant host system than said N-acetylglucosaminyltransferase II (GnTII) or modified N-acetylglucosaminyltransferase II (GnTII) where its transmembrane portion has been deleted and an endoplasmic reticulum retention signal have been inserted. The sequences encoding N-acetylglucosaminyltransferases or mannosidases or the said transmembrane fragments can originate from plants or from eukaryotic non-plant organisms (e.g., mammals).

In still another preferred embodiment, the present invention contemplates host cells expressing TmXyl-GaIT plus ManIII. In another embodiment of the invention, galactose residues are added to said arms by introducing to said plant host system (a) a nucleic acid sequence encoding a Mannosidase III (ManIII, wildtype gene sequence but not limited to: also ManIII with endoplasmic reticulum retention signal; ManIII with transmembrane fragment of early (cis-) Golgi apparatus glycosyltransferase (GnTI, Mani, GnTIII). The sequences encoding Mannosidase III can originate form insects, preferably from Spodopterafrugiperda or Drosophila melanogaster (but not limited to), human or from other organisms.

In still another preferred embodiment, the present invention contemplates a host cell expressing TmXyl-GalT plus ManIII plus TmGnTI-GnTII. In yet another preferred embodiment, the present invention contemplates a host cell expressing TmXyl-GalT plus ManIII plus TmManI-GnTI plus TmManI-GnTII.

The method of the invention may optionally comprise, in one embodiment, introducing into said plant host system a mammalian N-acetylglucosaminyltransferase GnTIII, particularly a human GnTIII or hybrid protein comprising a catalytic portion of mammalian GnTIII
and a transmembrane portion of a protein, said protein residing in the ER or earlier compartment of the Golgi apparatus of a eukaryotic cell. For example, in one embodiment, the hybrid enzyme TmXyl-GnTIII is contemplated (along with nucleic acid coding for such a hybrid enzyme, vectors containing such nucleic acid, host cells containing such vectors, and plants ¨ or plant parts ¨ containing such host cells). In another embodiment, the hybrid enzyme TmFuc-GnTIII is contemplated (along with nucleic acid coding for such a hybrid enzyme, vectors containing such nucleic acid, host cells containing such vectors, and plants ¨ or plant parts ¨ containing such host cells). The present invention specifically contemplates host cells expressing such hybrid enzymes (with or without additional hybrid enzymes or other glycosyltransferases).

The invention is further directed to said hybrid and modified enzymes, nucleic acid sequences encoding said hybrid enzymes, vectors comprising said nucleic acid sequences and methods for obtaining said hybrid enzymes. Furthermore, the invention is directed to a plant host system comprising a heterologous glycoprotein having preferably complex type bi-antennary glycans and containing at least one galactose residue on at least one of the arms and are devoid of xylose and fucose. A "heterologous glycoprotein" is a glycoprotein originating from a species other than the plant host system. The glycoprotein may include but is not limited to antibodies, hormones, growth factors and growth factor receptors and antigens.
Indeed, the present invention is particularly useful for controlling the glycosylation of heterologous glycoproteins, such as antibodies or antibody fragments (single chain antibodies, Fab fragments, Fab2 fragments, Fv fragments, and the like). To control the glycosylation of an antibody, the gene construct encoding a hybrid enzyme of the present invention (e.g., the TmXyl-GalT gene construct) can be introduced in transgenic plants expressing an antibody (e.g., monoclonal antibody) or antibody fragment. On the other hand, the gene(s) encoding the antibody (or antibody fragment) can be introduced by retransformation of plant expressing TmXyl-GalT gene construct. In still another embodiment, the binary vector harbouring the TmXyl-GalT expression cassette can be co-transformed to plants together with a plant binary vector harbouring the expression cassettes comprising both light and heavy chain sequences of a monoclonal antibody on a single T-DNA or with binary vectors harbouring the expression cassettes for light and heavy chain sequences both separately on independent T-DNA's but both encoding a monoclonal antibody. The present invention specifically contemplates, in one embodiment, crossing plants expressing antibodies with plant expressing the hybrid glycosyltransferase(s) of the present invention.
A "host system" may include but is not limited to any organism containing glycoproteins with N-glycans.
A "plant host system" may include but is not limited to a plant or portion thereof, which includes but is not limited to a plant cell, plant organ and/or plant tissue.
The plant may be a monocotyledon (monocot) which is a flowering plant whose embryos have one cotyledon or seed leaf and includes but is not limited to lilies, grasses, corn (Zea mays), rice, grains including oats, wheat and barley, orchids, irises, onions and palms. Alternatively, the plant may be a dicotyledenon (dicot) which includes but is not limited to tobacco (Nicotiana), tomatoes, potatoes, legumes (e.g, alfalfa and soybeans), roses, daises, cacti, violets and duckweed. The plant may also be a moss which includes but is not limited to Physcomitrella patens.
The invention is further directed to a method for obtaining said plant host system. The method comprises crossing a plant expressing a heterologous glycoprotein with a plant comprising (a) a hybrid enzyme comprising a catalytic region (or portion thereof) of a galactosyltransferase not normally found in a plant and a CTS region (or fragment, such as one including the transmembrane domain) of a protein, wherein said protein acts earlier in the Golgi apparatus of a plant cell in said plant host system than said galactosyltransferase or a modified galactosyltransferase where its transmembrane portion has been deleted and endoplasmic reticulum retention signal has been inserted; (b) a hybrid enzyme comprising a CTS
region (or portion thereof, such as one including the transmembrane domain) of a protein, particularly an enzyme, including but not limited to N-acetylglucosaminyltransferase I (GnTI) and a catalytic region (or portion thereof) of a mannosidase II (Mani , wherein said enzyme acts earlier in the Golgi apparatus of a plant cell in said plant host system than said mannosidase II or modified mannosidase II
where its transmembrane portion has been deleted and endoplasmic reticulum retention signal have been inserted and (c) a hybrid enzyme comprising at least a transmembrane region of an enzyme (such as the first 40-60 amino acids of the N-terminus) of a glycosyltransferase including but not limited to N-acetylglucosaminyltransferase I (GnTI) and a catalytic region of a N-acetylglucos-aminyltransferase II (GnTII), wherein said enzyme acts earlier in the Golgi apparatus of a plant cell in said plant host system than said N acetylglucosaminyltransferase II (GnTII) or modified N-acetylglucosaminyl-transferase II (GnTII) where its transmembrane portion has been deleted and an endoplasmic reticulum retention signal have been inserted., harvesting progeny from said crossing and selecting a desired progeny plant expressing said heterologous glycoprotein.
The invention is further directed to said plant or portion thereof which would constitute a plant host system. Said plant host system may further comprise a mammalian GnTIII enzyme or hybrid protein comprising a catalytic portion of mammalian GnTIII and a transmembrane portion of a protein, said protein residing in the ER or earlier compartment of the Golgi apparatus of a eukaryotic cell.
Additionally, the invention also provides the use of a plant host system to produce a desired glycoprotein or functional fragment thereof. The invention additionally provides a method for obtaining a desired glycoprotein or functional fragment thereof comprising cultivating a plant according to the invention until said plant has reached a harvestable stage, for example when sufficient biomass has grown to allow profitable harvesting, followed by harvesting said plant with established techniques known in the art and fractionating said plant with established techniques known in the art to obtain fractionated plant material and at least partly isolating said glycoprotein from said fractionated plant material.
Alternatively, said plant host cell system comprising said heterologous glycoprotein may also be obtained by introducing into a plant host cell system or portion thereof (a) a nucleic acid sequence encoding a hybrid enzyme comprising a catalytic region of a galactosyltransferase not normally found in a plant and at least the transmembrane region (or more of the CTS) of a protein, wherein said protein acts earlier in the Golgi apparatus of a plant cell in said plant host system than said galactosyltransferase or a modified galactosyltransferase where its transmembrane portion has been deleted and endoplasmic reticulum retention signal have been inserted;
(b) a nucleic acid sequence encoding a first hybrid enzyme comprising at least the transmembrane region (or more of the CTS if desired) of a protein, particularly an enzyme, including but not limited to N-acetylglucosaminyltransferase I (GnTI) and a catalytic region of a mannosidase II (ManII) , wherein said enzyme acts earlier in the Golgi apparatus of a plant cell in said plant host system than said mannosidase II, or modified mannosidase II where its transmembrane portion has been deleted and endoplasmic reticulum retention signal have been inserted and (c) a nucleic acid sequence encoding a second hybrid enzyme comprising at least a transmembrane region (more of the CTS if desired) of an enzyme including but not limited to N-acetylglucosaminyl-transferase I
(GnTI) and a catalytic region of a N-acetylglucosaminyltransferase II (GnTII), wherein said enzyme acts earlier in the -23-, Golgi apparatus of a plant cell in said plant host system than said N-acetylglucos-aminyltransferase-II (GnTII) or modified N-acetylglucosaminyltransferase II (GnTII) where its transmembrane portion has been deleted and an endoplasmic reticulum retention signal have been inserted, and isolating a plant or portion thereof expressing said heterologous glycoprotein (or portion thereof). In one embodiment, one vector comprising all of the nucleic acid sequences is introduced into said plant host system. In another embodiment, each nucleic acid sequence is inserted into separate vectors and these vectors are introduced into said plant host system. In another embodiment combinations of two or more nucleic acid sequences are inserted into separate vectors which are than combined into said plant host system by retransformation or co-transformation or by crossing.
The invention also provides use of such a plant-derived glycoprotein or functional fragment thereof according to the invention for the production of a composition, particularly, pharmaceutical composition, for example for the treatment of a patient with an antibody, a hormone, a vaccine antigen, an enzyme, or the like. Such a pharmaceutical composition comprising a glycoprotein or functional fragment thereof is now also provided.
Finally, it is contemplated that the above-described approach may be useful in reducing the overall diversity in glycans in plants expressing one or more of the hybrid enzymes of the present invention (as compared to wild-type plants or plants simply transformed with only mammalian GalT).

BRIEF DESCRIPTION OF THE FIGURES
Figure 1 compares the glycosylation pathway of glycoproteins in plants and in mammals.
Figure 2 shows the effect of exchanging the CTS fragment of galactosyltransferase with xylosyltransferase Figure 3 shows the further effect of relocalizing mannosidase H and GlcNAcTII.
Figure 4 top panel shows a T-DNA construct carrying the genes encoding glycan modifying enzymes to produce efficiently galactosylated glycans that are devoid of immunogenic xylose and fucose and the bottom panel shows a T-DNA construct carrying antibody light chain and heavy chain genes.
Figure 5 shows the nucleic acid sequence (SEQ ID NO:1) for a human galactosyltransferase (human B1,4-galactosyltransferase ¨ GalT).
Figure 6 shows the nucleic acid sequence of Figure 5 along with the corresponding amino acid sequence (SEQ ID NO:2).
Figure 7 shows an illustrative mutated sequence (SEQ ID NO:59) derived the wild type amino acid sequence (SEQ ID NO:2) for a human galactosyltransferase, wherein a serine has been deleted from the cytoplasmic tail and a G-I-Y motif has been repeated. Of course, such changes are merely illustrative of the many possible changes within the scope of the present invention. For example, in one embodiment, the present invention contemplates mutated sequences wherein only deletions (one or more) are employed (e.g. deletions in the cytoplasmic tail domain or the stem domain) ¨ with no insertions or repeats. Similarly, in one embodiment, the present invention contemplates mutated sequences wherein only (one or more) insertions or replacements (e.g. in the transmembrane domain) are employed ¨ with no deletions.
Figure 8 shows the nucleic acid sequence (SEQ ID NO:3) encoding a hybrid enzyme comprising human galactosyltransferase (human B1,4-galactosyltransferase ¨
GalT). The upper case letters are nucleotides of Arabidopsis thaliana mRNA for beta 1,2-xylosyltransferase (database entry: EMBL:ATH277603, the TmXyl-fragment used involves nucleotides 135-297 of this database sequence).
Figure 9 shows the nucleic acid sequence of Figure 8 along with the corresponding amino acid sequence (SEQ ID NO:4).
Figure 10 shows the amino acid sequence (SEQ ID NO:4) for the hybrid enzyme encoded by the nucleic acid shown in Figure 8.
Figure 11 shows the nucleic acid sequence (SEQ ID NO:5) for the human glycosyltransferase GnTIII (along with additional sequence encoding a myc-tag) (primary accession number Q09327 GNT3 HUMAN).
Figure 12 shows the nucleic acid sequence of Figure 11 along with the corresponding amino acid sequence (SEQ ID NO:6).
Figure 13 shows the amino acid sequence (SEQ ID NO:6) for a human GnTIII
(along with additional amino acid sequence of the myc epitope tag SEQ ID NO:7).
Figure 14 shows the nucleic acid sequence (SEQ ID NO:9) encoding one embodiment of a hybrid enzyme of the present invention, said hybrid enzyme comprising the transmembrane domain of a plant xylosyltransferase (TmXyl-) and the catalytic domain (along with other regions) for human GnTIII (TmXyl-GnTIII) (along with additional sequence encoding a myc-tag).
Figure 15 shows the nucleic acid sequence of Figure 14 along with the corresponding amino acid sequence (SEQ ID NO:10).
Figure 16 shows the amino acid sequence (SEQ ID NO:10) for hybrid enzyme encoded by the nucleic acid of Figure 14 (along with additional sequence for the myc epitope tag SEQ ID NO:7).
Figure 17 shows the complete nucleic acid sequence (SEQ ID NO:27) for a cassette encoding the hybrid enzymes TmXyl-GalT plus TmGnTI-GnTII plus TmGnTI-ManII).
Figure 18 shows the complete nucleic acid sequence (SEQ ID NO:28) for a cassette encoding the hybrid enzyme TmGnTI-ManII (with the RbcS1 promoter sequence SEQ ID NO:39 shown).
Figure 19 shows the nucleic acid sequence (SEQ ID NO:29) encoding the hybrid enzyme TmGnTI-ManII.
Figure 20 shows the nucleic acid sequence (SEQ ID NO:30) encoding the hybrid enzyme TmGnTI-GnTII.
Figure 21 shows the nucleic acid sequence (SEQ ID NO:31) encoding the hybrid enzyme TmGnTI-GnTII, wherein the transmembrane fragment used (designated TmGntI) has the nucleic acid sequence set forth in SEQ ID NO:32.

Figure 22A shows the nucleic acid sequence (SEQ ID NO:32) encoding one embodiment of a transmembrane domain fragment (TmGnTI). Figure 22B shows the nucleic acid sequence (SEQ

ID NO:33) encoding another embodiment of a transmembrane domain fragment (TmManI).

Figure 23 shows the complete nucleic acid sequence (SEQ ID NO:34) for a triple cassette embodiment of the present invention.

Figure 24 shows the nucleic acid sequence (SEQ ID NO:35) for a hybrid gene expression cassette (TmManI-GnTI).

Figure 25 shows the nucleic acid sequence (SEQ ID NO :36) for the histone 3.1 promoter.
Figure 26 shows the nucleic acid sequence (SEQ ID NO:37) for the hybrid gene fusion (TmManI-TmGnTI).
Figure 27 shows the nucleic acid sequence (SEQ ID NO:38) for the hybrid gene fusion TmManI-ManII (with the RbcS1 promoter sequence SEQ ID NO:39 shown).

Figure 28 shows the nucleic acid sequence (SEQ ID NO:39) for the RbcS1 promoter.

Figure 29 shows the nucleic acid sequence (SEQ ID NO:40) for the hybrid gene TmManI-ManII wherein the nucleic acid sequence (SEQ ID NO:33) encoding the transmembrane fragment is shown.
Figure 30 shows the nucleic acid sequence (SEQ ID NO:41) for the hybrid gene TmManI-GnTII.

Figure 31 shows the nucleic acid sequence (SEQ ID NO :42) for the Lhca promoter.
Figure 32 shows the nucleic acid sequence (SEQ ID NO:43) for the hybrid gene TmManI-GnTII wherein the nucleic acid sequence (SEQ ID NO:33) encoding the transmembrane fragment is shown Figure 33 shows the nucleic acid sequence (SEQ ID NO:44) for the terminator sequence used (see below).
Figure 34 is a Western Blot which examines total protein glycosylation of plants of the present invention compared to control plants.

Figure 35 is a lectin blot with RCA on Fl progeny of crossed plants, said progeny made according to one embodiment of the present invention Figure 36 is a Western Blot. Panel A was assayed with anti-IgG antibody. Panel B was assayed with an anti-HRP antibody. Panel C was assayed with a specific anti-Xyl antibody fraction.

Panel D was assayed with a specific anti-Fucose antibody fraction. Panel E was assayed with the lectin RCA.
Figure 37 shows the nucleic acid sequence (SEQ ID NO:49) of a hybrid gene wherein the aminoterminal CTS region of an insect Mannosidase III gene is replaced by a mouse signal peptide and a carboxyterminal endoplasmic reticulum retention signal (1(DEL) was added.
Figure 38 shows the corresponding amino acid sequence (SEQ ID NO:50) for the nucleic acid sequence of Figure 37.

Figure 39 shows the nucleic acid sequence (SEQ ID NO:51) of a hybrid gene wherein the aminoterminal CTS region of a human beta-1,4-galactosyltransferase (GalT) gene is replaced by a mouse signal peptide and a carboxyterminal endoplasmic reticulum retention signal (KDEL) was added.
Figure 40 shows the corresponding amino acid sequence (SEQ ID NO:52) for the nucleic acid sequence of Figure 39.
Figure 41 shows the nucleic acid sequence (SEQ ID NO:53) of a hybrid gene wherein the aminoterminal CTS region of an Arabidopsis thaliana GnTI gene is replaced by a mouse signal peptide and a carboxyterminal endoplasmic reticulum retention signal (KDEL) was added.
Figure 42 shows the corresponding amino acid sequence (SEQ ID NO:54) for the nucleic acid sequence of Figure 41.
Figure 43 shows the nucleic acid sequence (SEQ ID NO:55) of a hybrid gene wherein the aminoterminal CTS region of an Arabidopsis thaliana GnTII gene is replaced by a mouse signal peptide and a carboxyterminal endoplasmic reticulum retention signal (KDEL) was added.
Figure 44 shows the corresponding amino acid sequence (SEQ ID NO:56) for the nucleic acid sequence of Figure 43.
Figure 45 shows the nucleic acid sequence (SEQ ID NO:57) of a hybrid gene wherein the aminoterminal CTS region of a human beta-1,4-galactosyltransferase (GalT) gene is replaced by the CTS region of the human gene for GnTI.
Figure 46 shows the corresponding amino acid sequence (SEQ ID NO:58) for the nucleic acid sequence of Figure 45.
Figure 47 is a schematic of how enzymes might be localized to the Golgi.
Figure 48 is a non-limiting speculative schematic of how the "swapping" of regions of transferases might cause relocalization.
DETAILED DESCRIPTION OF THE INVENTION

Hybrid Enzymes The nucleic acid sequences encoding the various glycosylation enzymes such as mannosidases, GlcNAcTs, galactosyltransferases may be obtained using various recombinant DNA
procedures known in the art, such as polymerase chain reaction (PCR) or screening of expression libraries to detect cloned DNA fragments with shared structural features. See, e.g., Innis et al., 1990, PCR: A Guide to Methods and Application, Academic Press, New York. Other nucleic acid amplification procedures such as ligase chain reaction (LCR), ligated activated transcription (LAT) and nucleic acid sequence-based amplification (NASBA) or long range PCR may be used.
Once the DNA fragments are generated, identification of the specific DNA
fragment containing the desired gene may be accomplished in a number of ways. For example, if an amount of a portion of a gene or its specific RNA, or a fragment thereof, is available and can be purified and labeled, the generated DNA fragments may be screened by nucleic acid hybridization to the labeled probe [Benton and Davis, Science 196:180 (1977); Grunstein and Hogness, Proc.
Natl. Acad. Sci.
U.S.A. 72:3961 (1975)]. Alternatively, the presence of the gene may be detected by assays based on the physical, chemical, or immunological properties of its expressed product.
For example, cDNA
clones, or DNA clones which hybrid-select the proper mRNAs, can be selected which produce a protein that, e.g., has similar or identical electrophoretic migration, isoelectric focusing behavior, proteolytic digestion maps, or antigenic properties as known for the protein of interest.
A nucleic acid sequence encoding a hybrid enzyme comprising a transmembrane portion of a first enzyme and a catalytic portion of a second enzyme may be obtained as follows. The sequence encoding the transmembrane portion is removed from the second enzyme, leaving a nucleic acid sequence comprising a nucleic acid sequence encoding the C-terminal portion of the second enzyme, which encompasses the catalytic site. The sequence encoding the transmembrane portion of the first enzyme is isolated or obtained via PCR and ligated to the sequence encoding a sequence comprising the C-terminal portion of the second enzyme.
Modified Enzymes A nucleic acid sequence encoding a protein, particularly enzymes such as galactosyltransferases, mannosidases and N-acetylglucosamine transferases that are retained in the ER may be obtained by removing the sequence encoding the transmembrane fragment and substituting it for a methionine (initiation of translation) codon and by inserting between the last codon and the stop codon of galactosyltransferase the nucleic acid sequence encoding an ER
retention signal such as the sequence encoding KDEL (amino acid residue sequence: lysine-aspartic acid-glutamic acid-leucine) [Rothman Cell 50:521 (1987)].

Using Domains and Portions Thereof As noted above, the phrases "at least a portion of" or a "fragment of' refers to the minimal amino acid sequence necessary for a protein or a peptide to retain its natural or native function. For example, the function of an enzyme could refer to its enzymatic or catalytic role, its ability to anchor a protein in the Golgi apparatus, or as a signal peptide. Thus, the phrases "at least a portion of a transmembrane domain" or "a fragment of a transmembrane domain" each refer to the smallest amino acid segment of a larger transmembrane domain that still retains at least part of the native transmembrane functionality (for example, the function may be evident, albeit decreased). As another example, the phrases "at least a portion of a catalytic region" or "a fragment of a catalytic region" each refer to the smallest amino acid segment of a larger catalytic region that still retains at least part of the native catalytic functionality (again, even if somewhat decreased). As discussed herein, one skilled in the art will know the minimal amino acid segment that is necessary for a protein or a peptide to retain at least some of the functionality of the native protein or peptide.

The glycosyltransferase enzymes are typically grouped into families based on the type of sugar they transfer (galactosyltransferases, sialyltransferases, etc.). Based on amino-acid sequence similarity and the stereochemical course of the reaction, glycosyltransferases can be classified into at least 27 and perhaps as many as 47 different families [Campbell et al., Biocheni. J. 326:929-939 (1997), Biochem. J. 329:719 (1998)]. The majority of glycosyltransferases cloned to date are type II
transmembrane proteins (i.e., single transmembrane domain with the NH2 terminus in the cytosol and the COOH terminus in the lumen of the Golgi apparatus). Regardless of how they are classified, all glycosyltransferases share some common structural features: a short NH2-terminal cytoplasmic tail, a 16-20 amino acid signal-anchor or transmembrane domain, and an extended stem region which is followed by the large COOH-terminal catalytic domain. The cytoplasmic tail appears to be involved in the specific localization of some types of glycosyltransferases to the Golgi [Milland et al., I Biol. Chem. 277:10374-10378]. The signal anchor domains can act as both uncleavable signal peptides and as membrane-spanning regions that orient the catalytic domains of the glycosyltransferases within the lumen of the Golgi apparatus.
In one embodiment of the present invention, a portion defined by the N-terminal 77 amino acids of Nicotiana benthamiana (tobacco) acetylglucosaminyltransferase I are contemplated for use in the hybrid enzyme(s), since this portion has been found to be sufficient to target to and to retain a reporter protein in the plant Golgi apparatus [Essl et al., FEBS Lett 453:169-173 (1999)].
Subcellular localization in tobacco of various fusion proteins between the putative cytoplasmic, transmembrane and stem domains revealed that the cytoplasmic-transmembrane domains alone were sufficient to sustain Golgi retention of f3 1,2-xylosyltransferase without the contribution of any luminal sequences [Dirnberger et al., Plant MoL Biol. 50:273-281 (2002)].
Thus, as noted above, certain embodiments of the present invention utilize portions of the CTS
region which involve only the cytoplasmic-transmembrane domains (or portions thereof) without utilizing the stem region of the CTS region. However, while some types of glycosyltransferases rely primarily on their transmembrane domain for Golgi retention, other types require their transmembrane region and sequences flanking one or both sides of this region [Colley, Glycobiology 7:1-13 (1997)]. For example, the N-terminal peptide encompassing amino acids 1 to 32 appears to be the minimal targeting signal sufficient to localize 13 1,6 N-acetylglucosaminyltransferase to the Golgi. This peptide makes up the cytoplasmic and transmembrane domains of this enzyme [Zerfaoui et al., Glycobiology 12:15-24].
A great deal of information is available on the amino acid sequences of the domains for specific glycosyltransferases. For example, the amino acid sequence of the mammalian galactosyltransferase provided in GenBank Accession No. AAM17731 has the "stem" and "catalytic" domains spanning residues 19 to 147 and residues 148 to 397, respectively [U.S. Patent No. 6,416,988, hereby incorporated by reference] ¨ and the present invention, in certain embodiments, specifically contemplates such portions for use in the hybrid enzyme(s). The amino acid sequence of the rat liver sialyltransferase provided in GenBank Accession No. AAC91156 has a 9-amino acid NH2-terminal cytoplasmic tail, a 17-amino acid signal-anchor domain, and a luminal domain that includes an exposed stem region followed by a 41 kDa catalytic domain [Hudgin et al., Can. J. Biochem. 49:829-837 (1971); U.S. Patent Nos. 5,032,519 and 5,776,772, hereby incorporated by reference]. Known human and mouse f3 1,3-galactosyltransferases have a catalytic domain with eight conserved regions [Kolbinger et al., J. Biol. Chem. 273:433-440 (1998);
Hennet et al., J. Biol.
Chem. 273:58-65 (1998); U.S. Patent No. 5,955,282, hereby incorporated by reference]. For example, the amino acid sequence of mouse UDP-galactose: P-N-acetylglucosamine p 1,3-galactosyltransferase-I provided in GenBank Accession No. NM020026 has the following catalytic regions: region 1 from residues 78-83; region 2 from residues 93-102; region 3 from residues 116-119; region 4 from residues 147-158; region 5 from residues 172-183; region 6 from residues 203-206; region 7 from amino acid residues 236-246; and region 8 from residues 264-275. [Hennet et al., supra.] ¨ all of which are contemplated in certain embodiments of the present invention as useful portions in the context of the hybrid enzyme(s) discussed above.
While earlier comparisons amongst known cDNA clones of glycosyltransferases had revealed very little sequence homology between the enzymes [Paulson et al., J.
Biol. Chem.
264:17615-618 (1989)], more recent advances have made it possible to deduce conserved domain structures in glycosyltransferases of diverse specificity [Kapitonov et al., Glycobiology 9:961-978 (1999)]. For example, the nucleic acid and amino acid sequences of a number of glycosyltransferases have been identified using sequence data provided by the complete genomic sequences obtained for such diverse organisms as Homo sapiens (humans), Caenorhabditis elegans (soil nematode), Arabidopsis thaliana (thale cress, a mustard) and Oiyza sativa (rice).
As a result of extensive studies, common amino acid sequences have been deduced for homologous binding sites of various families of glycosyltransferases. For example, sialyltransferases have sialyl motifs that appear to participate in the recognition of the donor substrate, CMP-sialic acid [Paulson et al., J. Biol. Chem., 264:17615-17618 (1989); Datta et al., J.
Biol. Chem., 270:1497-1500 (1995); Katsutoshi, Trends Glycosci. Glycotech. 8:195-215 (1996)]. The hexapeptide RDICKND in Gal al-3 galactosyltransferase and RDKKNE in GlcNAc 01-4 galactosyltransferase have been suggested as the binding site for UDP-Gal [(Joziasse etal., J. Biol. Chem., 260:4941-4951 (1985), J.
Biol. Chem., 264:14290-14297 (1989); Joziasse, Glycobiology, 2:271-277 (1992)].
A small, highly-conserved motif formed by two aspartic acid residues (DXD), which is frequently surrounded by a hydrophobic region, has been identified in a large number of different eukaryotic transferases, including a-1, 3-mannosyltransferase, p-i, 4-galactosyltransfereases, a-1, 3-galactosyltransferases, glucuronyltransferases, fucosyltransferases, glycogenins and others [Wiggins et at., Proc. Natl. Acad. Sci. U.S.A. 95:7945-7950 (1998)]. Mutation studies indicate that this motif is necessary for enzymatic activity [Busch etal., J. BioL Chem. 273:19566-19572 (1998); Wang et at., J.. Biol. Chem. 277:18568-18573 (2002)]. Multiple peptide alignment showed several motifs corresponding to putative catalytic domains that are conserved throughout all members of the P 3-galactosyltransferase family, namely, a type II transmembrane domain, a conserved DxD motif, an N-glycosylation site and five conserved cysteines [Gromova et al., Mol.
Carcinog. 32:61-72 (2001)].
Through the use of BLAST searches and multiple alignments, the E-X7-E motif was found to be a highly conserved among the members of four families of retaining glycosyltransferases [Cid et al., J. Biol. Chem. 275:33614-33621 (2000)]. The 0-linked acetylglucosaminyltransferases (GleNAc) add a single P-N-acetylglucosamine moiety to specific serine or threonine hydroxyls.
BLAST analyses, consensus secondary structure predictions and fold recognition studies indicate that a conserved motif in the second Rossmann domain points to the UDP-GleNAc donor-binding site [Wrabl et al., J. Mol. Biol. 314:365-374 (2001)]. The Pl, 3-glycosyltransferase enzymes identified to date share several conserved regions and conserved cysteine residues, all being located in the putative catalytic domain. Site-directed mutagenesis of the murine P3GatT-I gene (Accession No. AF029790) indicate that the conserved residues W101 and W162 are involved in the binding of the UDP-galactose donor, the residue W315 in the binding of the N-acetylglucosamine- p-p-nitrophenol acceptor, and the domain including E264 appears to participate in the binding of both substrates [Malissard et al., Eur. J. Biochem. 269:233-239 (2002)].

Expression of Proteins of Interest in Plant Host System The nucleic acid encoding the hybrid or modified enzymes or other heterologous proteins, such as a heterologous glycoprotein may be inserted according to certain embodiments of the present invention into an appropriate expression vector, i.e., a vector which contains the necessary elements for the transcription and translation of the inserted coding sequence, or in the case of an RNA viral vector, the necessary elements for replication and translation, as well as selectable markers. These include but are not limited to a promoter region, a signal sequence, 5' untranslated sequences, initiation codon (depending upon whether or not the structural gene comes equipped with one), and transcription and translation termination sequences. Methods for obtaining such vectors are known in the art (see WO 01/29242 for review).
Promoter sequences suitable for expression in plants are described in the art, e.g., WO
91/198696. These include non-constitutive promoters or constitutive promoters, such as, the nopaline synthetase and octopine synthetase promoters, cauliflower mosaic virus (CaMV) 19S and 35S promoters and the figwort mosaic virus (FMV) 35 promoter (see U.S. Pat.
Nos. 5, 352,605 and 6,051,753, both of which are hereby incorporated by reference). Promoters used may also be tissue specific promoters targeted for example to the endosperm, aleurone layer, embryo, pericarp, stem, leaves, tubers, roots, and the like.
A signal sequence allows processing and translocation of a protein where appropriate. The signal can be derived from plants or could be non-plant signal sequences. The signal peptides direct the nascent polypeptide to the endoplasmic reticulum, where the polypeptide subsequently undergoes post-translational modification. Signal peptides can routinely be identified by those of skill in the art.
They typically have a tripartite structure, with positively charged amino acids at the N-terminal end, followed by a hydrophobic region and then the cleavage site within a region of reduced hydrophobicity.
The transcription termination is routinely at the opposite end from the transcription initiation regulatory region. It may be associated with the transcriptional initiation region or from a different gene and may be selected to enhance expression. An example is the NOS
terminator from Agrobacterium Ti plasmid and the rice alpha-amylase terminator.
Polyadenylation tails may also be added. Examples include but are not limited to Agrobacterium octopine synthetase signal, [Gielen et al., EMBO J. 3:835-846 (1984)1 or nopaline synthase of the same species [Depicker et al., MoL
AppL Genet. 1:561-573 (1982)].
Enhancers may be included to increase and/or maximize transcription of the heterologous protein. These include, but are not limited to peptide export signal sequence, codon usage, introns, polyadenylation, and transcription termination sites ( see WO 01/29242).
Markers include preferably prokaryote selectable markers. Such markers include resistance toward antibiotics such as ampicillin, tetracycline, kanamycin, and spectinomycin. Specific examples include but are not limited to streptomycin phosphotransferase (spt) gene coding for streptomycin resistance, neomycin phosphotransferase (nptIl) gene encoding kanamycin or geneticin resistance, hygromycin phosphotransferase (hpt) gene encoding resistance to hygromycin.
The vectors constructed may be introduced into the plant host system using procedures known in the art (reviewed in WO 01/29242 and WO 01/31045). The vectors may be modified to intermediate plant transformation plasmids that contain a region of homology to an Agrobacterium tumefaciens vector, a T-DNA border region from A. tumefaciens. Alternatively, the vectors used in the methods of the present invention may be Agrobacteriuni vectors. Methods for introducing the vectors include but are not limited to microinjection, velocity ballistic penetration by small particles with the nucleic acid either within the matrix of small beads or particles, or on the surface and electroporation. The vector may be introduced into a plant cell, tissue or organ. In a specific embodiment, once the presence of a heterologous gene is ascertained, a plant may be regenerated using procedures known in the art. The presence of desired proteins may be screened using methods known in the art, preferably using screening assays where the biologically active site is detected in such a way as to produce a detectable signal. This signal may be produced directly or indirectly.
Examples of such assays include ELISA or a radioimmunoassay.

Transient Expression The present invention specifically contemplates both stable and transient expression of the above-described hybrid enzymes. Techniques for transforming a wide variety of higher plant species for transient expression of an expression cassette are well known [see, for example, Weising et al., Ann. Rev. Genet. 22:421-477(1988)]. Variables of different systems include type nucleic acid transferred (DNA, RNA, plasmid, viral), type of tissue transformed, means of introducing transgene(s), and conditions of transformation. For example, a nucleic acid construct may be introduced directly into a plant cell using techniques ranging from electroporation, PEG poration, particle bombardment, silicon fiber delivery, microinjection of plant cell protoplasts or embryogenic callus or other plant tissue, or Agrobacterium-mediated transformation [Hiei et al., Plant J. 6:271-282 (1994)]. Because transformation efficiencies are variable, internal standards (eg, 35S-Luc) are often used to standardize transformation efficiencies.

Expression constructs for transient assays include plasmids and viral vectors.
A variety of plant viruses that can be employed as vectors are known in the art and include cauliflower mosaic virus (CaMV), geminivirus, brome mosaic virus, and tobacco mosaic virus.

Plant tissues suitable for transient expression include cultured cells, either intact or as protoplasts (in which the cell wall is removed), cultured tissue, cultured plants, and plant tissue such as leaves.
Some transient expression methods utilize gene transfer into plant cell protoplasts mediated by electroporation or polyethylene glycol (PEG). These methods require the preparation and culture of plant protoplasts, and involve creating pores in the protoplast through which nucleic acid is transferred into the interior of the protoplast.
Exemplary electroporation techniques are described in Fromm et al., Proc.
Natl. Acad. Sci.
82: 5824 (1985). The introduction of DNA constructs using polyethylene glycol precipitation is described in Paszkowski et at., EMBO J. 3: 2717-2722 (1984). PEG-mediated transformation of tobacco protoplasts, which includes the steps of isolation, purification, and transformation of the protoplasts, are described in Lyck et al., (1997) Planta 202: 117-125 and Scharf et al., (1998) Mol Cell Biol 18: 2240-2251, and Kirschner et al., (2000) The Plant J24(3): 397-411. These methods have been used, for example, to identify cis-acting elements in promoters activated by external stimuli, Abel and Theologis (1994) Plant J5: 421-427; Hattori et at., (1992) Genes Dev 6: 609-618; Sablowski et at., (1994) EMBO J13: 128-137; and Solano et al., (1995) EMBO J14: 1773-1784), as well as for other gene expression studies (U. S. Patent 6,376,747, hereby incorporated by reference).
Ballistic transformation techniques are described in Klein et al., (1987) Nature 327: 70-73.

Biolistic transient transformation is used with suspension cells or plant organs. For example, it has been developed for use in Nicotiana tabacum leaves, Godon et al (1993) Biochimie 75(7): 591-595.
It has also been used in investigating plant promoters, (Baum et al., (1997) Plant J12: 463-469;
Stromvik et al., (1999) Plant Mol Biol 41(2): 217-31, Tuerck and Fromm (1994) Plant Cell 6:
1655-1663; and U. S. Patent 5,847,102, hereby incorporated by reference), and to characterize transcription factors (Goff et al., (1990) EMBO J9: 2517-2522; Gubler et at., (1999) Plant J 17: 1-9; and Sainz et at., (1997) Plant Cell 9: 611-625).
Other methods allow visualization of transient expression of genes in situ, such as with onion epidermal peels, in which GFP expression in various cellular compartments was observed (Scott et at., (1999) Biotechniques 26(6): 1128-1132 Nucleic acids can also be introduced into plants by direct injection.
Transient gene expression can be obtained by injection of the DNA into reproductive organs of a plant (see, for example, Pena et al., (1987) Nature, 325.:274), such as by direct DNA transfer into pollen (see, for example, Zhou et al., (1983) Methods in Enzymology, 101:433; D. Hess (1987) Intern Rev. Cytol., 107:367; Luo et al., (1988) Plant Mol. Biol. Reporter, 6:165. DNA can also be injected directly into the cells of immature embryos (see, for example, Neuhaus et al., (1987) Theor.
AppL Genet: 75:30;
and Benbrook et al., (1986) in Proceedings Bio Expo 1986, Butterworth, Stoneham, Mass., pp.
27-54).
Agrobacterium-mediated transformation is applicable to both dicots and monocots.
Optimized methods and vectors for Agrobacterium-mediated transformation of plants in the family Graminae, such as rice and maize have been described (see, for example, Heath et al., (1997) Mol.
Plant-Microbe Interact. 10:221-227; Hiei et al., (1994) Plant J. 6:271-282 and Ishida et al., (1996) Nat. Biotech. 14:745-750). The efficiency of maize transformation is affected by a variety of factors including the types and stages of tissue infected, the concentration of Agrobacterium, the tissue culture media, the Ti vectors and the maize genotype.
Another useful basic transformation protocol involves a combination of wounding by particle bombardment, followed by use of Agrobacterium for DNA delivery (see, for example, Bidney et al., (1992) Plant MoL Biol. 18:301-313). Both intact meristem transformation and a split meristem transformation methods are also known (U. S. Patent 6,300,545, hereby incorporated by reference).
Additional methods utilizing Agrobacteria include agroinfection and agroinfiltration. By inserting a viral genome into the T-DNA, Agrobacterium can be used to mediate the viral infection of plants (see, for example, U. S. Patent 6,300,545, hereby incorporated by reference). Following transfer of the T-DNA to the plant cell, excision of the viral genome from the T-DNA (mobilization) is required for successful viral infection. This Agrobacterium-mediated method for introducing a virus into a plant host is known as agroinfection (see, for example, Grimsley, "Agroinfection" pp.
325-342, in Methods in Molecular Biology, vol 44: Agrobacterium Protocols, ed.
Gartland and Davey, Humana Press, Inc., Totowa, N.J.; and Grimsley (1990) Physiol. Plant.
79:147-153).
The development of plant virus gene vectors for expression of foreign genes in plants provides a means to provide high levels of gene expression within a short time.
Suitable viral replicons include double-stranded DNA from a virus having a double stranded DNA
genome or replication intermediate. The excised viral DNA is capable of acting as a replicon or replication intermediate, either independently, or with factors supplied in trans. The viral DNA may or may not encode infectious viral particles and furthermore may contain insertions, deletions, substitutions, rearrangements or other modifications. The viral DNA may contain heterologous DNA, which is any non-viral DNA or DNA from a different virus. For example, the heterologous DNA may comprise an expression cassette for a protein or RNA of interest.
Super binary vectors carrying the vir genes of Agrobacterium strains A281 and A348 are useful for high efficiency transformation of monocots. However, even without the use of high efficiency vectors, it has been demonstrated that T-DNA is transferred to maize at an efficiency that results in systemic infection by viruses introduced by agroinfection, although tumors are not formed (Grimsley et al., (1989) Mol. Gen. Genet. 217:309-316). This is because integration of the T-DNA
containing the viral genome is not required for viral multiplication, since the excised viral genome acts as an independent replicon.
Another Agrobacteria-mediated transient expression assay is based on Agrobacterium-mediated transformation of tobacco leaves in planta (Yang et al., (2000) The Plant J22(6): 543-551). The method utilizes infiltration of agrobacteria carrying plasmid constructs into tobacco leaves, and is referred to as agroinfiltration; it has been utilized used to analyze in vivo expression of promoters and transcription factors in as little as 2-3 days. It also allows examination of effects of external stimuli such as pathogen infections and environmental stresses on promoter activity in situ.

Example 1 An Arabidopsis thaliana cDNA encoding 131,2-xylosyltransferase was isolated from a cDNA
library by a previously described PCR based sibling selection procedure [Bakker et al., BBRC 261:829 (1999)]. Xylosyltransferase activity was confirmed by immunostaining of transfected CHO cells with a xylose specific antibody purified from rabbit-anti-horseradish-peroxidase antiserum. A DNA fragment covering the N-terminal part of the xylosyltransferase was amplified using primers:
XylTpvuF:ATACTCGAGTTAACAATGAGTAAACGGAATC (SEQ ID NO:45) and XylTpvuR:TTCTCGATCGCCGATTGGTTATTC (SEQ ID NO:46) XhoI and HpaI restriction sites were introduced in front of the start codon and a PvuI was introduced at the reverse end. A C-terminal fragment from Human 131,4galactosyltransferase (acc.no. x55415, Aoki 1992) was amplified using primers GalTpvuF:GCCGCCGCGATCGGGCAGTCCTCC (SEQ
ID NO:47) and GalTrev:AACGGATCCACGCTAGCTCGGTGTCCCGAT (SEQ ID NO:48) thus introducing PvuI and BamHI sites. The XhoI/PvuI and PvuI/BamHI digested PCR
fragments were ligated in XhoI/BamHI digested pBluescriptSK+ and sequenced. The resulting open reading frame encodes a fusion protein containing the first 54 amino acids of A. thaliana 01,2-xylosyltransferase fused with amino acid 69 to 398 of human [31,4galactosyltransferase and is designated as TmXyl-GalT. The fragment was cloned into a plant expression vector between the CaMV35S promoter and Nos terminator, using HpaI/BamHI. The clone was introduced into Nicotiana tabacum (samsun NN) as described for native human 131,4galactosyltransferase [Bakker et al., Proc.
Nat. Acad. Sci. USA
98:2899 (2001)].
Protein extract of transgenic plants and Western Blots were made as described [Bakker et al., Proc. Nat. Acad. Sci. USA 98:2899 (2001)]. Based on reaction with the lectin RCA, a transgenic plant expressing TmXylGalT was selected for further glycan analysis by MALDI-TOF [Elbers et al., Plant Physiology 126:1314 (2001] and compared with glycans isolated from plants expressing native 131,4galactosyltransferase and with glycans from wild-type plants. Relative peak areas of the MALDI-TOF spectrum are given in Table 1. That is to say, Table 1 is a comparison of the results of mass spec (MALDI-TOF) analysis of N-glycans of endogenous glycoproteins of control tobacco ("Tobacco"), transgenic tobacco expressing human beta-1,4-galactosyltransferase ("GalT") and transgenic tobacco plants expressing the beta-1,4-galactosyltransferase gene of which the CTS

region has been replaced with that of beta-1,2-xylosyltransferase ("TmXyl-GalT").

m/z Type Tobacco GaIT TmXyl-GaIT

1298 GaIGNM3 6 1460 GaIGNM4 11 1576 GaIGNFXM3 5 1606 GaIGNFM4 1622 GaIGNM5 9 1768 GaIGNFM5 3 1779 GaIGN2FXM3 2 1941 Gal2GN2FXM3 2 These data show that:

1. In TmXylGalT plants, xylosylation and fucosylation of the glycans is dramatically reduced: 82% of the glycans do not carry xylose nor fucose as compared to 14%
in wild-type plants.

2. Galactosylation has increased from 9% in GalT plants to 32% in TmXylGalT
plants.

Example 2 A transgenic plant expressing said TmXyl-GalT gene (TmXyl-GalT-12 plant) was selected (above) based on lectin blotting using biotin-labelled RCA (Vector Laboratories, Burlingame, California). Comparison of protein extracts of MGR48 transgenic (control) plant, a selected transgenic plant expressing the unmodified human P1,4-galactosyltransferase gene and TmXyl-GalT-12 plant for the presence of xylose and fucose using anti-HRP
(horseradish peroxidase) polyclonal antibody (known for high anti-xylose and anti-fucose reactivity) clearly showed reduced xylose and fucose (Figure 34: "Anti-HRP"). Western blotting using an anti-xylose fraction of the anti-HRP and an anti-fucose fraction (each of which can be prepared by affinity chromatography over the appropriate ligand) showed that especially xylose was reduced compared to control plants (Figure 34: anti-Fuc" and "anti-Xyl").

Example 3 The TmXyl-Ga1T-12 plant was crossed with a transgenic plant expressing the monoclonal antibody MGR48 from a single T-DNA integration event (MGR48-31) and which was first made homozygous by selecting offspring plants not segregating for the kanamycin resistance marker and antibody production (MGR48-31-4). Pollen of MGR48-31-4 was used for pollination of emasculated TmXyl-GalT-12 plants. Vice versa, pollen of TmXyl-GaIT-12 plant was used for fertilization on emasculated MGR48-31-4 plants. A number of Fl plants were analyzed for the presence of MGR48 by western blotting and for galactosylation of endogenous glycoproteins by lectin blotting using RCA (Figure 35). One plant expressing MGR48 and showing galactosylation of endogenous glycoproteins was selected for further analysis. This plant was identified as XGM8.
Seeds from TmXyl-GaIT-12 (y) x MGR48-31-4 (6) were sown and Fl offspring plants (XGM) were analysed for antibody production by Western blotting and for galactosylation by lectin blotting using biotinylated RCA120 (Vector Labs., Burlingame, California) using standard techniques as described before. All plants as expected expressed the monoclonal antibody MGR48 and the majority also had galactosylated glycans as depicted from lectin blotting using RCA120. A
single plant expressing both antibody MGR48 and having galactosylated N-glycans was chosen for further analysis (XGM8) (TmXyl-Ga1T-12 X MGR48-31-4 offpring plant 8). The monoclonal recombinant MGR48 antibody was purified from this plant as described before and submitted to N-glycan analysis by MALDI-TOF.
Briefly, XGM8 plant was grown in greenhouse for antibody production under optimal conditions [Elbers et al., Plant Physiology 126:1314 (2001)]. Protein extract of leaves of transgenic XGM8 plant was made and monoclonal antibody was purified using protein G
chromatography as described [Bakker et al, Proc. Nat. Acad. Sci. USA 98:2899 (2001)]. MALDI-TOF
of N-glycans of purified monoclonal antibody was as described (Elbers et al., 2001, supra).
The presence of galactose on glycans was established by enzyme sequencing using bovine testis P-galactosidase as described (Bakker et al., 2001, supra; Table 2). Table 2 (below) is a comparison of the results of mass spec (MALDI-TOF) analysis of N-glycans of endogenous glycoproteins ("Xyl-GalT Endo") of a Fl hybrid of TmXyl-Ga1T-12plant and plant producing rec-mAb (MGR48) and of N-glycans of rec-mA13 purified by protein G chromatography from said Fl hybrid.

TABLE 2 Xyl-GaIT Xyl-GaIT
mk Type Endo IgG

1095 M4 _ 5 5 1298 GaIGNM3 3 4 1430 GaIGNXM3 1444 GaIGNFM3 1460 GaIGNM4 8 10 1501 GaIGN2M3 1 1 1576 GaIGNFXM3 2 3 1593 GaIGNXM4 1 2 1606 GaIGNFM4 1622 GaIGNM5 6 6 1647 GaIGN2FM3 1 1 1663 Gal2GN2M3 1 1 1738 GaIGNFXM4 1754 GaIGNXM5 1 2 1768 GaIGNFM5 _ 2 3 1784 GaIGNM6 1809 Gal2GN2FM3 2 1 1900 GaIGNFXM5 These data show that:
1. In the Fl hybrid, xylosylation and fucosylation of the glycans is dramatically reduced: 43% of the glycans of endogenous glycoproteins lack xylose and fucose as compared to only 14% in wild-type tobacco plants.
2. The glycans of purified mAb of this Fl hybrid have reduced xylose and fucose, 47% compared to 14% for wildtype tobacco. See also Figure 36, panels B-D.
3. Galactosylation of endogenous glycoproteins of F1 hybrid has increased from 9% in GalT plants to 37% in Fl TmXyl-GalT X MGR48 plant. See also Figure 35.
4. Purified rec-mAb from said Fl (see Figure 36, panel A) shows increased galactosylation; that is to say, 46% has galactose. See also Figure 36, panel E.
It should however be noted that the observed quantities (MALDI-TOF) do not necessarily reflect the molar rations of said glycoforms in vivo. Quantification based on MALDI-TOF
can be under- or overestimated depending on the specific glycoform under study. Also, since there is no molecular weight difference between Gal and Man, some peaks can not be annotated unambiguously unless there are clear differences in relative height of specific molecules before and after galactosidase treatment.

Example 4 A more direct comparison of xylose, fucose and galactose content was done by examining the MGR48 IgG antibodies from hybridoma, transgenic tobacco and TmXyl-GalT
transgenic tobacco. As mentioned above, the TmXyl-Ga1T-12 plant was crossed with tobacco plant expressing MGR48 IgG (MGR48 tobacco) resulting in an Fl hybrid harbouring MGR48 TmXyl-GalT. An Fl plant was chosen for extraction and purification of MGR48 IgG. Antibodies from said plants (tobacco and TmXyl-GalT) were isolated and purified using protein G
chromatography (Elbers et al., 2001. Plant Physiology 126: 1314-1322). 300 nanograms amounts of each, hybridoma MGR48 and plant-derived recMGR48, were loaded on precast 12% SDS-PAGE gels (BioRad) and run. The contents of each lane were as follows: Lane 1, MGR48 from hybridoma; Lane 2, purified recMGR48 from normal transgenic tobacco plant; and Lane 3, purified recMGR48 from TmXyl-GalT transgenic plant. Following SDS-PAGE proteins were transferred to nitrocellulose using CAPS buffer. Blots were incubated with A, anti-mouse IgG; B, polyclonal rabbit anti-IIRP (anti-xylose/(alpha 1,3-fucose); C, anti-xylose; D, anti-(alpha 1,3-) fucose antibodies; and E, biotinylated RCA. Detection was with LumiLight on Lumi Imager following incubation with HRP-labelled sheep anti-mouse (panel A) or goat-anti-rabbit (panels B-D) antibodies and HRP-labeled streptavidin (E).

Panel A shows that approximately similar amounts of the MGR48 IgG was loaded for all lanes (1-3). L refers to Light chain and H, heavy chain of MGR48 IgG.

Panel B shows that the heavy chain of MGR48 antibody in lane 2 (tobacco) strongly reacts with anti-HRP as expected, whereas the heavy chain of hybridoma derived MGR48 (lane 1) does not (as expected). Hybridoma derived antibodies do not carry xylose and alpha 1, 3-fuctose residues.

Remarkably, MGR48 antibodies from TmXyl-GalT tobacco plant also do not react, suggesting that the heavy chain of antibody from this plant have significantly reduced (perhaps by 90% or more) the amounts of xylose and fucose residues on the N-glycans. This is confirmed by experiments depicted in panels C (anti-xylose) and D (anti-fucose). Panel E shows that the heavy chain of MGR48 antibody of hybridoma (lane 1) has a galactosylated N-glycan, whereas tobacco-derived MGR48 (lane 2) has not, both as expected. Heavy chain of MGR48 from the TmXyl-GalT
plant (lane 3) also has galactosylated N-glycan due to the presence of the construct expressing the hybrid enzyme.

These data are in agreement with the data obtained from similar experiments using total protein extracts from similar plants (tobacco and TmXyl-GalT-12 plant) as shown previously and confirm that the novel trait introduced in tobacco from expression of TmXyl-GalT gene can be stably transmitted to offspring and a recombinant monoclonal antibody.

Example 5 Further characterization of the above-described F1 hybrid was performed by treatement with beta-galactosidase. Table 3 is a comparison of the results of mass spec (MALDI-TOF) analysis of N-glycans of rec-mAbs purified by protein G chromatography from an Fl hybrid of TmXyl-GalT

and MGR48 plant before and after treatment of the glycans with beta-galactosidase.

TABLE 3 Xyl-GaIT Xyl-GaIT

miz Type IgG- IgG+beta-galactosidase 1298 GaIGNM3 4 4 1430 GaIGNXM3 1444 GaIGNFM3 1460 GaIGNM4 10 14 1501 GaIGN2M3 1 1576 GaIGNFXM3 3 3 1593 GaIGNXM4 2 2 1606 GaIGNFM4 1622 GaIGNM5 6 1 1647 GaIGN2FM3 1 1663 Gal2GN2M3 1 1738 GaIGNFXM4 1754 GaIGNXM5 2 1 1768 GaIGNFM5 3 1 1784 GaIGNM6 1809 Gal2GN2FM3 1 1900 GaIGNFXM5 These data show that:
1. Rec-mAbs from Fl hybrid contain galactose which can be deduced from the observed reduction of specific (galactose-containing) glycoforms after beta-galactosidase treatment and increase of glycoforms lacking galactose. Note the observed reduction of m/z 1622 from 6 to 1% and simultaneous increase of m/z 1460 from 10 to 14% which is the result of the removal of galactose from Ga1GNM5 to give rise to GNM5. The same is true for miz 1768 (3 to 1%
decrease) and corresponding m/z 1606 peak (4 to 6% increase). See also Figure 36, panel E.
2. Similarly a number of peaks that can be attributed to galactose containing glycans vanish upon treatment with galactosidase, especially m/z 1501, 1647 and 1663 confirming the presence of galactose.

Example 6 In another embodiment, the aminoterminal CTS region of an insect Mannosidase III gene (accession number: AF005034; mistakenly annotated as a Mannosidase II gene!) is replaced by a mouse signal peptide coding sequence for import into the endoplasmic reticulum (see Figure 37) .
The signal peptide sequence encodes a fully active signal peptide normally present at the aminoterminus of IgG sequences and has been used successfully in plants and other organisms before. Furthermore a synthetic sequence coding for a so-called endoplasmic reticulum retention sequence (KDEL) is added to the carboxyterminus of the gene part encoding the catalytic fragment for ER retention. The hybrid Mannosidase III protein encoded by this gene sequence will hence accumulate preferentially in the endoplasmic reticulum.

Example 7 In another embodiment, the aminoterminal CTS region of the human beta-1,4-galactosyltransferase (GalT) gene (accession A52551) is replaced by a mouse signal peptide coding sequence for import into the endoplasmic reticulum (see Figure 39). The signal peptide sequence encodes a fully active signal peptide normally present at the aminoterminus of IgG sequences and has been used successfully in plants and other organisms before. Furthermore a synthetic sequence coding for a so-called endoplasmic reticulum retention sequence (KDEL) is added to the carboxyterminus of the gene part encoding the catalytic fragment for ER
retention. The hybrid beta-1,4-galactosyl-transferase protein encoded by this gene sequence will hence accumulate preferentially in the endoplasmic reticulum.

Example 8In another embodiment, the aminoterminal CTS region of Arabidopsis thaliana GnTI (acc.
AJ243198) is replaced by a mouse signal peptide coding sequence for import into the endoplasmic reticulum (see Figure 41). The signal peptide sequence encodes a fully active signal peptide normally present at the aminoterminus of IgG sequences and has been used successfully in plants and other organisms before. Furthermore a synthetic sequence coding for a so-called endoplasmic reticulum retention sequence (KDEL) is added to the carboxyterminus of the gene part encoding the catalytic fragment for ER retention. The hybrid GnTI protein encoded by this gene sequence will hence accumulate preferentially in the endoplasmic reticulum.

Example 9 In another embodiment, the aminoterminal CTS region of an Arabidopsis thaliana GnTII
(acc. AJ249274) is replaced by a mouse signal peptide coding sequence for import into the endoplasmic reticulum (see Figure 43). The signal peptide sequence encodes a fully active signal peptide normally present at the aminoterminus of IgG sequences and has been used successfully in plants and other organisms before. Furthermore a synthetic sequence coding for a so-called endoplasmic reticulum retention sequence (KDEL) is added to the carboxyterminus of the gene part encoding the catalytic fragment for ER retention. The hybrid GnTII protein encoded by this gene sequence will hence accumulate preferentially in the endoplasmic reticulum.

Example 10 In another embodiment, the aminoterminal CTS region of the human gene for beta-1,4-galactosyltransferase (GalT) gene is replaced by the CTS region of the human gene for GnTI

(TmhuGnTI-GalT) (see Figure 45).

It is understood that the present invention is not limited to any particular mechanism. Nor is it necessary to understand the mechanism in order to successfully use the various embodiments of the invention. Nonetheless, it is believed that there is a sequential distribution of Golgi enzymes (Figure 47) and that the swapping in of transmembrane domains of plant glycosyltransferases causes relocalization (Figure 48).

It is understood that the present invention is not limited to the particular methodology, protocols, cell lines, vectors, and reagents described herein, as these may vary. It is also to be understood that the terminology used herein is used for the purpose of describing particular embodiments only, and is not intend to limit the scope of the present invention. It must be noted that as used herein and in the appended claims, the singular forms "a", "an", and "the" include plural reference unless the context clearly dictates otherwise. Unless defined otherwise, all technical and scientific terms used herein have the same meanings as commonly understood by one of ordinary skill in the art to which this invention belongs.
The invention described and claimed herein is not to be limited in scope by the specific embodiments herein disclosed, since these embodiments are intended as illustrations of several aspects of the invention. Any equivalent embodiments are intended to be within the scope of this invention. Indeed, various modifications of the invention in addition to those shown and described herein will become apparent to those skilled in the art from the foregoing description. Such modifications are also intended to fall within the scope of the appended claims.
Various references are cited herein, the disclosures of which are incorporated by reference in their entireties.

SEQUENCE LISTING

<110> PLANT RESEARCH INTERNATIONAL BV

<120> Optimizing glycan processing in plants <130> 08901224CA

<140>
<141> 2003-03-18 <150> US-60/365,735 <151> 2002-03-19 <160> 59 <170> PatentIn version 3.2 <210> 1 <211> 1197 <212> DNA
<213> Homo sapiens <400> 1 atgaggcttc gggagccgct cctgagcggc agcgccgcga tgccaggcgc gtccctacag cgggcctgcc gcctgctcgt ggccgtctgc gctctgcacc ttggcgtcac cctcgtttac tacctggctg gccgcgacct gagccgcctg ccccaactgg tcggagtctc cacaccgctg cagggcggct cgaacagtgc cgccgccatc gggcagtcct ccggggagct ccggaccgga ggggcccggc cgccgcctcc tctaggcgcc tcctcccagc cgcgcccggg tggcgactcc agcccagtcg tggattctgg ccctggcccc gctagcaact tgacctcggt cccagtgccc cacaccaccg cactgtcgct gcccgcctgc cctgaggagt ccccgctgct tgtgggcccc atgctgattg agtttaacat gcctgtggac ctggagctcg tggcaaagca gaacccaaat gtgaagatgg gcggccgcta tgcccccagg gactgcgtct ctcctcacaa ggtggccatc atcattccat tccgcaaccg gcaggagcac ctcaagtact ggctatatta tttgcaccca gtcctgcagc gccagcagct ggactatggc atctatgtta tcaaccaggc gggagacact atattcaatc gtgctaagct cctcaatgtt ggctttcaag aagccttgaa ggactatgac tacacctgct ttgtgtttag tgacgtggac ctcattccaa tgaatgacca taatgcgtac aggtgttttt cacagccacg gcacatttcc gttgcaatgg ataagtttgg attcagccta ccttatgttc agtattttgg aggtgtctct gctctaagta aacaacagtt tctaaccatc aatggatttc ctaataatta ttggggctgg ggaggagaag atgatgacat ttttaacaga ttagttttta gaggcatgtc tatatctcgc ccaaatgctg tggtcgggag gtgtcgcatg 1020 atccgccact caagagacaa gaaaaatgaa cccaatcctc agaggtttga ccgaattgca 1080 cacacaaagg agacaatgct ctctgatggt ttgaactcac tcacctacca ggtgctggat gtacagagat acccattgta tacccaaatc acagtggaca tcgggacacc gagctag <210> 2 <211> 398 <212> PRT
<213> Homo sapiens <400> 2 Met Arg Leu Arg Glu Pro Leu Leu Ser Gly Ser Ala Ala Met Pro Gly Ala Ser Leu Gln Arg Ala Cys Arg Leu Leu Val Ala Val Cys Ala Leu His Leu Gly Val Thr Leu Val Tyr Tyr Leu Ala Gly Arg Asp Leu Ser Arg Leu Pro Gln Leu Val Gly Val Ser Thr Pro Leu Gln Gly Gly Ser Asn Ser Ala Ala Ala Ile Gly Gln Ser Ser Gly Glu Leu Arg Thr Gly Gly Ala Arg Pro Pro Pro Pro Lou Gly Ala Ser Ser Gln Pro Arg Pro Gly Gly Asp Ser Ser Pro Val Val Asp Ser Gly Pro Gly Pro Ala Ser Asn Lou Thr Ser Val Pro Val Pro His Thr Thr Ala Lou Ser Leu Pro Ala Cys Pro Glu Glu Ser Pro Leu Leu Val Gly Pro Met Lou Ile Glu Phe Asn Met Pro Val Asp Leu Glu Lou Val Ala Lys Gln Asn Pro Asn Val Lys Met Gly Gly Arg Tyr Ala Pro Arg Asp Cys Val Ser Pro His Lys Val Ala Ile Ile Ile Pro Phe Arg Asn Arg Gln Glu His Lou Lys Tyr Trp Lou Tyr Tyr Lou His Pro Val Lou Gln Arg Gln Gln Leu Asp Tyr Gly Ile Tyr Val Ile Asn Gln Ala Gly Asp Thr Ile Phe Asn Arg Ala Lys Lou Lou Asn Val Gly Phe Gln Glu Ala Leu Lys Asp Tyr Asp Tyr Thr Cys Phe Val Phe Ser Asp Val Asp Lou Ile Pro Met Asn Asp His Asn Ala Tyr Arg Cys Phe Ser Gln Pro Arg His Ile Ser Val Ala Met Asp Lys Phe Gly Phe Ser Leu Pro Tyr Val Gln Tyr Phe Gly Gly Val Ser Ala Leu Ser Lys Gln Gln Phe Lou Thr Ile Asn Gly Phe Pro Asn Asn Tyr Trp Gly Trp Gly Gly Glu Asp Asp Asp Ile Phe Asn Arg Lou Val Phe Arg Gly Met Ser Ile Ser Arg Pro Asn Ala Val Val Gly Arg Cys Arg Met Ile Arg His Ser Arg Asp Lys Lys Asn Glu Pro Asn Pro Gln Arg Phe Asp Arg Ile Ala His Thr Lys Glu Thr Met Leu Ser Asp Gly Lou Asn Ser Lou Thr Tyr Gln Val Leu Asp Val Gln Arg Tyr Pro Lou Tyr Thr Gln Ile Thr Val Asp Ile Gly Thr Pro Ser <210> 3 <211> 1152 <212> DNA
<213> hybrid <400> 3 atgagtaaac ggaatccgaa gattctgaag atttttctgt atatgttact tctcaactct 60 ctctttctca tcatctactt cgtttttcac tcatcgtcgt tttcaccgga gcagtcacag 120 cctcctcata tataccacgt ttcagtgaat aaccaatcgg cgatcgggca gtcctccggg 180 gagctccgga ccggaggggc ccggccgccg cctcctctag gcgcctcctc ccagccgcgc 240 ccgggtggcg actccagccc agtcgtggat tctggccctg gccccgctag caacttgacc 300 tcggtcccag tgccccacac caccgcactg tcgctgcccg cctgccctga ggagtccccg 360 ctgcttgtgg gccccatgct gattgagttt aacatgcctg tggacctgga gctcgtggca 420 aagcagaacc caaatgtgaa gatgggcggc cgctatgccc ccagggactg cgtctctcct 480 cacaaggtgg ccatcatcat tccattccgc aaccggcagg agcacctcaa gtactggcta 540 tattatttgc acccagtcct gcagcgccag cagctggact atggcatcta tgttatcaac 600 caggcgggag acactatatt caatcgtgct aagctcctca atgttggctt tcaagaagcc 660 ttgaaggact atgactacac ctgctttgtg tttagtgacg tggacctcat tccaatgaat 720 gaccataatg cgtacaggtg tttttcacag ccacggcaca tttccgttgc aatggataag 780 tttggattca gcctacctta tgttcagtat tttggaggtg tctctgctct aagtaaacaa 840 cagtttctaa ccatcaatgg atttcctaat aattattggg gctggggagg agaagatgat 900 gacattttta acagattagt ttttagaggc atgtctatat ctcgcccaaa tgctgtggtc 960 gggaggtgtc gcatgatccg ccactcaaga gacaagaaaa atgaacccaa tcctcagagg 1020 tttgaccgaa ttgcacacac aaaggagaca atgctctctg atggtttgaa ctcactcacc 1080 taccaggtgc tggatgtaca gagataccca ttgtataccc aaatcacagt ggacatcggg 1140 acaccgagct ag 1152 <210> 4 <211> 383 <212> PRT
<213> hybrid <400> 4 Met Ser Lys Arg Asn Pro Lys Ile Leu Lys Ile Phe Leu Tyr Met Leu Leu Leu Asn Ser Leu Phe Leu Ile Ile Tyr Phe Val Phe His Ser Ser Ser Phe Ser Pro Glu Gin Ser Gin Pro Pro His Ile Tyr His Val Ser Val Asn Asn Gin Ser Ala Ile Gly Gin Ser Ser Gly Glu Leu Arg Thr Gly Gly Ala Arg Pro Pro Pro Pro Leu Gly Ala Ser Ser Gin Pro Arg Pro Gly Gly Asp Ser Ser Pro Val Val Asp Ser Gly Pro Gly Pro Ala Ser Asn Leu Thr Ser Val Pro Val Pro His Thr Thr Ala Leu Ser Leu Pro Ala Cys Pro Glu Glu Ser Pro Leu Leu Val Gly Pro Met Leu Ile Glu Phe Asn Met Pro Val Asp Leu Glu Leu Val Ala Lys Gin Asn Pro Asn Val Lys Met Gly Gly Arg Tyr Ala Pro Arg Asp Cys Val Ser Pro His Lys Val Ala Ile Ile Ile Pro Phe Arg Asn Arg Gin Glu His Leu Lys Tyr Trp Leu Tyr Tyr Leu His Pro Val Leu Gin Arg Gin Gin Leu Asp Tyr Gly Ile Tyr Val Ile Asn Gin Ala Gly Asp Thr Ile Phe Asn Arg Ala Lys Leu Leu Asn Val Gly Phe Gln Glu Ala Leu Lys Asp Tyr Asp Tyr Thr Cys Phe Val Phe Ser Asp Val Asp Leu Ile Pro Met Asn Asp His Asn Ala Tyr Arg Cys Phe Ser Gln Pro Arg His Ile Ser Val Ala Met Asp Lys Phe Gly Phe Ser Leu Pro Tyr Val Gln Tyr Phe Gly Gly Val Ser Ala Leu Ser Lys Gln Gln Phe Leu Thr Ile Asn Gly Phe Pro Asn Asn Tyr Trp Gly Trp Gly Gly Glu Asp Asp Asp Ile Phe Asn Arg Leu Val Phe Arg Gly Met Ser Ile Ser Arg Pro Asn Ala Val Val Gly Arg Cys Arg Met Ile Arg His Ser Arg Asp Lys Lys Asn Glu Pro Asn Pro Gln Arg Phe Asp Arg Ile Ala His Thr Lys Glu Thr Met Leu Ser Asp Gly Leu Asn Ser Leu Thr Tyr Gln Val Leu Asp Val Gln Arg Tyr Pro Leu Tyr Thr Gln Ile Thr Val Asp Ile Gly Thr Pro Ser <210> 5 <211> 1642 <212> DNA
<213> Homo sapiens <400> 5 ccatggtgat gagacgctac aagctctttc tcatgttctg tatggccggc ctgtgcctca 60 tctccttcct gcacttcttc aagaccctgt cctatgtcac cttcccccga gaactggcct 120 ccctcagccc taacctggtg tccagctttt tctggaacaa tgccccggtc acgccccagg 180 ccagccccga gccaggaggc cctgacctgc tgcgtacccc actctactcc cactcgcccc 240 tgctgcagcc gctgccgccc agcaaggcgg ccgaggagct ccaccgggtg gacttggtgc 300 tgcccgagga caccaccgag tatttcgtgc gcaccaaggc cggcggcgtc tgcttcaaac 360 ccggcaccaa gatgctggag aggccgcccc cgggacggcc ggaggagaag cctgaggggg 420 ccaacggctc ctcggcccgg cggccacccc ggtacctcct gagcgcccgg gagcgcacgg 480 ggggccgagg cgcccggcgc aagtgggtgg agtgcgtgtg cctgcccggc tggcacggac 540 ccagctgcgg cgtgcccact gtggtgcagt actccaacct gcccaccaag gagcggctgg 600 tgcccaggga ggtgccgcgc cgcgtcatca acgccatcaa cgtcaaccac gagttcgacc 660 tgctggacgt gcgcttccac gagctgggcg acgtggtgga cgcctttgtg gtgtgcgagt 720 ccaacttcac ggcttatggg gagccgcggc cgctcaagtt ccgggagatg ctgaccaatg 780 gcaccttcga gtacatccgc cacaaggtgc tctatgtctt cctggaccac ttcccgcccg 840 gcggccggca ggacggctgg atcgccgacg actacctgcg caccttcctc acccaggacg 900 gcgtctcgcg gctgcgcaac ctgcggcccg acgacgtctt catcattgac gatgcggacg 960 agatcccggc ccgtgacggc gtccttttcc tcaagctcta cgatggctgg accgagccct 1020 tcgccttcca catgcgcaag tcgctctacg gcttcttctg gaagcagccg ggcaccctgg 1080 aggtggtgtc aggctgcacg gtggacatgc tgcaggcagt gtatgggctg gacggcatcc 1140 gcctgcgccg ccgccagtac tacaccatgc ccaacttcag acagtatgag aaccgcaccg 1200 gccacatcct ggtgcagtgg tcgctgggca gccccctgca cttcgccggc tggcactgct 1260 cctggtgctt cacgcccgag ggcatctact tcaagctcgt gtccgcccag aatggcgact 1320 tcccacgctg gggtgactac gaggacaagc gggacctgaa ctacatccgc ggcctgatcc 1380 gcaccggggg ctggttcgac ggcacgcagc aggagtaccc gcctgcagac cccagcgagc 1440 acatgtatgc gcccaagtac ctgctgaaga actacgaccg gttccactac ctgctggaca 1500 acccctacca ggagcccagg agcacggcgg cgggcgggtg gcgccacagg ggtcccgagg 1560 gaaggccgcc cgcccggggc aaactggacg aggcggaagt cgaacaaaaa ctcatctcag 1620 aagaggatct gaattaggat cc 1642 <210> 6 <211> 544 <212> PRT
<213> Homo sapiens <400> 6 Met Val Met Arg Arg Tyr Lys Lou Phe Leu Met Phe Cys Met Ala Gly Lou Cys Lou Ile Ser Phe Leu His Phe Phe Lys Thr Lou Ser Tyr Val Thr Phe Pro Arg Glu Leu Ala Ser Leu Ser Pro Asn Leu Val Ser Ser Phe Phe Trp Asn Asn Ala Pro Val Thr Pro Gin Ala Ser Pro Glu Pro Gly Gly Pro Asp Leu Leu Arg Thr Pro Leu Tyr Ser His Ser Pro Leu Leu Gin Pro Leu Pro Pro Ser Lys Ala Ala Glu Glu Lou His Arg Val Asp Leu Val Lou Pro Glu Asp Thr Thr Glu Tyr Phe Val Arg Thr Lys Ala Gly Gly Val Cys Phe Lys Pro Gly Thr Lys Met Lou Glu Arg Pro Pro Pro Gly Arg Pro Glu Glu Lys Pro Glu Gly Ala Asn Gly Ser Ser Ala Arg Arg Pro Pro Arg Tyr Lou Lou Ser Ala Arg Glu Arg Thr Gly Gly Arg Gly Ala Arg Arg Lys Trp Val Glu Cys Val Cys Leu Pro Gly Trp His Gly Pro Ser Cys Gly Val Pro Thr Val Val Gin Tyr Ser Asn Lou Pro Thr Lys Glu Arg Leu Val Pro Arg Glu Val Pro Arg Arg Val Ile Asn Ala Ile Asn Val Asn His Glu Phe Asp Leu Lou Asp Val Arg Phe His Glu Leu Gly Asp Val Val Asp Ala Phe Val Val Cys Glu Ser Asn Phe Thr Ala Tyr Gly Glu Pro Arg Pro Leu Lys Phe Arg Glu Met Leu Thr Asn Gly Thr Phe Glu Tyr Ile Arg His Lys Val Lou Tyr Val Phe Lou Asp His Phe Pro Pro Gly Gly Arg Gin Asp Gly Trp Ile Ala Asp Asp Tyr Lou Arg Thr Phe Lou Thr Gin Asp Gly Val Ser Arg Leu Arg Asn Lou Arg Pro Asp Asp Val Phe Ile Ile Asp Asp Ala Asp Glu Ile Pro Ala Arg Asp Gly Val Leu Phe Leu Lys Leu Tyr Asp Gly Trp Thr Glu Pro Phe Ala Phe His Met Arg Lys Ser Leu Tyr Gly Phe Phe Trp Lys Gin Pro Gly Thr Leu Glu Val Val Ser Gly Cys Thr Val Asp Met Leu Gin Ala Val Tyr Gly Leu Asp Gly Ile Arg Leu Arg Arg Arg Gin Tyr Tyr Thr Met Pro Asn Phe Arg Gin Tyr Glu Asn Arg Thr Gly His Ile Leu Val Gin Trp Ser Leu Gly Ser Pro Leu His Phe Ala Gly Trp His Cys Ser Trp Cys Phe Thr Pro Glu Gly Ile Tyr Phe Lys Leu Val Ser Ala Gin Asn Gly Asp Phe Pro Arg Trp Gly Asp Tyr Glu Asp Lys Arg Asp Leu Asn Tyr Ile Arg Gly Leu Ile Arg Thr Gly Gly Trp Phe Asp Gly Thr Gin Gin Glu Tyr Pro Pro Ala Asp Pro Ser Glu His Met Tyr Ala Pro Lys Tyr Leu Leu Lys Asn Tyr Asp Arg Phe His Tyr Leu Leu Asp Asn Pro Tyr Gin Glu Pro Arg Ser Thr Ala Ala Gly Gly Trp Arg His Arg Gly Pro Glu Gly Arg Pro Pro Ala Arg Gly Lys Leu Asp Glu Ala Glu Val Glu Gin Lys Leu Ile Ser Glu Glu Asp Leu Asn <210> 7 <211> 10 <212> PRT
<213> Artificial Sequence <220>
<223> Synthetic <400> 7 Glu Gin Lys Leu Ile Ser Glu Glu Asp Leu <210> 8 <211> 31 <212> PRT
<213> Homo sapiens <400> 8 Gin Glu Pro Arg Ser Thr Ala Ala Gly Gly Trp Arg His Arg Gly Pro Glu Gly Arg Pro Pro Ala Arg Gly Lys Leu Asp Glu Ala Glu Val <210> 9 <211> 1614 <212> DNA
<213> hybrid <400> 9 catgagtaaa cggaatccga agattctgaa gatttttctg tatatgttac ttctcaactc 60 tctctttctc atcatctact tcgtttttca ctcatcgtcg ttttcaccgg agcagtcaca 120 gcctcctcat atataccacg tttcagtgaa taaccaatcg gcacatggag gccctgacct 180 gctgcgtacc ccactctact cccactcgcc cctgctgcag ccgctgccgc ccagcaaggc 240 ggccgaggag ctccaccggg tggacttggt gctgcccgag gacaccaccg agtatttcgt 300 gcgcaccaag gccggcggcg tctgcttcaa acccggcacc aagatgctgg agaggccgcc 360 cccgggacgg ccggaggaga agcctgaggg ggccaacggc tcctcggccc ggcggccacc 420 ccggtacctc ctgagcgccc gggagcgcac ggggggccga ggcgcccggc gcaagtgggt 480 ggagtgcgtg tgcctgcccg gctggcacgg acccagctgc ggcgtgccca ctgtggtgca 540 gtactccaac ctgcccacca aggagcggct ggtgcccagg gaggtgccgc gccgcgtcat 600 caacgccatc aacgtcaacc acgagttcga cctgctggac gtgcgcttcc acgagctggg 660 cgacgtggtg gacgcctttg tggtgtgcga gtccaacttc acggcttatg gggagccgcg 720 gccgctcaag ttccgggaga tgctgaccaa tggcaccttc gagtacatcc gccacaaggt 780 gctctatgtc ttcctggacc acttcccgcc cggcggccgg caggacggct ggatcgccga 840 cgactacctg cgcaccttcc tcacccagga cggcgtctcg cggctgcgca acctgcggcc 900 cgacgacgtc ttcatcattg acgatgcgga cgagatcccg gcccgtgacg gcgtcctttt 960 cctcaagctc tacgatggct ggaccgagcc cttcgccttc cacatgcgca agtcgctcta 1020 cggcttcttc tggaagcagc cgggcaccct ggaggtggtg tcaggctgca cggtggacat 1080 gctgcaggca gtgtatgggc tggacggcat ccgcctgcgc cgccgccagt actacaccat 1140 gcccaacttc agacagtatg agaaccgcac cggccacatc ctggtgcagt ggtcgctggg 1200 cagccccctg cacttcgccg gctggcactg ctcctggtgc ttcacgcccg agggcatcta 1260 cttcaagctc gtgtccgccc agaatggcga cttcccacgc tggggtgact acgaggacaa 1320 gcgggacctg aactacatcc gcggcctgat ccgcaccggg ggctggttcg acggcacgca 1380 gcaggagtac ccgcctgcag accccagcga gcacatgtat gcgcccaagt acctgctgaa 1440 gaactacgac cggttccact acctgctgga caacccctac caggagccca ggagcacggc 1500 ggcgggcggg tggcgccaca ggggtcccga gggaaggccg cccgcccggg gcaaactgga 1560 cgaggcggaa gtcgaacaaa aactcatctc agaagaggat ctgaattagg atcc 1614 <210> 10 <211> 535 <212> PRT
<213> hybrid <400> 10 Met Ser Lys Arg Asn Pro Lys Ile Leu Lys Ile Phe Leu Tyr Met Leu Leu Leu Asn Ser Leu Phe Leu Ile Ile Tyr Phe Val Phe His Ser Ser Ser Phe Ser Pro Glu Gin Ser Gin Pro Pro His Ile Tyr His Val Ser Val Asn Asn Gln Ser Ala His Gly Gly Pro Asp Leu Leu Arg Thr Pro Leu Tyr Ser His Ser Pro Leu Leu Gin Pro Leu Pro Pro Ser Lys Ala Ala Glu Glu Leu His Arg Val Asp Leu Val Leu Pro Glu Asp Thr Thr Glu Tyr Phe Val Arg Thr Lys Ala Gly Gly Val Cys Phe Lys Pro Gly Thr Lys Met Leu Glu Arg Pro Pro Pro Gly Arg Pro Glu Glu Lys Pro Glu Gly Ala Asn Gly Ser Ser Ala Arg Arg Pro Pro Arg Tyr Leu Leu Ser Ala Arg Glu Arg Thr Gly Gly Arg Gly Ala Arg Arg Lys Trp Val Glu Cys Val Cys Leu Pro Gly Trp His Gly Pro Ser Cys Gly Val Pro Thr Val Val Gin Tyr Ser Asn Leu Pro Thr Lys Glu Arg Leu Val Pro Arg Glu Val Pro Arg Arg Val Ile Asn Ala Ile Asn Val Asn His Glu Phe Asp Leu Leu Asp Val Arg Phe His Glu Leu Gly Asp Val Val Asp Ala Phe Val Val Cys Glu Ser Asn Phe Thr Ala Tyr Gly Glu Pro Arg Pro Leu Lys Phe Arg Glu Met Leu Thr Asn Gly Thr Phe Glu Tyr Ile Arg His Lys Val Leu Tyr Val Phe Leu Asp His Phe Pro Pro Gly Gly Arg Gin Asp Gly Trp Ile Ala Asp Asp Tyr Leu Arg Thr Phe Leu Thr Gin Asp Gly Val Ser Arg Leu Arg Asn Leu Arg Pro Asp Asp Val Phe Ile Ile Asp Asp Ala Asp Glu Ile Pro Ala Arg Asp Gly Val Leu Phe Leu Lys Leu Tyr Asp Gly Trp Thr Glu Pro Phe Ala Phe His Met Arg Lys Ser Leu Tyr Gly Phe Phe Trp Lys Gin Pro Gly Thr Leu Glu Val Val Ser Gly Cys Thr Val Asp Met Leu Gin Ala Val Tyr Gly Leu Asp Gly Ile Arg Leu Arg Arg Arg Gin Tyr Tyr Thr Met Pro Asn Phe Arg Gin Tyr Glu Asn Arg Thr Gly His Ile Leu Val Gin Trp Ser Leu Gly Ser Pro Leu His Phe Ala Gly Trp His Cys Ser Trp Cys Phe Thr Pro Glu Gly Ile Tyr Phe Lys Leu Val Ser Ala Gin Asn Gly Asp Phe Pro Arg Trp Gly Asp Tyr Glu Asp Lys Arg Asp Leu Asn Tyr Ile Arg Gly Leu Ile Arg Thr Gly Gly Trp Phe Asp Gly Thr Gin Gin Glu Tyr Pro Pro Ala Asp Pro Ser Glu His Met Tyr Ala Pro Lys Tyr Leu Leu Lys Asn Tyr Asp Arg Phe His Tyr Leu Leu Asp Asn Pro Tyr Gin Glu Pro Arg Ser Thr Ala Ala Gly Gly Trp Arg His Arg Gly Pro Glu Gly Arg Pro Pro Ala Arg Gly Lys Leu Asp Glu Ala Glu Val Glu Gin Lys Leu Ile Ser Glu Glu Asp Leu Asn <210> 11 <211> 13 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic <400> 11 aatacttcca ccc 13 <210> 12 <211> 34 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic <400> 12 ccacccgtta acaatgaaga tgagacgcta caag 34 <210> 13 <211> 29 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic <400> 13 gggccatgga gatgagacgc tacaagctc 29 <210> 14 <211> 28 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic <400> 14 ggatccaatg aagatgagac gctacaag 28 <210> 15 <211> 41 <212> DNA
<213> Artificial Sequence <220>

<223> Synthetic <400> 15 gggcccggga gatcctaatt cagatcctct tctgagatga g 41 <210> 16 <211> 35 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic <400> 16 cccggatcct aattcagatc ctcttctgag atgag 35 <210> 17 <211> 38 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic <400> 17 gggtctagat cctaattcag atcctcttct gagatgag 38 <210> 18 <211> 35 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic <400> 18 ccacccgtta acaatgagta aacggaatcc gaaga 35 <210> 19 <211> 35 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic <400> 19 gggccatggg taaacggaat ccgaagattc tgaag 35 <210> 20 <211> 34 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic <400> 20 cccggatcca tgagtaaacg gaatccgaag attc 34 <210> 21 <211> 29 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic <400> 21 gcgccccggg acgctagctc ggtgtcccg 29 <210> 22 <211> 25 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic <400> 22 cccggatcca cgctagctcg gtgtc 25 <210> 23 <211> 31 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic <400> 23 gggtctagat ccacgctagc tcggtgtccc g 31 <210> 24 <211> 39 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic <400> 24 ccacccgtta acaatgaggc ttcgggagcc gctcctgag 39 <210> 25 <211> 31 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic <400> 25 gggccatggg gcttcgggag ccgctcctga g 31 <210> 26 <211> 35 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic <400> 26 cccggatcca tgaggcttcg ggagccgctc ctgag 35 <210> 27 <211> 7155 <212> DNA
<213> hybrid <400> 27 ggcgcgcctc gaggcgatcg cagatctaat ctaaccaatt acgatacgct ttgggtacac 60 ttgatttttg tttcagtggt tacatatatc ttgttttata tgctatcttt aaggatctgc 120 acaaagatta tttgttgatg ttcttgatgg ggctcagaag atttgatatg atacactcta 180 atctttagga gataccagcc aggattatat tcagtaagac aatcaaattt tacgtgttca 240 aactcgttat cttttcattc aaaggatgag ccagaatctt tatagaatga ttgcaatcga 300 gaatatgttc ggccgatatg cctttgttgg cttcaatatt ctacatatca cacaagaatc 360 gaccgtattg taccctcttt ccataaagga aaacacaata tgcagatgct tttttcccac 420 atgcagtaac atataggtat tcaaaaatgg ctaaaagaag ttggataaca aattgacaac 480 tatttccatt tctgttatat aaatttcaca acacacaaaa gcccgtaatc aagagtctgc 540 ccatgtacga aataacttct attatttggt attgggccta agcccagctc agagtacgtg 600 ggggtaccac atataggaag gtaacaaaat actgcaagat agccccataa cgtaccagcc 660 tctccttacc acgaagagat aagatataag acccaccctg ccacgtgtca catcgtcatg 720 gtggttaatg ataagggatt acatccttct atgtttgtgg acatgatgca tgtaatgtca 780 tgagccacag gatccaatgg ccacaggaac gtaagaatgt agatagattt gattttgtcc 840 gttagatagc aaacaacatt ataaaaggtg tgtatcaata ggaactaatt cactcattgg 900 attcatagaa gtccattcct cctaagtatc tagaaaccat ggcgaggatc tcgtgtgact 960 tgagatttct tctcatcccg gcagctttca tgttcatcta catccagatg aggcttttcc 1020 agacgcaatc acagtatgca gatcgcctca gttccgctat cgaatctgag aaccattgca 1080 ctagtcaaat gcgaggcctc atagatgaag ttagcatcaa acagtcgcgg attgttgccc 1140 tcgaagatat gaagaaccgc caggacgaag aacttgtgca gcttaaggat ctaatccaga 1200 cgtttgaaaa aaaaggaata gcaaaactca ctcaaggtgg agccatggat tccaattcag 1260 gcgccgtcgt tgatatcaca actaaagatc tatacgatag gattgagttt cttgatacag 1320 atggtggtcc atggaaacaa ggttggagag ttacgtataa agacgatgag tgggagaaag 1380 agaagctcaa aatcttcgtt gttcctcatt ctcataacga tcctggttgg aaattgactg 1440 tagaggagta ttatcagaga caatccagac atattcttga caccattgtt gagactttat 1500 ctaaggtatg acgaaagttt ttgcttttgg ttttaatatt ttaattctct cccatggtta 1560 tcccgtgaac aatcttaaat gtcttaaaat tctcatgacg tcattaaact ctataaccaa 1620 acttctttgc tgggttctgt ttttttttag tttcgtgatg aaacagagtt ctagaagttc 1680 gttcttttgg aaaatttgaa gtctttggag ctaaagtttg tttttttatt actgggtttt 1740 gagattgaag gatagctaga atcttatttg tgtgggggtt tgttttgaat atgtttaata 1800 ggattcaaga agaaagttta tatgggagga gatgtcatat ctggagagat ggtggagaga 1860 cgcttcacct aataaacaag aagctttgac taaattggtt aaggatgggc agctagagat 1920 tgttggaggt ggctgggtta tgaatgatga ggctaattca cattattttg ccataattga 1980 acagatagca gagggtaata tgtggctgaa tgacacaatt ggggttattc ctaagaattc 2040 ttgggctata gatccctttg gctattcatc aaccatggct tatcttctcc ggcgtatggg 2100 ttttgaaaac atgcttattc aaaggactca ttacgagctc aagaaagacc ttgcccagca 2160 taagaatctt gaatatattt ggcgtcagag ctgggatgct atggaaacca cagatatctt 2220 tgttcatatg atgccgtttt attcatacga tatcccacac acttgtggac cagagcctgc 2280 aatttgctgt cagtttgatt tcgctcggat gcggggattt aagtatgaac tttgtccatg 2340 gggaaagcac ccagtggaga ccacactaga aaatgtgcag gagagggcat taaagcttct 2400 ggatcaatac aggaaaaaat ccactctata tcgaactaat acacttctta tacctcttgg 2460 agatgatttt aggtacatta gtatcgatga agccgaggct cagttccgta actaccagat 2520 gttgtttgat cacatcaact ctaatcctag tctaaacgca gaagcaaagt ttggtacttt 2580 ggaggattat ttcagaacag tccgagaaga agcagacaga gtgaattatt ctcgtcctgg 2640 tgaggttggc tctggtcagg ttgttggttt cccttctctg tcaggtgact tctttacata 2700 tgcagatagg caacaagact attggagtgg ttattatgtt tcaagacctt tcttcaaagc 2760 tgttgatcgt gtgctcgagc atacccttcg tggagctgag atcatgatgt catttctgct 2820 aggttattgc catcgaattc aatgtgagaa atttccaaca agttttacgt ataagttgac 2880 tgctgcaaga agaaatctgg ctcttttcca gcaccatgat ggggtaactg gaactgctaa 2940 ggattatgtg gtacaagatt acggcacccg gatgcatact tcattgcaag accttcagat 3000 ctttatgtct aaagcaatcg aagttcttct tgggatccgc cacgagaaag aaaaatctga 3060 tcaatcccca tcatttttcg aggcagagca aatgagatca aagtatgatg ctcggccagt 3120 tcacaagcca attgctgccc gggaaggaaa ttcgcacaca gttatactct tcaatccatc 3180 agaacagacg agagaggagg tggtgacggt tgttgttaac cgcgctgaaa tctcggtttt 3240 ggactcaaac tggacttgtg tccctagcca aatttctcct gaagtgcagc atgacgatac 3300 caaactattc accggcagac atcgccttta ctggaaagct tccatcccag ctcttggtct 3360 gagaacatat ttcattgcta atgggaatgt cgagtgtgag aaagctactc cgtctaaact 3420 caaatacgct tctgagtttg acccatttcc ttgtcctcct ccatattcct gctccaaact 3480 ggacaacgac gttactgaga tccgaaatga acatcagact cttgtgtttg atgtgaagaa 3540 cggatcactg cggaagatag tccatagaaa cggatcagag actgttgtgg gagaagagat 3600 13%38 aggtatgtac tctagtccag agagtggagc ttacctgttc aaaccagatg gtgaagctca 3660 gccaattgtt caacctgatg gacatgtagt cacctctgag ggtctgctgg ttcaagaagt 3720 cttctcttac cctaaaacca aatgggagaa atcacccctc tctcagaaaa ctcgtcttta 3780 cactggaggt aatacgcttc aggatcaagt ggtcgagata gaatatcatg ttgagcttct 3840 tggtaatgat tttgatgacc gggaattgat tgtccggtac aagactgatg ttgacaacaa 3900 gaaggtcttc tattcagatc tcaatggttt ccaaatgagc aggagagaaa cttatgataa 3960 gatccctctt caaggaaact actacccaat gccatctctc gcatttatcc aaggatccaa 4020 tggtcagaga ttctccgtgc actctcgtca atctctcggt gttgcaagcc tcaaagaggg 4080 ttggttggag attatgctgg acagacggtt ggttcgtgat gacggacggg gtctagggca 4140 aggtgtgatg gataaccgcg caatgaccgt ggtatttcac cttcttgcgg aatctaacat 4200 ttctcaagca gaccctgctt ccaacactaa cccgaggaac ccttcgcttc tctctcacct 4260 cataggtgct cacttaaact accccataaa cacattcatt gccaagaaac cgcaagacat 4320 atctgtgcgt gttccacaat acggttcctt tgctccttta gccaaaccgt taccatgtga 4380 cctccacatt gtaaatttca aggttcctcg tccatccaaa tactctcagc aattggaaga 4440 agacaagcca aggttcgctc ttatcctcaa tagacgagct tgggattcag cttattgcca 4500 taaaggaaga caagtaaact gcacaagcat ggctaatgaa ccagtaaact tttccgacat 4560 gttcaaagat cttgcagctt caaaggtaaa accaacttca ctgaatctct tgcaagaaga 4620 tatggagatt cttgggtacg atgaccaaga gctacctcga gatagttcac agccacggga 4680 aggacgtgtc tcgatctctc ccatggaaat acgagcttat aagcttgaac tgcgacctca 4740 caagtgaacc tgctgaagat ccgctagagt ccgcaaaaat caccagtctc tctctacaaa 4800 tctatctctc tctatttttc tccagaataa tgtgtgagta gttcccagat aagggaatta 4860 gggttcttat agggtttcgc tcatgtgttg agcatataag aaacccttag tatgtatttg 4920 tatttgtaaa atacttctat caataaaatt tctaatccta aaaccaaaat cccgcgcgcg 4980 cctcgaggcg atcgcagatc tcattatacc gttagaagca tagttaaaat ctaaagcttg 5040 tcgttaattc tagtcatttt acattgttgg gttctacatt attaatgaat tttctaatgc 5100 aaatacagaa tttaaatcaa aattgttgaa ttatgctaaa catgtaacat acgtatatct 5160 ccgccttgtg tgttgtatta acttgaagtt atcataagaa ccacaaatac actagtaaat 5220 ctatgagaag gcaggtggca acacaaacaa gagtatctaa gattttcatt tgtgactata 5280 ggaatataat atctcttatc tgatttaatg aatccacatg ttcacttctc atttgtccac 5340 aagatcacaa ctttatcttc aatattcaca acttgttata tccaccacaa tttcattctt 5400 ttcacttagc cccacaaaat actttgtccc cttatttgcc accttttgta tttaatttat 5460 tcttgtggag ctaagtgttc atattattct tcttctcaaa aaaacaaaaa caaaaaaaaa 5520 gagaagaaaa ccatggcgag gatctcgtgt gacttgagat ttcttctcat cccggcagct 5580 ttcatgttca tctacatcca gatgaggctt ttccagacgc aatcacagta tgcagatcgc 5640 ctcagttccg ctatcgaatc tgagaaccat tgcactagtc aaatgcgagg cctcatagat 5700 gaagttagca tcaaacagtc gcggattgtt gccctcgaag atatgaagaa ccgccaggac 5760 gaagaacttg tgcagcttaa ggatctaatc cagacgtttg aaaaaaaagg aatagcaaaa 5820 ctcactcaag gtggagccat ggctctaagg ttgcatagaa ggaaccattt ttcgcctaga 5880 aatacggatc tgttcccgga tttggcaaaa gatcgtgtgg ttatcgtctt gtatgtgcat 5940 aatcgggctc agtattttcg agtcacagtg gaaagtttgt cgaaggttaa aggtataagt 6000 gagacattgt tgattgttag tcatgatggt tactttgaag agatgaatag gattgtggag 6060 agtattaagt tttgtcaagt gaaacagatt ttctcgcctt attcgcctca tatatatcgt 6120 actagcttcc cgggtgtgac cctgaatgat tgtaagaaca agggtgatga ggcaaagggg 6180 cattgtgaag gtaatcctga tcagtatggg aatcatcggt ctccgaagat tgtatctttg 6240 aagcatcact ggtggtggat gatgaacact gtatgggatg ggttggaaga gactaaagga 6300 catgaggggc atatcctttt cattgaagaa gatcattttc tgtttcctaa tgcctatcgt 6360 aacatacaga ctcttacgag gctgaaaccc gcaaagtgtc ctgactgttt tgctgctaat 6420 ttagcaccgt ctgatgtgaa gtcaagagga gaagggcttg aaagtttggt tgcagagaga 6480 atgggaaatg ttgggtattc ttttaataga agtgtgtggg agaatattca tcagaaggca 6540 agagagtttt gtttctttga tgattacaac tgggatataa cgatgtgggc aacggttttc 6600 ccgtcgtttg gttccccggt gtacacattg cgagggccta ggactagtgc ggtacacttt 6660 ggaaaatgtg ggttgcatca aggtagagga gatgagggtg attgcatcga taatggggtc 6720 gtaaacatag aagttaagga aacagataaa gttgtgaaca taaaagaagg atggggagtt 6780 cgggtgtata agcatcaagc gggttataaa gccggtttcg aaggttgggg aggttggggc 6840 gatgataggg accgacattt atgtttggat tttgccacta tgtatcgtta cagcagtagc 6900 agtgcatctc catgaaacgg atccgctaga gtccgcaaaa atcaccagtc tctctctaca 6960 aatctatctc tctctatttt tctccagaat aatgtgtgag tagttcccag ataagggaat 7020 tagggttctt atagggtttc gctcatgtgt tgagcatata agaaaccctt agtatgtatt 7080 tgtatttgta aaatacttct atcaataaaa tttctaatcc taaaaccaaa atcccgcgag 7140 agacctctta attaa 7155 <210> 28 <211> 4993 <212> DNA
<213> hybrid <400> 28 ggcgcgcctc gaggcgatcg cagatctaat ctaaccaatt acgatacgct ttgggtacac 60 ttgatttttg tttcagtggt tacatatatc ttgttttata tgctatcttt aaggatctgc 120 acaaagatta tttgttgatg ttcttgatgg ggctcagaag atttgatatg atacactcta 180 atctttagga gataccagcc aggattatat tcagtaagac aatcaaattt tacgtgttca 240 aactcgttat cttttcattc aaaggatgag ccagaatctt tatagaatga ttgcaatcga 300 gaatatgttc ggccgatatg cctttgttgg cttcaatatt ctacatatca cacaagaatc 360 gaccgtattg taccctcttt ccataaagga aaacacaata tgcagatgct tttttcccac 420 atgcagtaac atataggtat tcaaaaatgg ctaaaagaag ttggataaca aattgacaac 480 tatttccatt tctgttatat aaatttcaca acacacaaaa gcccgtaatc aagagtctgc 540 ccatgtacga aataacttct attatttggt attgggccta agcccagctc agagtacgtg 600 ggggtaccac atataggaag gtaacaaaat actgcaagat agccccataa cgtaccagcc 660 tctccttacc acgaagagat aagatataag acccaccctg ccacgtgtca catcgtcatg 720 gtggttaatg ataagggatt acatccttct atgtttgtgg acatgatgca tgtaatgtca 780 tgagccacag gatccaatgg ccacaggaac gtaagaatgt agatagattt gattttgtcc 840 gttagatagc aaacaacatt ataaaaggtg tgtatcaata ggaactaatt cactcattgg 900 attcatagaa gtccattcct cctaagtatc tagaaaccat ggcgaggatc tcgtgtgact 960 tgagatttct tctcatcccg gcagctttca tgttcatcta catccagatg aggcttttcc 1020 agacgcaatc acagtatgca gatcgcctca gttccgctat cgaatctgag aaccattgca 1080 ctagtcaaat gcgaggcctc atagatgaag ttagcatcaa acagtcgcgg attgttgccc 1140 tcgaagatat gaagaaccgc caggacgaag aacttgtgca gcttaaggat ctaatccaga 1200 cgtttgaaaa aaaaggaata gcaaaactca ctcaaggtgg agccatggat tccaattcag 1260 gcgccgtcgt tgatatcaca actaaagatc tatacgatag gattgagttt cttgatacag 1320 atggtggtcc atggaaacaa ggttggagag ttacgtataa agacgatgag tgggagaaag 1380 agaagctcaa aatcttcgtt gttcctcatt ctcataacga tcctggttgg aaattgactg 1440 tagaggagta ttatcagaga caatccagac atattcttga caccattgtt gagactttat 1500 ctaaggtatg acgaaagttt ttgcttttgg ttttaatatt ttaattctct cccatggtta 1560 tcccgtgaac aatcttaaat gtcttaaaat tctcatgacg tcattaaact ctataaccaa 1620 acttctttgc tgggttctgt ttttttttag tttcgtgatg aaacagagtt ctagaagttc 1680 gttcttttgg aaaatttgaa gtctttggag ctaaagtttg tttttttatt actgggtttt 1740 gagattgaag gatagctaga atcttatttg tgtgggggtt tgttttgaat atgtttaata 1800 ggattcaaga agaaagttta tatgggagga gatgtcatat ctggagagat ggtggagaga 1860 cgcttcacct aataaacaag aagctttgac taaattggtt aaggatgggc agctagagat 1920 tgttggaggt ggctgggtta tgaatgatga ggctaattca cattattttg ccataattga 1980 acagatagca gagggtaata tgtggctgaa tgacacaatt ggggttattc ctaagaattc 2040 ttgggctata gatccctttg gctattcatc aaccatggct tatcttctcc ggcgtatggg 2100 ttttgaaaac atgcttattc aaaggactca ttacgagctc aagaaagacc ttgcccagca 2160 taagaatctt gaatatattt ggcgtcagag ctgggatgct atggaaacca cagatatctt 2220 tgttcatatg atgccgtttt attcatacga tatcccacac acttgtggac cagagcctgc 2280 aatttgctgt cagtttgatt tcgctcggat gcggggattt aagtatgaac tttgtccatg 2340 gggaaagcac ccagtggaga ccacactaga aaatgtgcag gagagggcat taaagcttct 2400 ggatcaatac aggaaaaaat ccactctata tcgaactaat acacttctta tacctcttgg 2460 agatgatttt aggtacatta gtatcgatga agccgaggct cagttccgta actaccagat 2520 gttgtttgat cacatcaact ctaatcctag tctaaacgca gaagcaaagt ttggtacttt 2580 ggaggattat ttcagaacag tccgagaaga agcagacaga gtgaattatt ctcgtcctgg 2640 tgaggttggc tctggtcagg ttgttggttt cccttctctg tcaggtgact tctttacata 2700 tgcagatagg caacaagact attggagtgg ttattatgtt tcaagacctt tcttcaaagc 2760 tgttgatcgt gtgctcgagc atacccttcg tggagctgag atcatgatgt catttctgct 2820 aggttattgc catcgaattc aatgtgagaa atttccaaca agttttacgt ataagttgac 2880 tgctgcaaga agaaatctgg ctcttttcca gcaccatgat ggggtaactg gaactgctaa 2940 ggattatgtg gtacaagatt acggcacccg gatgcatact tcattgcaag accttcagat 3000 ctttatgtct aaagcaatcg aagttcttct tgggatccgc cacgagaaag aaaaatctga 3060 tcaatcccca tcatttttcg aggcagagca aatgagatca aagtatgatg ctcggccagt 3120 tcacaagcca attgctgccc gggaaggaaa ttcgcacaca gttatactct tcaatccatc 3180 agaacagacg agagaggagg tggtgacggt tgttgttaac cgcgctgaaa tctcggtttt 3240 ggactcaaac tggacttgtg tccctagcca aatttctcct gaagtgcagc atgacgatac 3300 caaactattc accggcagac atcgccttta ctggaaagct tccatcccag ctcttggtct 3360 gagaacatat ttcattgcta atgggaatgt cgagtgtgag aaagctactc cgtctaaact 3420 caaatacgct tctgagtttg acccatttcc ttgtcctcct ccatattcct gctccaaact 3480 ggacaacgac gttactgaga tccgaaatga acatcagact cttgtgtttg atgtgaagaa 3540 cggatcactg cggaagatag tccatagaaa cggatcagag actgttgtgg gagaagagat 3600 aggtatgtac tctagtccag agagtggagc ttacctgttc aaaccagatg gtgaagctca 3660 gccaattgtt caacctgatg gacatgtagt cacctctgag ggtctgctgg ttcaagaagt 3720 cttctcttac cctaaaacca aatgggagaa atcacccctc tctcagaaaa ctcgtcttta 3780 cactggaggt aatacgcttc aggatcaagt ggtcgagata gaatatcatg ttgagcttct 3840 tggtaatgat tttgatgacc gggaattgat tgtccggtac aagactgatg ttgacaacaa 3900 gaaggtcttc tattcagatc tcaatggttt ccaaatgagc aggagagaaa cttatgataa 3960 gatccctctt caaggaaact actacccaat gccatctctc gcatttatcc aaggatccaa 4020 tggtcagaga ttctccgtgc actctcgtca atctctcggt gttgcaagcc tcaaagaggg 4080 ttggttggag attatgctgg acagacggtt ggttcgtgat gacggacggg gtctagggca 4140 aggtgtgatg gataaccgcg caatgaccgt ggtatttcac cttcttgcgg aatctaacat 4200 ttctcaagca gaccctgctt ccaacactaa cccgaggaac ccttcgcttc tctctcacct 4260 cataggtgct cacttaaact accccataaa cacattcatt gccaagaaac cgcaagacat 4320 atctgtgcgt gttccacaat acggttcctt tgctccttta gccaaaccgt taccatgtga 4380 cctccacatt gtaaatttca aggttcctcg tccatccaaa tactctcagc aattggaaga 4440 agacaagcca aggttcgctc ttatcctcaa tagacgagct tgggattcag cttattgcca 4500 taaaggaaga caagtaaact gcacaagcat ggctaatgaa ccagtaaact tttccgacat 4560 gttcaaagat cttgcagctt caaaggtaaa accaacttca ctgaatctct tgcaagaaga 4620 tatggagatt cttgggtacg atgaccaaga gctacctcga gatagttcac agccacggga 4680 aggacgtgtc tcgatctctc ccatggaaat acgagcttat aagcttgaac tgcgacctca 4740 caagtgaacc tgctgaagat ccgctagagt ccgcaaaaat caccagtctc tctctacaaa 4800 tctatctctc tctatttttc tccagaataa tgtgtgagta gttcccagat aagggaatta 4860 gggttcttat agggtttcgc tcatgtgttg agcatataag aaacccttag tatgtatttg 4920 tatttgtaaa atacttctat caataaaatt tctaatccta aaaccaaaat cccgcgagag 4980 acctcttaat taa 4993 <210> 29 <211> 3825 <212> DNA
<213> hybrid <400> 29 ccatggcgag gatctcgtgt gacttgagat ttcttctcat cccggcagct ttcatgttca 60 tctacatcca gatgaggctt ttccagacgc aatcacagta tgcagatcgc ctcagttccg 120 ctatcgaatc tgagaaccat tgcactagtc aaatgcgagg cctcatagat gaagttagca 180 tcaaacagtc gcggattgtt gccctcgaag atatgaagaa ccgccaggac gaagaacttg 240 tgcagcttaa ggatctaatc cagacgtttg aaaaaaaagg aatagcaaaa ctcactcaag 300 gtggagccat ggattccaat tcaggcgccg tcgttgatat cacaactaaa gatctatacg 360 ataggattga gtttcttgat acagatggtg gtccatggaa acaaggttgg agagttacgt 420 ataaagacga tgagtgggag aaagagaagc tcaaaatctt cgttgttcct cattctcata 480 acgatcctgg ttggaaattg actgtagagg agtattatca gagacaatcc agacatattc 540 ttgacaccat tgttgagact ttatctaagg tatgacgaaa gtttttgctt ttggttttaa 600 tattttaatt ctctcccatg gttatcccgt gaacaatctt aaatgtctta aaattctcat 660 gacgtcatta aactctataa ccaaacttct ttgctgggtt ctgttttttt ttagtttcgt 720 gatgaaacag agttctagaa gttcgttctt ttggaaaatt tgaagtcttt ggagctaaag 780 tttgtttttt tattactggg ttttgagatt gaaggatagc tagaatctta tttgtgtggg 840 ggtttgtttt gaatatgttt aataggattc aagaagaaag tttatatggg aggagatgtc 900 atatctggag agatggtgga gagacgcttc acctaataaa caagaagctt tgactaaatt 960 ggttaaggat gggcagctag agattgttgg aggtggctgg gttatgaatg atgaggctaa 1020 ttcacattat tttgccataa ttgaacagat agcagagggt aatatgtggc tgaatgacac 1080 aattggggtt attcctaaga attcttgggc tatagatccc tttggctatt catcaaccat 1140 ggcttatctt ctccggcgta tgggttttga aaacatgctt attcaaagga ctcattacga 1200 gctcaagaaa gaccttgccc agcataagaa tcttgaatat atttggcgtc agagctggga 1260 tgctatggaa accacagata tctttgttca tatgatgccg ttttattcat acgatatccc 1320 acacacttgt ggaccagagc ctgcaatttg ctgtcagttt gatttcgctc ggatgcgggg 1380 atttaagtat gaactttgtc catggggaaa gcacccagtg gagaccacac tagaaaatgt 1440 gcaggagagg gcattaaagc ttctggatca atacaggaaa aaatccactc tatatcgaac 1500 taatacactt cttatacctc ttggagatga ttttaggtac attagtatcg atgaagccga 1560 ggctcagttc cgtaactacc agatgttgtt tgatcacatc aactctaatc ctagtctaaa 1620 cgcagaagca aagtttggta ctttggagga ttatttcaga acagtccgag aagaagcaga 1680 cagagtgaat tattctcgtc ctggtgaggt tggctctggt caggttgttg gtttcccttc 1740 tctgtcaggt gacttcttta catatgcaga taggcaacaa gactattgga gtggttatta 1800 tgtttcaaga cctttcttca aagctgttga tcgtgtgctc gagcataccc ttcgtggagc 1860 tgagatcatg atgtcatttc tgctaggtta ttgccatcga attcaatgtg agaaatttcc 1920 aacaagtttt acgtataagt tgactgctgc aagaagaaat ctggctcttt tccagcacca 1980 tgatggggta actggaactg ctaaggatta tgtggtacaa gattacggca cccggatgca 2040 tacttcattg caagaccttc agatctttat gtctaaagca atcgaagttc ttcttgggat 2100 ccgccacgag aaagaaaaat ctgatcaatc cccatcattt ttcgaggcag agcaaatgag 2160 atcaaagtat gatgctcggc cagttcacaa gccaattgct gcccgggaag gaaattcgca 2220 cacagttata ctcttcaatc catcagaaca gacgagagag gaggtggtga cggttgttgt 2280 taaccgcgct gaaatctcgg ttttggactc aaactggact tgtgtcccta gccaaatttc 2340 tcctgaagtg cagcatgacg ataccaaact attcaccggc agacatcgcc tttactggaa 2400 agcttccatc ccagctcttg gtctgagaac atatttcatt gctaatggga atgtcgagtg 2460 tgagaaagct actccgtcta aactcaaata cgcttctgag tttgacccat ttccttgtcc 2520 tcctccatat tcctgctcca aactggacaa cgacgttact gagatccgaa atgaacatca 2580 gactcttgtg tttgatgtga agaacggatc actgcggaag atagtccata gaaacggatc 2640 agagactgtt gtgggagaag agataggtat gtactctagt ccagagagtg gagcttacct 2700 gttcaaacca gatggtgaag ctcagccaat tgttcaacct gatggacatg tagtcacctc 2760 tgagggtctg ctggttcaag aagtcttctc ttaccctaaa accaaatggg agaaatcacc 2820 cctctctcag aaaactcgtc tttacactgg aggtaatacg cttcaggatc aagtggtcga 2880 gatagaatat catgttgagc ttcttggtaa tgattttgat gaccgggaat tgattgtccg 2940 gtacaagact gatgttgaca acaagaaggt cttctattca gatctcaatg gtttccaaat 3000 gagcaggaga gaaacttatg ataagatccc tcttcaagga aactactacc caatgccatc 3060 tctcgcattt atccaaggat ccaatggtca gagattctcc gtgcactctc gtcaatctct 3120 cggtgttgca agcctcaaag agggttggtt ggagattatg ctggacagac ggttggttcg 3180 tgatgacgga cggggtctag ggcaaggtgt gatggataac cgcgcaatga ccgtggtatt 3240 tcaccttctt gcggaatcta acatttctca agcagaccct gcttccaaca ctaacccgag 3300 gaacccttcg cttctctctc acctcatagg tgctcactta aactacccca taaacacatt 3360 cattgccaag aaaccgcaag acatatctgt gcgtgttcca caatacggtt cctttgctcc 3420 tttagccaaa ccgttaccat gtgacctcca cattgtaaat ttcaaggttc ctcgtccatc 3480 caaatactct cagcaattgg aagaagacaa gccaaggttc gctcttatcc tcaatagacg 3540 agcttgggat tcagcttatt gccataaagg aagacaagta aactgcacaa gcatggctaa 3600 tgaaccagta aacttttccg acatgttcaa agatcttgca gcttcaaagg taaaaccaac 3660 ttcactgaat ctcttgcaag aagatatgga gattcttggg tacgatgacc aagagctacc 3720 tcgagatagt tcacagccac gggaaggacg tgtctcgatc tctcccatgg aaatacgagc 3780 ttataagctt gaactgcgac ctcacaagtg aacctgctga agatc 3825 <210> 30 <211> 2181 <212> DNA
<213> hybrid <400> 30 ggcgcgcctc gaggcgatcg cagatctcat tataccgtta gaagcatagt taaaatctaa 60 agcttgtcgt taattctagt cattttacat tgttgggttc tacattatta atgaattttc 120 taatgcaaat acagaattta aatcaaaatt gttgaattat gctaaacatg taacatacgt 180 atatctccgc cttgtgtgtt gtattaactt gaagttatca taagaaccac aaatacacta 240 gtaaatctat gagaaggcag gtggcaacac aaacaagagt atctaagatt ttcatttgtg 300 actataggaa tataatatct cttatctgat ttaatgaatc cacatgttca cttctcattt 360 gtccacaaga tcacaacttt atcttcaata ttcacaactt gttatatcca ccacaatttc 420 attcttttca cttagcccca caaaatactt tgtcccctta tttgccacct tttgtattta 480 atttattctt gtggagctaa gtgttcatat tattcttctt ctcaaaaaaa caaaaacaaa 540 aaaaaagaga agaaaaccat ggcgaggatc tcgtgtgact tgagatttct tctcatcccg 600 gcagctttca tgttcatcta catccagatg aggcttttcc agacgcaatc acagtatgca 660 gatcgcctca gttccgctat cgaatctgag aaccattgca ctagtcaaat gcgaggcctc 720 atagatgaag ttagcatcaa acagtcgcgg attgttgccc tcgaagatat gaagaaccgc 780 caggacgaag aacttgtgca gcttaaggat ctaatccaga cgtttgaaaa aaaaggaata 840 gcaaaactca ctcaaggtgg agccatggct ctaaggttgc atagaaggaa ccatttttcg 900 cctagaaata cggatctgtt cccggatttg gcaaaagatc gtgtggttat cgtcttgtat 960 gtgcataatc gggctcagta ttttcgagtc acagtggaaa gtttgtcgaa ggttaaaggt 1020 ataagtgaga cattgttgat tgttagtcat gatggttact ttgaagagat gaataggatt 1080 gtggagagta ttaagttttg tcaagtgaaa cagattttct cgccttattc gcctcatata 1140 tatcgtacta gcttcccggg tgtgaccctg aatgattgta agaacaaggg tgatgaggca 1200 aaggggcatt gtgaaggtaa tcctgatcag tatgggaatc atcggtctcc gaagattgta 1260 tctttgaagc atcactggtg gtggatgatg aacactgtat gggatgggtt ggaagagact 1320 aaaggacatg aggggcatat ccttttcatt gaagaagatc attttctgtt tcctaatgcc 1380 tatcgtaaca tacagactct tacgaggctg aaacccgcaa agtgtcctga ctgttttgct 1440 gctaatttag caccgtctga tgtgaagtca agaggagaag ggcttgaaag tttggttgca 1500 gagagaatgg gaaatgttgg gtattctttt aatagaagtg tgtgggagaa tattcatcag 1560 aaggcaagag agttttgttt ctttgatgat tacaactggg atataacgat gtgggcaacg 1620 gttttcccgt cgtttggttc cccggtgtac acattgcgag ggcctaggac tagtgcggta 1680 cactttggaa aatgtgggtt gcatcaaggt agaggagatg agggtgattg catcgataat 1740 ggggtcgtaa acatagaagt taaggaaaca gataaagttg tgaacataaa agaaggatgg 1800 ggagttcggg tgtataagca tcaagcgggt tataaagccg gtttcgaagg ttggggaggt 1860 tggggcgatg atagggaccg acatttatgt ttggattttg ccactatgta tcgttacagc 1920 agtagcagtg catctccatg aaacggatcc gctagagtcc gcaaaaatca ccagtctctc 1980 tctacaaatc tatctctctc tatttttctc cagaataatg tgtgagtagt tcccagataa 2040 gggaattagg gttcttatag ggtttcgctc atgtgttgag catataagaa acccttagta 2100 tgtatttgta tttgtaaaat acttctatca ataaaatttc taatcctaaa accaaaatcc 2160 cgcgagagac ctcttaatta a 2181 <210> 31 <211> 1394 <212> DNA
<213> hybrid <400> 31 ccatggcgag gatctcgtgt gacttgagat ttcttctcat cccggcagct ttcatgttca 60 tctacatcca gatgaggctt ttccagacgc aatcacagta tgcagatcgc ctcagttccg 120 ctatcgaatc tgagaaccat tgcactagtc aaatgcgagg cctcatagat gaagttagca 180 tcaaacagtc gcggattgtt gccctcgaag atatgaagaa ccgccaggac gaagaacttg 240 tgcagcttaa ggatctaatc cagacgtttg aaaaaaaagg aatagcaaaa ctcactcaag 300 gtggagccat ggctctaagg ttgcatagaa ggaaccattt ttcgcctaga aatacggatc 360 tgttcccgga tttggcaaaa gatcgtgtgg ttatcgtctt gtatgtgcat aatcgggctc 420 agtattttcg agtcacagtg gaaagtttgt cgaaggttaa aggtataagt gagacattgt 480 tgattgttag tcatgatggt tactttgaag agatgaatag gattgtggag agtattaagt 540 tttgtcaagt gaaacagatt ttctcgcctt attcgcctca tatatatcgt actagcttcc 600 cgggtgtgac cctgaatgat tgtaagaaca agggtgatga ggcaaagggg cattgtgaag 660 gtaatcctga tcagtatggg aatcatcggt ctccgaagat tgtatctttg aagcatcact 720 ggtggtggat gatgaacact gtatgggatg ggttggaaga gactaaagga catgaggggc 780 atatcctttt cattgaagaa gatcattttc tgtttcctaa tgcctatcgt aacatacaga 840 ctcttacgag gctgaaaccc gcaaagtgtc ctgactgttt tgctgctaat ttagcaccgt 900 ctgatgtgaa gtcaagagga gaagggcttg aaagtttggt tgcagagaga atgggaaatg 960 ttgggtattc ttttaataga agtgtgtggg agaatattca tcagaaggca agagagtttt 1020 gtttctttga tgattacaac tgggatataa cgatgtgggc aacggttttc ccgtcgtttg 1080 gttccccggt gtacacattg cgagggccta ggactagtgc ggtacacttt ggaaaatgtg 1140 ggttgcatca aggtagagga gatgagggtg attgcatcga taatggggtc gtaaacatag 1200 aagttaagga aacagataaa gttgtgaaca taaaagaagg atggggagtt cgggtgtata 1260 agcatcaagc gggttataaa gccggtttcg aaggttgggg aggttggggc gatgataggg 1320 accgacattt atgtttggat tttgccacta tgtatcgtta cagcagtagc agtgcatctc 1380 catgaaacgg atcc 1394 <210> 32 <211> 312 <212> DNA
<213> Arabidopsis thaliana <400> 32 ccatggcgag gatctcgtgt gacttgagat ttcttctcat cccggcagct ttcatgttca 60 tctacatcca gatgaggctt ttccagacgc aatcacagta tgcagatcgc ctcagttccg 120 ctatcgaatc tgagaaccat tgcactagtc aaatgcgagg cctcatagat gaagttagca 180 tcaaacagtc gcggattgtt gccctcgaag atatgaagaa ccgccaggac gaagaacttg 240 tgcagcttaa ggatctaatc cagacgtttg aaaaaaaagg aatagcaaaa ctcactcaag 300 gtggagccat gg 312 <210> 33 <211> 276 <212> DNA
<213> Glycine max <400> 33 ccatggcgag agggagcaga tcagtgggta gcagcagcag caaatggagg tactgcaacc 60 cttcctatta cttgaagcgc ccaaagcgtc ttgctctgct cttcatcgtt ttcgtttgtg 120 tctctttcgt tttctgggac cgtcaaactc tcgtcagaga gcaccaggtt gaaatttctg 180 agctgcagaa agaagtgact gatttgaaaa atttggtgga tgatttaaat aacaaacaag 240 gtggtacctc tgggaaaact gacttgggga ccatgg 276 <210> 34 <211> 9240 <212> DNA
<213> hybrid <400> 34 ggcgcgcctc gaggcgatcg cagatccgat ataacaaaat ttgaatcgca cagatcgatc 60 tctttggaga ttctatacct agaaaatgga gacgattttc aaatctctgt aaaaattctg 120 gtttcttctt gacggaagaa gacgacgact ccaatatttc ggttagtact gaaccggaaa 180 gtttgactgg tgcaaccaat ttaatgtacc gtacgtaacg caccaatcgg attttgtatt 240 caatgggcct tatctgtgag cccattaatt gatgtgacgg cctaaactaa atccgaacgg 300 tttatttcag cgatccgcga cggtttgtat tcagccaata gcaatcaatt atgtagcagt 360 ggtgatcctc gtcaaaccag taaagctaga tctggaccgt tgaattggtg caagaaagca 420 catgttgtga tatttttacc cgtacgatta gaaaacttga gaaacacatt gataatcgat 480 aaaaaccgtc cgatcatata aatccgcttt accatcgttg cctataaatt aatatcaata 540 gccgtacacg cgtgaagact gacaatatta tctttttcga attcggagct caagtttgaa 600 attcggagaa gctagagagt tttctgataa ccatggcgag agggagcaga tcagtgggta 660 gcagcagcag caaatggagg tactgcaacc cttcctatta cttgaagcgc ccaaagcgtc 720 ttgctctgct cttcatcgtt ttcgtttgtg tctctttcgt tttctgggac cgtcaaactc 780 tcgtcagaga gcaccaggtt gaaatttctg agctgcagaa agaagtgact gatttgaaaa 840 atttggtgga tgatttaaat aacaaacaag gtggtacctc tgggaaaact gacttgggga 900 ccatgggaca gatgcctgtg gctgctgtag tggttatggc ctgcagtcgt gcagactatc 960 ttgaaaggac tgttaaatca gttttaacat atcaaactcc cgttgcttca aaatatcctc 1020 tatttatatc tcaggatgga tctgatcaag ctgtcaagag caagtcattg agctataatc 1080 aattaacata tatgcagcac ttggattttg aaccagtggt cactgaaagg cctggcgaac 1140 tgactgcgta ctacaagatt gcacgtcact acaagtgggc actggaccag ttgttttaca 1200 aacacaaatt tagtcgagtg attatactag aagatgatat ggaaattgct ccagacttct 1260 ttgattactt tgaggctgca gctagtctca tggataggga taaaaccatt atggctgctt 1320 catcatggaa tgataatgga cagaagcagt ttgtgcatga tccctatgcg ctataccgat 1380 cagatttttt tcctggcctt gggtggatgc tcaagagatc gacttgggat gagttatcac 1440 caaagtggcc aaaggcttac tgggatgatt ggctgagact aaaggaaaac cataaaggcc 1500 gccaattcat tcgaccggaa gtctgtagaa catacaattt tggtgaacat gggtctagtt 1560 tgggacagtt tttcagtcag tatctggaac ctataaagct aaacgatgtg acggttgact 1620 ggaaagcaaa ggacctggga tacctgacag agggaaacta taccaagtac ttttctggct 1680 tagtgagaca agcacgacca attcaaggtt ctgaccttgt cttaaaggct caaaacataa 1740 aggatgatgt tcgtatccgg tataaagacc aagtagagtt tgaacgcatt gcaggggaat 1800 ttggtatatt tgaagaatgg aaggatggtg tgcctcgaac agcatataaa ggagtagtgg 1860 tgtttcgaat ccagacaaca agacgtgtat tcctggttgg gccagattct gtaatgcagc 1920 ttggaattcg aaattcctga tgcggatccg ctagagtccg caaaaatcac cagtctctct 1980 ctacaaatct atctctctct atttttctcc agaataatgt gtgagtagtt cccagataag 2040 ggaattaggg ttcttatagg gtttcgctca tgtgttgagc atataagaaa cccttagtat 2100 gtatttgtat ttgtaaaata cttctatcaa taaaatttct aatcctaaaa ccaaaatccc 2160 gcgcctcgag gcgatcgcag atctaatcta accaattacg atacgctttg ggtacacttg 2220 atttttgttt cagtggttac atatatcttg ttttatatgc tatctttaag gatctgcaca 2280 aagattattt gttgatgttc ttgatggggc tcagaagatt tgatatgata cactctaatc 2340 tttaggagat accagccagg attatattca gtaagacaat caaattttac gtgttcaaac 2400 tcgttatctt ttcattcaaa ggatgagcca gaatctttat agaatgattg caatcgagaa 2460 tatgttcggc cgatatgcct ttgttggctt caatattcta catatcacac aagaatcgac 2520 cgtattgtac cctctttcca taaaggaaaa cacaatatgc agatgctttt ttcccacatg 2580 cagtaacata taggtattca aaaatggcta aaagaagttg gataacaaat tgacaactat 2640 ttccatttct gttatataaa tttcacaaca cacaaaagcc cgtaatcaag agtctgccca 2700 tgtacgaaat aacttctatt atttggtatt gggcctaagc ccagctcaga gtacgtgggg 2760 gtaccacata taggaaggta acaaaatact gcaagatagc cccataacgt accagcctct 2820 ccttaccacg aagagataag atataagacc caccctgcca cgtgtcacat cgtcatggtg 2880 gttaatgata agggattaca tccttctatg tttgtggaca tgatgcatgt aatgtcatga 2940 gccacaggat ccaatggcca caggaacgta agaatgtaga tagatttgat tttgtccgtt 3000 agatagcaaa caacattata aaaggtgtgt atcaatagga actaattcac tcattggatt 3060 catagaagtc cattcctcct aagtatctag aaaccatggc gagagggagc agatcagtgg 3120 gtagcagcag cagcaaatgg aggtactgca acccttccta ttacttgaag cgcccaaagc 3180 gtcttgctct gctcttcatc gttttcgttt gtgtctcttt cgttttctgg gaccgtcaaa 3240 ctctcgtcag agagcaccag gttgaaattt ctgagctgca gaaagaagtg actgatttga 3300 aaaatttggt ggatgattta aataacaaac aaggtggtac ctctgggaaa actgacttgg 3360 ggaccatgga ttccaattca ggcgccgtcg ttgatatcac aactaaagat ctatacgata 3420 ggattgagtt tcttgataca gatggtggtc catggaaaca aggttggaga gttacgtata 3480 aagacgatga gtgggagaaa gagaagctca aaatcttcgt tgttcctcat tctcataacg 3540 atcctggttg gaaattgact gtagaggagt attatcagag acaatccaga catattcttg 3600 acaccattgt tgagacttta tctaaggtat gacgaaagtt tttgcttttg gttttaatat 3660 tttaattctc tcccatggtt atcccgtgaa caatcttaaa tgtcttaaaa ttctcatgac 3720 gtcattaaac tctataacca aacttctttg ctgggttctg ttttttttta gtttcgtgat 3780 gaaacagagt tctagaagtt cgttcttttg gaaaatttga agtctttgga gctaaagttt 3840 gtttttttat tactgggttt tgagattgaa ggatagctag aatcttattt gtgtgggggt 3900 ttgttttgaa tatgtttaat aggattcaag aagaaagttt atatgggagg agatgtcata 3960 tctggagaga tggtggagag acgcttcacc taataaacaa gaagctttga ctaaattggt 4020 taaggatggg cagctagaga ttgttggagg tggctgggtt atgaatgatg aggctaattc 4080 acattatttt gccataattg aacagatagc agagggtaat atgtggctga atgacacaat 4140 tggggttatt cctaagaatt cttgggctat agatcccttt ggctattcat caaccatggc 4200 ttatcttctc cggcgtatgg gttttgaaaa catgcttatt caaaggactc attacgagct 4260 caagaaagac cttgcccagc ataagaatct tgaatatatt tggcgtcaga gctgggatgc 4320 tatggaaacc acagatatct ttgttcatat gatgccgttt tattcatacg atatcccaca 4380 cacttgtgga ccagagcctg caatttgctg tcagtttgat ttcgctcgga tgcggggatt 4440 taagtatgaa ctttgtccat ggggaaagca cccagtggag accacactag aaaatgtgca 4500 ggagagggca ttaaagcttc tggatcaata caggaaaaaa tccactctat atcgaactaa 4560 tacacttctt atacctcttg gagatgattt taggtacatt agtatcgatg aagccgaggc 4620 tcagttccgt aactaccaga tgttgtttga tcacatcaac tctaatccta gtctaaacgc 4680 agaagcaaag tttggtactt tggaggatta tttcagaaca gtccgagaag aagcagacag 4740 agtgaattat tctcgtcctg gtgaggttgg ctctggtcag gttgttggtt tcccttctct 4800 gtcaggtgac ttctttacat atgcagatag gcaacaagac tattggagtg gttattatgt 4860 ttcaagacct ttcttcaaag ctgttgatcg tgtgctcgag catacccttc gtggagctga 4920 gatcatgatg tcatttctgc taggttattg ccatcgaatt caatgtgaga aatttccaac 4980 aagttttacg tataagttga ctgctgcaag aagaaatctg gctcttttcc agcaccatga 5040 tggggtaact ggaactgcta aggattatgt ggtacaagat tacggcaccc ggatgcatac 5100 ttcattgcaa gaccttcaga tctttatgtc taaagcaatc gaagttcttc ttgggatccg 5160 ccacgagaaa gaaaaatctg atcaatcccc atcatttttc gaggcagagc aaatgagatc 5220 aaagtatgat gctcggccag ttcacaagcc aattgctgcc cgggaaggaa attcgcacac 5280 agttatactc ttcaatccat cagaacagac gagagaggag gtggtgacgg ttgttgttaa 5340 ccgcgctgaa atctcggttt tggactcaaa ctggacttgt gtccctagcc aaatttctcc 5400 tgaagtgcag catgacgata ccaaactatt caccggcaga catcgccttt actggaaagc 5460 ttccatccca gctcttggtc tgagaacata tttcattgct aatgggaatg tcgagtgtga 5520 gaaagctact ccgtctaaac tcaaatacgc ttctgagttt gacccatttc cttgtcctcc 5580 tccatattcc tgctccaaac tggacaacga cgttactgag atccgaaatg aacatcagac 5640 tcttgtgttt gatgtgaaga acggatcact gcggaagata gtccatagaa acggatcaga 5700 gactgttgtg ggagaagaga taggtatgta ctctagtcca gagagtggag cttacctgtt 5760 caaaccagat ggtgaagctc agccaattgt tcaacctgat ggacatgtag tcacctctga 5820 gggtctgctg gttcaagaag tcttctctta ccctaaaacc aaatgggaga aatcacccct 5880 ctctcagaaa actcgtcttt acactggagg taatacgctt caggatcaag tggtcgagat 5940 agaatatcat gttgagcttc ttggtaatga ttttgatgac cgggaattga ttgtccggta 6000 caagactgat gttgacaaca agaaggtctt ctattcagat ctcaatggtt tccaaatgag 6060 caggagagaa acttatgata agatccctct tcaaggaaac tactacccaa tgccatctct 6120 cgcatttatc caaggatcca atggtcagag attctccgtg cactctcgtc aatctctcgg 6180 tgttgcaagc ctcaaagagg gttggttgga gattatgctg gacagacggt tggttcgtga 6240 tgacggacgg ggtctagggc aaggtgtgat ggataaccgc gcaatgaccg tggtatttca 6300 ccttcttgcg gaatctaaca tttctcaagc agaccctgct tccaacacta acccgaggaa 6360 cccttcgctt ctctctcacc tcataggtgc tcacttaaac taccccataa acacattcat 6420 tgccaagaaa ccgcaagaca tatctgtgcg tgttccacaa tacggttcct ttgctccttt 6480 agccaaaccg ttaccatgtg acctccacat tgtaaatttc aaggttcctc gtccatccaa 6540 atactctcag caattggaag aagacaagcc aaggttcgct cttatcctca atagacgagc 6600 ttgggattca gcttattgcc ataaaggaag acaagtaaac tgcacaagca tggctaatga 6660 accagtaaac ttttccgaca tgttcaaaga tcttgcagct tcaaaggtaa aaccaacttc 6720 actgaatctc ttgcaagaag atatggagat tcttgggtac gatgaccaag agctacctcg 6780 agatagttca cagccacggg aaggacgtgt ctcgatctct cccatggaaa tacgagctta 6840 taagcttgaa ctgcgacctc acaagtgaac ctgctgaaga tccgctagag tccgcaaaaa 6900 tcaccagtct ctctctacaa atctatctct ctctattttt ctccagaata atgtgtgagt 6960 agttcccaga taagggaatt agggttctta tagggtttcg ctcatgtgtt gagcatataa 7020 gaaaccctta gtatgtattt gtatttgtaa aatacttcta tcaataaaat ttctaatcct 7080 aaaaccaaaa tcccgcgcgc gcctcgaggc gatcgcagat ctcattatac cgttagaagc 7140 atagttaaaa tctaaagctt gtcgttaatt ctagtcattt tacattgttg ggttctacat 7200 tattaatgaa ttttctaatg caaatacaga atttaaatca aaattgttga attatgctaa 7260 acatgtaaca tacgtatatc tccgccttgt gtgttgtatt aacttgaagt tatcataaga 7320 accacaaata cactagtaaa tctatgagaa ggcaggtggc aacacaaaca agagtatcta 7380 agattttcat ttgtgactat aggaatataa tatctcttat ctgatttaat gaatccacat 7440 gttcacttct catttgtcca caagatcaca actttatctt caatattcac aacttgttat 7500 atccaccaca atttcattct tttcacttag ccccacaaaa tactttgtcc ccttatttgc 7560 caccttttgt atttaattta ttcttgtgga gctaagtgtt catattattc ttcttctcaa 7620 aaaaacaaaa acaaaaaaaa agagaagaaa accatggcga gagggagcag atcagtgggt 7680 agcagcagca gcaaatggag gtactgcaac ccttcctatt acttgaagcg cccaaagcgt 7740 cttgctctgc tcttcatcgt tttcgtttgt gtctctttcg ttttctggga ccgtcaaact 7800 ctcgtcagag agcaccaggt tgaaatttct gagctgcaga aagaagtgac tgatttgaaa 7860 aatttggtgg atgatttaaa taacaaacaa ggtggtacct ctgggaaaac tgacttgggg 7920 accatggctc taaggttgca tagaaggaac catttttcgc ctagaaatac ggatctgttc 7980 ccggatttgg caaaagatcg tgtggttatc gtcttgtatg tgcataatcg ggctcagtat 8040 tttcgagtca cagtggaaag tttgtcgaag gttaaaggta taagtgagac attgttgatt 8100 gttagtcatg atggttactt tgaagagatg aataggattg tggagagtat taagttttgt 8160 caagtgaaac agattttctc gccttattcg cctcatatat atcgtactag cttcccgggt 8220 gtgaccctga atgattgtaa gaacaagggt gatgaggcaa aggggcattg tgaaggtaat 8280 cctgatcagt atgggaatca tcggtctccg aagattgtat ctttgaagca tcactggtgg 8340 tggatgatga acactgtatg ggatgggttg gaagagacta aaggacatga ggggcatatc 8400 cttttcattg aagaagatca ttttctgttt cctaatgcct atcgtaacat acagactctt 8460 acgaggctga aacccgcaaa gtgtcctgac tgttttgctg ctaatttagc accgtctgat 8520 gtgaagtcaa gaggagaagg gcttgaaagt ttggttgcag agagaatggg aaatgttggg 8580 tattctttta atagaagtgt gtgggagaat attcatcaga aggcaagaga gttttgtttc 8640 tttgatgatt acaactggga tataacgatg tgggcaacgg ttttcccgtc gtttggttcc 8700 ccggtgtaca cattgcgagg gcctaggact agtgcggtac actttggaaa atgtgggttg 8760 catcaaggta gaggagatga gggtgattgc atcgataatg gggtcgtaaa catagaagtt 8820 aaggaaacag ataaagttgt gaacataaaa gaaggatggg gagttcgggt gtataagcat 8880 caagcgggtt ataaagccgg tttcgaaggt tggggaggtt ggggcgatga tagggaccga 8940 catttatgtt tggattttgc cactatgtat cgttacagca gtagcagtgc atctccatga 9000 aacggatccg ctagagtccg caaaaatcac cagtctctct ctacaaatct atctctctct 9060 atttttctcc agaataatgt gtgagtagtt cccagataag ggaattaggg ttcttatagg 9120 gtttcgctca tgtgttgagc atataagaaa cccttagtat gtatttgtat ttgtaaaata 9180 cttctatcaa taaaatttct aatcctaaaa ccaaaatccc gcgagagacc tcttaattaa 9240 <210> 35 <211> 2180 <212> DNA
<213> hybrid <400> 35 ggcgcgcctc gaggcgatcg cagatccgat ataacaaaat ttgaatcgca cagatcgatc 60 tctttggaga ttctatacct agaaaatgga gacgattttc aaatctctgt aaaaattctg 120 gtttcttctt gacggaagaa gacgacgact ccaatatttc ggttagtact gaaccggaaa 180 gtttgactgg tgcaaccaat ttaatgtacc gtacgtaacg caccaatcgg attttgtatt 240 caatgggcct tatctgtgag cccattaatt gatgtgacgg cctaaactaa atccgaacgg 300 tttatttcag cgatccgcga cggtttgtat tcagccaata gcaatcaatt atgtagcagt 360 ggtgatcctc gtcaaaccag taaagctaga tctggaccgt tgaattggtg caagaaagca 420 catgttgtga tatttttacc cgtacgatta gaaaacttga gaaacacatt gataatcgat 480 aaaaaccgtc cgatcatata aatccgcttt accatcgttg cctataaatt aatatcaata 540 gccgtacacg cgtgaagact gacaatatta tctttttcga attcggagct caagtttgaa 600 attcggagaa gctagagagt tttctgataa ccatggcgag agggagcaga tcagtgggta 660 gcagcagcag caaatggagg tactgcaacc cttcctatta cttgaagcgc ccaaagcgtc 720 ttgctctgct cttcatcgtt ttcgtttgtg tctctttcgt tttctgggac cgtcaaactc 780 tcgtcagaga gcaccaggtt gaaatttctg agctgcagaa agaagtgact gatttgaaaa 840 atttggtgga tgatttaaat aacaaacaag gtggtacctc tgggaaaact gacttgggga 900 ccatgggaca gatgcctgtg gctgctgtag tggttatggc ctgcagtcgt gcagactatc 960 ttgaaaggac tgttaaatca gttttaacat atcaaactcc cgttgcttca aaatatcctc 1020 tatttatatc tcaggatgga tctgatcaag ctgtcaagag caagtcattg agctataatc 1080 aattaacata tatgcagcac ttggattttg aaccagtggt cactgaaagg cctggcgaac 1140 tgactgcgta ctacaagatt gcacgtcact acaagtgggc actggaccag ttgttttaca 1200 aacacaaatt tagtcgagtg attatactag aagatgatat ggaaattgct ccagacttct 1260 ttgattactt tgaggctgca gctagtctca tggataggga taaaaccatt atggctgctt 1320 catcatggaa tgataatgga cagaagcagt ttgtgcatga tccctatgcg ctataccgat 1380 cagatttttt tcctggcctt gggtggatgc tcaagagatc gacttgggat gagttatcac 1440 caaagtggcc aaaggcttac tgggatgatt ggctgagact aaaggaaaac cataaaggcc 1500 gccaattcat tcgaccggaa gtctgtagaa catacaattt tggtgaacat gggtctagtt 1560 tgggacagtt tttcagtcag tatctggaac ctataaagct aaacgatgtg acggttgact 1620 ggaaagcaaa ggacctggga tacctgacag agggaaacta taccaagtac ttttctggct 1680 tagtgagaca agcacgacca attcaaggtt ctgaccttgt cttaaaggct caaaacataa 1740 aggatgatgt tcgtatccgg tataaagacc aagtagagtt tgaacgcatt gcaggggaat 1800 ttggtatatt tgaagaatgg aaggatggtg tgcctcgaac agcatataaa ggagtagtgg 1860 tgtttcgaat ccagacaaca agacgtgtat tcctggttgg gccagattct gtaatgcagc 1920 ttggaattcg aaattcctga tgcggatccg ctagagtccg caaaaatcac cagtctctct 1980 ctacaaatct atctctctct atttttctcc agaataatgt gtgagtagtt cccagataag 2040 ggaattaggg ttcttatagg gtttcgctca tgtgttgagc atataagaaa cccttagtat 2100 gtatttgtat ttgtaaaata cttctatcaa taaaatttct aatcctaaaa ccaaaatccc 2160 gcgagagacc tcttaattaa 2180 <210> 36 <211> 615 <212> DNA
<213> Arabidopsis thaliana <400> 36 ggatccgata taacaaaatt tgaatcgcac agatcgatct ctttggagat tctataccta 60 gaaaatggag acgattttca aatctctgta aaaattctgg tttcttcttg acggaagaag 120 acgacgactc caatatttcg gttagtactg aaccggaaag tttgactggt gcaaccaatt 180 taatgtaccg tacgtaacgc accaatcgga ttttgtattc aatgggcctt atctgtgagc 240 ccattaattg atgtgacggc ctaaactaaa tccgaacggt ttatttcagc gatccgcgac 300 ggtttgtatt cagccaatag caatcaatta tgtagcagtg gtgatcctcg tcaaaccagt 360 aaagctagat ctggaccgtt gaattggtgc aagaaagcac atgttgtgat atttttaccc 420 gtacgattag aaaacttgag aaacacattg ataatcgata aaaaccgtcc gatcatataa 480 atccgcttta ccatcgttgc ctataaatta atatcaatag ccgtacacgc gtgaagactg 540 acaatattat ctttttcgaa ttcggagctc aagtttgaaa ttcggagaag ctagagagtt 600 ttctgataac catgg 615 <210> 37 <211> 1319 <212> DNA
<213> hybrid <400> 37 ccatggcgag agggagcaga tcagtgggta gcagcagcag caaatggagg tactgcaacc 60 cttcctatta cttgaagcgc ccaaagcgtc ttgctctgct cttcatcgtt ttcgtttgtg 120 tctctttcgt tttctgggac cgtcaaactc tcgtcagaga gcaccaggtt gaaatttctg 180 agctgcagaa agaagtgact gatttgaaaa atttggtgga tgatttaaat aacaaacaag 240 gtggtacctc tgggaaaact gacttgggga ccatgggaca gatgcctgtg gctgctgtag 300 tggttatggc ctgcagtcgt gcagactatc ttgaaaggac tgttaaatca gttttaacat 360 atcaaactcc cgttgcttca aaatatcctc tatttatatc tcaggatgga tctgatcaag 420 ctgtcaagag caagtcattg agctataatc aattaacata tatgcagcac ttggattttg 480 aaccagtggt cactgaaagg cctggcgaac tgactgcgta ctacaagatt gcacgtcact 540 acaagtgggc actggaccag ttgttttaca aacacaaatt tagtcgagtg attatactag 600 aagatgatat ggaaattgct ccagacttct ttgattactt tgaggctgca gctagtctca 660 tggataggga taaaaccatt atggctgctt catcatggaa tgataatgga cagaagcagt 720 ttgtgcatga tccctatgcg ctataccgat cagatttttt tcctggcctt gggtggatgc 780 tcaagagatc gacttgggat gagttatcac caaagtggcc aaaggcttac tgggatgatt 840 ggctgagact aaaggaaaac cataaaggcc gccaattcat tcgaccggaa gtctgtagaa 900 catacaattt tggtgaacat gggtctagtt tgggacagtt tttcagtcag tatctggaac 960 ctataaagct aaacgatgtg acggttgact ggaaagcaaa ggacctggga tacctgacag 1020 agggaaacta taccaagtac ttttctggct tagtgagaca agcacgacca attcaaggtt 1080 ctgaccttgt cttaaaggct caaaacataa aggatgatgt tcgtatccgg tataaagacc 1140 aagtagagtt tgaacgcatt gcaggggaat ttggtatatt tgaagaatgg aaggatggtg 1200 tgcctcgaac agcatataaa ggagtagtgg tgtttcgaat ccagacaaca agacgtgtat 1260 tcctggttgg gccagattct gtaatgcagc ttggaattcg aaattcctga tgcggatcc 1319 <210> 38 <211> 4957 <212> DNA
<213> hybrid <400> 38 ggcgcgcctc gaggcgatcg cagatctaat ctaaccaatt acgatacgct ttgggtacac 60 ttgatttttg tttcagtggt tacatatatc ttgttttata tgctatcttt aaggatctgc 120 acaaagatta tttgttgatg ttcttgatgg ggctcagaag atttgatatg atacactcta 180 atctttagga gataccagcc aggattatat tcagtaagac aatcaaattt tacgtgttca 240 aactcgttat cttttcattc aaaggatgag ccagaatctt tatagaatga ttgcaatcga 300 gaatatgttc ggccgatatg cctttgttgg cttcaatatt ctacatatca cacaagaatc 360 gaccgtattg taccctcttt ccataaagga aaacacaata tgcagatgct tttttcccac 420 atgcagtaac atataggtat tcaaaaatgg ctaaaagaag ttggataaca aattgacaac 480 tatttccatt tctgttatat aaatttcaca acacacaaaa gcccgtaatc aagagtctgc 540 ccatgtacga aataacttct attatttggt attgggccta agcccagctc agagtacgtg 600 ggggtaccac atataggaag gtaacaaaat actgcaagat agccccataa cgtaccagcc 660 tctccttacc acgaagagat aagatataag acccaccctg ccacgtgtca catcgtcatg 720 gtggttaatg ataagggatt acatccttct atgtttgtgg acatgatgca tgtaatgtca 780 tgagccacag gatccaatgg ccacaggaac gtaagaatgt agatagattt gattttgtcc 840 gttagatagc aaacaacatt ataaaaggtg tgtatcaata ggaactaatt cactcattgg 900 attcatagaa gtccattcct cctaagtatc tagaaaccat ggcgagaggg agcagatcag 960 tgggtagcag cagcagcaaa tggaggtact gcaacccttc ctattacttg aagcgcccaa 1020 agcgtcttgc tctgctcttc atcgttttcg tttgtgtctc tttcgttttc tgggaccgtc 1080 aaactctcgt cagagagcac caggttgaaa tttctgagct gcagaaagaa gtgactgatt 1140 tgaaaaattt ggtggatgat ttaaataaca aacaaggtgg tacctctggg aaaactgact 1200 tggggaccat ggattccaat tcaggcgccg tcgttgatat cacaactaaa gatctatacg 1260 ataggattga gtttcttgat acagatggtg gtccatggaa acaaggttgg agagttacgt 1320 ataaagacga tgagtgggag aaagagaagc tcaaaatctt cgttgttcct cattctcata 1380 acgatcctgg ttggaaattg actgtagagg agtattatca gagacaatcc agacatattc 1440 ttgacaccat tgttgagact ttatctaagg tatgacgaaa gtttttgctt ttggttttaa 1500 tattttaatt ctctcccatg gttatcccgt gaacaatctt aaatgtctta aaattctcat 1560 gacgtcatta aactctataa ccaaacttct ttgctgggtt ctgttttttt ttagtttcgt 1620 gatgaaacag agttctagaa gttcgttctt ttggaaaatt tgaagtcttt ggagctaaag 1680 tttgtttttt tattactggg ttttgagatt gaaggatagc tagaatctta tttgtgtggg 1740 ggtttgtttt gaatatgttt aataggattc aagaagaaag tttatatggg aggagatgtc 1800 atatctggag agatggtgga gagacgcttc acctaataaa caagaagctt tgactaaatt 1860 ggttaaggat gggcagctag agattgttgg aggtggctgg gttatgaatg atgaggctaa 1920 ttcacattat tttgccataa ttgaacagat agcagagggt aatatgtggc tgaatgacac 1980 aattggggtt attcctaaga attcttgggc tatagatccc tttggctatt catcaaccat 2040 ggcttatctt ctccggcgta tgggttttga aaacatgctt attcaaagga ctcattacga 2100 gctcaagaaa gaccttgccc agcataagaa tcttgaatat atttggcgtc agagctggga 2160 tgctatggaa accacagata tctttgttca tatgatgccg ttttattcat acgatatccc 2220 acacacttgt ggaccagagc ctgcaatttg ctgtcagttt gatttcgctc ggatgcgggg 2280 atttaagtat gaactttgtc catggggaaa gcacccagtg gagaccacac tagaaaatgt 2340 gcaggagagg gcattaaagc ttctggatca atacaggaaa aaatccactc tatatcgaac 2400 taatacactt cttatacctc ttggagatga ttttaggtac attagtatcg atgaagccga 2460 ggctcagttc cgtaactacc agatgttgtt tgatcacatc aactctaatc ctagtctaaa 2520 cgcagaagca aagtttggta ctttggagga ttatttcaga acagtccgag aagaagcaga 2580 cagagtgaat tattctcgtc ctggtgaggt tggctctggt caggttgttg gtttcccttc 2640 tctgtcaggt gacttcttta catatgcaga taggcaacaa gactattgga gtggttatta 2700 tgtttcaaga cctttcttca aagctgttga tcgtgtgctc gagcataccc ttcgtggagc 2760 tgagatcatg atgtcatttc tgctaggtta ttgccatcga attcaatgtg agaaatttcc 2820 aacaagtttt acgtataagt tgactgctgc aagaagaaat ctggctcttt tccagcacca 2880 tgatggggta actggaactg ctaaggatta tgtggtacaa gattacggca cccggatgca 2940 tacttcattg caagaccttc agatctttat gtctaaagca atcgaagttc ttcttgggat 3000 ccgccacgag aaagaaaaat ctgatcaatc cccatcattt ttcgaggcag agcaaatgag 3060 atcaaagtat gatgctcggc cagttcacaa gccaattgct gcccgggaag gaaattcgca 3120 cacagttata ctcttcaatc catcagaaca gacgagagag gaggtggtga cggttgttgt 3180 taaccgcgct gaaatctcgg ttttggactc aaactggact tgtgtcccta gccaaatttc 3240 tcctgaagtg cagcatgacg ataccaaact attcaccggc agacatcgcc tttactggaa 3300 agcttccatc ccagctcttg gtctgagaac atatttcatt gctaatggga atgtcgagtg 3360 tgagaaagct actccgtcta aactcaaata cgcttctgag tttgacccat ttccttgtcc 3420 tcctccatat tcctgctcca aactggacaa cgacgttact gagatccgaa atgaacatca 3480 gactcttgtg tttgatgtga agaacggatc actgcggaag atagtccata gaaacggatc 3540 agagactgtt gtgggagaag agataggtat gtactctagt ccagagagtg gagcttacct 3600 gttcaaacca gatggtgaag ctcagccaat tgttcaacct gatggacatg tagtcacctc 3660 tgagggtctg ctggttcaag aagtcttctc ttaccctaaa accaaatggg agaaatcacc 3720 cctctctcag aaaactcgtc tttacactgg aggtaatacg cttcaggatc aagtggtcga 3780 gatagaatat catgttgagc ttcttggtaa tgattttgat gaccgggaat tgattgtccg 3840 gtacaagact gatgttgaca acaagaaggt cttctattca gatctcaatg gtttccaaat 3900 gagcaggaga gaaacttatg ataagatccc tcttcaagga aactactacc caatgccatc 3960 tctcgcattt atccaaggat ccaatggtca gagattctcc gtgcactctc gtcaatctct 4020 cggtgttgca agcctcaaag agggttggtt ggagattatg ctggacagac ggttggttcg 4080 tgatgacgga cggggtctag ggcaaggtgt gatggataac cgcgcaatga ccgtggtatt 4140 tcaccttctt gcggaatcta acatttctca agcagaccct gcttccaaca ctaacccgag 4200 gaacccttcg cttctctctc acctcatagg tgctcactta aactacccca taaacacatt 4260 cattgccaag aaaccgcaag acatatctgt gcgtgttcca caatacggtt cctttgctcc 4320 tttagccaaa ccgttaccat gtgacctcca cattgtaaat ttcaaggttc ctcgtccatc 4380 caaatactct cagcaattgg aagaagacaa gccaaggttc gctcttatcc tcaatagacg 4440 agcttgggat tcagcttatt gccataaagg aagacaagta aactgcacaa gcatggctaa 4500 tgaaccagta aacttttccg acatgttcaa agatcttgca gcttcaaagg taaaaccaac 4560 ttcactgaat ctcttgcaag aagatatgga gattcttggg tacgatgacc aagagctacc 4620 tcgagatagt tcacagccac gggaaggacg tgtctcgatc tctcccatgg aaatacgagc 4680 ttataagctt gaactgcgac ctcacaagtg aacctgctga agatccgcta gagtccgcaa 4740 aaatcaccag tctctctcta caaatctatc tctctctatt tttctccaga ataatgtgtg 4800 agtagttccc agataaggga attagggttc ttatagggtt tcgctcatgt gttgagcata 4860 taagaaaccc ttagtatgta tttgtatttg taaaatactt ctatcaataa aatttctaat 4920 cctaaaacca aaatcccgcg agagacctct taattaa 4957 <210> 39 <211> 921 <212> DNA

<213> Chrysanthemum x morifolium <400> 39 agatctaatc taaccaatta cgatacgctt tgggtacact tgatttttgt ttcagtggtt 60 acatatatct tgttttatat gctatcttta aggatctgca caaagattat ttgttgatgt 120 tcttgatggg gctcagaaga tttgatatga tacactctaa tctttaggag ataccagcca 180 ggattatatt cagtaagaca atcaaatttt acgtgttcaa actcgttatc ttttcattca 240 aaggatgagc cagaatcttt atagaatgat tgcaatcgag aatatgttcg gccgatatgc 300 ctttgttggc ttcaatattc tacatatcac acaagaatcg accgtattgt accctctttc 360 cataaaggaa aacacaatat gcagatgctt ttttcccaca tgcagtaaca tataggtatt 420 caaaaatggc taaaagaagt tggataacaa attgacaact atttccattt ctgttatata 480 aatttcacaa cacacaaaag cccgtaatca agagtctgcc catgtacgaa ataacttcta 540 ttatttggta ttgggcctaa gcccagctca gagtacgtgg gggtaccaca tataggaagg 600 taacaaaata ctgcaagata gccccataac gtaccagcct ctccttacca cgaagagata 660 agatataaga cccaccctgc cacgtgtcac atcgtcatgg tggttaatga taagggatta 720 catccttcta tgtttgtgga catgatgcat gtaatgtcat gagccacagg atccaatggc 780 cacaggaacg taagaatgta gatagatttg attttgtccg ttagatagca aacaacatta 840 taaaaggtgt gtatcaatag gaactaattc actcattgga ttcatagaag tccattcctc 900 ctaagtatct agaaaccatg g 921 <210> 40 <211> 3789 <212> DNA
<213> hybrid <400> 40 ccatggcgag agggagcaga tcagtgggta gcagcagcag caaatggagg tactgcaacc 60 cttcctatta cttgaagcgc ccaaagcgtc ttgctctgct cttcatcgtt ttcgtttgtg 120 tctctttcgt tttctgggac cgtcaaactc tcgtcagaga gcaccaggtt gaaatttctg 180 agctgcagaa agaagtgact gatttgaaaa atttggtgga tgatttaaat aacaaacaag 240 gtggtacctc tgggaaaact gacttgggga ccatggattc caattcaggc gccgtcgttg 300 atatcacaac taaagatcta tacgatagga ttgagtttct tgatacagat ggtggtccat 360 ggaaacaagg ttggagagtt acgtataaag acgatgagtg ggagaaagag aagctcaaaa 420 tcttcgttgt tcctcattct cataacgatc ctggttggaa attgactgta gaggagtatt 480 atcagagaca atccagacat attcttgaca ccattgttga gactttatct aaggtatgac 540 gaaagttttt gcttttggtt ttaatatttt aattctctcc catggttatc ccgtgaacaa 600 tcttaaatgt cttaaaattc tcatgacgtc attaaactct ataaccaaac ttctttgctg 660 ggttctgttt ttttttagtt tcgtgatgaa acagagttct agaagttcgt tcttttggaa 720 aatttgaagt ctttggagct aaagtttgtt tttttattac tgggttttga gattgaagga 780 tagctagaat cttatttgtg tgggggtttg ttttgaatat gtttaatagg attcaagaag 840 aaagtttata tgggaggaga tgtcatatct ggagagatgg tggagagacg cttcacctaa 900 taaacaagaa gctttgacta aattggttaa ggatgggcag ctagagattg ttggaggtgg 960 ctgggttatg aatgatgagg ctaattcaca ttattttgcc ataattgaac agatagcaga 1020 gggtaatatg tggctgaatg acacaattgg ggttattcct aagaattctt gggctataga 1080 tccctttggc tattcatcaa ccatggctta tcttctccgg cgtatgggtt ttgaaaacat 1140 gcttattcaa aggactcatt acgagctcaa gaaagacctt gcccagcata agaatcttga 1200 atatatttgg cgtcagagct gggatgctat ggaaaccaca gatatctttg ttcatatgat 1260 gccgttttat tcatacgata tcccacacac ttgtggacca gagcctgcaa tttgctgtca 1320 gtttgatttc gctcggatgc ggggatttaa gtatgaactt tgtccatggg gaaagcaccc 1380 agtggagacc acactagaaa atgtgcagga gagggcatta aagcttctgg atcaatacag 1440 gaaaaaatcc actctatatc gaactaatac acttcttata cctcttggag atgattttag 1500 gtacattagt atcgatgaag ccgaggctca gttccgtaac taccagatgt tgtttgatca 1560 catcaactct aatcctagtc taaacgcaga agcaaagttt ggtactttgg aggattattt 1620 cagaacagtc cgagaagaag cagacagagt gaattattct cgtcctggtg aggttggctc 1680 tggtcaggtt gttggtttcc cttctctgtc aggtgacttc tttacatatg cagataggca 1740 acaagactat tggagtggtt attatgtttc aagacctttc ttcaaagctg ttgatcgtgt 1800 gctcgagcat acccttcgtg gagctgagat catgatgtca tttctgctag gttattgcca 1860 tcgaattcaa tgtgagaaat ttccaacaag ttttacgtat aagttgactg ctgcaagaag 1920 aaatctggct cttttccagc accatgatgg ggtaactgga actgctaagg attatgtggt 1980 acaagattac ggcacccgga tgcatacttc attgcaagac cttcagatct ttatgtctaa 2040 agcaatcgaa gttcttcttg ggatccgcca cgagaaagaa aaatctgatc aatccccatc 2100 atttttcgag gcagagcaaa tgagatcaaa gtatgatgct cggccagttc acaagccaat 2160 tgctgcccgg gaaggaaatt cgcacacagt tatactcttc aatccatcag aacagacgag 2220 agaggaggtg gtgacggttg ttgttaaccg cgctgaaatc tcggttttgg actcaaactg 2280 gacttgtgtc cctagccaaa tttctcctga agtgcagcat gacgatacca aactattcac 2340 cggcagacat cgcctttact ggaaagcttc catcccagct cttggtctga gaacatattt 2400 cattgctaat gggaatgtcg agtgtgagaa agctactccg tctaaactca aatacgcttc 2460 tgagtttgac ccatttcctt gtcctcctcc atattcctgc tccaaactgg acaacgacgt 2520 tactgagatc cgaaatgaac atcagactct tgtgtttgat gtgaagaacg gatcactgcg 2580 gaagatagtc catagaaacg gatcagagac tgttgtggga gaagagatag gtatgtactc 2640 tagtccagag agtggagctt acctgttcaa accagatggt gaagctcagc caattgttca 2700 acctgatgga catgtagtca cctctgaggg tctgctggtt caagaagtct tctcttaccc 2760 taaaaccaaa tgggagaaat cacccctctc tcagaaaact cgtctttaca ctggaggtaa 2820 tacgcttcag gatcaagtgg tcgagataga atatcatgtt gagcttcttg gtaatgattt 2880 tgatgaccgg gaattgattg tccggtacaa gactgatgtt gacaacaaga aggtcttcta 2940 ttcagatctc aatggtttcc aaatgagcag gagagaaact tatgataaga tccctcttca 3000 aggaaactac tacccaatgc catctctcgc atttatccaa ggatccaatg gtcagagatt 3060 ctccgtgcac tctcgtcaat ctctcggtgt tgcaagcctc aaagagggtt ggttggagat 3120 tatgctggac agacggttgg ttcgtgatga cggacggggt ctagggcaag gtgtgatgga 3180 taaccgcgca atgaccgtgg tatttcacct tcttgcggaa tctaacattt ctcaagcaga 3240 ccctgcttcc aacactaacc cgaggaaccc ttcgcttctc tctcacctca taggtgctca 3300 cttaaactac cccataaaca cattcattgc caagaaaccg caagacatat ctgtgcgtgt 3360 tccacaatac ggttcctttg ctcctttagc caaaccgtta ccatgtgacc tccacattgt 3420 aaatttcaag gttcctcgtc catccaaata ctctcagcaa ttggaagaag acaagccaag 3480 gttcgctctt atcctcaata gacgagcttg ggattcagct tattgccata aaggaagaca 3540 agtaaactgc acaagcatgg ctaatgaacc agtaaacttt tccgacatgt tcaaagatct 3600 tgcagcttca aaggtaaaac caacttcact gaatctcttg caagaagata tggagattct 3660 tgggtacgat gaccaagagc tacctcgaga tagttcacag ccacgggaag gacgtgtctc 3720 gatctctccc atggaaatac gagcttataa gcttgaactg cgacctcaca agtgaacctg 3780 ctgaagatc 3789 <210> 41 <211> 2145 <212> DNA
<213> hybrid <400> 41 ggcgcgcctc gaggcgatcg cagatctcat tataccgtta gaagcatagt taaaatctaa 60 agcttgtcgt taattctagt cattttacat tgttgggttc tacattatta atgaattttc 120 taatgcaaat acagaattta aatcaaaatt gttgaattat gctaaacatg taacatacgt 180 atatctccgc cttgtgtgtt gtattaactt gaagttatca taagaaccac aaatacacta 240 gtaaatctat gagaaggcag gtggcaacac aaacaagagt atctaagatt ttcatttgtg 300 actataggaa tataatatct cttatctgat ttaatgaatc cacatgttca cttctcattt 360 gtccacaaga tcacaacttt atcttcaata ttcacaactt gttatatcca ccacaatttc 420 attcttttca cttagcccca caaaatactt tgtcccctta tttgccacct tttgtattta 480 atttattctt gtggagctaa gtgttcatat tattcttctt ctcaaaaaaa caaaaacaaa 540 aaaaaagaga agaaaaccat ggcgagaggg agcagatcag tgggtagcag cagcagcaaa 600 tggaggtact gcaacccttc ctattacttg aagcgcccaa agcgtcttgc tctgctcttc 660 atcgttttcg tttgtgtctc tttcgttttc tgggaccgtc aaactctcgt cagagagcac 720 caggttgaaa tttctgagct gcagaaagaa gtgactgatt tgaaaaattt ggtggatgat 780 ttaaataaca aacaaggtgg tacctctggg aaaactgact tggggaccat ggctctaagg 840 ttgcatagaa ggaaccattt ttcgcctaga aatacggatc tgttcccgga tttggcaaaa 900 gatcgtgtgg ttatcgtctt gtatgtgcat aatcgggctc agtattttcg agtcacagtg 960 gaaagtttgt cgaaggttaa aggtataagt gagacattgt tgattgttag tcatgatggt 1020 tactttgaag agatgaatag gattgtggag agtattaagt tttgtcaagt gaaacagatt 1080 ttctcgcctt attcgcctca tatatatcgt actagcttcc cgggtgtgac cctgaatgat 1140 tgtaagaaca agggtgatga ggcaaagggg cattgtgaag gtaatcctga tcagtatggg 1200 aatcatcggt ctccgaagat tgtatctttg aagcatcact ggtggtggat gatgaacact 1260 gtatgggatg ggttggaaga gactaaagga catgaggggc atatcctttt cattgaagaa 1320 gatcattttc tgtttcctaa tgcctatcgt aacatacaga ctcttacgag gctgaaaccc 1380 gcaaagtgtc ctgactgttt tgctgctaat ttagcaccgt ctgatgtgaa gtcaagagga 1440 gaagggcttg aaagtttggt tgcagagaga atgggaaatg ttgggtattc ttttaataga 1500 agtgtgtggg agaatattca tcagaaggca agagagtttt gtttctttga tgattacaac 1560 tgggatataa cgatgtgggc aacggttttc ccgtcgtttg gttccccggt gtacacattg 1620 cgagggccta ggactagtgc ggtacacttt ggaaaatgtg ggttgcatca aggtagagga 1680 gatgagggtg attgcatcga taatggggtc gtaaacatag aagttaagga aacagataaa 1740 gttgtgaaca taaaagaagg atggggagtt cgggtgtata agcatcaagc gggttataaa 1800 gccggtttcg aaggttgggg aggttggggc gatgataggg accgacattt atgtttggat 1860 tttgccacta tgtatcgtta cagcagtagc agtgcatctc catgaaacgg atccgctaga 1920 gtccgcaaaa atcaccagtc tctctctaca aatctatctc tctctatttt tctccagaat 1980 aatgtgtgag tagttcccag ataagggaat tagggttctt atagggtttc gctcatgtgt 2040 tgagcatata agaaaccctt agtatgtatt tgtatttgta aaatacttct atcaataaaa 2100 tttctaatcc taaaaccaaa atcccgcgag agacctctta attaa 2145 <210> 42 <211> 541 <212> DNA
<213> Solanum tuberosum <400> 42 agatctcatt ataccgttag aagcatagtt aaaatctaaa gcttgtcgtt aattctagtc 60 attttacatt gttgggttct acattattaa tgaattttct aatgcaaata cagaatttaa 120 atcaaaattg ttgaattatg ctaaacatgt aacatacgta tatctccgcc ttgtgtgttg 180 tattaacttg aagttatcat aagaaccaca aatacactag taaatctatg agaaggcagg 240 tggcaacaca aacaagagta tctaagattt tcatttgtga ctataggaat ataatatctc 300 ttatctgatt taatgaatcc acatgttcac ttctcatttg tccacaagat cacaacttta 360 tcttcaatat tcacaacttg ttatatccac cacaatttca ttcttttcac ttagccccac 420 aaaatacttt gtccccttat ttgccacctt ttgtatttaa tttattcttg tggagctaag 480 tgttcatatt attcttcttc tcaaaaaaac aaaaacaaaa aaaaagagaa gaaaaccatg 540 <210> 43 <211> 1358 <212> DNA
<213> hybrid <400> 43 ccatggcgag agggagcaga tcagtgggta gcagcagcag caaatggagg tactgcaacc 60 cttcctatta cttgaagcgc ccaaagcgtc ttgctctgct cttcatcgtt ttcgtttgtg 120 tctctttcgt tttctgggac cgtcaaactc tcgtcagaga gcaccaggtt gaaatttctg 180 agctgcagaa agaagtgact gatttgaaaa atttggtgga tgatttaaat aacaaacaag 240 gtggtacctc tgggaaaact gacttgggga ccatggctct aaggttgcat agaaggaacc 300 atttttcgcc tagaaatacg gatctgttcc cggatttggc aaaagatcgt gtggttatcg 360 tcttgtatgt gcataatcgg gctcagtatt ttcgagtcac agtggaaagt ttgtcgaagg 420 ttaaaggtat aagtgagaca ttgttgattg ttagtcatga tggttacttt gaagagatga 480 ataggattgt ggagagtatt aagttttgtc aagtgaaaca gattttctcg ccttattcgc 540 ctcatatata tcgtactagc ttcccgggtg tgaccctgaa tgattgtaag aacaagggtg 600 atgaggcaaa ggggcattgt gaaggtaatc ctgatcagta tgggaatcat cggtctccga 660 agattgtatc tttgaagcat cactggtggt ggatgatgaa cactgtatgg gatgggttgg 720 aagagactaa aggacatgag gggcatatcc ttttcattga agaagatcat tttctgtttc 780 ctaatgccta tcgtaacata cagactctta cgaggctgaa acccgcaaag tgtcctgact 840 gttttgctgc taatttagca ccgtctgatg tgaagtcaag aggagaaggg cttgaaagtt 900 tggttgcaga gagaatggga aatgttgggt attcttttaa tagaagtgtg tgggagaata 960 ttcatcagaa ggcaagagag ttttgtttct ttgatgatta caactgggat ataacgatgt 1020 gggcaacggt tttcccgtcg tttggttccc cggtgtacac attgcgaggg cctaggacta 1080 gtgcggtaca ctttggaaaa tgtgggttgc atcaaggtag aggagatgag ggtgattgca 1140 tcgataatgg ggtcgtaaac atagaagtta aggaaacaga taaagttgtg aacataaaag 1200 aaggatgggg agttcgggtg tataagcatc aagcgggtta taaagccggt ttcgaaggtt 1260 ggggaggttg gggcgatgat agggaccgac atttatgttt ggattttgcc actatgtatc 1320 gttacagcag tagcagtgca tctccatgaa acggatcc 1358 <210> 44 <211> 237 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic <400> 44 ggatccgcta gagtccgcaa aaatcaccag tctctctcta caaatctatc tctctctatt 60 tttctccaga ataatgtgtg agtagttccc agataaggga attagggttc ttatagggtt 120 tcgctcatgt gttgagcata taagaaaccc ttagtatgta tttgtatttg taaaatactt 180 ctatcaataa aatttctaat cctaaaacca aaatcccgcg agagacctct taattaa 237 <210> 45 <211> 31 <212> DNA

<213> Artificial Sequence <220>
<223> Synthetic <400> 45 atactcgagt taacaatgag taaacggaat c 31 <210> 46 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic <400> 46 ttctcgatcg ccgattggtt attc 24 <210> 47 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic <400> 47 gccgccgcga tcgggcagtc ctcc 24 <210> 48 <211> 30 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic <400> 48 aacggatcca cgctagctcg gtgtcccgat 30 <210> 49 <211> 3327 <212> DNA
<213> hybrid <400> 49 atgggcatca agatggagac acattctcag gtctttgtat acatgttgct gtggttgtct 60 ggtgtcgaca tgaagcactt caaatcttcc ctcactcaca ccgtcaagag ccgagacgag 120 ccaactccgg atcaatgccc tgcattgaag gaaagcgaag cggacatcga caccgtggcg 180 atatacccaa cttttgattt tcagccgagc tggttgcgta caaaggaatt ttgggacaag 240 tccttcgagg atcggtatga aagaattcat aacgacacta cacggcctag actgaaggta 300 atcgtggttc ctcactcaca caacgacccg ggatggctga agacgtttga acagtacttc 360 gagtggaaga ccaagaacat tatcaacaac atagtgaaca aactgcacca gtaccccaac 420 atgaccttca tttggaccga gatatcgttt ctgaatgcct ggtgggaaag gtcgcaccct 480 gtcaaacaaa aggcattgaa aaaacttatc aaagaaggtc gtctcgagat cacgacgggc 540 ggctgggtga tgccggacga agcctgcacg catatctatg cgctaattga ccagtttatt 600 gaaggacatc actgggtgaa aactaatctc ggcgtcatcc cgaagacagg atggtctatt 660 gaccccttcg gccacggggc cactgtgcct tacctgctag accagagcgg ccttgaggga 720 accattatac agagaatcca ttatgcgtgg aaacagtggc tggcggagcg acagattgag 780 gagttttact ggctggcgag ttgggctact acgaagccgt ccatgatagt gcacaatcag 840 ccgtttgata tttattcaat aaaaagcacg tgtggcccgc acccttcaat ttgtctcagt 900 ttcgacttca ggaagattcc cggcgaatat tctgaataca cagctaagca cgaagacatc 960 acggaacaca acttgcacag caaggcaaag actttgatag aggagtacga ccgtatcggg 1020 tccctgactc cacacaacgt ggtgctggtg ccgctcggag acgacttcag atacgagtac 1080 agcgtcgagt ttgatgccca atacgtcaat tatatgaaaa tgtttaacta catcaatgct 1140 cacaaggaaa tcttcaacgc tgacgtacag ttcggaactc ctctcgatta ctttaacgcc 1200 atgaaagaaa gacatcaaaa tatacccagc ttaaagggag atttcttcgt ttactccgat 1260 attttcagcg aaggtaaacc agcgtactgg tcaggttact acactactag accctaccaa 1320 aaaatcctcg cccgtcagtt cgaacaccaa ctgcgatcgg cagagatttt attcaccctt 1380 gtatcgaact acatcagaca gatgggtcgc caaggagagt tcggagcttc tgagaaaaag 1440 ttagaaaaat cttacgagca gcttatctat gctcgacgga acttgggtct gtttcaacat 1500 cacgatgcga ttactggaac atcaaagtcc agtgtgatgc aagattacgg aaccaaactg 1560 ttcacaagtc tgtatcactg catccgcctg caggaggccg cgctcaccac catcatgttg 1620 cctgaccagt cgttgcactc gcagagcatt atacaaagcg aggttgagtg ggaaacttac 1680 ggaaaaccgc ccaagaagct gcaagtgtcc ttcattgaca agaagaaagt tatacttttt 1740 aatccgttgg ctgagactcg aactgaagtg gtcacggtta gatccaacac gtccaacatc 1800 cgggtgtacg atacacacaa gaggaagcac gtcttgtatc agataatgcc cagcatcaca 1860 atccaagaca acggcaagag tatcgtaagc gacaccacgt tcgacataat gttcgtggcc 1920 accatcccgc ccctcacctc catctcgtac aagctgcagg agcacaccaa cacttcccac 1980 cactgcgtca ttttctgcaa caactgcgaa caataccaga aatccaatgt gttccaaatt 2040 aagaaaatga tgcctggtga catacaatta gaaaatgcag tgctaaaact tctcgttaat 2100 aggaacaccg gctttctgag acaagtctat agaaaggaca tccggaagag aactgtcgtt 2160 gacgtacaat tcggcgcata tcaaagtgcc caaagacatt ctggtgctta cctcttcatg 2220 cctcattacg actcacctga gaagaatgtt ctgcatccct acactaatca gaacaacatg 2280 caagatgata acataatcat agtgtccgga cctatttcta cggaaatcac gaccatgtac 2340 ttgcccttct tggtgcacac tattaggata tacaacgtgc cggacccggt actgtcgcgt 2400 gctattctat tagagaccga tgtagatttc gaggcgccac ctaagaacag agagactgag 2460 ttatttatga gattacagac tgatatacaa aacggtgaca ttcccgaatt ttacaccgat 2520 cagaacggat tccagtacca aaagagggtc aaagtgaata aactaggaat agaagctaat 2580 tactacccga tcactaccat ggcgtgcctg caagacgagg agacccggct cactctgctg 2640 acgaaccacg ctcaaggcgc tgctgcatac gaaccaggac gcttagaagt catgctcgat 2700 cgtcgaactc tttatgatga cttcagagga atcggtgaag gagtagtcga taacaaaccg 2760 acgactttcc agaactggat tttaattgaa tccatgccag gcgtgacgcg agccaagaga 2820 gacactagtg aaccaggttt caaatttgtt aatgaacgtc gttttggccc cggccagaag 2880 gaaagccctt accaagtacc gtcgcagact gcggactacc tgagcaggat gttcaattac 2940 ccggtgaacg tgtacctggt ggacactagc gaggttggcg agatcgaggt gaagccgtac 3000 cagtcgttcc tgcagagctt cccgcccggc atccacctgg tcaccctgcg caccatcacc 3060 gacgacgtgc tcgaactctt ccccagcaac gaaagctaca tggtactgca ccgaccagga 3120 tacagctgcg ctgtcggaga gaagccagtc gccaagtctc ccaagttttc gtccaaaacc 3180 aggttcaatg gtctgaacat tcagaacatc actgcagtca gcctgaccgg cctgaagtca 3240 ctccgacctc tcacaggtct gagtgacatc cacctgaacg ctatggaggt aaaaacttac 3300 aagatcaggt ttaaggacga gctttaa 3327 <210> 50 <211> 1108 <212> PRT
<213> hybrid <400> 50 Met Gly Ile Lys Met Glu Thr His Ser Gin Val Phe Val Tyr Met Leu Leu Trp Leu Ser Gly Val Asp Met Lys His Phe Lys Ser Ser Leu Thr His Thr Val Lys Ser Arg Asp Glu Pro Thr Pro Asp Gin Cys Pro Ala 35 40 '45 Leu Lys Glu Ser Glu Ala Asp Ile Asp Thr Val Ala Ile Tyr Pro Thr Phe Asp Phe Gin Pro Ser Trp Leu Arg Thr Lys Glu Phe Trp Asp Lys Ser Phe Glu Asp Arg Tyr Glu Arg Ile His Asn Asp Thr Thr Arg Pro Arg Leu Lys Val Ile Val Val Pro His Ser His Asn Asp Pro Gly Trp Leu Lys Thr Phe Glu Gin Tyr Phe Glu Trp Lys Thr Lys Asn Ile Ile Asn Asn Ile Val Asn Lys Leu His Gin Tyr Pro Asn Met Thr Phe Ile Trp Thr Glu Ile Ser Phe Leu Asn Ala Trp Trp Glu Arg Ser His Pro Val Lys Gin Lys Ala Leu Lys Lys Leu Ile Lys Glu Gly Arg Leu Glu Ile Thr Thr Gly Gly Trp Val Met Pro Asp Glu Ala Cys Thr His Ile Tyr Ala Leu Ile Asp Gin Phe Ile Glu Gly His His Trp Val Lys Thr Asn Leu Gly Val Ile Pro Lys Thr Gly Trp Ser Ile Asp Pro Phe Gly His Gly Ala Thr Val Pro Tyr Leu Leu Asp Gin Ser Gly Leu Glu Gly Thr Ile Ile Gin Arg Ile His Tyr Ala Trp Lys Gin Trp Leu Ala Glu Arg Gin Ile Glu Glu Phe Tyr Trp Leu Ala Ser Trp Ala Thr Thr Lys Pro Ser Met Ile Val His Asn Gin Pro Phe Asp Ile Tyr Ser Ile Lys Ser Thr Cys Gly Pro His Pro Ser Ile Cys Leu Ser Phe Asp Phe Arg Lys Ile Pro Gly Glu Tyr Ser Glu Tyr Thr Ala Lys His Glu Asp Ile Thr Glu His Asn Leu His Ser Lys Ala Lys Thr Leu Ile Glu Glu Tyr Asp Arg Ile Gly Ser Leu Thr Pro His Asn Val Val Leu Val Pro Leu Gly Asp Asp Phe Arg Tyr Glu Tyr Ser Val Glu Phe Asp Ala Gin Tyr Val Asn Tyr Met Lys Met Phe Asn Tyr Ile Asn Ala His Lys Glu Ile Phe Asn Ala Asp Val Gin Phe Gly Thr Pro Leu Asp Tyr Phe Asn Ala Met Lys Glu Arg His Gin Asn Ile Pro Ser Leu Lys Gly Asp Phe Phe Val Tyr Ser Asp Ile Phe Ser Glu Gly Lys Pro Ala Tyr Trp Ser Gly Tyr Tyr Thr Thr Arg Pro Tyr Gin Lys Ile Leu Ala Arg Gin Phe Glu His Gin Leu Arg Ser Ala Glu Ile Leu Phe Thr Leu Val Ser Asn Tyr Ile Arg Gin Met Gly Arg Gin Gly Glu Phe Gly Ala Ser Glu Lys Lys Lou Glu Lys Ser Tyr Glu Gin Leu Ile Tyr Ala Arg Arg Asn Leu Gly Leu Phe Gin His His Asp Ala Ile Thr Gly Thr Ser Lys Ser Ser Val Met Gin Asp Tyr Gly Thr Lys Leu Phe Thr Ser Leu Tyr His Cys Ile Arg Leu Gin Glu Ala Ala Leu Thr Thr Ile Met Leu Pro Asp Gin Ser Leu His Ser Gin Ser Ile Ile Gin Ser Glu Val Glu Trp Glu Thr Tyr Gly Lys Pro Pro Lys Lys Leu Gin Val Ser Phe Ile Asp Lys Lys Lys Val Ile Leu Phe Asn Pro Lou Ala Glu Thr Arg Thr Glu Val Val Thr Val Arg Ser Asn Thr Ser Asn Ile Arg Val Tyr Asp Thr His Lys Arg Lys His Val Leu Tyr Gin Ile Met Pro Ser Ile Thr Ile Gin Asp Asn Gly Lys Ser Ile Val Ser Asp Thr Thr Phe Asp Ile Met Phe Val Ala Thr Ile Pro Pro Lou Thr Ser Ile Ser Tyr Lys Lou Gin Glu His Thr Asn Thr Ser His His Cys Val Ile Phe Cys Asn Asn Cys Glu Gin Tyr Gin Lys Ser Asn Val Phe Gin Ile Lys Lys Met Met Pro Gly Asp Ile Gin Lou Glu Asn Ala Val Leu Lys Lou Lou Val Asn Arg Asn Thr Gly Phe Leu Arg Gin Val Tyr Arg Lys Asp Ile Arg Lys Arg Thr Val Val Asp Val Gin Phe Gly Ala Tyr Gin Ser Ala Gin Arg His Ser Gly Ala Tyr Leu Phe Met Pro His Tyr Asp Ser Pro Glu Lys Asn Val Lou His Pro Tyr Thr Asn Gin Asn Asn Met Gin Asp Asp Asn Ile Ile Ile Val Ser Gly Pro Ile Ser Thr Glu Ile Thr Thr Met Tyr Leu Pro Phe Leu Val His Thr Ile Arg Ile Tyr Asn Val Pro Asp Pro Val Leu Ser Arg Ala Ile Lou Lou Glu Thr Asp Val Asp Phe Glu Ala Pro Pro Lys Asn Arg Glu Thr Glu Leu Phe Met Arg Lou Gin Thr Asp Ile Gin Asn Gly Asp Ile Pro Glu Phe Tyr Thr Asp Gin Asn Gly Phe Gin Tyr Gin Lys Arg Val Lys Val Asn Lys Leu Gly Ile Glu Ala Asn Tyr Tyr Pro Ile Thr Thr Met Ala Cys Leu Gin Asp Glu Glu Thr Arg Leu Thr Leu Leu Thr Asn His Ala Gin Gly Ala Ala Ala Tyr Glu Pro Gly Arg Leu Glu Val Met Leu Asp Arg Arg Thr Leu Tyr Asp Asp Phe Arg Gly Ile Gly Glu Gly Val Val Asp Asn Lys Pro Thr Thr Phe Gin Asn Trp Ile Leu Ile Glu Ser Met Pro Gly Val Thr Arg Ala Lys Arg Asp Thr Ser Glu Pro Gly Phe Lys Phe Val Asn Glu Arg Arg Phe Gly Pro Gly Gin Lys Glu Ser Pro Tyr Gin Val Pro Ser Gin Thr Ala Asp Tyr Leu Ser Arg Met Phe Asn Tyr Pro Val Asn Val Tyr Leu Val Asp Thr Ser Glu Val Gly Glu Ile Glu Val Lys Pro Tyr Gin Ser Phe Leu Gin Ser Phe Pro Pro Gly Ile His Leu Val Thr Leu Arg Thr Ile Thr Asp Asp Val Leu Glu Leu Phe Pro Ser Asn Glu Ser Tyr Met Val Leu His Arg Pro Gly Tyr Ser Cys Ala Val Gly Glu Lys Pro Val Ala Lys Ser Pro Lys Phe Ser Ser Lys Thr Arg Phe Asn Gly Leu Asn Ile Gin Asn Ile Thr Ala Val Ser Leu Thr Gly Leu Lys Ser Leu Arg Pro Leu Thr Gly Leu Ser Asp Ile His Leu Asn Ala Met Glu Val Lys Thr Tyr Lys Ile Arg Phe Lys Asp Glu Leu <210> 51 <211> 1068 <212> DNA
<213> hybrid <400> 51 atgggcatca agatggagac acattctcag gtctttgtat acatgttgct gtggttgtct 60 ggtgtcgaca tgcagtcctc cggggagctc cggaccggag gggcccggcc gccgcctcct 120 ctaggcgcct cctcccagcc gcgcccgggt ggcgactcca gcccagtcgt ggattctggc 180 cctggccccg ctagcaactt gacctcggtc ccagtgcccc acaccaccgc actgtcgctg 240 cccgcctgcc ctgaggagtc cccgctgctt gtgggcccca tgctgattga gtttaacatg 300 cctgtggacc tggagctcgt ggcaaagcag aacccaaatg tgaagatggg cggccgctat 360 gcccccaggg actgcgtctc tcctcacaag gtggccatca tcattccatt ccgcaaccgg 420 caggagcacc tcaagtactg gctatattat ttgcacccag tcctgcagcg ccagcagctg 480 gactatggca tctatgttat caaccaggcg ggagacacta tattcaatcg tgctaagctc 540 ctcaatgttg gctttcaaga agccttgaag gactatgact acacctgctt tgtgtttagt 600 gacgtggacc tcattccaat gaatgaccat aatgcgtaca ggtgtttttc acagccacgg 660 cacatttccg ttgcaatgga taagtttgga ttcagcctac cttatgttca gtattttgga 720 ggtgtctctg ctctaagtaa acaacagttt ctaaccatca atggatttcc taataattat 780 tggggctggg gaggagaaga tgatgacatt tttaacagat tagtttttag aggcatgtct 840 atatctcgcc caaatgctgt ggtcgggagg tgtcgcatga tccgccactc aagagacaag 900 aaaaatgaac ccaatcctca gaggtttgac cgaattgcac acacaaagga gacaatgctc 960 tctgatggtt tgaactcact cacctaccag gtgctggatg tacagagata cccattgtat 1020 acccaaatca cagtggacat cgggacaccg agcaaggacg agctttag 1068 <210> 52 <211> 355 <212> PRT
<213> hybrid <400> 52 Met Gly Ile Lys Met Glu Thr His Ser Gin Val Phe Val Tyr Met Leu Leu Trp Leu Ser Gly Val Asp Met Gin Ser Ser Gly Glu Leu Arg Thr Gly Gly Ala Arg Pro Pro Pro Pro Leu Gly Ala Ser Ser Gin Pro Arg Pro Gly Gly Asp Ser Ser Pro Val Val Asp Ser Gly Pro Gly Pro Ala Ser Asn Leu Thr Ser Val Pro Val Pro His Thr Thr Ala Leu Ser Leu Pro Ala Cys Pro Glu Glu Ser Pro Leu Leu Val Gly Pro Met Leu Ile Glu Phe Asn Met Pro Val Asp Leu Glu Leu Val Ala Lys Gin Asn Pro Asn Val Lys Met Gly Gly Arg Tyr Ala Pro Arg Asp Cys Val Ser Pro His Lys Val Ala Ile Ile Ile Pro Phe Arg Asn Arg Gin Glu His Leu Lys Tyr Trp Leu Tyr Tyr Leu His Pro Val Leu Gin Arg Gin Gin Leu Asp Tyr Gly Ile Tyr Val Ile Asn Gin Ala Gly Asp Thr Ile Phe Asn Arg Ala Lys Leu Leu Asn Val Gly Phe Gin Glu Ala Leu Lys Asp Tyr Asp Tyr Thr Cys Phe Val Phe Ser Asp Val Asp Leu Ile Pro Met Asn Asp His Asn Ala Tyr Arg Cys Phe Ser Gin Pro Arg His Ile Ser Val Ala Met Asp Lys Phe Gly Phe Ser Leu Pro Tyr Val Gin Tyr Phe Gly Gly Val Ser Ala Leu Ser Lys Gin Gin Phe Leu Thr Ile Asn Gly Phe Pro Asn Asn Tyr Trp Gly Trp Gly Gly Glu Asp Asp Asp Ile Phe Asn Arg Leu Val Phe Arg Gly Met Ser Ile Ser Arg Pro Asn Ala Val Val Gly Arg Cys Arg Met Ile Arg His Ser Arg Asp Lys Lys Asn Glu Pro Asn Pro Gin Arg Phe Asp Arg Ile Ala His Thr Lys Glu Thr Met Leu Ser Asp Gly Leu Asn Ser Leu Thr Tyr Gin Val Leu Asp Val Gin Arg Tyr Pro Leu Tyr Thr Gin Ile Thr Val Asp Ile Gly Thr Pro Ser Lys Asp Glu Leu <210> 53 <211> 1119 <212> DNA
<213> hybrid <400> 53 atgggcatca agatggagac acattctcag gtctttgtat acatgttgct gtggttgtct 60 ggtgtcgaca tgggacagat gcctgtggct gctgtagtgg ttatggcctg cagtcgtgca 120 gactatcttg aaaggactgt taaatcagtt ttaacatatc aaactcccgt tgcttcaaaa 180 tatcctctat ttatatctca ggatggatct gatcaagctg tcaagagcaa gtcattgagc 240 tataatcaat taacatatat gcagcacttg gattttgaac cagtggtcac tgaaaggcct 300 ggcgaactga ctgcgtacta caagattgca cgtcactaca agtgggcact ggaccagttg 360 ttttacaaac acaaatttag tcgagtgatt atactagaag atgatatgga aattgctcca 420 gacttctttg attactttga ggctgcagct agtctcatgg atagggataa aaccattatg 480 gctgcttcat catggaatga taatggacag aagcagtttg tgcatgatcc ctatgcgcta 540 taccgatcag atttttttcc tggccttggg tggatgctca agagatcgac ttgggatgag 600 ttatcaccaa agtggccaaa ggcttactgg gatgattggc tgagactaaa ggaaaaccat 660 aaaggccgcc aattcattcg accggaagtc tgtagaacat acaattttgg tgaacatggg 720 tctagtttgg gacagttttt cagtcagtat ctggaaccta taaagctaaa cgatgtgacg 780 gttgactgga aagcaaagga cctgggatac ctgacagagg gaaactatac caagtacttt 840 tctggcttag tgagacaagc acgaccaatt caaggttctg accttgtctt aaaggctcaa 900 aacataaagg atgatgttcg tatccggtat aaagaccaag tagagtttga acgcattgca 960 ggggaatttg gtatatttga agaatggaag gatggtgtgc ctcgaacagc atataaagga 1020 gtagtggtgt ttcgaatcca gacaacaaga cgtgtattcc tggttgggcc agattctgta 1080 atgcagcttg gaattcgaaa ttccaaggac gagctttga 1119 <210> 54 <211> 372 <212> PRT
<213> hybrid <400> 54 Met Gly Ile Lys Met Glu Thr His Ser Gin Val Phe Val Tyr Met Leu Leu Trp Leu Ser Gly Val Asp Met Gly Gin Met Pro Val Ala Ala Val Val Val Met Ala Cys Ser Arg Ala Asp Tyr Leu Glu Arg Thr Val Lys Ser Val Leu Thr Tyr Gin Thr Pro Val Ala Ser Lys Tyr Pro Leu Phe Ile Ser Gin Asp Gly Ser Asp Gin Ala Val Lys Ser Lys Ser Leu Ser Tyr Asn Gin Leu Thr Tyr Met Gin His Leu Asp Phe Glu Pro Val Val Thr Glu Arg Pro Gly Glu Leu Thr Ala Tyr Tyr Lys Ile Ala Arg His Tyr Lys Trp Ala Leu Asp Gin Leu Phe Tyr Lys His Lys Phe Ser Arg Val Ile Ile Leu Glu Asp Asp Met Glu Ile Ala Pro Asp Phe Phe Asp Tyr Phe Glu Ala Ala Ala Ser Leu Met Asp Arg Asp Lys Thr Ile Met Ala Ala Ser Ser Trp Asn Asp Asn Gly Gin Lys Gin Phe Val His Asp Pro Tyr Ala Leu Tyr Arg Ser Asp Phe Phe Pro Gly Leu Gly Trp Met Leu Lys Arg Ser Thr Trp Asp Glu Leu Ser Pro Lys Trp Pro Lys Ala Tyr Trp Asp Asp Trp Leu Arg Leu Lys Glu Asn His Lys Gly Arg Gin Phe Ile Arg Pro Glu Val Cys Arg Thr Tyr Asn Phe Gly Glu His Gly Ser Ser Leu Gly Gin Phe Phe Ser Gin Tyr Leu Glu Pro Ile Lys Leu Asn Asp Val Thr Val Asp Trp Lys Ala Lys Asp Leu Gly Tyr Leu Thr Glu Gly Asn Tyr Thr Lys Tyr Phe Ser Gly Leu Val Arg Gin Ala Arg Pro Ile Gin Gly Ser Asp Leu Val Leu Lys Ala Gin Asn Ile Lys Asp Asp Val Arg Ile Arg Tyr Lys Asp Gin Val Glu Phe Glu Arg Ile Ala Gly Glu Phe Gly Ile Phe Glu Glu Trp Lys Asp Gly Val Pro Arg Thr Ala Tyr Lys Gly Val Val Val Phe Arg Ile Gin Thr Thr Arg Arg Val Phe Leu Val Gly Pro Asp Ser Val Met Gin Leu Gly Ile Arg Asn Ser Lys Asp Glu Leu <210> 55 <211> 1158 <212> DNA
<213> hybrid <400> 55 atgggcatca agatggagac acattctcag gtctttgtat acatgttgct gtggttgtct 60 ggtgtcgaca tggctctaag gttgcataga aggaaccatt tttcgcctag aaatacggat 120 ctgttcccgg atttggcaaa agatcgtgtg gttatcgtct tgtatgtgca taatcgggct 180 cagtattttc gagtcacagt ggaaagtttg tcgaaggtta aaggtataag tgagacattg 240 ttgattgtta gtcatgatgg ttactttgaa gagatgaata ggattgtgga gagtattaag 300 ttttgtcaag tgaaacagat tttctcgcct tattcgcctc atatatatcg tactagcttc 360 ccgggtgtga ccctgaatga ttgtaagaac aagggtgatg aggcaaaggg gcattgtgaa 420 ggtaatcctg atcagtatgg gaatcatcgg tctccgaaga ttgtatcttt gaagcatcac 480 tggtggtgga tgatgaacac tgtatgggat gggttggaag agactaaagg acatgagggg 540 catatccttt tcattgaaga agatcatttt ctgtttccta atgcctatcg taacatacag 600 actcttacga ggctgaaacc cgcaaagtgt cctgactgtt ttgctgctaa tttagcaccg 660 tctgatgtga agtcaagagg agaagggctt gaaagtttgg ttgcagagag aatgggaaat 720 gttgggtatt cttttaatag aagtgtgtgg gagaatattc atcagaaggc aagagagttt 780 tgtttctttg atgattacaa ctgggatata acgatgtggg caacggtttt cccgtcgttt 840 ggttccccgg tgtacacatt gcgagggcct aggactagtg cggtacactt tggaaaatgt 900 gggttgcatc aaggtagagg agatgagggt gattgcatcg ataatggggt cgtaaacata 960 gaagttaagg aaacagataa agttgtgaac ataaaagaag gatggggagt tcgggtgtat 1020 aagcatcaag cgggttataa agccggtttc gaaggttggg gaggttgggg cgatgatagg 1080 gaccgacatt tatgtttgga ttttgccact atgtatcgtt acagcagtag cagtgcatct 1140 ccaaaggacg agctttga 1158 <210> 56 <211> 385 <212> PRT
<213> hybrid <400> 56 Met Gly Ile Lys Met Glu Thr His Ser Gin Val Phe Val Tyr Met Leu Leu Trp Leu Ser Gly Val Asp Met Ala Leu Arg Leu His Arg Arg Asn His Phe Ser Pro Arg Asn Thr Asp Leu Phe Pro Asp Leu Ala Lys Asp Arg Val Val Ile Val Leu Tyr Val His Asn Arg Ala Gin Tyr Phe Arg Val Thr Val Glu Ser Leu Ser Lys Val Lys Gly Ile Ser Glu Thr Leu Leu Ile Val Ser His Asp Gly Tyr Phe Glu Glu Met Asn Arg Ile Val Glu Ser Ile Lys Phe Cys Gin Val Lys Gin Ile Phe Ser Pro Tyr Ser Pro His Ile Tyr Arg Thr Ser Phe Pro Gly Val Thr Leu Asn Asp Cys Lys Asn Lys Gly Asp Glu Ala Lys Gly His Cys Glu Gly Asn Pro Asp Gin Tyr Gly Asn His Arg Ser Pro Lys Ile Val Ser Leu Lys His His Trp Trp Trp Met Met Asn Thr Val Trp Asp Gly Leu Glu Glu Thr Lys Gly His Glu Gly His Ile Leu Phe Ile Glu Glu Asp His Phe Leu Phe Pro Asn Ala Tyr Arg Asn Ile Gin Thr Leu Thr Arg Leu Lys Pro Ala Lys Cys Pro Asp Cys Phe Ala Ala Asn Leu Ala Pro Ser Asp Val Lys Ser Arg Gly Glu Gly Leu Glu Ser Leu Val Ala Glu Arg Met Gly Asn Val Gly Tyr Ser Phe Asn Arg Ser Val Trp Glu Asn Ile His Gin Lys Ala Arg Glu Phe Cys Phe Phe Asp Asp Tyr Asn Trp Asp Ile Thr Met Trp Ala Thr Val Phe Pro Ser Phe Gly Ser Pro Val Tyr Thr Leu Arg Gly Pro Arg Thr Ser Ala Val His Phe Gly Lys Cys Gly Leu His Gln Gly Arg Gly Asp Glu Gly Asp Cys Ile Asp Asn Gly Val Val Asn Ile Glu Val Lys Glu Thr Asp Lys Val Val Asn Ile Lys Glu Gly Trp Gly Val Arg Val Tyr Lys His Gin Ala Gly Tyr Lys Ala Gly Phe Glu Gly Trp Gly Gly Trp Gly Asp Asp Arg Asp Arg His Leu Cys Leu Asp Phe Ala Thr Met Tyr Arg Tyr Ser Ser Ser Ser Ala Ser Pro Lys Asp Glu Leu <210> 57 <211> 1152 <212> DNA
<213> Homo sapiens <400> 57 atgctgaaga agcagtctgc agggcttgtg ctgtggggcg ctatcctctt tgtggcctgg 60 aatgccctgc tgctcctctt cttctggacg cgcccagcac ctggcaggcc accctcagtc 120 agcgctctcg atggcgaccc cgccagcctc acccgggaag tcgacatgca gtcctccggg 180 gagctccgga ccggaggggc ccggccgccg cctcctctag gcgcctcctc ccagccgcgc 240 ccgggtggcg actccagccc agtcgtggat tctggccctg gccccgctag caacttgacc 300 tcggtcccag tgccccacac caccgcactg tcgctgcccg cctgccctga ggagtccccg 360 ctgcttgtgg gccccatgct gattgagttt aacatgcctg tggacctgga gctcgtggca 420 aagcagaacc caaatgtgaa gatgggcggc cgctatgccc ccagggactg cgtctctcct 480 cacaaggtgg ccatcatcat tccattccgc aaccggcagg agcacctcaa gtactggcta 540 tattatttgc acccagtcct gcagcgccag cagctggact atggcatcta tgttatcaac 600 caggcgggag acactatatt caatcgtgct aagctcctca atgttggctt tcaagaagcc 660 ttgaaggact atgactacac ctgctttgtg tttagtgacg tggacctcat tccaatgaat 720 gaccataatg cgtacaggtg tttttcacag ccacggcaca tttccgttgc aatggataag 780 tttggattca gcctacctta tgttcagtat tttggaggtg tctctgctct aagtaaacaa 840 cagtttctaa ccatcaatgg atttcctaat aattattggg gctggggagg agaagatgat 900 gacattttta acagattagt ttttagaggc atgtctatat ctcgcccaaa tgctgtggtc 960 gggaggtgtc gcatgatccg ccactcaaga gacaagaaaa atgaacccaa tcctcagagg 1020 tttgaccgaa ttgcacacac aaaggagaca atgctctctg atggtttgaa ctcactcacc 1080 taccaggtgc tggatgtaca gagataccca ttgtataccc aaatcacagt ggacatcggg 1140 acaccgagct ag 1152 <210> 58 <211> 383 <212> PRT
<212> Homo sapiens <400> 58 Met Leu Lys Lys Gin Ser Ala Gly Leu Val Leu Trp Gly Ala Ile Leu Phe Val Ala Trp Asn Ala Leu Leu Leu Leu Phe Phe Trp Thr Arg Pro Ala Pro Gly Arg Pro Pro Ser Val Ser Ala Leu Asp Gly Asp Pro Ala Ser Leu Thr Arg Glu Val Asp Met Gln Ser Ser Gly Glu Leu Arg Thr Gly Gly Ala Arg Pro Pro Pro Pro Leu Gly Ala Ser Ser Gin Pro Arg Pro Gly Gly Asp Ser Ser Pro Val Val Asp Ser Gly Pro Gly Pro Ala Ser Asn Leu Thr Ser Val Pro Val Pro His Thr Thr Ala Leu Ser Leu Pro Ala Cys Pro Glu Glu Ser Pro Leu Leu Val Gly Pro Met Leu Ile Glu Phe Asn Met Pro Val Asp Leu Glu Leu Val Ala Lys Gin Asn Pro Asn Val Lys Met Gly Gly Arg Tyr Ala Pro Arg Asp Cys Val Ser Pro His Lys Val Ala Ile Ile Ile Pro Phe Arg Asn Arg Gin Glu His Leu Lys Tyr Trp Leu Tyr Tyr Leu His Pro Val Leu Gin Arg Gin Gin Leu Asp Tyr Gly Ile Tyr Val Ile Asn Gin Ala Gly Asp Thr Ile Phe Asn Arg Ala Lys Leu Leu Asn Val Gly Phe Gin Glu Ala Leu Lys Asp Tyr Asp Tyr Thr Cys Phe Val Phe Ser Asp Val Asp Leu Ile Pro Met Asn Asp His Asn Ala Tyr Arg Cys Phe Ser Gin Pro Arg His Ile Ser Val Ala Met Asp Lys Phe Gly Phe Ser Leu Pro Tyr Val Gin Tyr Phe Gly Gly Val Ser Ala Leu Ser Lys Gin Gin Phe Leu Thr Ile Asn Gly Phe Pro Asn Asn Tyr Trp Gly Trp Gly Gly Glu Asp Asp Asp Ile Phe Asn Arg Leu Val Phe Arg Gly Met Ser Ile Ser Arg Pro Asn Ala Val Val Gly Arg Cys Arg Met Ile Arg His Ser Arg Asp Lys Lys Asn Glu Pro Asn Pro Gin Arg Phe Asp Arg Ile Ala His Thr Lys Glu Thr Met Leu Ser Asp Gly Lou Asn Ser Leu Thr Tyr Gin Val Lou Asp Val Gin Arg Tyr Pro Leu Tyr Thr Gin Ile Thr Val Asp Ile Gly Thr Pro Ser <210> 59 <211> 400 <212> PRT
<213> Homo sapiens <400> 59 Net Arg Leu Arg Glu Pro Lou Leu Ser Gly Ala Ala Net Pro Gly Ala Ser Lou Gin Arg Ala Cys Arg Leu Lou Val Ala Val Cys Ala Leu His Leu Gly Val Thr Leu Val Tyr Tyr Lou Ala Gly Arg Asp Leu Ser Arg Leu Pro Gin Leu Val Gly Val Ser Thr Pro Leu Gin Gly Gly Ser Asn Ser Ala Ala Ala Ile Gly Gin Ser Ser Gly Glu Leu Arg Thr Gly Gly Ala Arg Pro Pro Pro Pro Leu Gly Ala Ser Ser Gin Pro Arg Pro Gly Gly Asp Ser Ser Pro Val Val Asp Ser Gly Pro Gly Pro Ala Ser Asn Leu Thr Ser Val Pro Val Pro His Thr Thr Ala Leu Ser Leu Pro Ala Cys Pro Glu Glu Ser Pro Leu Leu Val Gly Pro Met Leu Ile Glu Phe Asn Net Pro Val Asp Leu Glu Leu Val Ala Lys Gin Asn Pro Asn Val Lys Met Gly Gly Arg Tyr Ala Pro Arg Asp Cys Val Ser Pro His Lys Val Ala Ile Ile Ile Pro Phe Arg Asn Arg Gin Glu His Lou Lys Tyr Trp Lou Tyr Tyr Leu His Pro Val Leu Gin Arg Gin Gin Lou Asp Tyr Gly Ile Tyr Gly Ile Tyr Val Ile Asn Gin Ala Gly Asp Thr Ile Phe Asn Arg Ala Lys Leu Leu Asn Val Gly Phe Gin Glu Ala Leu Lys Asp Tyr Asp Tyr Thr Cys Phe Val Phe Ser Asp Val Asp Leu Ile Pro Met Asn Asp His Asn Ala Tyr Arg Cys Phe Ser Gin Pro Arg His Ile Ser Val Ala Met Asp Lys Phe Gly Phe Ser Lou Pro Tyr Val Gin Tyr Phe Gly Gly Val Ser Ala Leu Ser Lys Gin Gin Phe Leu Thr Ile Asn Gly Phe Pro Asn Asn Tyr Trp Gly Trp Gly Gly Glu Asp Asp Asp Ile Phe Asn Arg Leu Val Phe Arg Gly Met Ser Ile Ser Arg Pro Asn Ala Val Val Gly Arg Cys Arg Met Ile Arg His SeneArg Asp Lys Lys Asn Glu Pro Asn Pro Gin Arg Phe Asp Arg Ile Ala His Thr Lys Glu Thr Met Leu Ser Asp Gly Leu Asn Ser Leu Thr Tyr Gin Val Leu Asp Val Gin Arg Tyr Pro Leu Tyr Thr Gin Ile Thr Val Asp Ile Gly Thr Pro Ser

Claims (15)

WHAT IS CLAIMED IS:
1. A nucleic acid encoding a hybrid enzyme that comprises a transmembrane domain of a plant N-acetylglucosaminyltransferase I (GnTI) and a catalytic domain of a mammalian mannosidase II (ManII) or N-acetylglucosaminyltransferase II
(GnTII).
2. The nucleic acid of claim 1, wherein the transmembrane domain of the plant GnTI comprises the amino acid sequence encoded by a nucleic acid comprising the nucleotide sequence defined by SEQ ID NO:32.
3. The nucleic acid of claim 1, wherein the hybrid enzyme comprises the amino acid sequence encoded by a nucleic acid comprising the nucleotide sequence defined by SEQ ID
NO:29, 30, or 31.
4. The nucleic acid of claim 1, wherein the nucleic acid comprises the nucleotide sequence of SEQ ID NO:29, 30, or 31.
5. An expression vector comprising a nucleic acid of any one of claims 1-4.
6. A host cell comprising the expression vector of claim 5.
7. The host cell of claim 6, further comprising a nucleic acid coding for a heterologous glycoprotein.
8. The host cell of claim 7, wherein the heterologous glycoprotein is an antibody or a fragment thereof.
9. The host cell of any one of claims 6-8, wherein the host cell is a plant cell.
10. A method for producing a transgenic plant cell, comprising providing (i) a plant cell, and (ii) the expression vector of claim 5, and transforming the plant cell with the expression vector under conditions such that the hybrid enzyme encoded by the expression vector is expressed.
11. The method of claim 10, further comprising transforming the plant cell with another expression vector comprising a nucleic acid sequence coding for a heterologous glycoprotein under conditions such that the heterologous glycoprotein is expressed.
12. A method for producing a heterologous glycoprotein, comprising providing a transgenic plant cell that comprises the expression vector of claim 5 and another expression vector comprising a nucleotide sequence encoding the heterologous glycoprotein, and culturing the plant cell under conditions allowing expression of the hybrid enzyme encoded by the expression vector of claim 5 and the heterologous glycoprotein.
13. A method for producing a heterologous glycoprotein, comprising:
providing (i) a plant cell, and (ii) an expression vector that comprises a first nucleotide sequence coding for a hybrid enzyme and a second nucleotide sequence coding for a heterologous glycoprotein, and transforming the plant cell with the expression vector under conditions such that both the hybrid enzyme and the heterologous glycoprotein are expressed, wherein the hybrid enzyme comprises a transmembrane domain of a plant N-acetylglucosaminyltransferase I (GnTI) and a catalytic domain of a mammalian mannosidase II
(Mann) or N-acetylglucosaminyltransferase II (GnTII).
14. The method of claim 12 or claim 13, wherein the heterologous glycoprotein is an antibody or a fragment thereof.
15. The method of any one of claims 12-14, further comprising isolating the heterologous glycoprotein.
CA2478297A 2002-03-19 2003-03-18 Optimizing glycan processing in plants Expired - Fee Related CA2478297C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CA2799805A CA2799805C (en) 2002-03-19 2003-03-18 Optimizing glycan processing in plants

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US36573502P 2002-03-19 2002-03-19
US60/365,735 2002-03-19
PCT/IB2003/001626 WO2003078637A2 (en) 2002-03-19 2003-03-18 Optimizing glycan processing in plants

Related Child Applications (1)

Application Number Title Priority Date Filing Date
CA2799805A Division CA2799805C (en) 2002-03-19 2003-03-18 Optimizing glycan processing in plants

Publications (2)

Publication Number Publication Date
CA2478297A1 CA2478297A1 (en) 2003-09-25
CA2478297C true CA2478297C (en) 2013-05-14

Family

ID=28042042

Family Applications (3)

Application Number Title Priority Date Filing Date
CA2799805A Expired - Fee Related CA2799805C (en) 2002-03-19 2003-03-18 Optimizing glycan processing in plants
CA2923247A Abandoned CA2923247A1 (en) 2002-03-19 2003-03-18 Optimizing glycan processing in plants
CA2478297A Expired - Fee Related CA2478297C (en) 2002-03-19 2003-03-18 Optimizing glycan processing in plants

Family Applications Before (2)

Application Number Title Priority Date Filing Date
CA2799805A Expired - Fee Related CA2799805C (en) 2002-03-19 2003-03-18 Optimizing glycan processing in plants
CA2923247A Abandoned CA2923247A1 (en) 2002-03-19 2003-03-18 Optimizing glycan processing in plants

Country Status (11)

Country Link
US (3) US7601891B2 (en)
EP (3) EP1485486B1 (en)
JP (4) JP4532118B2 (en)
KR (3) KR101196023B1 (en)
CN (2) CN100532558C (en)
AR (1) AR039013A1 (en)
AU (1) AU2003219418B2 (en)
CA (3) CA2799805C (en)
HK (2) HK1124364A1 (en)
NZ (1) NZ534880A (en)
WO (1) WO2003078637A2 (en)

Families Citing this family (44)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
PT1137789E (en) * 1998-12-09 2010-10-21 Phyton Holdings Llc A method for manufacturing glycoproteins having human-type glycosylation
TR200201121T2 (en) * 1999-10-26 2003-01-21 Plant Research International B.V. Mammalian-type glycosylation in plants
US7625756B2 (en) * 2000-06-28 2009-12-01 GycoFi, Inc. Expression of class 2 mannosidase and class III mannosidase in lower eukaryotic cells
WO2002057468A2 (en) 2001-01-19 2002-07-25 The Dow Chemical Company Method for secretory production of glycoprotein having human-type sugar chain using plant cell
EP2078455A1 (en) * 2001-03-06 2009-07-15 The Dow Chemical Company Plant cell having animal-type sugar chain adding function
AU2003219418B2 (en) 2002-03-19 2008-02-28 Stichting Dienst Landbouwkundig Onderzoek Optimizing glycan processing in plants
WO2003078614A2 (en) 2002-03-19 2003-09-25 Plant Research International B.V. Gntiii (udp-n-acethylglucosamine:beta-d mannoside beta (1,4)-n-acethylglucosaminy ltransferase iii) expression in plants
CL2003002461A1 (en) * 2002-11-27 2005-01-07 Dow Chemical Company Agroscien IMMUNOGLOBULIN THAT UNDERSTANDS AT LEAST ONE AFUCOSILATED GLICAN, COMPOSITION THAT CONTAINS IT, NUCLEOTIDIC SEQUENCE AND VECTOR THAT UNDERSTANDS IT, PROCEDURE TO PRODUCE IMMUNOGLOBULIN SAID IN PLANTS.
AU2003294912B2 (en) * 2002-12-20 2009-06-04 Greenovation Biotech Gmbh Production of heterologous glycosylated proteins in bryophyte cells
WO2007090290A1 (en) * 2006-02-09 2007-08-16 Medicago Inc. Synthesis of sialic acid in plants
EP1867722A1 (en) * 2006-06-16 2007-12-19 Plant Research International B.V. Synthesis of lewis X epitopes on proteins in plants
AU2007318926B2 (en) * 2006-11-08 2013-05-02 Centre National De La Recherce Scientifique-Cnrs A set of sequences for targeting expression and control of the post-translationnal modifications of a recombinant polypeptide
CN101641115B (en) 2007-03-08 2013-04-24 卡罗拜奥斯制药公司 Epha3 antibodies for the treatment of solid tumors
KR101460953B1 (en) * 2007-04-17 2014-11-13 스티칭 디엔스트 랜드보위쿤디그 온데조에크 Mammalian-type glycosylation in plants by expression of non-mammalian glycosyltransferases
KR101385050B1 (en) * 2007-06-15 2014-05-13 메디카고 인코포레이티드 Modifying glycoprotein production in plants
US9546375B2 (en) * 2008-07-18 2017-01-17 Medicago Inc. Influenza virus immunizing epitope
EP2403879A1 (en) 2009-03-06 2012-01-11 Kalobios Pharmaceuticals, Inc. Treatment of leukemias and chronic myeloproliferative diseases with antibodies to epha3
CA2765287C (en) * 2009-06-15 2018-12-11 Bayer Bioscience N.V. Nicotiana benthamiana plants deficient in xylosyltransferase activity
WO2011035884A1 (en) * 2009-09-22 2011-03-31 Volker Sandig Process for producing molecules containing specialized glycan structures
PL2488554T3 (en) 2009-10-14 2020-03-31 Humanigen, Inc. Antibodies to epha3
EP2551348B1 (en) 2011-07-29 2014-09-24 Icon Genetics GmbH Production of galactosylated N-glycans in plants
WO2013180809A1 (en) * 2012-05-29 2013-12-05 North Carolina Central University Methods for the production of cytoprotective asialo-erythropoietin in plants and its purification from plant tissues
WO2014084640A1 (en) 2012-11-28 2014-06-05 Samsung Electronics Co., Ltd. Method and apparatus for performing communication in a wireless communication system
EP3221455B1 (en) * 2014-11-20 2020-08-12 Yissum Research and Development Company of the Hebrew University of Jerusalem Ltd. Compositions and methods for producing polypeptides with a modified glycosylation pattern in plant cells
CA2977271A1 (en) 2015-03-02 2016-09-09 Synthetic Genomics, Inc. Regulatory elements from labyrinthulomycetes microorganisms
LT3271453T (en) 2015-03-17 2021-11-10 eleva GmbH Glycosylated lysosomal proteins, method of production and uses
EA201792451A1 (en) 2015-05-07 2018-05-31 Агенус Инк. ANTIBODIES TO OX40 AND METHODS OF THEIR APPLICATION
EP3512883A1 (en) 2016-09-13 2019-07-24 Humanigen, Inc. Epha3 antibodies for the treatment of pulmonary fibrosis
US10633454B2 (en) 2016-11-01 2020-04-28 Conagen Inc. Expression of modified glycoproteins and glycopeptides
WO2018085273A1 (en) * 2016-11-01 2018-05-11 Synthetic Genomics, Inc. Expression of modified glycoproteins and glycopeptides
US11499160B2 (en) 2016-12-01 2022-11-15 Plantform Corporation Transgenic plant with reduced fucosyltransferase and xylosyltransferase activity
JP7106538B2 (en) 2016-12-07 2022-07-26 アジェナス インコーポレイテッド Antibodies and methods of their use
BR112019025352A8 (en) 2017-05-29 2023-05-02 Inst Nat Sante Rech Med CANCER-ASSOCIATED IMMUNOSUPPRESSION INHIBITORS
US11306144B2 (en) 2017-08-25 2022-04-19 Five Prime Therapeutics, Inc. B7-H4 antibodies and methods of use thereof
AU2019226009A1 (en) 2018-02-21 2020-09-03 Five Prime Therapeutics, Inc. B7-H4 antibody dosing regimens
CA3091174A1 (en) 2018-02-21 2019-08-29 Five Prime Therapeutics, Inc. B7-h4 antibody formulations
CA3091801A1 (en) 2018-03-02 2019-09-06 Five Prime Therapeutics, Inc. B7-h4 antibodies and methods of use thereof
WO2020081497A1 (en) 2018-10-15 2020-04-23 Five Prime Therapeutics, Inc. Combination therapy for cancer
AU2021275361A1 (en) 2020-05-17 2023-01-19 Astrazeneca Uk Limited SARS-CoV-2 antibodies and methods of selecting and using the same
AU2021275115A1 (en) * 2020-05-19 2022-11-10 Momenta Pharmaceuticals, Inc. Hyper-sialylated immunoglobulin
JP2023537078A (en) 2020-08-10 2023-08-30 アストラゼネカ・ユーケイ・リミテッド SARS-CoV-2 Antibodies for Treating and Preventing COVID-19
US11821014B2 (en) 2020-12-15 2023-11-21 Frito-Lay North America, Inc. Production of gentisic acid 5-O-β-D xylopyranoside
TW202342095A (en) 2021-11-05 2023-11-01 英商阿斯特捷利康英國股份有限公司 Composition for treatment and prevention of covid-19
WO2023209177A1 (en) 2022-04-29 2023-11-02 Astrazeneca Uk Limited Sars-cov-2 antibodies and methods of using the same

Family Cites Families (81)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS5455790A (en) 1977-10-05 1979-05-04 Tomoyuki Tajima Production of erythropoetin
JPS5616496A (en) 1979-07-20 1981-02-17 Sumitomo Chem Co Ltd Recovery of erythropoietin
JPS5653696A (en) 1979-10-09 1981-05-13 Ajinomoto Co Inc Isolation of erythropoietin by adsorption
JPS56108798A (en) 1980-02-01 1981-08-28 Green Cross Corp:The Purification of erythropoietin
JPS57149228A (en) 1981-03-11 1982-09-14 Ajinomoto Co Inc Novel erythropoietin and its preparation
JPS57169424A (en) 1981-04-10 1982-10-19 Toyo Shinyaku Kk Preparation of erythropoietin
US5352605A (en) 1983-01-17 1994-10-04 Monsanto Company Chimeric genes for transforming plant cells using viral promoters
US4956282A (en) 1985-07-29 1990-09-11 Calgene, Inc. Mammalian peptide expression in plant cells
US5179198A (en) 1988-07-11 1993-01-12 Hidechika Okada Glycoprotein and gene coding therefor
US5141131A (en) 1989-06-30 1992-08-25 Dowelanco Method and apparatus for the acceleration of a propellable matter
US6051753A (en) 1989-09-07 2000-04-18 Calgene, Inc. Figwort mosaic virus promoter and uses
US5032519A (en) 1989-10-24 1991-07-16 The Regents Of The Univ. Of California Method for producing secretable glycosyltransferases and other Golgi processing enzymes
US5541083A (en) 1989-10-24 1996-07-30 The Regents Of The University Of California Method for producing secretable glycosyltransferases and other golgi processing enzymes
US5959177A (en) 1989-10-27 1999-09-28 The Scripps Research Institute Transgenic plants expressing assembled secretory antibodies
US5202422A (en) 1989-10-27 1993-04-13 The Scripps Research Institute Compositions containing plant-produced glycopolypeptide multimers, multimeric proteins and method of their use
US5180674A (en) 1990-04-16 1993-01-19 The Trustees Of The University Of Pennsylvania Saccharide compositions, methods and apparatus for their synthesis
US5096818A (en) 1990-06-04 1992-03-17 Autogen Instruments, Inc. Nucleic acid separation method
WO1991019696A1 (en) 1990-06-21 1991-12-26 Nippon Shokubai Co., Ltd. Process for producing n-substituted aziridine compound
EP0550756B1 (en) 1991-04-18 2001-12-05 Toray Industries, Inc. Preparation of interleukin-6 compositions
US5874271A (en) 1992-08-21 1999-02-23 Takara Shuzo Co., Ltd. Human glycosyltransferase gene, compounds and method for inhibiting cancerous metastasis
WO1994012646A1 (en) * 1992-11-27 1994-06-09 Ciba-Geigy Ag Proteins having glycosyltransferase activity
CN1129953A (en) 1993-07-15 1996-08-28 尼奥斯药品公司 Synthesis of saccharic composition
WO1995018217A1 (en) 1993-12-24 1995-07-06 The Institute Of Physical And Chemical Research Novel sugar-chain synthetase and process for producing the same
CN1112943C (en) 1994-01-21 2003-07-02 粉剂注射疫苗股份有限公司 Gas driven gene delivery instrument
US5955647A (en) 1994-02-03 1999-09-21 The Scripps Research Institute Method for using tobacco mosaic virus to overproduce peptides and proteins
CA2146712C (en) 1995-04-10 2002-06-25 Jas Singh Cold induced promoter from winter brassica napus
US5723755A (en) 1995-05-16 1998-03-03 Francis E. Lefaivre Large scale production of human or animal proteins using plant bioreactors
US5939288A (en) 1995-06-07 1999-08-17 Iowa State University Research Foundation, Inc. Plant secretory signal peptides and nectarins
FR2736930B1 (en) * 1995-07-17 1997-09-19 Biocem PROCESS FOR THE PRODUCTION, BY PLANT CELLS, OF HEMINIC PROTEINS, PROTEINS THUS OBTAINED AND PRODUCTS CONTAINING THE SAME
US6020169A (en) 1995-07-20 2000-02-01 Washington State University Research Foundation Production of secreted foreign polypeptides in plant cell culture
WO1997027303A1 (en) 1996-01-24 1997-07-31 Toyo Boseki Kabushiki Kaisha Alpha-1-6 fucosyltransferases
WO1998005768A1 (en) * 1996-08-02 1998-02-12 The Austin Research Institute Improved nucleic acids encoding a chimeric glycosyltransferase
AU736993B2 (en) 1997-01-16 2001-08-09 Novo Nordisk A/S Practical in vitro sialylation of recombinant glycoproteins
US6087115A (en) 1997-01-22 2000-07-11 Cornell Research Foundation, Inc. Methods of identifying negative antagonists for G protein coupled receptors
JP3921271B2 (en) 1997-05-16 2007-05-30 生化学工業株式会社 DNA encoding glucuronyltransferase
JP4003093B2 (en) 1997-07-29 2007-11-07 東洋紡績株式会社 Method for producing saccharides
AU1585999A (en) 1997-11-12 1999-05-31 Neurotherapeutics Methods for the detection and treatment of disease using a glycosyltransferase
EP1034286B1 (en) 1997-11-18 2010-01-06 Pioneer Hi-Bred International, Inc. Mobilization of viral genomes from t-dna using site-specific recombination systems
DE19754622A1 (en) 1997-12-09 1999-06-10 Antje Von Dr Schaewen Plant GntI sequences and the use thereof for obtaining plants with reduced or missing N-acetylglucosaminyltransferase I (GnTI) activity
US6096546A (en) 1998-01-30 2000-08-01 Board Of Trustees, Rutgers, The State University Of New Jersey Methods for recovering polypeptides from plants and portions thereof
FR2774379B1 (en) 1998-01-30 2002-03-29 Groupe Limagrain Holding PROCESS FOR THE PRODUCTION OF ALPHA 1-ANTITRYPSIN AND ITS VARIANTS BY PLANT CELLS, AND PRODUCTS CONTAINING THE ALPHA-ANTITRYPSIN OBTAINED THEREBY
US5955282A (en) 1998-04-03 1999-09-21 Incyte Pharmaceuticals, Inc. Human galactosyltransferases
AU3657899A (en) 1998-04-20 1999-11-08 James E. Bailey Glycosylation engineering of antibodies for improving antibody-dependent cellular cytotoxicity
IL126967A (en) 1998-11-09 2002-09-12 Lightech Electronics Ind Ltd Electronic transformer for lighting
EP1131415A4 (en) 1998-11-18 2002-09-11 Neose Technologies Inc Low cost manufacture of oligosaccharides
PT1137789E (en) 1998-12-09 2010-10-21 Phyton Holdings Llc A method for manufacturing glycoproteins having human-type glycosylation
US6416988B1 (en) 1998-12-10 2002-07-09 Zymogenetics, Inc. Beta-1,3-galactosyltransferase homologs
AT408446B (en) 1999-02-18 2001-11-26 Altmann Friedrich Dr FUCOSYL TRANSFERASE GENE
JP2000245470A (en) 1999-02-26 2000-09-12 Noda Inst For Scient Res Production of extraneous polypeptide by plant intercellular liquid
WO2000052136A2 (en) 1999-03-02 2000-09-08 Human Genome Sciences, Inc. Human glycosylation enzymes
JP2000287692A (en) 1999-04-09 2000-10-17 Mitsui Chemicals Inc Control of content of sugar chain having different branching numbers in the composition of n-bound sugar chain contained in glycoprotein
US6376747B1 (en) 1999-08-27 2002-04-23 Her Majesty The Queen In Right Of Canada As Represented By The Minister Of Agriculture And Agri-Food Canada Plant-derived map kinase kinase
BR0014858A (en) 1999-10-15 2002-07-16 Calgene Llc Methods and vectors for recombination of specific site in plant cell plastids
WO2001029242A2 (en) * 1999-10-21 2001-04-26 Monsanto Company Post-translational modification of recombinant proteins produced in plants
TR200201121T2 (en) 1999-10-26 2003-01-21 Plant Research International B.V. Mammalian-type glycosylation in plants
ES2349427T3 (en) 1999-10-26 2011-01-03 Stichting Dienst Landbouwkundig Onderzoek GLAMMOSILATION OF MAMMAL TYPE IN PLANTS.
JP2001546898A (en) 1999-12-21 2003-06-10 Kyowa Hakko Kogyo Kk MODIFIED alpha -1,2-FUCOSYLTRANSFERASE GENE AND PROCESS FOR PRODUCING alpha -1,2-FUCOSYLTRANSFERASE AND FUCOSE-CONTAINING SUGAR CHAIN
EP1268751B1 (en) 1999-12-30 2010-03-17 Knut Kleesiek Xylosyltransferase and isoforms thereof
FR2803307A1 (en) 1999-12-30 2001-07-06 Centre Nat Rech Scient SPECIFIC TH1 TYPE CD4 T LYMPHOCYTES LINES AND METHOD FOR THEIR EX VIVO INDUCTION
AU2001235285A1 (en) 2000-02-24 2001-09-03 University Of Guelph Method of modulating glycosylation pathways
US20020174453A1 (en) 2001-04-18 2002-11-21 Henry Daniell Production of antibodies in transgenic plastids
AT409381B (en) 2000-03-03 2002-07-25 Gloessl Josef Dr Xylosyltransferase-GEN
FR2808286B1 (en) 2000-04-26 2004-12-17 Agronomique Inst Nat Rech NOVEL PLANT GLUCOSIDASE I AND ITS APPLICATION TO THE PRODUCTION OF RECOMBINANT PROTEINS WITH MODIFIED GLYCOSYLATION
AU2001250573A1 (en) 2000-04-28 2001-11-12 Michel Xilinas Treatment of pathological conditions influenced by the action of matrix metalloproteinases (mmps) using phanquinone
NZ522320A (en) 2000-05-12 2007-03-30 Neose Technologies Inc In vitro modification of glycosylation patterns of recombinant glycopeptides
WO2002000879A2 (en) * 2000-06-28 2002-01-03 Glycofi, Inc. Methods for producing modified glycoproteins
AU2002222965A1 (en) 2000-07-19 2002-01-30 Digimarc Corporation Print media with embedded messages for controlling printing
TW519716B (en) * 2000-12-19 2003-02-01 Tokyo Electron Ltd Wafer bias drive for a plasma source
WO2002057468A2 (en) 2001-01-19 2002-07-25 The Dow Chemical Company Method for secretory production of glycoprotein having human-type sugar chain using plant cell
EP2078455A1 (en) 2001-03-06 2009-07-15 The Dow Chemical Company Plant cell having animal-type sugar chain adding function
DE50214430D1 (en) 2001-06-22 2010-06-24 Siemens Ag VIDEO SMOKE DETECTION SYSTEM AND METHOD FOR VERIFYING IT
CN1473452A (en) 2001-07-09 2004-02-04 IBIDEN�ɷ����޹�˾ Ceramic heater and ceramic joined article
SE519261C2 (en) 2001-07-17 2003-02-04 Sca Hygiene Prod Ab Absorbent articles
WO2003007861A1 (en) 2001-07-20 2003-01-30 Cutmed Ltd. System for performing a micro-drilling procedure in human or animal and a method for use thereof
WO2003011878A2 (en) 2001-08-03 2003-02-13 Glycart Biotechnology Ag Antibody glycosylation variants having increased antibody-dependent cellular cytotoxicity
US20030175755A1 (en) 2002-03-05 2003-09-18 Yoshihiro Abiko Method of collecting data for estimation of susceptibility to periodontal disease
WO2003078614A2 (en) 2002-03-19 2003-09-25 Plant Research International B.V. Gntiii (udp-n-acethylglucosamine:beta-d mannoside beta (1,4)-n-acethylglucosaminy ltransferase iii) expression in plants
AU2003219418B2 (en) 2002-03-19 2008-02-28 Stichting Dienst Landbouwkundig Onderzoek Optimizing glycan processing in plants
DE10229635C1 (en) 2002-07-02 2003-07-10 Ise Gmbh Roll over guard for motor vehicle has spring extended had protector with latch to retain in normal retracted position
CL2003002461A1 (en) 2002-11-27 2005-01-07 Dow Chemical Company Agroscien IMMUNOGLOBULIN THAT UNDERSTANDS AT LEAST ONE AFUCOSILATED GLICAN, COMPOSITION THAT CONTAINS IT, NUCLEOTIDIC SEQUENCE AND VECTOR THAT UNDERSTANDS IT, PROCEDURE TO PRODUCE IMMUNOGLOBULIN SAID IN PLANTS.
KR101460953B1 (en) 2007-04-17 2014-11-13 스티칭 디엔스트 랜드보위쿤디그 온데조에크 Mammalian-type glycosylation in plants by expression of non-mammalian glycosyltransferases

Also Published As

Publication number Publication date
EP2339004A1 (en) 2011-06-29
KR101060361B1 (en) 2011-08-29
HK1124364A1 (en) 2009-07-10
KR20040091144A (en) 2004-10-27
CN100532558C (en) 2009-08-26
KR20110004490A (en) 2011-01-13
WO2003078637A2 (en) 2003-09-25
US20120011600A1 (en) 2012-01-12
KR20110079860A (en) 2011-07-08
EP1485486A2 (en) 2004-12-15
US8927810B2 (en) 2015-01-06
EP1959018A1 (en) 2008-08-20
KR101196023B1 (en) 2012-10-30
EP2339004B1 (en) 2015-02-25
WO2003078637A3 (en) 2004-03-11
JP4532118B2 (en) 2010-08-25
JP2014221077A (en) 2014-11-27
JP2012070761A (en) 2012-04-12
JP5105257B2 (en) 2012-12-26
US20100122365A1 (en) 2010-05-13
HK1153236A1 (en) 2012-03-23
CA2799805A1 (en) 2003-09-25
US8058508B2 (en) 2011-11-15
JP2005535291A (en) 2005-11-24
JP5694967B2 (en) 2015-04-01
CN1756844A (en) 2006-04-05
US20060253928A1 (en) 2006-11-09
AU2003219418B2 (en) 2008-02-28
AU2003219418A1 (en) 2003-09-29
US7601891B2 (en) 2009-10-13
JP2009232858A (en) 2009-10-15
NZ534880A (en) 2008-04-30
CA2799805C (en) 2016-05-17
KR101228276B1 (en) 2013-01-30
AR039013A1 (en) 2005-02-02
CA2478297A1 (en) 2003-09-25
CA2923247A1 (en) 2003-09-25
EP1485486B1 (en) 2012-11-21
EP1959018B1 (en) 2011-05-11
CN101613705A (en) 2009-12-30

Similar Documents

Publication Publication Date Title
CA2478297C (en) Optimizing glycan processing in plants
JP5480060B2 (en) GNTIII expression in plants
AU2012202479B2 (en) Optimizing glycan processing in plants
AU2013205260B2 (en) Optimizing glycan processing in plants
AU2008202355B2 (en) Optimizing glycan processing in plants
AU2016203395A1 (en) Optimizing glycan processing in plants

Legal Events

Date Code Title Description
EEER Examination request
MKLA Lapsed

Effective date: 20190318