US20100121868A1 - Converting a database query to a multi-dimensional expression query - Google Patents

Converting a database query to a multi-dimensional expression query Download PDF

Info

Publication number
US20100121868A1
US20100121868A1 US12/266,570 US26657008A US2010121868A1 US 20100121868 A1 US20100121868 A1 US 20100121868A1 US 26657008 A US26657008 A US 26657008A US 2010121868 A1 US2010121868 A1 US 2010121868A1
Authority
US
United States
Prior art keywords
query
filter condition
dimensional expression
normalized
operator
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US12/266,570
Inventor
Yann Le Biannic
Eric Gouthiere
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
SAP France SA
Original Assignee
SAP France SA
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by SAP France SA filed Critical SAP France SA
Priority to US12/266,570 priority Critical patent/US20100121868A1/en
Assigned to BUSINESS OBJECTS S.A. reassignment BUSINESS OBJECTS S.A. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: LE BIANNIC, YANN
Assigned to BUSINESS OBJECTS S.A. reassignment BUSINESS OBJECTS S.A. CORRECTIVE ASSIGNMENT TO CORRECT THE LIST OF INVENTORS PREVIOUSLY RECORDED ON REEL 023425 FRAME 0396. ASSIGNOR(S) HEREBY CONFIRMS THE ...WE, YANN LE BIANNIC AND ERIC GOUTHIERE, HEREBY ASSIGN TO .... Assignors: GOUTHIERE, ERIC, LE BIANNIC, YANN
Publication of US20100121868A1 publication Critical patent/US20100121868A1/en
Assigned to SAP France S.A. reassignment SAP France S.A. CHANGE OF NAME (SEE DOCUMENT FOR DETAILS). Assignors: SAP France S.A.
Assigned to SAP France S.A. reassignment SAP France S.A. CHANGE OF NAME (SEE DOCUMENT FOR DETAILS). Assignors: BUSINESS OBJECTS S.A.
Abandoned legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/242Query formulation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/903Querying
    • G06F16/9032Query formulation
    • G06F16/90324Query formulation using system suggestions

Definitions

  • the invention generally relates to the field of multi-dimensional data sources and systems. More particularly the invention relates to converting a database query involving a propositional formula to a multi-dimensional expression query by normalizing and translating the database query.
  • Queries are used create, modify, retrieve and manipulate data in a data source, such as, a database, a data warehouse, a plurality of reports, and the like.
  • Filtering is the application of filters.
  • a filter is a condition used to limit information retrieved from a data source to a subset of the whole result of an unfiltered query. Filters are usually expressed in form a propositional formula that states the condition.
  • OLAP tools are a subset of business intelligence tools. There are a number of commercially available OLAP tools including BusinessObjects OLAP VoyagerTM which is available from Business Objects of San Jose, Calif. OLAP tools are a report generation tool, and a tool suited to ad hoc analyses. OLAP generally refers to a technique of providing fast analysis of shared multi-dimensional information stored in a database. In some OLAP tools the data is arranged in a schema which simulates a multidimensional schema. Conceptually the information is in a hyper cube. The multi-dimensional schema means redundant information is stored, but it allows for users to initiate queries without the need to know how the data is organized.
  • MDX Multidimensional Expressions
  • SQL is a query language for relational databases.
  • an MDX statement can be used to query for a result from an OLAP data source, i.e., a cube.
  • the MDX statement can resemble SQL statements where one can ask for data on a row and columns from a cube.
  • each MDX query requires a data request (the “SELECT” clause), a starting point (the “FROM” clause), and a filter (the “WHERE” clause).
  • SQL queries are used to generate a two dimensional result—zero or more rows. This result is a relation.
  • the query process includes selecting columns; selecting from relations; and applying filters using predicates.
  • MDX queries are used to generate a multidimensional result—the result is a (hyper-) cube.
  • the process includes selecting sets of members; selecting from a cube; and applying filters.
  • the filters are set operators for members & predicates for values. Not all SQL queries can be converted MDX queries. Most filter conditions cannot be directly translated as predicates.
  • the filter condition is converted to a normalized filter condition by analyzing a semantic context of one or more member sets in the filter condition, converting the one or more member sets into a normal form and replacing logical operators between the one or more member sets with normalized operators.
  • the normalized filter condition along with the result object and the data source is then translated to a multi-dimensional expression query in a second format.
  • FIG. 1 is a flow diagram for converting a query in a first format to a multi-dimensional expression (MDX) query in a second format according to an embodiment of the invention.
  • MDX multi-dimensional expression
  • FIG. 2 is an exemplary screen display of a user interface to create a query in a first format according to an embodiment of the invention.
  • FIG. 3 is a flow diagram for normalizing a filter condition of the query in the first format according to an embodiment of the invention.
  • FIG. 4 is an illustration of semantic analysis of a filter condition in a query of FIG. 2 according to an embodiment of the invention.
  • FIG. 5 is a flow diagram for analyzing a filter condition to determine a semantic context of member sets in the filter condition according to an embodiment of the invention.
  • FIG. 6 is a flow diagram for converting the filter condition to normal form according to an embodiment of the invention.
  • FIG. 7A is a flow diagram for replacing logical operators with normalized operators in the filter condition according to an embodiment of the invention.
  • FIG. 7B continues the example in FIG. 4 and is an illustration of replacing logical operators with normalized operators in the filter condition according to an embodiment of the invention.
  • FIG. 8 is an example of converting a logical expression to conjunctive normal form according to an embodiment of the invention.
  • FIG. 9 is an example of the conversion of a part of a filter condition to a disjunctive normal form according to an embodiment of the invention.
  • FIG. 10 shows the optimizing of an example filter condition by replacing an IAND operator with CAND operator according to an embodiment of the invention.
  • FIG. 11 is a normalization graph depicting structure of a normalized filter condition according to an embodiment of the invention.
  • FIG. 12 continues the example in FIGS. 2 and 7B by showing a normalized query of FIG.2 according to an embodiment of the invention.
  • FIG. 13 furthers the example in FIG. 12 by showing a translation of a result object in a normalized query to a first part of an MDX query according to an embodiment of the invention.
  • FIG. 14 extends the example in FIGS. 7B and 13 by illustrating the translation a normalized filter condition of the normalized query to a second part of the MDX query according to an embodiment of the invention.
  • FIG. 15 continues the example in FIG. 14 by depicting the MDX query of a query of FIG. 2 according to an embodiment of the invention.
  • FIG. 16 is a flow diagram for converting a normalized query to a multi-dimensional expression query according to an embodiment of the invention.
  • FIG. 17 depicts a table of a source query filter condition, its normalized filter condition and MDX filter condition according to an embodiment of the invention.
  • FIG. 18 is a block diagram for converting a query in a first format to the MDX query in the second format according to an embodiment of the invention.
  • a dimension may have a number of hierarchical levels in it.
  • dimension geography may have at least three hierarchical levels namely country, state, and district.
  • a measure or a metric is a quantity as ascertained by comparison with a standard, usually denoted in some metric, for example, units sold and dollars.
  • FIG. 1 is a flow diagram for converting a query in a first format to a multi-dimensional expression (MDX) query in a second format according to an embodiment of the invention.
  • the MDX query is used to access data in a multi-dimensional database such as the OLAP cube.
  • a query in a first format is received.
  • An example of the first format includes a structured query language (SQL) , other declarative query languages, and a query created in a declarative way using a user interface such as a query panel in the BusinessObjects Web IntelligenceTM product provided by Business Objects Americas of San Jose in California.
  • the query is normalized into a canonical form that is suitable for a translation to the MDX query.
  • a canonical form (often called normal form or standard form) of an object is a standard way of presenting that object.
  • the normalization process converts a filter condition in the query to a structured pattern wherein each structured pattern has a defined MDX query translation.
  • the normalized query is translated to the MDX query in a second format.
  • An example of the second format includes query languages such as Analysis Services (AS) 2000 and AS 2005 provided by Microsoft of Redmond, and MaxL from Oracle Essbase. These query languages are based on a MDX specification defined by Microsoft.
  • FIG. 2 is an exemplary screen display of a user interface to create a query in a first format according to an embodiment of the invention.
  • a user may create a query in a declarative way using a user interface such as query panel 200 .
  • the query is created by a user selection of result set, data source and filter criteria parameters unlike creating a query by writing a source code.
  • the query in query panel 200 fetches a count of customers in “United States” and “California” those who have bought a product which is not in the category of “bikes”, and not in the sub category of “caps” and not in fiscal year “2002”.
  • the result set of the query “a count of customers in United States and California” is defined as customer geography 215 and customer count 220 in result objects 205 .
  • the filter criteria of the query “product not in the category of bikes, not in the sub category of caps and not in fiscal year 2002” are defined in filter condition 225 as first predicate 235 , second predicate 240 and third predicate 245 respectively.
  • An AND operator 230 defines the relation between predicates of filter condition 225 .
  • FIG. 3 is a flow diagram for normalizing a filter condition of the query in the first format according to an embodiment of the invention.
  • a query in a first format is received.
  • An example of the query in the first format includes a SQL query and a query created using query panel 200 .
  • a query specification is also received to identify a format of the query received. Based on the format received, different parts of the query such as result objects, a filter condition and a data source may be retrieved.
  • the query is parsed to determine a filter condition 225 of the query.
  • filter condition 225 is analyzed to determine a semantic context of member sets in filter condition 225 .
  • a member set is a predicate that applies on a single dimension.
  • first predicate 235 in the product category and the second predicate 240 in the product sub-category form a single member set since they share a same dimension, product.
  • the third predicate 245 which is on a time dimension forms another member set.
  • the filter condition also includes a predicate that involves a measure.
  • the measure represents a quantity such as revenue, number of units etc.
  • the member set represents a relationship or a property such as name of a product and category of a product.
  • DNF disjunctive normal form
  • the DNF is a format of logical expression.
  • the DNF is a standard way to write a logical expression that is characterized by one or more disjunctions joining two or more conjunctive clauses.
  • a disjunction is an “OR” operation.
  • the following expressions are in DNF: X Y; X; (X Y) Z and (X Y Z) ( A B C).
  • Logical expressions can be converted to DNF by using logical equivalences.
  • a conjunction is an “AND” operation.
  • An expression is in conjunctive normal form (CNF) if it is a conjunction of clauses.
  • a conjunctive clause is one or more variables, or negations of a variable—combined by zero, one or more “and” operations.
  • the following expressions are in CNF: X Y; (X Y) (X Z).
  • a normalized filter condition is created by replacing logical operators such as “AND” and “OR” between the member sets in the filter condition with normalized operators.
  • the normalized operators include crossjoin-and (CAND), intersect-and (IAND), union-or (UOR) and AGGREGATION FILTER.
  • the CAND operator computes an intersection of member sets of different dimensions.
  • the IAND operator computes an intersection of the member sets of multiple dimensions having at least one common dimension on either side of the logical AND operator.
  • the UOR operator computes a union of member sets of different dimensions.
  • the aggregation filter operator aggregates a measure over the one or more member sets.
  • the logical AND operator computes an intersection of member sets of same dimension and the logical OR operator computes a union of member sets of same dimension.
  • the normalized filter condition has the member sets in a normal form with normalized operators.
  • the normalized filter condition may be translated to the MDX query in a second format such as AS 2000.
  • FIG. 4 is an illustrated example of semantic analysis of a filter condition according to an embodiment of the invention.
  • the query of FIG. 2 is analyzed to determine a semantic context for one or more member sets.
  • a first filter condition 400 is a graphical representation of filter condition 225 .
  • the first filter condition 400 has three predicates, first predicate 405 having a product category as a filter criterion, second predicate 410 having a product sub category as a filter criterion, and third predicate 415 having a fiscal year as a filter criterion.
  • the first filter condition 400 is converted to second filter condition 420 after a semantic analysis of first filter condition 400 .
  • first predicate 405 and second predicate 410 are grouped into first member set 425 since they share a same dimension, product.
  • the third predicate 415 having a different dimension, time, is grouped as second member set 430 .
  • FIG. 5 is a flow diagram for analyzing a filter condition to determine a semantic context of member sets in the filter condition of FIG. 4 according to an embodiment of the invention.
  • the dimensions of predicates in first filter condition 400 are identified.
  • the dimension of first predicate 405 and second predicate 410 is product
  • the dimension of third predicate 415 is identified as time.
  • the predicates having a same dimension are grouped into a single member set. Therefore, first predicate 405 and second predicate 410 having the same dimension, product are grouped into first member set 425 .
  • the third predicate 415 having the dimension, time is grouped as second member set 430 .
  • the filter condition 400 has a measure. If no, the semantic analysis process ends. If yes, then a hierarchy level in the dimension of the member set for which a measure is aggregated is determined.
  • the semantic context for a measure is the list of hierarchical levels in a dimension on which the measure is aggregated.
  • the semantic context is either explicit, that is, associated to the measure within the filter condition or implicit, that is, derived from the result objects of the query. For example, in query panel 200 , if the query had below parameters:
  • Filter Condition ([Measures].[Revenue]>100) AND ([Product].[Subcategory] inlist (“beer”, “soda”))
  • the result objects involve two dimensions, customer and product.
  • the deepest hierarchical levels for these two dimensions in result objects are [Customer].[City] which is below [Customer].[Country] and [Product].[Category]. Any measure that appears in filter condition 225 of the query is assumed to be aggregated on [City] and [Category], whatever the hierarchical levels in the rest of filter condition 225 is.
  • the filter condition in the above example involves an aggregation of the measure on [Product].[Subcategory] which is a level below [Category], but [Revenue] is still aggregated on [Category].
  • FIG. 6 is a flow diagram showing an example of process block 315 used in some embodiments of the invention.
  • the flow diagram depicts converting the filter condition to disjunctive normal form.
  • the predicates After performing a semantic analysis on the filter condition of the query the predicates have been grouped by like dimensions to form member sets. Now, the member sets in the filter condition are converted to a disjunctive normal form.
  • the member sets in the filter condition are converted to a conjunctive normal form.
  • the member sets are first converted to CNF in order to separate member sets of different dimensions. Once, the member sets are separated based on their dimension, they can be translated to their respective axis in an MDX query.
  • the predicates of the filter condition are again grouped into member sets as described in FIG. 5 .
  • the operators on member sets are converted to DNF whenever the filter condition initially involves an intersection of unions of the member sets. After conversion, such a condition is expressed as a union of intersection of member set.
  • the filter condition is optimized by replacing an intersection-AND (IAND) operator with a cross-join AND (CAND) operator.
  • IAND intersection-AND
  • CAND cross-join AND
  • the IAND operator is used to compute an intersection between the member sets of multiple dimensions having at least one dimension in common on either sides of the logical AND operator.
  • a CAND operator is used to compute an intersection between member sets of different dimensions. Translating an IAND operator to an MDX equivalent is more expensive in terms of time and memory than a CAND operator. Hence, the IAND operator is replaced with CAND operator wherever possible.
  • FIG. 7A is a flow diagram for replacing logical operators with normalized operators in the filter condition according to an embodiment of the invention.
  • processing block 320 replaces the logical operators in the filter condition with normalized operators.
  • the normalized operators include but not limited to operators such as CAND, IAND, UOR, AGGREGATION FILTER.
  • a logical AND operator between the member sets of a different dimension is replaced with a CAND operator. For example, consider
  • a logical AND operator between the member sets of multiple dimensions having at least one dimension in common between the member sets on either sides of the logical AND operator is replaced with an IAND operator. For example, consider a filter condition,
  • a logical OR operator between the member sets of different dimensions is replaced with a UOR operator. For example, consider a filter condition,
  • an aggregation between the member sets is converted to an AGGREGATION FILTER operator.
  • An aggregation is a function, such as, count, sum, average, min, max and the like, defined over a dimension. It is used to compute a measure value. For example, consider a filter condition
  • the logical AND and logical OR operators are replaced with the normalized operators such as CAND, IAND, UOR, AGGREGATION FILTER because each of the normalized operators has a straight forward translation in the MDX query.
  • the process of translating the query becomes easier and efficient if a filter condition in the query is converted to the normalized filter condition.
  • FIG. 7B illustrates replacing logical operators with normalized operators in the filter condition according to an embodiment of the invention.
  • the filter condition 715 of the query in query panel 200 is normalized by replacing the logical operators with normalized operators to create a normalized filter condition 740 .
  • a first logical AND operator 720 is between a first member set 730 and second member set 735 which are of different dimensions.
  • the first logical AND operator 720 is replaced with CAND operator 745 in the normalized filter condition 740 .
  • a second logical AND operator 725 is retained in the normalized filter condition 740 since it is an AND operation between two predicates of same dimensions.
  • FIG. 8 shows an example of converting a logical expression to conjunctive normal form.
  • Such a conversion is used by some embodiments of the invention to convert filter conditions.
  • the main goal in normalizing a filter condition is to obtain Cartesian products on distinct dimensions since these operations are efficiently executed in MDX.
  • the member sets are first converted to CNF in order to separate member sets of different dimensions. Once, the member sets are separated based on their dimension, they can be projected to their respective axis in an MDX query.
  • a first logical expression 800 which is (A AND B) OR C. Converting first logical expression 800 to CNF based on logical equivalences, we get
  • FIG. 9 is a block diagram for converting the filter condition to a disjunctive normal form according to an embodiment of the invention. After converting the filter condition to CNF and grouping the predicates of the same dimension into member sets, portions of the filter condition that involves a set intersection are converted to DNF.
  • first filter condition 900 which is in CNF.
  • DNF disjunctive normal form
  • FIG. 10 is a block diagram for optimizing a normalized filter condition by replacing an IAND operator with CAND operator according to an embodiment of the invention.
  • Translating an IAND operator to an MDX equivalent is more expensive in terms of time and memory than a CAND operator.
  • the IAND operator is replaced with CAND operator wherever possible.
  • the IAND is converted to a logical AND operator, the member sets of different dimensions are separated and the logical AND between the different dimensions is converted to the CAND operator.
  • the first normalized filter condition 1000 is optimized by converting an expensive IAND operator to a less expensive CAND operator as follows:
  • Second normalized filter condition 1025 Replacing the logical operators in filter condition 1005 with the normalized operators, we get second normalized filter condition 1025 .
  • Second normalized filter condition 1025 Second normalized filter condition 1025 :
  • a first logical AND operator 1010 is replaced with a first CAND operator 1030
  • a logical OR operator 1015 is replaced with an UOR operator 1035
  • a second logical AND operator 1020 is replaced with a second CAND operator 1040 in the normalized filter condition 1025 based on rules described in FIG. 7A .
  • the normalized filter condition is then combined with a result object and a data source of the query to form a normalized query. For instance, consider a query in query panel 200 , the normalized query would consist of result objects 205 , data source (not shown in the figure) and normalized filter condition 740 .
  • FIG. 11 is a normalization graph 1100 depicting structure of a normalized filter condition 1105 according to an embodiment of the invention.
  • the normalization process as described in FIG. 3 at block 320 reduces the normalized filter condition 1105 to a format described by, and constrained to, the normalization graph 1100 or a sub graph.
  • the normalized filter condition 1105 has normalized operators that include but not limited to CAND 1110 , UOR 1115 , IAND 1125 , MEMBER SET 1130 which is a semantic wrapper that groups predicates of same dimension, and AGGREGATION FILTER 1135 apart from logical AND and OR operators.
  • the normalization graph 1100 shows the structure of the normalized filter condition 1105 , that is, order of the normalized operators and operands the normalized operators have in some embodiments.
  • the operator at the top is CAND it separates non overlapping dimensions allowing for easy association with an axis in an MDX query.
  • operator CAND 1110 may have its operands as operators
  • MEMBER SET 1130 may have a selection on dimension 1140 such as “SELECT [PRODUCT].[CATEGORY].&BIKES” as its operand.
  • the AGGREGATION FILTER 1135 may have a selection on measure 1145 such as “SELECT revenue>100” as its operand.
  • FIG. 12 depicts a normalized query of FIG. 2 according to an embodiment of the invention.
  • the normalized query 1200 has result objects 205 , a data source, and a normalized filter condition 740 of the query in query panel 200 .
  • the normalized query 1200 forms an input to the MDX query translation process.
  • FIG. 13 is a block diagram for translating a result object in a normalized query to a first part of an MDX query according to an embodiment of the invention.
  • the MDX query has a syntax with at least the following three parts:
  • query_axis specifies a result set of the MDX query, that is, result objects 1300
  • slicer_axis specifies a filter condition of the MDX query
  • data source specifies a data source such as an OLAP cube from which the data is retrieved.
  • result objects 1300 of normalized query 1200 is translated to the query_axis of the SELECT clause.
  • the result set contains two columns, Customer geography 1305 , Customer Count 1310 and two rows, one for United States and the other for California. Therefore, result objects 1300 is translated to the first part 1315 of the MDX query as follows:
  • a data source specified in normalized query 1200 is translated to a third part of the MDX query as follows:
  • FIG. 14 is a block diagram illustrating the translating of a normalized filter condition of the normalized query 1200 to a second part of the MDX query according to an embodiment of the invention.
  • the normalized filter condition 1450 is translated to the slicer_axis of the FROM SELECT clause of the MDX query.
  • normalized filter condition 1450 of normalized query 1200 is translated to second part 1455 of the MDX query.
  • the MDX query based on a syntax of the AS 2000 query language.
  • the second part 1455 has MDX operators that include but not limited to CROSSJOIN, INTERSECT, UNION, FROM SELECT UNION, and FILTER.
  • the translation of normalized filter condition 1450 is done based on the multi-dimensional expression translation rules that include but not limited to:
  • Second part FROM (SELECT CROSSJOIN ( INTERSECT ( GENERATE ( EXCEPT ( [Product].[ProductCategories].[Category].Members, ⁇ [Product].[ProductCategories].[Category].&Bikes ⁇ ), DESCENDANTS ( [Product].[Product Categories].Currentmember, [Product].[Product Categories].[Subcategory], Self)), EXCEPT ( [Product].[Product Categories].[Subcategory].Members, ⁇ [Product].[Product Categories].[Subcategory].&CAPS ⁇ )), EXCEPT ( [Date].[Fiscal].[Fiscal Year].Members, ⁇ [Date].[Fiscal].[Fiscal Year].&[2002] ⁇ ))) on columns.
  • the GENERATE clause in MDX returns a concatenated string created by evaluating a string expression over a set.
  • the EXCEPT clause evaluates two sets and removes those tuples in the first set that also exist in the second set.
  • the DESCENDANTS clause returns the set of descendants of a member at a specified level or distance.
  • FIG. 15 depicts the MDX query 1500 of a query of FIG. 2 according to an embodiment of the invention.
  • the three parts are merged to form the MDX query in AS 2000 query language. Therefore, merging first part 1305 , second part 1400 and the third part 1210 , the MDX query 1500 reads as follows:
  • MDX Query SELECT ⁇ [Measures].Customer Count] ⁇ on columns, Non Empty Union ( ⁇ [Customer].[Customer Geography].[Country].&United States ⁇ , ⁇ [Customer].[Customer Geography].[State-Province].&Ca&Us ⁇ ) on rows FROM (SELECT CROSSJOIN ( INTERSECT ( GENERATE ( EXCEPT ( [Product].[ProductCategories].[Category].Members, ⁇ [Product].[ProductCategories].[Category].&Bikes ⁇ ), DESCENDANTS ( [Product].[Product Categories].Currentmember, [Product].[Product Categories].[Subcategory], Self)), EXCEPT ( [Product].[Product Categories].[Subcategory].Members, ⁇ [Product].[Product Categories].[Subcategory].&CAPS ⁇ )), EXCEPT ( [Date].[Fiscal].[Fiscal Year].
  • FIG. 16 is a flow diagram for converting a normalized query to a multi-dimensional expression query according to an embodiment of the invention.
  • a normalized query is received.
  • the normalized query is parsed to determine a result object, a data source and a normalized filter condition.
  • the result object and the data source are translated to a first part and a third part of the MDX query.
  • the translation is done based on syntax of the MDX query language that includes but not limited to AS 2000, AS 2005 and MaxL.
  • the normalized filter condition is translated to a second part of the MDX query based on MDX translation rules.
  • the MDX translation rules include rules for translating the normalized operators to MDX operators in the MDX query as described in FIG. 14 .
  • the first part specifying a result set of the MDX query, the second part specifying the filter condition of the MDX query and the third part specifying the data source of the MDX query are merged to form the MDX query.
  • FIG. 17 depicts a table of a source query filter condition, its normalized filter condition and MDX filter condition according to an embodiment of the invention. For example, in a first row of the table,
  • a logical AND operation is between two predicates S 1 and S 2 of the same dimension, PRODUCT. Therefore, in the normalized filter, MemberSet ([Product], AND (s 1 , s 2 )), the predicates S 1 and S 2 are grouped into a member set of PRODUCT and then a logical AND operation is performed between the predicates.
  • the normalized filter translates to an INTERSECT operation between predicates (translate (S 1 ), translate (S 2 )) in the MDX filter.
  • the predicates translate (S 1 ) and translate (S 2 ) are translated to query languages such as AS 2000.
  • FIG. 18 is a block diagram of a system for converting a query in a first format to a MDX query in the second format according to an embodiment of the invention.
  • System 1800 includes query provider 1805 that provides a query in the first format which is translated to MDX query 1855 .
  • the first format includes SQL, other declarative query languages, and queries created in a declarative way in tools such as Web Intelligence from Business Objects.
  • the second format in which the MDX query is generated includes but not limited to query languages such as AS 2000, AS 2005 and MaxL.
  • query provider 1805 also provides a query specification to identify a format of the query. Based on the format of the query, different parts of the query such as result objects, a filter condition and a data source may be identified.
  • a parser 1810 in communication with query provider 1805 parses the query to determine a result object and data source 1860 and a filter condition of the query.
  • a semantic analyzer 1815 in communication with parser 1810 analyzes the filter condition of the query to determine a semantic context such as dimensions of predicates of the filter condition and a hierarchy level in the dimension of a member set to which a measure is aggregated. The semantic analyzer 1815 groups the predicates having a same dimension into a member set.
  • a normalization engine 1820 in communication with pattern matching tool 1825 normalizes the semantically analyzed filter condition by converting the filter condition to disjunctive normal form.
  • pattern matching tool 1825 includes To One Matching (TOM) framework provided by INRIA of France.
  • TOM To One Matching
  • the pattern matching tool 1825 has rules for converting logical expressions to disjunctive normal form, conjunctive normal form and converting between the two. The rules are based on logical equivalences.
  • the normalization engine 1820 has rules for replacing logical operators with normalized operators.
  • the normalization engine 1820 in communication with pattern matching tool 1825 replaces the logical operators with the normalized operators.
  • a query optimizer 1830 optimizes the filter condition by replacing an IAND operator with a CAND operator and creates normalized filter condition 1835 . This optimization may involve repeated conversions of the filter condition to disjunctive normal form and conjunctive normal form. So the query optimizer 1830 is in communication with normalization engine 1820 .
  • the normalized filter condition 1835 and result object and data source 1860 are converted to MDX query 1855 by MDX engine 1840 and translator 1850 .
  • the MDX engine 1840 converts the normalized filter condition 1835 and result object and data source 1860 to an intermediate MDX query based on MDX translation rules. Further, the intermediate format of the MDX filter is translated to MDX query 1855 by translator 1850 .
  • the MDX engine 1840 identifies the normalized operators in normalized filter condition 1835 and provides them to pattern matching tool 1825 along with result object and data source 1860 to convert them to the intermediate MDX query.
  • the pattern matching tool 1825 constructs the intermediate MDX query by
  • the translator 1850 further translates the intermediate MDX query to MDX query 1855 in a query language that includes but not limited to AS 2000, AS 2005 and MaxL.
  • the translator 1850 obtains syntax of the query language from syntax file 1845 .
  • Embodiments of the invention may include various steps as set forth above.
  • the steps may be embodied in machine-executable program code which causes a general-purpose or special-purpose processor to perform certain steps.
  • these steps may be performed by specific hardware components that contain hardwired logic for performing the steps, or by any combination of programmed computer components and custom hardware components.
  • Embodiments of the present invention may also be provided as a machine-readable medium for storing the machine-executable instructions.
  • the machine-readable medium may include, but is not limited to, flash memory, optical disks, CD-ROMs, DVD ROMs, RAMs, EPROMs, EEPROMs, magnetic or optical cards, or any other type of machine-readable media suitable for tangibly storing electronic instructions.
  • the machine readable medium can provide the instructions stored therein to a computer system comprising a processor capable of reading and executing the instructions to implement the method steps described herein.

Abstract

Disclosed is a method and system for receiving a query in a first format, parsing the query to determine a result object, a data source and a filter condition in the query. The filter condition is converted to a normalized filter condition by analyzing a semantic context of one or more member sets in the filter condition, converting the one or more member sets into a normal form and replacing logical operators between the one or more member sets with normalized operators. The normalized filter condition along with the result object and the data source is then translated to a multi-dimensional database query in a second format.

Description

    FIELD OF THE INVENTION
  • The invention generally relates to the field of multi-dimensional data sources and systems. More particularly the invention relates to converting a database query involving a propositional formula to a multi-dimensional expression query by normalizing and translating the database query.
  • BACKGROUND OF THE INVENTION
  • Queries are used create, modify, retrieve and manipulate data in a data source, such as, a database, a data warehouse, a plurality of reports, and the like. Filtering is the application of filters. A filter is a condition used to limit information retrieved from a data source to a subset of the whole result of an unfiltered query. Filters are usually expressed in form a propositional formula that states the condition.
  • OLAP tools are a subset of business intelligence tools. There are a number of commercially available OLAP tools including BusinessObjects OLAP Voyager™ which is available from Business Objects of San Jose, Calif. OLAP tools are a report generation tool, and a tool suited to ad hoc analyses. OLAP generally refers to a technique of providing fast analysis of shared multi-dimensional information stored in a database. In some OLAP tools the data is arranged in a schema which simulates a multidimensional schema. Conceptually the information is in a hyper cube. The multi-dimensional schema means redundant information is stored, but it allows for users to initiate queries without the need to know how the data is organized.
  • Multidimensional Expressions (MDX) is a query language for OLAP databases, like SQL is a query language for relational databases. Thus, an MDX statement can be used to query for a result from an OLAP data source, i.e., a cube. The MDX statement can resemble SQL statements where one can ask for data on a row and columns from a cube. As with an SQL query, each MDX query requires a data request (the “SELECT” clause), a starting point (the “FROM” clause), and a filter (the “WHERE” clause). These and other keywords provide the tools used to extract specific portions of data from a cube for analysis, e.g., to extract a slice.
  • SQL queries are used to generate a two dimensional result—zero or more rows. This result is a relation. The query process includes selecting columns; selecting from relations; and applying filters using predicates. MDX queries are used to generate a multidimensional result—the result is a (hyper-) cube. The process includes selecting sets of members; selecting from a cube; and applying filters. The filters are set operators for members & predicates for values. Not all SQL queries can be converted MDX queries. Most filter conditions cannot be directly translated as predicates.
  • SUMMARY OF THE INVENTION
  • Described are methods and systems for receiving a query in a first format, parsing the query to determine a result object, a data source and a filter condition in the query. The filter condition is converted to a normalized filter condition by analyzing a semantic context of one or more member sets in the filter condition, converting the one or more member sets into a normal form and replacing logical operators between the one or more member sets with normalized operators. The normalized filter condition along with the result object and the data source is then translated to a multi-dimensional expression query in a second format.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • The following description includes discussion of figures having illustrations given by way of example of implementations of embodiments of the invention. The drawings should be understood by way of example, and not by way of limitation. As used herein, references to one or more “embodiments” are to be understood as describing a particular feature, structure, or characteristic included in at least one implementation of the invention. Thus, phrases such as “in one embodiment” or “in an alternate embodiment” appearing herein describe various embodiments and implementations of the invention, and do not necessarily all refer to the same embodiment. However, they are also not necessarily mutually exclusive.
  • FIG. 1 is a flow diagram for converting a query in a first format to a multi-dimensional expression (MDX) query in a second format according to an embodiment of the invention.
  • FIG. 2 is an exemplary screen display of a user interface to create a query in a first format according to an embodiment of the invention.
  • FIG. 3 is a flow diagram for normalizing a filter condition of the query in the first format according to an embodiment of the invention.
  • FIG. 4 is an illustration of semantic analysis of a filter condition in a query of FIG. 2 according to an embodiment of the invention.
  • FIG. 5 is a flow diagram for analyzing a filter condition to determine a semantic context of member sets in the filter condition according to an embodiment of the invention.
  • FIG. 6 is a flow diagram for converting the filter condition to normal form according to an embodiment of the invention.
  • FIG. 7A is a flow diagram for replacing logical operators with normalized operators in the filter condition according to an embodiment of the invention.
  • FIG. 7B continues the example in FIG. 4 and is an illustration of replacing logical operators with normalized operators in the filter condition according to an embodiment of the invention.
  • FIG. 8 is an example of converting a logical expression to conjunctive normal form according to an embodiment of the invention.
  • FIG. 9 is an example of the conversion of a part of a filter condition to a disjunctive normal form according to an embodiment of the invention.
  • FIG. 10 shows the optimizing of an example filter condition by replacing an IAND operator with CAND operator according to an embodiment of the invention.
  • FIG. 11 is a normalization graph depicting structure of a normalized filter condition according to an embodiment of the invention.
  • FIG. 12 continues the example in FIGS. 2 and 7B by showing a normalized query of FIG.2 according to an embodiment of the invention.
  • FIG. 13 furthers the example in FIG. 12 by showing a translation of a result object in a normalized query to a first part of an MDX query according to an embodiment of the invention.
  • FIG. 14 extends the example in FIGS. 7B and 13 by illustrating the translation a normalized filter condition of the normalized query to a second part of the MDX query according to an embodiment of the invention.
  • FIG. 15 continues the example in FIG. 14 by depicting the MDX query of a query of FIG. 2 according to an embodiment of the invention.
  • FIG. 16 is a flow diagram for converting a normalized query to a multi-dimensional expression query according to an embodiment of the invention.
  • FIG. 17 depicts a table of a source query filter condition, its normalized filter condition and MDX filter condition according to an embodiment of the invention.
  • FIG. 18 is a block diagram for converting a query in a first format to the MDX query in the second format according to an embodiment of the invention.
  • DETAILED DESCRIPTION
  • In a multi-dimensional database such as an online analytical processing (OLAP) cube contains data in various dimensions such as products, people, geography, financial elements, and time. A dimension may have a number of hierarchical levels in it. For example, dimension geography may have at least three hierarchical levels namely country, state, and district. A measure or a metric is a quantity as ascertained by comparison with a standard, usually denoted in some metric, for example, units sold and dollars.
  • FIG. 1 is a flow diagram for converting a query in a first format to a multi-dimensional expression (MDX) query in a second format according to an embodiment of the invention. The MDX query is used to access data in a multi-dimensional database such as the OLAP cube. At process block 100, a query in a first format is received. An example of the first format includes a structured query language (SQL) , other declarative query languages, and a query created in a declarative way using a user interface such as a query panel in the BusinessObjects Web Intelligence™ product provided by Business Objects Americas of San Jose in California. At process block 105, the query is normalized into a canonical form that is suitable for a translation to the MDX query. A canonical form (often called normal form or standard form) of an object is a standard way of presenting that object. The normalization process converts a filter condition in the query to a structured pattern wherein each structured pattern has a defined MDX query translation. At process block 110, the normalized query is translated to the MDX query in a second format. An example of the second format includes query languages such as Analysis Services (AS) 2000 and AS 2005 provided by Microsoft of Redmond, and MaxL from Oracle Essbase. These query languages are based on a MDX specification defined by Microsoft.
  • FIG. 2 is an exemplary screen display of a user interface to create a query in a first format according to an embodiment of the invention. A user may create a query in a declarative way using a user interface such as query panel 200. In a declarative way, the query is created by a user selection of result set, data source and filter criteria parameters unlike creating a query by writing a source code. The query in query panel 200 fetches a count of customers in “United States” and “California” those who have bought a product which is not in the category of “bikes”, and not in the sub category of “caps” and not in fiscal year “2002”. The result set of the query, “a count of customers in United States and California” is defined as customer geography 215 and customer count 220 in result objects 205. The filter criteria of the query, “product not in the category of bikes, not in the sub category of caps and not in fiscal year 2002” are defined in filter condition 225 as first predicate 235, second predicate 240 and third predicate 245 respectively. An AND operator 230 defines the relation between predicates of filter condition 225.
  • FIG. 3 is a flow diagram for normalizing a filter condition of the query in the first format according to an embodiment of the invention. At process block 300, a query in a first format is received. An example of the query in the first format includes a SQL query and a query created using query panel 200. Along with the query, a query specification is also received to identify a format of the query received. Based on the format received, different parts of the query such as result objects, a filter condition and a data source may be retrieved. At process block 305, the query is parsed to determine a filter condition 225 of the query. At process block 310, filter condition 225 is analyzed to determine a semantic context of member sets in filter condition 225. A member set is a predicate that applies on a single dimension. For example, in filter condition 225, first predicate 235 in the product category and the second predicate 240 in the product sub-category form a single member set since they share a same dimension, product. The third predicate 245 which is on a time dimension forms another member set. The filter condition also includes a predicate that involves a measure. The measure represents a quantity such as revenue, number of units etc. The member set represents a relationship or a property such as name of a product and category of a product.
  • At process block 315, AND/OR operators applied on analyzed member sets are converted into a normal form that involves multiple layers of conjunctions and disjunctions. In an embodiment, semantically analyzed member sets may be converted into a disjunctive normal form (DNF) in the filter condition. The DNF is a format of logical expression. The DNF is a standard way to write a logical expression that is characterized by one or more disjunctions joining two or more conjunctive clauses. A disjunction is an “OR” operation. The following expressions are in DNF: X
    Figure US20100121868A1-20100513-P00001
    Y; X; (X
    Figure US20100121868A1-20100513-P00002
    Y)
    Figure US20100121868A1-20100513-P00001
    Z and (X
    Figure US20100121868A1-20100513-P00003
    Y
    Figure US20100121868A1-20100513-P00003
    Z)
    Figure US20100121868A1-20100513-P00001
    (
    Figure US20100121868A1-20100513-P00004
    A
    Figure US20100121868A1-20100513-P00002
    B
    Figure US20100121868A1-20100513-P00002
    C). Logical expressions can be converted to DNF by using logical equivalences.
  • A conjunction is an “AND” operation. An expression is in conjunctive normal form (CNF) if it is a conjunction of clauses. A conjunctive clause is one or more variables, or negations of a variable—combined by zero, one or more “and” operations. The following expressions are in CNF: X
    Figure US20100121868A1-20100513-P00002
    Y; (X
    Figure US20100121868A1-20100513-P00001
    Y)
    Figure US20100121868A1-20100513-P00002
    (X
    Figure US20100121868A1-20100513-P00001
    Z).
  • At process block 320, a normalized filter condition is created by replacing logical operators such as “AND” and “OR” between the member sets in the filter condition with normalized operators. The normalized operators include crossjoin-and (CAND), intersect-and (IAND), union-or (UOR) and AGGREGATION FILTER. The CAND operator computes an intersection of member sets of different dimensions. The IAND operator computes an intersection of the member sets of multiple dimensions having at least one common dimension on either side of the logical AND operator. The UOR operator computes a union of member sets of different dimensions. The aggregation filter operator aggregates a measure over the one or more member sets. The logical AND operator computes an intersection of member sets of same dimension and the logical OR operator computes a union of member sets of same dimension.
  • The normalized filter condition has the member sets in a normal form with normalized operators. The normalized filter condition may be translated to the MDX query in a second format such as AS 2000.
  • FIG. 4 is an illustrated example of semantic analysis of a filter condition according to an embodiment of the invention. The query of FIG. 2 is analyzed to determine a semantic context for one or more member sets. A first filter condition 400 is a graphical representation of filter condition 225. The first filter condition 400 has three predicates, first predicate 405 having a product category as a filter criterion, second predicate 410 having a product sub category as a filter criterion, and third predicate 415 having a fiscal year as a filter criterion.
  • The first filter condition 400 is converted to second filter condition 420 after a semantic analysis of first filter condition 400. As a result of semantic analysis, first predicate 405 and second predicate 410 are grouped into first member set 425 since they share a same dimension, product. The third predicate 415 having a different dimension, time, is grouped as second member set 430.
  • FIG. 5 is a flow diagram for analyzing a filter condition to determine a semantic context of member sets in the filter condition of FIG. 4 according to an embodiment of the invention. At process block 500, the dimensions of predicates in first filter condition 400 are identified. For example, the dimension of first predicate 405 and second predicate 410 is product, and the dimension of third predicate 415 is identified as time. At process block 505, the predicates having a same dimension are grouped into a single member set. Therefore, first predicate 405 and second predicate 410 having the same dimension, product are grouped into first member set 425. The third predicate 415 having the dimension, time is grouped as second member set 430.
  • At process block 510, it is checked if the filter condition 400 has a measure. If no, the semantic analysis process ends. If yes, then a hierarchy level in the dimension of the member set for which a measure is aggregated is determined. The semantic context for a measure is the list of hierarchical levels in a dimension on which the measure is aggregated. The semantic context is either explicit, that is, associated to the measure within the filter condition or implicit, that is, derived from the result objects of the query. For example, in query panel 200, if the query had below parameters:
  • Result objects=[Customer].[Country], [Customer].[City], [Product].[Category], [Measures].[Revenue]
  • Filter Condition=([Measures].[Revenue]>100) AND ([Product].[Subcategory] inlist (“beer”, “soda”))
  • The result objects involve two dimensions, customer and product. The deepest hierarchical levels for these two dimensions in result objects are [Customer].[City] which is below [Customer].[Country] and [Product].[Category]. Any measure that appears in filter condition 225 of the query is assumed to be aggregated on [City] and [Category], whatever the hierarchical levels in the rest of filter condition 225 is. The filter condition in the above example involves an aggregation of the measure on [Product].[Subcategory] which is a level below [Category], but [Revenue] is still aggregated on [Category].
  • FIG. 6 is a flow diagram showing an example of process block 315 used in some embodiments of the invention. The flow diagram depicts converting the filter condition to disjunctive normal form. After performing a semantic analysis on the filter condition of the query the predicates have been grouped by like dimensions to form member sets. Now, the member sets in the filter condition are converted to a disjunctive normal form. In a multi-dimensional environment wherein a filter condition has predicates of multiple dimensions, it is more efficient to evaluate the filter condition as a “union of intersections” rather than an “intersection of unions”.
  • At process block 600, the member sets in the filter condition are converted to a conjunctive normal form. The member sets are first converted to CNF in order to separate member sets of different dimensions. Once, the member sets are separated based on their dimension, they can be translated to their respective axis in an MDX query. Consider a filter condition of the form
      • Filter condition=(A AND B) OR C.
        Converting this to CNF based on logical equivalences, we get
      • Filter condition in CNF=(A OR C) AND (B OR C)
  • After converting the member sets to CNF, at process block 605, the predicates of the filter condition are again grouped into member sets as described in FIG. 5. At process block 610, the operators on member sets are converted to DNF whenever the filter condition initially involves an intersection of unions of the member sets. After conversion, such a condition is expressed as a union of intersection of member set. Consider a filter condition in CNF,
      • Filter condition=((Count=N) OR (Country in {INDIA, USA})) AND Country in {CHINA, INDIA}
    Converting to DNF, we get
      • Filter condition=(Country=INDIA) OR ((Count=N) AND (Country in {INDIA, CHINA}))
    Where,
      • OR operator is a union operation; and
      • AND operator is an intersect operation.
  • The details of the above conversion are described in FIG. 9. It is more efficient to translate the filter condition in DNF to an MDX query than the filter condition in CNF. Evaluating a statement in DNF is more efficient since as few as one clause needs to be evaluated. Like Evaluation translating a filter condition which is in CNF (as an intersection of unions of the member sets) is typically less efficient.
  • Finally, at process block 615, the filter condition is optimized by replacing an intersection-AND (IAND) operator with a cross-join AND (CAND) operator. The IAND operator is used to compute an intersection between the member sets of multiple dimensions having at least one dimension in common on either sides of the logical AND operator. A CAND operator is used to compute an intersection between member sets of different dimensions. Translating an IAND operator to an MDX equivalent is more expensive in terms of time and memory than a CAND operator. Hence, the IAND operator is replaced with CAND operator wherever possible.
  • For example, consider a filter condition,
      • Filter Condition: (Year>2007) IAND ((Product=SODA) OR (Year=2007)).
        Optimizing the above filter condition by replacing the IAND operator with CAND operator, we get
      • Filter condition: (Year=2007) OR ((Year>2007) CAND (Product=SODA))
  • The details of the conversion of the above filter condition are explained in FIG. 10. In another embodiment, it may not be possible to replace an IAND operator with CAND operator in complex queries.
  • FIG. 7A is a flow diagram for replacing logical operators with normalized operators in the filter condition according to an embodiment of the invention. After processing the filter condition in FIGS. 3, 5 and 6 processing block 320 replaces the logical operators in the filter condition with normalized operators. The normalized operators include but not limited to operators such as CAND, IAND, UOR, AGGREGATION FILTER.
  • At process block 700, a logical AND operator between the member sets of a different dimension is replaced with a CAND operator. For example, consider
      • Filter Condition: (Country=USA) AND (Year=2007)
        The member sets on either sides of the AND operator in the above filter condition have different dimensions, geography and time respectively. Hence, the logical AND operator becomes a CAND operator in the normalized filter condition. Therefore, the normalized filter condition reads as
      • Filter Condition: (Country=USA) CAND (Year=2007)
  • At process block 705, a logical AND operator between the member sets of multiple dimensions having at least one dimension in common between the member sets on either sides of the logical AND operator is replaced with an IAND operator. For example, consider a filter condition,
      • Filter Condition: (Year≧2007) AND ((Product=SODA) OR (Year=2007))
        In the above filter condition, there are multiple dimensions on either side of the AND operator; dimension, time on the left side and dimensions, product and time on the right side of the AND operator. But, the member set (Year≧2007) on the left side of the AND operator and the member set (Year=2007) on the right side of the operator have the same dimension, time. Therefore, the AND operator is replaced with an IAND operator. The normalized filter condition will read as
      • Filter Condition: (Year≧2007) IAND ((Product=SODA) OR (Year=2007)).
  • At process block 710, a logical OR operator between the member sets of different dimensions is replaced with a UOR operator. For example, consider a filter condition,
      • Filter Condition: (Year=2007) OR (Product=Soda)
        Since the member sets (Year=2007) and (Product=Soda) have different dimensions, time and product, the logical OR operator is replaced with the normalized UOR operator. The normalized filter condition reads as,
      • Filter Condition: (Year=2007) UOR (Product=Soda)
  • At process block 715, an aggregation between the member sets is converted to an AGGREGATION FILTER operator. An aggregation is a function, such as, count, sum, average, min, max and the like, defined over a dimension. It is used to compute a measure value. For example, consider a filter condition
      • Filter Condition: (Country=USA) AND (Revenue>1000) is converted to=AGGREGATION FILTER (Country=USA, Revenue>1000)
        The above filter condition means that a result set is first filtered by predicate (Country=USA) and then a measure such as (Revenue>1000) is applied on the filtered result set.
  • The logical AND and logical OR operators are replaced with the normalized operators such as CAND, IAND, UOR, AGGREGATION FILTER because each of the normalized operators has a straight forward translation in the MDX query. The process of translating the query becomes easier and efficient if a filter condition in the query is converted to the normalized filter condition.
  • FIG. 7B illustrates replacing logical operators with normalized operators in the filter condition according to an embodiment of the invention. Continuing the example of FIG. 4 the filter condition 715 of the query in query panel 200 is normalized by replacing the logical operators with normalized operators to create a normalized filter condition 740. A first logical AND operator 720 is between a first member set 730 and second member set 735 which are of different dimensions. Hence, the first logical AND operator 720 is replaced with CAND operator 745 in the normalized filter condition 740. A second logical AND operator 725 is retained in the normalized filter condition 740 since it is an AND operation between two predicates of same dimensions.
  • FIG. 8 shows an example of converting a logical expression to conjunctive normal form. Such a conversion is used by some embodiments of the invention to convert filter conditions. The main goal in normalizing a filter condition is to obtain Cartesian products on distinct dimensions since these operations are efficiently executed in MDX. Thus, the member sets are first converted to CNF in order to separate member sets of different dimensions. Once, the member sets are separated based on their dimension, they can be projected to their respective axis in an MDX query. Consider a first logical expression 800 which is (A AND B) OR C. Converting first logical expression 800 to CNF based on logical equivalences, we get
      • =(A AND B) OR (C)
      • =(A OR C) AND (B OR C)
      • which is the second logical expression 805.
  • FIG. 9 is a block diagram for converting the filter condition to a disjunctive normal form according to an embodiment of the invention. After converting the filter condition to CNF and grouping the predicates of the same dimension into member sets, portions of the filter condition that involves a set intersection are converted to DNF. Consider first filter condition 900 which is in CNF. Using on logical equivalences, converting the filter condition 900 to DNF,
      • =((Country in {USA, INDIA}) OR (Count=N)) AND (Country in {INDIA, CHINA})
      • =(Country in {USA, INDIA }) AND (Country in {INDIA, CHINA }) OR
      • ((Country in {INDIA, CHINA}) AND (Count=N)) which is second filter condition 905. The structural similarity between logical expressions 800 and 805 and filter conditions 900 and 905 should be apparent. Because the filter condition 905 is in disjunctive normal form with AND/Intersection operators between elements of the member sets. And because the member sets are along common dimensions the member sets with the filter conditions can be simplified.
      • =(Country=INDIA) OR
      • ((Count=N) AND (Country in {INDIA, CHINA})) which is third filter condition 910 in DNF.
  • FIG. 10 is a block diagram for optimizing a normalized filter condition by replacing an IAND operator with CAND operator according to an embodiment of the invention. Translating an IAND operator to an MDX equivalent is more expensive in terms of time and memory than a CAND operator. Hence, the IAND operator is replaced with CAND operator wherever possible. The IAND is converted to a logical AND operator, the member sets of different dimensions are separated and the logical AND between the different dimensions is converted to the CAND operator.
  • In an embodiment, the first normalized filter condition 1000 is optimized by converting an expensive IAND operator to a less expensive CAND operator as follows:
      • [(Product=SODA) UOR (Year=2007)] IAND
      • (Year>2007) AND (Country=USA)
        The UOR is converted to logical OR. The IAND is converted to logical AND. The predicate with year inequality is distributed over the clause containing OR.
      • ={[(Product=SODA) AND (Year>2007)] OR [(Year=2007) AND (Year>2007)]} AND (Country=USA)
        Since [(Year=2007) AND (Year>2007)] can be simplified.
      • ={[(Product=SODA) AND (Year>2007)] OR (Year=2007)} AND (Country=USA)
        This is filter condition 1005.
  • Replacing the logical operators in filter condition 1005 with the normalized operators, we get second normalized filter condition 1025. Second normalized filter condition 1025:
      • ={[(Product=SODA) CAND (Year>2007)] UOR (Year=2007)} CAND (Country=USA)
  • A first logical AND operator 1010 is replaced with a first CAND operator 1030, a logical OR operator 1015 is replaced with an UOR operator 1035, and a second logical AND operator 1020 is replaced with a second CAND operator 1040 in the normalized filter condition 1025 based on rules described in FIG. 7A. The normalized filter condition is then combined with a result object and a data source of the query to form a normalized query. For instance, consider a query in query panel 200, the normalized query would consist of result objects 205, data source (not shown in the figure) and normalized filter condition 740.
  • FIG. 11 is a normalization graph 1100 depicting structure of a normalized filter condition 1105 according to an embodiment of the invention. The normalization process as described in FIG. 3 at block 320 reduces the normalized filter condition 1105 to a format described by, and constrained to, the normalization graph 1100 or a sub graph. The normalized filter condition 1105 has normalized operators that include but not limited to CAND 1110, UOR 1115, IAND 1125, MEMBER SET 1130 which is a semantic wrapper that groups predicates of same dimension, and AGGREGATION FILTER 1135 apart from logical AND and OR operators. The normalization graph 1100 shows the structure of the normalized filter condition 1105, that is, order of the normalized operators and operands the normalized operators have in some embodiments. The operator at the top is CAND it separates non overlapping dimensions allowing for easy association with an axis in an MDX query. For instance, operator CAND 1110 may have its operands as operators
      • UOR 1115 as indicated by first arrow 1111,
      • IAND 1125 as indicated by second arrow 1112,
      • MEMBSER SET 1130 as indicated by third arrow 1113, and
      • AGGREGATION FILTER 1135 as indicated by fourth arrow 1113.
        Statements in DNF are faster to evaluate that those in CNF. It is faster to compute unions of low cardinality sets than intersections of high cardinality sets. Hence UOR 1115 may have its operands as operators CAND 1120 and IAND 1125; as well as MEMBSER SET 1130 and AGGREGATION FILTER 1135. Whereas operator IAND 1125 may have only MEMBSER SET 1130 and AGGREGATION FILTER 1135 as its operands.
  • Similarly, MEMBER SET 1130 may have a selection on dimension 1140 such as “SELECT [PRODUCT].[CATEGORY].&BIKES” as its operand. The AGGREGATION FILTER 1135 may have a selection on measure 1145 such as “SELECT revenue>100” as its operand.
  • FIG. 12 depicts a normalized query of FIG. 2 according to an embodiment of the invention. The normalized query 1200 has result objects 205, a data source, and a normalized filter condition 740 of the query in query panel 200. The normalized query 1200 forms an input to the MDX query translation process.
  • FIG. 13 is a block diagram for translating a result object in a normalized query to a first part of an MDX query according to an embodiment of the invention. In an embodiment, the MDX query has a syntax with at least the following three parts:
  •     SELECT <query_axis> (first part)
        FROM SELECT <slicer_axis> (second part)
        ...
        ...
        FROM <data source> (third part)
        where
        query_axis specifies a result set of the MDX query, that is,
    result objects 1300;
        slicer_axis specifies a filter condition of the MDX query; and
        data source specifies a data source such as an OLAP cube from
    which the data is retrieved.
  • The result objects 1300 of normalized query 1200 is translated to the query_axis of the SELECT clause. The result set contains two columns, Customer geography 1305, Customer Count 1310 and two rows, one for United States and the other for California. Therefore, result objects 1300 is translated to the first part 1315 of the MDX query as follows:
  • First part =
      SELECT
        {[Measures].Customer Count]} on columns,
        Non Empty Union
        ({[Customer].[Customer Geography].[Country].&United States},
        {[Customer].[Customer Geography].[State-Province].&Ca&Us})
        on rows.
  • Similarly, a data source specified in normalized query 1200 is translated to a third part of the MDX query as follows:
      • Third part=FROM (Customer_Cube)
  • FIG. 14 is a block diagram illustrating the translating of a normalized filter condition of the normalized query 1200 to a second part of the MDX query according to an embodiment of the invention. The normalized filter condition 1450 is translated to the slicer_axis of the FROM SELECT clause of the MDX query. In an embodiment, normalized filter condition 1450 of normalized query 1200 is translated to second part 1455 of the MDX query. In an embodiment the MDX query based on a syntax of the AS 2000 query language. The second part 1455 has MDX operators that include but not limited to CROSSJOIN, INTERSECT, UNION, FROM SELECT UNION, and FILTER. The translation of normalized filter condition 1450 is done based on the multi-dimensional expression translation rules that include but not limited to:
      • Translate a CAND operator in the normalized filter condition to MDX CROSSJOIN operator in the MDX query;
      • Translate an IAND operator in the normalized filter condition to MDX INTERSECT operator in the MDX query;
      • Translate an AND operator in the normalized filter condition to MDX INTERSECT operator in the MDX query;
      • Translate an OR operator in the normalized filter condition to MDX UNION operator in the MDX query;
      • Translate an UOR operator in the normalized filter condition to MDX FROM SELECT UNION operator in the MDX query;
      • Translate an AGGREGATIONFILTER operator in the normalized filter condition to MDX FILTER operator in the MDX query; and
      • Translate predicates in the normalized filter condition to the MDX query based on syntax of the MDX language.
  • Based on the above rules normalized filter condition 1450 is translated to second part 1455 as follows:
      • CAND operator 1400 is translated to CROSSJOIN operator 1445;
      • AND operator 1405 is translated to INTERSECT operator 1430; and
      • Predicates, first predicate 1410, second predicate 1415, and third predicate 1420 are translated to predicates fourth predicate 1425, fifth predicate 1435 and sixth predicate 1440 respectively based on syntax of AS 2000 query language.
        After the translation, the second part 1455 of the MDX query reads as follows:
  • Second part =
      FROM (SELECT
      CROSSJOIN (
        INTERSECT (
          GENERATE (
            EXCEPT (
              [Product].[ProductCategories].[Category].Members,
              {[Product].[ProductCategories].[Category].&Bikes}),
            DESCENDANTS (
              [Product].[Product Categories].Currentmember,
              [Product].[Product Categories].[Subcategory], Self)),
          EXCEPT (
            [Product].[Product Categories].[Subcategory].Members,
            {[Product].[Product Categories].[Subcategory].&CAPS})),
        EXCEPT (
          [Date].[Fiscal].[Fiscal Year].Members,
          {[Date].[Fiscal].[Fiscal Year].&[2002]}))) on columns.
  • The GENERATE clause in MDX returns a concatenated string created by evaluating a string expression over a set. The EXCEPT clause evaluates two sets and removes those tuples in the first set that also exist in the second set. The DESCENDANTS clause returns the set of descendants of a member at a specified level or distance.
  • FIG. 15 depicts the MDX query 1500 of a query of FIG. 2 according to an embodiment of the invention. After translating the normalized query 1200 to first part 1305 specifying a result set, second part 1400 specifying a filter condition and the third part specifying a data source of the MDX query, the three parts are merged to form the MDX query in AS 2000 query language. Therefore, merging first part 1305, second part 1400 and the third part 1210, the MDX query 1500 reads as follows:
  • MDX Query =
      SELECT
        {[Measures].Customer Count]} on columns,
        Non Empty Union
        ({[Customer].[Customer Geography].[Country].&United States},
        {[Customer].[Customer Geography].[State-Province].&Ca&Us})
        on rows
      FROM (SELECT
      CROSSJOIN (
        INTERSECT (
          GENERATE (
            EXCEPT (
              [Product].[ProductCategories].[Category].Members,
              {[Product].[ProductCategories].[Category].&Bikes}),
            DESCENDANTS (
              [Product].[Product Categories].Currentmember,
              [Product].[Product Categories].[Subcategory], Self)),
          EXCEPT (
            [Product].[Product Categories].[Subcategory].Members,
            {[Product].[Product Categories].[Subcategory].&CAPS})),
        EXCEPT (
          [Date].[Fiscal].[Fiscal Year].Members,
          {[Date].[Fiscal].[Fiscal Year].&[2002]}))) on columns
      FROM (Customer_Cube).
  • FIG. 16 is a flow diagram for converting a normalized query to a multi-dimensional expression query according to an embodiment of the invention. At process block 1600, a normalized query is received. At process block 1605, the normalized query is parsed to determine a result object, a data source and a normalized filter condition. At process block 1610, the result object and the data source are translated to a first part and a third part of the MDX query. The translation is done based on syntax of the MDX query language that includes but not limited to AS 2000, AS 2005 and MaxL.
  • At process block 1615, the normalized filter condition is translated to a second part of the MDX query based on MDX translation rules. In an embodiment, the MDX translation rules include rules for translating the normalized operators to MDX operators in the MDX query as described in FIG. 14. At process block 1620, the first part specifying a result set of the MDX query, the second part specifying the filter condition of the MDX query and the third part specifying the data source of the MDX query are merged to form the MDX query.
  • FIG. 17 depicts a table of a source query filter condition, its normalized filter condition and MDX filter condition according to an embodiment of the invention. For example, in a first row of the table,
      • Source filter=AND (s1@selection 1([Product]), s2@selection2 ([Product])
      • Normalized filter condition=MemberSet ([Product], AND (s1, s2))
      • MDX filter condition=INTERSECT (translate (s1), translate (s2))
        where
      • S1SELECTION1 [PRODUCT] is a selection condition on a PRODUCT dimension such as
      • Selection ([Product].[Category], neq ([Product].[Category].[Bikes])
      • and similarly S2 is another selection condition on the PRODUCT dimension.
  • In the source filter condition a logical AND operation is between two predicates S1 and S2 of the same dimension, PRODUCT. Therefore, in the normalized filter, MemberSet ([Product], AND (s1, s2)), the predicates S1 and S2 are grouped into a member set of PRODUCT and then a logical AND operation is performed between the predicates. The normalized filter translates to an INTERSECT operation between predicates (translate (S1), translate (S2)) in the MDX filter. The predicates translate (S1) and translate (S2) are translated to query languages such as AS 2000.
  • Other filters mentioned in rest of the rows of the table may be understood in light of the above explanation of the filter condition in the first row.
  • FIG. 18 is a block diagram of a system for converting a query in a first format to a MDX query in the second format according to an embodiment of the invention. System 1800 includes query provider 1805 that provides a query in the first format which is translated to MDX query 1855. The first format includes SQL, other declarative query languages, and queries created in a declarative way in tools such as Web Intelligence from Business Objects. The second format in which the MDX query is generated includes but not limited to query languages such as AS 2000, AS 2005 and MaxL.
  • Along with the query, query provider 1805 also provides a query specification to identify a format of the query. Based on the format of the query, different parts of the query such as result objects, a filter condition and a data source may be identified. A parser 1810 in communication with query provider 1805 parses the query to determine a result object and data source 1860 and a filter condition of the query. A semantic analyzer 1815 in communication with parser 1810 analyzes the filter condition of the query to determine a semantic context such as dimensions of predicates of the filter condition and a hierarchy level in the dimension of a member set to which a measure is aggregated. The semantic analyzer 1815 groups the predicates having a same dimension into a member set.
  • A normalization engine 1820 in communication with pattern matching tool 1825 normalizes the semantically analyzed filter condition by converting the filter condition to disjunctive normal form. In an embodiment, pattern matching tool 1825 includes To One Matching (TOM) framework provided by INRIA of France. The pattern matching tool 1825 has rules for converting logical expressions to disjunctive normal form, conjunctive normal form and converting between the two. The rules are based on logical equivalences. The normalization engine 1820 has rules for replacing logical operators with normalized operators. The normalization engine 1820 in communication with pattern matching tool 1825 replaces the logical operators with the normalized operators. A query optimizer 1830 optimizes the filter condition by replacing an IAND operator with a CAND operator and creates normalized filter condition 1835. This optimization may involve repeated conversions of the filter condition to disjunctive normal form and conjunctive normal form. So the query optimizer 1830 is in communication with normalization engine 1820.
  • The normalized filter condition 1835 and result object and data source 1860 are converted to MDX query 1855 by MDX engine 1840 and translator 1850. The MDX engine 1840 converts the normalized filter condition 1835 and result object and data source 1860 to an intermediate MDX query based on MDX translation rules. Further, the intermediate format of the MDX filter is translated to MDX query 1855 by translator 1850.
  • The MDX engine 1840 identifies the normalized operators in normalized filter condition 1835 and provides them to pattern matching tool 1825 along with result object and data source 1860 to convert them to the intermediate MDX query. The pattern matching tool 1825 constructs the intermediate MDX query by
      • replacing the normalized operators in normalized filter condition 1835 with MDX operators;
      • converting predicates in normalized filter condition 1835 to translation ready predicates. The translation ready predicates are created by concatenating a phrase “translate” with a predicate in the normalized filter condition; for instance, translate (S1) where S1 is a predicate in the normalized filter condition; and
      • converting result object and data source 1860 to translation ready result object and data source in a similar way that translation ready predicates are created.
        An example of the intermediate MDX query filter condition would be “CROSSJOIN (translate (S1), translate (S2))” which is a translation of a normalized filter condition CAND (S1, S2).
  • The translator 1850 further translates the intermediate MDX query to MDX query 1855 in a query language that includes but not limited to AS 2000, AS 2005 and MaxL. The translator 1850 obtains syntax of the query language from syntax file 1845.
  • Embodiments of the invention may include various steps as set forth above. The steps may be embodied in machine-executable program code which causes a general-purpose or special-purpose processor to perform certain steps. Alternatively, these steps may be performed by specific hardware components that contain hardwired logic for performing the steps, or by any combination of programmed computer components and custom hardware components.
  • Embodiments of the present invention may also be provided as a machine-readable medium for storing the machine-executable instructions. The machine-readable medium may include, but is not limited to, flash memory, optical disks, CD-ROMs, DVD ROMs, RAMs, EPROMs, EEPROMs, magnetic or optical cards, or any other type of machine-readable media suitable for tangibly storing electronic instructions. The machine readable medium can provide the instructions stored therein to a computer system comprising a processor capable of reading and executing the instructions to implement the method steps described herein.
  • It should be appreciated that reference throughout this specification to one embodiment or an embodiment means that a particular feature, structure or characteristic described in connection with the embodiment is included in at least one embodiment of the present invention. These references are not necessarily all referring to the same embodiment. Furthermore, the particular features, structures or characteristics may be combined as suitable in one or more embodiments of the invention.
  • Throughout the foregoing description, for the purposes of explanation, numerous specific details were set forth in order to provide a thorough understanding of the invention. It will be apparent, however, to one skilled in the art that the invention may be practiced without some of these specific details. The detailed description as set forth above includes descriptions of method steps. However, one skilled in the art will understand that the order of the steps set forth above is meant for the purposes of illustration only and the claimed invention is not meant to be limited only to the specific order in which the steps are set forth. Accordingly, the scope and spirit of the invention should be judged in terms of the claims which follow.

Claims (20)

1. An article of manufacture, comprising:
a machine readable medium having instructions which when executed by a machine cause the machine to perform operations comprising:
receiving a normalized query;
parsing the normalized query to determine a result object, a data source, and a normalized filter condition;
translating the result object to a first part of a multi-dimensional expression query;
translating the normalized filter condition to a second part of the multi-dimensional expression query by translating the normalized operators and predicates in the normalized filter condition based on multi-dimensional expression translation rules;
translating the data source to a third part of the multi-dimensional expression query; and
merging the first part, the second part and the third part to form the multi-dimensional expression query.
2. The article of manufacture in claim 1, wherein translating the result object, the normalized filter condition and the data source to the multi-dimensional expression query comprises translating to a multi-dimensional expression query language selected from a group consisting of Analysis Services 2000, Analysis Services 2005, Cubing Services, Netweaver BI 7 and MaxL.
3. The article of manufacture in claim 1, wherein the result object and the data source object are expressed in a query language selected from a group consisting of: structured query language, another declarative query languages, and query created in a declarative way using a user interface.
4. The article of manufacture in claim 1, wherein a structure of the normalized filter condition is defined by a normalization graph.
5. The article of manufacture in claim 1, wherein the normalized filter condition comprises a filter condition of a database query in a disjunctive normal form.
6. The article of manufacture in claim 1, wherein translating the result object to a multi-dimensional expression query comprises converting the result object to a query axis in SELECT clause of the multi-dimensional expression query.
7. The article of manufacture in claim 1, wherein converting the normalized filter condition to the multi-dimensional expression query comprises converting the normalized filter condition to a slicer axis in the multi-dimensional expression query.
8. The article of manufacture in claim 1, wherein the multi-dimensional expression translation rules for converting the normalized filter condition to the multi-dimensional expression query comprises a rule selected from a group consisting of:
translating a CAND operator in the normalized filter condition to CROSSJOIN operator in the multi-dimensional expression query;
translating an IAND operator in the normalized filter condition to INTERSECT operator in the multi-dimensional expression query; and
translating an AND operator in the normalized filter condition to INTERSECT operator in the multi-dimensional expression query.
9. The article of manufacture in claim 1, wherein the multi-dimensional expression translation rules for converting the normalized filter condition to the multi-dimensional expression query comprises selected from a group consisting of:
translating an OR operator in the normalized filter condition to UNION operator in the multi-dimensional expression query; and
translating an UOR operator in the normalized filter condition to FROM SELECT UNION operator in the multi-dimensional expression query.
10. The article of manufacture in claim 1, wherein the multi-dimensional expression translation rules for converting the normalized filter condition to the multi-dimensional expression query comprises translating an AGGREGATIONFILTER operator in the normalized filter condition to FILTER operator in the multi-dimensional expression query.
11. A computer implemented method for translating a normalized query to a multi-dimensional database query, the method comprising:
receiving a normalized query;
parsing the normalized query to determine a result object, a data source, and a normalized filter condition in the normalized query;
translating the result object to a first part of a multi-dimensional expression query;
translating the normalized filter condition to a second part of the multi-dimensional expression query by translating the normalized operators and predicates in the normalized filter condition based on multi-dimensional expression translation rules;
translating the data source to a third part of the multi-dimensional expression query; and
merging the first part, the second part and the third part to form the multi-dimensional expression query.
12. The computer implemented method in claim 11, wherein the normalized filter condition comprises a filter condition in a disjunctive normal form.
13. The computer implemented method in claim 11, wherein the multi-dimensional expression translation rules for translating the normalized filter condition to the multi-dimensional expression query comprises selected from a group consisting of:
translating a CAND operator in the normalized filter condition to CROSSJOIN operator in the multi-dimensional expression specification;
translating an IAND operator in the normalized filter condition to INTERSECT operator in the multi-dimensional expression specification;
translating an AND operator in the normalized filter condition to INTERSECT operator in the multi-dimensional expression specification;
translating an OR operator in the normalized filter condition to UNION operator in the multi-dimensional expression specification; and translating an UOR operator in the normalized filter condition to FROM SELECT UNION operator in the multi-dimensional expression specification.
14. The computer implemented method in claim 11, wherein a structure of the normalized filter condition is defined by a normalization graph.
15. The computer implemented method in claim 11, wherein the multi-dimensional expression translation rules for translating the normalized filter condition to the multi-dimensional expression query comprises translating an AGGREGATIONFILTER operator in the normalized filter condition to FILTER operator in the multi-dimensional expression specification.
16. The computer implemented method in claim 11, wherein translating the result object, the normalized filter condition and the data source to a multi-dimensional expression query comprises translating to a multi-dimensional expression query language selected from a group consisting of Analysis Service 2000, Analysis Service 2005, and MaxL.
17. A computer system including a processor and a memory, the memory comprising instructions that are executable by the processor, the instructions comprising:
a parser to retrieve a result object, and a data source from a query;
a multi-dimensional expression engine in communication with the parser to convert the result object, the data source and a normalized filter condition of the query to an intermediate multi-dimensional expression query based on the multi-dimensional expression rules; and
a translator in communication with the multi-dimensional expression engine to translate the intermediate multi-dimensional expression query to a multi-dimensional expression query based on syntax of a multi-dimensional expression query language.
18. The system in claim 17 further comprising a syntax file that provides syntax of a multi-dimensional query language to the translator.
19. The system in claim 17 further comprising a pattern matching tool in communication with the multi-dimensional expression engine to convert the result object, the data source and predicates in the normalized filter condition to a translation ready result object, translation ready data source and translation ready predicates that form a part of the intermediate multi-dimensional expression query.
20. The system in claim 17 further comprising a normalization engine in communication with a pattern matching tool to provide the normalized filter condition of the query.
US12/266,570 2008-11-07 2008-11-07 Converting a database query to a multi-dimensional expression query Abandoned US20100121868A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US12/266,570 US20100121868A1 (en) 2008-11-07 2008-11-07 Converting a database query to a multi-dimensional expression query

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US12/266,570 US20100121868A1 (en) 2008-11-07 2008-11-07 Converting a database query to a multi-dimensional expression query

Publications (1)

Publication Number Publication Date
US20100121868A1 true US20100121868A1 (en) 2010-05-13

Family

ID=42166154

Family Applications (1)

Application Number Title Priority Date Filing Date
US12/266,570 Abandoned US20100121868A1 (en) 2008-11-07 2008-11-07 Converting a database query to a multi-dimensional expression query

Country Status (1)

Country Link
US (1) US20100121868A1 (en)

Cited By (36)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100153409A1 (en) * 2008-12-12 2010-06-17 Joshi Mahesh V Computer-Implemented Systems And Methods For Providing Paginated Search Results From A Database
US20110119281A1 (en) * 2009-11-18 2011-05-19 Battelle Memorial Institute Methods for Discovering Analyst-Significant Portions of a Multi-Dimensional Database
US20110282881A1 (en) * 2010-05-13 2011-11-17 Salesforce.Com, Inc. Methods and systems for determining candidates for a custom index in a multi-tenant database environment
US8468177B2 (en) * 2009-09-29 2013-06-18 Zap Holdings Limited Content based approach to extending the form and function of a business intelligence system
CN103294699A (en) * 2012-02-24 2013-09-11 联想(北京)有限公司 Method and electronic equipment for screening object
US20130262443A1 (en) * 2012-03-30 2013-10-03 Khalifa University of Science, Technology, and Research Method and system for processing data queries
US20140032588A1 (en) * 2012-07-29 2014-01-30 Sergiy GETMANETS Systems And Methods For Providing A Simplified Application Programming Interface For Converting From Two-Dimensional Query Languages Into Multi-Dimensional Query Languages To Query Multi-Dimensional Data Sources And MDX Servers
US20140280279A1 (en) * 2013-03-15 2014-09-18 International Business Machines Corporation Minimizing result set size when converting from asymmetric to symmetric requests
US20150278347A1 (en) * 2014-03-31 2015-10-01 Petra Meyer Evaluation of variant configuration using in-memory technology
US9183272B1 (en) 2013-11-06 2015-11-10 Dell Software Inc. System and method for accessing dimensional databases
US9208209B1 (en) 2014-10-02 2015-12-08 Sas Institute Inc. Techniques for monitoring transformation techniques using control charts
US20160098454A1 (en) * 2014-10-07 2016-04-07 Unisys Corporation Handling of analytic queries
US9396287B1 (en) * 2011-10-05 2016-07-19 Cumulus Systems, Inc. System for organizing and fast searching of massive amounts of data
US9418101B2 (en) 2012-09-12 2016-08-16 International Business Machines Corporation Query optimization
US9418339B1 (en) 2015-01-26 2016-08-16 Sas Institute, Inc. Systems and methods for time series analysis techniques utilizing count data sets
US20160239582A1 (en) * 2015-02-18 2016-08-18 Ab Initio Technology Llc Querying a data source on a network
US20170262501A1 (en) * 2016-03-10 2017-09-14 Ricoh Co., Ltd. Open Query Language
US9886483B1 (en) * 2010-04-29 2018-02-06 Quest Software Inc. System for providing structured query language access to non-relational data stores
US9886474B2 (en) 2011-11-22 2018-02-06 Microsoft Technology Licensing, Llc Multidimensional grouping operators
US9892370B2 (en) 2014-06-12 2018-02-13 Sas Institute Inc. Systems and methods for resolving over multiple hierarchies
US9934259B2 (en) 2013-08-15 2018-04-03 Sas Institute Inc. In-memory time series database and processing in a distributed environment
US10073871B2 (en) 2015-11-09 2018-09-11 International Business Machines Corporation Database entity analysis
US10108648B2 (en) 2011-07-13 2018-10-23 Salesforce.Com, Inc. Creating a custom index in a multi-tenant database environment
US10157234B1 (en) 2015-02-27 2018-12-18 Quest Software Inc. Systems and methods for transforming datasets
US10169720B2 (en) 2014-04-17 2019-01-01 Sas Institute Inc. Systems and methods for machine learning using classifying, clustering, and grouping time series data
US10185755B2 (en) * 2015-12-28 2019-01-22 Business Objects Software Limited Orchestration of data query processing in a database system
US10282181B2 (en) 2013-12-06 2019-05-07 Ab Initio Technology Llc Source code translation
US10417611B2 (en) 2010-05-18 2019-09-17 Salesforce.Com, Inc. Methods and systems for providing multiple column custom indexes in a multi-tenant database environment
US10437819B2 (en) 2014-11-14 2019-10-08 Ab Initio Technology Llc Processing queries containing a union-type operation
US10521427B2 (en) 2011-05-02 2019-12-31 Ab Initio Technology Llc Managing data queries
US10560313B2 (en) 2018-06-26 2020-02-11 Sas Institute Inc. Pipeline system for time-series data forecasting
US10685283B2 (en) 2018-06-26 2020-06-16 Sas Institute Inc. Demand classification based pipeline system for time-series data forecasting
US10762099B2 (en) * 2016-06-07 2020-09-01 International Business Machines Corporation Syntactical transformation of database interaction statements
US11093223B2 (en) 2019-07-18 2021-08-17 Ab Initio Technology Llc Automatically converting a program written in a procedural programming language into a dataflow graph and related systems and methods
US20220058169A1 (en) * 2020-08-24 2022-02-24 Sap Se Automated feature engineering for multidimensional data
US11593369B2 (en) 2010-01-15 2023-02-28 Ab Initio Technology Llc Managing data queries

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050010565A1 (en) * 2003-05-27 2005-01-13 David Cushing System and method of transforming queries based upon E/R schema into multi-dimensional expression queries
US20070208721A1 (en) * 2003-12-01 2007-09-06 Zaman Kazi A Modeling multidimensional data sources
US20090100014A1 (en) * 2007-10-10 2009-04-16 Bugra Gedik Methods and Apparatus for Adaptive Source Filtering and Load Shedding for Data Stream Processing
US20090249125A1 (en) * 2008-04-01 2009-10-01 Microsoft Corporation Database querying

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050010565A1 (en) * 2003-05-27 2005-01-13 David Cushing System and method of transforming queries based upon E/R schema into multi-dimensional expression queries
US7945575B2 (en) * 2003-05-27 2011-05-17 International Business Machines Corporation Method of transforming queries based upon E/R schema into multi-dimensional expression queries
US20070208721A1 (en) * 2003-12-01 2007-09-06 Zaman Kazi A Modeling multidimensional data sources
US20090100014A1 (en) * 2007-10-10 2009-04-16 Bugra Gedik Methods and Apparatus for Adaptive Source Filtering and Load Shedding for Data Stream Processing
US20090249125A1 (en) * 2008-04-01 2009-10-01 Microsoft Corporation Database querying

Cited By (55)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8489622B2 (en) * 2008-12-12 2013-07-16 Sas Institute Inc. Computer-implemented systems and methods for providing paginated search results from a database
US20100153409A1 (en) * 2008-12-12 2010-06-17 Joshi Mahesh V Computer-Implemented Systems And Methods For Providing Paginated Search Results From A Database
US8468177B2 (en) * 2009-09-29 2013-06-18 Zap Holdings Limited Content based approach to extending the form and function of a business intelligence system
US20110119281A1 (en) * 2009-11-18 2011-05-19 Battelle Memorial Institute Methods for Discovering Analyst-Significant Portions of a Multi-Dimensional Database
US11593369B2 (en) 2010-01-15 2023-02-28 Ab Initio Technology Llc Managing data queries
US9886483B1 (en) * 2010-04-29 2018-02-06 Quest Software Inc. System for providing structured query language access to non-relational data stores
US20110282881A1 (en) * 2010-05-13 2011-11-17 Salesforce.Com, Inc. Methods and systems for determining candidates for a custom index in a multi-tenant database environment
US8583653B2 (en) * 2010-05-13 2013-11-12 Salesforce.Com, Inc. Methods and systems for determining candidates for a custom index in a multi-tenant database environment
US10417611B2 (en) 2010-05-18 2019-09-17 Salesforce.Com, Inc. Methods and systems for providing multiple column custom indexes in a multi-tenant database environment
US10521427B2 (en) 2011-05-02 2019-12-31 Ab Initio Technology Llc Managing data queries
US10108648B2 (en) 2011-07-13 2018-10-23 Salesforce.Com, Inc. Creating a custom index in a multi-tenant database environment
US9479385B1 (en) 2011-10-05 2016-10-25 Cumulus Systems, Inc. System for organizing and fast searching of massive amounts of data
US9396287B1 (en) * 2011-10-05 2016-07-19 Cumulus Systems, Inc. System for organizing and fast searching of massive amounts of data
US9886474B2 (en) 2011-11-22 2018-02-06 Microsoft Technology Licensing, Llc Multidimensional grouping operators
CN103294699A (en) * 2012-02-24 2013-09-11 联想(北京)有限公司 Method and electronic equipment for screening object
US9639575B2 (en) * 2012-03-30 2017-05-02 Khalifa University Of Science, Technology And Research Method and system for processing data queries
US20130262443A1 (en) * 2012-03-30 2013-10-03 Khalifa University of Science, Technology, and Research Method and system for processing data queries
US20140032588A1 (en) * 2012-07-29 2014-01-30 Sergiy GETMANETS Systems And Methods For Providing A Simplified Application Programming Interface For Converting From Two-Dimensional Query Languages Into Multi-Dimensional Query Languages To Query Multi-Dimensional Data Sources And MDX Servers
US9355143B2 (en) * 2012-07-29 2016-05-31 Sergiy GETMANETS Systems and methods for providing a simplified application programming interface for converting from two-dimensional query languages into multi-dimensional query languages to query multi-dimensional data sources and MDX servers
US9418101B2 (en) 2012-09-12 2016-08-16 International Business Machines Corporation Query optimization
US20140280279A1 (en) * 2013-03-15 2014-09-18 International Business Machines Corporation Minimizing result set size when converting from asymmetric to symmetric requests
US9275111B2 (en) * 2013-03-15 2016-03-01 International Business Machines Corporation Minimizing result set size when converting from asymmetric to symmetric requests
US9524318B2 (en) 2013-03-15 2016-12-20 International Business Machines Corporation Minimizing result set size when converting from asymmetric to symmetric requests
US9934259B2 (en) 2013-08-15 2018-04-03 Sas Institute Inc. In-memory time series database and processing in a distributed environment
US9495437B1 (en) 2013-11-06 2016-11-15 Dell Software Inc. System and method for accessing dimensional databases
US9183272B1 (en) 2013-11-06 2015-11-10 Dell Software Inc. System and method for accessing dimensional databases
US10282181B2 (en) 2013-12-06 2019-05-07 Ab Initio Technology Llc Source code translation
US11106440B2 (en) 2013-12-06 2021-08-31 Ab Initio Technology Llc Source code translation
US10289396B2 (en) 2013-12-06 2019-05-14 Ab Initio Technology Llc Source code translation
US20150278347A1 (en) * 2014-03-31 2015-10-01 Petra Meyer Evaluation of variant configuration using in-memory technology
US9442968B2 (en) * 2014-03-31 2016-09-13 Sap Se Evaluation of variant configuration using in-memory technology
US10474968B2 (en) 2014-04-17 2019-11-12 Sas Institute Inc. Improving accuracy of predictions using seasonal relationships of time series data
US10169720B2 (en) 2014-04-17 2019-01-01 Sas Institute Inc. Systems and methods for machine learning using classifying, clustering, and grouping time series data
US9892370B2 (en) 2014-06-12 2018-02-13 Sas Institute Inc. Systems and methods for resolving over multiple hierarchies
US9208209B1 (en) 2014-10-02 2015-12-08 Sas Institute Inc. Techniques for monitoring transformation techniques using control charts
US9727612B2 (en) * 2014-10-07 2017-08-08 Unisys Corporation Handling of analytic queries
US20160098454A1 (en) * 2014-10-07 2016-04-07 Unisys Corporation Handling of analytic queries
US10437819B2 (en) 2014-11-14 2019-10-08 Ab Initio Technology Llc Processing queries containing a union-type operation
US9418339B1 (en) 2015-01-26 2016-08-16 Sas Institute, Inc. Systems and methods for time series analysis techniques utilizing count data sets
US10417281B2 (en) * 2015-02-18 2019-09-17 Ab Initio Technology Llc Querying a data source on a network
US11308161B2 (en) * 2015-02-18 2022-04-19 Ab Initio Technology Llc Querying a data source on a network
US20160239582A1 (en) * 2015-02-18 2016-08-18 Ab Initio Technology Llc Querying a data source on a network
US10157234B1 (en) 2015-02-27 2018-12-18 Quest Software Inc. Systems and methods for transforming datasets
US10901963B2 (en) 2015-11-09 2021-01-26 International Business Machines Corporation Database entity analysis
US10073871B2 (en) 2015-11-09 2018-09-11 International Business Machines Corporation Database entity analysis
US10185755B2 (en) * 2015-12-28 2019-01-22 Business Objects Software Limited Orchestration of data query processing in a database system
US10824621B2 (en) * 2016-03-10 2020-11-03 Ricoh Co., Ltd. Open query language
US20170262501A1 (en) * 2016-03-10 2017-09-14 Ricoh Co., Ltd. Open Query Language
US10762099B2 (en) * 2016-06-07 2020-09-01 International Business Machines Corporation Syntactical transformation of database interaction statements
US10891300B2 (en) * 2016-06-07 2021-01-12 International Business Machines Corporation Syntactical transformation of database interaction statements
US10685283B2 (en) 2018-06-26 2020-06-16 Sas Institute Inc. Demand classification based pipeline system for time-series data forecasting
US10560313B2 (en) 2018-06-26 2020-02-11 Sas Institute Inc. Pipeline system for time-series data forecasting
US11093223B2 (en) 2019-07-18 2021-08-17 Ab Initio Technology Llc Automatically converting a program written in a procedural programming language into a dataflow graph and related systems and methods
US20220058169A1 (en) * 2020-08-24 2022-02-24 Sap Se Automated feature engineering for multidimensional data
US11741069B2 (en) * 2020-08-24 2023-08-29 Sap Se Automated feature engineering for multidimensional data

Similar Documents

Publication Publication Date Title
US20100121868A1 (en) Converting a database query to a multi-dimensional expression query
EP2184689B1 (en) Normalizing a filter condition of a database query
US11755575B2 (en) Processing database queries using format conversion
US7464083B2 (en) Combining multi-dimensional data sources using database operations
US7716167B2 (en) System and method for automatically building an OLAP model in a relational database
Sakr et al. A framework for querying graph-based business process models
US7945575B2 (en) Method of transforming queries based upon E/R schema into multi-dimensional expression queries
US9563662B2 (en) Detecting and processing cache hits for queries with aggregates
US8868544B2 (en) Using relational structures to create and support a cube within a relational database system
US20100017395A1 (en) Apparatus and methods for transforming relational queries into multi-dimensional queries
US20070061287A1 (en) Method, apparatus and program storage device for optimizing a data warehouse model and operation
EP2601600B1 (en) Incremental maintenance of immediate materialized views with outerjoins
US20110208690A1 (en) Viewing an olap cube as a virtual data source
US20050102613A1 (en) Generating a hierarchical plain-text execution plan from a database query
US20050120001A1 (en) SQL structure analyzer
US20050108209A1 (en) Context quantifier transformation in XML query rewrite
US20160232207A1 (en) Hierarchy modeling and query
CN110555035A (en) Method and device for optimizing query statement
US20060161525A1 (en) Method and system for supporting structured aggregation operations on semi-structured data
Dombrovskaya et al. PostgreSQL Query Optimization
Lin et al. On detecting cherry-picked generalizations
Guo et al. Multi-model query languages: taming the variety of big data
Kilias et al. INDREX: In-database relation extraction
Huang et al. Calibration: A Simple Trick for Wide-table Delta Analytics
CN113590610B (en) Blood relationship expression method based on Elastic Search

Legal Events

Date Code Title Description
AS Assignment

Owner name: BUSINESS OBJECTS S.A.,FRANCE

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:LE BIANNIC, YANN;REEL/FRAME:023425/0396

Effective date: 20081106

AS Assignment

Owner name: BUSINESS OBJECTS S.A.,FRANCE

Free format text: CORRECTIVE ASSIGNMENT TO CORRECT THE LIST OF INVENTORS PREVIOUSLY RECORDED ON REEL 023425 FRAME 0396. ASSIGNOR(S) HEREBY CONFIRMS THE ...WE, YANN LE BIANNIC AND ERIC GOUTHIERE, HEREBY ASSIGN TO ...;ASSIGNORS:LE BIANNIC, YANN;GOUTHIERE, ERIC;REEL/FRAME:023833/0167

Effective date: 20081106

AS Assignment

Owner name: SAP FRANCE S.A., FRANCE

Free format text: CHANGE OF NAME;ASSIGNOR:SAP FRANCE S.A.;REEL/FRAME:028931/0284

Effective date: 20091221

AS Assignment

Owner name: SAP FRANCE S.A., FRANCE

Free format text: CHANGE OF NAME;ASSIGNOR:BUSINESS OBJECTS S.A.;REEL/FRAME:030875/0207

Effective date: 20091221

STCB Information on status: application discontinuation

Free format text: ABANDONED -- AFTER EXAMINER'S ANSWER OR BOARD OF APPEALS DECISION