FIELD OF THE INVENTION
The present invention relates to business intelligence systems.
BACKGROUND OF THE INVENTION
Enterprises implement Business Intelligence (BI) technology to improve access to the enterprise's data sources in order to, for example, create summaries, presentations, look for trends, patterns, associations, provide aggregations, and apply multi-dimensional analysis, among other things. Sophisticated BI products such as Brio™, Cognos™ and others allow enterprises to have access to all data stored in all of the enterprises database packages, (e.g. accounts package, stock control database package, sales database, etc) in order to draw on all the available data across the enterprise. These types of sophisticated BI systems, therefore, attempt to make all the enterprises data available for the production of meaningful information by users to enable an enterprise to improve its efficiency.
Nevertheless, although Business Intelligence systems are a great improvement and do allow users to present intelligent information, drawn from a firm's entire database, in easily digestible format, they go no further than this. What tends to occur over time is that many users across an enterprise utilising a BI system produce many reports, queries, analytical documents, spreadsheets, presentations and other products enabled by the BI system so that, after a while, there may be many thousands of such BI artefacts available. These artefacts essentially embody an enterprise's “knowledge”, which can be considered as a combination of the data that the enterprise has available and the information added by the user of the BI tool to produce artefacts (i.e. the user's knowledge). This knowledge is not readily accessible. It is locked away in what may be thousands of BI artefacts.
For example, if a user of a BI system produces a report from the firm's database utilising the BI tools, in order to make that report meaningful, they may need to add information or change information. They may have to provide meaningful list names for a report, for example, “birthdays”, “expiry dates”, “transaction dates”, etc. Titles of the data as stored in the database may be fairly meaningless (usually they are technical terms which have been chosen by an enterprises IT department, and they can be quite cryptic). Users, therefore, effectively add their own knowledge to the firm's data when they utilise the BI system. This knowledge remains “locked up” in the particular BI axtefact which has been produced. Because many users are using the BI system, much knowledge becomes locked up in these disparate fragments.
Users have no proper access to this knowledge. This often results in repetition. Two or more users may design a very similar BI artefact because they will not be aware that the same or similar artefact has in fact been prepared before. Users from different departments of an enterprise may design a report which effectively uses the same data, but which include different titles, because the users perspectives are different. In other words, many users may be accessing the same data for the same ends, but this cannot be ascertained from the end appearance of the BI artefacts.
There are often in BI enabled enterprises many analysts applying their own knowledge and adding meaning to raw data that they are analysing and presenting to their superiors for business decisions. A problem emerges that there is now no authoritative source of this knowledge. Effort is duplicated, time is wasted, information may be mis-categorised, conflicting results generated, opportunities are missed and wrong decisions may be made.
Businesses attempt to at least partly address this problem by implementing solutions such as building data dictionaries, reviewing and renaming columns and tables in an enterprises databases for consistency and to reflect the user's perspective. Such solutions are very expensive and are often not completed because of the difficulty and expense in implementing them. They usually require IT experts to work from the “bottom-up” analysing the firm's available data and trying to make sense of it, consulting with firm's management, and then implementing changes. The metadata created often bears no relation to the actual use of the data in the enterprise, because no one implementing the solution really knows how the data is used across the organisation. Such projects often grind to an expensive halt, well before completion.
Another problem relates to the effect of making a change to the enterprise's IT systems (e.g. an upgrade of hardware or software, legislative changes requiring a change to the IT systems).
Which BI artefacts are going to be affected by the changes? Which departments using the BI artefacts are going to be affected by the changes? Which BI artefacts need to be addressed in order to make allowance for the changes? Finding out which BI artefacts are affected and implementing changes is a very difficult time-consuming and expensive task.
Further, in enterprises which implement BI systems successfully, increased use of the system eventually leads to capacity problems. To address overuse, a firm may decide to add more mission critical hardware. This is the simplest solution, but it is an expensive one and only addresses a single bottleneck. Further, if an analysis were made of system usage, it would in all likelihood be found that the systems are not being used efficiently. In many cases addition of more expensive hardware would be avoidable by optimising use of the systems. Optimisation can include such items as reducing or eliminating redundant documents and adding data mart and cubes. This is a further time-consuming process (and therefore also expensive), particularly when there may be many BI artefacts to analyse. The simplest solution, therefore, is often just to add more hardware, when the more effective solution would in fact be to rationalise and optimise the system.
SUMMARY OF THE INVENTION
The present invention provides a method of obtaining knowledge about an enterprise's data, comprising the steps of analysing business intelligence artefacts produced by users of an enterprise's business intelligence system, producing metadata based on the analysis, and making the metadata available to provide information about the enterprise's data.
Preferably, the metadata is made available to users so that they can query the data to find out information about the enterprise's data. Preferably, business users are able to make optimal use of their business intelligence system and technical users so that they can manage the business intelligence system better.
Business Intelligence artefacts include any artefacts of an enterprise produced from data available to the enterprise in order to provide meaningful information to the enterprise, and it includes any query, analytical document, chart, spreadsheet, presentation, and more.
Preferably, Business Intelligence artefacts are also produced by a Business Intelligence system, but the present invention is not limited to BI artefacts in the narrow sense of the term (where a BI system is implemented). Business Intelligence artifacts are produced by enterprises which do not have BI systems, and the present invention may be applied in enterprises which do not have such systems.
Preferably, the Business Intelligence artefacts are in electronic form.
Preferably, the artefacts have some structure to them. That is, they may have columns with titles or tables with titles. They are preferably not unstructured documents such as word processing systems documents which merely contain only unstructured text.
In the present invention, therefore, Business Intelligence artefacts, such as reports, documents, analyses, presentations, which are produced by the users of a Business Intelligence system (or produced by an enterprise which does not operate a BI system) are analysed to produce metadata (knowledge about Business Intelligence artefacts), and this metadata is made available for users to query to provide information. Essentially, this enables access to the “knowledge” of the enterprise embodied in the Business Intelligence artefacts. The metadata is preferably made available in a structured and therefore queryable form. Rather than working from the “bottom-up” from the enterprise's database (as present attempts to overcome this problem do), the system of the present invention accesses the knowledge of the users of the BI system and provides data about that as well as about the data stored in the enterprise's database.
Preferably, the step of analysing the business intelligence artefacts comprises, for each artefact, the step of determining attributes of the artefact according to a list of attributes. Preferably, the list of attributes is commonly applied to each of the artefacts. Preferably, each artefact is analysed in accordance with an attribute template. Preferably, the application of the common template provides a frame of reference to enable functions such as a matching function, to determine similarity of artefacts, preferably based on the artefacts characteristics.
Preferably, the method includes the step of preparing and storing attribute data relating to the attributes of the artefacts, which have been determined by the analysis process.
The attribute data may include attribute structure and attribute values (where the values imply business rules).
The attribute data preferably includes data on operational characteristics of the artefact. For example, the data may include the identity of the person that formulated the artefact, the identity of the user of the artefact, the time that the artefact was used, the time it took to produce results from the use of the artefact and the number of results which were produced by use of the artefact. When such characteristics are stored for all the artefacts in an enterprise queries can be implemented such as “which artefacts does user X use”? “Which artefacts take up a lot of system time?” “Which artefacts take up a lot of system space?”
Other characteristics of the artefacts may also be included in the attribute data. For example, the attribute data may also include information on the type of analysis applied by the artefact and data on the information within the scope of the artefact, Such data items can be used to locate artefacts which, for example, relate to the same subject matter.
Preferably, the attribute data includes database data including information identifying database tables accessed by application of the artefact. This enables identification of the parts of the enterprises' database utilised by particular artefacts. This information can be applied to rationalise a company's IT systems and also to assist in steering a process of upgrading or changing a company's IT system (the system can preferably identify which artefacts are likely to be affected by the upgrade or change).
Preferably, the attribute data includes business item data, which includes information on any business item associated with an artefact. Users of a BI system add meaning to their artefacts by for example, renaming database columns into business terms. Or they may create virtual columns by defining formulae that optionally use real database columns. Preferably the present invention identifies and stores this business item data, so that, for example, searches of the, artefacts can be implemented utilising business terms or business rules that are formulae.
Business item data may include table names, column names, renamed items, titles, access names, among others.
The method of the present invention also preferably includes the step of querying the metadata. The metadata may be queried to determine a match between artefact attribute data input by a user and attribute data associated with any stored artefacts. The match query may determine the degree of the match. This can enable the user to, for example, find any similar or same artefacts in the enterprise.
As discussed above, the step of querying the metadata may also enable a determination of how much of an enterprise's database is utilised by a particular artefact and what parts of the enterprise's database are utilised by a particular artefact.
Often, when analysts are utilising BI systems they may wish to add annotations to their observations on a particular artefact or artefacts e.g. an observation on a particular inconsistency in data. The method of the present invention preferably includes the step of allowing users to annotate the stored metadata with observations relating to the artefacts. This effectively becomes “new” knowledge which was not originally part of the Business Intelligence pool, but which is elicited from users of the system and stored with the metadata associated with the artefact.
The present invention further provides a system for obtaining knowledge about an, enterprise's data, comprising a harvester means for analysing business intelligence artefacts produced by users of an enterprise's business intelligence system and producing metadata based on that analysis.
Preferably the system of this aspect of the invention may include means for applying any or all of the method or steps discussed above.