WO2007005730B1 - System and method of making unstructured data available to structured data analysis tools - Google Patents

System and method of making unstructured data available to structured data analysis tools

Info

Publication number
WO2007005730B1
WO2007005730B1 PCT/US2006/025811 US2006025811W WO2007005730B1 WO 2007005730 B1 WO2007005730 B1 WO 2007005730B1 US 2006025811 W US2006025811 W US 2006025811W WO 2007005730 B1 WO2007005730 B1 WO 2007005730B1
Authority
WO
WIPO (PCT)
Prior art keywords
data
unstructured
tools
schema
source
Prior art date
Application number
PCT/US2006/025811
Other languages
French (fr)
Other versions
WO2007005730A2 (en
WO2007005730A3 (en
Inventor
Justin Langseth
Nithi Vivitrat
Gene Sohn
Original Assignee
Clarabridge Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Clarabridge Inc filed Critical Clarabridge Inc
Priority to EP06774414.4A priority Critical patent/EP1899855B1/en
Publication of WO2007005730A2 publication Critical patent/WO2007005730A2/en
Publication of WO2007005730A3 publication Critical patent/WO2007005730A3/en
Publication of WO2007005730B1 publication Critical patent/WO2007005730B1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/25Integrating or interfacing systems involving database management systems
    • G06F16/254Extract, transform and load [ETL] procedures, e.g. ETL data flows in data warehouses
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/25Integrating or interfacing systems involving database management systems
    • G06F16/258Data format conversion from or to a database
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/912Applications of a database

Abstract

A system and method of making unstructured data available to structured data analysis tools. The system includes middleware software that can be used in combination with structured data tools to perform analysis on both structured and unstructured data. Data can be read from a wide variety of unstructured sources. The data may then be transformed with commercial data transformation products that may, for example, extract individual pieces of data and determine relationships between the extracted data. The transformed data and relationships may then be passed through an extraction/transform/load (ETL) layer and placed in a structured schema. The structured schema may then be made available to commercial or proprietary structured data analysis tools.

Claims

AMENDED CLAIMS received by the International Bureau on 16 April 2007 (16.-04.2007)
1. A method of making unstructured data available to structured data tools comprising; accessing a source of unstructured data via extraction connectors; extracting the unstructured data;
writing the extracted unstructured data to a capture schema; sending the extracted mistructured data from the capture schema to a transformation. tool via transformation connectors; transforming the extracted unstructured data -with the tπansf oπnarion, tool; writing the transformed, extracted iinstructured data to the capture schema; processing data in the capture schema "with an extraction/transform/load layer, writing the processed data from the extraction/transf orm/Ioad layer in an analysis schema; and providing data connectors that allow structured data tools to access the analysis schema.
2. The method of claim 1 , wherein extracting includes parsing the unstructured data and associating the data source -with the parsed unstructured data.
3. The method of claim 1, -wherein the transformation tool extracts individual pieces of data and performs text and/ or data processing activities.
4. The method of claim 3, -wherein the text and data processing activities include: determining the topic of a section of text, extracting a section of text from a whole document, matching names, or matching addresses.
5. The method of claim 1, -wherein the structured data tools include business intelligence, statistical analysis, data visualization and mapping, or data mining.
6. The method of claim 1, wherein the source of unstructured data includes email, ward processing documents, spreadsheets, presentation materials, PDF files, "web pages, news/media reports, case files, transcriptions, file servers, web servers, enterprise content, enterprise search tool repositories, intranets, knowledge management systems, or document management systems.
7. The method of claim 1, -wherein the transformadon tool includes: (J) entity, concept and relationship tagging and extraction tools, (ϋ) categorization and topic extraction tools, (iii) data matching tools or, (iv) custom transformations.
8. A system for making unstructured data available to structured data took comprising: a core server, pne or more processors on the core server, a tangible medium on the core server containing instructions that -when executed by the one or more processors perf orms a method comprising; writing the extracted unstructured data to a capture schema; sending the extracted unstructured data from the capture schema to a transformation tool via transformation cormectois; transforming the extracted unstructured data with the transformation tool; writing the transformed, extracted unstructured data to the capture schema; processing data in the capture schema with an extraction/transform/load layer; -writing the processed data from the extraction/transform /load layer in an analysis schema; and providing data connectors that allow structured data tools to access the analysis schema.
9. The system of claim 8, wherein the code to extract includes code to parse die unstructured data and associate the data source -with the parsed unstructured data.
10. The system of claim 8, wherein the transformation tool extracts individual pieces of data and performs text and/or data processing activities.
11. The system of claim 10, wherein die text and data processing activities include; determining the topic of a section of text, extracting a section of text from a -whole document, matching names, or matching addresses.
12. The system of claim 8, wherein the structured data tools include business intelligence, statistical analysis, data visualization and mapping, or data mining.
13. The system of claim 8, wherein the source of unstructured, data includes email, word processing documents, spreadsheets, presentation materials, PDF files, web pages, news/media reports, case files, transcriptions, file servers, web servers, enterprise content, enterprise search tool repositories, intranets, knowledge management systems, or document management systems.
14. The system of claim 8, wherein the transformation tool includes: (i) entity, concept and relationship tagging and extraction tools, (iϊ) categorization and topic extraction, tools, (UL) data matching tools or, (rv) custom transformations.
15. A system for extracting unstructured data from a plurality of unstructured data sources and a plurality of formats comprising: a core server, one or more processors on the core server, a tangible medium on the core server containing instructions that when executed by the one or more processors operates a plurality of APIs to interface with the plurality of unstructured data sources and a single internal API that interfaces with a plurality of software components that allow structured data tools to operate on unstructured data.
16. The extraction service of claim 15, wherein the plurality of unstructured data sources includes email, word processing documents, spreadsheets, presentation materials, PDF files, web pages, news/media reports, case files, transcriptions, file servers, web servers, enterprise content, enterprise search tool repositories, intranets, knowledge management systems, and document management systems.
17. A transformation connector comprising: a core server, one or more processors on the core server, a tangible medium on the core server containing instructions that when executed by the one or more processors performs a method comprising: understanding the format of data provided by a transformation tool; and converting the data provided by a transformation tool to a data format that maps to a data capture schema, the data, capture schema comprising; a table to store data extracted from a plurality of source documents having unstructured data; and. a table to stone inf oπnation about the extracted data, and wherein the plurality of documents are assigned a unique key that identifies the document throughout a software system allowing (i) cross-analysis, (H) linking of results for further analysis, (iii) drill-down from analytical reports back to the source documents or (iv) drill-down from analytical reports back to transf ormation information stored in the schema.
18. The transformation connector of claim 17, wherein the transformation tool includes: (i) entity, concept and relationship tagging and extraction tools, (ii) categorization and topic extraction tools, (iii) data matching tools or, (iv) custom transformations.
19. The transformation connector of claim 17, wherein, the code to convert the data comprises at least one XSL transform.
20. A core server comprising code to allow parallel processing of unstructured data on a continuous real-time basis, wherein: the code is adapted to configure unstructured source extractors arid treat them as black boxes in a data workflow; the code is adapted to extract unstructured text from a plurality of data sources and source systems, the extracted unstructured text available for input for further processing; the code is adapted to configure end-to-end data flow from the plurality of data sources, through a capture schema, through one or more transformation components. back to the capture schema, through an extraction/transf orm/load layer, and into an analysis schema for analysis by structured data analysis tools; the code is adapted to retain a single key for each data source, the key being associated with data generated by the transformation components; and the code is adapted to store all extracted unstructured text, metadata and transformation data in a capture schema.
21. The core server of claim 20, wherein the code is adapted, to use a drag and drop data editor.
22. A structured data connector system that allows structured data analysis tools to analyze data in an analysis schema comprising; a core server, one or more processors on the cone server, a tangible medium on the core server containing instructions that when executed by the one or more processors performs a method comprising: providing ODBC code; providing JDBC code; and pre-populating metadata of die structured data analysis tools -with tables, columns, attributes, data and metrics from an analysis schema without performing tool customization or application specific setup,
23. The structured data connector of claim 22, further comprising pre-built reports, graphs and dashboards,
24. The structured data connector of claim 22, further comprising embedded hyperlinks that allow drill -through to underlying data sources.
25. The structured data connector of claim 24, "wherein the hyperlinks include a document ID, entity ID, or relationship ID from the analysis schema,
26. The structured data connector of claim 25, further comprising a source highlighter, the source highlighter adapted to access a capture schema and retrieve a source document or a section of a source document.
27. The structured data connector of claim 26, -wherein the source highlighter is adapted to retrieve start and end character positions, and to scroll down and highlight a relevant sentence in a retrieved source document or in a retrieved section of a source document.
PCT/US2006/025811 2005-07-05 2006-06-30 System and method of making unstructured data available to structured data analysis tools WO2007005730A2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
EP06774414.4A EP1899855B1 (en) 2005-07-05 2006-06-30 System and method of making unstructured data available to structured data analysis tools

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US11/172,955 US7849048B2 (en) 2005-07-05 2005-07-05 System and method of making unstructured data available to structured data analysis tools
US11/172,955 2005-07-05

Publications (3)

Publication Number Publication Date
WO2007005730A2 WO2007005730A2 (en) 2007-01-11
WO2007005730A3 WO2007005730A3 (en) 2007-04-05
WO2007005730B1 true WO2007005730B1 (en) 2007-06-07

Family

ID=37605090

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2006/025811 WO2007005730A2 (en) 2005-07-05 2006-06-30 System and method of making unstructured data available to structured data analysis tools

Country Status (3)

Country Link
US (2) US7849048B2 (en)
EP (1) EP1899855B1 (en)
WO (1) WO2007005730A2 (en)

Families Citing this family (239)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8396859B2 (en) * 2000-06-26 2013-03-12 Oracle International Corporation Subject matter context search engine
US7441246B2 (en) * 2004-03-19 2008-10-21 Microsoft Corporation Configurable collection of computer related metric data
US7536634B2 (en) * 2005-06-13 2009-05-19 Silver Creek Systems, Inc. Frame-slot architecture for data conversion
US20060112123A1 (en) * 2004-11-24 2006-05-25 Macnica, Inc. Spreadsheet user-interfaced business data visualization and publishing system
US7849049B2 (en) 2005-07-05 2010-12-07 Clarabridge, Inc. Schema and ETL tools for structured and unstructured data
US7490108B2 (en) * 2005-10-19 2009-02-10 Hewlett-Packard Development Company, L.P. Data consolidation
US7523135B2 (en) * 2005-10-20 2009-04-21 International Business Machines Corporation Risk and compliance framework
US7620642B2 (en) * 2005-12-13 2009-11-17 Sap Ag Mapping data structures
US8407585B2 (en) * 2006-04-19 2013-03-26 Apple Inc. Context-aware content conversion and interpretation-specific views
US8140464B2 (en) * 2006-04-28 2012-03-20 Battelle Memorial Institute Hypothesis analysis methods, hypothesis analysis devices, and articles of manufacture
WO2007150005A2 (en) 2006-06-22 2007-12-27 Multimodal Technologies, Inc. Automatic decision support
US20080065671A1 (en) * 2006-09-07 2008-03-13 Xerox Corporation Methods and apparatuses for detecting and labeling organizational tables in a document
US20080091423A1 (en) * 2006-10-13 2008-04-17 Shourya Roy Generation of domain models from noisy transcriptions
WO2008054948A1 (en) * 2006-10-31 2008-05-08 Nielsen Media Research, Inc. Methods and systems to retrieve information from data sources
US8032566B2 (en) * 2006-12-04 2011-10-04 Teradata Us, Inc. Tools for defining and using custom analysis modules
WO2009009192A2 (en) * 2007-04-18 2009-01-15 Aumni Data, Inc. Adaptive archive data management
JP4395176B2 (en) * 2007-05-10 2010-01-06 インターナショナル・ビジネス・マシーンズ・コーポレーション Future technology trend prediction support apparatus, method, program, and method for providing future technology trend prediction support service
US20080294976A1 (en) * 2007-05-22 2008-11-27 Eyal Rosenberg System and method for generating and communicating digital documents
US8954476B2 (en) * 2007-08-06 2015-02-10 Nipendo Ltd. System and method for mediating transactions of digital documents
US9646083B2 (en) * 2007-12-03 2017-05-09 International Business Machines Corporation Web 2.0 system and method for dynamic categorization of heterogeneous and regulated enterprise assets
US8140584B2 (en) * 2007-12-10 2012-03-20 Aloke Guha Adaptive data classification for data mining
US7779051B2 (en) * 2008-01-02 2010-08-17 International Business Machines Corporation System and method for optimizing federated and ETL'd databases with considerations of specialized data structures within an environment having multidimensional constraints
US7949654B2 (en) * 2008-03-31 2011-05-24 International Business Machines Corporation Supporting unified querying over autonomous unstructured and structured databases
US8255192B2 (en) * 2008-06-27 2012-08-28 Microsoft Corporation Analytical map models
US8620635B2 (en) 2008-06-27 2013-12-31 Microsoft Corporation Composition of analytics models
US8411085B2 (en) * 2008-06-27 2013-04-02 Microsoft Corporation Constructing view compositions for domain-specific environments
US20090322739A1 (en) * 2008-06-27 2009-12-31 Microsoft Corporation Visual Interactions with Analytics
US8117145B2 (en) 2008-06-27 2012-02-14 Microsoft Corporation Analytical model solver framework
US8290951B1 (en) * 2008-07-10 2012-10-16 Bank Of America Corporation Unstructured data integration with a data warehouse
US20100042623A1 (en) * 2008-08-14 2010-02-18 Junlan Feng System and method for mining and tracking business documents
US8266148B2 (en) * 2008-10-07 2012-09-11 Aumni Data, Inc. Method and system for business intelligence analytics on unstructured data
US8155931B2 (en) * 2008-11-26 2012-04-10 Microsoft Corporation Use of taxonomized analytics reference model
US8103608B2 (en) * 2008-11-26 2012-01-24 Microsoft Corporation Reference model for data-driven analytics
US8145615B2 (en) * 2008-11-26 2012-03-27 Microsoft Corporation Search and exploration using analytics reference model
US8190406B2 (en) * 2008-11-26 2012-05-29 Microsoft Corporation Hybrid solver for data-driven analytics
US20100161344A1 (en) * 2008-12-12 2010-06-24 Dyson David S Methods and apparatus to prepare report requests
US8314793B2 (en) * 2008-12-24 2012-11-20 Microsoft Corporation Implied analytical reasoning and computation
US8452791B2 (en) 2009-01-16 2013-05-28 Google Inc. Adding new instances to a structured presentation
US8412749B2 (en) 2009-01-16 2013-04-02 Google Inc. Populating a structured presentation with new values
US8615707B2 (en) 2009-01-16 2013-12-24 Google Inc. Adding new attributes to a structured presentation
US8977645B2 (en) 2009-01-16 2015-03-10 Google Inc. Accessing a search interface in a structured presentation
US20100228794A1 (en) * 2009-02-25 2010-09-09 International Business Machines Corporation Semantic document analysis
US8352412B2 (en) * 2009-02-27 2013-01-08 International Business Machines Corporation System for monitoring global online opinions via semantic extraction
US8250026B2 (en) 2009-03-06 2012-08-21 Peoplechart Corporation Combining medical information captured in structured and unstructured data formats for use or display in a user application, interface, or view
US8866818B2 (en) 2009-06-19 2014-10-21 Microsoft Corporation Composing shapes and data series in geometries
US8692826B2 (en) * 2009-06-19 2014-04-08 Brian C. Beckman Solver-based visualization framework
US8259134B2 (en) * 2009-06-19 2012-09-04 Microsoft Corporation Data-driven model implemented with spreadsheets
US8531451B2 (en) * 2009-06-19 2013-09-10 Microsoft Corporation Data-driven visualization transformation
US9330503B2 (en) 2009-06-19 2016-05-03 Microsoft Technology Licensing, Llc Presaging and surfacing interactivity within data visualizations
US8493406B2 (en) * 2009-06-19 2013-07-23 Microsoft Corporation Creating new charts and data visualizations
US8788574B2 (en) * 2009-06-19 2014-07-22 Microsoft Corporation Data-driven visualization of pseudo-infinite scenes
US8352397B2 (en) * 2009-09-10 2013-01-08 Microsoft Corporation Dependency graph in data-driven model
US9361359B1 (en) * 2009-09-25 2016-06-07 Emc Corporation Accessing schema-free databases
US9600919B1 (en) 2009-10-20 2017-03-21 Yahoo! Inc. Systems and methods for assembling and/or displaying multimedia objects, modules or presentations
AU2009233605B2 (en) * 2009-10-30 2016-06-23 IFRS System Pty Limited Processing Engine
US8683311B2 (en) 2009-12-11 2014-03-25 Microsoft Corporation Generating structured data objects from unstructured web pages
US20110145710A1 (en) * 2009-12-16 2011-06-16 Sap Ag Framework to visualize networks
US9558520B2 (en) * 2009-12-31 2017-01-31 Hartford Fire Insurance Company System and method for geocoded insurance processing using mobile devices
US8805707B2 (en) 2009-12-31 2014-08-12 Hartford Fire Insurance Company Systems and methods for providing a safety score associated with a user location
WO2011085562A1 (en) * 2010-01-18 2011-07-21 Hewlett-Packard Development Company, L.P. System and method for automatically extracting metadata from unstructured electronic documents
US20120303645A1 (en) * 2010-02-03 2012-11-29 Anita Kulkarni-Puranik System and method for extraction of structured data from arbitrarily structured composite data
US20110314001A1 (en) * 2010-06-18 2011-12-22 Microsoft Corporation Performing query expansion based upon statistical analysis of structured data
US9043296B2 (en) 2010-07-30 2015-05-26 Microsoft Technology Licensing, Llc System of providing suggestions based on accessible and contextual information
US9123161B2 (en) 2010-08-04 2015-09-01 Exxonmobil Upstream Research Company System and method for summarizing data on an unstructured grid
US8959102B2 (en) 2010-10-08 2015-02-17 Mmodal Ip Llc Structured searching of dynamic structured document corpuses
US10318877B2 (en) 2010-10-19 2019-06-11 International Business Machines Corporation Cohort-based prediction of a future event
US8484255B2 (en) * 2010-12-03 2013-07-09 Sap Ag Automatic conversion of multidimentional schema entities
US9064004B2 (en) * 2011-03-04 2015-06-23 Microsoft Technology Licensing, Llc Extensible surface for consuming information extraction services
US9037529B2 (en) 2011-06-15 2015-05-19 Ceresis, Llc Method for generating visual mapping of knowledge information from parsing of text inputs for subjects and predicates
US8407165B2 (en) * 2011-06-15 2013-03-26 Ceresis, Llc Method for parsing, searching and formatting of text input for visual mapping of knowledge information
US9785982B2 (en) 2011-09-12 2017-10-10 Doco Labs, Llc Telecom profitability management
US9110904B2 (en) * 2011-09-21 2015-08-18 Verizon Patent And Licensing Inc. Rule-based metadata transformation and aggregation for programs
US9361320B1 (en) * 2011-09-30 2016-06-07 Emc Corporation Modeling big data
US9020981B2 (en) * 2011-09-30 2015-04-28 Comprehend Systems, Inc. Systems and methods for generating schemas that represent multiple data sources
US8838519B2 (en) * 2011-10-06 2014-09-16 Ut-Battelle, Llc Graph-theoretic analysis of discrete-phase-space states for condition change detection and quantification of information
US8996350B1 (en) 2011-11-02 2015-03-31 Dub Software Group, Inc. System and method for automatic document management
US9245010B1 (en) 2011-11-02 2016-01-26 Sri International Extracting and leveraging knowledge from unstructured data
US10114843B2 (en) * 2011-11-09 2018-10-30 Sap Se Content migration framework
US8725552B2 (en) * 2011-11-28 2014-05-13 Dr/Decision Resources, Llc Pharmaceutical/life science technology evaluation and scoring
US8458189B1 (en) * 2011-11-28 2013-06-04 Sap Ag Automatic tagging between structured/unstructured data
TW201324417A (en) * 2011-12-08 2013-06-16 Infopower Corp Data processing method of business intelligence software
US10387503B2 (en) 2011-12-15 2019-08-20 Excalibur Ip, Llc Systems and methods involving features of search and/or search integration
US10504555B2 (en) 2011-12-20 2019-12-10 Oath Inc. Systems and methods involving features of creation/viewing/utilization of information modules such as mixed-media modules
US10296158B2 (en) 2011-12-20 2019-05-21 Oath Inc. Systems and methods involving features of creation/viewing/utilization of information modules such as mixed-media modules
US9836805B2 (en) * 2012-01-17 2017-12-05 Sackett Solutions & Innovations, LLC System for search and customized information updating of new patents and research, and evaluation of new research projects' and current patents' potential
US20130185276A1 (en) * 2012-01-17 2013-07-18 Sackett Solutions & Innovations, LLC System for Search and Customized Information Updating of New Patents and Research, and Evaluation of New Research Projects' and Current Patents' Potential
US11099714B2 (en) 2012-02-28 2021-08-24 Verizon Media Inc. Systems and methods involving creation/display/utilization of information modules, such as mixed-media and multimedia modules
US10372741B2 (en) 2012-03-02 2019-08-06 Clarabridge, Inc. Apparatus for automatic theme detection from unstructured data
US20130232157A1 (en) * 2012-03-05 2013-09-05 Tammer Eric Kamel Systems and methods for processing unstructured numerical data
US8954376B2 (en) * 2012-03-08 2015-02-10 International Business Machines Corporation Detecting transcoding tables in extract-transform-load processes
US8583626B2 (en) 2012-03-08 2013-11-12 International Business Machines Corporation Method to detect reference data tables in ETL processes
US8892579B2 (en) * 2012-04-26 2014-11-18 Anu Pareek Method and system of data extraction from a portable document format file
US9177000B2 (en) 2012-04-30 2015-11-03 International Business Machines Corporation Data index using a linked data standard
US9418389B2 (en) 2012-05-07 2016-08-16 Nasdaq, Inc. Social intelligence architecture using social media message queues
US10304036B2 (en) 2012-05-07 2019-05-28 Nasdaq, Inc. Social media profiling for one or more authors using one or more social media platforms
US20130311166A1 (en) * 2012-05-15 2013-11-21 Andre Yanpolsky Domain-Specific Natural-Language Processing Engine
US9843823B2 (en) 2012-05-23 2017-12-12 Yahoo Holdings, Inc. Systems and methods involving creation of information modules, including server, media searching, user interface and/or other features
US9251180B2 (en) 2012-05-29 2016-02-02 International Business Machines Corporation Supplementing structured information about entities with information from unstructured data sources
US10417289B2 (en) 2012-06-12 2019-09-17 Oath Inc. Systems and methods involving integration/creation of search results media modules
US10303723B2 (en) 2012-06-12 2019-05-28 Excalibur Ip, Llc Systems and methods involving search enhancement features associated with media modules
US8849843B1 (en) * 2012-06-18 2014-09-30 Ez-XBRL Solutions, Inc. System and method for facilitating associating semantic labels with content
US9679077B2 (en) 2012-06-29 2017-06-13 Mmodal Ip Llc Automated clinical evidence sheet workflow
US9047587B2 (en) * 2012-07-16 2015-06-02 Sap Portals Israel Ltd Incorporating electronic communication data content into an enterprise workspace
US20140164417A1 (en) * 2012-07-26 2014-06-12 Infosys Limited Methods for analyzing user opinions and devices thereof
US20140046977A1 (en) * 2012-08-10 2014-02-13 Xurmo Technologies Pvt. Ltd. System and method for mining patterns from relationship sequences extracted from big data
CA2881564A1 (en) 2012-08-13 2014-02-20 Mmodal Ip Llc Maintaining a discrete data representation that corresponds to information contained in free-form text
US8762133B2 (en) 2012-08-30 2014-06-24 Arria Data2Text Limited Method and apparatus for alert validation
US8762134B2 (en) 2012-08-30 2014-06-24 Arria Data2Text Limited Method and apparatus for situational analysis text generation
US9135327B1 (en) 2012-08-30 2015-09-15 Ez-XBRL Solutions, Inc. System and method to facilitate the association of structured content in a structured document with unstructured content in an unstructured document
US9336193B2 (en) 2012-08-30 2016-05-10 Arria Data2Text Limited Method and apparatus for updating a previously generated text
US9405448B2 (en) 2012-08-30 2016-08-02 Arria Data2Text Limited Method and apparatus for annotating a graphical output
US9135244B2 (en) 2012-08-30 2015-09-15 Arria Data2Text Limited Method and apparatus for configurable microplanning
US9355093B2 (en) 2012-08-30 2016-05-31 Arria Data2Text Limited Method and apparatus for referring expression generation
US9424249B1 (en) * 2012-09-18 2016-08-23 Amazon Technologies, Inc. Encoding text units
US8725750B1 (en) * 2012-10-25 2014-05-13 Hulu, LLC Framework for generating programs to process beacons
US9600471B2 (en) 2012-11-02 2017-03-21 Arria Data2Text Limited Method and apparatus for aggregating with information generalization
WO2014076524A1 (en) 2012-11-16 2014-05-22 Data2Text Limited Method and apparatus for spatial descriptions in an output text
WO2014076525A1 (en) 2012-11-16 2014-05-22 Data2Text Limited Method and apparatus for expressing time in an output text
WO2014102569A1 (en) 2012-12-27 2014-07-03 Arria Data2Text Limited Method and apparatus for motion description
WO2014102568A1 (en) 2012-12-27 2014-07-03 Arria Data2Text Limited Method and apparatus for motion detection
US10776561B2 (en) 2013-01-15 2020-09-15 Arria Data2Text Limited Method and apparatus for generating a linguistic representation of raw input data
US9291608B2 (en) 2013-03-13 2016-03-22 Aclima Inc. Calibration method for distributed sensor system
US9297748B2 (en) 2013-03-13 2016-03-29 Aclima Inc. Distributed sensor system with remote sensor nodes and centralized data processing
US9128994B2 (en) 2013-03-14 2015-09-08 Microsoft Technology Licensing, Llc Visually representing queries of multi-source data
US9218568B2 (en) 2013-03-15 2015-12-22 Business Objects Software Ltd. Disambiguating data using contextual and historical information
US9607038B2 (en) * 2013-03-15 2017-03-28 International Business Machines Corporation Determining linkage metadata of content of a target document to source documents
US9262550B2 (en) 2013-03-15 2016-02-16 Business Objects Software Ltd. Processing semi-structured data
US9299041B2 (en) 2013-03-15 2016-03-29 Business Objects Software Ltd. Obtaining data from unstructured data for a structured data collection
US20140331179A1 (en) * 2013-05-06 2014-11-06 Microsoft Corporation Automated Presentation of Visualized Data
US9495347B2 (en) * 2013-07-16 2016-11-15 Recommind, Inc. Systems and methods for extracting table information from documents
US9342608B2 (en) 2013-08-01 2016-05-17 International Business Machines Corporation Clarification of submitted questions in a question and answer system
US10762276B2 (en) * 2013-08-27 2020-09-01 Paper Software LLC Cross-references within a hierarchically structured document
US9946711B2 (en) 2013-08-29 2018-04-17 Arria Data2Text Limited Text generation from correlated alerts
US9244894B1 (en) 2013-09-16 2016-01-26 Arria Data2Text Limited Method and apparatus for interactive reports
US9396181B1 (en) 2013-09-16 2016-07-19 Arria Data2Text Limited Method, apparatus, and computer program product for user-directed reporting
US20160217112A1 (en) * 2013-09-25 2016-07-28 Chartspan Medical Technologies, Inc. User-Initiated Data Recognition and Data Conversion Process
US9396031B2 (en) 2013-09-27 2016-07-19 International Business Machines Corporation Distributed UIMA cluster computing (DUCC) facility
WO2015084757A1 (en) * 2013-12-02 2015-06-11 Qbase, LLC Systems and methods for processing data stored in a database
US9547701B2 (en) 2013-12-02 2017-01-17 Qbase, LLC Method of discovering and exploring feature knowledge
US9424294B2 (en) 2013-12-02 2016-08-23 Qbase, LLC Method for facet searching and search suggestions
US9201744B2 (en) 2013-12-02 2015-12-01 Qbase, LLC Fault tolerant architecture for distributed computing systems
US9424524B2 (en) 2013-12-02 2016-08-23 Qbase, LLC Extracting facts from unstructured text
US9208204B2 (en) 2013-12-02 2015-12-08 Qbase, LLC Search suggestions using fuzzy-score matching and entity co-occurrence
US9223833B2 (en) 2013-12-02 2015-12-29 Qbase, LLC Method for in-loop human validation of disambiguated features
US9025892B1 (en) 2013-12-02 2015-05-05 Qbase, LLC Data record compression with progressive and/or selective decomposition
US9355152B2 (en) 2013-12-02 2016-05-31 Qbase, LLC Non-exclusionary search within in-memory databases
US9542477B2 (en) 2013-12-02 2017-01-10 Qbase, LLC Method of automated discovery of topics relatedness
US9177262B2 (en) 2013-12-02 2015-11-03 Qbase, LLC Method of automated discovery of new topics
US9922032B2 (en) 2013-12-02 2018-03-20 Qbase, LLC Featured co-occurrence knowledge base from a corpus of documents
US9659108B2 (en) 2013-12-02 2017-05-23 Qbase, LLC Pluggable architecture for embedding analytics in clustered in-memory databases
US9230041B2 (en) 2013-12-02 2016-01-05 Qbase, LLC Search suggestions of related entities based on co-occurrence and/or fuzzy-score matching
US9836708B2 (en) 2013-12-13 2017-12-05 Visier Solutions, Inc. Dynamic identification of supported items in an application
US10114878B2 (en) 2013-12-16 2018-10-30 International Business Machines Corporation Index utilization in ETL tools
US9015730B1 (en) 2013-12-17 2015-04-21 International Business Machines Corporation Natural language access to application programming interfaces
US10466217B1 (en) 2013-12-23 2019-11-05 Aclima Inc. Method to combine partially aggregated sensor data in a distributed sensor system
US10979295B2 (en) 2014-01-21 2021-04-13 Micro Focus Llc Automatically discovering topology of an information technology (IT) infrastructure
US20150213035A1 (en) * 2014-01-24 2015-07-30 Bit Stew Systems Inc. Search Engine System and Method for a Utility Interface Platform
KR101568346B1 (en) 2014-03-28 2015-11-12 주식회사 솔트룩스 Knowledge acquisition system based on un-structured data for never-ending and self-evolving
US10664558B2 (en) 2014-04-18 2020-05-26 Arria Data2Text Limited Method and apparatus for document planning
US10877955B2 (en) * 2014-04-29 2020-12-29 Microsoft Technology Licensing, Llc Using lineage to infer data quality issues
US10089409B2 (en) 2014-04-29 2018-10-02 Microsoft Technology Licensing, Llc Event-triggered data quality verification
WO2015165545A1 (en) * 2014-05-01 2015-11-05 Longsand Limited Embedded processing of structured and unstructured data using a single application protocol interface (api)
WO2016007162A1 (en) 2014-07-10 2016-01-14 Hewlett-Packard Development Company, L.P. Categorizing columns in a data table
BR112017000661A2 (en) * 2014-07-15 2018-01-09 Microsoft Technology Licensing Llc data intake management
US20160098405A1 (en) * 2014-10-01 2016-04-07 Docurated, Inc. Document Curation System
CN105786921B (en) * 2014-12-26 2019-06-18 北京航天测控技术有限公司 A kind of the data module method for transformation and device of non-structured document
US10909138B2 (en) 2015-03-10 2021-02-02 Microsoft Technology Licensing, Llc Transforming data to share across applications
US9836599B2 (en) * 2015-03-13 2017-12-05 Microsoft Technology Licensing, Llc Implicit process detection and automation from unstructured activity
US10242359B2 (en) 2015-03-18 2019-03-26 International Business Machines Corporation Mining unstructured online content for automated currency value conversion
US20160321578A1 (en) * 2015-05-02 2016-11-03 Vatbox, Ltd. System and method for verifying enterprise resource planning data
US10331633B2 (en) 2015-06-04 2019-06-25 International Business Machines Corporation Schema discovery through statistical transduction
WO2017019705A1 (en) * 2015-07-27 2017-02-02 Texas State Technical College System Systems and methods for domain-specific machine-interpretation of input data
WO2017017678A1 (en) * 2015-07-27 2017-02-02 Opisoft Care Ltd. System and method for phrase search within document section
US10776357B2 (en) 2015-08-26 2020-09-15 Infosys Limited System and method of data join and metadata configuration
US10025846B2 (en) 2015-09-14 2018-07-17 International Business Machines Corporation Identifying entity mappings across data assets
US20170116550A1 (en) * 2015-09-30 2017-04-27 Tata Consultancy Services Limited System and method for enterprise data management
US10055430B2 (en) 2015-10-14 2018-08-21 International Business Machines Corporation Method for classifying an unmanaged dataset
US20170116194A1 (en) 2015-10-23 2017-04-27 International Business Machines Corporation Ingestion planning for complex tables
US10628456B2 (en) * 2015-10-30 2020-04-21 Hartford Fire Insurance Company Universal analytical data mart and data structure for same
JP6893209B2 (en) * 2015-10-30 2021-06-23 アクシオム コーポレーション Automatic interpretation of structured multifield file layout
US10521464B2 (en) * 2015-12-10 2019-12-31 Agile Data Decisions, Llc Method and system for extracting, verifying and cataloging technical information from unstructured documents
AU2016228174B1 (en) * 2016-04-27 2017-03-30 Accenture Global Solutions Limited Machine for generating unstructured syntax
US10282454B2 (en) * 2016-04-27 2019-05-07 Accenture Global Solutions Limited Machine for generating unstructured syntax
US11151653B1 (en) 2016-06-16 2021-10-19 Decision Resources, Inc. Method and system for managing data
US10248702B2 (en) * 2016-07-29 2019-04-02 International Business Machines Corporation Integration management for structured and unstructured data
US10963634B2 (en) * 2016-08-04 2021-03-30 Servicenow, Inc. Cross-platform classification of machine-generated textual data
US10445432B1 (en) 2016-08-31 2019-10-15 Arria Data2Text Limited Method and apparatus for lightweight multilingual natural language realizer
US10467347B1 (en) 2016-10-31 2019-11-05 Arria Data2Text Limited Method and apparatus for natural language document orchestrator
US10824681B2 (en) * 2016-11-21 2020-11-03 Sap Se Enterprise resource textual analysis
US10628058B1 (en) 2017-02-15 2020-04-21 Bank Of America Corporation System for electronic data verification, storage, and transfer
US11921765B2 (en) 2017-02-24 2024-03-05 Red Hat, Inc. Systematic iterative analysis of unstructured data files
CN110476159A (en) 2017-03-30 2019-11-19 日本电气株式会社 Information processing system, characteristic value illustration method and characteristic value read-me
US11113259B2 (en) * 2017-08-02 2021-09-07 Tata Consultancy Services Limited Method and system for analyzing unstructured data for compliance enforcement
SG11202003814TA (en) 2017-10-05 2020-05-28 Dotdata Inc Feature generating device, feature generating method, and feature generating program
US11763077B1 (en) * 2017-11-03 2023-09-19 EMC IP Holding Company LLC Uniform parsing of configuration files for multiple product types
US10592738B2 (en) 2017-12-01 2020-03-17 International Business Machines Corporation Cognitive document image digitalization
EP3495968A1 (en) * 2017-12-11 2019-06-12 Tata Consultancy Services Limited Method and system for extraction of relevant sections from plurality of documents
US10296578B1 (en) 2018-02-20 2019-05-21 Paycor, Inc. Intelligent extraction and organization of data from unstructured documents
US10698911B1 (en) * 2018-03-15 2020-06-30 Keysight Technologies, Inc. Method for ranking possible causes for anomalies in large data sets
US11048762B2 (en) 2018-03-16 2021-06-29 Open Text Holdings, Inc. User-defined automated document feature modeling, extraction and optimization
US10762142B2 (en) 2018-03-16 2020-09-01 Open Text Holdings, Inc. User-defined automated document feature extraction and optimization
US10956436B2 (en) 2018-04-17 2021-03-23 International Business Machines Corporation Refining search results generated from a combination of multiple types of searches
CN109241144B (en) * 2018-04-24 2022-02-08 中国银行股份有限公司 Operation and maintenance data mining and compliance checking method and system
US20190354809A1 (en) * 2018-05-21 2019-11-21 State Street Corporation Computational model management
US10509813B1 (en) * 2018-06-01 2019-12-17 Droit Financial Technologies LLC System and method for analyzing and modeling content
WO2019241630A1 (en) * 2018-06-15 2019-12-19 Deep Insight Solutions, Inc. Systems and methods for an artificial intelligence data fusion platform
CN110826974A (en) * 2018-08-13 2020-02-21 山东大学 Scientific and technological achievement transformation/incubation big data cloud platform internet + system
US11061942B2 (en) 2018-09-07 2021-07-13 Graywell, Inc. Unstructured data fusion by content-aware concurrent data processing pipeline
US10936640B2 (en) * 2018-10-09 2021-03-02 International Business Machines Corporation Intelligent visualization of unstructured data in column-oriented data tables
US11790262B2 (en) * 2019-01-22 2023-10-17 Accenture Global Solutions Limited Data transformations for robotic process automation
US11210266B2 (en) 2019-01-25 2021-12-28 International Business Machines Corporation Methods and systems for natural language processing of metadata
US11176000B2 (en) 2019-01-25 2021-11-16 International Business Machines Corporation Methods and systems for custom metadata driven data protection and identification of data
US11914869B2 (en) 2019-01-25 2024-02-27 International Business Machines Corporation Methods and systems for encryption based on intelligent data classification
US11610277B2 (en) 2019-01-25 2023-03-21 Open Text Holdings, Inc. Seamless electronic discovery system with an enterprise data portal
US11113238B2 (en) 2019-01-25 2021-09-07 International Business Machines Corporation Methods and systems for metadata tag inheritance between multiple storage systems
US11093448B2 (en) 2019-01-25 2021-08-17 International Business Machines Corporation Methods and systems for metadata tag inheritance for data tiering
US11030054B2 (en) 2019-01-25 2021-06-08 International Business Machines Corporation Methods and systems for data backup based on data classification
US11113148B2 (en) 2019-01-25 2021-09-07 International Business Machines Corporation Methods and systems for metadata tag inheritance for data backup
US11100048B2 (en) 2019-01-25 2021-08-24 International Business Machines Corporation Methods and systems for metadata tag inheritance between multiple file systems within a storage system
US10592544B1 (en) * 2019-02-12 2020-03-17 Live Objects, Inc. Generation of process models in domains with unstructured data
US11922140B2 (en) * 2019-04-05 2024-03-05 Oracle International Corporation Platform for integrating back-end data analysis tools using schema
US11157777B2 (en) * 2019-07-15 2021-10-26 Disney Enterprises, Inc. Quality control systems and methods for annotated content
CA3208838A1 (en) * 2019-08-26 2021-02-26 Bank Of Montreal Systems and methods for data mart rationalization
US10741168B1 (en) 2019-10-31 2020-08-11 Capital One Services, Llc Text-to-speech enriching system
US11502905B1 (en) 2019-12-19 2022-11-15 Wells Fargo Bank, N.A. Computing infrastructure standards assay
US11237847B1 (en) 2019-12-19 2022-02-01 Wells Fargo Bank, N.A. Automated standards-based computing system reconfiguration
US11645579B2 (en) 2019-12-20 2023-05-09 Disney Enterprises, Inc. Automated machine learning tagging and optimization of review procedures
US11783079B2 (en) * 2019-12-27 2023-10-10 International Business Machines Corporation Privacy protection for regulated computing environments
US11630870B2 (en) * 2020-01-06 2023-04-18 Tarek A. M. Abdunabi Academic search and analytics system and method therefor
US11494425B2 (en) 2020-02-03 2022-11-08 S&P Global Inc. Schema-informed extraction for unstructured data
US11500840B2 (en) * 2020-02-28 2022-11-15 International Business Machines Corporation Contrasting document-embedded structured data and generating summaries thereof
US20210279606A1 (en) * 2020-03-09 2021-09-09 Samsung Electronics Co., Ltd. Automatic detection and association of new attributes with entities in knowledge bases
AU2021203456A1 (en) * 2020-05-29 2021-12-16 Lexx Technologies Pty Ltd Computer-Implemented Method of Providing Maintenance Instructions for Servicing Equipment
US11941565B2 (en) 2020-06-11 2024-03-26 Capital One Services, Llc Citation and policy based document classification
US11275776B2 (en) * 2020-06-11 2022-03-15 Capital One Services, Llc Section-linked document classifiers
US11410447B2 (en) 2020-06-19 2022-08-09 Bank Of America Corporation Information security assessment translation engine
US11082315B1 (en) * 2020-12-14 2021-08-03 Qualcomm Incorporated Method of sub flow or activity classification
WO2023215334A1 (en) * 2022-05-02 2023-11-09 Blueflash Software Llc System and method for classification of unstructured data
US11947561B2 (en) * 2022-06-21 2024-04-02 International Business Machines Corporation Heterogeneous schema discovery for unstructured data

Family Cites Families (108)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4652733A (en) 1984-10-15 1987-03-24 At&T Company Technique for cataloging pictorial and/or written database information on video tape or disk
US4632733A (en) * 1985-12-30 1986-12-30 Nippon Kokan Kabushiki Kaisha Method for manufacturing one-side electrogalvanized steel strip
US4871903A (en) * 1987-07-31 1989-10-03 General Electric Company Apparatus for rapidly accessing a large data base employing an optical disc reading system with multiple heads and track position compensation means
US5162992A (en) * 1989-12-19 1992-11-10 International Business Machines Corp. Vector relational characteristical object
US5361353A (en) 1991-10-02 1994-11-01 International Business Machines Corporation System for parsing message units from an unstructured message stream of interleaved message units to form structured messages
JP3189186B2 (en) * 1992-03-23 2001-07-16 インターナショナル・ビジネス・マシーンズ・コーポレ−ション Translation device based on patterns
US5600831A (en) 1994-02-28 1997-02-04 Lucent Technologies Inc. Apparatus and methods for retrieving information by modifying query plan based on description of information sources
DE4432714C1 (en) * 1994-09-14 1995-11-02 Deutsche Forsch Luft Raumfahrt Method for determining the size of airborne water drops
US5729730A (en) 1995-03-28 1998-03-17 Dex Information Systems, Inc. Method and apparatus for improved information storage and retrieval system
US5608904A (en) * 1995-02-13 1997-03-04 Hewlett-Packard Company Method and apparatus for processing and optimizing queries having joins between structured data and text data
US5708825A (en) * 1995-05-26 1998-01-13 Iconovex Corporation Automatic summary page creation and hyperlink generation
US5664109A (en) 1995-06-07 1997-09-02 E-Systems, Inc. Method for extracting pre-defined data items from medical service records generated by health care providers
US5867799A (en) 1996-04-04 1999-02-02 Lang; Andrew K. Information system and method for filtering a massive flow of information entities to meet user information classification needs
US6052693A (en) 1996-07-02 2000-04-18 Harlequin Group Plc System for assembling large databases through information extracted from text sources
US5819265A (en) * 1996-07-12 1998-10-06 International Business Machines Corporation Processing names in a text
JPH10240589A (en) * 1997-02-21 1998-09-11 Hitachi Ltd Database processing method taking out actual data delay
US6009462A (en) * 1997-06-16 1999-12-28 Digital Equipment Corporation Replacing large bit component of electronic mail (e-mail) message with hot-link in distributed computer system
US6167397A (en) 1997-09-23 2000-12-26 At&T Corporation Method of clustering electronic documents in response to a search query
US6236994B1 (en) 1997-10-21 2001-05-22 Xerox Corporation Method and apparatus for the integration of information and knowledge
US6061678A (en) * 1997-10-31 2000-05-09 Oracle Corporation Approach for managing access to large objects in database systems using large object indexes
US6078924A (en) 1998-01-30 2000-06-20 Aeneid Corporation Method and apparatus for performing data collection, interpretation and analysis, in an information platform
US6366921B1 (en) 1999-02-09 2002-04-02 International Business Machines Corporation System and method for data manipulation in a dynamic object-based format
US6629097B1 (en) 1999-04-28 2003-09-30 Douglas K. Keith Displaying implicit associations among items in loosely-structured data sets
US6470277B1 (en) * 1999-07-30 2002-10-22 Agy Therapeutics, Inc. Techniques for facilitating identification of candidate genes
US6332163B1 (en) * 1999-09-01 2001-12-18 Accenture, Llp Method for providing communication services over a computer network system
US6546133B1 (en) 1999-09-08 2003-04-08 Ge Capital Commercial Finance, Inc. Methods and apparatus for print scraping
US6601026B2 (en) * 1999-09-17 2003-07-29 Discern Communications, Inc. Information retrieval by natural language querying
US6665685B1 (en) * 1999-11-01 2003-12-16 Cambridge Soft Corporation Deriving database interaction software
US6564215B1 (en) 1999-12-16 2003-05-13 International Business Machines Corporation Update support in database content management
US6449620B1 (en) * 2000-03-02 2002-09-10 Nimble Technology, Inc. Method and apparatus for generating information pages using semi-structured data stored in a structured manner
EP1139603A1 (en) * 2000-03-27 2001-10-04 Tektronix, Inc. Method and Apparatus for data analysing
AU2001261084A1 (en) 2000-04-27 2001-11-07 Brio Technology, Inc. Method and apparatus for processing jobs on an enterprise-wide computer system
US6912498B2 (en) * 2000-05-02 2005-06-28 Scansoft, Inc. Error correction in speech recognition by correcting text around selected area
US6732098B1 (en) * 2000-08-11 2004-05-04 Attensity Corporation Relational text index creation and searching
US6732097B1 (en) * 2000-08-11 2004-05-04 Attensity Corporation Relational text index creation and searching
US6728707B1 (en) * 2000-08-11 2004-04-27 Attensity Corporation Relational text index creation and searching
US6741988B1 (en) * 2000-08-11 2004-05-25 Attensity Corporation Relational text index creation and searching
US6738765B1 (en) * 2000-08-11 2004-05-18 Attensity Corporation Relational text index creation and searching
US7024425B2 (en) 2000-09-07 2006-04-04 Oracle International Corporation Method and apparatus for flexible storage and uniform manipulation of XML data in a relational database system
US20020065857A1 (en) 2000-10-04 2002-05-30 Zbigniew Michalewicz System and method for analysis and clustering of documents for search engine
US8230323B2 (en) 2000-12-06 2012-07-24 Sra International, Inc. Content distribution system and method
US6862585B2 (en) 2000-12-19 2005-03-01 The Procter & Gamble Company System and method for managing product development
US7363308B2 (en) 2000-12-28 2008-04-22 Fair Isaac Corporation System and method for obtaining keyword descriptions of records from a large database
US20020156817A1 (en) * 2001-02-22 2002-10-24 Volantia, Inc. System and method for extracting information
US7076485B2 (en) 2001-03-07 2006-07-11 The Mitre Corporation Method and system for finding similar records in mixed free-text and structured data
US20020128998A1 (en) 2001-03-07 2002-09-12 David Kil Automatic data explorer that determines relationships among original and derived fields
US6694307B2 (en) * 2001-03-07 2004-02-17 Netvention System for collecting specific information from several sources of unstructured digitized data
US7392287B2 (en) * 2001-03-27 2008-06-24 Hemisphere Ii Investment Lp Method and apparatus for sharing information using a handheld device
US7043535B2 (en) * 2001-03-30 2006-05-09 Xerox Corporation Systems and methods for combined browsing and searching in a document collection based on information scent
US7191183B1 (en) * 2001-04-10 2007-03-13 Rgi Informatics, Llc Analytics and data warehousing infrastructure and services
US6904428B2 (en) 2001-04-18 2005-06-07 Illinois Institute Of Technology Intranet mediator
US20020161626A1 (en) 2001-04-27 2002-10-31 Pierre Plante Web-assistant based e-marketing method and system
US6970881B1 (en) * 2001-05-07 2005-11-29 Intelligenxia, Inc. Concept-based method and system for dynamically analyzing unstructured information
US7536413B1 (en) * 2001-05-07 2009-05-19 Ixreveal, Inc. Concept-based categorization of unstructured objects
US6735578B2 (en) 2001-05-10 2004-05-11 Honeywell International Inc. Indexing of knowledge base in multilayer self-organizing maps with hessian and perturbation induced fast learning
WO2002095616A1 (en) 2001-05-18 2002-11-28 Mastersoft Research Pty Limited Parsing system
AUPR511301A0 (en) 2001-05-18 2001-06-14 Mastersoft Research Pty Limited Parsing system
US20030014406A1 (en) 2001-06-07 2003-01-16 Urbanpixel Inc. Intelligent browser windows in a multi-browser environment
US6980976B2 (en) 2001-08-13 2005-12-27 Oracle International Corp. Combined database index of unstructured and structured columns
WO2003021480A1 (en) 2001-09-04 2003-03-13 International Limited Database management system
CA2463434A1 (en) 2001-10-12 2003-04-24 Swiss Reinsurance Company System and method for reinsurance placement
EP1440410A2 (en) 2001-11-02 2004-07-28 Siemens Corporate Research, Inc. Patient data mining for lung cancer screening
GB2399666A (en) 2001-11-07 2004-09-22 Enkata Technologies Inc Method and system for root cause analysis of structured and instructured data
US7219130B2 (en) 2001-11-28 2007-05-15 Appmail Llc System and method for integrating e-mail into functionality of software application
US20030130894A1 (en) 2001-11-30 2003-07-10 Alison Huettner System for converting and delivering multiple subscriber data requests to remote subscribers
US7493265B2 (en) 2001-12-11 2009-02-17 Sas Institute Inc. Integrated biomedical information portal system and method
US20030158865A1 (en) 2001-12-28 2003-08-21 Frank Renkes Managing multiple data stores
US7225183B2 (en) 2002-01-28 2007-05-29 Ipxl, Inc. Ontology-based information management system and method
US20030176976A1 (en) 2002-01-28 2003-09-18 Steve Gardner Bioinformatics system architecture with data and process integration for overall portfolio management
US20030144892A1 (en) 2002-01-29 2003-07-31 International Business Machines Corporation Method, system, and storage medium for providing knowledge management services
JP2003271389A (en) * 2002-03-19 2003-09-26 Shuichi Araki Method for operating software object in natural language and its program
SG106068A1 (en) * 2002-04-02 2004-09-30 Reuters Ltd Metadata database management system and method therefor
US7010520B2 (en) 2002-04-26 2006-03-07 International Business Machines Corporation Method and system for searching documents with numbers
US20030206201A1 (en) * 2002-05-03 2003-11-06 Ly Eric Thichvi Method for graphical classification of unstructured data
CA2485554A1 (en) 2002-05-14 2003-11-27 Verity, Inc. Searching structured, semi-structured, and unstructured content
US6996575B2 (en) 2002-05-31 2006-02-07 Sas Institute Inc. Computer-implemented system and method for text-based document processing
US6892198B2 (en) * 2002-06-14 2005-05-10 Entopia, Inc. System and method for personalized information retrieval based on user expertise
US20040010491A1 (en) 2002-06-28 2004-01-15 Markus Riedinger User interface framework
US20040049473A1 (en) 2002-09-05 2004-03-11 David John Gower Information analytics systems and methods
US20040049505A1 (en) 2002-09-11 2004-03-11 Kelly Pennock Textual on-line analytical processing method and system
DE10337934A1 (en) 2002-09-30 2004-04-08 Siemens Ag Unstructured text conversion method in which the text is structured using structuring rules that operate on text fragments and sort them using terminology and subject dependent structuring rules
US6886010B2 (en) 2002-09-30 2005-04-26 The United States Of America As Represented By The Secretary Of The Navy Method for data and text mining and literature-based discovery
AU2003290678B2 (en) 2002-11-08 2009-12-24 Arbitration Forums, Inc. A system and process for electronic subrogation, inter-organization workflow management, inter-organization transaction processing and optimized web-baser user interaction
US7197503B2 (en) * 2002-11-26 2007-03-27 Honeywell International Inc. Intelligent retrieval and classification of information from a product manual
WO2004051432A2 (en) 2002-12-03 2004-06-17 Siemens Medical Solutions Usa, Inc. Systems and methods for automated extraction and processing of billing information in patient records
EP1588277A4 (en) * 2002-12-06 2007-04-25 Attensity Corp Systems and methods for providing a mixed data integration service
JP2004258912A (en) 2003-02-25 2004-09-16 Toshiba Corp Document retrieval device, method and program
US7146356B2 (en) * 2003-03-21 2006-12-05 International Business Machines Corporation Real-time aggregation of unstructured data into structured data for SQL processing by a relational database engine
US20040194009A1 (en) * 2003-03-27 2004-09-30 Lacomb Christina Automated understanding, extraction and structured reformatting of information in electronic files
US7081650B2 (en) * 2003-03-31 2006-07-25 Intel Corporation Interposer with signal and power supply through vias
US8495002B2 (en) 2003-05-06 2013-07-23 International Business Machines Corporation Software tool for training and testing a knowledge base
WO2004104865A2 (en) 2003-05-12 2004-12-02 Sun Microsystems, Inc. Methods and systems for intellectual capital sharing and control
US20040243560A1 (en) * 2003-05-30 2004-12-02 International Business Machines Corporation System, method and computer program product for performing unstructured information management and automatic text analysis, including an annotation inverted file system facilitating indexing and searching
US7139752B2 (en) 2003-05-30 2006-11-21 International Business Machines Corporation System, method and computer program product for performing unstructured information management and automatic text analysis, and providing multiple document views derived from different document tokenizations
US20040243554A1 (en) 2003-05-30 2004-12-02 International Business Machines Corporation System, method and computer program product for performing unstructured information management and automatic text analysis
US7640051B2 (en) 2003-06-25 2009-12-29 Siemens Medical Solutions Usa, Inc. Systems and methods for automated diagnosis and decision support for breast imaging
CN100481096C (en) 2003-06-25 2009-04-22 美国西门子医疗解决公司 Automated regional myocardial assessment method for cardiac imaging
US7257585B2 (en) * 2003-07-02 2007-08-14 Vibrant Media Limited Method and system for augmenting web content
US7389306B2 (en) 2003-07-25 2008-06-17 Enkata Technologies, Inc. System and method for processing semi-structured business data using selected template designs
US7333997B2 (en) 2003-08-12 2008-02-19 Viziant Corporation Knowledge discovery method with utility functions and feedback loops
US7478100B2 (en) 2003-09-05 2009-01-13 Oracle International Corporation Method and mechanism for efficient storage and query of XML documents based on paths
US20050065941A1 (en) 2003-09-23 2005-03-24 Deangelis Stephen F. Systems for optimizing business processes, complying with regulations, and identifying threat and vulnerabilty risks for an enterprise
US7813947B2 (en) 2003-09-23 2010-10-12 Enterra Solutions, Llc Systems and methods for optimizing business processes, complying with regulations, and identifying threat and vulnerabilty risks for an enterprise
KR100533810B1 (en) 2003-10-16 2005-12-07 한국전자통신연구원 Semi-Automatic Construction Method for Knowledge of Encyclopedia Question Answering System
US7155444B2 (en) 2003-10-23 2006-12-26 Microsoft Corporation Promotion and demotion techniques to facilitate file property management between object systems
US7917548B2 (en) 2003-11-14 2011-03-29 Bottelle Memorial Institute Universal parsing agent system and method
US20050243604A1 (en) * 2004-03-16 2005-11-03 Ascential Software Corporation Migrating integration processes among data integration platforms
US7849049B2 (en) * 2005-07-05 2010-12-07 Clarabridge, Inc. Schema and ETL tools for structured and unstructured data

Also Published As

Publication number Publication date
US7849048B2 (en) 2010-12-07
EP1899855A2 (en) 2008-03-19
EP1899855B1 (en) 2018-12-19
WO2007005730A2 (en) 2007-01-11
EP1899855A4 (en) 2011-01-26
US20110161333A1 (en) 2011-06-30
WO2007005730A3 (en) 2007-04-05
US20070011134A1 (en) 2007-01-11

Similar Documents

Publication Publication Date Title
WO2007005730B1 (en) System and method of making unstructured data available to structured data analysis tools
CN110083805B (en) Method and system for converting Word file into EPUB file
CN101539904B (en) Automatic indexing method of quotations
Hawkins Bibliometrics of electronic journals in information science
Hinrichs et al. Trading consequences: A case study of combining text mining and visualization to facilitate document exploration
US8880463B2 (en) Standardized framework for reporting archived legacy system data
CN101206670B (en) System and method for transferring non construction information to content
US20070011183A1 (en) Analysis and transformation tools for structured and unstructured data
CN102566945B (en) Method and system for realizing automatic acquisition and on-demand printing of book
US9753977B2 (en) Method and system for managing database
WO2007059469A3 (en) System and method for delivering results of a search query in an information management system
WO2014105867A4 (en) Systems and methods for creating, editing, storing and retrieving knowledge contained in specification documents
US20100228794A1 (en) Semantic document analysis
CN102207948A (en) Method for generating incident statement sentence material base
CN101136013A (en) Method for quick updating data domain in full text retrieval system
CN101158958A (en) Fusion enquire method based on MySQL storage engines
WO2018127747A1 (en) A method, apparatus and computer program product for user-directed database configuration, and automated mining and conversion of data
CN112825069A (en) Method, device and system for analyzing database data and storage medium
US20150261837A1 (en) Querying Structured And Unstructured Databases
CN103473444A (en) Electronic medical record system based on intelligent analyzing data structure and processing method of system
CN111178057B (en) Content analysis and extraction system for government electronic documents
Lee et al. Database forensic investigation based on table relationship analysis techniques
CA2414230A1 (en) Computer method and device for transporting data
TWM578817U (en) Processing system for converting data of data system into relational data format
CN112463728A (en) Bibliographic data extraction method of scientific and technological literature

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application
NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 2006774414

Country of ref document: EP