Sign in

System for processing at least partially structured data

 Naama Bamberger et al
An improved system for processing at least partially structured data includes a method for comparing a population of terms in a term repository, including standardizing each term within the population of terms based on at least one standardization rule. The method also includes comparing at least...
Inventors: Naama Bamberger, Uri Bernstein, Gil Reich, Tamar Rosen, Lev Reitblat, Rita Zlotnikov, Mike Berkowitz, Yehudit Halle, Jack Kustanowitz, Yedida Lubin, Oren Samuel
Assignee: Answers Corporation
Primary Examiner: Sana Al-Hashemi
Attorneys: Hoffman, Wasson & Gitler, P.C.

U.S. Classification
707/3; 707/102; 707/104.1; 707103/R

View patent at USPTO

Citations

Patent NumberTitleIssue date
5115504Information management systemMay 19, 1992
5544352Method and apparatus for indexing, searching and displaying dataAug 6, 1996
5659732Document retrieval over networks wherein ranking and relevance scores are computed at the client for multiple database documentsAug 19, 1997
5710918Method for distributed task fulfillment of web browser requestsJan 20, 1998
5721911Mechanism for metadata for an information catalog systemFeb 24, 1998
5745895Method for association of heterogeneous informationApr 28, 1998
5781911Integrated system and method of data warehousing and deliveryJul 14, 1998
5794246Method for incremental aggregation of dynamically increasing database data sets Aug 11, 1998
5826261System and method for querying multiple, distributed databases by selective sharing of local relative significance information for terms related to the query Oct 20, 1998
5867799Information system and method for filtering a massive flow of information entities to meet user information classification needsFeb 2, 1999
5870746System and method for segmenting a database based upon data attributesFeb 9, 1999
5907838Information search and collection method and systemMay 25, 1999
5920854Real-time document collection search engine with phrase indexingJul 6, 1999
5930788Disambiguation of themes in a document classification systemJul 27, 1999
5940821Information presentation in a knowledge base search and retrieval systemAug 17, 1999
5953718Research mode for a knowledge base search and retrieval systemSep 14, 1999
5963944System and method for distributing and indexing computerized documents using independent agentsOct 5, 1999
5983214System and method employing individual user content-based data and user collaborative feedback data to evaluate the content of an information entity in a large information communication networkNov 9, 1999
5987454Method and apparatus for selectively augmenting retrieved text, numbers, maps, charts, still pictures and/or graphics, moving pictures and/or graphics and audio information from a network resourceNov 16, 1999
6038560Concept knowledge base search and retrieval systemMar 14, 2000
6044374Method and apparatus for sharing metadata between multiple data marts through object referencesMar 28, 2000
6067539Intelligent information retrieval systemMay 23, 2000
6085190Apparatus and method for retrieval of information from various structured informationJul 4, 2000
6101491Method and apparatus for distributed indexing and retrievalAug 8, 2000
6102969Method and system using information written in a wrapper description language to execute query on a networkAug 15, 2000
6141659Systems, methods and computer program products for retrieving documents from multiple document servers via a single client sessionOct 31, 2000
6144958System and method for correcting spelling errors in search queriesNov 7, 2000
6148298System and method for aggregating distributed dataNov 14, 2000
6151601Computer architecture and method for collecting, analyzing and/or transforming internet and/or electronic commerce data for storage into a data storage areaNov 21, 2000
6151604Method and apparatus for improved information storage and retrieval systemNov 21, 2000
6154213Immersive movement-based interaction with large complex information structuresNov 28, 2000
6161103Method and apparatus for creating aggregates for use in a datamartDec 12, 2000
6163781Object-to-relational data converter mapping attributes to object instance into relational tablesDec 19, 2000
6163782Efficient and effective distributed information managementDec 19, 2000
6167405Method and apparatus for automatically populating a data warehouse systemDec 26, 2000
6178416Method and apparatus for knowledgebase searchingJan 23, 2001
6178418Distributed data warehouse query and resource management systemJan 23, 2001
6182082Method and system for managing object-oriented databaseJan 30, 2001
6185550Method and apparatus for classifying documents within a class hierarchy creating term vector, term file and relevance rankingFeb 6, 2001
6185572Method for representing data from non-relational, non-object-oriented datastores as queryable datastore persistent objectsFeb 6, 2001
6189004Method and apparatus for creating a datamart and for creating a query structure for the datamartFeb 13, 2001
6208975Information aggregation and synthesization systemMar 27, 2001
6208988Method for identifying themes associated with a search query using metadata and for organizing documents responsive to the search query in accordance with the themesMar 27, 2001
6212524Method and apparatus for creating and populating a datamartApr 3, 2001
6226632Structured-text cataloging method, structured-text searching method, and portable medium used in the methodsMay 1, 2001
6226635Layered query managementMay 1, 2001
6233581Method for processing and accessing data objects, particularly documents, and system thereforMay 15, 2001
6233584Technique for providing a universal query for multiple different databasesMay 15, 2001
6236768Method and apparatus for automated, context-dependent retrieval of informationMay 22, 2001
6236987Dynamic content organization in information retrieval systemsMay 22, 2001
6236988Data retrieval systemMay 22, 2001
6236991Method and system for providing access for categorized information from online internet and intranet sourcesMay 22, 2001
6236994Method and apparatus for the integration of information and knowledgeMay 22, 2001
6240407Method and apparatus for creating an index in a database systemMay 29, 2001
6263341Information repository system and method including data objects and a relationship objectJul 17, 2001
6272495Method and apparatus for processing free-format dataAug 7, 2001
6275824System and method for managing data privacy in a database management systemAug 14, 2001
6278990Sort system for text retrievalAug 21, 2001
6282537Query and retrieving semi-structured data from heterogeneous sources by translating structured queriesAug 28, 2001
6314434Structured data management system and computer-readable method for storing structured data management programNov 6, 2001
6381592Candidate chaserApr 30, 2002
6411924System and method for linguistic filter and interactive displayJun 25, 2002
6647391System, method and article of manufacture for fast mapping from a propertied document management system to a relational databaseNov 11, 2003

Claims

What is claimed is:

1. A computer system for displaying information pertaining to any of a universe of topics, said information comprising at least partially structured data culled from a plurality of online data sources each storing a first multiplicity of at least partially structured online data entries each pertaining to an individual topic from among a second multiplicity of topics, wherein, for at least one topic, more than one online data entry pertains thereto, the system comprising:

a topic repository;

a topic builder including a user interface and being operative at least partially automatically to employ structure in at least partially structured data in said plurality of online data sources to facilitate access to said data from said plurality of online data sources, by topic, including unification of online data entries pertaining to a single topic, for display under said single topic, said unification comprising analyzing a set of topics to identify therewithin subsets of identical topics and collapsing the set by redefining all identical topics in each subset as a single topic; and

a topic-oriented user interface employed by a user to access said data in association with said topic repository by topic.

2. A topic builder for use in a computer system for displaying information pertaining to any of a universe of topics,

said information comprising at least partially structured data culled from a plurality of online data sources each storing a first multiplicity of at least partially structured online data entries each pertaining to an individual topic from among a second multiplicity of topics, wherein, for at least one topic, more than one online data entry pertains thereto, the system comprising a topic repository and a topic-oriented user interface employed by a user to access said data in said topic repository by topic,

said topic builder including a user interface and being operative at least partially automatically to employ structure in at least partially structured data in said plurality of online data sources to facilitate access to said data from said plurality of online data sources, by topic, including unification of data entries pertaining to a single topic, for display under said single topic, said unification comprising analyzing a set of topics to identify therewithin subsets of identical topics and collapsing the set by redefining all identical topics in each subset as a single topic.

3. A system according to claim 1 and also comprising:

an access controller operative to selectively assign, to various users, permission to access data originating from various of the plurality of online data sources.

4. A system according to claim 1 wherein at least one logical combinations of at least one sequences of at least one keywords are deemed to pertain to a single topic.

5. A system according to claim 1 wherein said unification comprises unification based on overlap of significant words or phrases between entries.

6. A system according to claim 1 wherein said unification comprises unification of data entries based on similarity in meaning between at least one data fields which each data entry includes.

7. A system according to claim 1 wherein said unification comprises using fuzzy matching algorithms to compare texts of data entries.

8. A system according to claim 1 wherein said unification comprises internal unification within a single data source.

9. A system according to claim 1 wherein said unification comprises title-based unification based on titles of data entries.

10. A system according to claim 1 wherein said unification comprises manually changing at least one system-made unification decision.

11. A topic builder according to claim 2 wherein at least one logical combinations of at least one sequences of at least one keywords are deemed to pertain to a single topic.

12. A topic builder according to claim 2 wherein said unification comprises unification based on overlap of significant words or phrases between entries.

13. A topic builder according to claim 2 wherein said unification comprises unification of data entries based on similarity in meaning between at least one data fields which each data entry includes.

14. A topic builder according to claim 2 wherein said unification comprises using fuzzy matching algorithms to compare texts of data entries.

15. A topic builder according to claim 2 wherein said unification comprises internal unification within a single data source.

16. A topic builder according to claim 2 wherein said unification comprises title-based unification based on titles of data entries.

17. A topic builder according to claim 2 wherein said unification comprises manually changing at least one system-made unification decision.