DE69901544D1 - Verfahren und vorrichtung zum erstellen eines musterwörterbuches zur anwendung in der erkennung von homologen sequenzen - Google Patents

Verfahren und vorrichtung zum erstellen eines musterwörterbuches zur anwendung in der erkennung von homologen sequenzen

Info

Publication number
DE69901544D1
DE69901544D1 DE69901544T DE69901544T DE69901544D1 DE 69901544 D1 DE69901544 D1 DE 69901544D1 DE 69901544 T DE69901544 T DE 69901544T DE 69901544 T DE69901544 T DE 69901544T DE 69901544 D1 DE69901544 D1 DE 69901544D1
Authority
DE
Germany
Prior art keywords
homological
sequences
recognition
creating
application
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
DE69901544T
Other languages
English (en)
Other versions
DE69901544T2 (de
Inventor
Aris Floratos
Isidore Rigoutsos
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
International Business Machines Corp
Original Assignee
International Business Machines Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by International Business Machines Corp filed Critical International Business Machines Corp
Publication of DE69901544D1 publication Critical patent/DE69901544D1/de
Application granted granted Critical
Publication of DE69901544T2 publication Critical patent/DE69901544T2/de
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B30/00ICT specially adapted for sequence analysis involving nucleotides or amino acids
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/28Determining representative reference patterns, e.g. by averaging or distorting; Generating dictionaries
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B30/00ICT specially adapted for sequence analysis involving nucleotides or amino acids
    • G16B30/10Sequence alignment; Homology search
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/953Organization of data
    • Y10S707/959Network
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99931Database or file accessing
    • Y10S707/99933Query processing, i.e. searching
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99931Database or file accessing
    • Y10S707/99933Query processing, i.e. searching
    • Y10S707/99934Query formulation, input preparation, or translation
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99931Database or file accessing
    • Y10S707/99933Query processing, i.e. searching
    • Y10S707/99936Pattern matching access
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S930/00Peptide or protein sequence
    • Y10S930/01Peptide or protein sequence
    • Y10S930/31Linker sequence
DE69901544T 1998-10-30 1999-10-29 Verfahren und vorrichtung zum erstellen eines musterwörterbuches zur anwendung in der erkennung von homologen sequenzen Expired - Lifetime DE69901544T2 (de)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US10629598P 1998-10-30 1998-10-30
PCT/US1999/025367 WO2000026819A1 (en) 1998-10-30 1999-10-29 Methods and apparatus for performing pattern dictionary formation for use in sequence homology detection

Publications (2)

Publication Number Publication Date
DE69901544D1 true DE69901544D1 (de) 2002-06-27
DE69901544T2 DE69901544T2 (de) 2003-01-16

Family

ID=22310644

Family Applications (2)

Application Number Title Priority Date Filing Date
DE69901544T Expired - Lifetime DE69901544T2 (de) 1998-10-30 1999-10-29 Verfahren und vorrichtung zum erstellen eines musterwörterbuches zur anwendung in der erkennung von homologen sequenzen
DE69904435T Expired - Lifetime DE69904435T2 (de) 1998-10-30 1999-10-29 Verfahren und vorrichtung zur detektion von homologen sequenzen

Family Applications After (1)

Application Number Title Priority Date Filing Date
DE69904435T Expired - Lifetime DE69904435T2 (de) 1998-10-30 1999-10-29 Verfahren und vorrichtung zur detektion von homologen sequenzen

Country Status (7)

Country Link
US (2) US6785672B1 (de)
EP (2) EP1057131B1 (de)
JP (2) JP3412618B2 (de)
CN (2) CN1108579C (de)
CA (1) CA2315147C (de)
DE (2) DE69901544T2 (de)
WO (2) WO2000026818A1 (de)

Families Citing this family (42)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2001086577A2 (en) 2000-05-10 2001-11-15 E. I. Du Pont De Nemours And Company Method of discovering patterns in symbol sequences
WO2002005133A1 (en) * 2000-07-07 2002-01-17 Kent Ridge Digital Labs A method and apparatus for searching a database containing biological information
JP3871301B2 (ja) * 2001-05-15 2007-01-24 インターナショナル・ビジネス・マシーンズ・コーポレーション データベース検索装置、及びプログラム
CA2387277C (en) * 2001-05-25 2015-03-03 Hitachi, Ltd. Information processing system using nucleotide sequence-related information
US20030125931A1 (en) * 2001-12-07 2003-07-03 Shannon Roy Campbell Method for matching strings
US6996558B2 (en) 2002-02-26 2006-02-07 International Business Machines Corporation Application portability and extensibility through database schema and query abstraction
US7110540B2 (en) * 2002-04-25 2006-09-19 Intel Corporation Multi-pass hierarchical pattern matching
US20040126840A1 (en) * 2002-12-23 2004-07-01 Affymetrix, Inc. Method, system and computer software for providing genomic ontological data
US8239400B2 (en) * 2003-08-21 2012-08-07 International Business Machines Corporation Annotation of query components
US7203680B2 (en) * 2003-10-01 2007-04-10 International Business Machines Corporation System and method for encoding and detecting extensible patterns
US7900133B2 (en) 2003-12-09 2011-03-01 International Business Machines Corporation Annotation structure type determination
US20060235845A1 (en) * 2005-04-15 2006-10-19 Argentar David R Identifying patterns of symbols in sequences of symbols using a binary array representation of the sequence
US7188032B2 (en) * 2005-06-30 2007-03-06 International Business Machines Corporation Incremental determination of Teiresias patterns
US7822759B2 (en) * 2005-12-13 2010-10-26 Microsoft Corporation Query-driven sharing and syndication
EP2021979B1 (de) * 2006-05-30 2012-03-21 Yissum Research Development Company of the Hebrew University of Jerusalem Musterabgleich
CN1932040B (zh) * 2006-09-21 2010-06-09 武汉大学 全基因组目标基因家族成员的自动化快速检测系统
JP5007803B2 (ja) * 2007-03-09 2012-08-22 独立行政法人農業生物資源研究所 遺伝子クラスタリング装置、遺伝子クラスタリング方法およびプログラム
US7970614B2 (en) * 2007-05-08 2011-06-28 Nuance Communications, Inc. Continuous adaptation in detection systems via self-tuning from target population subsets
US8290921B2 (en) * 2007-06-28 2012-10-16 Microsoft Corporation Identification of similar queries based on overall and partial similarity of time series
US7693823B2 (en) * 2007-06-28 2010-04-06 Microsoft Corporation Forecasting time-dependent search queries
US7685100B2 (en) * 2007-06-28 2010-03-23 Microsoft Corporation Forecasting search queries based on time dependencies
US8090709B2 (en) * 2007-06-28 2012-01-03 Microsoft Corporation Representing queries and determining similarity based on an ARIMA model
US7693908B2 (en) * 2007-06-28 2010-04-06 Microsoft Corporation Determination of time dependency of search queries
US7685099B2 (en) * 2007-06-28 2010-03-23 Microsoft Corporation Forecasting time-independent search queries
US7689622B2 (en) * 2007-06-28 2010-03-30 Microsoft Corporation Identification of events of search queries
JP5193518B2 (ja) * 2007-07-13 2013-05-08 株式会社東芝 パターン探索装置及びその方法
US9775554B2 (en) * 2007-12-31 2017-10-03 Invention Science Fund I, Llc Population cohort-linked avatar
EP2235836A4 (de) * 2008-01-24 2012-08-29 Sra International Inc System und verfahren für den abgleich varianter strings
CN101714187B (zh) * 2008-10-07 2011-09-28 中国科学院计算技术研究所 一种规模化蛋白质鉴定中的索引加速方法及相应的系统
US9135396B1 (en) 2008-12-22 2015-09-15 Amazon Technologies, Inc. Method and system for determining sets of variant items
US8689172B2 (en) * 2009-03-24 2014-04-01 International Business Machines Corporation Mining sequential patterns in weighted directed graphs
EP2480991A2 (de) * 2009-09-25 2012-08-01 Adnan Fakeih Datenbank und verfahren zur evalulierung von daten aus der datenbank
JP5790006B2 (ja) * 2010-05-25 2015-10-07 ソニー株式会社 情報処理装置、情報処理方法及びプログラム
CN102682226B (zh) * 2012-04-18 2015-09-30 盛司潼 一种核酸测序信息处理系统及方法
US9092566B2 (en) 2012-04-20 2015-07-28 International Drug Development Institute Methods for central monitoring of research trials
US9348902B2 (en) 2013-01-30 2016-05-24 Wal-Mart Stores, Inc. Automated attribute disambiguation with human input
US10191929B2 (en) * 2013-05-29 2019-01-29 Noblis, Inc. Systems and methods for SNP analysis and genome sequencing
CN104636636B (zh) * 2015-02-02 2018-01-05 哈尔滨工业大学深圳研究生院 蛋白质远程同源性检测方法及装置
CN107239458B (zh) * 2016-03-28 2021-01-29 阿里巴巴集团控股有限公司 基于大数据推算开发对象关系的方法及装置
CN111178615B (zh) * 2019-12-24 2023-10-27 成都数联铭品科技有限公司 一种企业风险识别模型的构建方法及系统
CN111445962B (zh) * 2020-03-27 2022-12-16 上海祥耀生物科技有限责任公司 抗体库的构建方法及装置
US11715022B2 (en) * 2020-07-01 2023-08-01 International Business Machines Corporation Managing the selection and presentation sequence of visual elements

Family Cites Families (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5265065A (en) * 1991-10-08 1993-11-23 West Publishing Company Method and apparatus for information retrieval from a database by replacing domain specific stemmed phases in a natural language to create a search query
US6303297B1 (en) * 1992-07-17 2001-10-16 Incyte Pharmaceuticals, Inc. Database for storage and analysis of full-length sequences
JPH0793370A (ja) * 1993-09-27 1995-04-07 Hitachi Device Eng Co Ltd 遺伝子データベース検索システム
JP3611601B2 (ja) * 1994-09-01 2005-01-19 富士通株式会社 リスト処理システムとその方法
US5940825A (en) * 1996-10-04 1999-08-17 International Business Machines Corporation Adaptive similarity searching in sequence databases
US6023659A (en) * 1996-10-10 2000-02-08 Incyte Pharmaceuticals, Inc. Database system employing protein function hierarchies for viewing biomolecular sequence data
US6189013B1 (en) * 1996-12-12 2001-02-13 Incyte Genomics, Inc. Project-based full length biomolecular sequence database
US5873052A (en) 1996-11-06 1999-02-16 The Perkin-Elmer Corporation Alignment-based similarity scoring methods for quantifying the differences between related biopolymer sequences
US6108666A (en) 1997-06-12 2000-08-22 International Business Machines Corporation Method and apparatus for pattern discovery in 1-dimensional event streams
US6373971B1 (en) 1997-06-12 2002-04-16 International Business Machines Corporation Method and apparatus for pattern discovery in protein sequences
US5977890A (en) 1997-06-12 1999-11-02 International Business Machines Corporation Method and apparatus for data compression utilizing efficient pattern discovery
US6029167A (en) * 1997-07-25 2000-02-22 Claritech Corporation Method and apparatus for retrieving text using document signatures
US6092065A (en) 1998-02-13 2000-07-18 International Business Machines Corporation Method and apparatus for discovery, clustering and classification of patterns in 1-dimensional event streams

Also Published As

Publication number Publication date
EP1057131B1 (de) 2002-05-22
JP2002529817A (ja) 2002-09-10
EP1044417B1 (de) 2002-12-11
WO2000026819A9 (en) 2002-04-11
CN1110761C (zh) 2003-06-04
CN1108579C (zh) 2003-05-14
US6571199B1 (en) 2003-05-27
EP1057131A1 (de) 2000-12-06
JP3412618B2 (ja) 2003-06-03
EP1044417A1 (de) 2000-10-18
JP4250339B2 (ja) 2009-04-08
CA2315147A1 (en) 2000-05-11
CA2315147C (en) 2004-12-28
DE69904435T2 (de) 2003-10-09
DE69904435D1 (de) 2003-01-23
CN1289424A (zh) 2001-03-28
US6785672B1 (en) 2004-08-31
WO2000026818A1 (en) 2000-05-11
JP2002529818A (ja) 2002-09-10
WO2000026819A1 (en) 2000-05-11
DE69901544T2 (de) 2003-01-16
CN1287641A (zh) 2001-03-14

Similar Documents

Publication Publication Date Title
DE69901544D1 (de) Verfahren und vorrichtung zum erstellen eines musterwörterbuches zur anwendung in der erkennung von homologen sequenzen
DE69534695D1 (de) Verfahren und Vorrichtung zum Erzeugen von Mustern
DE69930560D1 (de) Verfahren und Vorrichtung zur Mustererkennung
DE19881919T1 (de) Verfahren und Vorrichtung zum Erstellen von Fingerabdrücken und zum Authentifizieren verschiedener magnetischer Medien
DE69726316D1 (de) Verfahren und vorrichtung zum formen von düsen
DE69411578D1 (de) Verfahren und vorrichtung zur erzeugung eines ungleichmässigen stromes von partikeln zum auftragen auf eine faserstoffbahn
DE69510252T2 (de) Verfahren und Vorrichtung zur Gewinnung eines sich bewegenden Objektes, mit Anwendung von Hintergrundsubstraktion
DE69735920D1 (de) Verfahren und Vorrichtung zum Entfernen von Teilchen von einer Gegenstandsoberfläche
DE69417105D1 (de) Vorrichtung und Verfahren zum Erkennen handgeschriebener Symbole
DE69822237D1 (de) Gerät und Verfahren zum Extrahieren von Mustern
ATE184631T1 (de) Verfahren und einrichtung zur strippung von suspendierten feststoffen und anwendung in fliesskrackverfahren
DE69721941D1 (de) Gerät und Verfahren zum Extrahieren von Mustern
DE19681378T1 (de) Verfahren und Vorrichtung zum Gravieren
DE69825299D1 (de) Verfahren und vorrichtung zur anwendung von gewichteten zufallsmustern bei teilabtastung
DE59608553D1 (de) Verfahren und vorrichtung zur herstellung von rasternäpfchen in der oberfläche eines tiefdruckzylinders
DE60215075D1 (de) Verfahren und Vorrichtung zur Erzeugung eines Sprühmusters von einem Kraftstoffeinspritzventil
DE69904764T2 (de) Verfahren und Vorrichtung zur Mustererkennung
DE69709965D1 (de) Verfahren und Vorrichtung zur Mustererkennung
DE50308417D1 (de) Vorrichtung und verfahren zur entfernung von oberflächenbereichen eines bauteils
DE69732168D1 (de) Verfahren und Vorrichtung zum Lesen von Punktmustern
DE69616683T2 (de) Vorrichtung und Verfahren zum Entfernen von Restmonomeren
DE19883010T1 (de) Verfahren und Vorrichtung zum Erkennen eines sich bewegenden Objekts in einer Abfolge von Farbvollbildern
DE69833707D1 (de) Verfahren und Vorrichtung zum Behandeln eines geschlachteten Vogels bevor der Ausnehmung hiervon
DE69417273T2 (de) Verfahren und Vorrichtung zur Mustererkennung
DE69928456D1 (de) Verfahren und Vorrichtung zur Mustererkennung

Legal Events

Date Code Title Description
8364 No opposition during term of opposition
8320 Willingness to grant licences declared (paragraph 23)
8328 Change in the person/name/address of the agent

Representative=s name: DUSCHER, R., DIPL.-PHYS. DR.RER.NAT., PAT.-ANW., 7