CA2020058C - Machine translation apparatus having a process function for proper nouns with acronyms - Google Patents

Machine translation apparatus having a process function for proper nouns with acronyms

Info

Publication number
CA2020058C
CA2020058C CA002020058A CA2020058A CA2020058C CA 2020058 C CA2020058 C CA 2020058C CA 002020058 A CA002020058 A CA 002020058A CA 2020058 A CA2020058 A CA 2020058A CA 2020058 C CA2020058 C CA 2020058C
Authority
CA
Canada
Prior art keywords
dictionary
words
acronym
sentence
word
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CA002020058A
Other languages
French (fr)
Other versions
CA2020058A1 (en
Inventor
Shuzo Kugimiya
Yoji Fukumochi
Ichiko Sata
Tokuyuki Hirai
Hitoshi Suzuki
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sharp Corp
Original Assignee
Sharp Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sharp Corp filed Critical Sharp Corp
Publication of CA2020058A1 publication Critical patent/CA2020058A1/en
Application granted granted Critical
Publication of CA2020058C publication Critical patent/CA2020058C/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/237Lexical tools
    • G06F40/247Thesauruses; Synonyms
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/268Morphological analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/205Parsing
    • G06F40/211Syntactic parsing, e.g. based on context-free grammar [CFG] or unification grammars
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/289Phrasal analysis, e.g. finite state techniques or chunking
    • G06F40/295Named entity recognition
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language
    • G06F40/55Rule-based translation

Abstract

ABSTRACT OF THE DISCLOSURE

A machine translation apparatus in which the sentence construction of a source language entered by means of an input device is analyzed in order to generate the corresponding translated text after being converted into a sentence construction in a target language, wherein the machine translation apparatus comprises a device for determining whether or not a word string obtained from a sentence construction analysis is a proper noun with an acronym, a device for examining whether or not the number of first letters of each of a certain number of words corresponds to the number of letters of the acronym, and also for examining whether or nor these words are registered in a dictionary, and a device for outputting the corresponding term after it is translated into a target language, when the words are registered in the dictionary, and for outputting directly the words, whose number of first letters corresponds to the number of the letters of the acronym, without translating them, when the words are not registered in the dictionary.

Description

2~2G~8 The present invention relates to a machine translation apparatus, and more particularly to a machine translation apparatus suitable for processing compound-word proper nouns with acronyms.
In a conventional machine translation apparatus, when a word in a sentence entered into the apparatus is used as a proper noun, and if the word is registered as a proper noun in a conversation dictionary which is consulted, a corresponding translated word is generated according to information obtained by consulting the dictionary. On the contrary, if the word is not registered as a proper noun in the dictionary, a translated word is generated based upon information, other than that for proper nouns, obtained from consulting the dictionary.
As mentioned above, despite the fact that a word in a sentence entered into the apparatus is used as a proper noun, if the word is not registered as a proper noun in a converstion dictionary, a translated word is generated based upon other information obtained by consulting the dictionary, thus resulting in incorrect translation.
The above incorrect translation often occurs, particularly in the case in which a plurality of words make up compound-word proper nouns, The following description illustrates an example of an input sentence and the corresponding output sentence which is incorrectly translated by the conventional machine translation apparatus.
When there is a proper noun, such as ~Organization of Petroleum Exporting Countries (OPEC)" in an entered ~entence, and if a group of these words iB not registered as a compound-word proper noun in the dictionary, a group of the~e translated words ie generated ba~ed upon other information obtained from consulting the dictionary. As a result, the proper noun ie incorrectly tran~lated as "Organization of Petroleum which Exports Countrie~," giving an incorrect meaning to the translated text.

202a~58 Japanese Patent Unexamined Publication No. 63-182775 discloses "a machine translation apparatus" which is constructed such that when a source sentence or a source text including proper nouns is machine-translated, information regarding proper nouns which relate to a preliminary knowledge of the source sentence or the source text, and yet which do not appear in the source sentence or the source text, is generated as part of a translated sentence or a translated text.

SUMMARY OF THE INVENTION

The present invention provides a machine translation apparatus having a process function for proper nouns with acronyms so as to prevent compound-word proper nouns from being incorrectly translated, the apparatus comprising input means for inputting a sentence in a source language;
dictionary means for storing dictionary information in order to translate a source language into a target language; display means for displaying the source language and the target language which has been translated; source language analysis means for obtaining the morphology element string of the entered sentence on the basis of information obtained by consulting the dictionary of said dictionary means, for an individual word in a sentence ,, ,. ,.~, ; ' .
, .

2~2~

entered by said input means; sentence construction analysis means for analyzing, by using the information obtained by consulting the dictionary, the sentence construction of the morphology element string which is obtained by the source language analysis means; converting means for converting, based upon the information obtained by consulting the dictionary, a sentence construction obtained by the sentence construction analysis means into a structure of a sentence construction in the target language; translation text generating means for translating, based upon the structure of the sentence construction in the target language which is obtained by the converting means and based upon the information obtained by consulting the dictionary, the indivldual word into a sentence in the target language; determining means for determining, as a result of consulting the dictionary in the source language analysis means, whether or not the word is a proper noun with an acronym; examining means for examining, when the proper noun with an acronym is determined as a proper noun, whether or not there exist words whose number of the capital letters corresponds to the number of the letter of the acronym, on the beginning or end side of the acronym, and for examining whether or not these words are registered as compound words in the dictionary means; and acronym , .
, . '' "' ' ' ' ' ' ;; --20~5~

processing means for translating the compound words into a sentence in the target language when the compound words are registered as compound words in the dictionary means, and for directly outputting the words, whose number of the capital letters corresponds to that of letters of the acronym, onto the display means without being translated when the compound words are not registered in the dictionary.
According to the present invention, during translation from a source language to a target language, it is determined whether or not a proper noun with an acronym exists in the source language. When it is determined that an acronym exits, it is then examined whether or not a group of words, whose number of the capital letters corresponds to the number of the letters of the acronym, exist at the beginning or the end of the acronym, and it is al80 examined whether or not the words are registered as compound words in the conversation dictionary. When the compound words are registered in the conversation dictlonary, they are output after being generated in a translated text, and when they are not registered in the paired translation dictionary, the words, whose number of the capital letter8 corresponds to the number of the letters of the acronym, are directly output in the source language wlthout being translated. Therefore, according to ~ y . . .
.

` 2~2~8 the present invention, since proper nouns made up of compound words with acronyms which are not registered in a dictionary are directly output in a source language without corresponding translations being generated, it is possible to prevent compound-word proper nouns from being output as incorrect translationsf thereby generatlng translated texts easy to understand.
Other features will become apparent from the following Description of the Preferred Embodiments when read with reference to the accompanying drawings.

DESCRIPTION OF THE DRAWINGS

Fig. 1 is a block diagram illustrating an embodiment of a machine translation apparatus to which the present invention is applied;
Fig. 2 i5 a view explaining translation levels in automatic translation;
Fig. 3 is a schematic illustration of a translation module;
Fig. 4 is a block diagram ~howing the translation module;
Figs. 5 through 9 are views illustrating examples of the contents of the memory for each buffer shown in Fig. 4;
Flg. 10 ls a flow chart showing the operatlon of a , " '"' ,, ~, .

20~a~

consulting dictionary process according to an embodiment;
Fig. 11 is a flow chart showing the operation of an acronym determining process according to the embodiment;
Figs. 12 through 14 are views illustrating the contents of the memory for a buffer B in steps for the consulting dictionary process;
Fig. 15 is a view showing an example of an input sentence and the corresponding output sentence translated by a machine translation apparatus to which the present invention is applied; and Fig. 16 is a view showing an example of an input sentence and the corresponding output sentence translated by a conventional machine translation apparatus.

DESCRIPTION OF THE PREFERRED EMBODIMENTS

In Fig. 1, numeral 1 indicates a maln CPU (central processing unit), 2 indicates a main memory, 3 indicates a CRT (cathode-ray tube), 9 denotes a keyboard, 5 denotes a translation module, 6 denotes a memory which stores a conversation dictionary for translation (hereinafter referred to simply as a dictionary), syntax rules, tree 8tructure conversion rules, etc.
When a source language is entered, the translation module 5 translates the source language and outputs a , ~ - , , . . ~ . .

202~

corresponding target language. That is, the source language entered by using the keyboard 4 is transferred to the translation module 5 under the control of the main CPU
1. The translation module 5 translates, ln a manner which will be described later, the entered source language into a target language by using the dictionary, the syntax rules, the tree structure conversion rules and the like stored in the memory 6. The translated result is temporarily stored in the main memory 2 and is displayed on the CRT 3 at the same time.
The machine translation performed by the translation module 5 generally involves analysis levels as shown ln Flg. 2. As shown ln the upper left slde of Fig. 2, once a source language is entered, analysis advances from a consulting dictionary level L1 to a morphology element analysis level L2, a sentence constructlon analysis level L3, and so on.
The machine translation is largely divided into the following two systems, depending upon the analysls levels.
One i9 a plvot system ln which analysls advances up to a concept level L6 (referred to as an intermediate language) which depends neither on a source language nor on a target language, and from there, generation, such as a context generation level L7, a semantlcs generatlon level L8, a sentence construction generation level L9 and a morphology -"~ 2~2~

element generation level L10, advances in order to generate a target language.
The other system is a transfer system in which analysis advances to any of the following levels, the morphology element analysis level L2, the sentence construction analysis level L3, the semantics analysis level L4 or the context analysis level L5 in order to obtain the internal structure of a source language. The obtained internal structure of the source language is then converted into the internal structure of a target language whose levels are equal to those of the source language, whereby the target language is generated.
The above-mentioned analyses will now be explained.
Consulting dictionary, morphology element analysis In this analysis the dictionary in the memory 6 shown in Fig. 1 is consulted to divide an entered sentence into each morphology element string (word string), so that grammatical information such as a part of speech, and an equivalent translated term with respect to each word are obtained. Further, tense, person, number, etc are analyzed.
Sentence construction analysis In thls analysis, a sentence constructlon ~construction analysis tree) such as modifying ~_ , ' 'l .:
: ' . ... . ' . . . . :' ' ," ' ' ' ' "
,, :

--`` 2~23~58 relationships between words is determined as will be described later.
Semantics analysis In this analysis, a semantically correct sentence construction from a semantically incorrect sentence construction is distinguished based upon the results of a plurality of the sentence construction analyses.
Context analysis In this analysis topics are understood and omissions or ambiguities are eliminated.
The translation module 5 utilized in the present invention analyzes at least up to sentence construction analysis level L~, That is, the translation module 5 in the present invention is the one which comprises the sections shown ln Fig. 3.
Fig. 4 is a block diagram illustrating the above translation module 5. Figs. 5 through 9 are views illustrating examples of the contents of buffers A through E shown in Fig. 4, when an English sentence, such as "This is a pen." is translated into Japanese. The ~nglish-JApanese translation operation will be hereinafter described wlth reference to Flgs. 3 through 9. Flrst, a source sentence read by the apparatus is stored as shown in Fig. 5 in the bu~er A of Fig. 4. Under the control of a - - \
202~58 translation CPU 15 which is based upon a translation program 16, information required for each word of the sentence is obtained by a consulting dictionary morphology element analysis section 11 in Fig. 3 and by using the dictionary in the memory 6 in accordance with the source sentence ~stored in the buffer A, and this information is stored in a buffer B in Fig. 4. Although the part of speech which the word "this" has can vary, the part of speech which pertains to the "this" in the sentence "This ls a pen." is uniquely determined by a sentence construction analysis sectlon 12 connected to the consulting dictionary morphology element analysis section 11. In the sentence construction analysis section 12, a construction analysis tree showing modifying relationships between words ls determined as shown in Fig. 7 accordlng to the dlctlonary and the syntax rules in the memory 6, and is stored in a buffer C shown in Fig. 4. The construction analysis tree is determined in the following manner, that is, according to the following rules:
1. ~entence -~ the sub~ect and the predicate 2. the sub~ect -~ noun phrase 3. the predicate -~ verb and noun phrase 4. noun phrase -~ pronoun 5. noun phrase -~ article and noun The above rules are obtained from the syntax rules in the ~ ~0 - '". . ': ' ;''...' ' ' '' ':
: .. . . . , .. ~:

2~20~8 memory 6. For example, the first rule indicates that "A
sentence consists of the subject and the predicate." The construction analysis tree is hereinafter determined according to these rules.
In a converting section 13 of Fig. 3, in the same manner as in the above-mentioned sentence construction analysis section 12, the structure of the construction analysis tree (see Fig. 7) for the entered English sentence is converted, by using the tree structure conversion rules ln the memory 6, into the structure of the corresponding Japanese sentence construction, as shown in Fig. 8. The obtained result is then stored in a buffer D in Fig. 4. In a translation text generating section 14 of Fig. 3, an approprlate ~oshi or particle "wa" and an auxiliary verb are added to the obtained Japanese character string "This pen is.", so that the string i9 converted into the Japanese sentence shown in Fig. 9, and is stored in a buffer E of Fig. 4. The Japanese sentence "This is a pen." is not only output from the translation module 5 but is also stored in the main memory 2, and ls displayed as well on the CRT 3 at the same tlme.
The descriptlon hitherto given has pertained to the English-Japanese translation operation in a general case.
The features of the present invention will now be described.

.~,, /~

202~a~8 First, the operation of the consulting dictionary process in the consulting dictionary morphology element analysis section 11 according to the present invention will be hereinafter explained with reference to the flow chart shown in Fig. 10. For the explanation, reference should be made to the proper noun with an acronym "Organization of Petroleum Exporting Countries ~OPEC~" illustrated in Fig.
15.
The logical sequence needed for the consulting dictionary starts in step.nl, in which the consulting dictionary process for words in a sentence entered into the apparatus is performed, and when a word is registered in the dictionary, the information obtained by consulting the dictionary is retained in the buffer B so as to correspond to the word. Fig. 12 illustrates the contents of the memory retained ln the buffer B at the completlon of the process in step nl, i.e., at the completion of the consulting dictionary process for, e.g., the word "(OPEC)."
The logical sequence then proceeds to step n2, in which it is determined whether or not the word has been registered in the dictionary. The determination is made by examining whether or not the buffer B is flagged to indicate the registration ~whether or not the buffer ~ contains "1"). If the word has been registered in the dictionary, the logical sequence proceeds to ~tep n8, while on the contrary, if the ,' '' ~ ' , .:' . '' .:
. ~
~ ~ , .. .

2~20~8 word has not yet been registered in the dictionary, the logical sequence proceeds to step n3 in order to perform an acronym determining process which will be described later.
Fig. 13 illustrates the contents of the memory retained in the buffer B if the word "Organization of Petroleum Exporting Countries" has been registered in the dictionary.
After the completion of the acronym determining process, the logical sequence proceeds to step n4, ln which it is determined, based upon the result of the acronym determining process, whether or not the word is an acronym.
If the word is an acronym, the logical sequence proceeds to step n5, on the other hand, if the word is not an acronym, the logical sequence proceeds to step n8. In step n5, the number of the letters of the word which has been determined as an acronym is counted (the number of letters of the word OPEC shown in Fig. lS ls four). The loglcal sequence further proceeds to step n6, in which it is determined whether or not a group of words, whose number of the capital letters corresponds to the number of letters of the acronym, l.e., four, and which is positioned at the beginning of the acronym (OPEC), is registered as a compound-word proper noun in the dictionary. The number of the group of words is counted from the word whlch is positioned in the place closest to the acronym. If the group of words is registered in the dictionary, information 202~

obtained by consulting the dictionary is stored in the buffer B so as to correspond to the group of words. The logical sequence then proceeds to step n8. On the contrary, if the group of words is not registered in the dictionary, the logical sequence proceeds to step n7, in which information, indicating that the above group of words is a proper noun which has not yet been registered, is retained in the buffer B. Fig. lg illustrates the contents of the memory retained in the buffer B at the completion of step n7. The word "Organization" has two parts of speech:
a proper noun (the number of words is five) and a noun ~the number of words is one). The logical sequence proceeds to step n8, in which it is determined whether or not the consulting dlctionary process for all words in the entered sentence is completed. If the process ls completed, the program needed for the consulting dictionary process termlnates there, while on the contrary, if the process is not completed, the logical sequence returns to step nl to proceed with the consultlng dictlonary proces~ for next words.
The above-described acronym determinlng process will now be explained with reference to the flow chart of Fig.
11. The loglcal sequence needed for the acronym determlning process starts in step sl, in whlch it is determined whether or not an entered word is enclosed in 2 0 ~

parentheses, and then proceeds to step s2, in which it is determined whether or not all letters of the word are capital. If the entered word satisfies both of the above conditions required for the acronym determining, the logical sequence proceeds to step s3, while on the contrary, if the entered word satisfies only one of the above conditions, or satisfies neither of the above conditions, it is determined that the entered word is not an acronym.
Further, in step s3 it is determined whether or not "each letter of an acronym corresponds to each of the first letters of a group of words, whose number of the capital letters corresponds to the number of letters of the acronym, and which exists at the beginning of the acronym."
More specifically, ln the example of Fig. 15, in order to compare "C in OPEC" wlth "C in Countries," "E in OPEC" with "E in Exporting," "P in OPEC" with "P in Petroleum," and "O
in OPEC " wlth "O in Organization," a pointer i is set to the rightmost letter of the word OPEC which has been determined as a candldate for the acronym. The logical sequence then proceeds to step s4, in which a pointer ~ is set to the capital letter of the word positioned in the place closest to the word OPEC. The logical sequence proceeds to step 95, in which it is determined whether or not the letter to which the polnter i has been set is ., , .. . .

202~8 identical to the letter to which the pointer j has been set. If the two letters are not identical, it is determined that the word which has been a candidate for the acronym is not an acronym. On the contrary, if the two letters are identical, the logical sequence proceeds to step s6, in which it is determined whether or not there exists a letter to the left of the letter to which the pointer i has been set. If there exists no letter to the left, it is determined that the word is an acronym. If there exists a letter to the left, the logical sequence proceeds to step s7, in which the pointer i is shifted leftward from the position to which the pointer i has been set to a distance proportional to one letter. The logical sequence further proceeds to step s8, in which the pointer ~ ls shifted from the position to which the pointer ~ has been set to the capital letter of the word positioned to the left of the word to which the pointer ~ has been set.
The logical sequence returns to step s5 in order to proceed with the above described comparison of two letters for next letters. When the pointer ~ is shifted to a word whose first letter happens to be a lower-case letter ("of" in the example "Organization of Petroleum Exporting Countries ~OPEC)" shown in Fig. 15), the pointer ~ is further shifted to the word posltioned to the left of the word with the lower-case letter.

/G

, 2~2~8 Thus, it is determined whether or not each letter of a word which is the subject of the acronym determinatlon matches a corresponding capital letter of each word of a group of words which is positioned before the word. If any letters do not correspond to each other, upon finding this mismatch, the word is not regarded as an acronym, while on the contrary, if all the letters correspond to each other, at the completion of matching, the word is regarded as an acronym.
Further, the translation text generating section 14 outputs directly the above-mentioned word without qenerating the corresponding translated word, if information regarding the proper noun which has not yet been registered is added during the process of generating and outputting the translated sentence based upon information which is obtained by consulting the dictionary stored in the buffer B.
Fig. lS is a view showing an example of an input sentence and the corresponding output sentence translated by a machine translatlon apparatus to which the present inventlon is applied. If information regarding the proper noun which has not yet been registered is added with respect to each word of a group of words whose capital letters from an acronym, such as "Organization of Petroleum Exporting Countrie~ ~OPEC)", the above words are output in ' ' ' ' ~: ' : ' -2~2~8 the original form without being translated.
As has been described, according to the present invention, when there is a word which is an acronym in a sentence entered into the apparatus, it is determined whether or not the group of words, which is positioned before the word which is the acronym, and whose number of the capital letters corresponds to the number of letters of the word with the acronym, is registered as a compound-word pronoun in the dictionary. If the group of words is not registered, it is directly output without being translated during the generation of translation.
The invention has been described in detail with particular reference to the preferred embodiments thereof, but it will be understood that variations and modifications of the invention can be made within the spirit and scope of the invention.

"' ' ' '" , ', .,~ , ': ' .~ :: . .. .

Claims (5)

1. A machine translation apparatus having a process function for translating proper nouns used as acronyms, comprising:
input means for entering a sentence in a source language;
dictionary means for storing dictionary information for translating a source language into a target language;
display means for displaying the source language and the target language which has been translated;
source language analysis means for obtaining, on the basis of information obtained by consulting the dictionary information of said dictionary means, a morphology element string for the individual words in the sentence entered in the source language by said input means;
sentence construction analysis means for analyzing, by using said dictionary information obtained by consulting the dictionary, the sentence construction of the morphology element string which is obtained by said source language analysis means;
converting means for converting, based on said dictionary information obtained by consulting the dictionary, a sentence construction obtained by said sentence construction analysis means into a sentence construction in the target language;
translation text generating means for translating, based upon the structure of the sentence construction in the target language which is obtained by said converting means and based upon said dictionary information obtained by consulting the dictionary, the individual words of a sentence entered in the source language into a sentence in the target language;
determining means for determining, as result of consulting the dictionary by said source language analysis means, whether or not an individual word in said source language sentence is a proper noun used as an acronym;
examining means for examining, when said word is determined to be a proper noun used as an acronym, whether or not said target language sentence contains a selected group of words having a number of capital letters which corresponds to the number of letters in the acronym, said selected group of words being positioned on one side of the acronym, and for examining whether or not the selected group of words are registered as compound words in said dictionary means; and acronym processing means for translating the selected groups of words into a sentence in the target language when the selected group of words are registered as compound words in said dictionary means, and for directly outputting the words, whose number of capital letters corresponds to the number of letters in the acronym, onto said display means without being translated when the selected group of words are not registered as compound words in the dictionary.
2. A machine translation apparatus according to claim 1, wherein said dictionary information includes a dictionary, syntax rules and sentence construction conversion rules.
3. A machine translation apparatus according to claim 1, wherein said determining means determines whether or not a first condition specifying that a word string be enclosed in parentheses is satisfied, and whether or not a second condition specifying that the word string be made up solely of capital letters is satisfied, and determines whether or not both of the first and second conditions are satisfied.
4. A machine translation apparatus according to claim 1, wherein said examining means compares the rightmost letter of the acronym with a capital letter of a word positioned in the place closest to the acronym in order to determine whether or not these two letters are the same, and examines, by comparing each of the rest of the letters of the acronym with each corresponding capital letter of each word of a group of words to which the word belongs in a direction moving away from the acronym, to determine whether or not each of the rest of the letters of the acronym is the same as each corresponding capital letter of each word of the group of words, until the leftmost letter of the acronym is compared with the corresponding capital letter of the group of words.
5. A machine translation apparatus according to claim 4, wherein said examining means includes a means for skipping a word which is the subject of comparison when the first letter of the word is a lower-case letter.
CA002020058A 1989-06-30 1990-06-28 Machine translation apparatus having a process function for proper nouns with acronyms Expired - Fee Related CA2020058C (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP170301/1989 1989-06-30
JP1170301A JPH077419B2 (en) 1989-06-30 1989-06-30 Abbreviated proper noun processing method in machine translation device

Publications (2)

Publication Number Publication Date
CA2020058A1 CA2020058A1 (en) 1990-12-31
CA2020058C true CA2020058C (en) 1994-06-07

Family

ID=15902431

Family Applications (1)

Application Number Title Priority Date Filing Date
CA002020058A Expired - Fee Related CA2020058C (en) 1989-06-30 1990-06-28 Machine translation apparatus having a process function for proper nouns with acronyms

Country Status (4)

Country Link
US (1) US5161105A (en)
JP (1) JPH077419B2 (en)
CA (1) CA2020058C (en)
GB (1) GB2234614A (en)

Families Citing this family (39)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH03222065A (en) * 1990-01-26 1991-10-01 Sharp Corp Machine translation device
US5287278A (en) * 1992-01-27 1994-02-15 General Electric Company Method for extracting company names from text
US5867812A (en) * 1992-08-14 1999-02-02 Fujitsu Limited Registration apparatus for compound-word dictionary
JPH0668137A (en) * 1992-08-21 1994-03-11 Hitachi Ltd Operation command object information generating system and operation command object recognition system
US5819260A (en) * 1996-01-22 1998-10-06 Lexis-Nexis Phrase recognition method and apparatus
JP3272288B2 (en) * 1997-12-24 2002-04-08 日本アイ・ビー・エム株式会社 Machine translation device and machine translation method
JP3647283B2 (en) * 1998-09-21 2005-05-11 シャープ株式会社 Document information extraction device
GB2366893B (en) * 2000-09-08 2004-06-16 Roke Manor Research Improvements in or relating to word processor systems or the like
JP2003271389A (en) * 2002-03-19 2003-09-26 Shuichi Araki Method for operating software object in natural language and its program
US7236923B1 (en) 2002-08-07 2007-06-26 Itt Manufacturing Enterprises, Inc. Acronym extraction system and method of identifying acronyms and extracting corresponding expansions from text
US20040044517A1 (en) * 2002-08-30 2004-03-04 Robert Palmquist Translation system
JP4398144B2 (en) * 2002-12-24 2010-01-13 インターナショナル・ビジネス・マシーンズ・コーポレーション Dictionary update system, update server, terminal, control method, program, and recording medium
CA2554890A1 (en) * 2003-12-17 2005-06-30 Speechgear, Inc. Translation tool
JP4356745B2 (en) * 2004-03-18 2009-11-04 日本電気株式会社 Machine translation system, machine translation method and program
JP4424057B2 (en) * 2004-05-10 2010-03-03 富士ゼロックス株式会社 Learning apparatus and program
JP4645065B2 (en) * 2004-05-25 2011-03-09 富士ゼロックス株式会社 Document processing apparatus and program
JP4018668B2 (en) * 2004-05-28 2007-12-05 株式会社東芝 Sino-Japanese machine translation device, Sino-Japanese machine translation method, and Sino-Japanese machine translation program
JP4208081B2 (en) * 2004-12-27 2009-01-14 インターナショナル・ビジネス・マシーンズ・コーポレーション System, web server, method and program for adding personalized value to multiple websites
JP2006276918A (en) * 2005-03-25 2006-10-12 Fuji Xerox Co Ltd Translating device, translating method and program
US20080003551A1 (en) * 2006-05-16 2008-01-03 University Of Southern California Teaching Language Through Interactive Translation
US8706471B2 (en) * 2006-05-18 2014-04-22 University Of Southern California Communication system using mixed translating while in multilingual communication
US8032355B2 (en) * 2006-05-22 2011-10-04 University Of Southern California Socially cognizant translation by detecting and transforming elements of politeness and respect
US8032356B2 (en) * 2006-05-25 2011-10-04 University Of Southern California Spoken translation system using meta information strings
JP4256891B2 (en) * 2006-10-27 2009-04-22 インターナショナル・ビジネス・マシーンズ・コーポレーション Technology to improve machine translation accuracy
US8234107B2 (en) * 2007-05-03 2012-07-31 Ketera Technologies, Inc. Supplier deduplication engine
US8364470B2 (en) * 2008-01-15 2013-01-29 International Business Machines Corporation Text analysis method for finding acronyms
US8706474B2 (en) * 2008-02-23 2014-04-22 Fair Isaac Corporation Translation of entity names based on source document publication date, and frequency and co-occurrence of the entity names
KR101023209B1 (en) * 2008-10-13 2011-03-18 한국전자통신연구원 Document translation apparatus and its method
JP4940325B2 (en) * 2010-03-29 2012-05-30 株式会社東芝 Document proofreading support apparatus, method and program
CN103678288B (en) * 2013-11-30 2016-08-17 武汉传神信息技术有限公司 A kind of method of Automatic proper noun translation
US10261995B1 (en) 2016-09-28 2019-04-16 Amazon Technologies, Inc. Semantic and natural language processing for content categorization and routing
US10223356B1 (en) 2016-09-28 2019-03-05 Amazon Technologies, Inc. Abstraction of syntax in localization through pre-rendering
US10235362B1 (en) 2016-09-28 2019-03-19 Amazon Technologies, Inc. Continuous translation refinement with automated delivery of re-translated content
US10229113B1 (en) 2016-09-28 2019-03-12 Amazon Technologies, Inc. Leveraging content dimensions during the translation of human-readable languages
US10275459B1 (en) * 2016-09-28 2019-04-30 Amazon Technologies, Inc. Source language content scoring for localizability
US10699074B2 (en) * 2018-05-22 2020-06-30 Microsoft Technology Licensing, Llc Phrase-level abbreviated text entry and translation
US10664658B2 (en) 2018-08-23 2020-05-26 Microsoft Technology Licensing, Llc Abbreviated handwritten entry translation
US11693374B2 (en) * 2020-05-28 2023-07-04 Johnson Controls Tyco IP Holdings LLP Building system with string mapping based on a sequence to sequence neural network
CN114997190A (en) * 2022-06-14 2022-09-02 平安科技(深圳)有限公司 Machine translation method, device, computer equipment and storage medium

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2197510A (en) * 1986-10-27 1988-05-18 Sharp Kk Translation apparatus
JPH0658673B2 (en) * 1987-01-23 1994-08-03 松下電器産業株式会社 Machine translation device
US4862408A (en) * 1987-03-20 1989-08-29 International Business Machines Corporation Paradigm-based morphological text analysis for natural languages
US4870610A (en) * 1987-08-25 1989-09-26 Bell Communications Research, Inc. Method of operating a computer system to provide customed I/O information including language translation

Also Published As

Publication number Publication date
CA2020058A1 (en) 1990-12-31
JPH0335354A (en) 1991-02-15
US5161105A (en) 1992-11-03
GB9014489D0 (en) 1990-08-22
GB2234614A (en) 1991-02-06
JPH077419B2 (en) 1995-01-30

Similar Documents

Publication Publication Date Title
CA2020058C (en) Machine translation apparatus having a process function for proper nouns with acronyms
US4814987A (en) Translation system
US5005127A (en) System including means to translate only selected portions of an input sentence and means to translate selected portions according to distinct rules
EP0176858B1 (en) Translation system
US5329446A (en) Translation machine
EP0403057B1 (en) Method of translating sentence including adverb phrase by using translating apparatus
US5023786A (en) Language translator which recognizes and indicates prepositional or indefinite phrases and relative clauses
EP0398513B1 (en) Method and apparatus for translating a sentence including a compound word formed by hyphenation
US5608623A (en) Special cooccurrence processing method and apparatus
US5353221A (en) Translation machine capable of translating sentence with ambiguous parallel disposition of words and/or phrases
US4931936A (en) Language translation system with means to distinguish between phrases and sentence and number discrminating means
EP0357344B1 (en) Computer assisted language translating machine
US4860206A (en) Translation system allowing user designation of postpositional words
US5075851A (en) System for translating a source language word with a prefix into a target language word with multiple forms
US5270928A (en) Translation machine that inhabits translation of selected portions of a sentence using stored non-translation rules
JPH0332106B2 (en)
JPS6190269A (en) Translation system
GB2193018A (en) Translation machine system
GB2199432A (en) Translation machine
GB2211641A (en) Language translation machine
JP2719453B2 (en) Machine translation equipment
JP3255294B2 (en) Machine translation equipment
JP2752025B2 (en) Machine translation equipment
NAGAO 2. 1 A Survey of Natural Language Processing and Machine Translation in Japan
JPH04235671A (en) Machine translation device

Legal Events

Date Code Title Description
EEER Examination request
MKLA Lapsed