CROSS REFERENCE TO RELATED APPLICATION

[0001]
This application is based on and claims the benefit of prior Japanese Patent Application No. 2001356571, filed on Nov. 21, 2001, the entire contents of which are incorporated herein by reference.
BACKGROUND OF THE INVENTION

[0002]
1. Field of the Invention

[0003]
The present invention relates to a semiconductor memory device such as a NANDtype flash memory, more particularly to a semiconductor memory device having an onchip error correcting function.

[0004]
2. Description of the Related Art

[0005]
The NANDtype flash memory is known to deteriorate its cell property through repeated operations of rewriting, and to vary data after it is left for a long time. In order to improve the reliability of the NANDtype flash memory, such a semiconductor memory that contains an ECC (Error Correcting Code) circuit mounted onchip for error detection and correction has been proposed in the art (for example, Japanese Patent Application LaidOpen Nos. 2000348497 and 200114888).

[0006]
[0006]FIG. 21 is a block diagram briefly showing an arrangement of the conventional NANDtype flash memory with ECC circuits mounted thereon.

[0007]
This memory comprises eight memory cell areas 1 _{0}, 1 _{1}, . . . , 1 _{7}. Each of the memory cell areas 1 _{0}, 1 _{1}, . . . , 1 _{7 }includes a plurality of memory cells, not depicted, arrayed in a matrix. Data of 528 bits (=one page) can be written in and read out from 528 memory cells connected to a common word line through 528 bit lines at a time. Page buffers 2 _{0}2 _{7 }are connected to the memory cell areas 1 _{0}1 _{7}, respectively. Each page buffer can hold 528bit write data and read data. Between the page buffers 2 _{0}2 _{7 }and I/O terminals 4 _{0}4 _{7 }located corresponding to the memory cell areas 1 _{0}1 _{7}, ECC circuits 3 _{0}3 _{7 }are provided for the memory cell areas 1 _{0}1 _{7}, respectively.

[0008]
Each ECC circuit 3 _{0}3 _{7 }has a coding function to add a certain bit number of check bits (ECC) to one page of information bits (528 bits) to be stored in each memory cell area 1 _{0}1 _{7}, and a decoding function to detect and correct a certain bit number of errors in the information bits with the check bits added thereto. BCH (BoseChaudhuriHocquenghem) code is employed as an error correcting code that can correct a plurality of bit errors with a relatively small circuit scale. Between the memory and external, data is read and written on a basis of 8 bits corresponding to the number of memory cells. Data is fed bit by bit into each ECC circuit 3 _{0}3 _{7}, and is circulated through and output from an internal cyclic shift register bit by bit to execute coding and decoding.

[0009]
Operations of coding and decoding in the conventional ECC circuit 3 _{0}_{37 }using BCH code will be described next.

[0010]
The number of check bits in BCH code for correcting 2bit errors and detecting 3bit errors is equal to 21 bits for 528 information bits. For convenience of description, a simple error detection and correction system is described, which employs BCH code capable of correcting 2bit errors and detecting 3bit errors for the number of information bits, k=7, a code length, n=15, and the number of check bits, t=2.

[0011]
In this case, a generating polynomial required for coding and decoding is given below as it is generally known:
$\begin{array}{cc}\begin{array}{c}\mathrm{Fundamental}\ue89e\text{\hspace{1em}}\ue89e\mathrm{Polynomial}\ue89e\text{:}\ue89e\text{\hspace{1em}}\ue89eF\ue8a0\left(X\right)={X}^{4}+X+1\\ \alpha \ue89e\text{\hspace{1em}}\ue89e\mathrm{Minmal}\ue89e\text{\hspace{1em}}\ue89e\mathrm{Polynomial}\ue89e\text{:}\ue89e\text{\hspace{1em}}\ue89e{M}_{1}\ue8a0\left(x\right)={X}^{4}+X+1\\ {\alpha}^{3}\ue89e\text{\hspace{1em}}\ue89e\mathrm{Minimal}\ue89e\text{\hspace{1em}}\ue89e\mathrm{Polynomial}\ue89e\text{:}\ue89e\text{\hspace{1em}}\ue89e{M}_{3}\ue8a0\left(x\right)={X}^{4}+{X}^{3}+{X}^{2}+{X}^{\text{\hspace{1em}}}+1\\ \mathrm{Generating}\ue89e\text{\hspace{1em}}\ue89e\mathrm{Polynomial}\ue89e\text{:}\ue89e\text{\hspace{1em}}\ue89eG\ue8a0\left(x\right)={M}_{1}\ue89e{M}_{3}\\ ={X}^{8}+{X}^{7}+{X}^{6}+{X}^{4}+1\end{array}& \left(1\right)\end{array}$

[0012]
(1) Coder

[0013]
[0013]FIG. 22 is a block diagram showing a coder 10 functionally configured inside the conventional ECC circuit 3 i (i=0, 1, . . . , or 7). The coder 10 comprises a shift register 11 consisting of registers D_{7}, D_{6}, D_{5}, D_{4}, D_{3}, D_{2}, D_{1}, D_{0}, XOR circuits 12 _{1}, 12 _{2}, 12 _{3}, 12 _{4 }for modulo2 operations, and circuit changing switches SW1, SW2.

[0014]
An operation for moving the shift register 11 once corresponds to multiplying each value in the shift register 11 by X. A value of data stored in the shift register 11 can be expressed by:

a_{0}X^{0}+a_{1}X^{1}+a_{2}X^{2}+a_{3}X^{3}+a_{4}X^{4}+a_{5}X^{5}+a_{6}X^{6}+a_{7}X^{7} (2)

[0015]
where a_{i }denotes a value stored in a register D_{i}, and a_{i}=0 or 1 (i=07). When this is shifted once, the following is obtained:

a_{0}X^{1}+a_{1}X^{2}+a_{2}X^{3}+a_{3}X^{4}+a_{4}X^{5}+a_{5}X^{6}+a_{6}X^{7}+a_{7}X^{8} (3)

[0016]
From the generating polynomial G(x) given by Expression (1), a relation of X^{8}=X^{7}+X^{6}+X^{4}+1 is derived. Therefore, Expression (3) can be represented by:

a_{7}X^{0}+a_{0}X^{1}+a_{1}X^{2}+a_{2}X^{3}+(a_{3}+a_{7})X^{4}+a_{4}X^{5 }+(a_{5}+a_{7})X^{6}+(a_{6}+a_{7})X^{7} (4)

[0017]
This corresponds to shifting each bit; storing the value a_{7 }of the register D_{7 }into the register D_{0}; adding the values a_{3}, a_{7 }of the registers D_{3}, D_{7 }at the XOR circuit 12 _{1 }and storing the sum into the register D_{4}; adding the values a_{5}+a_{7 }of the registers D_{5}, D_{7 }at the XOR circuit 12 _{2 }and storing the sum into the register D_{6}; and adding the values a_{6}+a_{7 }of the registers D_{6}, D_{7 }at the XOR circuit 12 _{3 }and storing the sum into the register D_{7}.

[0018]
On coding, the switches SW1, SW2 are first connected to ON sides to enter input data (information bits) I_{0}, I_{1}, I_{2}, I_{3}, I_{4}, I_{5}, I_{6 }(I_{0}I_{6}=0 or 1) bit by bit from external through the I/O terminal 4 i. Every time one bit of the input data I_{0}I_{6 }enters, the shift register 11 operates once. As the switch SW1 is kept ON during the input data I_{0}I_{6 }entering, the data is output bit by bit to the page buffer 2 i as it is. At the same time, the input data I_{0}I_{6 }is added to the value a_{7 }of the register D_{7 }at the XOR circuit 12 _{1 }and the sum is stored in turn into the shift register 11. After completion of the input data I_{0}I_{6 }entered into the page buffer 2 i, check bits I_{7}, I_{8}, I_{9}, I_{10}, I_{11}, I_{12}, I_{13}, I_{14 }are stored inside the registers D_{7}, D_{6}, D_{5}, D_{4}, D_{3}, D_{2}, D_{1}, D_{0 }of the shift register 11, respectively. The switches SW1, SW2 are then connected to OFF sides and, every time the shift register 11 operates, the check bits I_{7}I_{14 }are output serially to the page buffer 2 i through the switch SW1. The information bits and check bits stored in the page buffer 2 i are written into the memory cell area 1 i. At the same time, the value in the shift register 11 is reset.

[0019]
(2) Decoder

[0020]
A decoder is described next. The decoder comprises syndrome computational circuits and an error position detector. In the case of 2bit error detection, two syndromes S_{1}, S_{3 }are required for decoding. These syndromes can be derived from the minimal polynomial M_{1}(x)=X^{4}+X+1 as it is known. FIG. 23 specifically shows (A) a conventional S_{1 }syndrome computational circuit 20 and (B) a conventional S_{3 }syndrome computational circuit 30.

[0021]
Based on the minimal polynomial M_{1}(x), the S_{1 }syndrome computational circuit 20 in FIG. 23A comprises a shift register 21 consisting of registers D_{3, D} _{2}, D_{1}, D_{0}, and XOR circuits 22 _{1}, 22 _{2}. An operation for moving the shift register 21 once corresponds to multiplying a value in the shift register 21 by X. The value stored in the shift register 21 can be expressed by:

a_{O}X^{0}+a_{1}X^{1}+a_{2}X^{2}+a_{3}X^{3} (5)

[0022]
where a_{i }denotes a value stored in a register D_{i}, and a_{i}=0 or 1 (i=03). When this is shifted once, the following is obtained:

a_{0}X^{1}+a_{1}X^{2}+a_{2}X^{3}+a_{3}X^{4} (6)

[0023]
From the α minimal polynomial M_{1}(x), a relation of X^{4}=X +1 is derived. Accordingly:

a_{3}X^{0}+(a_{0}+a_{3})X^{1}+a_{1}X^{2}+a_{2}X^{3} (7)

[0024]
This corresponds to shifting each bit; storing the value a_{3 }of the register D_{3 }into the register D_{0}; and adding the values a_{0}, a_{3 }of the registers D_{0}, D_{3 }at the XOR circuit 12 _{2 }and storing the sum into the register D_{1}. The information bits I_{0}I_{6 }and check bits I_{7}I_{14 }are fed in this order into the S_{1 }syndrome computational circuit 20 bit by bit. The shift register 21 operates once every time one bit enters. After all bits I_{0}I_{14 }enter, the syndrome S_{1 }is generated in the shift register 21 (D_{0}D_{3}).

[0025]
Similar to the S_{1 }syndrome computational circuit 20, the S_{3 }syndrome computational circuit 30 in FIG. 23B comprises a shift register 31 consisting of registers D_{3}, D_{2}, D_{1}, D_{0}, and XOR circuits 32 _{1}, 32 _{2}, 32 _{3}, 32 _{4}. It is configured by the X^{3 }circuit of the minimal polynomial M_{1}(x). In the S_{3 }syndrome computational circuit 30, an operation for moving the shift register 31 once corresponds to multiplying a value in the shift register 31 by X^{3}. The value stored in the shift register 31 is expressed by Expression (5). When it is multiplied by X^{3}, the following is given:

a_{0}X^{3}+a_{1}X^{4}+a_{2}X^{5}+a_{3}X^{6} (8)

[0026]
From the α minimal polynomial M_{1}(x), a relation of X^{4}=X +1 is derived. Accordingly:

a_{1}X^{0}+(a_{1}+a_{2})X^{1}+(a_{2}+a_{3})X^{2}+(a_{0}+a_{3})X^{3} (9)

[0027]
This corresponds to shifting each bit; storing the value a_{1 }of the register D_{1 }into the register D_{0}; adding the values a_{1}, a_{2 }of the registers D_{1}, D_{2 }at the XOR circuit 32 _{2 }and storing the sum into the register D_{1}; adding the values a_{2}, a_{3 }of the registers D_{2}, D_{3 }at the XOR circuit 32 _{3 }and storing the sum into the register D_{2}; and adding the values a_{0}, a_{3 }of the registers D_{0}, D_{3 }at the XOR circuit 32 _{4 }and storing the sum into the register D_{3}. The information bits I_{0}I_{6 }and check bits I_{7}I_{14 }stored in the memory cells are also fed in this order into the S_{3 }syndrome computational circuit 30 bit by bit. The shift register 31 operates once every time one bit enters. After all bits I_{0}I_{14 }enter, the syndrome S_{3 }is generated in the shift register 31 (D_{0}D_{3}).

[0028]
[0028]FIG. 24 is a flowchart showing an algorithm for decoding. The S_{1}, S_{3 }syndrome computational circuits 20, 30 compute syndromes S_{1}, S_{3 }first based on the information bits and check bits read out from the memory cell area 1 i (step S1). If the syndromes S_{1}, S_{3 }are S1=S3=0, it is determined errorless, and the readout information bits are output as they are (steps S2, S3, S4). If only one of the syndromes S_{1}, S_{3 }is equal to 0, it is determined uncorrectable, and the data is output as it is (steps S2, S3, S5, S6, S7). If S_{i}≠0 and S_{3}≠0, computations are executed to derive σ_{1}=S_{1} ^{2 }and σ_{2}=S_{1} ^{3}+S_{3 }(steps S2, S6, S8). If σ_{2}=0 (step S9), it can be found that a 1bit error is present, and 1bit corrected data is output (step S10). If σ_{2}≠0 (step S9), it can be found that 2bit errors are present, and 2bit corrected data is output (step S11).

[0029]
The position of the error bit can be found by assigning Z=α^{I}(I=0, 1, 2, 3, 4, 5, 6) in turn to an error position polynomial σ(Z) represented by Expression (10) as it is known generally. The position of the error can be indicated by i that holds σ(α^{I})=0.

σ(Z)=S _{i}+σ_{1} ×Z+σ _{2} ×Z ^{2} (10)

[0030]
An arrangement of the error position detector is shown in FIGS. 25 and 26, which is configured based on such the point. FIG. 25 shows a first arithmetic section 40 a that computes and stores S_{1}, σ_{1 }and σ_{2}. FIG. 26 shows a second arithmetic section 40 b that executes the operation of Expression (10) based on the operated result from the first arithmetic section 40 a and outputs a detection signal to indicate the error position in the data. As shown in FIG. 25, the first arithmetic section 40 a comprises a shift register 41, an X arithmetic circuit 42, and an X^{2 }arithmetic circuit 43. A shift register 41 a stores the syndrome S_{1}, and shift registers 42 a and 43 a store the operated results, σ_{1}=S_{1} ^{2 }and σ_{22}=S_{1} ^{3}+S_{3}. It is assumed that the shift register 42 a has a value of:

a_{0}X^{0}+a_{1}X^{1}+a_{2}X^{2}+a_{3}X^{3} (11)

[0031]
where a_{i }denotes a value stored in a register D_{i}, and a_{i}=0 or 1 (i=03). As the X arithmetic circuit 42 multiplies it by X, the value of the shift register 42 a comes to:

a_{0}X^{1}+a_{1}X^{2}+a_{2}X^{3}+a_{3}X^{4} (12)

[0032]
From the α minimal polynomial M_{1}(x), a relation of X^{4}=X +1 is present. Accordingly, Expression (12) yields:

a_{3}X^{0}+(a_{0}+a_{3})X^{1}+a_{1}X^{2}+a_{2}X^{3} (13)

[0033]
This corresponds to shifting each bit; storing the value a_{3 }of the register D_{3 }into the register D_{0}; and adding the values a_{0}, a_{3 }of the registers D_{0}, D_{3 }at the XOR circuit 42 _{2 }and storing the sum into the register D_{1}.

[0034]
The X^{2 }arithmetic circuit 43 multiplies the value of the shift register 43 a by X^{2}. Therefore, when the value indicated by Expression (11) is stored in the shift register 43 a, and it is multiplied by X^{2}, the value of the shift register 43 a comes to:

a_{0}X^{2}+a_{1}X^{3}+a_{2}X^{4}+a_{3}X^{5} (14)

[0035]
From the α minimal polynomial M_{1}(x), a relation of X^{4}=X +1 is present. Accordingly, Expression (14) yields:

a_{2}X^{0}+(a_{2}+a_{3})X^{1}+(a_{0}+a_{3})X^{2}+a_{1}X^{3} (15)

[0036]
This corresponds to shifting each bit; storing the value a_{2 }of the register E_{2 }into the register E_{0}; storing the value a_{1 }of the register E_{1 }into the register E_{3}; adding the values a_{2}, a_{3 }of the registers E_{2}, E_{3 }at the XOR circuit 43 b _{1 }and storing the sum into the register E_{1}; and adding the values a_{0}, a_{3 }of the registers E_{0}, E_{3 }at the XOR circuit 43 b _{2 }and storing the sum into the register E_{2}.

[0037]
When 1bit data I_{0}I_{6 }is output, one shift operation of the shift registers 41 a, 42 a, 43 a multiplies the term of σ_{1 }by Z in the X arithmetic section 42 and the term of σ_{2 }by Z^{2 }in the X^{2 }arithmetic section 43. The NANDtype flash memory operates the shift registers 41 a, 42 a, 43 a in synchronization with the toggle signal that is employed to output the information bits stored in the memory cell to outside the chip. In the second arithmetic circuit 40 b, the result from the operation through an XOR circuit 44 and an NOR gate 45 exhibits “1” at the error position. This output is employed to invert the corresponding data Ii to detect and correct the error.

[0038]
Thus, in the conventional ECC circuit that employs BCH code, one shift and computation per 1bit input is the basic operation. The NANDtype flash memory receives parallel data input from external on a basis of 8I/O or 16I/O per address. Therefore, it is required to correct an error per I/O or compute 8 or 16 times during the one input. The 8 or 16time computation during the one input needs a fast operation for this part, which can not be achieved practically because a special process is required, for example.

[0039]
Therefore, an ECC circuit 3 i is provided for each memory cell area 1 i (each I/O) in the art to correct errors on a basis of each memory cell area 1 i. The NANDtype flash memory reads and programs data per page (528 bytes). If it intends to correct 2bit errors and detect 3bit errors per I/O, it requires 21 check bits for 528 information bits, 21×8=168 extra check bits in total for the entire chip. This is an inhibit factor for improving the chip integration density.

[0040]
The present invention has been made in consideration of such the problem and accordingly has an object to provide a semiconductor memory device capable of reducing the number of check bits relative to the number of information bits to improve a chip integration density.
BRIEF SUMMARY OF THE INVENTION

[0041]
According to an aspect of the invention, a semiconductor memory device comprises a plurality of memory cell areas, each of which includes a plurality of memory cells arrayed in a matrix and has a data I/O portion; a plurality of buffers, each of which is located on the data I/O portion at each memory cell area to temporarily store data to be written into the memory cell area and data read out from the memory cell area; a plurality of I/O terminals, each of which is configured to receive the data to be written into the memory cell area from external and output the data read out from the memory cell area to external; and an error correction circuit located between the plurality of I/O terminals and the plurality of buffers, the error correction circuit includes a coder configured to generate check bits for error correcting and to attach the check bits to the data to be written into the memory cell area and a decoder configured to process for error correcting the data read out from the memory cell area with the generated check bits, the error correction circuit operates to allocate a set of check bits to an information bit length of M×N (N denotes an integer of two or more) to execute at least one of coding and decoding by parallel processing Nbit data, where M denotes the number of bits in a unit of data to be written into and read out from the memory cell area.
BRIEF DESCRIPTION OF THE DRAWINGS

[0042]
The present invention will be more fully understood from the following detailed description with reference to the accompanying drawings, in which:

[0043]
[0043]FIG. 1 is a block diagram showing an arrangement of a coder for use in an ECC circuit mounted on a flash memory according to a first embodiment of the present invention;

[0044]
[0044]FIG. 2 is a block diagram showing an arrangement of a shift register for use in the coder;

[0045]
[0045]FIG. 3 is a truth table of an XOR circuit for use in the coder;

[0046]
FIGS. 4A and AB are block diagrams showing syndrome computational circuits in a decoder for use in the ECC circuit;

[0047]
[0047]FIG. 5 is a block diagram showing a first arithmetic section contained in an error position detector for use in the decoder;

[0048]
[0048]FIG. 6 is a block diagram showing a second arithmetic section contained in the error position detector;

[0049]
[0049]FIG. 7 is a block diagram showing a NANDtype flash memory according to a second embodiment of the present invention;

[0050]
[0050]FIG. 8 is a circuit diagram showing an arrangement of a memory cell area in the flash memory;

[0051]
[0051]FIG. 9 is a block diagram showing an ECC circuit in the flash memory;

[0052]
[0052]FIG. 10 shows registers contained in an arithmetic logic circuit on coding in the ECC circuit;

[0053]
[0053]FIG. 11 is a flowchart showing an operation of coding in the coder;

[0054]
[0054]FIG. 12 is a timing chart on coding;

[0055]
[0055]FIG. 13 shows registers contained in an arithmetic logic circuit for decoding in the ECC circuit;

[0056]
[0056]FIG. 14 is a flowchart showing an operation of decoding;

[0057]
[0057]FIG. 15 is a block diagram of an error position detector in the ECC circuit;

[0058]
[0058]FIG. 16 is a flowchart showing an algorithm for computing each term in an error position polynomial in the error position detector;

[0059]
[0059]FIGS. 17A, 17B and 17C are block diagrams of a Galois arithmetic circuit in the ECC circuit;

[0060]
[0060]FIG. 18 shows a second arithmetic section in the error position detector;

[0061]
[0061]FIG. 19 is a block diagram of another error position detector in the ECC circuit;

[0062]
[0062]FIGS. 20A and 20B are timing charts on decoding in the ECC circuit;

[0063]
[0063]FIG. 21 is a block diagram showing an arrangement of the NANDtype flash memory with conventional ECC circuits mounted thereon;

[0064]
[0064]FIG. 22 is a block diagram showing a coder in the conventional ECC circuit;

[0065]
[0065]FIGS. 23A and 23B are block diagrams showing conventional syndrome computational circuits;

[0066]
[0066]FIG. 24 is a flowchart showing a decoding algorithm in the conventional ECC circuit;

[0067]
[0067]FIG. 25 is a block diagram showing a first arithmetic section contained in an error position detector in the conventional ECC circuit; and

[0068]
[0068]FIG. 26 is a block diagram showing a second arithmetic section contained in the error position detector in the conventional ECC circuit.
DETAILED DESCRIPTION OF THE INVENTION

[0069]
Embodiments of the present invention will be described below with reference to the drawings.

[0070]
(1) First Embodiment

[0071]
In order to provide an understanding of the present invention, 2bit error correction is exemplified as a first embodiment with the number of information bits, k=7, a code length, n=15, and the number of correction bits, t 2.

[0072]
(11) Coder

[0073]
When input data I_{0 }enters the conventional coder 11 shown in FIG. 22, the input data I_{0 }is added at the XOR circuit 12 _{4 }to the term of X^{7 }in the coder, then multiplied by X. Each register 11 in the coder 10 in the initial state has a value of 0, which is referred to as (0). Accordingly:

(0+I_{0}X^{7})X (17)

[0074]
When next input data I_{1 }enters the coder 10, the input data I_{1 }is added to the term of X^{7 }in the coder 10, then multiplied by X to yield:

((0+I_{0}X^{7})X+I_{1}X^{7})X (18)

[0075]
When next input data I_{2 }enters the coder 10, the input data I_{2 }is added to the term of X^{7 }in the coder 10, then multiplied by X to yield:

(((0+I_{0}X^{7})X+I_{1}X^{7})X+I_{2}X^{7})X (19)

[0076]
Similarly, after input data, up to I_{6}, enters the coder 10, the following is given:

(((((((0+I_{0}X^{7})X+I_{1}X^{7})X+I_{2}X^{7})X+I_{3}X^{7})X +I_{4}X^{7})X+I_{5}X^{7})X+I_{6}X^{7})X (20)

[0077]
This expression can be altered in:

((((0+I_{0}X^{7})X+I_{1}X^{6})X^{2}+I_{2}X^{7})X+I_{3}X^{6})X^{2 }+I_{4}X^{7})X^{2}+I_{5}X^{6})X^{2}+I_{6}X^{7})X (21)

[0078]
This means that the pieces of input data I_{0}, I_{1 }are added to the terms of X^{7}, X^{6 }in the coder 10, respectively, then multiplied by X^{2}. Thereafter, the pieces of input data I_{2}, I_{3 }are added to the terms of X^{7}, X^{6 }in the coder 10, respectively, then multiplied by X^{2}. Finally the pieces of input data I_{4}, I_{5 }are added to the terms of X^{7}, X^{6 }in the coder 10, respectively, then multiplied by X^{2}. In a word, one operation of the shift register 11 after two bits input can multiply the data by X^{2}. As for the last data I_{6}, however, one bit input multiplies it by X as is in the art.

[0079]
When the value of the shift register 11 represented by Expression (2) is multiplied by X^{2}, it comes to:

a_{0}X^{2}+a_{1}X^{3}+a_{2}X^{4}+a_{3}X^{5}+a_{4}X^{6}+a_{5}X^{7}+a_{6}X^{8}+a_{7}X^{9} (22)

[0080]
From the generating polynomial G(x) given by Expression (1), a relation of X^{8}=X^{7}+X^{6}+X^{4}+1 is derived. Therefore, Expression (21) yields:

(a_{6}+a_{7})X^{0}+a_{7}X^{1}+a_{0}X^{2}+a_{1}X^{3}+(a_{2}+a_{6}+a_{7})X^{4}+(a_{3}+a_{7})X^{5}+(a_{4}+a_{6}+a_{7})X^{6}+(a_{5}+a_{6})X^{7} (23)

[0081]
[0081]FIG. 1 is a block diagram showing a circuit arrangement of a coder 50 according to the present embodiment that specifically configures Expression (23).

[0082]
The coder 50 comprises a shift register 51 consisting of registers D_{7}, D_{6}, D_{5}, D_{4}, D_{3}, D_{2}, D_{1}, D_{0}, XOR circuits 521, 522, 523, 524, 525, 526, 527, and four switches SW11, SW12, SW21, SW22 for changing input data and output data. The shift register 51 includes fourstage transfer gates 51 a and other necessary gate circuits 51 b as shown in FIG. 2. In the transfer gates 51 a, a reset signal RSTn is employed to reset the contents of data and a clock signal CLK to synchronously transfer 1bit data from an input terminal IN to an output terminal OUT. An XOR circuit 52 applies a modulo2 operation to data input from input terminals IN1, IN2, as shown in FIG. 3, and output the result from an output terminal OUT.

[0083]
Based on Expression (23), the coder 50 through one shift operation performs: adding the values a_{6}, a_{7 }of the registers D_{6}, D_{7 }at the XOR gate 52 _{6 }and storing the sum into the register D_{0}; storing the value a_{7 }of the register D_{7 }into the register D_{1}; storing the value a_{0 }of the register D_{0 }into the register D_{2}; storing the value a, of the register D, into the register D_{3}; adding the values a_{2}, a_{6}, a_{7 }of the registers D_{2}, D_{6}, D_{7 }at the XOR gates 52 1 , 52 _{6 }and storing the sum into the register D_{4}; adding the values a_{3}, a_{7 }of the registers D_{3}, D_{7 }at the XOR gate 52 _{2 }and storing the sum into the register D_{5}; adding the values a_{4}, a_{6}, a_{7 }of the registers D_{4}, D_{6}, D_{7 }at the XOR gates 52 _{3}, 52 _{6 }and storing the sum into the register D_{6}; and adding the values a_{5}, a_{6 }of the registers D_{5}, D_{6 }at the XOR gate 52 _{5 }and storing the sum into the register D_{7}.

[0084]
The pieces of input data (information bits) I_{0}, I_{1}, I_{2}, I_{3}, I_{4}, I_{5}, I_{6}, given from external to be written into the memory, are divided into two: input data I_{0}, I_{2}, I_{4 }and input data I_{1}, I_{3}, I_{5}. The input data I_{0}, I_{2}, I_{4 }is fed to ON sides of the switches SW11, SW21. The input data I_{1}, I_{3}, I_{5 }is fed to ON sides of the switches SW12, SW22. The pieces of input data are fed by two bits in parallel in an order of (I_{0}, I_{1}), (I_{2}, I_{3}), (I_{4}, I_{5}). After the input, the shift register 51 operates once. As the shift register 51 is connected to every other one, one shift operation multiplies the data by X^{2}. While the pieces of data (I_{0}, I_{1}), (I_{2}, I_{3}), (I_{4}, I_{5}) enter, the switches SW11, SW12, SW21, SW22 are all kept ON to allow these pieces of data to output by two bits in parallel as they are. At the same time, the data I_{0}, I_{2}, I_{4 }is added to the value a_{7 }of the register D_{7 }at the XOR circuit 52 _{7 }and sequentially stored in the shift register 51. The data I_{1}, I_{3}, I_{5 }is added to the value a_{7 }of the register D_{7 }at the XOR circuit 52 _{4 }and sequentially stored in the shift register 51. As the last I_{6 }of the input data is 1bit input, the connection is switched to the same as in the conventional coder 10 shown in FIG. 22. Such the switching is required because k=7 is selected as the number of information bits. After completion of input of the data I_{0}, I_{1}, I_{2}, I_{3}, I_{4}, I_{5}, I_{6}, check bits I_{7}, I_{8}, I_{9}, I_{10}, I_{11}, I_{12}, I_{13}, I_{14 }are stored inside the registers D_{7}, D_{6}, D_{5}, D_{4}, D_{3}, D_{2}, D_{1}, D_{0 }in the shift register 51, respectively. The switches SW11, SW12, SW21, SW22 are then all connected to OFF sides. Thus, every time the shift register 51 operates, the check bits I_{1}, I_{9}, I_{11}, I_{13 }are fed to the output of the switch SW11 and the check bits I_{8}, I_{10}, I_{12}, I_{14 }to the output of the switch SW12. At the same time, the value in the shift register 51 is reset. This allows check bits to be generated through 2bit input parallel processing.

[0085]
(12) Decoder

[0086]
{circle over (1)} S_{1 }Syndrome Computational Circuit

[0087]
In the conventional S_{1 }syndrome computational circuit 20 of FIG. 23A, the value in the S_{1 }syndrome computational circuit 20 is first multiplied by X, then the input data I_{0 }is added to the term of X^{0 }at the XOR circuit 22 _{1}. The shift register 21 in the S_{1 }syndrome computational circuit 20 in the initial state has a value of 0, which is referred to as (0). Accordingly:

0×X+I_{0} (24)

[0088]
After the value in the S_{1 }syndrome computational circuit 20 is multiplied by X, the input data I_{1 }is added to the term of X_{0}. Accordingly:

(0×X+I_{0})X+I_{1} (25)

[0089]
Subsequently, after the value in the S_{1 }syndrome computational circuit 20 is multiplied by X, the input data I_{2 }is added to the term of X^{0}. Accordingly:

((0×X+I_{0})X+I_{1})X+I_{2} (26)

[0090]
When the input data, up to I_{14}, enters the S_{1 }syndrome computational circuit 20, the following is given:

(((((((((((((0×X+I_{0})X+I_{1})X+I_{2})X+I_{3})X+I_{4})X+I_{5})X+I_{6})X+I_{7})X+I_{8})X+I_{9})X+I_{10})X+I_{11})X+I_{12})X+I_{13})X+I_{14} (27)

[0091]
The expression can be altered in:

(((((((0×X^{2}+I_{0}X+I_{1})X^{2}+I_{2}X+I_{3})X^{2}+I_{4}X+I_{5})X^{2}+I_{6}X+I_{7})X^{2}+I_{8}X+_{9})X^{2}+I_{10}X +I_{11})X^{2 }+I_{12}X+I_{13})X+I_{14} (28)

[0092]
This means that after the value in the S_{1 }syndrome computational circuit 20 is multiplied by X^{2}, the input data I_{0 }is added to the term of X^{1}, and the input data I_{1 }to the term of X^{0}. Then, after the value in the S_{1 }syndrome computational circuit 20 is multiplied by X^{2}, the input data I_{2 }is added to the term of X^{1}, and the input data I_{3 }to the term of X^{0}. Next, after the value in the S_{1 }syndrome computational circuit 20 is multiplied by X^{2}, the input data I_{4 }is added to the term of X^{1}, and the input data I_{5 }to the term of X^{0}. In a word, one operation of the shift register multiplies the data by X^{2}, then 2bit data enters. Finally, after the value in the S_{1 }syndrome computational circuit 20 is multiplied by X, the input data I_{14 }is added to the term of X_{0 }by 1bit input.

[0093]
When the value of the shift register 21, expressed by Expression (5), is multiplied by X^{2}, the following is given:

a_{0}X^{2}+a_{1}X^{3}+a_{2}X^{4}+a_{3}X^{5} (29)

[0094]
From the α minimal polynomial M_{1}(x), a relation of X^{4}=X +1 is derived. Accordingly:

[0095]
a_{2}X^{0}+(a_{2}+a_{3})X^{1}+(a_{0}+a_{3})X^{2}+a_{1}X^{3} 30

[0096]
[0096]FIG. 4A is a block diagram showing a circuit arrangement of an S_{1 }syndrome computational circuit 60 according to the present embodiment that specifically configures Expression (30).

[0097]
The S_{1 }syndrome computational circuit 60 comprises a shift register 61 consisting of registers D_{0}, D_{1}, D_{2}, D_{3}, and XOR circuits 62 _{1}, 62 _{2}, 62 _{3}, 62 _{4}.

[0098]
Based on Expression (30), the S_{1 }syndrome computational circuit 60 through one shift operation performs: storing the value a_{2 }of the register D_{2 }into the register D_{0}; adding the values a_{2}, a_{3 }of the registers D_{2}, D_{3 }at the XOR circuit 62 _{2 }and storing the sum into the register D_{1}; adding the values a_{0}, a_{3 }of the registers D_{0}, D_{3 }at the XOR circuit 62 _{4 }and storing the sum into the register D_{2}; and storing the value a_{1 }of the register D, into the register D_{3}.

[0099]
The information bits I_{0}, I_{1}, I_{2}, I_{3}, I_{4}, I_{5}, I_{6 }and check bits I_{7}, I_{8}, I_{9}, I_{10}I_{11}, I_{12}, I_{13}, I_{14 }read out from the memory cell area, not depicted, are divided into I_{0}, I_{2}, I_{4}, I_{6}, I_{8}, I_{10}, I_{12}, I_{14 }and I_{1}, I_{3}, I_{5}, I_{7}, I_{9}, I_{11}, I_{13 }and fed by two bits in parallel in an order of (I_{0}, I_{1}), (I_{2}, I_{3}), (I_{4}, I_{5}), . . . to the S_{1 }syndrome computational circuit 60. After the input, the shift register 61 operates once. As the shift register 61 is connected to every other one, one shift operation multiplies the data by X^{2}. The data I_{0}, I_{2}, I_{4}, . . . ,I_{14 }is added at the XOR circuit 62 _{3 }to the output, a_{2}+a_{3}, from the XOR circuit 62 _{2 }and the sum is stored in the register D_{1}. The data I_{1}, I_{3}, I_{5}, . . . , I_{13 }is added at the XOR circuit 62 _{1 }to the value a_{2 }of the register D_{2 }and the sum is stored in the register D_{0}. As the last I_{6 }of the information bits is 1bit input, the connection is switched to the same as in the circuit of FIG. 23. Alternatively, it is possible to input I_{15}=0 to the S_{1 }syndrome computational circuit 60 and, after a shift operation, multiply the shift register by X^{−1}. This allows 2bit input parallel processing to be performed.

[0100]
{circle over (2)} S_{3 }Syndrome Computational Circuit

[0101]
A S_{3 }syndrome computational circuit 70 in FIG. 4B is described next. In the conventional S_{3 }syndrome computational circuit 30 in FIG. 23A, the value in the S_{3 }syndrome computational circuit 30 is first multiplied by X^{3}, then the input data I_{0 }is added to the term of X^{0 }at the XOR circuit 32 _{1}. The shift register 31 in the S_{3 }syndrome computational circuit 30 in the initial state has a value of 0, which is referred to as (0). Accordingly:

0×X^{3}+I_{0} (31)

[0102]
After the value in the S_{3 }syndrome computational circuit 30 is multiplied by X^{3}, the input data I_{1 }is added to the term of X_{0}. Accordingly:

(0×X^{3}+I_{0})X+I_{1} (32)

[0103]
Subsequently, after the value in the S_{3 }syndrome computational circuit 30 is multiplied by X^{3}, the input data I_{2 }is added to the term of X_{0}. Accordingly:

((0×X^{3}+I_{0})X^{3}+I_{3}+I_{1})X^{3}+I_{2} (33)

[0104]
When the input data, up to I_{14}, enters the S_{3 }syndrome computational circuit 30, the following is given:

(((((0×X^{3}+I_{0})X^{3}+I_{1})X^{3}+I_{2})X^{3}+I_{3})X^{3}+I_{4})X^{3}+I_{5})X^{3}+I_{6})X^{3}+I_{7})X^{3}+I_{8})X^{3}+I_{9})X^{3}+I_{10})x^{3}+I_{11})X^{3}+I_{12})X^{3}+I_{13})X^{3}+I_{14} (34)

[0105]
The expression can be altered in:

(((((0×X^{6}+I_{0}X^{3}+I_{1})X^{6}+I_{2}X^{3}+I_{3})X^{6}+I_{4}X^{3 }+I_{5})X^{6}+I_{6}X^{3}+I_{7})X^{6}+I_{8}X^{3}+I_{9})X^{6}+I_{10}X^{3 }+I_{11})X^{6}+I_{12}X^{3}+I_{13})X^{3}+I_{14} (35)

[0106]
This means that after the value in the S_{3 }syndrome computational circuit 30 is multiplied by X^{6}, the input data I_{0 }is added to the term of X^{3}, and the input data I_{1 }to the term of X^{0}. Then, after the value in the S_{3 }syndrome computational circuit 30 is multiplied by X^{6}, the input data I_{2 }is added to the term of X^{3}, and the input data I_{3 }to the term of X^{0}. Next, after the value in the S_{3 }syndrome computational circuit 30 is multiplied by X^{6}, the input data I_{4 }is added to the term of X^{3}, and the input data I_{5 }to the term of X^{0}. In a word, one operation of the shift register multiplies the data by X^{6}, then 2bit data is input. Finally, after the value in the S_{3 }syndrome computational circuit 30 is multiplied by X^{3}, the input data I_{14 }is added to the term of X_{0 }by 1bit input.

[0107]
When the value of the shift register 31, expressed by Expression (5), is multiplied by X^{6}, the following is given:

a_{0}X^{6}+a_{1}X^{7}+a_{2}X^{8}+a_{3}X^{9} (36)

[0108]
From the α minimal polynomial M_{1}(x), a relation of X^{4}=X +1 is derived. Accordingly:

(a_{1}+a_{2})X^{0}+(a_{1}+a_{3})X^{1}+(a_{0}+a_{2})X^{2}+(a_{0}+a_{1}+a_{3})X^{3} (37)

[0109]
[0109]FIG. 4B is a block diagram showing a circuit arrangement of the S_{3 }syndrome computational circuit 70 according to the present embodiment that specifically configures Expression (37).

[0110]
The S_{3 }syndrome computational circuit 70 comprises a shift register 71 consisting of registers D_{0}, D_{1}, D_{2}, D_{3}, and XOR circuits 72 _{1}, 72 _{2}, 72 _{3}, 72 _{4}, 72 _{5}, 72 _{6}.

[0111]
Based on Expression (37), the S_{3 }syndrome computational circuit 70 through one shift operation performs: adding the values a_{1}, a_{2 }of the registers D_{1}, D_{2 }at the XOR circuit 72 _{2 }and storing the sum into the register D_{0}; adding the values a_{1}, a_{3 }of the registers D_{1}, D_{3 }at the XOR circuit 72 _{6 }and storing the sum into the register D_{1}; adding the values a_{0}, a_{2 }of the registers D_{0}, D_{2 }at the XOR circuit 72 _{4 }and storing the sum into the register D_{2}; and adding the values a_{0}, a_{1}, a_{3 }of the registers D_{0}, D_{1}, D_{3 }at the XOR circuits 72 _{5}, 72 _{6 }and storing the sum into the register D_{3}.

[0112]
The information bits I_{0}, I_{1}, I_{2}, I_{3}, I_{4}, I_{5}, I_{6 }and check bits I_{7}, I_{8}, I_{9}, I_{10}, I_{1}, I_{12}, I_{13}, I_{14 }read out from the memory cell area, not depicted, are divided into I_{0}, I_{2}, I_{4}, I_{6}, I_{8}, I_{10}, I_{12}, I_{14 }and I_{1}, I_{3}, I_{5}, I_{7}, I_{9}, I_{11}, I_{13 }and fed by two bits in parallel in an order of (I_{0}, I_{1}), (I_{2}, I_{1}), (I_{4}, I_{5}), . . . to the S_{3 }syndrome computational circuit 70. After the input, the shift register 71 operates once. The data I_{0}, I_{2}, I_{4}, . . . , I_{14 }is added at the XOR circuit 72 _{3 }to the output, a_{1}+a_{3}, from the XOR circuit 72 _{6 }and the sum is stored in the register D_{1}. The data I_{1}, I_{3}, I_{5}, . . . , I_{13 }is added to the output, a_{1}+a_{2}, from the XOR circuit 72 _{1 }at the XOR circuit 72 _{2 }and the sum is stored in the register D_{0}. As the last I_{6 }of the information bits is 1bit input, the connection is switched to the same as in the S_{3 }syndrome computational circuit 30 of FIG. 23. Alternatively, it is possible to input I_{15}=0 to the S_{3 }syndrome computational circuit 70 and, after a shift operation, multiply the shift register by X^{−3}. This allows 2bit input parallel processing to be performed.

[0113]
{circle over (3)} Error Position Detector

[0114]
An error position detector is described next. In the error position detector in the present embodiment, the S_{1}, S_{3 }syndrome computational circuits 60, 70 perform one shift operation corresponding to the conventional two shift operations. Therefore, the error position detector performs an arithmetic also corresponding to the conventional two shift operations. The error position polynomial (10) is also represented by:

σ(Z)=S _{1}+σ_{1} ×Z ^{2} +σ _{2} ×Z ^{4} (38)

[0115]
[0115]FIGS. 5 and 6 show an arrangement of the error position detector configured based on Expression (38).

[0116]
The error position detector 80 comprises a first arithmetic section 80 a (FIG. 5) that computes and stores S_{1}, σ_{1 }and σ_{2}, and a second arithmetic section 80 b that detects a data error position based on Expression (38) and outputs a detection signal. As shown in FIG. 5, the first arithmetic section 80 a comprises a shift register 81, an X^{2 }arithmetic circuit 82, and an X^{4 }arithmetic circuit 83. A shift register 81 a stores the syndrome S_{1 }as the initial state, and shift registers 82 a, 83 a store the operated results, σ_{1}=S_{1} ^{2 }and σ_{2}=S_{1} ^{3}+S_{3}, as the initial states. The error position detector 80 executes error detection in synchronization with every other data I_{0}, I_{2}, I_{4}, I_{6 }among the output data I_{0}, I_{1}, I_{2}, I_{3}, I_{4}, I_{5}, I_{6}. It operates the shift registers 81 a, 82 a, 83 a once to multiply the term of σ_{1 }by Z^{2 }in the X^{2 }arithmetic circuit 82, and the term of σ_{2 }by Z^{4 }in the X^{4 }arithmetic circuit 83. If any error is present, then σ=0.

[0117]
The X^{2 }arithmetic circuit 82 has the same arrangement as the X^{2 }arithmetic circuit 43 in FIG. 25: the shift register 43 a corresponds to the shift register 82 a; and the XOR circuits 43 b _{1}, 43 b _{2 }to the XOR circuits 82 b _{1}, 82 b _{2}. Therefore, detailed arrangement descriptions for those parts are omitted.

[0118]
The X^{4 }arithmetic circuit 83 multiplies the value expressed by Expression (11) of the shift register 83 a by X^{4}. Therefore, the shift register 83 a has a value expressed by:

a_{0}X^{4}+a_{1}X^{5}+a_{2}X^{6}+a_{3}X^{7} (39)

[0119]
From the α minimal polynomial M_{1}(x), a relation of X^{4}=X +1 is derived. Accordingly:

(a_{0}+a_{3})X^{0}+(a_{0}+a_{1}+a_{3})X^{1}+(a_{1}+a_{2})X^{2}+(a_{2}+a_{3})X^{3} (40)

[0120]
Based on Expression (40), the X^{4 }arithmetic section 83 through one shift operation performs: adding the values a_{0}, a_{3 }of the registers E_{0}, E_{3 }at the XOR circuit 83 b _{1 }and storing the sum into the register E_{0}; adding the values a_{0},a_{1}, a_{3 }of the registers E_{0}, E_{1}, E_{3 }at the XOR circuit 83 b _{1},83 b _{2 }and storing the sum into the register E_{1}; adding the values a_{1}, a_{2 }of the registers E_{1}, E_{2 }at the XOR circuit 83 b _{3 }and storing the sum into the register E_{2}; and adding the values a_{2}, a_{3 }of the registers E_{2}, E_{3 }at the XOR circuit 83 b _{4 }and storing the sum into the register E_{3}.

[0121]
The second arithmetic section 80 b in FIG. 6 includes a first detector 84 to detect error positions in the output data I_{0}, I_{2}, I_{4}, I_{6}; a second detector 85 to detect error positions in the output data I_{1}, I_{3}, I_{5}; an Xarithmetic circuit 86 to multiply the term of σ_{1 }by Z regarding the data I_{1}, I_{3}, I_{5}; and an X^{2}arithmetic circuit 87 to multiply the term of σ_{2 }by Z^{2 }regarding the data I_{1}, I_{3}, I_{5}. The output resulted from the operation at the XOR circuit 88 and the NOR gate 89 in each detector 84, 85 exhibits “1” at the error position. This output is employed to invert the corresponding data Ii to detect 2bit error positions in parallel at the same time by one shift operation. The X arithmetic circuit 86 and the X^{2 }arithmetic circuit 87 have the same arrangements as the conventional circuits shown in FIGS. 25 and 26 though they are not required to have registers for storing data.

[0122]
(2) Second Embodiment

[0123]
[0123]FIG. 7 is a block diagram showing a NANDtype flash memory according to a second embodiment, which mounts an ECC circuit on a chip.

[0124]
The memory comprises eight memory cell areas 101 _{0}, 101 _{1}, 101 _{2}, . . . , 101 _{7}. Eight page buffers 102 _{0}, 102 _{1}, 102 _{2}, . . . , 102 _{7 }are provided corresponding to the memory cell areas 101 _{0}, 101 _{1}, 101 _{2}, . . . , 101 _{7 }to temporarily store data to be written in and read out of the memory cell areas 101 _{0}, 101 _{1}, 101 _{2}, . . . , 101 _{7}. Between the page buffers 102 _{0}102 _{7 }and I/O terminals 104 _{0}, 104 _{1}, . . . , 104 _{7}, an ECC circuit 103 is provided to generate check bits, ECC, for correcting errors in the write data and to correct errors in the read data using the check bits (ECC). Different from the conventional type, for error detection and correction, the ECC circuit 103 adds 40 check bits commonly to information bits consisting of 528 bits×8 I/O=4224 bits data (M=528, N=8) that can be read out of and written into all memory cell areas 101 _{0}101 _{7 }at a time.

[0125]
Addresses and control signals, input to an I/O terminal 105, are fed to a control signal operation circuit 106 and an address decoder 107, respectively. The control signal operation circuit 106 receives various control signals, ALE, CLE, CE, WE, RE, WP, generates control voltages supplied to various parts, and outputs a signal, READY/BUSY, to an external circuit. On receipt of an address from external through the I/O terminal 105, the address decoder 107 temporarily stores it and drives a column decoder 108 and a block selector 109. The column decoder 108 activates one column in each of the page buffers 102 _{0}102 _{7}. The block selector 109 applies a voltage to a word line in the memory cell areas 101 _{0}101 _{7 }required for reading, writing and erasing.

[0126]
As shown in FIG. 8, each memory cell area 101 j (where j=07) includes electrically rewritable, nonvolatile memory cells MC arrayed in a matrix. In this example, 16 memory cells MC are serially connected in a unit. A drain of the memory cell MC at one end is connected to a bit line BL via a selection gate transistor SG1. A source of the memory cell MC at the other end is connected to a common source line SL via a selection gate transistor SG2. Control gates of the memory cells MC in the row direction are connected to a common word line WL. Gate electrodes of the selection gate transistors SG1, SG2 in the row direction are connected to a common selection gate line SGL1, SGL2. In this embodiment, data of 528 bits, stored in the memory cells arranged at odd or even numbers among 1056 memory cells MC along a control gate line, is treated as a page or a unit to be written or read at a time. In this example, data of 16 pages adjoining in the column direction is treated as a block or a unit to be erased at a time. In addition to 1056 (528×2) memory cells MC arranged along a word line WL to store information bits, the memory cell area 101 _{7 }is further provided with memory cells MC to store 80 (40×2) check bits for error correction.

[0127]
As shown in FIG. 8, each page buffer 102 j includes 528 data storage circuits 121. Each data storage circuit 121 is connected to two bit lines BLi, BLi+1. Data can be read out from a memory cell MC in the memory cell area 101 j via either bit line BL selected by the address. A state of a memory cell MC in the memory cell area 101 j can be detected via the bit line BL. Writing into a memory cell MC in the memory cell area 101 j can be performed when a write control voltage is applied to the memory cell MC via the bit line BL. Among 528 data storage circuits 121, either one is selected at the column decoder 108 and only the selected data storage circuit 121 is connected to the ECC circuit 103.

[0128]
Therefore, in the whole memory, the data storage circuits 121 of 8 bits (8I/O) having the same column address are connected to the ECC circuit 103 by the column decoder 108. In a read operation, the memory cells MC of one page surrounded by a dashed line in FIG. 8 are selected, and data of 528×8 bits is stored in all data storage circuits 121 at a time. The column decoder 108 increments the column address by one in synchronization with the read enable (RE) signal input from external. As a result, one in each of the memory cell areas 101 _{0}101 _{7}, eight data storage circuits 121 in total are selected in turn and 8bit (8I/O) data is sequentially output to the ECC circuit 103. In a write operation, 8bit (8I/O) data is sequentially input to the ECC circuit 103 from external via the I/O terminal 104 _{0}104 _{7}, and the 8bit data is sequentially output from the ECC circuit 103. The column decoder 108 increments the column address by one in synchronization with the write enable (WE) signal input from external. As a result, one in each of the memory cell areas 101 _{0}101 _{7}, eight data storage circuits 121 in total are selected in turn, and 8bit (8I/o) data from the ECC circuit 103 is sequentially input to the selected storage circuit 121.

[0129]
An ECC circuit 103 is explained next.

[0130]
[0130]FIG. 9 is a block diagram showing the ECC circuit 103 in detail. The ECC circuit 103 includes an arithmetic logic circuit 131 containing multiple stages of registers, XOR circuits and switches; a Galois arithmetic circuit 132 for use in a syndrome computation and so forth; and an error position detector 133 (mainly a second arithmetic section) and a data inverter 134 operative to decode. The arithmetic logic circuit 131 configures a check bit generator when the ECC circuit 103 serves as a coder, and configures mainly the syndrome arithmetic circuit and a first arithmetic section in the error position detector when the ECC circuit 103 serves as a decoder.

[0131]
(21) Coder

[0132]
In the ECC circuit
103, data is input by 8 bits (D
_{0}D
_{7}) to perform error detection and correction on a basis of data of 528×8=4228 bits. In the case of BCH code capable of correcting 3bit errors and detecting 4bit errors, the following condition can be considered: the number of information bits, k=4224; a code length, n=8191; the number of correction bits, t=3; and m=13. Therefore, a generating polynomial required for coding and decoding is given below:
$\begin{array}{cc}\begin{array}{c}\mathrm{Fundamental}\ue89e\text{\hspace{1em}}\ue89e\mathrm{Polynomial}\ue89e\text{:}\ue89e\text{\hspace{1em}}\ue89eF\ue8a0\left(X\right)={X}^{13}+{X}^{4}+{X}^{3}+X+1\\ \mathrm{Parity}\ue89e\text{\hspace{1em}}\ue89e\mathrm{Polynomial}\ue89e\text{:}\ue89e\text{\hspace{1em}}\ue89e{M}_{0}\ue8a0\left(x\right)=X+1\\ \alpha \ue89e\text{\hspace{1em}}\ue89e\mathrm{Minmal}\ue89e\text{\hspace{1em}}\ue89e\mathrm{Polynomial}\ue89e\text{:}\ue89e\text{\hspace{1em}}\ue89e{M}_{1}\ue8a0\left(x\right)={X}^{13}+{X}^{4}+{X}^{3}+X+1\\ {\alpha}^{3}\ue89e\text{\hspace{1em}}\ue89e\mathrm{Minimal}\ue89e\text{\hspace{1em}}\ue89e\mathrm{Polynomial}\ue89e\text{:}\ue89e\text{\hspace{1em}}\ue89e{M}_{3}\ue8a0\left(x\right)={X}^{13}+{X}^{10}+{X}^{9}+{X}^{7}+{X}^{5}+{X}^{4}+1\\ {\alpha}^{5}\ue89e\text{\hspace{1em}}\ue89e\mathrm{Minimal}\ue89e\text{\hspace{1em}}\ue89e\mathrm{Polynomial}\ue89e\text{:}\ue89e\text{\hspace{1em}}\ue89e{M}_{5}\ue8a0\left(x\right)={X}^{13}+{X}^{11}+{X}^{8}+{X}^{7}+{X}^{4}+{X}^{1}+1\\ \mathrm{Generating}\ue89e\text{\hspace{1em}}\ue89e\mathrm{Polynomial}\ue89e\text{:}\ue89e\text{\hspace{1em}}\ue89eG\ue8a0\left(x\right)={M}_{0}\ue89e{M}_{1}\ue89e{M}_{3}\ue89e{M}_{5}\\ =\begin{array}{c}\text{\hspace{1em}}\ue89e{X}^{40}+{X}^{39}+{X}^{38}+{X}^{35}+{X}^{34}+{X}^{33}+\\ \text{\hspace{1em}}\ue89e{X}^{32}+{X}^{28}+{X}^{27}+{X}^{26}+{X}^{25}+{X}^{23}+\\ \text{\hspace{1em}}\ue89e{X}^{22}+{X}^{20}+{X}^{18}+{X}^{17}+{X}^{16}+{X}^{15}+\\ \text{\hspace{1em}}\ue89e{X}^{14}+{X}^{10}+{X}^{9}+{X}^{5}+{X}^{4}+{X}^{2}+{X}^{1}+1\end{array}\end{array}& \left(41\right)\end{array}$

[0133]
Similar to the first embodiment, Expression (42) can be altered in Expression (43).

(((((0+I_{0}X^{39})X+I_{1}X^{39})X+I_{2}X^{39})X+I_{3}X^{39}) . . . )X+I_{527}X^{39})X (42)

(((((0+I_{0}X^{39}+I_{1}X^{38}+I_{2}X^{37 }. . . I_{7}X^{32})X^{8}+(I_{8}X^{39 . . . I} _{1.5}X^{32}))X^{8 }. . . (I_{520}X^{39}+I_{521}X^{38 }. . . I_{527}X^{32})X^{8} (43)

[0134]
Expression (43) means the following. The data of 8 bits D_{0}D_{7}=I_{0}, I_{1}, I_{2}, . . . , I_{7}, input by one clock of the WE signal, is multiplied on a bit basis by X^{39}, X^{38}, X^{37}, . . . x^{32}, respectively, and each product is added into an internal register value, which is then multiplied by X^{8}. Subsequently, the data of 8 bits D_{0}D_{7}=I_{8}, I_{9}, I_{10}, . . . I_{15}, input by the next clock of the WE signal, is multiplied on a bit basis by X^{39}, X^{38}, X^{37}, . . . , X^{32}, respectively, and each product is added into an internal register value, which is then multiplied by X^{8}. The same operations are repeated 528 times to the data of last 8 bits D_{0}D_{7}=I_{4216}, I_{4217, I} _{4218}, . . . , I_{4223. }

[0135]
[0135]FIG. 10 shows 40stage registers REG0, REG1, REG39 equipped in the arithmetic logic circuit 131. These registers configure a cyclic shift register in the coder. The registers REG0, REG1, . . . , REG39 have Inputs B0, B1, . . . , B39 and Outputs A0, A1, . . . , A39. Based on the above generating polynomial (41) and Expression (43), the arithmetic logic circuit 131 executes XOR operations represented by the following Expressions (45) and (46) for one data input. The XOR operations herein employed are represented by Expression (44). Prior to sending the Outputs A32A39, the registers REG32REG39 sends Outputs AA32AA39, which are resulted from XOR operations as shown by Expression (45) to add 8bit data D_{0}D_{7 }fed from external to register values. Outputs A031 and AA32AA39 are led to XOR circuits. The results from the XOR operations, B0B39, represented by Expression (46), are led to Inputs of the registers REG0REG39 and fetched in synchronization with the shift register clock. When this operation is repeated 528 times, 40 check bits I_{4224, I} _{4225}, I_{4226}, . . . , I_{4264 }are generated in the registers REG0REG39 of the arithmetic logic circuit 131.

XOR3(IN1,IN2,IN3)=XOR2(XOR2(IN1,IN2),IN3)

XOR4(IN1,IN2,IN3,IN4)=XOR2(XOR3(IN1,IN2,IN3),IN4)

XOR5(IN1,IN2,IN3,IN4,IN5)=XOR2(XOR4(IN1,IN2,IN3,IN4),IN5)

XOR6(IN1,IN2,IN3,IN4,IN5,IN6)=XOR2(XOR5(IN1,IN2,IN3,IN4,IN5),IN6)

XOR7(IN1,IN2,IN3,IN4,IN5,IN6,IN7)=XOR2(XOR6(IN1,IN2,IN3,IN4,IN5,IN6),IN7) (44)

AA39=XOR2(A39,D0)

AA38=XOR2(A38,D1)

AA37=XOR2(A37,D2)

AA36=XOR2(A36,D3)

AA35=XOR2(A35,D4)

AA34=XOR2(A34,D5)

AA33=XOR2(A33,D6)

AA32=XOR2(A32,D7) (45)

B0=XOR6(A32,A33,A35,A36,A37,A39)

B1=XOR5(A32,A34,A35,A38,A39)

B2=XOR2(A32,A37)

B3=XOR2(A33,A38)

B4=XOR6(A32,A33,A34,A35,A36,A37)

B5=XOR4(A32,A34,A38,A39)

B6=XOR3(A33,A35,A39)

B7=XOR2(A34,A36)

B8=XOR3(AO,A35,A37)

B9=XOR7(A1,A32,A33,A35,A37,A38,A39)

B10=XOR6(A2,A32,A34,A35,A37,A38)

B11=XOR6(A3,A33,A35,A36,A38,A39)

B12=XOR5(A4,A34,A36,A37,A39)

B13=XOR4(A5,A35,A37,A38)

B14=XOR6(A6,A32,A33,A35,A37,A38)

B15=XOR6(A7,A32,A34,A35,A37,A38)

B16=XOR4(A8,A32,A37,A38)

B17=XOR6(A9,A32,A35,A36,A37,A38)

B18=XOR4(A10,A32,A35,A38)

B19=XOR4(A11,A33,A36,A39)

B20=XOR7(A12,A32,A33,A34,A35,A36,A39)

B21=XOR6(A13,A33,A34,A35,A36,A37)

B22=XOR6(A14,A32,A33,A34,A38,A39)

B23=XOR5(A15,A32,A34,A36,A37)

B24=XOR5(A16,A33,A35,A37,A38)

B25=XOR7(A17,A32,A33,A34,A35,A37,A38)

B26=XOR5(A18,A32,A34,A37,A38)

B27=XOR5(A19,A32,A36,A37,A38)

B28=XOR5(A20,A32,A35,A36,A38)

B29=XOR5(A21,A33,A36,A37,A39)

B30=XOR4(A22,A34,A37,A38)

B31=XOR4(A23,A35,A38,A39)

B32=XOR5(A24,A32,A33,A35,A37)

B33=XOR7(A25,A32,A34,A35,A37,A38,A39)

B34=XOR4(A26,A32,A37,A38)

B35=XOR6(A27,A32,A35,A36,A37,A38)

B36=XOR6(A28,A33,A36,A37,A38,A39)

B37=XOR5(A29,A34,A37,A38,A39)

B38=XOR6(A30,A32,A33,A36,A37,A38)

B39=XOR6(A31,A32,A34,A35,A36,A38) (46)

[0136]
[0136]FIG. 11 is a flowchart showing an operation of coding in the ECC circuit 103 and FIG. 12 is a timing chart on coding in the same.

[0137]
When a data input command (80 h) enters from external (S21), the registers REG040 in the arithmetic logic circuit 131 are reset (S22), then an address (Add) is given. Subsequently, a WE (Write Enable) signal enters from external and, in synchronization with this signal, data is loaded by 8 bits into the page buffer 102 j (S23, S24, S25). At the same time, the data is sent to the arithmetic logic circuit 131 to compute check bits. When the column address reaches the last 528 (S25), the data loading is terminated. Subsequently, a program command (10 h) enters from external, and an operation of voltage boosting by a charge pump, not depicted, is started to write data into the memory cell MC. At the same time, prior to writing, check bits are output, using the internal oscillator and so forth, not depicted, from 40 bits REG0REG39 by 5 bytes sequentially, and stored in the data storage circuit 121 of the page buffer 102 _{7}. The data stored in the data storage circuit 121 is then written into the memory cells MC in the page (surrounded by the dashed line in FIG. 8) selected by the external address Add.

[0138]
(22) Decoder {circle over (1)} Syndrome Computational Circuits

[0139]
For 3bit error correction and 4bit error detection, four syndromes S_{0}, S_{1}, S_{3}, S_{5 }are required as it is known. The syndrome S_{0 }can be derived from the minimal polynomial M_{1}(x)=X^{4}+X+1. When X^{10}=X^{3}+1, derived from the minimal polynomial M_{1}(x)=X^{10}+X^{3}+1, is referred to as an a operator, the syndrome S_{1 }can be derived from the a operator, the syndrome S_{3 }from an α^{3 }operator, and the syndrome S_{5 }from an α^{5 }operator. Only one bit can enter by one clock of the WE signal in the conventional decoder. In contrast, 8bit data can be fetched by one clock of the WE signal in this embodiment by altering Expressions similar to the first embodiment that alters Expression from (27) to (28), and Expression from (34) to (35). Accordingly, the syndrome S_{1 }can be derived from an α^{8}operator, the syndrome S_{3 }from an α^{24 }operator, and the syndrome S_{5 }from an α^{40 }operator.

[0140]
[0140]FIG. 13 shows 40stage registers REG0, REG1, . . . , REG39 equipped in the arithmetic logic circuit 131. The register REG0 configures a cyclic shift register in the S_{0 }syndrome computational circuit. The registers REG113 configure a cyclic shift register in the S_{1 }syndrome computational circuit. The registers REG1426 configure a cyclic shift register in the S_{3 }syndrome computational circuit. The registers REG2739 configure a cyclic shift register in the S_{5 }syndrome computational circuit. The register REG0 has an Input PP0 and an Output P0. The registers REG113 have Inputs AA0, AA1, . . . , AA12 and Outputs A0, A1, . . . , A12. The registers REG1426 have Inputs BB0, BB1, . . . , BB12 and Outputs B0, B1, . . . , B12. The registers REG2739 have Inputs CC0, CC1, . . . , CC12 and Outputs C0, C1, . . . , C12. The arithmetic logic circuit 131 executes operations shown in Expressions (47), (48), (49) and (50) based on one data input. The 8bit data D0D7 read out of the data storage circuit 121 is added to the Outputs P0, A013, B013, C013 from the registers REG0REG39 at XOR circuits. The Outputs PP0, AA013, BB013, CC013 from the XOR circuits are led to the inputs of the registers REG039 and fetched in synchronization with the shift register clock. The XOR circuits connected to the registers REG113 configure an α^{8 }arithmetic circuit, which receives the data D0D7 input. The XOR circuits connected to the registers REG1426 configure an α^{24 }arithmetic circuit, which receives the data D0D7 input. The XOR circuits connected to the registers REG2739 configure an α^{40 }arithmetic circuit, which receives the data D0D7 input. In stead of the α^{40 }arithmetic circuit, because it has a large circuit scale, α^{40 }may be fed into one of inputs of the Galois arithmetic circuit 132 shown in FIG. 9, and the output thereof and the data D0D7 are appropriately operated at XOR circuits.

[0141]
<Computation of Syndrome SO>

PP0=XOR9(P0,D7,D6,D5,D4,D3,D2,D1,D0) (47)

[0142]
<Computation of Syndrome S_{1}>

AA0=XOR2(A5,D7)

AA1=XOR3(A5,A6,D6)

AA2=XOR3(A6,A7,D5)

AA3=XOR4(A5,A7,A8,D4)

AA4=XOR5(A5,A6,A8,A9,D3)

AA5=XOR5(A6,A7,A9,A10,D2)

AA6=XOR5(A7,A8,A10,A11,D1)

AA7=XOR5(A8,A9,A11,A12,D0)

AA8=XOR4(A0,A9,A10,A12)

AA9=XOR3(A1,A10,A11)

AA10=XOR3(A2,A11,A12)

AA11=XOR2(A3,A12)

AA12=A4 (48)

[0143]
<Computation of Syndrome S_{3}>

BB0=XOR5(B1,B2,B7,B9,D7)

BB1=XOR7(B0,B1,B3,B7,B8,B9,B10)

BB2=XOR8(B1,B2,B4,B8,B9,B10,B11,D2)

BB3=XOR10(B0,B1,B3,B5,B7,B10,B11,B12,D6,D2)

BB4=XOR8(B0,B4,B6,B7,B8,B9,B11,B12)

BB5=XOR9(B1,B5,B7,B8,B9,B10,B12,D2,D1)

BB6=XOR10(B0,B2,B6,B8,B9,B10,B11,D5,D2,D1)

BB7=XOR7(B1,B3,B7,B9,B10,B11,B12)

BB8=XOR8(B2,B4,B8,B10,B11,B12,D1,D0)

BB9=XOR8(B3,B5,B9,B11,B12,D4,D1,D0)

BB10=XOR4(B4,B6,B10,B12)

BB11=XOR5(B0,B5,B7,B11,D0)

BB12=XOR7(B0,B1,B6,B8,B12,D3,D0) (49)

[0144]
<Computation of Syndrome S_{5}>

CC0=XOR13(C0,C1,C2,C4,C5,C7,C8,C9,C10,C11,C12, D7,D2)

CC1=XOR5(C3,C4,C6,C7,D2)

CC2=XOR8(C0,C4,C5,C7,C8,D4,D2,D0)

CC3=XOR10(C2,C4,C6,C7,C10,C11,C12,D4,D2,D0)

CC4=XOR9(C0,C1,C2,C3,C4,C9,C10,D1,D0)

CC5=XOR11(C0,C1,C2,C3,C4,C5,C10,C11,D6,D4,D2)

CC6=XOR12(C0,C1,C2,C3,C4,C5,C6,C11,C12,D4,D1,D0)

CC7=XOR11(C1,C2,C3,C4,C5,C6,C7,C12,D3,D2,D0)

CC8=XOR10(C0,C2,C3,C4,C5,C6,C7,C8,D3,D0)

CC9=XOR10(C0,C1,C3,C4,C5,C6,C7,C8,C9,D0)

CC10=XOR12(C1,C2,C4,C5,C6,C7,C8,C9,C10,D5,D3,D1)

CC11=XOR12(C0,C2,C3,C5,C6,C7,C8,C9,C10,C11,D3,D0)

CC12=XOR13(C0,C1,C3,C4,C6,C7,C8,C9,C10,C11,C12, D2,D1) (50)

[0145]
{circle over (2)} Error Position Detector (First Arithmetic Section)

[0146]
[0146]FIG. 14 is a flowchart showing an operation of decoding in the ECC circuit 103.

[0147]
A data read command (00h) is input, then a read address (Add) from external to start reading (S31). The data of one page (528 bytes) selected by the address is read out from the memory cells MC into the page buffers 102 _{0}102 _{7 }(S32). Thereafter, in synchronization with a signal oscillated from the internal oscillator, the data D0D7 is input byte by byte to the ECC circuit 103 to compute the syndrome (S33). As shown in FIG. 27, after computations of the syndromes S_{0}, S_{1}, S_{3}, S_{5}, if S_{1}=S_{3}=S_{5}=0 (S34) and if S_{0}=0 (S35), it is determined errorless (Normal output: S36). If S_{0}≠0 (S35), it is determined uncorrectable (S37). Unless S_{1}=S_{3}=S_{5}=0 (S34), computations are made for σ_{2}=S_{1} ^{2}S_{3}+S_{5 }and σ_{0}=S_{1} ^{3}+S_{3 }(S38). If σ_{0}=0 (S39) and if σ_{2}=0 and S_{0}=0 (S40), it is determined 1bit error, and the control goes to an algorithm for 1bit error correction (S41). Unless σ_{2}=0 and S_{0}=0 (S40), it is determined uncorrectable (S42). If σ_{0}≠0 (S39), computations are made for σ_{1}=S_{1}(S_{1} ^{3}+S_{3}) and σ_{3}=(S_{1} ^{3}+S_{3})^{2}+S_{1}(S_{1} ^{2}S_{3}+S_{5}) (S43). If σ_{3}=0 (S44) and if σ_{2}≠0 and S_{0}=0 (S45), it is determined 2bit errors, and the control goes to an algorithm for 2bit error correction (S46). Unless σ_{2}≠0 and S_{0}=0 (S45), it is determined uncorrectable (S47). If σ_{3}≠0 (S44) and if S_{0}=1 (S48), it is determined 3bit errors, and the control goes to an algorithm for 3bit error correction (S49). The algorithm for 2bit error correction is same as that for 3bit error correction. If S_{0}≠1 (S48), it is determined uncorrectable (S50).

[0148]
[0148]FIG. 15 shows an error position detector that executes the above computations. This error position detector includes a first arithmetic section, consisting of four registers R, A, B, C of 13 bits each, and notdepicted XOR circuits, contained in the arithmetic logic circuit 131. The error position detector also includes a Galois arithmetic circuit 132, and a second arithmetic section 133 consisting of eight locators 141 and arithmetic circuits 142 interposed between the locators 141 to operate ×α, ×α^{2}, ×α^{3}. 13bit buses BUSR, BUSA, BUSB, BUSC are provided to connect them. The output from the Galois arithmetic circuit 132 is connected to the register R.

[0149]
[0149]FIG. 16 shows an algorithm to compute the terms of the error position polynomial, σ_{0}, σ_{1}, σ_{2}, σ_{3}. The registers A, B, C store the syndromes S_{1}, S_{3}, S_{5}, respectively. If these syndromes are all zero, it is determined errorless and no operation is executed (S61). If not, an operation is made for σ_{2}=S_{1} ^{2}S_{3}+S_{5 }and the operated result is sequentially stored in the register R. The operated result finally obtained is transferred from the register R to the register C (S62). Next, an operation is made for σ_{0}=S_{1} ^{3}+S_{3 }and the operated result is sequentially stored in the register R. The operated result finally obtained is transferred from the register R to the register B (S63). If the operated results stored in the registers B, C are both zero, then it is determined 1bit error (S64) and “1” is stored in the register R (S65). If not, computations are made for σ_{1}=S_{1}(S_{1} ^{3}+S_{3}) and σ_{3}=(S_{1} ^{3}+S_{3})^{2}+S_{1}(S_{1} ^{2}S_{3}+S_{5}) (S66, S67, S68).

[0150]
In the present embodiment, of the code length of n=8191, the information bits of k=4224 (528×8 bits) are subjected to the error correction, while the information bits can have 8151 bits except for 41 check bits originally in a code having the code length of n=8191. As a result, the error position is shifted by 8151−4224+1=3928 bits. On reading from a column address of 0, computations are performed to multiply σ_{1 }by α^{3928}, σ_{2 }by α^{7856(=3928×2)}, and σ_{3 }by α^{3593(=} 3928×3−8191) (S69, S70, S71). Similarly, on reading from a column address of i, computations are performed to multiply σ_{1 }by α^{3928+i}, σ_{2 }by α^{7858(=(3928+1)×2)}, and a by α^{3596(=(3928+i)×3−8191)}. Factors such as α^{3928+i }are written into a ROM, for example. The factor is stored in the vicinity of the column data storage or in the memory cell area 101, selected by the column selector 108 of FIG. 7, because it depends on the column address of i. Alternatively, only the factor at the column address of 0 is stored and, when another address is accessed, a dummy operation of detecting an error position is performed to provide a matched factor.

[0151]
[0151]FIG. 17 is a block diagram showing the Galois arithmetic circuit 132 in detail.

[0152]
13bit inputs A and B shown in FIG. 17A are respectively represented by:

[0153]
[0153]A=a _{0} X ^{0} +a _{1} X ^{1} +a _{2} X ^{2} +. . . +a _{12} X ^{12 }

B=b _{0} X ^{0} +b _{1} X _{1} +b _{2} X ^{2} +. . . +b _{12} X ^{12} (51)

[0154]
In this case, A×B can be represented by:
$\begin{array}{cc}\begin{array}{c}A\times B=A\ue8a0\left({b}_{0}\ue89e{X}^{0}+{b}_{1}\ue89e{X}^{1}+{b}_{2}\ue89e{X}^{2}+...\ue89e\text{\hspace{1em}}+{b}_{12}\ue89e{X}^{12}\right)\\ =A\ue89e\text{\hspace{1em}}\ue89e{b}_{0}+X\ue8a0\left(A\ue89e\text{\hspace{1em}}\ue89e{b}_{1}+X\ue8a0\left(A\ue89e\text{\hspace{1em}}\ue89e{b}_{2}+X\ue8a0\left(A\ue89e\text{\hspace{1em}}\ue89e{b}_{3}+......+X\ue8a0\left(A\ue89e\text{\hspace{1em}}\ue89e{b}_{12}\right)\right)\right)\right)))))))))\end{array}& \left(52\right)\end{array}$

[0155]
This circuit can be configured as shown in FIG. 17B, in which A and bi are subjected to the AND operation at an AND circuit 151. The operated result is then multiplied by X at an X multiplier 152, and the product is subjected at an XOR circuit 153 to the XOR operation with the ANDoperated result from the next A and bi+1. From the a Minimal Polynomial M_{1}(x) in Expression (41), a relation of x^{13}=x^{4}+X^{3}+X+1 is present. Therefore, as shown in FIG. 17C, the X multiplier 152 operates shifting the term of X^{12 }into the term of X^{0}; adding it into the terms of X^{3}, X^{1}, X^{0 }by the XOR circuit 154; and storing it in the terms of X^{4}, X^{3},X^{1}.

[0156]
As a result of the above operations, 13bit registers A, B, C, D are given σ_{1}, σ_{3}, σ_{2}, σ_{0 }as initial values, respectively. {circle over (2)} Error Position Detector (Second Arithmetic Section)

[0157]
Error bit positions can be detected based on the following error position polynomial (53) in the cases of 3bit correction and 4bit correction as it is known.

σ(Z)=S _{1}+σ_{1} ×Z+σ_{2} ×Z ^{2} +σ _{3} ×Z ^{3} (53)

[0158]
When Z=α^{I}(I=0, 1, 2, 3, . . . ) is assigned in turn to Expression (53), the position of the error can be indicated by i that holds σ(α^{I})=0. In the present embodiment, as 8bit data is output per WE clock, Expression (53) is altered to Expression (54), like Expression (10) is altered to Expression (38) in the first embodiment.

σ(Z)=σ_{0}+σ_{1} ×Z ^{8}+σ_{2} ×Z ^{16}+σ_{3} ×Z ^{24} (54)

[0159]
As a result, the error detection can be performed by 8 bits simultaneously at every other 8 bits. In a word, of the output data of 8 I/O, the error detection is performed to the I/O 0. If an error is present, then σ=0. As a result of the computations in FIG. 16, the 13bit registers A, B, C, D are given σ_{1}, σ_{3}, σ_{2}, σ_{0 }as initial values, respectively. The XOR circuits connected to the register A in the arithmetic logic circuit 131 configure an α^{8 }arithmetic circuit. The XOR circuits connected to the register B configure an α^{24 }arithmetic circuit. The XOR circuits connected to the register C configure an α^{16 }arithmetic circuit. The register A has Inputs AA0, AA1, . . . , AA12 and Outputs A0, A1, . . . , A12. The register B has Inputs BB0, BB1, . . . , BB12 and Outputs B0, B1, B12. The register C has Inputs CC0, CC1, . . . , CC12 and Outputs C0, C1, . . . , C12. In this case, the α^{8}, α^{16}, α^{24 }arithmetic circuits perform operations respectively represented by Expressions (55), (56) and (57):

AA0=A5

AA1=XOR2(A5,A6)

AA2=XOR2(A6,A7)

AA3=XOR3(A5,A7,A8)

AA4=XOR4(A5,A6,A8,A9)

AA5=XOR4(A6,A7,A9,A10)

AA6=XOR4(A7,A8,A10,A11)

AA7=XOR4(A8,A9,A11,A12)

AA8=XOR4(A0,A9,A10,A12)

AA9=XOR3(A1,A10,A11)

AA10=XOR3(A2,A11,A12)

AA11=XOR2(A3,A12)

AA12=A4 (55)

[0160]
<α^{16 }Arithmetic Circuit>

CC0=XOR4(C6,C7,C9,C10)

CC1=XOR4(C6,C8,C9,C11)

CC2=XOR4(C7,C9,C10,C12)

CC3=XOR6(C0,C6,C7,C8,C9,C11)

CC4=XOR5(C0,C1,C6,C8,C12)

CC5=XOR4(C1,C2,C7,C9)

CC6=XOR5(C0,C2,C3,C8,C10)

CC7=XOR6(C0,C1,C3,C4,C9,C11)

CC8=XOR6(C1,C2,C4,C5,C10,C12)

CC9=XOR5(C2,C3,C5,C6,C11)

CC10=XOR5(C3,C4,C6,C7,C12)

CC11=XOR4(C4,C5,C7,C8)

CC12=XOR4(C5,C6,C8,C9) (56)

[0161]
<α^{24 }Arithmetic Circuit>

BB0=XOR4(B1,B2,B7,B9)

BB1=XOR7(B0,B1,B3,B7,B8,B9,B10)

BB2=XOR7(B1,B2,B4,B8,B9,B10,B11)

BB3=XOR8(B0,B1,B3,B5,B7,B10,B11,B12)

BB4=XOR8(B0,B4,B6,B7,B8,B9,B11,B12)

BB5=XOR7(B1,B5,B7,B8,B9,B10,B12)

BB6=XOR7(B0,B2,B6,B8,B9,B10,B11)

BB7=XOR7(B1,B3,B7,B9,B10,B11,B12)

BB8=XOR6(B2,B4,B8,B10,B11,B12)

BB9=XOR5(B3,B5,B9,B11,B12)

BB10=XOR4(B4,B6,B10,B12)

BB11=XOR4(B0,B5,B7,B11)

BB12=XOR5(B0,B1,B6,B8,B12) (57)

[0162]
[0162]FIG. 18 is a circuit diagram showing a specific arrangement of the locator 141. The locator 141 includes XOR circuits 161 and NOR circuits 162 to compute σ(Z) and outputs “H” if an error is present (σ=0) at the I/O 0 (j =17). As a result, the data inverter 134 of FIG. 9 inverts the data from the data storage circuit 121 in the page buffer 102 _{0 }and outputs the inverted data. Alternatively, as indicated by a dashed arrow 135 in FIG. 9, error correction can be directly performed to the data at the error position in the page buffer 102.

[0163]
On the other hand, the data at the I/O 1 has values in σ(Z) with the term of σ_{1 }multiplied by Z, the term of σ_{2 }multiplied by Z^{2}, and the term of σ_{3 }multiplied by Z^{3}. Accordingly, as shown in FIG. 15, an arithmetic circuit 142 _{1 }is mounted to operate the term of σ_{1}×X, the term of σ_{2}×X^{2}, and the term of σ_{2}×X, and supplies the output to the locator 141 _{1 }to solve the error position polynomial. If an error is detected (σ=0), the output comes to “H”. When these X, X^{2}, X^{3 }arithmetic circuits are assumed to have Inputs X0X12 and Outputs Y0Y12, the arithmetic circuits execute the following operations. The arithmetic circuits are not required to have registers to store data.

[0164]
<X arithmetic circuit>

Y0=X12

Y1=XOR2(X0,X12)

Y2=X1

Y3=XOR2(X2,X12)

Y4=XOR2(X3,X12)

Y5=X4

Y6=X5

Y7=X6

Y8=X7

Y9=X8

Y10=X9

Y11=X10

Y12=X11 (58)

[0165]
<X^{2 }Arithmetic Circuit>

Y0=X11

Y1=XOR2(X11,X12)

Y2=XOR2(X0,X12)

Y3=XOR2(X1,X11)

Y4=XOR3(X2,X11,X12)

Y5=XOR2(X3,X12)

Y6=X4

Y7=X5

Y8=X6

Y9=X7

Y10=X8

Y11=X9

Y12=X10 (59)

[0166]
<X^{3 }Arithmetic Circuit>

Y0=X10

Y1=XOR2(X10,X11)

Y2=XOR2(X11,X12)

Y3=XOR3(X1,X10,X12)

Y4=XOR3(X1,X10,X11)

Y5=XOR3(X2,X11,X12)

Y6=XOR2(X3,X12)

Y7=X4

Y8=X5

Y9=X6

Y10=X7

Y11=X8

Y12=X9 (60)

[0167]
The data at the I/O 2 has values in c(Z) with the term of σ_{1 }multiplied by Z^{2}, the term of σ_{2 }multiplied by Z^{4}, and the term of σ_{3 }multiplied by Z^{6}. If arithmetic circuits are mounted to operate the term of σ_{1}×X^{2}, the term of σ_{2}×X^{4}, and the term of σ_{2}×X^{6 }on the basis of I/O 0, the arithmetic circuit for a large multiplication such as X^{6 }increases the circuit scale. Therefore, in this embodiment, an arithmetic circuit 141 _{1 }is provided to multiply the output from the arithmetic circuit 141 _{2 }by ×X, ×X^{2}, ×X^{3 }again. Similarly, arithmetic circuits are provided up to 141 _{7 }corresponding to the I/O 7.

[0168]
If there is a problem on a signal transmission time delay, the eight locators 141 configuring the error position detector (second arithmetic section) 133 may be divided in two groups of four locators, as shown in FIG. 19, which are arranged on both sides of the arithmetic logic circuit 131. This arrangement is effective to halve the signal transmission path to the locator 141.

[0169]
[0169]FIG. 20 is a timing chart on decoding in the ECC circuit 103. FIG. 20A shows data reading and error correcting after computations of all terms in the error position polynomial.

[0170]
When a data read command (00h) is input from external, followed by a read address (Add), a READY/BUSY signal is activated to start reading. First, the data of one page (528 bytes) selected by the address is read out from the memory cells MC into the page buffers 102 _{0}102 _{7}. Then, in synchronization with a signal oscillated from the internal oscillator, the data D0D7 is input byte by byte to the ECC circuit 103 to compute the syndromes and operate the terms of the error position polynomial using the computed syndromes S_{0}, S_{1}, S_{3}, S_{5}. Thereafter, the data is read out in synchronization with the write enable (RE) signal and the error correction is executed at the same time. In this case, compared to the absence of the ECC circuit 103, an additional busy time is derived from a computation time for syndromes plus a computation time for error correction operators in total. For example, if one syndrome computation requires 50 ns and an arithmetic time for an operator is equal to 3.6 μs, then 528×5 ns+3.6μs=30μ.

[0171]
[0171]FIG. 20B shows an example of computing the syndromes S_{0}, S_{1}, S_{3}, S_{5 }at the same time of data reading. After the reading is started similarly, the data of one page (528 bytes) is read out from the memory cells MC into the page buffers 102 _{0}102 _{7}. Then, the data is output from the page buffers 102 _{0}102 _{7 }byte by byte in synchronization with the RE signal and the ECC circuit 103 computes the syndromes. As a result of the syndrome computation, if an error is detected, a status fail command (70 h) is activated. Accordingly, an operator for error correction is computed and the data is output again to correct the error. In this case, if no error is present, an additional busy time in total is equal to zero.

[0172]
As for 2bit error correction and 3bit error detection, the number of permissible random failures (the number of random failures at a device failure probability of 1 ppm) is naturally better in the case of 528 information bits than in the case of 4224 information bits. Table 1 shows an application to a 256 Mb NANDtype flash memory.

[0173]
From Table 1, the number of permissible random failures is 100 bits at 2bit correction BCH code for 528 information bits, and only 30 bits for 4224 information bits. To the contrary, at 3bit correction BCH code for 4224 information bits, the random failures can be permitted up to 300 bits with a necessary code as short as 40 bits. Further, at 4bit correction BCH code for 4224 information bits, the random failures can be permitted up to 1000 bits with a necessary code as short as 53 bits effectively.
TABLE 1 


Number of random failures in 256 Mb at 
Device failure probability of 1 ppm 
 Code length per  Number of 
 Page (528 B)  Failures 
 
 2bit correction BCH code  21 × 8 = 168 bits  100 bits 
 (528 information bits) 
 2bit correction BCH code  27 bits  30 bits 
 (4224 information bits) 
 3bit correction BCH code  40 bits  300 bits 
 (4224 information bits) 
 4bit correction BCH code  53 bits  1000 bits 
 (4224 information bits) 
 

[0174]
Table 2 shows chip sizes of NANDtype flash memories of 128 Mbits and 512 Mbits when no ECC circuit is mounted, compared with those when the conventional 2bit correction ECC circuit is mounted, and those when the 2bit correction ECC circuit of the present embodiment is mounted.
 TABLE 2 
 
 
 128M (0.16 μm)  512M (0.16 μm) 
 

No ECC circuit  41.88 mm^{2 }(100.0%)  136.99 mm^{2 }(100.0%) 
ECC circuit mounted  44.72 mm^{2 }(106.8%)  143.96 mm^{2 }(105.1%) 
(Conventional) 
ECC circuit mounted  43.21 mm^{2 }(103.2%)  140.42 mm^{2 }(102.5%) 
(Embodiment) 


[0175]
Thus, the flash memory with the conventional ECC circuit mounted thereon has an increase in chip size of 6.8% (128M) and 5.1% (512M). To the contrary, the flash memory with the ECC circuit of the present embodiment mounted thereon has an increase in chip size of 3.2% (128M) and 2.5% (512M), which is half the conventional one.

[0176]
As obvious from the forgoing, the information bits are generated per Mbit that is a unit for accessing each memory area in the art. To the contrary, according to the embodiments of the invention, N bits can be processed in parallel. Therefore, it is possible to allocate a set of check bits to M×N bits and reduce the number of check bits in total relative to the number of information bits. This is effective to improve a chip integration density while mounting an onchip error correction circuit.

[0177]
Having described the embodiments consistent with the invention, other embodiments and variations consistent with the invention will be apparent to those skilled in the art. Therefore, the invention should not be viewed as limited to the disclosed embodiments but rather should be viewed as limited only by the spirit and scope of the appended claims.