Sign in

Evolution and learning in neural networks: the number and distribution of ...

 David G. Stork et al
The present invention relates to the interrelationships between nature (as mediated by evolution and genetic algorithms) and nurture (as mediated by gradient-descent supervised learning) in a population of neural networks for pattern recognition. The Baldwin effect is demonstrated that learning...
Inventors: David G. Stork, Ronald C. Keesing
Assignees: Ricoh Co. Ltd., Ricoh Corporation

U.S. Classification
395/13; 395/23; 395/24

International Classification
G06F 1518

View patent at USPTO

Citations

Patent NumberTitleIssue date
4935877Non-linear genetic algorithms for solving problemsJun 19, 1990
5048095Adaptive image segmentation systemSep 10, 1991

Referenced by

Patent NumberTitleIssue date
5428709Integer string rule detection systemJun 27, 1995
5581657System for integrating multiple genetic algorithm applicationsDec 3, 1996
5640492Soft margin classifierJun 17, 1997
5757924Network security device which performs MAC address translation without affecting the IP address May 26, 1998
5778317Method for allocating channels in a radio network using a genetic algorithm Jul 7, 1998
5832466System and method for dynamic learning control in genetically enhanced back-propagation neural networks Nov 3, 1998
5839120Genetic algorithm control arrangement for massively parallel computerNov 17, 1998
5841947Computer implemented machine learning method and systemNov 24, 1998
5946673Computer implemented machine learning and control systemAug 31, 1999
6128607Computer implemented machine learning method and systemOct 3, 2000
6151679System and method for preventing a first node from being emulated by another nodeNov 21, 2000
6304864System for retrieving multimedia information from the internet using multiple evolving intelligent agentsOct 16, 2001
6363368Optimum solution search method and optimum solution search apparatus as well as storage medium in which optimum solution search program is storedMar 26, 2002
6480832Method and apparatus to model the variables of a data setNov 12, 2002
6523016Learnable non-darwinian evolutionFeb 18, 2003
6553357Method for improving neural network architectures using evolutionary algorithmsApr 22, 2003
7139740System and method for developing artificial intelligenceNov 21, 2006

Claims

What is claimed is:

1. In a neural network system for use in application tasks including optical character recognition techniques, an evolutionary evaluation and learning method comprising the steps of:

forming plural genetic representations of an application task including recognizing a plurality of characters where each character has a bit configuration specifying initial connection strengths between filter lays and category units,
converting said plural genetic representations to neural networks,
assigning to each network i, k(i) learning trails, chosen randomly between zero and 2K, and
training each neural network to optimally perform the application task including optical character recognition where the number of learning trials per network is randomly chosen.

2. The system as in claim 1 wherein said training step includes gradient descent learning utilizing backpropagation.

3. The system as in claim 2 including the step of constructing a population of genes for said neural networks and converting said genes to neural networks.

4. The system as in claim 3 wherein the training of each network to perform said application task includes the step of testing each network on the task.

5. The system as in claim 4 including the step of determining whether the performance is within a predetermined percentage and if not discarding said network and if so utilizing a genetic algorithm process of crossover mutation and replication to determine new genes for the next generation.

6. The system as in claim 1 having initial parameters of N equals population size, K equals mean number of learning trials/net/generation, s equals portion of nets that survive each generation, c equals portion of surviving nets that crossover, M equals mutation rate, probability/bit/generation, and GenMax equals maximum number of generations, the method including the steps of:

initializing the population size having a size N, a mean number of learning trials per net per generation having a value K, a portion of networks that survive each generation having a value S, a portion of the surviving networks that crossover having a value C, a mutation rate, probability/bit/generation having a value M, and a maximum number of generations equal to Genmax,
randomly assigning bit values to each gene,
constructing N networks from said genes, and
presenting input/output data pairs k(i) times to said network i and performing gradient descent learning throughout said learning presentations.

7. The system as in claim 6 including the step of testing the fitness of each network, including the performance fitness on the problem to be solved.

8. The system as in claim 7 including the step of choosing the best S.times.N networks to survive in order to reproduce and randomly mutating genes at rate M by choosing C.times.S.times.N networks to crossover mutation so as to chose the cut point randomly in the gene.

9. The system as in claim 8, including the step of generating a new population of genes Gen where Gen=Gen+1 and if Gen=Genmax, displaying the fittest network in the generation Genmax and if not, constructing N networks from said genes.

10. In a neural network system for use in application tasks, an evolutionary evaluation and learning method comprising the steps of:

forming a genetic representation of an application task having a bit configuration specifying initial connection strengths between filter layers and category units,
converting said genetic representation to neural networks,
assigning to each network i, k(i) learning trials, chosen randomly between zero and 2K, and
training each neural network to optimally perform the application task where the number of learning trials per network is randomly chosen.

Drawings