CN105643625A - Working mechanism of single-station feeding production processing system based on mechanical arm - Google Patents

Working mechanism of single-station feeding production processing system based on mechanical arm Download PDF

Info

Publication number
CN105643625A
CN105643625A CN201610128381.0A CN201610128381A CN105643625A CN 105643625 A CN105643625 A CN 105643625A CN 201610128381 A CN201610128381 A CN 201610128381A CN 105643625 A CN105643625 A CN 105643625A
Authority
CN
China
Prior art keywords
mechanical arm
workpiece
time
alpha
buffer memory
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201610128381.0A
Other languages
Chinese (zh)
Other versions
CN105643625B (en
Inventor
唐昊
张树林
吴攀飞
刘心报
谭琦
戴飞
宋聪颖
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hefei University of Technology
Original Assignee
Hefei University of Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hefei University of Technology filed Critical Hefei University of Technology
Priority to CN201610128381.0A priority Critical patent/CN105643625B/en
Publication of CN105643625A publication Critical patent/CN105643625A/en
Application granted granted Critical
Publication of CN105643625B publication Critical patent/CN105643625B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • BPERFORMING OPERATIONS; TRANSPORTING
    • B25HAND TOOLS; PORTABLE POWER-DRIVEN TOOLS; MANIPULATORS
    • B25JMANIPULATORS; CHAMBERS PROVIDED WITH MANIPULATION DEVICES
    • B25J9/00Programme-controlled manipulators
    • B25J9/16Programme controls
    • B25J9/1656Programme controls characterised by programming, planning systems for manipulators
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B25HAND TOOLS; PORTABLE POWER-DRIVEN TOOLS; MANIPULATORS
    • B25JMANIPULATORS; CHAMBERS PROVIDED WITH MANIPULATION DEVICES
    • B25J9/00Programme-controlled manipulators
    • B25J9/16Programme controls
    • B25J9/1602Programme controls characterised by the control system, structure, architecture

Abstract

The invention discloses a working mechanism of a single-station feeding production processing system based on a mechanical arm. The working mechanism is characterized in that the single-station feeding production processing system comprises a mechanical arm, a conveyor belt, an industrial camera, a cache library with capacity being M, a finished product warehouse with capacity being N and workpieces, wherein the mechanical arm is used for obtaining vacant amount m of the cache library and a gripper position p under a current state; and a visibility distance Vsmp is selected according to an optimal control strategy v*; and if workpiece are within the visibility distance Vsmp, unloading operation is carried out; and if the workpieces are not within the visibility distance Vsmp, service operation is carried out. The optimal control strategy v* is obtained by carrying out optimization solution on an SMDP model which is established by taking the vacant amount m of the cache library and the gripper position p as states through a strategy iteration algorithm. The working mechanism can be used for improving process balance and production efficiency of a single-station mechanical arm production line which has no fixed beat, has random material supply and has no fixed-point processing, so that basis is provided for optimal scheduling of the single-station mechanical arm production line in industrial production.

Description

A kind of working mechanism of the single site feed production and processing system of mechanically-based arm
Technical field
The present invention relates to automation field, the working mechanism of especially a kind of single site conveyer belt feed production and processing station system
Background technology
Along with the quick development of modern industry, mechanical arm automatic production line is increasingly extensive in the application of field of industrial production, as in fields such as electronic manufacture, automobile making, processing and packing, goods sortings. Particularly machine vision technique utilization on a production line, drastically increases the flexibility of system, intellectuality and automatization level. Such automatic production line generally configures one or more mechanical arm for picking or processing as concrete actuator; Configure one or more conveyer belt for conveying workpieces and packing box; Configuration industrial vision system is for different operations such as the location of workpiece, identification, dimensional measurements.
However as socioeconomic development, the particularization of production and processing, specialization, randomization to the production efficiency of automatic production line, produce versatility, intelligent and motility is had higher requirement. Traditional production line often adapts to single workpiece stream, and according to fixing beat at fixing point processing work. Adaptability and production efficiency for modes of production such as on-fixed beat, random feed, multi-varieties and small-batches are not high.
Summary of the invention
The present invention is the weak point in order to overcome prior art to exist, the working mechanism of the single site feed production and processing system of a kind of mechanically-based arm is provided, to the production efficiency of the single site mechanical arm production line processed for on-fixed beat, random feed, on-fixed point can be improved, thus providing foundation for the single site mechanical arm production line Optimized Operation in commercial production.
The present invention solves that technical problem adopts the following technical scheme that
The working mechanism of the single site feed production and processing system of a kind of mechanically-based arm of the present invention, described single site feed production and processing system includes: mechanical arm, conveyer belt, industrial camera, capacity are the buffer memory storehouse of M, capacity is warehouse for finished product and the workpiece of N;
Described mechanical arm is positioned at the side of described conveyer belt, is respectively arranged with described buffer memory storehouse and warehouse for finished product in the both sides of described mechanical arm; Described industrial camera is in the upstream of described mechanical arm, and vertical just to the workpiece on described conveyer belt, make described industrial camera vertical just to position be a P that takes picturescam; And described industrial camera can will be located in a P that takes picturescamThe location of workpiece in downstream passes to described mechanical arm; The front viewpoint defining described mechanical arm is Plook, and it is positioned at described front viewpoint PlookThe workpiece in downstream cannot be captured by described mechanical arm;
Vacant amount in definition buffer memory storehouse is m; M �� [0, M]; The position defining the handgrip of described mechanical arm is p; As p=1, represent that the position of handgrip of described mechanical arm is in warehouse for finished product; As p=0, represent that the position p of handgrip of described mechanical arm is in buffer memory storehouse; The united state S of described system of processing it is made up of the position p of vacant amount m and handgripm,p; Definition is with described front viewpoint PlookOne section of observed range for starting point is forward sight distanceDefinition workpiece is through described forward sight distanceThe time spent is the forward sight time
Defining described mechanical arm working range on described conveyer belt is Wpick, and at described working range WpickInside carry out unloading operation, buffer memory storehouse carries out service operations; Described unloading operation is from capturing described conveyer belt to described buffer memory storehouse by described workpiece; Described service operations is to put in described warehouse for finished product after being processed by the workpiece in described buffer memory storehouse; The time having defined unloading operation is discharge timeThe time having defined service operations is service timeAfter completing unloading operation, the handgrip of described mechanical arm is in described buffer memory storehouse; After completing service operations, the handgrip of described mechanical arm is in described warehouse for finished product; Being characterized in, described working mechanism carries out as follows:
Step 1, described system of processing bring into operation, and workpiece arrives mechanical arm operation interval at random; Set line speed, mechanical arm speed, workpiece arrival rate, work pieces process rate; Defined variable i, and initialize i=1;
Step 2, described mechanical arm obtain the vacant amount m of i & ltiWith handgrip position pi, according to optimal control policy v*Select i & lt forward sight distance
Step 3, judge the optimum forward sight distance of described i & ltInside whether there is workpiece, if there being workpiece, then the described mechanical arm optimum forward sight distance to being in described i & ltInterior near front viewpoint PlookWorkpiece carry out unloading operation after, perform step 4; If there is no workpiece, then after the workpiece in described buffer memory storehouse is carried out service operations by described mechanical arm, perform step 5;
Step 4, judge near front viewpoint PlookThe i & lt discharge time of workpieceWhether more than described near front viewpoint PlookI & lt forward sight time of workpieceIf more than, then after i+1 being assigned to i, return step 2 and perform; Otherwise, through the unloading time delay of i & ltAfter, i+1 is assigned to i, and returns step 2 and perform;
Step 5, judge near front viewpoint PlookI & lt service time of workpieceWhether more than described near front viewpoint PlookI & lt forward sight time of workpieceIf more than, then after i+1 being assigned to i, return step 2 and perform; Otherwise, time delay is serviced through i & ltAfter, i+1 is assigned to i, and returns step 2 and perform.
The feature of the working mechanism of the single site feed production and processing system of mechanically-based arm of the present invention lies also in:
Described system of processing is as state to control described forward sight distance using described vacant amount m and handgrip (7) position pLength, and then control the mode of operation of described mechanical arm (1);
When vacant amount m is less, then described mechanical arm (1) chooses shorter forward sight distanceDescribed mechanical arm trends towards service operations;
When vacant amount m is bigger, then described mechanical arm (3) chooses longer forward sight distanceDescribed mechanical arm trends towards unloading operation.
Described optimal control policy v*Obtain according to the following steps:
Step 1, described system of processing are using the vacant amount in buffer memory storehouse and handgrip position as state, forward sight distanceChoose as action, set up half Markov decision model;
Step 2, it is optimized by double Markov model of Policy iteration algorithm and solves, obtain optimal control policy v*��
Described half Markov model is set up according to the following steps:
Step 1, define the state in described buffer memory storehouse for its vacant amount m, then the state space in described buffer memory storehouse is ��1, and have ��1=0,1 ..., M};
The state space of definition handgrip position p is ��2, and have ��2={ 0,1}; As p=0, represent that the position p of handgrip of described mechanical arm is in buffer memory storehouse, as p=1, represent that the position of handgrip of described mechanical arm is in warehouse for finished product;
Define the united state S of described system of processingm,pState space be ��, and have ��=��1����2;
Step 2, the former apparent distanceAs described system of processing at united state Sm,pUnder action, a steady control strategy v of described system of processing is all of united state to the mapping of action, and hasWhereinRepresent when the vacant amount in buffer memory storehouse (4) is m, the action of system of processing when the position of handgrip (7) is in buffer memory storehouse (4);Represent when the vacant amount in buffer memory storehouse (4) is m, the action of system of processing when the position of handgrip (7) is in warehouse for finished product (5); Definition All Policies v set be ��, ��=v | v=(v (1), v (2), v (3) ..., v (N)), v (i) �� D}; Described system of processing is absent from SM,0And S0,1The two state, then corresponding action is designated as NaN;
Step 3, definition TiFor the i & lt decision-making moment of described system of processing, make T0=0; Defining unloading operation time delay when described system of processing carries out the process of i & lt Markov markov property isService operations time delay is
When the i & lt of described mechanical arm operates as unloading operation, then i+1 time decision-making moment Ti+1For:
T i + 1 = T i + m a x { t S m i , p i , t m i , p i l o a d } - - - ( 1 )
I+1 time united state is Xi+1=Sm-1,0, representing that handgrip (7) position is p=0, the vacant amount in buffer memory storehouse (4) is m-1;
I & lt unloading operation time delay is:
When the i & lt of described mechanical arm operates as service operations, then i+1 time decision-making moment Ti+1For:
T i + 1 = T i + m a x { t S m i , p i , t m i , p i s e r } - - - ( 3 )
I+1 time united state is Xi+1=Sm+1,0; Representing that handgrip (7) position is p=0, the vacant amount in buffer memory storehouse (4) is m+1;
I & lt service operations time delay is:
Define half Markov core Qv(t) be:
Q v ( t ) = [ Q ( s m , p , v s m , p , s m ′ , p ′ , t ) ] s m , p , s m ′ , p ′ ∈ Φ = P v ⊗ F v ( t ) - - - ( 5 )
In formula (5),The operator that representing matrix is multiplied; T represents system time; PvRepresent Markov embedded chain transfer matrix under strategy v, FvT () represents the distribution of residence-time matrix under strategy v, and have:
P v = [ p s m , p s m ′ , p ′ ( v s m , p ) ] s m , p s m ′ , p ′ ∈ Φ - - - ( 6 )
F v ( t ) = [ F s m , p s m ′ , p ′ ( t , v s m , p ) ] s m , p s m ′ , p ′ ∈ Φ - - - ( 7 )
In formula (6)It is from state Sm,pTake actionTransfer to Sm',p'Probability, in formula (7)It is at state Sm,pTake actionIt is S to NextStatem',p'Distribution of residence-time;
Step 4, definition expected performance function fvFor:
f v = [ f ( s m , p , v s m , p , s m ′ , p ′ ) ] s m , p , s m ′ , p ′ ∈ Φ - - - ( 8 )
In formula (8),Represent that system of processing is taken actionFrom united state Sm,pTransfer to NextState Sm',p'Unit interval expectation cost;
Step 5, foundation SMDP model as shown in formula (9):
X=(Xt,��,D,Qv(t),fv)(9)
In formula (9), XtRepresent the state procedure of t system of processing.
Described Policy iteration algorithm sequentially includes the following steps:
Step 1, according to half Markov core Qv(t) and expected performance function fv, define Equivalent InfinitesimalWith performance matrix of equal valueAnd by SMDP model X=(Xt,��,D,Qv(t),fv) be converted to continuous time Markov decision making process�� is discount factor, �� �� [0,1];
Step 2, defined variable k, define vkKth time strategy for system of processing; Initialize k=0; And the 0th tactful v0For:
Step 3, formula (11) is utilized to obtain kth time strategy vkPerformance potential vector
( α I - A α v k + ρ α eπ α v k ) g α v k = f α v k - - - ( 11 )
In formula (11); I is unit diagonal matrix; E is unit column vector; And have:
h α v k = ( I - Q α v k ) e / α - - - ( 12 )
ρ α = m a x { h α v k ( i ) - 1 , i = 1 , 2 ... N , v k ∈ Ω , α ∈ [ 0 , 1 ] } - - - ( 13 )
Represent the average waiting time at state i,Represent the steady-state distribution of kth time in Markov decision making process of equal value; And have:
A α v k π α v k = 0 π α v k e = 1 A α v k e = 0 - - - ( 14 )
Step 4, utilize formula (15) obtain+1 tactful v of kthk+1:
v k + 1 ∈ arg m i n v ∈ Ω { A α v g α v k + f α v } - - - ( 15 )
Step 5, utilize formula (11) obtain+1 tactful v of kthk+1Performance potential vector
Step 6, judgementWhether set up, �� is a less constant; If setting up, then it represents that+1 tactful v of the kth obtainedk+1It is optimal control policy v*, and exit algorithm, otherwise; After k+1 is assigned to k, return step 4.
Compared with the prior art, beneficial effects of the present invention is embodied in:
The present invention adopts a kind of single site mechanical arm production and processing system working mechanism based on half Markov model, can effectively process on-fixed beat, on-fixed picks point, workpiece arrives at random production line processing problems, improve balance and the production efficiency of each operation of production line.
1, the present invention adopts half Markov model that single site mechanical arm production and processing system is modeled, the random workpiece stream arrived is divided into the independent workpiece sequence meeting Markov process of a lot of segment, comparing traditional method according to fixing beat, workpiece to be processed, the present invention more effective process can arrive the picking of workpiece, processing problems at random.
2, before the present invention apparent distance control the action as system of processing, it is possible to the situation according to the real-time workpiece of production line, buffer memory storehouse and warehouse for finished product, adjust Processing Strategies. It is effectively increased the balance between each operation of production line, strengthens the robustness of production line.
3, the present invention adopts Policy iteration algorithm that the SMDP model of system of processing is optimized to solve, compare other algorithms, and Policy iteration algorithm avoids local optimum, fast convergence rate, it is possible to reduces system of processing and solves the time, improves production efficiency.
4, mechanical arm of the present invention adopts may move and picks mode, calculates according to the location of workpiece on operation interval and picks collision point, it is possible to reduces uninstall process and waits the time of workpiece.
Accompanying drawing explanation
Fig. 1 is single site feed production and processing system schematic of the present invention;
Fig. 2 is working mechanism's flow chart of single site feed production and processing system of the present invention.
Detailed description of the invention
Referring to Fig. 1, in the present embodiment, single site feed production and processing system includes: mechanical arm 1, conveyer belt 2, industrial camera 3, capacity are the buffer memory storehouse 4 of M, capacity is warehouse for finished product 5 and the workpiece 6 of N;
Mechanical arm 1 is positioned at the side of conveyer belt 2, and mechanical arm is 6DOF cascade machine mechanical arm, equipped with pneumatic gripping device 7 on the ring flange of mechanical arm 1. Definition mechanical arm speed is Vrobot, in the implementation case, Vrobot=20cm/s. The buffer memory storehouse 4 that capacity is M and the warehouse for finished product 5 that capacity is N it is respectively arranged with in the both sides of mechanical arm 1; In the implementation case, M=4, N=2147483648. Industrial camera 3 is in the upstream of mechanical arm 1, and vertical just to the workpiece 6 on conveyer belt 2; Conveyer belt is by driven by servomotor, and speed can accurately regulate. Definition conveyer belt transporting velocity is Vcon, in the implementation case, Vcon=10cm/s; Definition mechanical arm 1 working range on conveyor belt 2 is Wpick, wherein WpickLeft end point isRight endpoint isTaking pictures a little for P of definition industrial camera 3cam;It is positioned at a P that takes picturescamThe information such as the location of workpiece in downstream can pass to mechanical arm 1; The front viewpoint defining described mechanical arm (1) is Plook, and it is positioned at front viewpoint downstream PlookWorkpiece will lose;
Vacant amount in definition buffer memory storehouse 4 is m, m �� [0,4]; The position of the handgrip 7 of definition mechanical arm 1 is p; As p=1, represent that the position of handgrip 7 of mechanical arm 1 is in warehouse for finished product 5; As p=0, represent that the position p of handgrip 7 of mechanical arm 1 is in buffer memory storehouse 4; Definition mechanical arm handgrip coordinate is Pp; The united state S of system of processing it is made up of the position p of vacant amount m and handgrip 7m,p; Viewpoint P before definitionlookOne section of observed range for starting point is forward sight distanceD=[0, lmax]; Definition workpiece 6 is through forward sight distanceThe time spent is the forward sight timeAnd have:
P l o o k = P r i g h t p i c k + | | P p - P r i g h t p i c k | | V r o b o t V c o n - - - ( 1 )
lmax=Pcam-Plook(2)
t S m , p = v S m , p V c o n - - - ( 3 )
Mechanical arm is at working range WpickInside carry out unloading operation, buffer memory storehouse 4 carries out service operations; Unloading operation is from capturing conveyer belt 2 to buffer memory storehouse 4 by workpiece 6; Service operations is to put in warehouse for finished product 5 after being processed by the workpiece 6 in buffer memory storehouse 4; The time having defined unloading operation is discharge timeThe time having defined service operations is service timeAfter completing unloading operation, the handgrip 7 of mechanical arm 1 is in buffer memory storehouse 4; After completing service operations, the handgrip 7 of mechanical arm 1 is in warehouse for finished product 5;
System of processing controls forward sight distance using vacant amount m and handgrip 7 position p as stateLength and the mode of operation of mechanical arm 1; When vacant amount m is less, it was shown that the storage pressure in buffer memory storehouse 4 is less, then mechanical arm 1 chooses shorter forward sight distanceMechanical arm 1 trends towards service operations, and the discharge pressure to reduce conveyer belt 2 improves the utilization rate in buffer memory storehouse 4; When vacant amount m is bigger, it was shown that the storage pressure in buffer memory storehouse 4 is relatively big, then mechanical arm 1 chooses longer forward sight distanceMechanical arm 1 trends towards unloading operation, to alleviate the storage pressure in buffer memory storehouse 4.
Described system of processing is using the vacant amount in buffer memory storehouse 4 and handgrip position 7 as state, forward sight distanceChoose as action, set up half Markov decision model according to the following steps:
Step 1, definition buffer memory storehouse 4 state be its vacant amount m, then the state space in buffer memory storehouse 4 is ��1, and have ��1=0,1 ..., M};
The state space of definition handgrip 7 position p is ��2, and have ��2=0,1}, as p=1, represent that the position of handgrip 7 of mechanical arm 1 is in warehouse for finished product 5; As p=0, represent that the position p of handgrip 7 of mechanical arm 1 is in buffer memory storehouse 4;
The united state S of definition system of processingm,pState space be ��, and have ��=��1����2;
Step 2, the former apparent distanceAs system of processing at united state Sm,pUnder action, a steady control strategy v of described system of processing is all of united state to the mapping of action, and has:
In formula (4)Represent when vacant amount in buffer memory storehouse 4 is m, the action of system of processing during the position p=0 of handgrip 7;Represent when vacant amount in buffer memory storehouse 4 is m, the action of system of processing during the position p=1 of handgrip 7; The set of definition All Policies v is ��; ��=v | v=(v (1), v (2), v (3) ..., v (N)), v (i) �� D}. System of processing is absent from SM,0And S0,1The two state, then corresponding action is designated as NaN;
Step 3, definition TiMoment, wherein T is shifted for described system of processing i & lt0=0. Defining unloading operation time delay when described system of processing carries out the process of Markov markov property isService operations time delay is
When the operation of described mechanical arm i & lt is for unloading operation, then next decision-making moment Ti+1For:
T i + 1 = T i + m a x { t S m i , p i , t m i , p i l o a d } - - - ( 5 )
NextState is Xi+1=Sm-1,0;System uninstallation delay time is:
When the operation of described mechanical arm i & lt is for service operations, then next decision-making moment Ti+1For:
T i + 1 = T i + m a x { t S m i , p i , t m i , p i s e r } - - - ( 7 )
NextState is Xi+1=Sm-1,0; System service delay time is:
Define half Markov core Qv(t) be:
Q v ( t ) = [ Q ( s m , p , v s m , p , s m ′ , p ′ , t ) ] s m , p , s m ′ , p ′ ∈ Φ = P v ⊗ F v ( t ) - - - ( 9 )
In formula (9),The operator that representing matrix is multiplied; PvRepresent Markov embedded chain transfer matrix under strategy v, FvT () represents the distribution of residence-time matrix under strategy v, and have:
P v = [ p s m , p s m ′ , p ′ ( v s m , p ) ] s m , p s m ′ , p ′ ∈ Φ - - - ( 10 )
F v ( t ) = [ F s m , p s m ′ , p ′ ( t , v s m , p ) ] s m , p s m ′ , p ′ ∈ Φ - - - ( 11 )
In formula (10)It is from state Sm,pTake actionTransfer to Sm',p'Probability, and have:
p S 0 , 0 S 1 , 1 ( v S 0 , 0 ) = p S M , 1 S M - 1 , 0 ( v S M , 1 ) ≡ 1 p S m , p S m - 1 , 0 ( v S m , p ) = 1 - e - λv S m , p , m = 1 , 2 , ... , M - 1 p S m , p S m + 1 , 1 ( v S m , p ) = e - λv S m , p , m = 1 , 2 , ... , M - 1 - - - ( 12 )
In formula (11)It is at state Sm,pTake actionIt is S to NextStatem',p'Distribution of residence-time.
Step 4, definition expected performance function fvFor:
f v = [ f ( s m , p , v s m , p , s m ′ , p ′ ) ] s m , p , s m ′ , p ′ ∈ Φ - - - ( 13 )
In formula (13),Represent that system of processing is taken actionFrom united state Sm,pTransfer to NextState Sm',p'Unit interval expectation cost;
Step 5, foundation SMDP model as shown in Equation 14:
X=(Xt,��,D,Qv(t),fv)(14)
In formula (14), XtRepresent the state procedure of t.
According to following steps described half Markov model is optimized by Policy iteration algorithm and solves, obtain optimal control policy v*, and by optimal control policy v*It is cured in described mechanical arm 1.
Step 1, according to half Markov core Qv(t) and expected performance function fv, define Equivalent InfinitesimalWith performance matrix of equal valueAnd by SMDP model X=(Xt,��,D,Qv(t),fv) be converted to continuous time Markov decision making processWherein �� is discount factor, �� �� [0,1].
Step 2, defined variable k, define vkKth time strategy for system of processing; Initialize k=0; And the 0th tactful v0For:
Step 3, formula (16) is utilized to obtain kth time strategy vkPerformance potential vector
( α I - A α v k + ρ α eπ α v k ) g α v k = f α v k - - - ( 16 )
In formula (16); I is unit diagonal matrix; E is unit column vector; And have:
h α v k = ( I - Q α v k ) e / α - - - ( 17 )
ρ α = m a x { h α v k ( i ) - 1 , i = 1 , 2 ... N , v k ∈ Ω , α ∈ [ 0 , 1 ] } - - - ( 18 )
Represent the average waiting time at state i,Represent the steady-state distribution of kth time in Markov decision making process of equal value; And have:
A α v k π α v k = 0 π α v k e = 1 A α v k e = 0 - - - ( 19 )
Step 4, utilize formula (20) obtain+1 tactful v of kthk+1:
v k + 1 ∈ arg m i n v ∈ Ω { A α v g α v k + f α v } - - - ( 20 )
Step 5, utilize formula (16) obtain+1 tactful v of kthk+1Performance potential vector
Step 6, judgementWhether set up, �� is a less constant; If setting up, then it represents that+1 tactful v of the kth obtainedk+1It is optimal control policy v*, and exit algorithm, otherwise; After k+1 is assigned to k, return step 4.
By obtained optimal control policy v*After being cured in described mechanical arm 1, the working mechanism of system of processing carries out as follows:
Step 1, described system of processing bring into operation, and set line speed Vcon=10cm/s, mechanical arm speed Vrobot=20cm/s, workpiece arrival rate ��=1, work pieces process rate ��=2; Defined variable i, and initialize i=1.
Step 2, described mechanical arm 1 obtain the vacant amount m of i & ltiWith handgrip 7 position pi, according to optimal control policy v*Select i & lt forward sight distance
Step 3, judge the optimum forward sight distance of described i & ltInside whether there is workpiece, if there being workpiece, then the described mechanical arm 1 forward sight distance to being in described i & ltInterior near front viewpoint PlookWorkpiece carry out unloading operation after, perform step 4; If there is no workpiece, then after the workpiece in described buffer memory storehouse 4 is carried out service operations by described mechanical arm 1; Perform step 5;
Wherein uninstall process is divided into and picks, places and wait three processes; Select in forward sight distanceFrom front viewpoint PlookA nearest workpiece is as target workpiece, and definition observes that target workpiece coordinate is Pwork, handgrip is from initial position PpSetting out, following the tracks of the collision point coordinate definition picking workpiece isDefinition is mobile picks the timeMovement is standing timeWith the waiting time it isDefinition target workpiece arrives front viewpoint PlookTime be the time of adventThen:
DefinitionFor the separation of the workpiece time of advent, and have:
T p a b l e = | | P p - P l e f t p i c k | | V r o b o t + P l e f t p i c k - P l o o k V c o n - - - ( 22 )
IfThen directly pick workpiece;Then collision pointThe time is picked with mobileMeet equation group:
| | P w o r k - P p p i c k | | = t p p i c k V c o n | | P p - P p p i c k | | = t p p i c k V r o b o t - - - ( 23 )
Waiting time is:
t p w a i t = 0 - - - ( 24 )
IfThen mechanical arm handgrip first moves to operation interval left end pointWait that workpiece arrives to pick, then the waiting time is again:
Mobile pick the time and be
t p p i c k = | | P l e f t p i c k - P p | | V r o b o t - - - ( 26 )
Handgrip goes back to buffer memory storehouse 4 after picking workpiece and places, and movement is standing time:
t p p l a c e = t 0 p i c k p = 0 | | P 1 p i c k - P 0 | | V r o b o t p = 1 - - - ( 27 )
So the time of i & lt unloading operationFor:
t m i , p i l o a d = t p i p i c k + t p i p l a c e + t p i w a i t - - - ( 28 )
Then the next decision-making moment is:
NextState is Xi+1=Sm-1,0; The system delay time is:
Service operations is divided into mobile taking-up workpiece and the course of processing, and being divided into initial position equally is buffer memory storehouse 4 and 5 two kinds of situations of warehouse for finished product; If p=0, namely initial position is buffer memory storehouse 4, then mechanical arm is directly postponed and taken out a workpiece in warehousing 4 and be processed, and is positioned over by workpiece in warehouse for finished product 5 after machining; If p=1, namely initial position is warehouse for finished product 5, then mechanical arm first moves to buffer memory storehouse 4 and takes out workpiece and be processed; The time of definition i & lt service operations isWherein obeying fixing random distribution process time, its time isDefinition is T from warehouse for finished product to buffer memory storehouse spended timebank_buffer, then
τ m i , p i s e r = t i p r o c e s s p = 0 t i p r o c e s s + T b a n k _ b u f f e r p = 1 - - - ( 31 )
Then the next decision-making moment is:
T i + 1 = T i + m a x { t S m i , p i , τ m i , p i s e r } - - - ( 32 )
Then next state is Xi+1=Sm+1,1, the system delay time is:
Step 4, judge near front viewpoint PlookThe i & lt discharge time of workpieceWhether more than described near front viewpoint PlookI & lt forward sight time of workpieceIf more than, then after i+1 being assigned to i, return step 2 and perform; Otherwise, through the unloading time delay of i & ltAfter, i+1 is assigned to i, and returns step 2 and perform;
Step 5, judge near front viewpoint PlookI & lt service time of workpieceWhether more than described near front viewpoint PlookI & lt forward sight time of workpieceIf more than, then after i+1 being assigned to i, return step 2 and perform; Otherwise, time delay is serviced through i & ltAfter, i+1 is assigned to i, and returns step 2 and perform.

Claims (5)

1. a working mechanism for the single site feed production and processing system of mechanically-based arm, described single site feed production and processing system includes: mechanical arm (1), conveyer belt (2), industrial camera (3), capacity are the buffer memory storehouse (4) of M, capacity is warehouse for finished product (5) and the workpiece (6) of N;
Described mechanical arm (1) is positioned at the side of described conveyer belt (2), is respectively arranged with described buffer memory storehouse (4) and warehouse for finished product (5) in the both sides of described mechanical arm (1); Described industrial camera (3) is in the upstream of described mechanical arm (1), and vertical just to the workpiece (6) on described conveyer belt (2), make described industrial camera (3) vertical just to position be a P that takes picturescam; And described industrial camera (3) can will be located in a P that takes picturescamThe location of workpiece in downstream passes to described mechanical arm (1); The front viewpoint defining described mechanical arm (1) is Plook, and it is positioned at described front viewpoint PlookThe workpiece in downstream cannot be captured by described mechanical arm (1);
Vacant amount in definition buffer memory storehouse (4) is m; M �� [0, M]; The position of the handgrip (7) defining described mechanical arm (1) is p; As p=1, represent that the position of handgrip (7) of described mechanical arm (1) is in warehouse for finished product (5); As p=0, represent that the position p of handgrip (7) of described mechanical arm (1) is in buffer memory storehouse (4); The united state S of described system of processing it is made up of the position p of vacant amount m and handgrip (7)m,p; Definition is with described front viewpoint PlookOne section of observed range for starting point is forward sight distance Definition workpiece (6) is through described forward sight distanceThe time spent is the forward sight time
Defining the described mechanical arm (1) working range on described conveyer belt (2) is Wpick, and at described working range WpickInside carry out unloading operation, buffer memory storehouse (4) carry out service operations;Described unloading operation is from capturing described conveyer belt (2) to described buffer memory storehouse (4) by described workpiece (6); Described service operations is to put in described warehouse for finished product (5) after being processed by the workpiece (6) in described buffer memory storehouse (4); The time having defined unloading operation is discharge timeThe time having defined service operations is service timeAfter completing unloading operation, the handgrip (7) of described mechanical arm (1) is in described buffer memory storehouse (4); After completing service operations, the handgrip (7) of described mechanical arm (1) is in described warehouse for finished product (5);
It is characterized in that, described working mechanism carries out as follows:
Step 1, described system of processing bring into operation, and workpiece arrives mechanical arm operation interval at random; Set line speed, mechanical arm speed, workpiece arrival rate, work pieces process rate; Defined variable i, and initialize i=1;
Step 2, described mechanical arm (1) obtain the vacant amount m of i & ltiWith handgrip (7) position pi, according to optimal control policy v*Select i & lt forward sight distance
Step 3, judge the optimum forward sight distance of described i & ltInside whether there is workpiece, if there being workpiece, then the described mechanical arm (1) the optimum forward sight distance to being in described i & ltInterior near front viewpoint PlookWorkpiece carry out unloading operation after, perform step 4; If there is no workpiece, then after the workpiece in described buffer memory storehouse (4) is carried out service operations by described mechanical arm (1), perform step 5;
Step 4, judge near front viewpoint PlookThe i & lt discharge time of workpieceWhether more than described near front viewpoint PlookI & lt forward sight time of workpieceIf more than, then after i+1 being assigned to i, return step 2 and perform; Otherwise, through the unloading time delay of i & ltAfter, i+1 is assigned to i, and returns step 2 and perform;
Step 5, judge near front viewpoint PlookI & lt service time of workpieceWhether more than described near front viewpoint PlookI & lt forward sight time of workpieceIf more than, then after i+1 being assigned to i, return step 2 and perform; Otherwise, time delay is serviced through i & ltAfter, i+1 is assigned to i, and returns step 2 and perform.
2. the working mechanism of the single site feed production and processing system of mechanically-based arm according to claim 1, is characterized in that,
Described system of processing is as state to control described forward sight distance using described vacant amount m and handgrip (7) position pLength, and then control the mode of operation of described mechanical arm (1);
When vacant amount m is less, then described mechanical arm (1) chooses shorter forward sight distanceDescribed mechanical arm (1) trends towards service operations;
When vacant amount m is bigger, then described mechanical arm (3) chooses longer forward sight distanceDescribed mechanical arm (1) trends towards unloading operation.
3. the working mechanism of the single site feed production and processing system of mechanically-based arm according to claim 1, is characterized in that,
Described optimal control policy v*Obtain according to the following steps:
Step 1, described system of processing are using the vacant amount of buffer memory storehouse (4) and handgrip (7) position as state, forward sight distanceChoose as action, set up half Markov decision model;
Step 2, it is optimized by double Markov model of Policy iteration algorithm and solves, obtain optimal control policy v*��
4. the working mechanism of the single site feed production and processing system of mechanically-based arm according to claim 3, is characterized in that:
Described half Markov model is set up according to the following steps:
Step 1, define the state of described buffer memory storehouse (4) for its vacant amount m, then the state space of described buffer memory storehouse (4) is ��1, and have ��1=0,1 ..., M};
The state space of definition handgrip (7) position p is ��2, and have ��2={ 0,1}; As p=0, represent that the position p of handgrip (7) of described mechanical arm (1) is in buffer memory storehouse (4), as p=1, represent that the position of handgrip (7) of described mechanical arm (1) is in warehouse for finished product (5);
Define the united state S of described system of processingm,pState space be ��, and have ��=��1����2;
Step 2, the former apparent distanceAs described system of processing at united state Sm,pUnder action, a steady control strategy v of described system of processing is all of united state to the mapping of action, and hasWherein Represent when the vacant amount in buffer memory storehouse (4) is m, the action of system of processing when the position of handgrip (7) is in buffer memory storehouse (4);Represent when the vacant amount in buffer memory storehouse (4) is m, the action of system of processing when the position of handgrip (7) is in warehouse for finished product (5); Definition All Policies v set be ��, ��=v | v=(v (1), v (2), v (3) ..., v (N)), v (i) �� D}; Described system of processing is absent from SM,0And S0,1The two state, then corresponding action is designated as NaN;
Step 3, definition TiFor the i & lt decision-making moment of described system of processing, make T0=0; Defining unloading operation time delay when described system of processing carries out the process of i & lt Markov markov property isService operations time delay is
When the i & lt of described mechanical arm (1) operates as unloading operation, then i+1 time decision-making moment Ti+1For:
T i + 1 = T i + m a x { t S m i , p i , t m i , p i l o a d } - - - ( 1 )
I+1 time united state is Xi+1=Sm-1,0, representing that handgrip (7) position is p=0, the vacant amount in buffer memory storehouse (4) is m-1;
I & lt unloading operation time delay is:
When the i & lt of described mechanical arm (1) operates as service operations, then i+1 time decision-making moment Ti+1For:
T i + 1 = T i + m a x { t S m i , p i , t m i , p i s e r } - - - ( 3 )
I+1 time united state is Xi+1=Sm+1,0; Representing that handgrip (7) position is p=0, the vacant amount in buffer memory storehouse (4) is m+1;
I & lt service operations time delay is:
Define half Markov core Qv(t) be:
Q v ( t ) = [ Q ( s m , p , v s m , p , s m ′ , p ′ , t ) ] s m , p , s m ′ , p ′ ∈ Φ = P v ⊗ F v ( t ) - - - ( 5 )
In formula (5),The operator that representing matrix is multiplied; T represents system time; PvRepresent Markov embedded chain transfer matrix under strategy v, FvT () represents the distribution of residence-time matrix under strategy v, and have:
P v = [ p s m , p s m ′ , p ′ · ( v s m , p ) ] s m , p s m ′ , p ′ ∈ Φ - - - ( 6 )
F v ( t ) = [ F s m , p s m ′ , p ′ ( t , v s m , p ) ] s m , p s m ′ , p ′ ∈ Φ - - - ( 7 )
In formula (6)It is from state Sm,pTake actionTransfer to Sm',p'Probability, in formula (7)It is at state Sm,pTake actionIt is S to NextStatem',p'Distribution of residence-time;
Step 4, definition expected performance function fvFor:
f v = [ f ( s m , p , v s m , p , s m ′ , p ′ ) ] s m , p , s m ′ , p ′ ∈ Φ - - - ( 8 )
In formula (8),Represent that system of processing is taken actionFrom united state Sm,pTransfer to NextState Sm',p'Unit interval expectation cost;
Step 5, foundation SMDP model as shown in formula (9):
X=(Xt,��,D,Qv(t),fv)(9)
In formula (9), XtRepresent the state procedure of t system of processing.
5. the working mechanism of the single site feed production and processing system of mechanically-based arm according to claim 3, is characterized in that:
Described Policy iteration algorithm sequentially includes the following steps:
Step 1, according to half Markov core Qv(t) and expected performance function fv, define Equivalent InfinitesimalWith performance matrix of equal valueAnd by SMDP model X=(Xt,��,D,Qv(t),fv) be converted to continuous time Markov decision making process�� is discount factor, �� �� [0,1];
Step 2, defined variable k, define vkKth time strategy for system of processing; Initialize k=0; And the 0th tactful v0For:
v 0 = l max N a N l max l max · l max · · · · l max · · l max · · · · l max · N a N l max - - - ( 10 )
Step 3, formula (11) is utilized to obtain kth time strategy vkPerformance potential vector
( α I - A α v k + ρ α eπ α v k ) g α v k = f α v k - - - ( 11 )
In formula (11); I is unit diagonal matrix; E is unit column vector; And have:
h α v k = ( I - Q α v k ) e / α - - - ( 12 )
ρ α = m a x { h α v k ( i ) - 1 , i = 1 , 2 ... N , v k ∈ Ω , α ∈ [ 0 , 1 ] } - - - ( 13 )
Represent the average waiting time at state i,Represent the steady-state distribution of kth time in Markov decision making process of equal value;And have:
A α v k π α v k = 0 π α v k e = 1 A α v k e = 0 ( 14 )
Step 4, utilize formula (15) obtain+1 tactful v of kthk+1:
v k + 1 ∈ arg m i n v ∈ Ω { A α v g α v k + f α v } - - - ( 15 )
Step 5, utilize formula (11) obtain+1 tactful v of kthk+1Performance potential vector
Step 6, judgementWhether set up, �� is a less constant; If setting up, then it represents that+1 tactful v of the kth obtainedk+1It is optimal control policy v*, and exit algorithm, otherwise; After k+1 is assigned to k, return step 4.
CN201610128381.0A 2016-03-07 2016-03-07 A kind of single site based on mechanical arm is fed the method for work of production and processing system Active CN105643625B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610128381.0A CN105643625B (en) 2016-03-07 2016-03-07 A kind of single site based on mechanical arm is fed the method for work of production and processing system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610128381.0A CN105643625B (en) 2016-03-07 2016-03-07 A kind of single site based on mechanical arm is fed the method for work of production and processing system

Publications (2)

Publication Number Publication Date
CN105643625A true CN105643625A (en) 2016-06-08
CN105643625B CN105643625B (en) 2017-08-25

Family

ID=56493203

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610128381.0A Active CN105643625B (en) 2016-03-07 2016-03-07 A kind of single site based on mechanical arm is fed the method for work of production and processing system

Country Status (1)

Country Link
CN (1) CN105643625B (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105904444A (en) * 2016-06-24 2016-08-31 张家港市兰航机械有限公司 Operation method of mechanical arm
CN106965180A (en) * 2017-04-13 2017-07-21 北京理工大学 The mechanical arm grabbing device and method of bottle on streamline
CN109870992A (en) * 2019-03-26 2019-06-11 合肥工业大学 A kind of CSPS system control method for not considering delay and waiting
CN111517112A (en) * 2020-04-21 2020-08-11 合肥工业大学 Improved TP + LCM full-lamination system and control method
CN114193440A (en) * 2022-01-04 2022-03-18 中船重工鹏力(南京)智能装备系统有限公司 Robot automatic grabbing system and method based on 3D vision
CN114872054A (en) * 2022-07-11 2022-08-09 深圳市麦瑞包装制品有限公司 Method for positioning robot hand for industrial manufacturing of packaging container

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050035182A1 (en) * 2003-08-11 2005-02-17 Niigata Seimitsu Co., Ltd. Circuit board transferring apparatus and method and solder ball mounting method
US7356378B1 (en) * 2007-04-03 2008-04-08 Taiwan Semiconductor Manufacturing Company Method and system for smart vehicle route selection
CN101788787A (en) * 2009-12-31 2010-07-28 合肥工业大学 Optimization control method of multi-site conveyor belt feeding production processing site system
CN102629108A (en) * 2012-04-19 2012-08-08 合肥工业大学 Optimization control method for multi-procedure conveyor belt feeding processing station system with flexible sites
CN104850117A (en) * 2015-05-19 2015-08-19 合肥工业大学 Robot automatic production line experiment platform and control method thereof

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050035182A1 (en) * 2003-08-11 2005-02-17 Niigata Seimitsu Co., Ltd. Circuit board transferring apparatus and method and solder ball mounting method
US7356378B1 (en) * 2007-04-03 2008-04-08 Taiwan Semiconductor Manufacturing Company Method and system for smart vehicle route selection
CN101788787A (en) * 2009-12-31 2010-07-28 合肥工业大学 Optimization control method of multi-site conveyor belt feeding production processing site system
CN102629108A (en) * 2012-04-19 2012-08-08 合肥工业大学 Optimization control method for multi-procedure conveyor belt feeding processing station system with flexible sites
CN104850117A (en) * 2015-05-19 2015-08-19 合肥工业大学 Robot automatic production line experiment platform and control method thereof

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
吴慧: ""CSPS模型基于学习的Look_ahead控制研究"", 《中国优秀硕士学位论文全文数据库(电子期刊)》 *
唐昊等: ""可变服务率模式下基于需求驱动的传送带给料加工站系统的优化控制"", 《控制理论与应用》 *
唐昊等: ""搬运系统作业分配问题的小脑模型关节控制器Q学习算法"", 《控制理论与应用》 *

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105904444A (en) * 2016-06-24 2016-08-31 张家港市兰航机械有限公司 Operation method of mechanical arm
CN105904444B (en) * 2016-06-24 2018-08-17 张家港市兰航机械有限公司 A kind of operating method of mechanical arm
CN106965180A (en) * 2017-04-13 2017-07-21 北京理工大学 The mechanical arm grabbing device and method of bottle on streamline
CN109870992A (en) * 2019-03-26 2019-06-11 合肥工业大学 A kind of CSPS system control method for not considering delay and waiting
CN111517112A (en) * 2020-04-21 2020-08-11 合肥工业大学 Improved TP + LCM full-lamination system and control method
CN111517112B (en) * 2020-04-21 2021-09-24 合肥工业大学 Improved TP + LCM full-lamination system and control method
CN114193440A (en) * 2022-01-04 2022-03-18 中船重工鹏力(南京)智能装备系统有限公司 Robot automatic grabbing system and method based on 3D vision
CN114193440B (en) * 2022-01-04 2023-09-12 中船重工鹏力(南京)智能装备系统有限公司 Robot automatic grabbing system and method based on 3D vision
CN114872054A (en) * 2022-07-11 2022-08-09 深圳市麦瑞包装制品有限公司 Method for positioning robot hand for industrial manufacturing of packaging container

Also Published As

Publication number Publication date
CN105643625B (en) 2017-08-25

Similar Documents

Publication Publication Date Title
CN105643625A (en) Working mechanism of single-station feeding production processing system based on mechanical arm
US10766139B2 (en) Method for avoiding collisions between two robots
CN109926342B (en) Express efficient sorting system and sorting method thereof
EP3624997B1 (en) Method and control system for controlling movement sequences of a robot
WO2019056840A1 (en) Palletizing control device, system and method and storage medium
US6779647B1 (en) Method for automatically grouping articles
JP6659744B2 (en) Robot system
US10955828B2 (en) Method and a robot system for handling objects
CN107977738B (en) Multi-objective optimization control method for conveyor belt feeding processing station system
EP3525990A1 (en) A method and a system for controlling a velocity of a conveyance path
CN109500809B (en) Optimization of an automated process for selecting and grasping objects by a robot
CN102674003A (en) Production system, processed object manufacturing method, and container transferring method
CN109870992B (en) CSPS system control method without considering delay waiting
CN111282827A (en) Automatic workpiece feeding and quick sorting control method
CN112441360A (en) Dispatching system and dispatching method for manufacturing mold
CN109731989A (en) A kind of punching automation system of Multistation press line
CN110371633A (en) A kind of automatic Material Handling System
CN114511167A (en) Material handling equipment scheduling method, device and system and storage medium
CN212190122U (en) Automatic workpiece feeding device
CN211520768U (en) Cooling fin spraying and feeding system
CN208150353U (en) Fully automatic stereo formula sorting line
CN110860488A (en) Article sorting device and method for multiple stations
CN111517112B (en) Improved TP + LCM full-lamination system and control method
CN104828627A (en) Intelligent storehouse inbound device used for sheet material production line and method thereof
CN212399574U (en) Automatic vehicle body transfer system based on robot

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant