Introduction
To date, many
An able bodied subject’s wrist angle (the palmar/dorsi- flexion) was controlled with the FEL controller to track a desired trajectory by stimulating two muscles (ECRL/ECRB and FCU). Before conducting the experiments, some parameters in the controller were determined in advance by computer simulation using a forward model of the wrist. The model had been prepared by training an artificial neural network so as to imitate the response of the subject’s wrist angle to electrical stimulation. The results showed that the FEL controller performed better than the conventional PID controller in the evaluation of an average error for a fast movement (cycle period: 2s). Furthermore, the performance of the controller was improved when an Inverse Dynamics Model (IDM) in the FEL controller had been trained in advance with the forward model. Only a few trials of stimulation were required to track the trajectory with a small error value.
Methods
Fig.1 shows the block diagram of the FEL (Feedback Error Learning) controller, which combines the feedforward controller (Inverse Dynamics Model; IDM) and the feedback controller. The feedforward controller enables fast movement without delay, whereas the feedback controller has the advantage of high stability and high performance for disturbance compensation. The FEL controller has both advantages. The inverse dynamics model is trained on-line using the output of the feedback controller. Therefore, both the control and the training of the IDM are carried out at the same time (i.e. the IDM captures the inverse characteristics of limbs while controlling them). The feedback controller acts mainly at the beginning of the control, then gradually shifts the role to the feedforward controller as the learning progresses. In this study, the four-layered perceptron was used to express the IDM, which was learnt by the back-propagation technique. The output of the feedback controller was used as the error signal for back-propagation. The error signal was back propagated through the IDM at a hundred milli-seconds past state to cancel out the delay present in electrically stimulated muscles. The initial values of connection weights in the IDM were determined by a random number generator. The traditional PID controller was adopted as the feedback controller; however, the PID controller was improved to be able to control redundant musculoskeletal systems [3].

Experiments were conducted with the controller to evaluate its
control and learning performance. Before
the experiments, some parameters in the controller, such as the number of
neurons in the hidden layer of the IDM and the learning coefficient for
back-propagation, were determined in advance by computer simulation. A forward model of the subject’s wrist angle
response to electrical stimulation was used in the simulation study. The forward model had been prepared by
training a neural network using the measured response of a subject’s
wrist. The palmar/dorsi-flexion angle
was controlled by stimulating ECR (ECRL & ECRB) and FCU. The subject was an able bodied male who had
enough experience with electrical stimulation.
His ECR and FCU were stimulated simultaneously through a pair of Ag/AgCl surface electrodes.
The laboratory
Figure 1. The block diagram of the FEL (Feedback Error Learning) controller.
respectively. The subject was seated on a
chair with his left arm hanging (i.e. in the gravitational direction); the
shoulder, the elbow and the hand were free.
The subject was instructed to relax his left arm and hand as much as
possible. The neutral angle of the wrist
was defined at this posture. The desired trajectory
was ten reciprocating motion of 40 degrees in amplitude and 2s in cycle
period. The control by the FEL was
repeated up to 40 times. The control
performance was evaluated (mean error) by comparison with the PID controller
alone. Control by the FEL in which the
IDM had been well trained with the forward model was also carried out.
Results
One of the results is shown in Fig.2. Fig.2 (a) is the result of the first trial (the iteration number is 1) and Fig.2 (b) is the 24th trial. The upper graphs show the time courses of the wrist angle, and the lower graphs describe the stimulation current. When the PID controller was used alone, the results were almost the same as Fig.2 (a), in which the average error was 7.3[degree]. It can be seen that there is a large delay in the measured trajectory on the first iteration (Fig.2 (a)). After 23 training cycles, the delay decreased as observed in Fig.2 (b), resulting in a 55% decrease of the average error. It was thought that the performance improved because the IDM successfully obtained the inverse characteristics of the subject’s wrist. The change in the error and the power is shown in Fig.2 (c). In the graph, power is the ratio of the output power of the IDM to the total power of both the IDM and the PID controller. The error decreased and the power ratio increased with increased iterations. The change of the power shows that the main controller was changed gradually from the PID to the IDM. At the 24th iteration, the IDM held 80% of the control. These results show that the FEL controller functioned as expected and its control performance was superior to the conventional PID controller. However, the error increased when the iteration number exceeded 30.

Figure 2. Results of the wrist angle
control: (a) The time course of the wrist angle (upper)
and the stimulation (lower) at the first trial (The iteration number is
1). (b) The time course of the wrist
angle and the stimulation at the 24th trial. (c) The change of the average
tracking error and the power ratio (Power of the IDM / Total Power). “random” is the
result when the initial connection weight values in the IDM were determined by
a random number generator. “trained”: the IDM was trained in advance with the forward
model.
The gray lines labeled as “trained” in Fig.2 (c) show the result with the model-trained IDM. Note that the forward model was not precise as several days had passed after the model was developed and the surface electrodes had since been replaced. Nevertheless, the error was small and the power ratio was high from the first iteration. Moreover, the error was minimized in all trials with high power ratio after a few additional iterations.
Discussion
The experimental results show that the delay in movement and the mean tracking error decreased to less than half of the value of that obtained with traditional PID controllers, after only 20 to 30 iterations. Only a few iterations were required to control with the smallest error when the IDM had been trained in advance with the artificial forward model. The required iteration number is so small in both the cases that the controller is practical for clinical applications. In the experiment, the forward model was not precise because several days had passed after the model had been developed. The results may be better if the control experiments are conducted soon after the development of the forward model. Moreover, adjustment of the learning coefficient will be required for smoother convergence of the error.
When multiple joints are controlled
simultaneously (e.g. reaching movements, etc.), delay in one joint movement
yields failure in tracking the trajectories of the fingertips because of the
kinematics. Thus, the controller as
described in this paper, which can control limbs with little delay, is required
for accurate movement control of paralyzed limbs by
References
[1] Kawato, M., Feedback-error-learning neural network for supervised motor learning. Adv. Neural Comput., 1990. p.365-372.
[2] Vojislav D. Kalanovic, Dejan Popovic, and Nils T. Skaug, Feedback Error Learning Neural Network for Trans-Femoral Prosthesis. IEEE Trans Rehab Eng, 2000. 8(1): p.71-80.
[3] Watanabe, T., Iibuchi, K., Kurosawa,
K., Futami, R., and Hoshimiya,
N., A Method for Solving Ill-Posed Problem in Multichannel
Closed-Loop
[4] Kurosawa, K., Watanabe, T., Futami, R., Hoshimiya, N., and Handa, Y., Development of a Closed-loop FES System Using a 3-D Magnetic Position and Orientation Measurement System. J. Automatic Control, 2002. 12: p.23-30.
Acknowledgments: This study was partly supported by The Japan Society for the Promotion of Science under the Bilateral Program, Takayanagi Foundation for Electronics Science and Technology, and Miyagi Joint-Research Project for Regional Intensive. We would like to thank these organizations.