Artificial Intelligence and Quantum Computing for Advanced Wireless Networks. Savo G. Glisic. Читать онлайн. Newlib. NEWLIB.NET

Информация о произведении:

Автор:	Savo G. Glisic
Издательство:	John Wiley & Sons Limited
Серия:
Жанр произведения:	Программы
Год издания:	0
isbn:	9781119790310

Скачать книгу

l plus 1 Baseline 2nd Row 1st Column 0 2nd Column otherwise period EndLayout"/>

Making all substitutions into Eq. (3.26), we get

(3.31)

where we have defined the vector

(3.32)

Schematic illustration of temporal backpropagation.

Figure 3.7 Temporal backpropagation.

Each term delta Subscript m Superscript l plus 1 Baseline left-parenthesis k right-parenthesis normal w Subscript italic j m Superscript l plus 1 within the sum corresponds to a reverse FIR filter. This is illustrated in Figure 3.7. The filter is drawn in such a way to emphasize the reversal of signal propagation through the FIR. Representing the forward propagation of states and the backward propagation of error terms requires simply reversing the direction of signal flow. In this process, unit delay operators q⁻¹ should be replaced with unit advances q⁺¹. The complete adaptation algorithm can be summarized as follows:

(3.33) normal upper Delta normal w Subscript italic i j Superscript l Baseline left-parenthesis k right-parenthesis equals minus mu delta Subscript j Superscript l plus 1 Baseline left-parenthesis k right-parenthesis normal a Subscript i Superscript l Baseline left-parenthesis k right-parenthesis

(3.34)

The bias weight normal w Subscript b Superscript l may again be adapted by letting normal a Subscript i Superscript l Baseline left-parenthesis normal k right-parenthesis equals 1 in Eq. (3.33). Observe the similarities between these equations and those for standard backpropagation. In fact, by replacing the vectors a, w, and δ by scalars, the previous equations reduce to precisely the backpropagation algorithm for static networks. Differences in the temporal version are due to implicit time relations. To find normal delta Subscript j Superscript l Baseline left-parenthesis normal k right-parenthesis , we filter the δ’s from the next layer backward through the FIR (see Figure 3.7). In other words, δ’s are created not only by taking weighted sums, but also by backward filtering. For each x(k) and desired vector d(k), the forward filters are incremented one time step, producing the current output y(k) and corresponding error e(k). Next, the backward filters are incremented one time step, advancing the δ(k) terms and allowing the filter coefficients to be updated. The process is then repeated for a new input at time k + 1.

The symmetry between the forward propagation of states and the backward propagation of error terms is preserved in temporal backpropagation. The number of operations per iteration now grows linearly with the number of layers and synapses in the network. This savings is due to the efficient recursive formulation. Each coefficient enters into the calculation only once, in contrast to the redundant use of terms when applying standard backpropagation to the unfolded network.

Design Example 3.1

As an illustration of the computations involved, we consider a simple network consisting of only two segments (cascaded linear FIR filters shown in Figure 3.8). The first segment is defined as

(3.35) u left-parenthesis k right-parenthesis equals sigma-summation Underscript i equals 0 Overscript upper M Endscripts a Subscript i Baseline x left-parenthesis k minus i right-parenthesis equals italic a x left-parenthesis k right-parenthesis period

For simplicity, the second segment is limited to only three taps:

(3.36) y left-parenthesis k right-parenthesis equals b 0 u left-parenthesis k right-parenthesis plus b 1 u left-parenthesis k minus 1 right-parenthesis plus b 2 u left-parenthesis k minus 2 right-parenthesis equals italic b u left-parenthesis k right-parenthesis period

Schematic illustration of oversimplified finite impulse response (FIR) network.

Figure 3.8 Oversimplified finite impulse response (FIR) network.

Here ( a is the vector of filter coefficient and should not be confused with the variable for the activation value used earlier). To adapt the filter coefficients, we evaluate the gradients ∂e²(k)/∂a and ∂e²(k)/∂b. For filter b, the desired response is available directly at the output of the filter of interest and the gradient is Скачать книгу