Artificial Intelligence and Quantum Computing for Advanced Wireless Networks. Savo G. Glisic. Читать онлайн. Newlib. NEWLIB.NET

Информация о произведении:

Автор:	Savo G. Glisic
Издательство:	John Wiley & Sons Limited
Серия:
Жанр произведения:	Программы
Год издания:	0
isbn:	9781119790310

Скачать книгу

ω_{y, j} are the weights associated with the delayed outputs. The previous networks exhibit a locally recurrent structure, but when connected into a larger network, they have a feedforward architecture and are referred to as locally recurrent–globally feedforward (LRGF) architectures. A general LRGF architecture is shown in Figure 3.14. It allows dynamic synapses to be included within both the input (represented by H₁, … , H_M) and the output feedback (represented by H_FB), some of the aforementioned schemes. Some typical examples of these networks are shown in Figures 3.15–3.18.

The following equations fully describe the RNN from Figure 3.17

(3.65)

Schematic illustration of general locally recurrent–globally feedforward (LRGF) architecture.

Figure 3.14 General locally recurrent–globally feedforward (LRGF) architecture.

Schematic illustration of an example of Elman recurrent neural network (RNN).

Figure 3.15 An example of Elman recurrent neural network (RNN).

Schematic illustration of an example of Jordan recurrent neural network (RNN).

Figure 3.16 An example of Jordan recurrent neural network (RNN).

where the (p + N + 1) × 1 dimensional vector u comprises both the external and feedback inputs to a neuron, as well as the unity valued constant bias input.

Training: Here, we discuss training the single fully connected RNN shown in Figure 3.17. The nonlinear time series prediction uses only one output neuron of the RNN. Training of the RNN is based on minimizing the instantaneous squared error at the output of the first neuron of the RNN which can be expressed as

(3.66) min left-parenthesis e squared left-parenthesis k right-parenthesis slash 2 right-parenthesis equals min left-parenthesis left-bracket s left-parenthesis k right-parenthesis minus y 1 left-parenthesis k right-parenthesis right-bracket squared slash 2 right-parenthesis

where e(k) denotes the error at the output y1 of the RNN, and s(k) is the training signal. Hence, the correction for the l‐th weight of neuron k at the time instant k is

(3.67)

Schematic illustration of a fully connected recurrent neural network.

Figure 3.17 A fully connected recurrent neural network (RNN; Williams–Zipser network) The neurons (nodes) are depicted by circles and incorporate the operation Φ (sum of inputs).

Since the external signal vector s does not depend on the elements of W, the error gradient becomes ∂e(k)/∂ω_n,l(k) = − ∂y₁(k)/∂ω_n,l(k). Using the chain rule gives

(3.68)

where δ_nl = 1 if n = l and 0 otherwise. When the learning rate η is sufficiently small, we have ∂y_α(k − 1)/∂ω_{n, l}(k) ≈ ∂y_α(k − 1)/∂ω_{n, l}(k − 1). By introducing the notation we have recursively for every time step k and all appropriate j, n and l

(3.69)

Schematic illustration of nonlinear IIR filter structures. (a) A recurrent nonlinear neural filter, (b) a recurrent linear/nonlinear neural filter structure.

Скачать книгу