Artificial Intelligence and Quantum Computing for Advanced Wireless Networks. Savo G. Glisic. Читать онлайн. Newlib. NEWLIB.NET

Информация о произведении:

Автор:	Savo G. Glisic
Издательство:	John Wiley & Sons Limited
Серия:
Жанр произведения:	Программы
Год издания:	0
isbn:	9781119790310

Скачать книгу

rel="nofollow" href="#fb3_img_img_8317e7c1-3dc7-585c-b186-a7c079f2d1b7.png" alt="images"/> otherwise. Vectors b_n and matrices A_n,u do not depend on the state x, but only on the node and edge labels. Thus, ∂F_w/∂x = A, and, by simple algebra we have

equation

which implies that F_w is a contraction map (w.r. t. ‖ ‖₁ ) for any set of parameters w.

1 Nonlinear (nonpositional) GNN. In this case, hw is realized by a multilayered feedforward NN. Since three‐layered neural networks are universal approximators [67], hw can approximate any desired function. However, not all the parameters w can be used, because it must be ensured that the corresponding transition function Fw is a contraction map. This can be achieved by adding a penalty term to Eq. (5.79), that is where the penalty term L(y) is (y − μ)2 if y > μ and 0 otherwise, and the parameter μ ∈ (0, 1) defines the desired contraction constant of Fw . More generally, the penalty term can be any expression, differentiable with respect to w, that is monotone increasing with respect to the norm of the Jacobian. For example, in our experiments, we use the penalty term , where Ai is the i‐th column of ∂Fw/∂x. In fact, such an expression is an approximation of L(‖∂Fw/∂x‖1) = L(maxi‖Ai‖1).

5.3.2 Computational Complexity

Here, we derive an analysis of the computational cost in GNN. The analysis will focus on three different GNN models: positional GNNs, where the functions f_w and g_w of Eq. (5.74) are implemented by FNNs; linear (nonpositional) GNNs; and nonlinear (nonpositional) GNNs.

First, we will describe with more details the most complex instructions involved in the learning procedure (see Table 5.2 reproduced from [1]). Then, the complexity of the learning algorithm will be defined. For the sake of simplicity, the cost is derived assuming that the training set contains just one graph G. Such an assumption does not cause any loss of generality, since the graphs of the training set can always be merged into a single graph. The complexity is measured by the order of floating point operations. By the common definition of time complexity, an algorithm requires O (l(a)) operations, if there exist α>0, images , such that c(a) ≤ αl(a) holds for each images , where c (a) is the maximal number of operations executed by the algorithm when the length of the input is a.

We will assume that there exist two procedures FP and BP, which implement the forward phase and the backward phase of the back propagation procedure, respectively. Formally, given a function l_w : R^a → R^b implemented by an FNN, we have

Table 5.2 Time complexity of the most expensive instructions of the learning algorithm. For each instruction and each GNN model, a bound on the order of floating point operations is given. The table also displays the number of times per epoch that each instruction is executed.

Source: Scarselli et al. [1].

Instruction	Positional	Nonlinear	Linear	Execs.
z(t + 1) = z(t) ⋅ A + b	s² ∣ E∣	s² ∣ E∣	s² ∣ E∣	it_b
o = G_w(x(t), l_w)				1
x(t + 1) = F_w(x(t), l)			s² ∣ E∣	it_f
				1
			–	1
	∣N∣	∣N∣	∣N∣	1
			–	1
				1
Скачать книгу В начало < 91 92 93 94 95 96 97 98 99 100 > В конец e-mail: [email protected]