RT info:eu-repo/semantics/article T1 Shallow neural network with kernel approximation for prediction problems in highly demanding data networks A1 López Martín, Manuel A1 Carro Martínez, Belén A1 Sánchez Esguevillas, Antonio Javier A1 Lloret, Jaime K1 Intrusion detection K1 Detección de intrusos K1 Shallow neural network K1 Red neuronal superficial K1 3325 Tecnología de las Telecomunicaciones AB Intrusion detection and network traffic classification are two of the main research applications of machine learning to highly demanding data networks e.g. IoT/sensors networks. These applications present new prediction challenges and strict requirements to the models applied for prediction. The models must be fast, accurate, flexible and capable of managing large datasets. They must be fast at the training, but mainly at the prediction phase, since inevitable environment changes require constant periodic training, and real-time prediction is mandatory. The models need to be accurate due to the consequences of prediction errors. They need also to be flexible and able to detect complex behaviors, usually encountered in non-linear models and, finally, training and prediction datasets are usually large due to traffic volumes. These requirements present conflicting solutions, between fast and simple shallow linear models and the slower and richer non-linear and deep learning models. Therefore, the perfect solution would be a mixture of both worlds. In this paper, we present such a solution made of a shallow neural network with linear activations plus a feature transformation based on kernel approximation algorithms which provide the necessary richness and non-linear behavior to the whole model. We have studied several kernel approximation algorithms: Nystrom, Random Fourier Features and Fastfood transformation and have applied them to three datasets related to intrusion detection and network traffic classification.This work presents the first application of a shallow linear model plus a kernel approximation to prediction problems with highly demanding network requirements. We show that the prediction performance obtained by these algorithms is positioned in the same range as the best non-linear classifiers, with a significant reduction in computational times, making them appropriate for new highly demanding networks. PB Elsevier SN 0957-4174 YR 2019 FD 2019 LK https://uvadoc.uva.es/handle/10324/54303 UL https://uvadoc.uva.es/handle/10324/54303 LA eng NO Expert Systems with Applications Volume 124, 2019, Pages 196-208 NO Producción Científica DS UVaDOC RD 17-may-2024