Catalysis of neural activation functions: Adaptive feed-forward training for big data applications

Sagnik Sarkar; Shaashwat Agrawal; Thar Baker; Praveen Kumar Reddy Maddikunta; Thippa Reddy Gadekallu

doi:10.1007/s10489-021-03082-y

Catalysis of neural activation functions: Adaptive feed-forward training for big data applications

Sagnik Sarkar, Shaashwat Agrawal, Thar Baker, Praveen Kumar Reddy Maddikunta, Thippa Reddy Gadekallu

School of Arch, Tech and Eng

Research output: Contribution to journal › Article › peer-review

Abstract

Deep Learning in the field of Big Data has become essential for the analysis and perception of trends. Activation functions play a crucial role in the outcome of these deep learning frameworks. The existing activation functions are hugely focused on data translation from one neural layer to another. Although they have been proven useful and have given consistent results, they are static and mostly non-parametric. In this paper, we propose a new function for modified training of neural networks that is more flexible and adaptable to the data. The proposed catalysis function works over Rectified Linear Unit (ReLU), sigmoid, tanh and all other activation functions to provide adaptive feed-forward training. The function uses vector components of the activation function to provide variational flow of input. The performance of this algorithm is tested on Modified National Institute of Standards and Technology (MNIST) and Canadian Institute for Advanced Research (CIFAR10) datasets against the conventional activation functions. Visual Geometry Group (VGG) blocks and Residual Neural Network (ResNet) architectures are used for experimentation. The proposed function has shown significant improvements in comparison to the traditional functions with a 75 ± 2.5% acuuracy across activation functions. The adaptive nature of training has drastically decreased the probability of under-fitting. The parameterization has helped increase the data learning capacity of models. On performing sensitivity analysis, the catalysis activation show slight or no changes on varying initialization parameters.

Original language	English
Pages (from-to)	13364–13383
Number of pages	20
Journal	Applied Intelligence
Volume	52
Issue number	12
DOIs	https://doi.org/10.1007/s10489-021-03082-y
Publication status	Published - 24 Mar 2022

Keywords

Activation function
Big data
Catalysis function
Neural networks
Rectified linear unit (Relu)

Access to Document

10.1007/s10489-021-03082-y

Cite this

@article{a91374f3329b4f9cb6c3b2193a90656b,

title = "Catalysis of neural activation functions: Adaptive feed-forward training for big data applications",

abstract = "Deep Learning in the field of Big Data has become essential for the analysis and perception of trends. Activation functions play a crucial role in the outcome of these deep learning frameworks. The existing activation functions are hugely focused on data translation from one neural layer to another. Although they have been proven useful and have given consistent results, they are static and mostly non-parametric. In this paper, we propose a new function for modified training of neural networks that is more flexible and adaptable to the data. The proposed catalysis function works over Rectified Linear Unit (ReLU), sigmoid, tanh and all other activation functions to provide adaptive feed-forward training. The function uses vector components of the activation function to provide variational flow of input. The performance of this algorithm is tested on Modified National Institute of Standards and Technology (MNIST) and Canadian Institute for Advanced Research (CIFAR10) datasets against the conventional activation functions. Visual Geometry Group (VGG) blocks and Residual Neural Network (ResNet) architectures are used for experimentation. The proposed function has shown significant improvements in comparison to the traditional functions with a 75 ± 2.5% acuuracy across activation functions. The adaptive nature of training has drastically decreased the probability of under-fitting. The parameterization has helped increase the data learning capacity of models. On performing sensitivity analysis, the catalysis activation show slight or no changes on varying initialization parameters.",

keywords = "Activation function, Big data, Catalysis function, Neural networks, Rectified linear unit (Relu)",

author = "Sagnik Sarkar and Shaashwat Agrawal and Thar Baker and Maddikunta, {Praveen Kumar Reddy} and Gadekallu, {Thippa Reddy}",

year = "2022",

month = mar,

day = "24",

doi = "10.1007/s10489-021-03082-y",

language = "English",

volume = "52",

pages = "13364–13383",

journal = "Applied Intelligence",

issn = "1573-7497",

publisher = "Springer",

number = "12",

}

TY - JOUR

T1 - Catalysis of neural activation functions

T2 - Adaptive feed-forward training for big data applications

AU - Sarkar, Sagnik

AU - Agrawal, Shaashwat

AU - Baker, Thar

AU - Maddikunta, Praveen Kumar Reddy

AU - Gadekallu, Thippa Reddy

PY - 2022/3/24

Y1 - 2022/3/24

N2 - Deep Learning in the field of Big Data has become essential for the analysis and perception of trends. Activation functions play a crucial role in the outcome of these deep learning frameworks. The existing activation functions are hugely focused on data translation from one neural layer to another. Although they have been proven useful and have given consistent results, they are static and mostly non-parametric. In this paper, we propose a new function for modified training of neural networks that is more flexible and adaptable to the data. The proposed catalysis function works over Rectified Linear Unit (ReLU), sigmoid, tanh and all other activation functions to provide adaptive feed-forward training. The function uses vector components of the activation function to provide variational flow of input. The performance of this algorithm is tested on Modified National Institute of Standards and Technology (MNIST) and Canadian Institute for Advanced Research (CIFAR10) datasets against the conventional activation functions. Visual Geometry Group (VGG) blocks and Residual Neural Network (ResNet) architectures are used for experimentation. The proposed function has shown significant improvements in comparison to the traditional functions with a 75 ± 2.5% acuuracy across activation functions. The adaptive nature of training has drastically decreased the probability of under-fitting. The parameterization has helped increase the data learning capacity of models. On performing sensitivity analysis, the catalysis activation show slight or no changes on varying initialization parameters.

AB - Deep Learning in the field of Big Data has become essential for the analysis and perception of trends. Activation functions play a crucial role in the outcome of these deep learning frameworks. The existing activation functions are hugely focused on data translation from one neural layer to another. Although they have been proven useful and have given consistent results, they are static and mostly non-parametric. In this paper, we propose a new function for modified training of neural networks that is more flexible and adaptable to the data. The proposed catalysis function works over Rectified Linear Unit (ReLU), sigmoid, tanh and all other activation functions to provide adaptive feed-forward training. The function uses vector components of the activation function to provide variational flow of input. The performance of this algorithm is tested on Modified National Institute of Standards and Technology (MNIST) and Canadian Institute for Advanced Research (CIFAR10) datasets against the conventional activation functions. Visual Geometry Group (VGG) blocks and Residual Neural Network (ResNet) architectures are used for experimentation. The proposed function has shown significant improvements in comparison to the traditional functions with a 75 ± 2.5% acuuracy across activation functions. The adaptive nature of training has drastically decreased the probability of under-fitting. The parameterization has helped increase the data learning capacity of models. On performing sensitivity analysis, the catalysis activation show slight or no changes on varying initialization parameters.

KW - Activation function

KW - Big data

KW - Catalysis function

KW - Neural networks

KW - Rectified linear unit (Relu)

UR - http://www.scopus.com/inward/record.url?scp=85127485518&partnerID=8YFLogxK

U2 - 10.1007/s10489-021-03082-y

DO - 10.1007/s10489-021-03082-y

M3 - Article

SN - 1573-7497

VL - 52

SP - 13364

EP - 13383

JO - Applied Intelligence

JF - Applied Intelligence

IS - 12

ER -

Catalysis of neural activation functions: Adaptive feed-forward training for big data applications

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this