Made By Vatsal Agrawal

Neural Network Library

HOW TO MAKE EXECUTABLE AND RUN PROGRAMME****

Unzip File at location x
Open Terminal and change directory to that location x
Now enter command make. This will create Executables
Now type export LD_LIBRARY_PATH=${pwd} //give the path of the current working directory.
Then type executable name with the arguments
For the plot of cross-entropy, type make plot1
For the plot of MSE, type make plot2
For different propagation comparisons, type make plot3
Type make run for classification data set with tanh
Type make run1 for regression data set with tanh
Type make run2 for vanishing gradient with logistic
Type make run3 for solving vanishing gradient through relu
Type make run8 for seeing the accuracy of the normalized data set
Type make run9 for seeing the accuracy of the standardized data set
Type make run4 for classification in python
Type make run5 for Regression in python
Type make run6 for own data set
Type make run7 for own dataset in python

Question Problem

Implement a multi-layer perceptron (MLP) based neural network using C.
The implementation should be a parameter-based library. The client or user can define the activation function, cost function and backpropagation technique using some arguments and then build the .so file. The activation functions can either be Sigmoid or Tanh, or ReLu. The backpropagation algorithm can be either Batch gradient descent or stochastic gradient descent or mini-batch gradient descent. The cost function can be either cross-entropy or mean squared error
Implement an MLP classifier using Scikit learn library in python.
Design multiple multilayer perceptrons with different activation functions. Plot a loss function graph, wherein the x-axis would be your number of iterations, and the y-axis would be the loss function value with every iteration.
The loss function graph should have multiple curves corresponding to the Scikit-learn version of MLP, your implementation of the MLP classifier with different activation functions such as Sigmoid or Tanh, or ReLu. You can have two versions of the loss function graph, one with cross-entropy as a cost function and one with mean squared error as a cost function.
Record the accuracy of the classifier and also state the observation in the report.
Simulate gradient vanishing problem using a suitable cost function, activation function and backpropagation algorithm. Can we get rid of gradient vanishing problems? If yes, how and If no, why?

Conclusion

Gradient Vanishing Problem can be seen clearly with seven layers
With 3 Layer Gradient vanishing Problem occurs sometimes depending on random initialization with logistics
Relu most of the time give NAN because of range overflow
The best-hidden layer function is Tanh
Increasing no of neurons to 100 gives the best answer but increases time drastically
The optimal no of neurons is found out to be 30, which takes less time and also performs best
The best no of hidden layers is found to be 3 with tanh
Other activations need 2 or 1 layer for their consistent result
Modified relu is giving the same result as logistics.
For the classification data set, accuracy ranges from 70 to 95% for tanh with a 0.001 learning rate with three layers of 100 neurons each.
Accuracy with sigmoid ranges from 20% to 75% for three layers
Tanh also gives the same accuracy
Batch gives the best result, but batch size finding is difficult optimum half of training size, gives accuracy 90%.
Stochastic gives result in less iteration, but the result cannot be of high accuracy sometimes, but most of the time it ranges to 75%, can be biased sometimes. It can contain noise.
Basic gives the highest accuracy ranging to 90%
Basic gives high accuracy but again needs time
Normalized data set gives an accuracy of 98% on avg.
For classification, I have used Cross entropy
For MSE in classification, the accuracy severely decreases to 60% for tanh
For regression data set again, the best model is found to be tanh
Accuracy range from 8% to 40% for tanh with three layers, 100 neurons each.
Accuracy dips to 0% to 20% for other activation function
Regression gives an accuracy of 70% with a +-2 error rate and a max 40% with a +-1 error rate
For other relatively less complex data set accuracy to turn out to be ranging from 87% to 100%
Increasing no of layers to four increases accuracy for tanh
Increasing no of layer till two increase accuracy for logistic and relu
Increasing no of neurons beyond 150 has negative effect only
More no of neurons, more time will be needed for calculation
Python calculation time is relatively short as compared to ours
Scikit also calculate relu for some cases (not all) before crossing maximum bound
For tanh and logistics, our and python library performance is similar, considering avg performance for large run
In one run, only any of the python and my implementation can perform better
Loss function value for MSE and cross-entropy for different functions in python and my implementation is given below
For normalized data set, both python and c implementation works till layer 4 for relu
Normalized and normal data set has the almost same accuracy
Standardized data set have somewhat less performance than normalized
I have not implemented any normalization or standardization function. Instead, I have used weka for this
For Regression, relu is not working for later 4 in python
For classification, relu is working for layer 4 in python
Most of the time gradient vanishing problem is seen in layer 4 for logistic
For normalized data set, this can be seen clearly that running relu for layer 4 removes the gradient vanishing problem
Loss function graphs are given below
Note in python plot straight line after some point is due to no significant change in the loss. Thus copying the last value for all iterations
Relu for c is modified in both loss graphs because the original relu is giving nan due to the large number
Relu modified is when derivative and original value is divided by 100000
Python gives nan for relu of MSE. Thus no plot for it

Result

I have made an MLP Neural Network Library that performs all significant functions like any other MLP Library
The library works similar to as sklearn library in neural network
I have implemented it with random weight and bias initialization with srand. So each time we run, we may get a different answer.
Gradient vanishing Problem can be seen in high no of layers
Relu may not work correctly because of value overflow, so a possible way is to limit the range of relu by dividing its answer by some fixed quantity. This changes its derivative part also.
Suppose we divide relu +ve part with x; then its derivatives will also be divided by x because the derivative of f(y) = y/x is 1/x
Softmax is used as an output activation function in my classifier
The output function can be changed also
The identity function is used as output activation function in Regression
MSE is checked by the regression data model, and Cross entropy is checked by the Classification model
I have implemented three types of gradient backpropagation methods – basic, stochastic, and Batch. Best performance is seen in basic
Batch also gives optimal performance if the Batch is chosen wisely.
As mentioned above, I have implemented two ways of error checking (loss function) -MSE and cross-entropy
Logistic activation function is good as output activation function, but it sometimes results in gradient vanishing for less no of layers also.
Tanh is found to be best in all cases.
Performance on many runs of python scikit learn, and my c neural network is found to be the same for any data set.
For small input and less complex data set, the accuracy level reaches 100%.
Nan occurs in relu because of the number exceeding the range of double.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
data		data
inc		inc
lib		lib
src		src
.gitattributes		.gitattributes
Makefile		Makefile
README.md		README.md
Report By Vatsal Agrawal.pdf		Report By Vatsal Agrawal.pdf
env.txt		env.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Made By Vatsal Agrawal

Neural Network Library

About

Releases

Packages

Languages

vatsalagrawal6991/Neural-Network-Library-C

Folders and files

Latest commit

History

Repository files navigation

Made By Vatsal Agrawal

Neural Network Library

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages