Car Evaluation Dataset Test

来源：互联网发布：ember.js 中文教程编辑：程序博客网时间：2024/06/05 05:48

Car Evaluation Dataset Test

An example of a multivariate data type classification problem using Neuroph

by Tijana Jovanovic, Faculty of Organisation Sciences, University of Belgrade

an experiment for Intelligent Systems course

Introduction

In this example we will be testing Neuroph with Car Dataset, which can be found : here. Several architectures will be tried out, and it will be determined which ones represent a good solution to the problem, and which ones does not.

First here are some useful information about our Car Dataset:
Data Set Characteristics: Multivariate
Number of Instances: 1728
Attribute Characteristics: Categorical
Number of Attributes: 6
Associated Tasks: Classification

Introducing the problem

Car Evaluation Database was derived from a simple hierarchical decision model.
The model evaluates cars according to the following concept structure:
CAR: car acceptability
. . PRICE overall price
. . buying buying price . . maint price of the maintenance
. . COMFORT comfort
. . doors number of doors . . persons capacity in terms of persons to carry . . lug_boot the size of luggage boot . . safety estimated safety of the car

Six input attributes: buying, maint, doors, persons, lug_boot, safety.

Attribute Information:

Class Values: unacc, acc, good, vgood

Attributes:

buying: vhigh, high, med, low.
maint: vhigh, high, med, low.
doors: 2, 3, 4, 5more.
persons: 2, 4, more.
lug_boot: small, med, big.
safety: low, med, high.

For this experiment to work we had to transform our data set in binary format (0, 1).We replaced each attribute value with suitable binary combination.

For example,the attribute buying has 4 posible values:vhigh, high, med, low.Since these values are in a String format we had to transform each in a number format.So in this case each string value will be replaced with a combination of 4 binary numbers.The final transformation looks like this:

Attributes:

buying: 1,0,0,0 instead of vhigh, 0,1,0,0 instead of high, 0,0,1,0 instead of med, 0,0,0,1 instead of low.
maint: 1,0,0,0 instead of vhigh, 0,1,0,0 instead of high, 0,0,1,0 instead of med, 0,0,0,1 instead of low.
doors: 0,0,0,1 instead of 2, 0,0,1,0 instead of 3, 0,1,0,0 instead of 4, 1,0,0,0 instead of 5more.
persons: 0,0,1 instead of 2, 0,1,0 instead of 4, 1,0,0 instead of more.
lug_boot: 0,0,1 instead of small, 0,1,0 instead of med, 1,0,0 instead of big.
safety: 0,0,1 instead of low, 0,1,0 instead of med, 1,0,0 instead of high.

Transformed Dataset

In this example we will be using 80% of data for training the network and 20% of data for testing it.

Before you start reading our experiment we suggest to first get more familiar with Neuroph Studio and Multi Layer Perceptron.You can do that by clicking on the links below:

Neuroph Studio Geting started

Multi Layer Perceptron

Training attempt 1

Here you can see the structure of our network with its inputs,outputs and hidden neurons in the middle layer.

Network Type: Multi Layer Perceptron
Training Algorithm: Backpropagation with Momentum
Number of inputs: 21
Number of outputs: 4 (unacc,acc,good,vgood)
Hidden neurons: 14

Training Parameters:
Learning Rate: 0.2
Momentum: 0.7
Max. Error: 0.01

Training Results:
For this training, we used Sigmoid transfer function.

As you can see, the neural network took 33 iterations to train. Total Net Error is acceptable 0.0095

Total Net Error graph look like this:

Practical Testing:

The final part of testing this network is testing it with several input values. To do that, we will select 5 random input values from our data set. Those are:

Network Inputs

Real Outputs

Instance number

Buying

Maint

Doors

Persons

Lug boot

Safety

Unacc

Acc

Good

VGood

1.0,0,0,1(vhigh)0,0,0,1(vhigh)1,0,0,0(2)1,0,0(2)0,1,0,(med)0,1,0(med)10002.1,0,0,0 (low)1,0,0,0 (low)0,0,0,1 (5more)0,1,0 (4)0,0,1 (big)0,0,1 (high)00013.1,0,0,0 (low)1,0,0,0 (low)0,0,0,1 (5more)0,1,0 (4)1,0,0 (small)1,0,0 (low)10004.1,0,0,0 (low)1,0,0,0 (low)0,0,0,1 (5more)0,0,1 (more)1,0,0 (small)0,1,0 (med)01005.1,0,0,0 (low)0,1,0,0 (med)0,0,0,1 (5more)0,1,0 (4)0,0,1 (big)0,1,0 (med)0010

The output neural network produced for this input is, respectively:

Network Inputs

Outputs neural network produced

Instance number

Buying

Maint

Doors

Persons

Lug boot

Safety

Unacc

Acc

Good

VGood

1.0,0,0,1(vhigh)0,0,0,1(vhigh)1,0,0,0(2)1,0,0(2)0,1,0,(med)0,1,0(med)10002.1,0,0,0 (low)1,0,0,0 (low)0,0,0,1 (5more)0,1,0 (4)0,0,1 (big)0,0,1 (high)0,00090,00020,00530,99313.1,0,0,0 (low)1,0,0,0 (low)0,0,0,1 (5more)0,1,0 (4)1,0,0 (small)1,0,0 (low)100,000104.1,0,0,0 (low)1,0,0,0 (low)0,0,0,1 (5more)0,0,1 (more)1,0,0 (small)0,1,0 (med)0,00330,99650,002505.1,0,0,0 (low)0,1,0,0 (med)0,0,0,1 (5more)0,1,0 (4)0,0,1 (big)0,1,0 (med)0,00020,00060,99730,0016

The network guessed correct in all five instances. After this test, we can conclude that this solution does not need to be rejected. It can be used to give good results in most cases.

In our next experiment we will be using the same network,but some of the parametres will be different and we will see how the result is going to change.

Training attempt 2

Network Type: Multi Layer Perceptron
Training Algorithm: Backpropagation with Momentum
Number of inputs: 21
Number of outputs: 4 (unacc,acc,good,vgood)
Hidden neurons: 14

Training Parameters:
Learning Rate: 0.3
Momentum: 0.6
Max. Error: 0.01

Training Results:
For this training, we used Sigmoid transfer function.

As you can see, the neural network took 21 iterations to train. Total Net Error is acceptable 0.0098

Total Net Error graph look like this:

Practical Testing:

The only thing left is to put the random inputs stated above into the neural network. The result of the test are shown in the table. The network guessed right in all five cases.

Network Inputs

Outputs neural network produced

Instance number

Buying

Maint

Doors

Persons

Lug boot

Safety

Unacc

Acc

Good

VGood

1.0,0,0,1(vhigh)0,0,0,1(vhigh)1,0,0,0(2)1,0,0(2)0,1,0,(med)0,1,0(med)10002.1,0,0,0 (low)1,0,0,0 (low)0,0,0,1 (5more)0,1,0 (4)0,0,1 (big)0,0,1 (high)0000,99963.1,0,0,0 (low)1,0,0,0 (low)0,0,0,1 (5more)0,1,0 (4)1,0,0 (small)1,0,0 (low)10004.1,0,0,0 (low)1,0,0,0 (low)0,0,0,1 (5more)0,0,1 (more)1,0,0 (small)0,1,0 (med)01005.1,0,0,0 (low)0,1,0,0 (med)0,0,0,1 (5more)0,1,0 (4)0,0,1 (big)0,1,0 (med)0010

As we can see from this table,network guessed allmost every instance in this test without any error,so we can say that the second combination of parametres is even better than the first one.

In the next two attempts we will be making a new neural network.The main difference will be the number of hidden neurons in the structure of our network and other parametres will also be changed.

Training attempt 3

Network Type: Multi Layer Perceptron
Training Algorithm: Backpropagation with Momentum
Number of inputs: 21
Number of outputs: 4 (unacc,acc,good,vgood)
Hidden neurons: 10

Training Parameters:
Learning Rate: 0.3
Momentum: 0.6
Max. Error: 0.01

Training Results:
For this training, we used Sigmoid transfer function.

As you can see, the neural network took 37 iterations to train. Total Net Error is acceptable 0.00995

Total Net Error graph look like this:

Practical Testing:

The final part of testing this network is testing it with several input values. To do that, we will select 5 random input values from our data set. Those are:

Network Inputs

Real Outputs

Instance number

Buying

Maint

Doors

Persons

Lug boot

Safety

Unacc

Acc

Good

VGood

The output neural network produced for this input is, respectively:

Network Inputs

Outputs neural network produced

Instance number

Buying

Maint

Doors

Persons

Lug boot

Safety

Unacc

Acc

Good

VGood

1.0,0,0,1(vhigh)0,0,0,1(vhigh)1,0,0,0(2)1,0,0(2)0,1,0,(med)0,1,0(med)10,0001002.1,0,0,0 (low)1,0,0,0 (low)0,0,0,1 (5more)0,1,0 (4)0,0,1 (big)0,0,1 (high)00,0010,01290,9863.1,0,0,0 (low)1,0,0,0 (low)0,0,0,1 (5more)0,1,0 (4)1,0,0 (small)1,0,0 (low)10004.1,0,0,0 (low)1,0,0,0 (low)0,0,0,1 (5more)0,0,1 (more)1,0,0 (small)0,1,0 (med)0,00330,99350,004505.1,0,0,0 (low)0,1,0,0 (med)0,0,0,1 (5more)0,1,0 (4)0,0,1 (big)0,1,0 (med)00,01910,95680,0237

The network guessed correct in all five instances. After this test, we can conclude that this solution does not need to be rejected. It can be used to give good results in most cases.

In our next experiment we will be using the same network,but some of the parametres will be diferent and we will see how the result is going to change.

Training attempt 4

Network Type: Multi Layer Perceptron
Training Algorithm: Backpropagation with Momentum
Number of inputs: 21
Number of outputs: 4 (unacc,acc,good,vgood)
Hidden neurons: 10

Training Parameters:
Learning Rate: 0.5
Momentum: 0.7
Max. Error: 0.01

Training Results:
For this training, we used Sigmoid transfer function.

As you can see, the neural network took 187 iterations to train. Total Net Error is acceptable 0.0084

Total Net Error graph look like this:

Practical Testing:

The only thing left is to put the random inputs stated above into the neural network. The result of the test are shown in the table. The network guessed right in all five cases.

Network Inputs

Outputs neural network produced

Instance number

Buying

Maint

Doors

Persons

Lug boot

Safety

Unacc

Acc

Good

VGood

1.0,0,0,1(vhigh)0,0,0,1(vhigh)1,0,0,0(2)1,0,0(2)0,1,0,(med)0,1,0(med)10002.1,0,0,0 (low)1,0,0,0 (low)0,0,0,1 (5more)0,1,0 (4)0,0,1 (big)0,0,1 (high)00013.1,0,0,0 (low)1,0,0,0 (low)0,0,0,1 (5more)0,1,0 (4)1,0,0 (small)1,0,0 (low)10004.1,0,0,0 (low)1,0,0,0 (low)0,0,0,1 (5more)0,0,1 (more)1,0,0 (small)0,1,0 (med)010,009405.1,0,0,0 (low)0,1,0,0 (med)0,0,0,1 (5more)0,1,0 (4)0,0,1 (big)0,1,0 (med)00,07360,99960

Training attempt 5

This time we will be making some more significant changes in the structure of our network.Now we will try to train a network with 5 neurons in its hidden layer.

Network Type: Multi Layer Perceptron
Training Algorithm: Backpropagation with Momentum
Number of inputs: 21
Number of outputs: 4 (unacc,acc,good,vgood)
Hidden neurons: 5

Training Parameters:
Learning Rate: 0.2
Momentum: 0.7
Max. Error: 0.01

Training Results:

We stoped the trening of network at this number of iterations because it is obious that in this case the network is not going to be trained succesfully and will not be able to learn the data from the set.

Total Net Error graph look like this:

So the conclusion of this experiment is that the choice of the number of hidden neurons is crucial to the effectiveness of a neural network.

One of the "rules" for determining the correct number of neurons to use in the hidden layers is that the number of hidden neurons should be between the size of the input layer and the size of the output layer.Formula that we used looks like this:((number of inputs + number of outputs)/2)+1.In that case we made a good network that showed great results.Then we made a network with less neurons in its hidden layer and the results were not as good as before.So,in the next example we are going to see how will the network react with a greater number of hidden neurons.

Training attempt 6

Network Type: Multi Layer Perceptron
Training Algorithm: Backpropagation with Momentum
Number of inputs: 21
Number of outputs: 4 (unacc,acc,good,vgood)
Hidden neurons: 17

Training Parameters:
Learning Rate: 0.2
Momentum: 0.7
Max. Error: 0.01

Training Results:
For this training, we used Sigmoid transfer function.

As you can see, the neural network took 23 iterations to train. Total Net Error is acceptable 0.0099

Total Net Error graph look like this:

Practical Testing:

The final part of testing this network is testing it with several input values. To do that, we will select 5 random input values from our data set. Those are:

Network Inputs

Real Outputs

Instance number

Buying

Maint

Doors

Persons

Lug boot

Safety

Unacc

Acc

Good

VGood

The output neural network produced for this input is, respectively:

Network Inputs

Outputs neural network produced

Instance number

Buying

Maint

Doors

Persons

Lug boot

Safety

Unacc

Acc

Good

VGood

1.0,0,0,1(vhigh)0,0,0,1(vhigh)1,0,0,0(2)1,0,0(2)0,1,0,(med)0,1,0(med)10,0001002.1,0,0,0 (low)1,0,0,0 (low)0,0,0,1 (5more)0,1,0 (4)0,0,1 (big)0,0,1 (high)0,00020,00010,00730,99463.1,0,0,0 (low)1,0,0,0 (low)0,0,0,1 (5more)0,1,0 (4)1,0,0 (small)1,0,0 (low)0,99870,00120,000204.1,0,0,0 (low)1,0,0,0 (low)0,0,0,1 (5more)0,0,1 (more)1,0,0 (small)0,1,0 (med)0,00310,99120,023605.1,0,0,0 (low)0,1,0,0 (med)0,0,0,1 (5more)0,1,0 (4)0,0,1 (big)0,1,0 (med)00,01910,95680,0237

As you can see,this number of hidden neurons with appropriate combination of parametres also gave a good results and guessed all five instances.

Training attempt 7

Now we will see how the same network is going to work with a diferent set of parametres.

Network Type: Multi Layer Perceptron
Training Algorithm: Backpropagation with Momentum
Number of inputs: 21
Number of outputs: 4 (unacc,acc,good,vgood)
Hidden neurons: 17

Training Parameters:
Learning Rate: 0.6
Momentum: 0.2
Max. Error: 0.02

Training Results:
For this training, we used Sigmoid transfer function.

As you can see, the neural network took 19 iterations to train. Total Net Error is acceptable 0.0189

Total Net Error graph look like this:

Practical Testing:

The only thing left is to put the random inputs stated above into the neural network. The result of the test are shown in the table. The network guessed right in all five