[prev in list] [next in list] [prev in thread] [next in thread] 

List:       wekalist
Subject:    Re: [Wekalist] Training set vs Test set in classification
From:       Harm op den Akker <h.opdenakker () gmail ! com>
Date:       2012-11-27 9:14:27
Message-ID: CABGYEa=6hvSmQjwGWTFqWRmCT_yYG3aQM5v62JwT8U4tKS9sPQ () mail ! gmail ! com
[Download RAW message or body]

[Attachment #2 (multipart/alternative)]


On Mon, Nov 26, 2012 at 4:56 PM, Brian Sanjeewa Rupasinghe <
jinkabs@gmail.com> wrote:

> Hi,
>
> I am really confused with identifying these two data sets. To me, Training
> set has classified examples
> and test data set consists of unclassified examples. Yet i cannot use
> these two sets for
> classification with ID3. It gives a message saying "Data sets are not
> compatible". Could somebody
> point me out the real difference with an example or some supporting
> material?
>
> Cheers, Brian.
>
>
>
> _______________________________________________
> Wekalist mailing list
> Send posts to: Wekalist@list.scms.waikato.ac.nz
> List info and subscription status:
> https://list.scms.waikato.ac.nz/mailman/listinfo/wekalist
> List etiquette:
> http://www.cs.waikato.ac.nz/~ml/weka/mailinglist_etiquette.html
>
>
Hi Brian,

Both training and test sets should contain "classified examples". For
examples

Training set:

@relation FruitsTraining

@attribute class {banana,apple,orange}
@attribute color {yellow,green,brown,orange}
@attribute weight numeric

@data
banana,brown,120
banana,green,116
apple,green,80
...

Then, also you test set contains this "class" attribute:

@relation FruitsTesting

@attribute class {banana,apple,orange}
@attribute color {yellow,green,brown,orange}
@attribute weight numeric

@data
apple,green,96
orange,orange,96
banana,yellow,112
...

It sounds to me like you removed the "class" attribute from the test set.
But without this, you would not be able to calculate any classification
results.

Regards,

Harm op den Akker

[Attachment #5 (text/html)]

On Mon, Nov 26, 2012 at 4:56 PM, Brian Sanjeewa Rupasinghe <span dir="ltr">&lt;<a \
href="mailto:jinkabs@gmail.com" target="_blank">jinkabs@gmail.com</a>&gt;</span> \
wrote:<br><div class="gmail_extra"><div class="gmail_quote">

<blockquote class="gmail_quote" style="margin:0px 0px 0px \
0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex">Hi,<div><br></div><div>I \
am really confused with identifying these two data sets. To me, Training set has \
classified examples</div>

<div>and test data set consists of unclassified examples. Yet i cannot use these two \
sets for </div> <div>classification with ID3. It gives a message saying &quot;Data \
sets are not compatible&quot;. Could somebody</div><div>point me out the real \
difference with an example or some supporting material?</div><div> </div><div>


Cheers, Brian.</div><div><br></div><div><br></div>
<br>_______________________________________________<br>
Wekalist mailing list<br>
Send posts to: <a href="mailto:Wekalist@list.scms.waikato.ac.nz">Wekalist@list.scms.waikato.ac.nz</a><br>
 List info and subscription status: <a \
href="https://list.scms.waikato.ac.nz/mailman/listinfo/wekalist" \
target="_blank">https://list.scms.waikato.ac.nz/mailman/listinfo/wekalist</a><br> \
List etiquette: <a href="http://www.cs.waikato.ac.nz/~ml/weka/mailinglist_etiquette.html" \
target="_blank">http://www.cs.waikato.ac.nz/~ml/weka/mailinglist_etiquette.html</a><br>
 <br></blockquote></div><br></div><div class="gmail_extra">Hi Brian,</div><div \
class="gmail_extra"><br></div><div class="gmail_extra">Both training and test sets \
should contain &quot;classified examples&quot;. For examples</div>

<div class="gmail_extra"><br></div><div class="gmail_extra">Training set:</div><div \
class="gmail_extra"><br></div><div class="gmail_extra">@relation \
FruitsTraining</div><div class="gmail_extra"><br></div><div class="gmail_extra">

@attribute class {banana,apple,orange}</div><div class="gmail_extra">@attribute color \
{yellow,green,brown,orange}</div><div class="gmail_extra">@attribute weight \
numeric</div><div class="gmail_extra"><br></div><div class="gmail_extra">

@data</div><div class="gmail_extra">banana,brown,120</div><div \
class="gmail_extra">banana,green,116</div><div \
class="gmail_extra">apple,green,80</div><div class="gmail_extra">...</div><div \
class="gmail_extra"><br></div><div class="gmail_extra">

Then, also you test set contains this &quot;class&quot; attribute:</div><div \
class="gmail_extra"><br></div><div class="gmail_extra">@relation \
FruitsTesting</div><div class="gmail_extra"><br></div><div class="gmail_extra">

<div class="gmail_extra">@attribute class {banana,apple,orange}</div><div \
class="gmail_extra">@attribute color {yellow,green,brown,orange}</div><div \
class="gmail_extra">@attribute weight numeric</div><div class="gmail_extra">

<br></div><div class="gmail_extra">@data</div><div \
class="gmail_extra">apple,green,96</div><div \
class="gmail_extra">orange,orange,96</div><div \
class="gmail_extra">banana,yellow,112</div><div class="gmail_extra">...</div> <div \
class="gmail_extra"> <br></div><div class="gmail_extra">It sounds to me like you \
removed the &quot;class&quot; attribute from the test set. But without this, you \
would not be able to calculate any classification results.</div><div \
class="gmail_extra">

<br></div><div class="gmail_extra">Regards,</div><div \
class="gmail_extra"><br></div><div class="gmail_extra">Harm op den \
Akker</div></div><div class="gmail_extra"><br></div>



_______________________________________________
Wekalist mailing list
Send posts to: Wekalist@list.scms.waikato.ac.nz
List info and subscription status: https://list.scms.waikato.ac.nz/mailman/=
listinfo/wekalist
List etiquette: http://www.cs.waikato.ac.nz/~ml/weka/mailinglist_etiquette.=
html


[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic