[prev in list] [next in list] [prev in thread] [next in thread] 

List:       wekalist
Subject:    [Wekalist] Question regarding RF weights
From:       Nathaniel Smoker <ntjs2 () kent ! ac ! uk>
Date:       2020-11-18 15:04:41
Message-ID: DB7PR01MB5292A7BDCE8D22DA760230A2C3E10 () DB7PR01MB5292 ! eurprd01 ! prod ! exchangelabs ! com
[Download RAW message or body]

I=92m using the random forest classifier in Weka. When the algorithm select=
s the best feature for a tree node based on the class impurity measure, it =
calculates the information (entropy) of the class for each value of a categ=
orical feature, in my case I have only binary features. It then computes th=
e class impurity measure based on a weighted sum of those information value=
s, where the weight for each feature value is given by the relative frequen=
cy of that feature value in the dataset.

I want to modify the calculation of that weight for each feature value, but=
 I cannot find the part of the code where that weight is calculated.

Can you please tell me which part of the code handles this?



[Attachment #3 (text/html)]

<html xmlns:o="urn:schemas-microsoft-com:office:office" \
xmlns:w="urn:schemas-microsoft-com:office:word" \
xmlns:m="http://schemas.microsoft.com/office/2004/12/omml" \
xmlns="http://www.w3.org/TR/REC-html40"> <head>
<meta http-equiv="Content-Type" content="text/html; charset=Windows-1252">
<meta name="Generator" content="Microsoft Word 15 (filtered medium)">
<style><!--
/* Font Definitions */
@font-face
	{font-family:"Cambria Math";
	panose-1:2 4 5 3 5 4 6 3 2 4;}
@font-face
	{font-family:Calibri;
	panose-1:2 15 5 2 2 2 4 3 2 4;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
	{margin:0cm;
	font-size:11.0pt;
	font-family:"Calibri",sans-serif;}
p.xxmsonormal, li.xxmsonormal, div.xxmsonormal
	{mso-style-name:x_x_msonormal;
	margin:0cm;
	font-size:11.0pt;
	font-family:"Calibri",sans-serif;}
.MsoChpDefault
	{mso-style-type:export-only;}
@page WordSection1
	{size:612.0pt 792.0pt;
	margin:72.0pt 72.0pt 72.0pt 72.0pt;}
div.WordSection1
	{page:WordSection1;}
--></style>
</head>
<body lang="EN-GB" link="blue" vlink="#954F72">
<div class="WordSection1">
<div>
<p class="xxmsonormal" style="background:white"><span \
style="font-size:12.0pt;color:black">I’m using the random forest classifier in Weka. \
When the algorithm selects the best feature for a tree node based on the </span><span \
style="font-size:12.0pt;color:black">class impurity measure</span><span \
style="font-size:12.0pt;color:black">, it calculates the information (entropy) of the \
class for each value of a categorical feature, in my case I have only binary \
features.  It then computes the </span><span \
style="font-size:12.0pt;color:black">class impurity measure</span><span \
style="font-size:12.0pt;color:black"> based on a weighted sum of those information \
values, where the weight for each feature value is given by the relative  frequency \
of that feature value in the dataset.<o:p></o:p></span></p> <p class="xxmsonormal" \
style="background:white"><span style="font-size:12.0pt;color:black">I want to modify \
the calculation of that weight for each feature value, but I cannot find the part of \
the code where that weight is calculated.<o:p></o:p></span></p> <p \
class="xxmsonormal" style="background:white"><span \
style="font-size:12.0pt;color:black">Can you please tell me which part of the code \
handles this?<o:p></o:p></span></p> </div>
<p class="MsoNormal"><o:p>&nbsp;</o:p></p>
<p class="MsoNormal"><o:p>&nbsp;</o:p></p>
</div>
</body>
</html>



_______________________________________________
Wekalist mailing list -- wekalist@list.waikato.ac.nz
Send posts to wekalist@list.waikato.ac.nz
To unsubscribe send an email to wekalist-leave@list.waikato.ac.nz
To subscribe, unsubscribe, etc., visit \
https://list.waikato.ac.nz/postorius/lists/wekalist.list.waikato.ac.nz List \
etiquette: http://www.cs.waikato.ac.nz/~ml/weka/mailinglist_etiquette.html

--===============1914500656069603917==--



[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic