[prev in list] [next in list] [prev in thread] [next in thread] 

List:       wekalist
Subject:    [Wekalist] =?iso-8859-1?q?RE=3A_weka_load_error=3A_unable_to_det?= =?iso-8859-1?q?ermine_structure_a
From:       Hank Koerner <Hank.Koerner () varolii ! com>
Date:       2012-10-18 0:38:45
Message-ID: 3A3F7E961F74AD45B335195DFBC1FE830363747A () SN2PRD0410MB359 ! namprd04 ! prod ! outlook ! com
[Download RAW message or body]

Hi. I can't load a dataset. I receive the error "unable to determine structure as \
arff (reason: java.ioexception keyword @relation expected, read Token[], line 1)". \
I have googled this problem and not found a resolution.

I created the dataset by converting a.csv to .arff via Notepad, Wordpad, and now \
Notepad++, appending @relation, @attribute, @data, and comments. I have recreated the \
file with UTF-8 encoding, reading somewhere that was important. I have compared the \
header of this current dataset to sample datasets, which load successfully. I have \
compared the header of this current dataset to other datasets I've created that load \
successfully.

The header and first few lines of the data are shown below. The file size is ~540,000 \
instances of 44 attributes. I'm at a loss. Any help would be appreciated.

Thank you,

Hank Koerner

% Dataset of TWC early calling collections for FME Aug2012
%   includes attributes summarizing 13 months of collections behavior
%   H. Koerner, Varolii Corp., Oct2012

@relation Collections

@attribute ProductID string
@attribute CycleDate date "yyyy-MM-dd"
@attribute TREATMENT_ID \
{TREAT_110,TREAT_111,TREAT_112,TREAT_119,TREAT_120,TREAT_122,TREAT_124,TREAT_125,TREAT_126}
 @attribute DUNNING_GROUP {<200,>200,=200}
@attribute DRPs numeric
@attribute MOU numeric
@attribute Attempts numeric
@attribute Interact numeric
@attribute CURRBAL numeric
@attribute DELQ_AMT numeric
@attribute DELQ_DAYS numeric
@attribute Market {"Los Angeles","San Diego","National","Texas"}
@attribute Gender {F,M,U}
@attribute CABLE {Y,N}
@attribute BROADBAND {Y,N}
@attribute PHONE {Y,N}
@attribute numProducts numeric
@attribute MinServiceType {F,G,M,P,S,U}
@attribute Age numeric
@attribute Tenure numeric
@attribute NumPmts numeric
@attribute PmtAmt numeric
@attribute PmtThisMo numeric
@attribute BrokenPromThisMo numeric
@attribute DelqLast12Mos numeric
@attribute PmtsLast12Mos numeric
@attribute BrokenPromLast12Mos numeric
@attribute interactsLast12Mos numeric
@attribute DelqLastMo numeric
@attribute PmtsLastMo numeric
@attribute BrokenPromLastMo numeric
@attribute interactsLastMo numeric
@attribute PmtCycleDayLastMo numeric
@attribute minAge numeric
@attribute minTenure numeric
@attribute sumAttempts numeric
@attribute sumCost numeric
@attribute sumDRPs numeric
@attribute sumLateFees numeric
@attribute sumMOU numeric
@attribute avgCURRBAL numeric
@attribute avgDELQ_AMT numeric
@attribute avgPmtAmt numeric
@attribute avgPmtCycleDay numeric

@data
8260130020000157,"2012-07-09",TREAT_110,>200,0,0,0,0,183.92,91.96,43,"Texas",M,Y,N,N,1 \
,G,0,24.5,1,183.92,1,0,2,2,0,0,0,0,0,0,0,10.75,23.75,4,0.49575,3,5.99,4.5,176.82,88.41,176.82,35
 8260130020004621,"2012-07-07",TREAT_120,>200,0,0,0,0,334.93,163.23,47,"Texas",F,Y,Y,Y \
,3,G,0,5.75,2,363.72,1,0,5,5,0,1,1,1,0,0,34,10.5,4.75,15,1.457875,5,11.98,27.25,248.38,91.748,146.636,35
 8260130020010081,"2012-06-17",TREAT_120,>200,2,5.25,6,0,378.54,249.84,43,"Texas",F,Y, \
Y,Y,3,G,0,12,1,124.98,1,0,7,3,0,0,1,0,0,0,0,10.75,11.25,57,3.961125,13,5.99,70.75,259.2428,114.3342,69.86,31
 8260130020012749,"2012-07-08",TREAT_120,>200,0,0,0,0,382.07,177.15,47,"Texas",M,Y,Y,Y,3,G,0,13.5,1,382.07,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0.00,0,0.00,0.00,0.00,0




Hank Koerner : Director Analytics
Varolii Corporation  | 206.902.1459 | \
hank.koerner@varolii.com<mailto:hank.koerner@varolii.com>




This email and any files transmitted with it are confidential and intended solely for \
the use of the individual or entity to whom they are addressed. If you have received \
this email in error please notify the sender. 


[Attachment #3 (text/html)]

<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">
<meta name="Generator" content="Microsoft Exchange Server">
<!-- converted from rtf -->
<style><!-- .EmailQuote { margin-left: 1pt; padding-left: 4pt; border-left: #800000 \
2px solid; } --></style> </head>
<body>
<font face="Calibri" size="2"><span style="font-size:11pt;">
<div>Hi. I can&#8217;t load a dataset. I receive the error &#8220;unable to determine \
structure as arff (reason: java.ioexception keyword @relation expected, read \
Token[], line 1)&#8221;. I have googled this problem and not found a resolution. \
</div> <div>&nbsp;</div>
<div>I created the dataset by converting a.csv to .arff via Notepad, Wordpad, and now \
Notepad&#43;&#43;, appending @relation, @attribute, @data, and comments. I have \
recreated the file with UTF-8 encoding, reading somewhere that was important. I have \
compared the header of this current dataset to sample datasets, which load \
successfully. I have compared the header of this current dataset to other datasets \
I&#8217;ve created that load successfully. </div> <div>&nbsp;</div>
<div>The header and first few lines of the data are shown below. The file size is \
~540,000 instances of 44 attributes. I&#8217;m at a loss. Any help would be \
appreciated. </div> <div>&nbsp;</div>
<div>Thank you,</div>
<div>&nbsp;</div>
<div>Hank Koerner</div>
<div>&nbsp;</div>
<div>% Dataset of TWC early calling collections for FME Aug2012</div>
<div>%&nbsp;&nbsp; includes attributes summarizing 13 months of collections \
behavior</div> <div>%&nbsp;&nbsp; H. Koerner, Varolii Corp., Oct2012</div>
<div>&nbsp;</div>
<div>@relation Collections</div>
<div>&nbsp;</div>
<div>@attribute ProductID string</div>
<div>@attribute CycleDate date &quot;yyyy-MM-dd&quot;</div>
<div>@attribute TREATMENT_ID \
{TREAT_110,TREAT_111,TREAT_112,TREAT_119,TREAT_120,TREAT_122,TREAT_124,TREAT_125,TREAT_126}</div>
 <div>@attribute DUNNING_GROUP {&lt;200,&gt;200,=200}</div>
<div>@attribute DRPs numeric</div>
<div>@attribute MOU numeric</div>
<div>@attribute Attempts numeric</div>
<div>@attribute Interact numeric</div>
<div>@attribute CURRBAL numeric</div>
<div>@attribute DELQ_AMT numeric</div>
<div>@attribute DELQ_DAYS numeric</div>
<div>@attribute Market {&quot;Los Angeles&quot;,&quot;San \
Diego&quot;,&quot;National&quot;,&quot;Texas&quot;}</div> <div>@attribute Gender \
{F,M,U}</div> <div>@attribute CABLE {Y,N}</div>
<div>@attribute BROADBAND {Y,N}</div>
<div>@attribute PHONE {Y,N}</div>
<div>@attribute numProducts numeric</div>
<div>@attribute MinServiceType {F,G,M,P,S,U}</div>
<div>@attribute Age numeric</div>
<div>@attribute Tenure numeric</div>
<div>@attribute NumPmts numeric</div>
<div>@attribute PmtAmt numeric</div>
<div>@attribute PmtThisMo numeric</div>
<div>@attribute BrokenPromThisMo numeric</div>
<div>@attribute DelqLast12Mos numeric</div>
<div>@attribute PmtsLast12Mos numeric</div>
<div>@attribute BrokenPromLast12Mos numeric</div>
<div>@attribute interactsLast12Mos numeric</div>
<div>@attribute DelqLastMo numeric</div>
<div>@attribute PmtsLastMo numeric</div>
<div>@attribute BrokenPromLastMo numeric</div>
<div>@attribute interactsLastMo numeric</div>
<div>@attribute PmtCycleDayLastMo numeric</div>
<div>@attribute minAge numeric</div>
<div>@attribute minTenure numeric</div>
<div>@attribute sumAttempts numeric</div>
<div>@attribute sumCost numeric</div>
<div>@attribute sumDRPs numeric</div>
<div>@attribute sumLateFees numeric</div>
<div>@attribute sumMOU numeric</div>
<div>@attribute avgCURRBAL numeric</div>
<div>@attribute avgDELQ_AMT numeric</div>
<div>@attribute avgPmtAmt numeric</div>
<div>@attribute avgPmtCycleDay numeric</div>
<div>&nbsp;</div>
<div>@data</div>
<div>8260130020000157,&quot;2012-07-09&quot;,TREAT_110,&gt;200,0,0,0,0,183.92,91.96,43 \
,&quot;Texas&quot;,M,Y,N,N,1,G,0,24.5,1,183.92,1,0,2,2,0,0,0,0,0,0,0,10.75,23.75,4,0.49575,3,5.99,4.5,176.82,88.41,176.82,35</div>
 <div>8260130020004621,&quot;2012-07-07&quot;,TREAT_120,&gt;200,0,0,0,0,334.93,163.23, \
47,&quot;Texas&quot;,F,Y,Y,Y,3,G,0,5.75,2,363.72,1,0,5,5,0,1,1,1,0,0,34,10.5,4.75,15,1.457875,5,11.98,27.25,248.38,91.748,146.636,35</div>
 <div>8260130020010081,&quot;2012-06-17&quot;,TREAT_120,&gt;200,2,5.25,6,0,378.54,249. \
84,43,&quot;Texas&quot;,F,Y,Y,Y,3,G,0,12,1,124.98,1,0,7,3,0,0,1,0,0,0,0,10.75,11.25,57,3.961125,13,5.99,70.75,259.2428,114.3342,69.86,31</div>
 <div>8260130020012749,&quot;2012-07-08&quot;,TREAT_120,&gt;200,0,0,0,0,382.07,177.15, \
47,&quot;Texas&quot;,M,Y,Y,Y,3,G,0,13.5,1,382.07,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0.00,0,0.00,0.00,0.00,0</div>
 <div>&nbsp;</div>
<div>&nbsp;</div>
<div>&nbsp;</div>
<div><font face="Arial" size="1" color="#E89419"><span style="font-size:8pt;"><b>Hank \
Koerner :</b><font color="#777777"><b> </b></font><font color="#606060"><b>Director \
Analytics</b></font><font color="#777777"><b> <br>

</b></font><font color="black"><b>Varolii Corporation</b></font><font \
color="#777777">&nbsp;</font><font color="navy"><b> |</b></font><font \
color="#777777">&nbsp;</font><font color="#606060"><b>206.902.1459</b></font><font \
face="Verdana" color="#777777"><b> </b></font><font color="#606060"><b>| \
</b></font><a href="mailto:hank.koerner@varolii.com"><font \
color="blue"><b><u>hank.koerner@varolii.com</u></b></font></a></span></font></div> \
<div>&nbsp;</div> <div>&nbsp;</div>
</span></font>
<br clear=all> <p><b><span lang="EN-GB" style="line-height: 125%; color: black">
<font size="1" face="arial,san-serif">This email and any files transmitted with it \
are confidential and  intended solely for the use of the individual or entity to whom \
they are  addressed. If you have received this email in error please notify the \
sender.</font></span></b></p> </body>
</html>



_______________________________________________
Wekalist mailing list
Send posts to: Wekalist@list.scms.waikato.ac.nz
List info and subscription status: https://list.scms.waikato.ac.nz/mailman/=
listinfo/wekalist
List etiquette: http://www.cs.waikato.ac.nz/~ml/weka/mailinglist_etiquette.=
html

--===============1993554444==--


[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic