<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://wiki.nanobiodata.org/index.php?action=history&amp;feed=atom&amp;title=WEKA_Steps_for_Loading_Data</id>
	<title>WEKA Steps for Loading Data - Revision history</title>
	<link rel="self" type="application/atom+xml" href="https://wiki.nanobiodata.org/index.php?action=history&amp;feed=atom&amp;title=WEKA_Steps_for_Loading_Data"/>
	<link rel="alternate" type="text/html" href="https://wiki.nanobiodata.org/index.php?title=WEKA_Steps_for_Loading_Data&amp;action=history"/>
	<updated>2026-05-14T13:27:08Z</updated>
	<subtitle>Revision history for this page on the wiki</subtitle>
	<generator>MediaWiki 1.38.2</generator>
	<entry>
		<id>https://wiki.nanobiodata.org/index.php?title=WEKA_Steps_for_Loading_Data&amp;diff=108&amp;oldid=prev</id>
		<title>Sysadmin: Revised formatting</title>
		<link rel="alternate" type="text/html" href="https://wiki.nanobiodata.org/index.php?title=WEKA_Steps_for_Loading_Data&amp;diff=108&amp;oldid=prev"/>
		<updated>2022-09-26T20:39:18Z</updated>

		<summary type="html">&lt;p&gt;Revised formatting&lt;/p&gt;
&lt;a href=&quot;https://wiki.nanobiodata.org/index.php?title=WEKA_Steps_for_Loading_Data&amp;amp;diff=108&amp;amp;oldid=103&quot;&gt;Show changes&lt;/a&gt;</summary>
		<author><name>Sysadmin</name></author>
	</entry>
	<entry>
		<id>https://wiki.nanobiodata.org/index.php?title=WEKA_Steps_for_Loading_Data&amp;diff=103&amp;oldid=prev</id>
		<title>Sysadmin: Imported from text file</title>
		<link rel="alternate" type="text/html" href="https://wiki.nanobiodata.org/index.php?title=WEKA_Steps_for_Loading_Data&amp;diff=103&amp;oldid=prev"/>
		<updated>2022-09-26T19:21:31Z</updated>

		<summary type="html">&lt;p&gt;Imported from text file&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;__TOC__&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div title=&amp;quot;header&amp;quot;&amp;gt;&lt;br /&gt;
&lt;br /&gt;
{|&lt;br /&gt;
|width=&amp;quot;33%&amp;quot;| &amp;lt;br /&amp;gt;&lt;br /&gt;
|width=&amp;quot;33%&amp;quot;| &amp;lt;br /&amp;gt;&lt;br /&gt;
|width=&amp;quot;33%&amp;quot;| &amp;lt;span style=&amp;quot;background: #c0c0c0&amp;quot;&amp;gt;0&amp;lt;/span&amp;gt;&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
Steps for Loading Data into WEKA&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
ARFF format consists of three parts: @RELATION, @ATTRIBUTE and @DATA.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
@RELATION &amp;#039;&amp;#039;“SPACE”&amp;#039;&amp;#039; name&lt;br /&gt;
&lt;br /&gt;
@ATTRIBUTE &amp;#039;&amp;#039;“SPACE”&amp;#039;&amp;#039; descriptor name &amp;#039;&amp;#039;“SPACE”&amp;#039;&amp;#039; data type (numeric, nominal …)&lt;br /&gt;
&lt;br /&gt;
&amp;lt;span id=&amp;quot;_GoBack&amp;quot;&amp;gt;&amp;lt;/span&amp;gt; @DATA: numbers (integer or real) or strings&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
General rules for ARFF file can be found here&lt;br /&gt;
&lt;br /&gt;
https://www.cs.waikato.ac.nz/ml/weka/arff.html&lt;br /&gt;
&lt;br /&gt;
(Search “ARFF File Format”)&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
In Excel:&lt;br /&gt;
&lt;br /&gt;
* File 1: the format in this file is three columns for: “@ATTRIBUTE”, the descriptors’ names, and “NUMERIC”&lt;br /&gt;
** Open the txt data file in Excel. Make sure you are searching from “All Files”.&lt;br /&gt;
&lt;br /&gt;
[[File:WEKA_Steps_for_Loading_Data_HTML_4ec5c9075dd9cb51.png|512x288px]] [[File:WEKA_Steps_for_Loading_Data_HTML_ae35e05dbfaa6f7c.gif|31x38px|Shape1]]&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;Figure &amp;lt;span style=&amp;quot;background: #c0c0c0&amp;quot;&amp;gt;1&amp;lt;/span&amp;gt;, Opening your data file&amp;#039;&amp;#039;&lt;br /&gt;
&lt;br /&gt;
* When &amp;#039;&amp;#039;&amp;#039;Text Import Wizard&amp;#039;&amp;#039;&amp;#039; prompts, choose &amp;#039;&amp;#039;&amp;#039;Delimited and&amp;#039;&amp;#039;&amp;#039; click on &amp;#039;&amp;#039;&amp;#039;Next&amp;#039;&amp;#039;&amp;#039; (&amp;#039;&amp;#039;&amp;#039;step 1&amp;#039;&amp;#039;&amp;#039;), check the &amp;#039;&amp;#039;&amp;#039;Tab&amp;#039;&amp;#039;&amp;#039; box (&amp;#039;&amp;#039;&amp;#039;step 2&amp;#039;&amp;#039;&amp;#039;) and click on &amp;#039;&amp;#039;&amp;#039;Finish&amp;#039;&amp;#039;&amp;#039;. (Leave the rest as default unless it is necessary to change.)&lt;br /&gt;
&lt;br /&gt;
[[File:WEKA_Steps_for_Loading_Data_HTML_d288ed8722be4fdc.png|393x300px]] [[File:WEKA_Steps_for_Loading_Data_HTML_369e56d90bf80141.gif|32x31px|Shape3]] [[File:WEKA_Steps_for_Loading_Data_HTML_bef7f76831abec8b.gif|14x32px|Shape2]]&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;Figure &amp;lt;span style=&amp;quot;background: #c0c0c0&amp;quot;&amp;gt;2&amp;lt;/span&amp;gt;, Step 1&amp;#039;&amp;#039;&lt;br /&gt;
&lt;br /&gt;
[[File:WEKA_Steps_for_Loading_Data_HTML_cd41ce313e7118b1.png|379x288px]] [[File:WEKA_Steps_for_Loading_Data_HTML_859149fac76acd64.gif|30x19px|Shape5]] [[File:WEKA_Steps_for_Loading_Data_HTML_6092b35113c6658c.gif|19x33px|Shape4]]&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;Figure &amp;lt;span style=&amp;quot;background: #c0c0c0&amp;quot;&amp;gt;3&amp;lt;/span&amp;gt;, Step 2&amp;#039;&amp;#039;&lt;br /&gt;
&lt;br /&gt;
* Create three blank columns.&lt;br /&gt;
* Copy the descriptors’ names (they are at the first row of your data file, normally) and paste them in the vertical form by using &amp;#039;&amp;#039;&amp;#039;“Transpose”&amp;#039;&amp;#039;&amp;#039; pasting option &amp;#039;&amp;#039;&amp;#039;to the second column&amp;#039;&amp;#039;&amp;#039;.&lt;br /&gt;
* Make sure the &amp;#039;&amp;#039;&amp;#039;cell format&amp;#039;&amp;#039;&amp;#039; is &amp;#039;&amp;#039;&amp;#039;Text&amp;#039;&amp;#039;&amp;#039; before the next step.&lt;br /&gt;
&lt;br /&gt;
[[File:WEKA_Steps_for_Loading_Data_HTML_5e70e797a5c2cba0.png|313x156px]] [[File:WEKA_Steps_for_Loading_Data_HTML_634c3e41e8a46d53.gif|59x34px|Shape6]]&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;Figure &amp;lt;span style=&amp;quot;background: #c0c0c0&amp;quot;&amp;gt;4&amp;lt;/span&amp;gt;, Cell format&amp;#039;&amp;#039;&lt;br /&gt;
&lt;br /&gt;
* For the first and third columns: make equal number of rows as the second column has for “@ATTRIBUTE” and “NUMERIC”, respectively.&lt;br /&gt;
* Example:&lt;br /&gt;
&lt;br /&gt;
[[File:WEKA_Steps_for_Loading_Data_HTML_e8fa2497f6fc4487.png|198x337px]]&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;Figure &amp;lt;span style=&amp;quot;background: #c0c0c0&amp;quot;&amp;gt;5&amp;lt;/span&amp;gt;, the three columns&amp;#039;&amp;#039;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
* File 2: Here we separate the @DATA part (only numbers) of data.&lt;br /&gt;
** Keep only the numbers needed and delete everything else.&lt;br /&gt;
** Save it in &amp;#039;&amp;#039;&amp;#039;CSV (comma delimited)&amp;#039;&amp;#039;&amp;#039; format. (&amp;#039;&amp;#039;&amp;#039;Click YES&amp;#039;&amp;#039;&amp;#039; when asks “Some features in your workbook … Do you want to keep using that format?”)&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[File:WEKA_Steps_for_Loading_Data_HTML_d4bc2328d2c3083f.png|279x64px]] [[File:WEKA_Steps_for_Loading_Data_HTML_4fc01181394c3a93.gif|43x28px|Shape7]]&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;Figure &amp;lt;span style=&amp;quot;background: #c0c0c0&amp;quot;&amp;gt;6&amp;lt;/span&amp;gt;, Save as CSV&amp;#039;&amp;#039;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
In Notepad++:&lt;br /&gt;
&lt;br /&gt;
* File A:&lt;br /&gt;
** In the first row, create two columns: @RELATION and a title for the relation name (which are just separated by a space).&lt;br /&gt;
** &amp;#039;&amp;#039;(For aesthetics:&amp;#039;&amp;#039; leave the second row blank.)&lt;br /&gt;
** Copy the three columns in File 1 of Excel and paste into row3.&lt;br /&gt;
** Example:&lt;br /&gt;
&lt;br /&gt;
[[File:WEKA_Steps_for_Loading_Data_HTML_1f0af9e857ead597.png|247x204px]]&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;Figure &amp;lt;span style=&amp;quot;background: #c0c0c0&amp;quot;&amp;gt;7&amp;lt;/span&amp;gt;, start of the File A&amp;#039;&amp;#039;&lt;br /&gt;
&lt;br /&gt;
* Depending on your data, the last @ATTRIBUTE row will need to be the response variable AKA the thing you are trying to predict. (CATEGORICAL may require this, NUMERIC may not)&lt;br /&gt;
* After all ATTRIBUTE information has been pasted leave 1 row blank &amp;#039;&amp;#039;(For aesthetics&amp;#039;&amp;#039;)&lt;br /&gt;
* After blank row Type “@DATA”.&lt;br /&gt;
* &amp;#039;&amp;#039;(For aesthetics:&amp;#039;&amp;#039; For the next row after “@DATA”, leave it blank.)&lt;br /&gt;
* Example: [[File:WEKA_Steps_for_Loading_Data_HTML_ca4125bd73bd6170.png|460x172px]]&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;Figure &amp;lt;span style=&amp;quot;background: #c0c0c0&amp;quot;&amp;gt;8&amp;lt;/span&amp;gt;, middle of File A&amp;#039;&amp;#039;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
* File B:&lt;br /&gt;
** Open File 2 from Notepad++ (and you should see the data are separated by commas).&lt;br /&gt;
** Example: [[File:WEKA_Steps_for_Loading_Data_HTML_964b046e1656a7f.png|555x225px]]&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;Figure &amp;lt;span style=&amp;quot;background: #c0c0c0&amp;quot;&amp;gt;9&amp;lt;/span&amp;gt;, File 2 open with NotePad++&amp;#039;&amp;#039;&lt;br /&gt;
&lt;br /&gt;
* Copy and paste all the data to File A.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
* Already returned to File A:&lt;br /&gt;
** Save the file as “arff” format (by adding “.arff” at the end of the file name).&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;The file is ready to open and run in WEKA (yay).&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
&lt;br /&gt;
PS: (1) “%” would give you error if included in the descriptors’ names, specifically such as &amp;#039;&amp;#039;&amp;#039;“%N” or “%O”&amp;#039;&amp;#039;&amp;#039;, after “@ATTRIBUTE”. The error would occur because whatever after “%” is considered as comment, then in WEKA, it would be interpreted as a missing information for the descriptor name and data type. Simply remove “%” would avoid errors. (2)Also make sure the number of descriptors matches with the number of data in the “@DATA” section. (If you have 10 lines of @ATTRIBUTE + descriptors’ names, there should have 10 numbers in each line in @DATA part in the Notepad++.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br /&amp;gt;&lt;br /&gt;
&amp;lt;br /&amp;gt;&lt;/div&gt;</summary>
		<author><name>Sysadmin</name></author>
	</entry>
</feed>