<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://wiki.nanobiodata.org/index.php?action=history&amp;feed=atom&amp;title=Statistical_Machine_Learning</id>
	<title>Statistical Machine Learning - Revision history</title>
	<link rel="self" type="application/atom+xml" href="https://wiki.nanobiodata.org/index.php?action=history&amp;feed=atom&amp;title=Statistical_Machine_Learning"/>
	<link rel="alternate" type="text/html" href="https://wiki.nanobiodata.org/index.php?title=Statistical_Machine_Learning&amp;action=history"/>
	<updated>2026-05-14T11:10:14Z</updated>
	<subtitle>Revision history for this page on the wiki</subtitle>
	<generator>MediaWiki 1.38.2</generator>
	<entry>
		<id>https://wiki.nanobiodata.org/index.php?title=Statistical_Machine_Learning&amp;diff=151&amp;oldid=prev</id>
		<title>Sysadmin: Added LLM section</title>
		<link rel="alternate" type="text/html" href="https://wiki.nanobiodata.org/index.php?title=Statistical_Machine_Learning&amp;diff=151&amp;oldid=prev"/>
		<updated>2023-03-20T11:14:42Z</updated>

		<summary type="html">&lt;p&gt;Added LLM section&lt;/p&gt;
&lt;table style=&quot;background-color: #fff; color: #202122;&quot; data-mw=&quot;interface&quot;&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;tr class=&quot;diff-title&quot; lang=&quot;en&quot;&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;← Older revision&lt;/td&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;Revision as of 11:14, 20 March 2023&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot; id=&quot;mw-diff-left-l215&quot;&gt;Line 215:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 215:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Take high throughput data and simplify before work is done &amp;#039;&amp;#039;ex&amp;#039;&amp;#039;: 1080p @ 60fps &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;\rightarrow&amp;lt;/math&amp;gt; 240x360 @ 15fps, broken into component channels Gaussian blur filter kernel applied to high-res images Edge detection via double threshold ML or CNN used past this to determine actual features&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Take high throughput data and simplify before work is done &amp;#039;&amp;#039;ex&amp;#039;&amp;#039;: 1080p @ 60fps &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;\rightarrow&amp;lt;/math&amp;gt; 240x360 @ 15fps, broken into component channels Gaussian blur filter kernel applied to high-res images Edge detection via double threshold ML or CNN used past this to determine actual features&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;−&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&amp;lt;/div&amp;gt;&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt; &lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-side-deleted&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;==== Large Language Models ====&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-side-deleted&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;Large language models (LLMs) are stochastic systems which attempt to capture the &amp;#039;shape&amp;#039; of a language (a recursively enumerable language by [[wikipedia:Chomsky_hierarchy|Chomsky Hierarchy]]) by pulling successive tokens from a &amp;#039;&amp;#039;&amp;#039;bag of words model&amp;#039;&amp;#039;&amp;#039;; in this system, a corpus of text is transformed into a mathematical space of the tokens, which are substrings of the input language, and weights, which are a cosine similarity between tokens in a Hilbert space. In other words, a large language model is a function which descends the gradient of this space, and uses probability to arrive at what word goes next in a given completion. This mathematical loss minimization is performed sequentially as the text is generated, in what is known as &amp;#039;&amp;#039;&amp;#039;self-attention&amp;#039;&amp;#039;&amp;#039;, which relates different positions of text sequence in order to compute a representation of the sequence. This function takes the form:&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-side-deleted&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt; &lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-side-deleted&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;&amp;lt;math display=&amp;quot;block&amp;quot;&amp;gt;\sum_{t=1}^T P(x_t | \vec{x}_{&amp;lt;t} \vec{x}_{i:j} )&amp;lt;/math&amp;gt;&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-side-deleted&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt; &lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-side-deleted&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;where tokens are generated until limit &amp;#039;&amp;#039;&amp;#039;T&amp;#039;&amp;#039;&amp;#039; is reached, and where there is a probability of another token generated as a function of previous tokens and the input. This functional setup is called an &amp;#039;&amp;#039;&amp;#039;encoder-decoder&amp;#039;&amp;#039;&amp;#039;. &lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-side-deleted&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt; &lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-side-deleted&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;So, based on the corpus, and given an input, an encoder-decoder will try to create a response given an input. Usually, encoder-decoders are further trained by a process called &amp;#039;&amp;#039;&amp;#039;Reinforcement Learning from Human Feedback&amp;#039;&amp;#039;&amp;#039; (RLHF), in which human feedback is given in an iterative process until the underlying language weights prefer the trained preferences.&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-side-deleted&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt; &lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-side-deleted&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;====== To consider in regard to LLMs: ======&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-side-deleted&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt; &lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-side-deleted&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;* They are stochastic word generator machines, and contain structure, rather than problem solving logic.&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-side-deleted&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;* They are typically computationally expensive to run.&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-side-deleted&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;* They are typically monstrously computationally expensive to train.&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-side-deleted&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;* If equipped with enough memory for self-attention, they become a Turing Machine with Type-0 Grammar. &lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-side-deleted&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;* They may occasionally generate incorrect information.&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-side-deleted&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;* They &amp;#039;&amp;#039;&amp;#039;hallucinate&amp;#039;&amp;#039;&amp;#039;, generating nonsense (verbal noise) when confronted with unexpected text not encountered in training. &lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-side-deleted&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;* Because the attention of a model is limited, LLMs perform badly with large multi-step processes which require dense context.&lt;/ins&gt;&amp;lt;/div&amp;gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;== Software Toolkit ==&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;== Software Toolkit ==&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot; id=&quot;mw-diff-left-l328&quot;&gt;Line 328:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 346:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* rdrr.io - meta-manual lookup and many other tools for R  &lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* rdrr.io - meta-manual lookup and many other tools for R  &lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* swirlstats.com - learn R, in R  &lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* swirlstats.com - learn R, in R  &lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;−&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* &lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;statslearning&lt;/del&gt;.com - statistical machine learning coursework&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* &lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;statlearning&lt;/ins&gt;.com - statistical machine learning coursework&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;=== Recommended Reading ===&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;=== Recommended Reading ===&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;

&lt;!-- diff cache key wiki-wiki-:diff::1.12:old-144:rev-151 --&gt;
&lt;/table&gt;</summary>
		<author><name>Sysadmin</name></author>
	</entry>
	<entry>
		<id>https://wiki.nanobiodata.org/index.php?title=Statistical_Machine_Learning&amp;diff=144&amp;oldid=prev</id>
		<title>Sysadmin: /* GNU Parallel */</title>
		<link rel="alternate" type="text/html" href="https://wiki.nanobiodata.org/index.php?title=Statistical_Machine_Learning&amp;diff=144&amp;oldid=prev"/>
		<updated>2022-11-08T19:16:12Z</updated>

		<summary type="html">&lt;p&gt;&lt;span dir=&quot;auto&quot;&gt;&lt;span class=&quot;autocomment&quot;&gt;GNU Parallel&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;
&lt;table style=&quot;background-color: #fff; color: #202122;&quot; data-mw=&quot;interface&quot;&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;tr class=&quot;diff-title&quot; lang=&quot;en&quot;&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;← Older revision&lt;/td&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;Revision as of 19:16, 8 November 2022&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot; id=&quot;mw-diff-left-l280&quot;&gt;Line 280:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 280:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;simple vectorization of loops over processors for non-multithreaded processes  &lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;simple vectorization of loops over processors for non-multithreaded processes  &lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;−&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&amp;#039;&amp;#039;ex&amp;#039;&amp;#039;: &amp;lt;code&amp;gt;parallel -j $NUM_PROC /path/to/thescript.sh ::: &lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;&amp;lt;/code&amp;gt;&amp;lt;span&amp;gt;&amp;lt;code&amp;gt;&lt;/del&gt;1..n&lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;&amp;lt;/code&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;code&amp;gt; &lt;/del&gt;::: &lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;&amp;lt;/code&amp;gt;&amp;lt;span&amp;gt;&amp;lt;code&amp;gt;&lt;/del&gt;1..m&amp;lt;/code&lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;&amp;gt;&amp;lt;/span&lt;/del&gt;&amp;gt;&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&amp;#039;&amp;#039;ex&amp;#039;&amp;#039;: &amp;lt;code&amp;gt;parallel -j $NUM_PROC /path/to/thescript.sh ::: 1..n ::: 1..m&amp;lt;/code&amp;gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;=== CUDA/OpenCL ===&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;=== CUDA/OpenCL ===&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;

&lt;!-- diff cache key wiki-wiki-:diff::1.12:old-143:rev-144 --&gt;
&lt;/table&gt;</summary>
		<author><name>Sysadmin</name></author>
	</entry>
	<entry>
		<id>https://wiki.nanobiodata.org/index.php?title=Statistical_Machine_Learning&amp;diff=143&amp;oldid=prev</id>
		<title>Sysadmin at 21:08, 3 November 2022</title>
		<link rel="alternate" type="text/html" href="https://wiki.nanobiodata.org/index.php?title=Statistical_Machine_Learning&amp;diff=143&amp;oldid=prev"/>
		<updated>2022-11-03T21:08:20Z</updated>

		<summary type="html">&lt;p&gt;&lt;/p&gt;
&lt;table style=&quot;background-color: #fff; color: #202122;&quot; data-mw=&quot;interface&quot;&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;tr class=&quot;diff-title&quot; lang=&quot;en&quot;&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;← Older revision&lt;/td&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;Revision as of 21:08, 3 November 2022&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot; id=&quot;mw-diff-left-l9&quot;&gt;Line 9:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 9:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* Machine Learning  &lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* Machine Learning  &lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* ML Techniques  &lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* ML Techniques  &lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;−&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* &lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;SoftwareToolkit &lt;/del&gt;&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* &lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;Software Toolkit &lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* Continued Learning&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* Continued Learning&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot; id=&quot;mw-diff-left-l42&quot;&gt;Line 42:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 42:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;So, our combined dataset consists of &amp;lt;math&amp;gt;[(\vec{x}_1,\vec{y}_1),(\vec{x}_2, \vec{y}_2),...,(\vec{x}_n,\vec{y}_n)]&amp;lt;/math&amp;gt;&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;So, our combined dataset consists of &amp;lt;math&amp;gt;[(\vec{x}_1,\vec{y}_1),(\vec{x}_2, \vec{y}_2),...,(\vec{x}_n,\vec{y}_n)]&amp;lt;/math&amp;gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;−&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&amp;#039;&amp;#039;&amp;#039;Our Mission&amp;#039;&amp;#039;&amp;#039;: determine relationships between &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;\textbf{X}&amp;lt;/math&amp;gt; and &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;\textbf{Y}&amp;lt;/math&amp;gt; which are mathematically sound, leading to better &lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;understandin&lt;/del&gt;&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&amp;#039;&amp;#039;&amp;#039;Our Mission&amp;#039;&amp;#039;&amp;#039;: determine relationships between &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;\textbf{X}&amp;lt;/math&amp;gt; and &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;\textbf{Y}&amp;lt;/math&amp;gt; which are mathematically sound, leading to better &lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;understanding&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Typically a table has columns as features, rows as entries&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Typically a table has columns as features, rows as entries&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot; id=&quot;mw-diff-left-l202&quot;&gt;Line 202:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 202:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Generates the inner product space of two arbitrary-dimensional numeric matrix spaces, showing the shape of data&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Generates the inner product space of two arbitrary-dimensional numeric matrix spaces, showing the shape of data&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;−&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;=== &lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;&amp;#039;&amp;#039;&amp;#039;&lt;/del&gt;Genetic Algorithms&lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;&amp;#039;&amp;#039;&amp;#039; &lt;/del&gt;===&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;=== Genetic Algorithms ===&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Utilize some adversarial scoring method of initially randomized vectors:&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Utilize some adversarial scoring method of initially randomized vectors:&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot; id=&quot;mw-diff-left-l230&quot;&gt;Line 230:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 230:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;pip libraries &amp;lt;code&amp;gt;$ pip list outdated format=freeze | grep -v | cut -d=&amp;quot; &amp;quot; -f1 | xargs -n1 pip install -U&amp;lt;/code&amp;gt;&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;pip libraries &amp;lt;code&amp;gt;$ pip list outdated format=freeze | grep -v | cut -d=&amp;quot; &amp;quot; -f1 | xargs -n1 pip install -U&amp;lt;/code&amp;gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;−&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;==== &lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;&amp;#039;&amp;#039;&amp;#039;&lt;/del&gt;Anaconda&lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;&amp;#039;&amp;#039;&amp;#039; &lt;/del&gt;====&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;==== Anaconda ====&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;separate virtualenv system specifically for data science:&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;separate virtualenv system specifically for data science:&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot; id=&quot;mw-diff-left-l293&quot;&gt;Line 293:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 293:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;=== High Performance Computers ===&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;=== High Performance Computers ===&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;−&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;HPC or &lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;supercomputing &lt;/del&gt;clusters provide high throughput analysis.&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;HPC or &lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;super-computing &lt;/ins&gt;clusters provide high throughput analysis.&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Amazingly high amount of computational power.&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Amazingly high amount of computational power.&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot; id=&quot;mw-diff-left-l328&quot;&gt;Line 328:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 328:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* rdrr.io - meta-manual lookup and many other tools for R  &lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* rdrr.io - meta-manual lookup and many other tools for R  &lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* swirlstats.com - learn R, in R  &lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* swirlstats.com - learn R, in R  &lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;−&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* &lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;statlearning&lt;/del&gt;.com - statistical machine learning coursework&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* &lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;statslearning&lt;/ins&gt;.com - statistical machine learning coursework&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;=== Recommended Reading ===&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;=== Recommended Reading ===&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;

&lt;!-- diff cache key wiki-wiki-:diff::1.12:old-142:rev-143 --&gt;
&lt;/table&gt;</summary>
		<author><name>Sysadmin</name></author>
	</entry>
	<entry>
		<id>https://wiki.nanobiodata.org/index.php?title=Statistical_Machine_Learning&amp;diff=142&amp;oldid=prev</id>
		<title>Sysadmin at 21:04, 1 November 2022</title>
		<link rel="alternate" type="text/html" href="https://wiki.nanobiodata.org/index.php?title=Statistical_Machine_Learning&amp;diff=142&amp;oldid=prev"/>
		<updated>2022-11-01T21:04:05Z</updated>

		<summary type="html">&lt;p&gt;&lt;/p&gt;
&lt;table style=&quot;background-color: #fff; color: #202122;&quot; data-mw=&quot;interface&quot;&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;tr class=&quot;diff-title&quot; lang=&quot;en&quot;&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;← Older revision&lt;/td&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;Revision as of 21:04, 1 November 2022&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot; id=&quot;mw-diff-left-l328&quot;&gt;Line 328:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 328:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* rdrr.io - meta-manual lookup and many other tools for R  &lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* rdrr.io - meta-manual lookup and many other tools for R  &lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* swirlstats.com - learn R, in R  &lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* swirlstats.com - learn R, in R  &lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;−&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* &lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;statslearning&lt;/del&gt;.com - statistical machine learning coursework&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* &lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;statlearning&lt;/ins&gt;.com - statistical machine learning coursework&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;=== Recommended Reading ===&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;=== Recommended Reading ===&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;

&lt;!-- diff cache key wiki-wiki-:diff::1.12:old-141:rev-142 --&gt;
&lt;/table&gt;</summary>
		<author><name>Sysadmin</name></author>
	</entry>
	<entry>
		<id>https://wiki.nanobiodata.org/index.php?title=Statistical_Machine_Learning&amp;diff=141&amp;oldid=prev</id>
		<title>Sysadmin at 20:41, 21 October 2022</title>
		<link rel="alternate" type="text/html" href="https://wiki.nanobiodata.org/index.php?title=Statistical_Machine_Learning&amp;diff=141&amp;oldid=prev"/>
		<updated>2022-10-21T20:41:25Z</updated>

		<summary type="html">&lt;p&gt;&lt;/p&gt;
&lt;table style=&quot;background-color: #fff; color: #202122;&quot; data-mw=&quot;interface&quot;&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;tr class=&quot;diff-title&quot; lang=&quot;en&quot;&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;← Older revision&lt;/td&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;Revision as of 20:41, 21 October 2022&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot; id=&quot;mw-diff-left-l124&quot;&gt;Line 124:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 124:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* &amp;#039;&amp;#039;&amp;#039;Data Driven Models&amp;#039;&amp;#039;&amp;#039;  &lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* &amp;#039;&amp;#039;&amp;#039;Data Driven Models&amp;#039;&amp;#039;&amp;#039;  &lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;** Use ML to provide &amp;#039;&amp;#039;&amp;#039;iterative&amp;#039;&amp;#039;&amp;#039; gain to reduce error  &lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;** Use ML to provide &amp;#039;&amp;#039;&amp;#039;iterative&amp;#039;&amp;#039;&amp;#039; gain to reduce error  &lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;−&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;*** Known &lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;data {\textstyle \rightarrow } model creation {\textstyle \rightarrow } point &lt;/del&gt;toward new factors.&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;*** Known &lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;data→model creation→point &lt;/ins&gt;toward new factors.&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;** Uses a split in training and testing data, or sum of error to move toward the correct answer.&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;** Uses a split in training and testing data, or sum of error to move toward the correct answer.&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;** Human researchers more free to find more data, improve prediction, develop theories&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;** Human researchers more free to find more data, improve prediction, develop theories&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;

&lt;!-- diff cache key wiki-wiki-:diff::1.12:old-140:rev-141 --&gt;
&lt;/table&gt;</summary>
		<author><name>Sysadmin</name></author>
	</entry>
	<entry>
		<id>https://wiki.nanobiodata.org/index.php?title=Statistical_Machine_Learning&amp;diff=140&amp;oldid=prev</id>
		<title>Sysadmin at 20:39, 21 October 2022</title>
		<link rel="alternate" type="text/html" href="https://wiki.nanobiodata.org/index.php?title=Statistical_Machine_Learning&amp;diff=140&amp;oldid=prev"/>
		<updated>2022-10-21T20:39:41Z</updated>

		<summary type="html">&lt;p&gt;&lt;/p&gt;
&lt;table style=&quot;background-color: #fff; color: #202122;&quot; data-mw=&quot;interface&quot;&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;tr class=&quot;diff-title&quot; lang=&quot;en&quot;&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;← Older revision&lt;/td&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;Revision as of 20:39, 21 October 2022&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot; id=&quot;mw-diff-left-l104&quot;&gt;Line 104:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 104:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* A polynomial: &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;Y = \beta_0 + \beta_1 X + \beta_2 X + ... + \varepsilon&amp;lt;/math&amp;gt;  &lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* A polynomial: &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;Y = \beta_0 + \beta_1 X + \beta_2 X + ... + \varepsilon&amp;lt;/math&amp;gt;  &lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* A natural function: &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;Y = e^{-\alpha_0 X^{\alpha_1}} + \varepsilon&amp;lt;/math&amp;gt;  &lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* A natural function: &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;Y = e^{-\alpha_0 X^{\alpha_1}} + \varepsilon&amp;lt;/math&amp;gt;  &lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;−&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* A logistical function: &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;p = \frac{e^{\beta_0 + \beta_1 X}}{1 + e^{\beta_0 + \beta_1 X}}&amp;lt;/math&amp;gt;  &lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* A logistical function: &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;p = \frac{ e^{\beta_0 + \beta_1 X} }{1 + e^{\beta_0 + \beta_1 X} }&amp;lt;/math&amp;gt;  &lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* A series of nested &amp;lt;code&amp;gt;if&amp;lt;/code&amp;gt; statements  &lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* A series of nested &amp;lt;code&amp;gt;if&amp;lt;/code&amp;gt; statements  &lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;−&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* A series of differential equations: &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;x&lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;^\prime &lt;/del&gt;= x_n - \bar{x}_n: y&lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;^\prime &lt;/del&gt;= y_n - \beta_n &lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;({&lt;/del&gt;x_n&lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;}^\prime).&lt;/del&gt;&amp;lt;/math&amp;gt;&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* A series of differential equations: &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;x&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;&amp;#039; &lt;/ins&gt;= x_n - \bar{x}_n: y&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;&amp;#039; &lt;/ins&gt;= y_n - \beta_n x_n&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;&amp;#039;&lt;/ins&gt;&amp;lt;/math&amp;gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;==== Why Machine Learning? ====&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;==== Why Machine Learning? ====&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot; id=&quot;mw-diff-left-l124&quot;&gt;Line 124:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 124:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* &amp;#039;&amp;#039;&amp;#039;Data Driven Models&amp;#039;&amp;#039;&amp;#039;  &lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* &amp;#039;&amp;#039;&amp;#039;Data Driven Models&amp;#039;&amp;#039;&amp;#039;  &lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;** Use ML to provide &amp;#039;&amp;#039;&amp;#039;iterative&amp;#039;&amp;#039;&amp;#039; gain to reduce error  &lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;** Use ML to provide &amp;#039;&amp;#039;&amp;#039;iterative&amp;#039;&amp;#039;&amp;#039; gain to reduce error  &lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;−&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;*** Known data &lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;&amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;&lt;/del&gt;\rightarrow&lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;&amp;lt;/math&amp;gt; &lt;/del&gt;model creation &lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;&amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;&lt;/del&gt;\rightarrow&lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;&amp;lt;/math&amp;gt; &lt;/del&gt;point toward new factors.&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;*** Known data &lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;{\textstyle &lt;/ins&gt;\rightarrow &lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;} &lt;/ins&gt;model creation &lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;{\textstyle &lt;/ins&gt;\rightarrow &lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;} &lt;/ins&gt;point toward new factors.&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;** Uses a split in training and testing data, or sum of error to move toward the correct answer.&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;** Uses a split in training and testing data, or sum of error to move toward the correct answer.&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;** Human researchers more free to find more data, improve prediction, develop theories&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;** Human researchers more free to find more data, improve prediction, develop theories&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;

&lt;!-- diff cache key wiki-wiki-:diff::1.12:old-138:rev-140 --&gt;
&lt;/table&gt;</summary>
		<author><name>Sysadmin</name></author>
	</entry>
	<entry>
		<id>https://wiki.nanobiodata.org/index.php?title=Statistical_Machine_Learning&amp;diff=138&amp;oldid=prev</id>
		<title>Sysadmin: created and edited page</title>
		<link rel="alternate" type="text/html" href="https://wiki.nanobiodata.org/index.php?title=Statistical_Machine_Learning&amp;diff=138&amp;oldid=prev"/>
		<updated>2022-10-21T20:23:33Z</updated>

		<summary type="html">&lt;p&gt;created and edited page&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;=== &amp;lt;span&amp;gt;Overview&amp;lt;/span&amp;gt; ===&lt;br /&gt;
&amp;lt;div class=&amp;quot;outline&amp;quot;&amp;gt;&lt;br /&gt;
A crash course to enable gentle introduction into the machine learning techniques and its applications into data science. &lt;br /&gt;
&lt;br /&gt;
Topics Covered: &lt;br /&gt;
&lt;br /&gt;
* Basic Concepts &lt;br /&gt;
* Data Models &lt;br /&gt;
* Machine Learning &lt;br /&gt;
* ML Techniques &lt;br /&gt;
* SoftwareToolkit &lt;br /&gt;
* Continued Learning&lt;br /&gt;
&lt;br /&gt;
== &amp;lt;span&amp;gt;Basic Concepts&amp;lt;/span&amp;gt; ==&lt;br /&gt;
&amp;lt;div class=&amp;quot;outline&amp;quot;&amp;gt;&lt;br /&gt;
=== Definitions: ===&lt;br /&gt;
&lt;br /&gt;
* &amp;#039;&amp;#039;&amp;#039;Statistical Machine Learning&amp;#039;&amp;#039;&amp;#039; is a set of tools used to model and understand complex data sets &lt;br /&gt;
* &amp;#039;&amp;#039;&amp;#039;Data Science&amp;#039;&amp;#039;&amp;#039; is a set of techniques in computing to support the analysis of data &lt;br /&gt;
** Not very useful without some domain knowledge: it is important to &amp;#039;&amp;#039;know your data&amp;#039;&amp;#039;. &lt;br /&gt;
* Includes analytic techniques: &lt;br /&gt;
** descriptive statistics &lt;br /&gt;
** data visualization &lt;br /&gt;
** statistical machine learning &lt;br /&gt;
** neural networks&lt;br /&gt;
** actor-environment models &lt;br /&gt;
* Also includes computational techniques: &lt;br /&gt;
** database administration &lt;br /&gt;
** management of information systems &lt;br /&gt;
** parallelization &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;\rightarrow&amp;lt;/math&amp;gt; high performance computing&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Basic Concepts ===&lt;br /&gt;
&amp;lt;div class=&amp;quot;outline&amp;quot;&amp;gt;&lt;br /&gt;
==== Knowing your data ====&lt;br /&gt;
Technical definition: &lt;br /&gt;
&lt;br /&gt;
Let &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;n&amp;lt;/math&amp;gt; represent a number of distinct &amp;#039;&amp;#039;&amp;#039;observations&amp;#039;&amp;#039;&amp;#039;, and let &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;p&amp;lt;/math&amp;gt; represent a number of &amp;#039;&amp;#039;&amp;#039;predictors&amp;#039;&amp;#039;&amp;#039; Then, our observed data &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;\textbf{X}&amp;lt;/math&amp;gt; is an &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;n\times p&amp;lt;/math&amp;gt; matrix with row observation vectors &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;\vec{x}_{1..n}&amp;lt;/math&amp;gt; and column predictor vectors &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;\vec{x}_{1..p}&amp;lt;/math&amp;gt;. &lt;br /&gt;
&lt;br /&gt;
In addition, we will also have &amp;#039;&amp;#039;&amp;#039;response&amp;#039;&amp;#039;&amp;#039; variable(s) &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;\textbf{Y}&amp;lt;/math&amp;gt;, which is a made up of some &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;n&amp;lt;/math&amp;gt;-length vectors. &lt;br /&gt;
&lt;br /&gt;
So, our combined dataset consists of &amp;lt;math&amp;gt;[(\vec{x}_1,\vec{y}_1),(\vec{x}_2, \vec{y}_2),...,(\vec{x}_n,\vec{y}_n)]&amp;lt;/math&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Our Mission&amp;#039;&amp;#039;&amp;#039;: determine relationships between &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;\textbf{X}&amp;lt;/math&amp;gt; and &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;\textbf{Y}&amp;lt;/math&amp;gt; which are mathematically sound, leading to better understandin&lt;br /&gt;
&lt;br /&gt;
Typically a table has columns as features, rows as entries&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Entries&amp;#039;&amp;#039;&amp;#039; might be &amp;#039;&amp;#039;&amp;#039;numeric&amp;#039;&amp;#039;&amp;#039; or &amp;#039;&amp;#039;&amp;#039;categorical&amp;#039;&amp;#039;&amp;#039;. &lt;br /&gt;
&lt;br /&gt;
Data sources are either &amp;#039;&amp;#039;&amp;#039;Structured&amp;#039;&amp;#039;&amp;#039; or &amp;#039;&amp;#039;&amp;#039;Unstructured&amp;#039;&amp;#039;&amp;#039;: &lt;br /&gt;
&lt;br /&gt;
* Unstructured data will require some transformation. &lt;br /&gt;
&lt;br /&gt;
Some data may also be &amp;#039;&amp;#039;&amp;#039;time series&amp;#039;&amp;#039;&amp;#039; taking a sampling of points over time, contributing to a 3-dimensional &amp;#039;&amp;#039;&amp;#039;Data Cub&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
&lt;br /&gt;
Several techniques can be used to reduce complex data: &lt;br /&gt;
&lt;br /&gt;
* &amp;#039;&amp;#039;&amp;#039;numeric representation&amp;#039;&amp;#039;&amp;#039; mapping of categorical information into numbers. &lt;br /&gt;
* &amp;#039;&amp;#039;&amp;#039;scaling&amp;#039;&amp;#039;&amp;#039; redefine a new range for a predictor vector. &lt;br /&gt;
* &amp;#039;&amp;#039;&amp;#039;normalization&amp;#039;&amp;#039;&amp;#039; redefine a predictor by its mean and standard deviation, giving a normal distribution of values. &lt;br /&gt;
* &amp;#039;&amp;#039;&amp;#039;dimension reduction&amp;#039;&amp;#039;&amp;#039; lose fine grain of data, but gain understandability. &lt;br /&gt;
* &amp;#039;&amp;#039;&amp;#039;feature extraction&amp;#039;&amp;#039;&amp;#039; a data mining technique in which we can generate new predictors from known information&lt;br /&gt;
&lt;br /&gt;
==== Modeling ====&lt;br /&gt;
What is a model?&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Very well-known model&amp;#039;&amp;#039;&amp;#039;: Gravity is a functional model between masses, distances, and force. &amp;lt;math display=&amp;quot;block&amp;quot;&amp;gt;F = G\frac{m_1m_2}{r^2} \rightarrow g = \frac{G M}{r^2} \rightarrow v(t) = v(0) - gt.&amp;lt;/math&amp;gt; &amp;#039;&amp;#039;&amp;#039;&amp;lt;math display=&amp;quot;block&amp;quot;&amp;gt;k = Ae^{\frac{E_a}{K_b T}}.&amp;lt;/math&amp;gt;Statistics definition&amp;#039;&amp;#039;&amp;#039;: &lt;br /&gt;
&lt;br /&gt;
Let &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;X = (\vec{x}_1, \vec{x}_2, ..., \vec{x}_p)&amp;lt;/math&amp;gt; each of length &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;n&amp;lt;/math&amp;gt;, and &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;Y = (\vec{y})&amp;lt;/math&amp;gt; of length &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;n&amp;lt;/math&amp;gt;, then for &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;X&amp;lt;/math&amp;gt; and &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;Y&amp;lt;/math&amp;gt;, there exists a function with a &amp;#039;&amp;#039;&amp;#039;systematic&amp;#039;&amp;#039;&amp;#039; &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;f&amp;lt;/math&amp;gt; and &amp;#039;&amp;#039;&amp;#039;error term&amp;#039;&amp;#039;&amp;#039; &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;\varepsilon&amp;lt;/math&amp;gt;: &amp;lt;math display=&amp;quot;block&amp;quot;&amp;gt;Y = f(X) + \varepsilon&amp;lt;/math&amp;gt;Why do we even estimate &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;f&amp;lt;/math&amp;gt; at all? &amp;#039;&amp;#039;&amp;#039;Prediction&amp;#039;&amp;#039;&amp;#039; or &amp;#039;&amp;#039;&amp;#039;Inference&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Predictive models&amp;#039;&amp;#039;&amp;#039; create an estimator &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;\hat{f}&amp;lt;/math&amp;gt; which we can use to estimate &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;Y&amp;lt;/math&amp;gt; using a sample &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;X&amp;lt;/math&amp;gt; from a larger population: &amp;lt;math display=&amp;quot;block&amp;quot;&amp;gt;\hat{ Y} = \hat{f}(X)&amp;lt;/math&amp;gt; With error: &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;E(Y - \hat{Y})^2 = [f(X) - \hat{f}(X)]^2 + \varepsilon&amp;lt;/math&amp;gt;. &lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Inference models&amp;#039;&amp;#039;&amp;#039; are primarily interested in how &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;Y&amp;lt;/math&amp;gt; is affected by &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;X&amp;lt;/math&amp;gt;: &lt;br /&gt;
&lt;br /&gt;
* What predictors associated with response? &lt;br /&gt;
* What is the relationship of predictors to response? &lt;br /&gt;
* What is the overall nature of relationship between &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;Y&amp;lt;/math&amp;gt; and the predictors.&lt;br /&gt;
&lt;br /&gt;
==== Signal vs Noise ====&lt;br /&gt;
Consider &amp;#039;&amp;#039;&amp;#039;precision&amp;#039;&amp;#039;&amp;#039; and &amp;#039;&amp;#039;&amp;#039;accuracy&amp;#039;&amp;#039;&amp;#039;. &lt;br /&gt;
&lt;br /&gt;
* Both contribute into data set &lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;noise:&amp;#039;&amp;#039;&amp;#039; variation in data which detracts from constructing &amp;#039;&amp;#039;&amp;#039;information&amp;#039;&amp;#039;&amp;#039;, as opposed to &amp;#039;&amp;#039;&amp;#039;signal&amp;#039;&amp;#039;&amp;#039;–data which is representative of a system under study and contains information. &lt;br /&gt;
&lt;br /&gt;
High signal to noise allows us to minimize &amp;#039;&amp;#039;&amp;#039;reducible error&amp;#039;&amp;#039;&amp;#039;, caused by sampling technique. &lt;br /&gt;
&lt;br /&gt;
Different than &amp;#039;&amp;#039;&amp;#039;irreducible error&amp;#039;&amp;#039;&amp;#039;, created by factors we are not measuring.&lt;br /&gt;
&lt;br /&gt;
==== Error and Fit ====&lt;br /&gt;
In Modeling In the terms of modeling, precision of a model is referred to &amp;#039;&amp;#039;&amp;#039;variance&amp;#039;&amp;#039;&amp;#039; and the accuracy of a model its degree of &amp;#039;&amp;#039;&amp;#039;bias&amp;#039;&amp;#039;&amp;#039;. &lt;br /&gt;
&lt;br /&gt;
Generally, overly complex models generate high variance, and can &amp;#039;&amp;#039;&amp;#039;over-fit&amp;#039;&amp;#039;&amp;#039; to input data, making the model useless to new data.&lt;br /&gt;
&lt;br /&gt;
Generally, &amp;#039;&amp;#039;&amp;#039;Mean Square Error&amp;#039;&amp;#039;&amp;#039; or MSE, used to determine goodness-of-fit for model calibration: &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;MSE = \frac{1}{n}\sum^n_{i=1}(y_i - \hat{f}(x_i))^2&amp;lt;/math&amp;gt;. &lt;br /&gt;
&lt;br /&gt;
Error rate used in classification: &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;\frac{1}{n} \sum^n_{i=1} I(y_i \ne \hat{y}_i)&amp;lt;/math&amp;gt;. &lt;br /&gt;
&lt;br /&gt;
However for reporting, &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;R^2&amp;lt;/math&amp;gt; statistic is more often used, because it gives a value between 0 and 1 useful to determine how much of variance in &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;Y&amp;lt;/math&amp;gt; is explained by variance in &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;X&amp;lt;/math&amp;gt;: &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;R^2 = 1 - \frac{\sum(y_i - \hat{y}_i)^2}{\sum(y_i - \bar{y})^2} = \frac{RSS}{TSS}&amp;lt;/math&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Example functions relating &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;X&amp;lt;/math&amp;gt; and &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;Y&amp;lt;/math&amp;gt;:&lt;br /&gt;
&lt;br /&gt;
* A linear function: &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;Y = \beta_0 + \beta_1 X + \varepsilon&amp;lt;/math&amp;gt; &lt;br /&gt;
* A polynomial: &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;Y = \beta_0 + \beta_1 X + \beta_2 X + ... + \varepsilon&amp;lt;/math&amp;gt; &lt;br /&gt;
* A natural function: &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;Y = e^{-\alpha_0 X^{\alpha_1}} + \varepsilon&amp;lt;/math&amp;gt; &lt;br /&gt;
* A logistical function: &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;p = \frac{e^{\beta_0 + \beta_1 X}}{1 + e^{\beta_0 + \beta_1 X}}&amp;lt;/math&amp;gt; &lt;br /&gt;
* A series of nested &amp;lt;code&amp;gt;if&amp;lt;/code&amp;gt; statements &lt;br /&gt;
* A series of differential equations: &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;x^\prime = x_n - \bar{x}_n: y^\prime = y_n - \beta_n ({x_n}^\prime).&amp;lt;/math&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== Why Machine Learning? ====&lt;br /&gt;
Types of Questions: &lt;br /&gt;
&lt;br /&gt;
* &amp;#039;&amp;#039;&amp;#039;Exact Solution is known&amp;#039;&amp;#039;&amp;#039; normal coding problems, linear models, and classical statistics. &lt;br /&gt;
* &amp;#039;&amp;#039;&amp;#039;Exact Solution is unknown, but can be extracted with work&amp;#039;&amp;#039;&amp;#039; work with systems experts and domain knowledge to create code. &lt;br /&gt;
* &amp;#039;&amp;#039;&amp;#039;Exact Solution is known, but not yet conveyable&amp;#039;&amp;#039;&amp;#039; ML is useful. &lt;br /&gt;
* &amp;#039;&amp;#039;&amp;#039;Exact Solution not known by humans&amp;#039;&amp;#039;&amp;#039; ML and/or Deep Learning needed&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;Example:&amp;#039;&amp;#039; consider a prediction of temperature: &lt;br /&gt;
&lt;br /&gt;
* &amp;#039;&amp;#039;&amp;#039;Knowledge Based Models&amp;#039;&amp;#039;&amp;#039; &lt;br /&gt;
** Physics and atmospheric science based model. &lt;br /&gt;
** Up to differential equations on chaotic systems. &lt;br /&gt;
** As fine granularity of prediction increases, number of factors and density of data quickly becomes too much for most humans to consider &lt;br /&gt;
* &amp;#039;&amp;#039;&amp;#039;Data Driven Models&amp;#039;&amp;#039;&amp;#039; &lt;br /&gt;
** Use ML to provide &amp;#039;&amp;#039;&amp;#039;iterative&amp;#039;&amp;#039;&amp;#039; gain to reduce error &lt;br /&gt;
*** Known data &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;\rightarrow&amp;lt;/math&amp;gt; model creation &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;\rightarrow&amp;lt;/math&amp;gt; point toward new factors.&lt;br /&gt;
** Uses a split in training and testing data, or sum of error to move toward the correct answer.&lt;br /&gt;
** Human researchers more free to find more data, improve prediction, develop theories&lt;br /&gt;
&lt;br /&gt;
== &amp;lt;span&amp;gt;Techniques&amp;lt;/span&amp;gt; ==&lt;br /&gt;
&lt;br /&gt;
=== Machine Learning in General ===&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Supervised Learning:&amp;#039;&amp;#039;&amp;#039; the model estimates, error verifies &lt;br /&gt;
&lt;br /&gt;
* If incorrect, needs user input for correction.&lt;br /&gt;
* &amp;#039;&amp;#039;Example&amp;#039;&amp;#039;: a computer vision system trained to find features in images via user annotated images. &lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Unsupervised Learning:&amp;#039;&amp;#039;&amp;#039; Clustering/Grouping of similar items&lt;br /&gt;
&lt;br /&gt;
* Need a similarity measure via feature vectors and ability to adjust weights &lt;br /&gt;
* &amp;#039;&amp;#039;Example&amp;#039;&amp;#039;: Taste prediction algorithms used in web advertising. &lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Reinforced Learning&amp;#039;&amp;#039;&amp;#039; Model estimates a sequence of guesses &lt;br /&gt;
&lt;br /&gt;
* Correct if and only if the entire sequence or a parameterized output scoring&lt;br /&gt;
* Instant feedback but high compute cost &lt;br /&gt;
* &amp;#039;&amp;#039;Example&amp;#039;&amp;#039;: Game-play in actor-environment model&lt;br /&gt;
&lt;br /&gt;
=== Types of Problems and Output ===&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Numeric&amp;#039;&amp;#039;&amp;#039; function maps &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;X&amp;lt;/math&amp;gt; to &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;Y&amp;lt;/math&amp;gt; and output is in &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;\mathbb{R}&amp;lt;/math&amp;gt;. &lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Categorization&amp;#039;&amp;#039;&amp;#039; non-orderable sorting &lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Clustering&amp;#039;&amp;#039;&amp;#039; finding principle ways groups differ &lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Anomaly Detection&amp;#039;&amp;#039;&amp;#039; finding data points which are out of the ordinary &lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Actor Models&amp;#039;&amp;#039;&amp;#039; real-time decision making or detailed simulation&lt;br /&gt;
&lt;br /&gt;
=== Predictive Models ===&lt;br /&gt;
Predictive ML models which are also Linear: &lt;br /&gt;
&lt;br /&gt;
* Utilize a split of training and test data: test-training or &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;k&amp;lt;/math&amp;gt;-fold cross-validation &lt;br /&gt;
* use a function mapping of one or more independent variables to the dependent variables, then re-evaluate to reduce error Includes techniques for mixed model reduction&lt;br /&gt;
* Reduction of the number of predictors via &amp;#039;&amp;#039;&amp;#039;Lasso&amp;#039;&amp;#039;&amp;#039;, &amp;#039;&amp;#039;&amp;#039;Ridge&amp;#039;&amp;#039;&amp;#039;, and &amp;#039;&amp;#039;&amp;#039;Elastic Net&amp;#039;&amp;#039;&amp;#039; techniques&lt;br /&gt;
&lt;br /&gt;
=== Feature-vector based models ===&lt;br /&gt;
Nested &amp;lt;code&amp;gt;if&amp;lt;/code&amp;gt; statements try to find decision boundaries by distance between independent data and dependent outcome features have weighted probability most information by Bayesian inference&lt;br /&gt;
&lt;br /&gt;
* &amp;#039;&amp;#039;&amp;#039;Decision Trees&amp;#039;&amp;#039;&amp;#039; can be used to create decision models for linearly separable data effectively a neural network with one neuron&lt;br /&gt;
* &amp;#039;&amp;#039;&amp;#039;Random Forest&amp;#039;&amp;#039;&amp;#039; utilizes a number of differently-tuned trees trees provide consensus voting-based approach for non-linearly separable data smaller tree depth typically prevents over-fit&lt;br /&gt;
&lt;br /&gt;
=== Clustering ===&lt;br /&gt;
Groups data into cluster such that distance within clusters is small, and between differing groups is large &lt;br /&gt;
&lt;br /&gt;
Works with any well-defined &amp;amp;quot;distance&amp;amp;quot; function: Euclidean, Hamming, Inner Product, etc. &lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;k-Means Clustering:&amp;#039;&amp;#039;&amp;#039; &lt;br /&gt;
&lt;br /&gt;
* choose &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;k&amp;lt;/math&amp;gt; number of clusters randomly distribute &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;k&amp;lt;/math&amp;gt; points, &amp;#039;&amp;#039;&amp;#039;centroids&amp;#039;&amp;#039;&amp;#039;, into feature space &lt;br /&gt;
* divide and classify data by distance to &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;k&amp;lt;/math&amp;gt; centroids &lt;br /&gt;
* move centroids based on center of groups repeats until convergence to some epsilon value&lt;br /&gt;
* where points no longer move across iterations &lt;br /&gt;
* &amp;#039;&amp;#039;&amp;#039;Goodness of Fit&amp;#039;&amp;#039;&amp;#039; for &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;N-M&amp;lt;/math&amp;gt; possible values of &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;k&amp;lt;/math&amp;gt;, an inflection in overall likelihood ratio given by probability function for set&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;K-nearest Neighbors:&amp;#039;&amp;#039;&amp;#039; &lt;br /&gt;
&lt;br /&gt;
Creates a probabilistic decision boundary within a feature space between &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;K&amp;lt;/math&amp;gt; centroids&lt;br /&gt;
&lt;br /&gt;
Unsupervised system to find structures of data works on majority voting system&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Gradient Boosting:&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
&lt;br /&gt;
System attempts to find the direction and vector of change in a dimensional field, and follow these iteratively to find local extrema. &lt;br /&gt;
&lt;br /&gt;
Most use some &amp;#039;&amp;#039;&amp;#039;Quasi-Newton Method&amp;#039;&amp;#039;&amp;#039; for finding extrema for faster centroid convergence. &lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Support Vector Machine:&amp;#039;&amp;#039;&amp;#039; &lt;br /&gt;
&lt;br /&gt;
Also known as SVM, applies a classifier into high-dimensional data to split points into groups some use a &amp;#039;&amp;#039;&amp;#039;kernel trick.&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
&lt;br /&gt;
Generates the inner product space of two arbitrary-dimensional numeric matrix spaces, showing the shape of data&lt;br /&gt;
&lt;br /&gt;
=== &amp;#039;&amp;#039;&amp;#039;Genetic Algorithms&amp;#039;&amp;#039;&amp;#039; ===&lt;br /&gt;
Utilize some adversarial scoring method of initially randomized vectors:&lt;br /&gt;
&lt;br /&gt;
* ’survivors’ become the basis of new models similar iterative concept to gradient methods.&lt;br /&gt;
* Does not have to understand topology of space requires creator to specify scoring for the machine .&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Convolutional Neural Networks:&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
&lt;br /&gt;
Utilize iterative scoring between training and testing, along with &amp;#039;&amp;#039;&amp;#039;gradient descent&amp;#039;&amp;#039;&amp;#039; on a number of layered, weighted vectors to extract features from a complex data set. &lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Vision Systems&amp;#039;&amp;#039;&amp;#039; &lt;br /&gt;
&lt;br /&gt;
Take high throughput data and simplify before work is done &amp;#039;&amp;#039;ex&amp;#039;&amp;#039;: 1080p @ 60fps &amp;lt;math display=&amp;quot;inline&amp;quot;&amp;gt;\rightarrow&amp;lt;/math&amp;gt; 240x360 @ 15fps, broken into component channels Gaussian blur filter kernel applied to high-res images Edge detection via double threshold ML or CNN used past this to determine actual features&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Software Toolkit ==&lt;br /&gt;
&amp;lt;div class=&amp;quot;outline&amp;quot;&amp;gt;&lt;br /&gt;
=== Python ===&lt;br /&gt;
General purpose programming language with many libraries Interpreted language: each line is run one at a time by a virtual machine.&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Dependency Structure&amp;#039;&amp;#039;&amp;#039; &lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;system&amp;#039;&amp;#039; vs &amp;#039;&amp;#039;user&amp;#039;&amp;#039; python &lt;br /&gt;
&lt;br /&gt;
virtualenv: &amp;lt;code&amp;gt;$ python3 -m venv /path/to/new/environment&amp;lt;/code&amp;gt; &lt;br /&gt;
&lt;br /&gt;
pip libraries &amp;lt;code&amp;gt;$ pip list outdated format=freeze | grep -v | cut -d=&amp;quot; &amp;quot; -f1 | xargs -n1 pip install -U&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== &amp;#039;&amp;#039;&amp;#039;Anaconda&amp;#039;&amp;#039;&amp;#039; ====&lt;br /&gt;
separate virtualenv system specifically for data science:&lt;br /&gt;
&lt;br /&gt;
* &amp;#039;&amp;#039;&amp;#039;Spyder&amp;#039;&amp;#039;&amp;#039; IDE with visual output &lt;br /&gt;
* &amp;#039;&amp;#039;&amp;#039;JupyterLabs&amp;#039;&amp;#039;&amp;#039; notes with data visualizations &lt;br /&gt;
* &amp;#039;&amp;#039;&amp;#039;Orange&amp;#039;&amp;#039;&amp;#039; visual IDE for stats exploration&lt;br /&gt;
* &amp;lt;code&amp;gt;pandas, NumPy, and SciPy&amp;lt;/code&amp;gt; libraries for data serialization and numerical work in python &lt;br /&gt;
* &amp;lt;code&amp;gt;matplotlib&amp;lt;/code&amp;gt; for data visualization &lt;br /&gt;
* &amp;lt;code&amp;gt;scikit-learn&amp;lt;/code&amp;gt; main library for ML &lt;br /&gt;
* &amp;lt;code&amp;gt;DASK&amp;lt;/code&amp;gt; distributed abstraction layer with &amp;lt;code&amp;gt;pandas&amp;lt;/code&amp;gt; grammar to easily distribute python tasks into 1-1000 compute nodes &lt;br /&gt;
* &amp;lt;code&amp;gt;PyTorch, and TensorFlow&amp;lt;/code&amp;gt; deep learning and CNN generation systems massive compute overhead to train models require data map reduction and or imputation to run well&amp;lt;/div&amp;gt;&amp;lt;div class=&amp;quot;outline&amp;quot;&amp;gt;&lt;br /&gt;
=== R language ===&lt;br /&gt;
Statistical programming language: interpreter invokes compiled C or FORTRAN.&lt;br /&gt;
&lt;br /&gt;
Also works within Jupyter notebook for instant visualization, if wanted.&lt;br /&gt;
&lt;br /&gt;
Open-source and extended by the Comprehensive R Archive Network (CRAN), which includes extensive documentation.&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;code&amp;gt;rmarkdown&amp;lt;/code&amp;gt; format a document from R with optional LaTeXbindings &lt;br /&gt;
* &amp;lt;code&amp;gt;tidyverse&amp;lt;/code&amp;gt; &lt;br /&gt;
** &amp;lt;code&amp;gt;dplyr&amp;lt;/code&amp;gt; grammar for mass data manipulation &lt;br /&gt;
** &amp;lt;code&amp;gt;ggplot2&amp;lt;/code&amp;gt; a library for creating graphs and visualizations &lt;br /&gt;
* &amp;lt;code&amp;gt;doparallel&amp;lt;/code&amp;gt; cost-free abstraction, pooling of CPU threads &lt;br /&gt;
* &amp;lt;code&amp;gt;mlr&amp;lt;/code&amp;gt; interface to a large number of classification and regression techniques &lt;br /&gt;
* &amp;lt;code&amp;gt;shiny&amp;lt;/code&amp;gt; provides ability to create web servers similar to NodeJS or Python Flaskl&lt;br /&gt;
&lt;br /&gt;
=== Intel MKL (Math Kernel Library) ===&lt;br /&gt;
Improves performance for Fast Fourier Transforms, linear algebra operations, vector math, deep neural networks, and kernel solvers. &lt;br /&gt;
&lt;br /&gt;
Default math backend for NumPy, SciPy, and MATLAB &lt;br /&gt;
&lt;br /&gt;
Not hardware agnostic: chooses slowest solvers for non-Intel chips by default &lt;br /&gt;
&lt;br /&gt;
=== OpenBLAS and LAPACK ===&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;LAPACK&amp;#039;&amp;#039;&amp;#039; (Linear Algebra PACKage) provides APIs much like MKL &lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;OpenBLAS&amp;#039;&amp;#039;&amp;#039; (Basic Linear Algebra Subprograms) extends LAPACK with optimizations for parallel computing &lt;br /&gt;
&lt;br /&gt;
Default for R and Biopython&lt;br /&gt;
&lt;br /&gt;
=== Message Passing Interface (MPI) ===&lt;br /&gt;
Supported by all major compilers (Intel and OpenMP implementations) &lt;br /&gt;
&lt;br /&gt;
An API supporting shared-memory multiprocessing provides backend for many parallel computing systems, allowing for multi-threaded access &lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;ex&amp;#039;&amp;#039;: &amp;lt;code&amp;gt;mpirun -np $NUM_PROC /path/to/coolProgram &amp;amp;lt; $INPUT &amp;amp;gt; /path/to/output&amp;lt;/code&amp;gt; &lt;br /&gt;
&lt;br /&gt;
=== GNU Parallel ===&lt;br /&gt;
simple vectorization of loops over processors for non-multithreaded processes &lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;ex&amp;#039;&amp;#039;: &amp;lt;code&amp;gt;parallel -j $NUM_PROC /path/to/thescript.sh ::: &amp;lt;/code&amp;gt;&amp;lt;span&amp;gt;&amp;lt;code&amp;gt;1..n&amp;lt;/code&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;code&amp;gt; ::: &amp;lt;/code&amp;gt;&amp;lt;span&amp;gt;&amp;lt;code&amp;gt;1..m&amp;lt;/code&amp;gt;&amp;lt;/span&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== CUDA/OpenCL ===&lt;br /&gt;
Nvidia-specific &amp;#039;&amp;#039;&amp;#039;CUDA&amp;#039;&amp;#039;&amp;#039; and open-source &amp;#039;&amp;#039;&amp;#039;OpenCL&amp;#039;&amp;#039;&amp;#039; provide a hardware abstracting API for using GPU for compute tasks &lt;br /&gt;
&lt;br /&gt;
must-have for Pytorch or TensorFlow workloads &lt;br /&gt;
&lt;br /&gt;
Nomenclature Divergence &lt;br /&gt;
&lt;br /&gt;
* CUDA thread = OpenCL work item = CPU lane &lt;br /&gt;
* CUDA multiprocessor = OpenCL compute unit = CPU&lt;br /&gt;
&lt;br /&gt;
=== High Performance Computers ===&lt;br /&gt;
HPC or supercomputing clusters provide high throughput analysis.&lt;br /&gt;
&lt;br /&gt;
Amazingly high amount of computational power.&lt;br /&gt;
&lt;br /&gt;
Need to plan your analysis.&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;NDSU Center for Computationally-Assisted Science and Technology (CCAST)&amp;#039;&amp;#039;&amp;#039; provides a platform for these workloads connect via &amp;lt;code&amp;gt;ssh&amp;lt;/code&amp;gt; uses loadable modules: &lt;br /&gt;
&lt;br /&gt;
* &amp;#039;&amp;#039;ex:&amp;#039;&amp;#039; &amp;lt;code&amp;gt;module load parallel&amp;lt;/code&amp;gt; &lt;br /&gt;
&lt;br /&gt;
batch processing via PBS scripting&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== &amp;lt;span&amp;gt;Continued Learning&amp;lt;/span&amp;gt; ==&lt;br /&gt;
&amp;lt;div class=&amp;quot;outline&amp;quot;&amp;gt;&lt;br /&gt;
=== General Programming/Computers Websites ===&lt;br /&gt;
&lt;br /&gt;
* StackOverflow.com - check before asking new questions &lt;br /&gt;
* RosettaCode.org - data structures and algorithms in many languages &lt;br /&gt;
* Linux.die.net/man/ - the Linux manual &lt;br /&gt;
* grymoire.com/Unix/ - more *nix CLI tutorials&lt;br /&gt;
&lt;br /&gt;
=== Python ===&lt;br /&gt;
&lt;br /&gt;
* docs.python.org/3/ - the official python documentation &lt;br /&gt;
* docs.python.org/3/tutorial - the official tutorial &lt;br /&gt;
* diveintopython.net - guided tutorial online &lt;br /&gt;
* pythontutor.com - visual debugger &lt;br /&gt;
&lt;br /&gt;
=== R ===&lt;br /&gt;
&lt;br /&gt;
* cran.r-project.org - CRAN &lt;br /&gt;
* cran.r-project.org/manuals.html &lt;br /&gt;
* rdrr.io - meta-manual lookup and many other tools for R &lt;br /&gt;
* swirlstats.com - learn R, in R &lt;br /&gt;
* statslearning.com - statistical machine learning coursework&lt;br /&gt;
&lt;br /&gt;
=== Recommended Reading ===&lt;br /&gt;
&lt;br /&gt;
* &amp;#039;&amp;#039;An Introduction to Statistical Machine Learning&amp;#039;&amp;#039; by Gareth James et al. &lt;br /&gt;
* &amp;#039;&amp;#039;A Primer on Scientific Programming with Python&amp;#039;&amp;#039; by Hans Petter Langtangen &lt;br /&gt;
* &amp;#039;&amp;#039;R for Data Science&amp;#039;&amp;#039; by Wickham and Grolmund&amp;lt;/div&amp;gt;&amp;lt;/div&amp;gt;&lt;/div&gt;</summary>
		<author><name>Sysadmin</name></author>
	</entry>
</feed>