<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>R-statistics blog &#187; machine learning</title>
	<atom:link href="http://www.r-statistics.com/tag/machine-learning/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.r-statistics.com</link>
	<description>Writing about statistics with R, and open source stuff (software, data, community)</description>
	<lastBuildDate>Mon, 30 Jan 2012 07:45:09 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>Want to join the closed BETA of a new Statistical Analysis Q&amp;A site &#8211; NOW is the time!</title>
		<link>http://www.r-statistics.com/2010/07/want-to-join-the-closed-beta-of-a-new-statistical-analysis-qa-site-now-is-the-time/</link>
		<comments>http://www.r-statistics.com/2010/07/want-to-join-the-closed-beta-of-a-new-statistical-analysis-qa-site-now-is-the-time/#comments</comments>
		<pubDate>Fri, 16 Jul 2010 07:06:56 +0000</pubDate>
		<dc:creator>Tal Galili</dc:creator>
				<category><![CDATA[R]]></category>
		<category><![CDATA[R community]]></category>
		<category><![CDATA[statistics]]></category>
		<category><![CDATA[communites]]></category>
		<category><![CDATA[machine learning]]></category>
		<category><![CDATA[online]]></category>
		<category><![CDATA[Q&A]]></category>
		<category><![CDATA[statistical analysis]]></category>

		<guid isPermaLink="false">http://www.r-statistics.com/?p=474</guid>
		<description><![CDATA[The bottom line of this post is for you to go to: Stack Exchange Q&#038;A site proposal: Statistical Analysis And commit yourself to using the website for asking and answering questions. (And also consider giving the contender, MetaOptimize a visit) * * * * Statistical analysis Q&#038;A website is about to go into BETA A [...]]]></description>
			<content:encoded><![CDATA[<div class="socialize-in-content" style="float:right;"><div class="socialize-in-button socialize-in-button-right"><iframe src="http://www.facebook.com/plugins/like.php?href=http://www.r-statistics.com/2010/07/want-to-join-the-closed-beta-of-a-new-statistical-analysis-qa-site-now-is-the-time/&amp;layout=box_count&amp;show_faces=false&amp;width=50&amp;action=like&amp;font=arial&amp;colorscheme=light&amp;height=65" scrolling="no" frameborder="0" style="border:none; overflow:hidden; width:50px !important; height:65px;" allowTransparency="true"></iframe></div><div class="socialize-in-button socialize-in-button-right"><g:plusone size="tall" href="http://www.r-statistics.com/2010/07/want-to-join-the-closed-beta-of-a-new-statistical-analysis-qa-site-now-is-the-time/"></g:plusone></div></div><p><strong>The bottom line of this post is for you to go to:<br />
<a href="http://area51.stackexchange.com/proposals/33/statistical-analysis?referrer=3OUOcMUJcOo1">Stack Exchange Q&#038;A site proposal: Statistical Analysis </a><br />
And commit yourself to using the website for asking and answering questions.</strong></p>
<p>(And also consider giving the contender, <a href="http://metaoptimize.com/qa">MetaOptimize</a> a visit)</p>
<p>* * * * </p>
<h3>Statistical analysis Q&#038;A website is about to go into BETA</h3>
<p>A month ago I <a href="http://www.r-statistics.com/2010/06/a-new-qa-website-for-data-analysis-based-on-stackoverflow-engine-is-waiting-for-you/">invited readers of this blog to commit to using a new Q&#038;A website for Data-Analysis</a> (based on StackOverFlow engine), once it will open (the site was originally proposed by <a href="http://robjhyndman.com/researchtips/">Rob Hyndman</a>).<br />
And now, a month later, I am happy to write that <strong>over 500 people</strong> have shown interest in the website, and choose to commit themselves.  This means we we have reached 100% completion of the website proposal process, and in the next few days we will move to the next step.</p>
<p>The next step is that the website will go into closed BETA for about a week.  If you want to be part of this &#8211; now is <a href="http://area51.stackexchange.com/proposals/33/statistical-analysis?referrer=3OUOcMUJcOo1">the time to join</a> (<--- call for action people).<br />
From being part in some other closed BETA of similar projects, I can attest that the enthusiasm of the people trying to answer questions in the BETA is very impressive, so I strongly recommend the experience.</p>
<p>If you won't make it by the time you see this post, then no worries - about a week or so after the website will go online, it will be open to the wide public.</p>
<p>(p.s: thanks Romunov for pointing out to me that the BETA is about to open)</p>
<h3>p.s: MetaOptimize</h3>
<p>I would like to finish this post with mentioning <a href="http://metaoptimize.com/qa/">MetaOptimize</a>.   This is a Q&#038;A website which is of a more &#8220;machine learning&#8221; then a &#8220;statistical&#8221; community.  It also started out some short while ago, and already it has <a href="http://metaoptimize.com/qa/users/">around 700 users</a> who have submitted ~160 questions with ~520 answers given.  From my experience on the site so far, I have enjoyed the high quality of the questions and answers.<br />
When I first came by the website, I feared that supporting this website will split the R community of users between this website and the <a href="http://area51.stackexchange.com/proposals/33/statistical-analysis?referrer=3OUOcMUJcOo1">area 51 StackExchange website</a>.<br />
But after a lengthy discussion (<a href="http://www.r-statistics.com/2010/07/statistical-analysis-qa-website-did-stackoverflow-just-lose-it-to-metaoptimize-and-is-it-good-or-bad/">published recently as a post</a>) with MetaOptimize founder, Joseph Turian, I came to have a more optimistic view of the competition of the two websites.  Where at first I was afraid, I am now <strong>hopeful</strong> that each of the two website will manage to draw a tiny bit of different communities of people (that would otherwise wouldn&#8217;t be present in the other website) &#8211; thus offering all of us a wider variety of knowledge to tap into.</p>
<p>See you there&#8230;</p>
]]></content:encoded>
			<wfw:commentRss>http://www.r-statistics.com/2010/07/want-to-join-the-closed-beta-of-a-new-statistical-analysis-qa-site-now-is-the-time/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>StackOverFlow and MetaOptimize are battling to be the #1 &#8220;Statistical Analysis Q&amp;A website” &#8211; to whom would you signup?</title>
		<link>http://www.r-statistics.com/2010/07/statistical-analysis-qa-website-did-stackoverflow-just-lose-it-to-metaoptimize-and-is-it-good-or-bad/</link>
		<comments>http://www.r-statistics.com/2010/07/statistical-analysis-qa-website-did-stackoverflow-just-lose-it-to-metaoptimize-and-is-it-good-or-bad/#comments</comments>
		<pubDate>Fri, 02 Jul 2010 21:55:05 +0000</pubDate>
		<dc:creator>Tal Galili</dc:creator>
				<category><![CDATA[R community]]></category>
		<category><![CDATA[statistics]]></category>
		<category><![CDATA[area51]]></category>
		<category><![CDATA[artificial intelligence]]></category>
		<category><![CDATA[data mining]]></category>
		<category><![CDATA[data visualization]]></category>
		<category><![CDATA[information retrieval]]></category>
		<category><![CDATA[machine learning]]></category>
		<category><![CDATA[natural language processing]]></category>
		<category><![CDATA[Q&A website]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[stack exchange]]></category>
		<category><![CDATA[stackoverflow]]></category>
		<category><![CDATA[statistical modeling]]></category>
		<category><![CDATA[text analysis]]></category>

		<guid isPermaLink="false">http://www.r-statistics.com/?p=442</guid>
		<description><![CDATA[A new statistical analysis Q&#38;A website launched While the proposal for a statistical analysis Q&#38;A website on area51 (stackexchange) is taking it&#8217;s time, and the website is still collecting people who will commit to it, Joseph Turian, who seems a nice guy from his various comments online, seem to feel this website is not what [...]]]></description>
			<content:encoded><![CDATA[<div class="socialize-in-content" style="float:right;"><div class="socialize-in-button socialize-in-button-right"><iframe src="http://www.facebook.com/plugins/like.php?href=http://www.r-statistics.com/2010/07/statistical-analysis-qa-website-did-stackoverflow-just-lose-it-to-metaoptimize-and-is-it-good-or-bad/&amp;layout=box_count&amp;show_faces=false&amp;width=50&amp;action=like&amp;font=arial&amp;colorscheme=light&amp;height=65" scrolling="no" frameborder="0" style="border:none; overflow:hidden; width:50px !important; height:65px;" allowTransparency="true"></iframe></div><div class="socialize-in-button socialize-in-button-right"><g:plusone size="tall" href="http://www.r-statistics.com/2010/07/statistical-analysis-qa-website-did-stackoverflow-just-lose-it-to-metaoptimize-and-is-it-good-or-bad/"></g:plusone></div></div><h3>A new statistical analysis Q&amp;A website launched</h3>
<p>While <a href="http://bit.ly/aDuRKV">the proposal for a statistical analysis Q&amp;A website</a> on area51 (stackexchange) is taking it&#8217;s time, and the website is still collecting people who will commit to it,<br />
<a href="http://www-etud.iro.umontreal.ca/~turian/">Joseph Turian</a>, who seems a nice guy from his various comments online, seem to feel this website is not what the community needs and that we shouldn&#8217;t hold up on our questions for the website to go online.  Therefore, Joseph is pushing with all his might his newest creation &#8220;<a href="http://metaoptimize.com/qa">MetaOptimize QA</a>&#8220;, a <a href="http://StackOverFlow.com">StackOverFlow </a>like website for (long list follows): <em>machine learning, natural language processing, artificial intelligence, text analysis, information retrieval, search, data mining, statistical modeling, and data visualization</em>.<br />
With all the bells and whistles that the <a href="http://www.osqa.net/">OSQA framework</a> (an open source stackoverflow clone, and more, system) can offer (you know, rankings, badges and so on).</p>
<p>Is this new website better then the area51 website?  Will all the people go to just one of the two websites. or will we end up with two places that attracts more people then we had to begin with?  These are the questions that come to mind when faced with the story in front of us.</p>
<p>My own suggestion is to try both websites (<a href="http://bit.ly/aDuRKV">the stackoverflow statistical analysis website to come</a> and &#8220;<a href="http://metaoptimize.com/qa">MetaOptimize QA</a>&#8220;) and let time tell.</p>
<p>More info on this story bellow.</p>
<h3>MetaOptimize online impact so far</h3>
<p>The need for such a Q&amp;A site is clearly evident.  With just several days after being promoted online, MetaOptimize has claimed the eyes of almost 300 users, submitting 59 questions and 129 answers.<br />
Already many bloggers in the statistical community have contributed their voices with encouraging posts, here is just a collection of the post I was able to find with some googling:</p>
<ul>
<li><a href="http://hunch.net/?p=1425">http://hunch.net/?p=1425</a></li>
<li><a href="http://ebiquity.umbc.edu/blogger/2010/06/30/training-examples-qa-stackoverflow-for-nlp-and-ml/">http://ebiquity.umbc.edu/blogger/2010/06/30/training-examples-qa-stackoverflow-for-nlp-and-ml/</a></li>
<li><a href="http://lingpipe-blog.com/2010/06/29/training-examples-a-stack-overflow-for-nlp-and-ml-and/">http://lingpipe-blog.com/2010/06/29/training-examples-a-stack-overflow-for-nlp-and-ml-and/</a></li>
<li><a href="http://www.stat.columbia.edu/~cook/movabletype/archives/2010/06/question_answer.html">http://www.stat.columbia.edu/~cook/movabletype/archives/2010/06/question_answer.html</a></li>
<li><a href="http://kaggle.com/blog/2010/07/02/new-machine-learning-and-natural-language-processing-qa-site/">http://kaggle.com/blog/2010/07/02/new-machine-learning-and-natural-language-processing-qa-site/</a></li>
<li><a href="http://www.jroller.com/otis/entry/metaoptimize_com_q_a_site">http://www.jroller.com/otis/entry/metaoptimize_com_q_a_site</a></li>
<li><a href="http://sbseminar.wordpress.com/2010/06/17/statistics-version-of-mathoverflow-looking-for-beta-testers/">http://sbseminar.wordpress.com/2010/06/17/statistics-version-of-mathoverflow-looking-for-beta-testers/</a></li>
<li><a href="http://myumbc3.my.umbc.edu/news/1841">http://myumbc3.my.umbc.edu/news/1841</a></li>
<li><a href="http://ebiquity.umbc.edu/blogger/2010/06/30/training-examples-qa-stackoverflow-for-nlp-and-ml/">http://ebiquity.umbc.edu/blogger/2010/06/30/training-examples-qa-stackoverflow-for-nlp-and-ml/</a></li>
</ul>
<h3>But is it goos to have two websites?</h3>
<p>But wait, didn&#8217;t we just start pushing forward another <a href="http://www.r-statistics.com/2010/06/a-new-qa-website-for-data-analysis-based-on-stackoverflow-engine-is-waiting-for-you/">statistical Q&amp;A website two weeks ago</a>?  I am talking about the <strong><a href="http://bit.ly/aDuRKV">Stack Exchange Q&amp;A site proposal: Statistical Analysis</a>.</strong></p>
<p>So what should we (the community of statistical minded people) to do the next time we have a question?</p>
<p>Should we wait for Stack Exchange offer for a new website to start?  Or should we start using MetaOptimize?</p>
<p><strong>Update: <span style="font-weight: normal;">after lengthy e-mail exchange with Joseph (the person who founded MetaOptimize), I decided to erase what I originally wrote as my doubts, and instead give a Q&amp;A session that him and I have had in the e-mails exchange.  It is a bit edited from what was originally, and some of the content will probably get updated &#8211; so if you are into this subject, check in again in a few hours <img src='http://www.r-statistics.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </span></strong></p>
<p><del datetime="2010-07-03T09:28:16+00:00"><br />
Honestly, I am split in two (and <a href="http://www-etud.iro.umontreal.ca/~turian/">Joseph</a>, I do hope you&#8217;ll take this in a positive way, since personally I feel confident you are a good guy).  I very strongly believe in the need and value of such a Q&amp;A website.  Yet I am wondering how I feel about such a website being hosted as MetaOptimize and outside the hands of the stackoverflow guys.<br />
On the one hand, open source lovers (like myself) tend to like decentralization and reliance on OSS (open source software) solutions (such as the one <a href="http://www.osqa.net/">OSQA framework</a> offers).  On the other hand, I do believe that the stackoverflow people  have (much) more experience in handling such websites then Joseph.  I can very easily trust them to do regular database backups, share the websites database dumps with the general community, smoothly test and upgrade to provide new features, and generally speaking perform in a more  experienced way with the online Q&amp;A community.<br />
It doesn&#8217;t mean that Joseph won&#8217;t do a great job, personally I hope he will.</del></p>
<h3><strong><span style="text-decoration: underline;">Q&amp;A session with Joseph Turian (MetaOptimize founder)</span></strong></h3>
<p><strong><span style="text-decoration: underline;">Tal</span></strong>: Let&#8217;s start with the easy question, should I worry about technical issues in the website (like, for example, backups)?</p>
<p><span style="text-decoration: underline;"><strong>Joseph</strong></span>:</p>
<div id="_mcePaste">The OSQA team (backed by DZone) have got my back. They have been very helpful since day one to all OSQA users, and have given me a lot of support. Thanks, especially Rick and Hernani!</div>
<p>They provide email and chat support for OSQA users.</p>
<p>I will commit to putting up regular automatic database dumps, whenever the OSQA team implements it:<br />
<a href="http://meta.osqa.net/questions/3120/how-do-i-offer-database-dumps">http://meta.osqa.net/questions/3120/how-do-i-offer-database-dumps</a><br />
If, in six months, they don&#8217;t have this feature as part of their core, and someone (e.g. you) emails me reminding me that they want a dump, I will manually do a database dump and strip the user table.</p>
<p>Also, I&#8217;ve got a scheduled daily database dump that is mirrored to Amazon S3.</p>
<p><span style="text-decoration: underline;"><strong><strong><span style="text-decoration: underline;">Tal</span></strong>:</strong></span> Why did you start MetaOptimize instead of supporting the area51 proposal?<br />
<span style="text-decoration: underline;"><strong>Joseph</strong></span>:</p>
<ol>
<li><span style="font-size: 13.1944px;">On Area51, people asked to have AI merged with ML, and ML merged with statistical analysis, but their requests seemed to be ignored. This seemed like a huge disservice to these communities.</span></li>
<li><span style="font-size: 13.1944px;">Area 51 didn&#8217;t have academics in ML + NLP. I know from experience it&#8217;s hard to get them to buy in to new technology. So why would I risk my reputation getting them to sign up for Area 51, when I know that I will get a 1% conversion? They aren&#8217;t early adopters interested in the process, many are late adopters who won&#8217;t sign up for something until they have too.</span></li>
<li><span style="font-size: 13.1944px;">If the Area 51 sites had a strong newbie bent, which is what it seemed like the direction was going, then the academic experts definitely wouldn&#8217;t waste their time. It would become a support<br />
</span><span style="font-size: 13.1944px;">community for newbies, without core expert discussion.  So basically, I know that I and a lot of my colleagues wanted the site I built. And I felt like area 51 was shaping the communities really incorrectly in several respects, and was also taking a while.  I could have fought an institutional process and maybe gotten half the results above and it took a few months, or I could just build the site and invite my friends, and shape the community correctly.</span></li>
</ol>
<p>Besides that, there are also personal motives:</p>
<ul>
<li><span style="font-size: 13.1944px;">I wanted the recognition for having a good vision for the community, and driving forward something they really like.</span></li>
<li><span style="font-size: 13.1944px;">I wanted to experiment with some NLP and ML extensions for the Q+A software, to help organize the information better. Not possible on a closed platform.</span></li>
</ul>
<p><span style="text-decoration: underline;"><strong><strong><span style="text-decoration: underline;">Tal</span></strong>:</strong></span> Me (and maybe some other people) fear that this might fork the people in the field to two websites, instead of bringing them together.  What are your thoughts about that?<br />
<span style="text-decoration: underline;"><strong>Joseph</strong></span>:<br />
How am I forking the community? I&#8217;m bringing a bunch of people in who wouldn&#8217;t have even been part of the Area 51 community.<br />
Area 51 was going to fork it into five communities: stat analysis, ML, NLP, AI, and data mining.  And then a lot fewer people would have been involved.</p>
<p><span style="text-decoration: underline;"><strong><strong><span style="text-decoration: underline;">Tal</span></strong>:</strong></span> What are the things that people who support your website are saying?<br />
<span style="text-decoration: underline;"><strong>Joseph</strong></span>:<br />
Here are some quotes about my site:</p>
<blockquote><p>Philip Resnick (UMD): &#8220;Looking at the questions being asked, the people responding, and the quality of the discussion, I can already see this becoming the go-to place for those &#8216;under the hood&#8217; details<br />
you rarely see in the textbooks or conference papers. This site is going to save a lot of people an awful lot of time and frustration.&#8221;</p>
<p>Aria Haghighi (Berkeley): &#8220;Both NLP and ML have a lot of folk wisdom about what works and what doesn&#8217;t. A site like this is crucial for facilitating the sharing and validation of this collective knowledge.&#8221;</p>
<p>Alexandre Passos (Unicamp): &#8220;Really thank you for that. As a machine learning phd student from somewhere far from most good research centers (I&#8217;m in brazil, and how many brazillian ML papers have you<br />
seen in NIPS/ICML recently?), I struggle a lot with this folk wisdom. Most professors around here haven&#8217;t really interacted enough with the international ML community to be up to date&#8221;<br />
(http://news.ycombinator.com/item?id=1476247)</p>
<p>Ryan McDonald (Google): &#8220;A tool like this will help disseminate and archive the tricks and best practices that are common in NLP/ML, but are rarely written about at length in papers.&#8221;</p>
<p>esoom on Reddit: &#8220;This is awesome. I&#8217;m really impressed by the quality of some of the answers, too. Within five minutes of skimming the site, I learned a neat trick that isn&#8217;t widely discussed in the literature.&#8221;<br />
(http://www.reddit.com/r/MachineLearning/comments/ckw5k/stackoverflow_for_machine_learning_and_natural/c0tb3gc)</p>
<p><span style="text-decoration: underline;"><strong><strong><span style="text-decoration: underline;">Tal</span></strong>:</strong></span> In order to be fair to area51 work, they have gotten wonderful responses for the &#8220;statistical analysis&#8221; proposal as well (<a href="http://bit.ly/aDuRKV">see it here</a>)<br />
I have also contacted area51 directly and asked them and invited them to come and join the discussion.  I&#8217;ll update this post with their reply.</p></blockquote>
<h3><span style="text-decoration: underline;">So what&#8217;s next?</span></h3>
<p><del datetime="2010-07-03T08:08:02+00:00">I don&#8217;t know.<br />
If the Stack Exchange website where to launch today, I would probably focus on using it and hint to the site for MetaOptimize (for the reasons I just mentioned, and also for some that Rob Hyndman maintained when he <a href="http://robjhyndman.com/researchtips/stack-exchange-for-statistical-analysis-needs-you/">first wrote on the subject</a>).<br />
If the stack exchange version of the website where to start in a few weeks, I would probably sit on the fence and see if people are using it.  I suspect that by that time, there wouldn&#8217;t be many people left to populate it (but I could always be wrong).<br />
And what if the website where to start in a week, what then?  I have no clue.</del><br />
Good question.<br />
My current feeling is that I am glad to let this play out.<br />
It seems this is a good case study for some healthy competition between platforms and models (OSQA vs stackoverflow/area51-system) &#8211; one that I hope will generate more good features from both companies.  And also will make both parties work hard to get people to participate.<br />
It also seems that this situation is getting many people in our field to be approached with the same idea (Q&amp;A website).  After Joseph input on the subject, I am starting to think that maybe at the end of the day this will benefit all of us.  Instead of forking one community into two, maybe what we&#8217;ll end up with is getting more (experienced) people online (into two locations) that would otherwise would have stayed in the shadows.</p>
<p>The verdict is still out, but I am a bit more optimistic than I was when first writing this post.  I&#8217;ll update this post after getting more input from people.</p>
<p>And as always &#8211; I would love to know <strong><span style="text-decoration: underline;">your thoughts</span></strong> on the subject.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.r-statistics.com/2010/07/statistical-analysis-qa-website-did-stackoverflow-just-lose-it-to-metaoptimize-and-is-it-good-or-bad/feed/</wfw:commentRss>
		<slash:comments>19</slash:comments>
		</item>
		<item>
		<title>Free statistics e-books for download</title>
		<link>http://www.r-statistics.com/2009/10/free-statistics-e-books-for-download/</link>
		<comments>http://www.r-statistics.com/2009/10/free-statistics-e-books-for-download/#comments</comments>
		<pubDate>Sun, 25 Oct 2009 07:31:02 +0000</pubDate>
		<dc:creator>Tal Galili</dc:creator>
				<category><![CDATA[R]]></category>
		<category><![CDATA[statistics]]></category>
		<category><![CDATA[book]]></category>
		<category><![CDATA[ebook]]></category>
		<category><![CDATA[Jerome Friedman]]></category>
		<category><![CDATA[machine learning]]></category>
		<category><![CDATA[Robert Tibshirani]]></category>
		<category><![CDATA[statistical learning]]></category>
		<category><![CDATA[Trevor Hastie]]></category>

		<guid isPermaLink="false">http://www.r-statistics.com/?p=40</guid>
		<description><![CDATA[This post will eventually grow to hold a wide list of books on statistics (e-books, pdf books and so on) that are available for free download.  But for now we&#8217;ll start off with just one several books: The Elements of Statistical Learning written by Trevor Hastie, Robert Tibshirani and Jerome Friedman. you can legally download [...]]]></description>
			<content:encoded><![CDATA[<div class="socialize-in-content" style="float:right;"><div class="socialize-in-button socialize-in-button-right"><iframe src="http://www.facebook.com/plugins/like.php?href=http://www.r-statistics.com/2009/10/free-statistics-e-books-for-download/&amp;layout=box_count&amp;show_faces=false&amp;width=50&amp;action=like&amp;font=arial&amp;colorscheme=light&amp;height=65" scrolling="no" frameborder="0" style="border:none; overflow:hidden; width:50px !important; height:65px;" allowTransparency="true"></iframe></div><div class="socialize-in-button socialize-in-button-right"><g:plusone size="tall" href="http://www.r-statistics.com/2009/10/free-statistics-e-books-for-download/"></g:plusone></div></div><p>This post will eventually grow to hold a wide list of books on statistics (e-books, pdf books and so on) that are available for free download.  But for now we&#8217;ll start off with just <span style="text-decoration: line-through;">one </span> several books:</p>
<ul>
<ul>
<li><em><strong>The Elements of Statistical Learning</strong></em> written by Trevor Hastie, Robert Tibshirani and Jerome Friedman. you can legally download a copy of the book in pdf format from the <a href="http://www-stat.stanford.edu/~tibs/ElemStatLearn/">authors website</a>! <a href="http://www-stat.stanford.edu/~tibs/ElemStatLearn/download.html">Direct download</a> (First discovered on the &#8220;<a href="http://onertipaday.blogspot.com/2009/10/elements-of-statistical-learning.html">one R tip a day</a>&#8221; blog)</li>
<li><a href="http://en.wikibooks.org/wiki/Statistics">Statistics (Probability and Data Analysis)</a> &#8211; a wikibook. <strong><a href="http://upload.wikimedia.org/wikipedia/commons/8/82/Statistics.pdf" rel="nofollow">Download link</a></strong></li>
<li><a href="http://www.math.umass.edu/~lavine/Book/book.html">Introduction to Statistical Thought</a> by Michael Lavine.  The book is organized into seven chapters: “Probability,” “Modes of Inference,” “Regression,” “More Probability,” “Special Distributions,” “More Models,” and “Mathematical Statistics.” and makes extensive use of R.  Here is a favoring review the book received in <a href="http://www.math.umass.edu/~lavine/Book/jasareview.pdf">JASA</a>. 328 pages. <strong><a href="http://www.math.umass.edu/~lavine/Book/book.pdf" rel="nofollow">Download link</a></strong> (approx. 40 mbyte)</li>
<li><a href="http://mitpress.mit.edu/catalog/item/default.asp?ttype=2&amp;tid=12156">Street-Fighting Mathematics</a> by Sanjoy Mahajan. <strong><a href="http://mitpress.mit.edu/books/full_pdfs/Street-Fighting_Mathematics.pdf" rel="nofollow">Download link</a></strong></li>
<li><strong> </strong><a href="http://psy.otago.ac.nz/miller/index.htm#GLMBook">Statistical Analysis with the General Linear Model</a> by Miller and Haden. an introductory textbook describing statistical analysis with analysis of variance (ANOVA, including repeated-measures and mixed designs), simple and multiple regression, and analysis of covariance. 274 pages. <strong><a href="http://www.mediafire.com/?bdggpmmew0z">Download link</a> </strong>(p.s: this book makes no reference to R.  <a href="http://www.r-statistics.com/2010/04/repeated-measures-anova-with-r-tutorials/">see here for R tutorials and functions for performing repeated measures anova</a>)</li>
<li><a href="http://cnx.org/content/col10522/latest/">Collaborative Statistics</a> by Barbara Illowsky and Susan Dean.  This textbook is intended for introductory statistics courses.  627 pages.  R is not used in this book.  <strong><a href="http://cnx.org/content/col10522/1.38/pdf" rel="nofollow">Download link</a></strong></li>
<li><strong><em>Using R for Introductory Statistics</em></strong> by John Verzani Publisher: Chapman &amp; Hall/CRC 2004 ISBN/ASIN: 1584884509 ISBN-13: 9781584884507 Number of pages: 114 Description: The author presents a self-contained treatment of statistical topics and the intricacies of the R software. The book treats exploratory data analysis with more attention than is typical, includes a chapter on simulation, and provides a unified approach to linear models. This text lays the foundation for further study and development in statistics using R. <a href="http://cran.r-project.org/doc/contrib/Verzani-SimpleR.pdf"><strong>Download link</strong></a></li>
<li><strong><em>R Graphics</em></strong> (Three chapters only) by Paul Murrell ISBN: 9781584884866 ISBN 10: 158488486X Publication Date: July 29, 2005 Number of Pages: 328 Description: Chapter 1: An Introduction to R Graphics Chapter 4: Trellis Graphics: The Lattice Package Chapter 5: The Grid Graphics Model <a href="http://www.stat.auckland.ac.nz/~paul/RGraphics/RGraphicsChapters-1-4-5.pdf"><strong>Download link</strong></a> (see scripts and images <a href="http://www.stat.auckland.ac.nz/~paul/RGraphics/rgraphics.html">here</a>)</li>
<li><strong><em>Using R</em></strong> <a href="http://cran.r-project.org/doc/contrib/usingR.pdf"><strong>Download link</strong></a></li>
<li><strong><em>R intro</em></strong> <strong><a href="http://cran.r-project.org/doc/manuals/R-intro.pdf">Download link</a></strong></li>
<li><em><strong>Psychometric Theory with Applications in R</strong></em> by William Revelle (a work in progress) <a href="http://www.personality-project.org/r/book/" rel="nofollow"><strong>Download link</strong></a></li>
<li>A great long list of R related texts, for free download, can be <a href="http://cran.r-project.org/other-docs.html#english">found here</a>.</li>
<li><strong><em>Using Graphs Instead of Tables</em></strong> <a href="http://tables2graphs.com/doku.php"><strong>website link</strong></a> (This web page accompanies the article &#8220;Using Graphs Instead of Tables in Political Science&#8221;, by Jonathan Kastellec and Eduardo Leoni, which appears in the December 2007 issue of Perspectives on Politics. It contains complete replication code for all the graphs that appear in the text)</li>
<li><strong><a href="http://ipsur.r-forge.r-project.org/book/">IPSUR: Introduction to Probability and Statistics Using R</a></strong> by G. Jay Kerns, is <a href="http://www.gnu.org/copyleft/fdl.html">FREE</a> (in the <a href="http://www.r-statistics.com/2010/07/richard-stallman-talkqa-at-the-user-2010-conference-audio-files-attached/">GNU sense</a> of the word) and comes with<a href="http://ipsur.r-forge.r-project.org/rcmdrplugin/"> a plugin for Rcmdr</a>. 412 pages. <strong><a href="http://ipsur.r-forge.r-project.org/book/download.html" rel="nofollow">Download link</a> </strong>(first discovered through <a href="http://blog.revolutionanalytics.com/2010/07/a-free-book-on-probability-and-statistics-with-r.html">the Revolution blog</a>)</li>
<li><strong><a href="http://knowledgeforge.net/opentextbook/svn/multivariatestatistics/">Multivariate Statistics with R</a></strong> by Paul J. Hewson. 189 pages. <strong><a href="http://knowledgeforge.net/opentextbook/svn/multivariatestatistics/notes.pdf" rel="nofollow">Download link</a> </strong>(first discovered through <a href="http://www.opentextbook.org/2009/04/03/multivariate-statistics-with-r/">open text book blog</a>)</li>
<li><strong><a href="http://en.wikibooks.org/wiki/R_Programming">R Programming</a></strong> &#8211; a wikibook. (no PDF version is available as of yet)</li>
<li><a href="http://pluto.huji.ac.il/~msby/StatThink/index.html">Introduction to Statistical Thinking (With R, Without Calculus)</a> &#8211; By Benjamin Yakir</li>
<li><a href="http://www.math.ku.dk/~sjo/papers/HaldBook.pdf">A History of Parametric Statistical Inference from Bernoulli to Fisher, 1713 to 1935</a> &#8211; By Anders Hald</li>
<li><a href="http://uncertainty.stat.cmu.edu/?p=1">Principles of Uncertainty</a> (direct <a href="http://uncertainty.stat.cmu.edu/wp-content/uploads/2011/05/principles-of-uncertainty.pdf">link to pdf</a>), by Jay Kadane (got <a href="http://xianblog.wordpress.com/2011/10/14/principles-of-uncertainty/">a great review by xian</a>)</li>
</ul>
</ul>
<p>&nbsp;</p>
<p>Several of these books were discovered through a<a href="http://stats.stackexchange.com/questions/614/open-source-statistical-textbooks/"> CrossValidated discussion</a>.</p>
<p>* * *</p>
<p><em>Know of any more e-books freely available for download? Please write to me about them in the comments.</em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.r-statistics.com/2009/10/free-statistics-e-books-for-download/feed/</wfw:commentRss>
		<slash:comments>15</slash:comments>
		</item>
	</channel>
</rss>

