<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:georss='http://www.georss.org/georss' xmlns:gd='http://schemas.google.com/g/2005' xmlns:thr='http://purl.org/syndication/thread/1.0'><id>tag:blogger.com,1999:blog-8780666648094660666</id><updated>2011-07-31T03:17:29.329-07:00</updated><category term='books'/><category term='R'/><title type='text'>Systems</title><subtitle type='html'>Discussions on various computing systems, especially storage and scalability</subtitle><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://stevenonsystems.blogspot.com/feeds/posts/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8780666648094660666/posts/default?max-results=100'/><link rel='alternate' type='text/html' href='http://stevenonsystems.blogspot.com/'/><link rel='hub' href='http://pubsubhubbub.appspot.com/'/><author><name>Steven Jenkins</name><uri>http://www.blogger.com/profile/06556182604117556286</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>2</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>100</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-8780666648094660666.post-2491314661241313599</id><published>2011-07-06T16:57:00.000-07:00</published><updated>2011-07-06T16:57:07.939-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='books'/><category scheme='http://www.blogger.com/atom/ns#' term='R'/><title type='text'>Book review: The R Cookbook by Paul Teetor</title><content type='html'>I haven't seen a better introduction to R than Paul Teetor's R Cookbook, published by O'Reilly.&amp;nbsp; While it follows the&amp;nbsp; familiar O'Reilly cookbook format, it also provides a gentle introduction, with all the necessary information to get started&amp;nbsp; As a particularly nice touch for a cookbook, it includes basic statistics and input/output in the early chapters so that the reader doesn't need to wade through (or fearfully skip over) a lot of material before getting to the needed resources.&lt;br /&gt;&lt;br /&gt;A common complaint with other R resources is that the novice in&lt;br /&gt;statistics is overwhelmed with statistical terminology.&amp;nbsp; Teetor&lt;br /&gt;is not trying to provide a statistics textbook, but he includes refreshing&lt;br /&gt;explanations for the underlying statistics.&lt;br /&gt;&lt;br /&gt;Some chapters are particular standouts:&lt;br /&gt;&lt;br /&gt;Chapter 2: Some Basics.&amp;nbsp; This chapter is an appetizer of what R can do,&lt;br /&gt;and it's very helpful to get this early.&amp;nbsp; Aside from the basic usage of R covered in this chapter, section 2.6 (Computing Basic Statistics) provides a quick introduction to performing basic statistics with R.&lt;br /&gt;&lt;br /&gt;Chapter 4: Input and Output.&amp;nbsp; R's input/output support is a bit cumbersome, but the R Cookbook provides examples for many common cases that newcomers need to handle (text files, CSV's, etc).&lt;br /&gt;&lt;br /&gt;Chapter 9: General Statistics.&amp;nbsp; This is the meat and potatoes of R for many statistical users.&amp;nbsp; Students in a basic statistics course (or practitioners needing to do most fundamental analyses) will find chapter 9 to be indispensable.&lt;br /&gt;&lt;br /&gt;Chapter 10: Graphics provides a nice dessert as visualizing data is often critical to understanding it.&amp;nbsp; Teetor provides simple, concrete examples that cover many of the common graphics, as well as how to handle their titles, labels, and legends.&lt;br /&gt;&lt;br /&gt;As an added bonus, Teetor and O'Reilly provide Chapter 14: Time Series Analysis.&amp;nbsp; The coverage here goes beyond standard cookbook fare and provides a good starting point for those interested in Time Series Analysis.&lt;br /&gt;&lt;br /&gt;Overall, the R Cookbook is the best O'Reilly cookbook I've read since the release of the Perl Cookbook, and it's by far the best introduction to R that I've seen.&amp;nbsp; It's a must-have for every newcomer to R.&lt;br /&gt;&lt;br /&gt;[Disclaimer: I got this book for free as part of the Oreilly blogger review program I was not required to write a positive review. The opinions I have expressed are my own. I am disclosing this in accordance with the Federal Trade Commission’s 16 CFR, Part 255 : “Guides Concerning the Use of Endorsements and Testimonials in Advertising.”]&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8780666648094660666-2491314661241313599?l=stevenonsystems.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://stevenonsystems.blogspot.com/feeds/2491314661241313599/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8780666648094660666&amp;postID=2491314661241313599' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8780666648094660666/posts/default/2491314661241313599'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8780666648094660666/posts/default/2491314661241313599'/><link rel='alternate' type='text/html' href='http://stevenonsystems.blogspot.com/2011/07/book-review-r-cookbook-by-paul-teetor.html' title='Book review: The R Cookbook by Paul Teetor'/><author><name>Steven Jenkins</name><uri>http://www.blogger.com/profile/06556182604117556286</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8780666648094660666.post-3292494001411925224</id><published>2007-08-20T20:31:00.000-07:00</published><updated>2007-08-20T22:53:36.797-07:00</updated><title type='text'>OpenAFS, Acopia, and Panasas</title><content type='html'>AFS (as the Andrew File System, IBM AFS, and, now, OpenAFS) has been around for a long time.  That longevity has brought it some distinct advantages: the userbase is both broad and deep; the product is also stable.   There are lots of competitors to OpenAFS, though, with money to be made in the storage market.  Witness the recent acquisition of Acopia by F5 Networks.  Another vendor, Panasas, is clearly viewed as a potential good business (e.g., take a look at their Board of Directors -- venture capitalists would not be on the Board if they did not think the company would be profitable).&lt;br /&gt;&lt;br /&gt;Those two companies, Acopia and Panasas, represent two different market segments that have historically been in the sweet spot of AFS usage.  AFS is still strong in one of those areas, but it has soured a bit in the other.&lt;br /&gt;&lt;br /&gt;Acopia's claim-to-fame is virtualization, the ability to keep a namespace constant while changing the back ends around.  They also do data migrations.  They export via NFS or CIFS, so virtually any modern operating system can access data through their systems.  This is very  nice.  The downside, though, is that Acopia is a hardware solution.  Lori MacVittie's &lt;a href="http://www.vmware.com/products/product_index.html"&gt;neat article&lt;/a&gt; about her personal NAS notwithstanding, using Acopia's ARX to provide seamless migration for your failed personal NAS just does not make fiscal sense. &lt;br /&gt;&lt;br /&gt;AFS provides the same kind of virtualization, but at a different cost.  First, no special hardware is needed.  The cost comes in complexity: clients have to run the AFS client.  Ports exist to lots of modern operating systems (from AIX to Windows), but installing clients is definitely more expensive than plugging a network-transparent NFS proxy into your network.  The other cost is in administration: the ramp-up for AFS is fairly steep.  While efforts have been made to help people get started with AFS, there is still a lot of work to be done.&lt;br /&gt;&lt;br /&gt;The two key features of AFS that provide this virtualization are the @sys magic, and the separation of filesystems into volumes, with volume metadata managed by database servers.  These key pieces let administrators glue together namespaces seamlessly.  The stable semantics of volume migration also lets administrators migrate data around a site even while users are accessing that data, letting users stay even further from the underlying details of the storage infrastructure.&lt;br /&gt;&lt;br /&gt;Panasas, on the other hand, is a clear winner over AFS in its product niche: high-performance NFS.  Like Lustre and several other filesystem products that live in the High-Performance Filesystem niche, Panasas accomplishes this by parallelizing remote filesystem accesses.  AFS gets some performance benefits from its caching, but the filesystem accesses are done against a single filesystem.  AFS also doesn't really do NFS. &lt;br /&gt;&lt;br /&gt;So who is buying Panasas?  While l have no knowledge of the sales, I can make some fairly educated guesses:  Organizations with large data sets in NFS (or CIFS) that need greater performance.  The large research organizations (high-energy physics labs, TeraGRID research groups, etc) might be interested in Panasas (except they already have Lustre, with support via ClusterFS).  The most direct competitors to Panasas, then, are the NAS appliance vendors.  It is interesting that most of the large research organizations have historically been heavy AFS users as well, and many still are.  AFS is widely used for the cross-site sharing of data, but it simply doesn't perform well enough to be competitive with NAS appliances running NFS. &lt;br /&gt;&lt;br /&gt;My suggestion, then, to the OpenAFS community is to get serious about helping people get started with AFS, and compete with the Acopias of the world.  Also, look into improving AFS performance: as AFS is more complex than NFS, there is likely never to be a performance comparison in favor of AFS; however, parallel filesystem accesses have been around for quite a while, and  an implementation of it in AFS could be very interesting.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8780666648094660666-3292494001411925224?l=stevenonsystems.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://stevenonsystems.blogspot.com/feeds/3292494001411925224/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8780666648094660666&amp;postID=3292494001411925224' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8780666648094660666/posts/default/3292494001411925224'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8780666648094660666/posts/default/3292494001411925224'/><link rel='alternate' type='text/html' href='http://stevenonsystems.blogspot.com/2007/08/openafs-futures.html' title='OpenAFS, Acopia, and Panasas'/><author><name>Steven Jenkins</name><uri>http://www.blogger.com/profile/06556182604117556286</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry></feed>
