<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Linuxscalability | Linux</title>
	<atom:link href="http://linux.sjolshagen.net/tag/scalability/feed/" rel="self" type="application/rss+xml" />
	<link>http://linux.sjolshagen.net</link>
	<description>Linux for Businesses</description>
	<lastBuildDate>Wed, 01 Feb 2012 17:33:51 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>Scaling up your virtualization solution on 8-socket HP ProLiant Servers</title>
		<link>http://linux.sjolshagen.net/2010/03/01/scaling-up-your-virtualization-solution-on-8-socket-hp-proliant-servers/</link>
		<comments>http://linux.sjolshagen.net/2010/03/01/scaling-up-your-virtualization-solution-on-8-socket-hp-proliant-servers/#comments</comments>
		<pubDate>Mon, 01 Mar 2010 19:10:28 +0000</pubDate>
		<dc:creator>Thomas S</dc:creator>
				<category><![CDATA[Linux]]></category>
		<category><![CDATA[Virtualization]]></category>
		<category><![CDATA[8-socket]]></category>
		<category><![CDATA[DL 785]]></category>
		<category><![CDATA[HP ProLiant]]></category>
		<category><![CDATA[KVM]]></category>
		<category><![CDATA[Performance]]></category>
		<category><![CDATA[Red Hat]]></category>
		<category><![CDATA[RHEL 5.4]]></category>
		<category><![CDATA[scalability]]></category>
		<category><![CDATA[server]]></category>
		<category><![CDATA[virtualization]]></category>

		<guid isPermaLink="false">http://linux.sjolshagen.net/?p=145</guid>
		<description><![CDATA[Some of the things we've learned while testing the KVM based virtualization solution in RHEL 5.4 on an 8-socket HP ProLiant server.]]></description>
			<content:encoded><![CDATA[<p>These days, when wearing my “Linux planner” hat, and with Virtualization being the “phrase that pays”, I’m often asked to help provide guidance on how to best take advantage of the technology included in our 8-socket HP ProLiant server offerings for Linux based virtualization solutions like Red Hat Enterprise Virtualization or Suse Linux Enterprise Server Xen (there’s a plethora of information out there about VMware ESX/ESXi 3.5.x and vSphere 4.0, so I’m not going to talk about that, this time around.)</p>
<p>The problem I’ve had, until recently, was providing actual – objective &#8211; data as a means to help illustrate my points.  For instance, I could not clearly illustrate how a snoop filter on the CPU interconnect can improve the linearity of the workload scalability in a virtualized environment (see Fig. 1).</p>
<div id="attachment_143" class="wp-caption aligncenter" style="width: 310px"><a href="http://linux.sjolshagen.net/wp-content/uploads/2010/03/Best-run-pinned-vs-unpinned.png"><img class="size-medium wp-image-143" title="Pinned and un-pinned tiles" src="http://linux.sjolshagen.net/wp-content/uploads/2010/03/Best-run-pinned-vs-unpinned-300x192.png" alt="" width="300" height="192" /></a><p class="wp-caption-text">Fig. 1: Average response time with pinned vs. un-pinned processors</p></div>
<p>I was unable to demonstrate benefits of the NUMA aware scheduler that the Linux kernel uses and how it <em>does</em> improve performance. (In figure 2, it’s represented by the improvement in average response times from the web-servers included in the workload) when your workloads run with memory interleaving disabled – see Fig. 2 and 3. Unless, for support reasons, your application vendor explicitly tells you otherwise, of course!</p>
<div id="attachment_151" class="wp-caption aligncenter" style="width: 310px"><a rel="attachment wp-att-151" href="http://linux.sjolshagen.net/2010/03/scaling-up-your-virtualization-solution-on-8-socket-hp-proliant-servers/non-interleaved-memory-avg_response/"><img class="size-medium wp-image-151" title="Non-interleaved Memory Config" src="http://linux.sjolshagen.net/wp-content/uploads/2010/03/non-interleaved-memory-avg_response-300x192.png" alt="" width="300" height="192" /></a><p class="wp-caption-text">Fig. 2: Average Response Times - Non-interleaved Memory Config</p></div>
<div id="attachment_153" class="wp-caption aligncenter" style="width: 310px"><a href="http://linux.sjolshagen.net/wp-content/uploads/2010/03/Interleaved-memory-avg-response.png"><img class="size-medium wp-image-153" title="Average Response Times - Interleaved RAM" src="http://linux.sjolshagen.net/wp-content/uploads/2010/03/Interleaved-memory-avg-response-300x225.png" alt="" width="300" height="225" /></a><p class="wp-caption-text">Fig. 3: Average Response Times - Interleaved memory</p></div>
<p>I also used to have a hard time explaining how and why to tune the Linux kernel for these systems. For instance, I only suspected how little (none) tuning of the host platform is required in order to drive pretty significant numbers of guests  (98) in these environments &#8211; see Fig. 4. But, if you engage in some very minor tuning activities of the network stack, how those very same workload performance results can be extended even further (to 256 guests) – see Fig. 5:</p>
<div id="attachment_161" class="wp-caption aligncenter" style="width: 310px"><a href="http://linux.sjolshagen.net/wp-content/uploads/2010/03/forgot-to-tune-linearity-graph.png"><img class="size-medium wp-image-161" title="Default tuning for Host server" src="http://linux.sjolshagen.net/wp-content/uploads/2010/03/forgot-to-tune-linearity-graph-300x192.png" alt="" width="300" height="192" /></a><p class="wp-caption-text">Fig. 4: The system has not been tuned beyond it&#39;s &quot;out of the box&quot; state.</p></div>
<div id="attachment_163" class="wp-caption aligncenter" style="width: 310px"><a href="http://linux.sjolshagen.net/wp-content/uploads/2010/03/tuned-slice.png"><img class="size-medium wp-image-163" title="Fully tuned and linear scalability" src="http://linux.sjolshagen.net/wp-content/uploads/2010/03/tuned-slice-300x192.png" alt="" width="300" height="192" /></a><p class="wp-caption-text">Fig. 5: System is tuned and exhibiting linear scalability to 256 KVM guests</p></div>
<p>As part of a joint documentation effort with Red Hat, all of the data collected has been brought together in a <a href="https://inquiries.redhat.com/go/redhat/rhel-hp-proliant">Reference Architecture document  &#8211; “Scaling RHEL 5.4 + KVM up to 256 Guests&#8221;</a> available for free from Red Hat’s website.</p>
<p>We obviously picked the guest density to prove a point about the platform, however it’s worth mentioning that <strong><em>256 guests</em></strong> <strong><em>does not represent the upper bound for the platform</em></strong>. It only represents where we thought the density went (far) beyond what is reasonable to expect in a production environment this day in age.</p>
]]></content:encoded>
			<wfw:commentRss>http://linux.sjolshagen.net/2010/03/01/scaling-up-your-virtualization-solution-on-8-socket-hp-proliant-servers/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Getting ready for an industry conference</title>
		<link>http://linux.sjolshagen.net/2009/05/09/getting-ready-for-an-industry-conference/</link>
		<comments>http://linux.sjolshagen.net/2009/05/09/getting-ready-for-an-industry-conference/#comments</comments>
		<pubDate>Sat, 09 May 2009 17:44:10 +0000</pubDate>
		<dc:creator>Thomas S</dc:creator>
				<category><![CDATA[Linux]]></category>
		<category><![CDATA[Performance]]></category>
		<category><![CDATA[scalability]]></category>
		<category><![CDATA[virtualization]]></category>

		<guid isPermaLink="false">http://linux.sjolshagen.net/?p=6</guid>
		<description><![CDATA[I wound up with more work on my plate this week in order to get ready for a large conference this summer where I need to talk about scalability, performance and virtualization of large scale-up platforms. ]]></description>
			<content:encoded><![CDATA[<p>This week was one of those &#8220;in spite of the best laid plans of mice and men&#8221; weeks.</p>
<p>One of the engineers I&#8217;ve been working with to help me get a better understanding for what the performance and scalability of a scale-up environment when being used to run Red Hat Virtualization (Xen) as well as the Kernel Virtual Machine (KVM) is changing roles within the company. Good for him!</p>
<p>Unfortunately, this means that I&#8217;ll have to do some of the data-collection work on my own. On top of my daily responsibilities. So, we met this week to try and figure out how we could compress what&#8217;s been, thus far, a multi-month effort into something a bit more schedule friendly for the both of us. And I think we&#8217;ve got a reasonable plan, all things considered.</p>
<p>One of the achilles heels of almost any of the virtualization solutions, thus far, has been the IO throughput (and latency) for disk related IO operations. If you look at testing done by various vendors and the Open Source community at large, the throughput has been either &#8220;not great&#8221; or inconsistent. The &#8220;not great&#8221; element of this hasn&#8217;t, historically, really been that huge of a deal for customers, since they seem to have planned around it by not including disk IO intensive workloads into their consolidation/virtualization plans.</p>
<p>However, more and more customers are looking to virtualize <em>everything</em> running on their server platforms in an effort to save power, cooling and management costs. As a result, the &#8220;not great&#8221; performance behavior has become enough of an issue that all of the virtualization vendors now support (or will in the near future), minimally, para-virtualized IO drivers and/or other performance optimizations.</p>
<p>Consequently, they also appear to be pushing Linux (and Windows) further &#8220;back&#8221; into their data centers and looking to use the two for more and more critical tasks, even in a virtualized/consolidated context. An on-going problem for myself and our customers has been the amount of fact-based information available in terms of how to best configure these environments to optimize IO performance or to help customers understand the actual limitations and benefits of the various IO options in a virtualized environment.</p>
<p>Also, there&#8217;s very little in terms of stated best practices for using things like raw devices, volume manager backed devices, file systems or file containers for the various types of workloads. So, I was hoping we&#8217;d be able to provide some of that in a presentation at a large conference this summer.  And my original plan was that I&#8217;d only be the consumer of the data, not the creator of it. Thus the &#8220;best laid plans of mice and men&#8221; statement early on.</p>
<p>So, for the next couple of weeks, I&#8217;ll be trying to collect whatever the engineer is unable to collect. Then I&#8217;ll have to graph it, &#8216;gussy it up&#8217; (with pretty colors), add some configuration recommendations and bring it all with me to the sessions I have planned for the conference.</p>
]]></content:encoded>
			<wfw:commentRss>http://linux.sjolshagen.net/2009/05/09/getting-ready-for-an-industry-conference/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

