Blog entries

Interview with Cloudscaling CEO on Cloud in the Mid-market

A recent interview I did with Alex Bewley of Uptime Software is finally available. Although the podcast is nominally about cloud computing for mid-tier enterprises, we actually cover much broader ground. Alex’s blog posting lists the core topics as:

  • what kinds of businesses are using cloud
  • how you should go about evaluating it
  • how to avoid being outsourced as an IT department
  • what are the barriers to adoption; monitoring in the cloud (near and dear to our hearts)
  • designing applications for failure awareness
  • where he thinks the cloud is going

More important, for me personally, is that I think this is one of my better podcasts. The audio is clear, my responses, while long, are reasonably crisp, and you can tell that the general thinking around here has evolved a lot.  Some key messages come through loud and clear, which I think aren’t well understood still:

  1. Cloud computing isn’t about virtualization
  2. This is disruptive sea change, be the disrupter, not the disrupted
  3. Whole new areas of opportunity, applications, etc. are opening up that didn’t exist before

I really think it’s worth a listen.  It’s a little less than 20 minutes and moves pretty quickly.  Please enjoy and a big thanks to Alex who did a great job with the interview. Head over to the original blog post to listen to the podcast with Flash in your browser or you can download the MP3 directly if you are using a non-flash capable system.

Post to Twitter

Getting Velocity – Economy of Motion

Last time, I implied that scale alone doesn’t always lead to the operational efficiency in the datacenter

There is no class someone can take that will teach them everything they need to know to run a datacenter, and the applications it ostensibly exists for, efficiently. Training exists for some of the tools, for some of the hardware, but gaps are still left to fill in with smart people, hopefully working together. (and sometimes getting the people aligned IS the hard part)

If there was a training covering every technical aspect of this, it would be expensive, take months, probably years, and each class would not be up to date by the time it was over.

Two weeks from today, you can get a glimpse of what different parts of that training might look like.

Velocity is the preeminent cloud operations conference. (Let’s face it, a cloud is a big web app. If you can’t do web ops efficiently at scale, keeping a cloud running effectively will be a chance for you to learn these lessons the hard way. These lessons will probably cost you on both sides, as you throw bodies at the problem while your customers leave after every outage.)

This is ‘the’ ops conference for building and managing cloud services at and on every level of whatever ‘cloud’ means to you.

I’ve been to every Velocity so far. Starting at the first Velocity, there were two recurring themes, first, automating dynamic infrastructures, and second, developers and operations working together as paramount and a differentiator. The change is only accelerating, no pun intended.

I’m leaving out half of the story because Velocity also gathers the planet’s expertise focused on client side performance, which is provably important. It’s not my focus at the moment, but we need each other to deliver the highest value.

Many lessons are best learned by doing. Come learn from and connect with the people who have experience building some of the most impressive infrastructures and ops teams on the planet (and the scars to prove it…). You’re at even greater disadvantage if you don’t, because we’ll be learning and sharing with each other.

RegNow_336x280

Cloudscaling will be there.

Register now for 20% off with this code: vel10s2d

Post to Twitter

More Economies of Scale: Efficiency, Head Count and TCO

James Hamilton’s presentation at Mix 10 illuminated cloud computing economics that few others have direct experience with, but I also believe that this presentation raises interesting questions that didn’t get addressed. (If you haven’t seen James Hamilton’s Mix10 presentation, go watch it now. You should probably also go through Randy’s follow up, and then watch James talk again… I’ve watched it 3 times now.)

This is the first post that will refer to aspects of James’ talk (and I plan at least one more about business models) and in case I haven’t stressed this enough, if you have any interest in understanding the economics of cloud computing, take the time to watch one of the best in this business.

Central to James’ presentation is the breaking down the total cost of ownership of computational infrastructure. His breakdown is based on his own data running web scale services and he provides us with a great analysis on the inevitability and sustainability of cloud computing business models. One of the key points James makes is that the only variable cost in the chart is the cost of power.

Cost Breakdown from James' Hamilton

Cost Breakdown from James' Hamilton

The thing I want to focus on here is the missing cost of personnel. James touches on this at different points discussing administration and automation. He gives out the number ‘as low as 3% for services’, so I’m assuming he is burying this as a negligible cost. I would argue that this cost is actually highly variable, and, while correlated with scale, is also a function of the types of services an organization provides and how those services relate to the core business. Additionally, automation investments can be scaled down effectively, but that’s what I’ve been working on for a couple years so this is likely a reflection of my bias.

Based on James’ biases (which he is straightforward about), that the cost of personnel can be driven down to almost nothing is essentially taken for granted. I contend, based on personal experience and observation, this is still a significant operational cost for many organizations. I would take this a step farther and posit that the level of efficiency James refers to only comes from the crucible of running web services at scale with considerable economic pressure for little or no downtime. Furthermore, this level of efficiency and cost reduction will never materialize in organizations who view IT as a cost center. Efficiency doesn’t just come for free, at scale or otherwise.

To keep the numbers easy, let’s assume an admin is paid $100,000 per year. Then neglecting the aspects of networking and storage, that admin can manage some number of machines. If that number is 100, then managing each machine costs $1000 per year or ~$83 per month. If that number is 1000 machines, then each machine costs $8.33. If those are $3000 servers and the servers are roughly 54%, then if I’m understand correctly the $8.33 is around 5% of the monthly cost when amortized over 3 years. James gave us price or efficiency ratios for storage, networking and admins. For a large service he listed ‘over 1000 servers/admin’. He did not give us a ratio or a price point per server, but in order to get down to ~3%, the admins need to manage significantly more than 1000, the server cost is significantly higher than $3000, or the admins get paid significantly less than $100,000. (this also assumes salary is the only cost and nothing is paid for any management tools…)

What do you pay your admins? What do you pay for servers? What is the ratio between them?

Which one do you have the most control over?

(hint: the way to optimize the ratio is not to hire less admins, unless your customers like down time…)

Watching the evolution of the cloud computing landscape, in the rush to bring new services to market or transition away from apparently disrupted business models, I believe many organizations may unnecessarily learn this lesson the hard way. The proper care and feeding of the infrastructure better be a core competency for those who intend to compete ‘as a Service’ at any level. The operational differentiators have as much to do with process and culture as they do with technology, but doing them well could be the difference between business success and failure. I believe this is difficult to retrofit, especially at scale.

So what should you do? Start by trying to understand what your costs look like today, and I’ll follow up with perspectives and resources that might help with operational efficiency, at any scale. Operations can be a competitive advantage, but only for those organizations who have made the investments in both the people and the infrastructure.

Post to Twitter

Lew Tucker, former Sun Cloud CTO, now Cloudscaling advisor

San Francisco, CA – May 11, 2010 – Cloudscaling today announced that Lew Tucker, former CTO of Sun’s Cloud Computing business unit, has joined Cloudscaling’s advisory board.  Cloudscaling CEO, Randy Bias, expanded on what this means to the company, “If you look at Lew’s history, you will see that he is a true visionary and always at the forefront of the next technology trend. His experiences at Salesforce.com, Sun Microsystems, and Thinking Machines, fit right alongside the deep expertise in cloud and distributed systems that makes Cloudscaling unique.”

Lew’s background spans more than 20 years during which he has been instrumental in driving several major  technology changes, including: AI and massively parallel systems, developer adoption of Java, Salesforce.com’s AppExchange,  and most recently, Cloud Computing.    According to Lew, “At Thinking Machines, in the early 1990’s, we were building massively parallel machines using thousands of individual processors.   At Sun, we drove the evolution of the web with Java and networking, often using the tagline, ‘The Network is the Computer’.   In this next phase, it’s becoming clear that the ‘Cloud is the Computer’  and this promises to be just as disruptive.”

Cloudscaling CEO, Randy Bias, and Lew Tucker both share a long-term interest in the design and architecture of large, scalable systems.  As former VP Technology Strategy of GoGrid, Randy was responsible for building out one of the most complete infrastructure services in the cloud.  As CTO of Cloud Computing for Sun Microsystems, Lew was responsible for the architecture and API for Sun Cloud.

Lew’s joining the advisory board continues to build up the Cloudscaling team’s unique set of resources.  “If you want to build significant clouds, you have to have the right team.” said Randy Bias.  We’re the only cloud engineering services team I know of that can point to not one, but many, large scale cloud environments they have built.”

About Cloudscaling

Cloudscaling is the leading cloud computing engineering services firm. We provide strategy, design and implementation to build cutting edge clouds. Located in San Francisco, the company was founded by experts who have built some of the largest public and private clouds operating today. Visit cloudscaling.com to read the blog and follow the team on twitter.com/cloudscaling.

Contact:
Pat Sharp
pat@cloudscaling.com
725 Cool Springs Blvd., Ste. 600
Franklin, TN
USA
Ph: +1 (615) 732-6192

###

Post to Twitter

Understanding Cloud Datacenter Economies of Scale

James Hamilton’s recent MIX’10 presentation on economies of scale for large cloud providers was quite impressive. James “gets it” like few others in the industry. If you haven’t watched his hour-long presentation, I suggest you do. I also recommend this excellent response from James Urquhart.  My goal in this posting is to highlight, clarify and expand on a few of James Hamilton’s points.  I will focus on Infrastructure-as-a-Service (IaaS) clouds, but the concepts are relevant for other kinds of cloud services.

In his presentation, James focuses on power: utilization, distribution, etc., and while an important element, like him, I don’t think it’s the most important factor.

I also want to dispel the myth that only the largest companies can achieve these economies of scale. Don’t get me wrong; providing a cloud service is a scale game. It requires a certain amount of buying power to compete. However,you don’t need to be MSFT, YHOO, AMZN, or GOOG to compete effectively. Buying power can be had at levels much lower than you might think.

In this article, I refer regularly to Jame’s comments in his presentation, so I suggest you watch his video first. In order to minimize confusion, I’ve borrowed some pictures from  his slides and inserted them here for your reference. This is a long entry, but it will be worth the read as I’ve got numbers for you which I hope you will find interesting.

Background
Like James, the Cloudscaling team has a history of building large scale services. I’ve worked in this area for 16+ years as has our COO, Adam Waters, and several of our team members. Understanding of the economies of scale, especially for service providers, cloud or otherwise is fundamental to our DNA. For example, see my previous piece describing how oversubscription works.

Enough of that! Let’s dig in and look at where you can achieve economies of scale, identifying areas James Hamilton may have neglected, and clarifying areas where I think there is still confusion.

Read more…

Post to Twitter

Twitter links powered by Tweet This v1.7.1, a WordPress plugin for Twitter.