Currently browsing author

Jian Zhen

The Rise of Cloud Privatization

Current results from Data Survey #1: Data Scientists. Thanks to everyone for helping the world understand Big Data better!

Please take the following 2-minute survey to help us understand your hadoop environment better.

The world of clouds these days is full of definitions and counter-definitions. There are many posts that try to define the concept of cloud computing; many that try to distinguish utility computing, grid computing and cloud computing; many that try to define public vs private clouds; and many that dismisses the notion of private clouds.

Jonh Foley, in his article “The Rise Of Enterprise-Class Cloud Computing“, referred to private cloud as an oxymoron,

That’s an oxymoron since cloud computing, by definition, happens outside of the corporate data center, but it’s the technology that’s important here, not the semantics.

But by whose definition? The industry as a whole haven’t even been able to nail down a concrete definition of cloud computing. Given that there’s no concrete definition, then by definition, private cloud is not an oxymoron. But I do agree with John, let’s focus on the technology and not the semantics.

Geva Perry, chief marketing officer at GigaSpace Technologies, did just that. By focusing on the technology and architectural aspects of cloud computing, he wrote in a GigaOM blog post,

Cloud computing is a broader concept than utility computing and relates to the underlying architecture in which the services are designed. It may be applied equally to utility services and internal corporate data centers, as George Gilder reported in a story for Wired Magazine titled The Information Factories.

Cloud Attributes

But instead of everyone trying to create their own definition of clouds, let’s look at the list of attributes that clouds have and compare public and private clouds.

  • Elasticity: The ability to dynamically provision (expand) or de-provision (shrink) the computing capacity as needed.
  • Utility: The ability to be charged by the amount of resources used. Great examples would include Amazon Web Services’ charge model. In an enterprise setting, sometimes business units are charged by the internal IT groups for the resources they requested. The utility model would allow IT groups to perform chargebacks in a similar model to AWS.
  • Scalability: The ability to either handle growing amounts of work in a graceful manner, or to be readily enlarged. For example, it can refer to the capability of a system to increase total throughput under an increased load when resources (typically hardware) are added. [ via wikipedia ]
  • Reliability & Availability: No failed whale! The cloud infrastructure and platforms must be reliable and available to the applications that are using them. There’s probably a lot of technology involved here to make this happen. For example, the ability to transparently migrate a virtual server when the running node has failed.
  • Manageability: The ability to effectively manage (start/stop/migrate/expand/shrink/etc) the different server and application instances in the cloud.
  • Security: The ability to secure the data and access to the cloud. Public clouds still have a trust issue with many of the enterprise customers, which is why the ? is there.
  • Performance: The ability to execute and complete tasks within the acceptable timeframe (defined by the SLA).
  • API: I consider this to be a desired attribute. This refers to the ability of doing resource management via some type of documented programming interface.
  • Virtualization: Applications are decoupled from the underlying hardware. Multiple applications can run on one computer (virtualization a la VMWare) or multiple computers can be used to run one application (grid computing). [ via GigaOM ]
  • Multi-Tenancy: The ability to house multiple customers using the same infrastructure and still be able to segregate the data.
  • SLA-Driven: The system is dynamically managed by service-level agreements that define policies such as how quickly responses to requests need to be delivered. If the system is experiencing peaks in load, it will create additional instances of the application on more servers in order to comply with the committed service levels — even at the expense of a low-priority application. [ via GigaOM ]
  • Support: The ability to smack someone upside the head when something fails.
Attributes Public Private
Elasticity
Utility
Scalability
Reliability & Availability
Security ?
Performance
API
Virtualization
Multi-Tenant
SLA-Driven ?
24×7 Support

So if we are looking purely from a technology perspective, private clouds can absolutely exist. In fact, given the questions for the public cloud, enterprises are more likely to experiment with private clouds for mission critical applications.

Market and Vendors

According to Merrill Lynch, the public and private cloud infrastructure, platform, applications and advertising together will be a $160 billion market by 2011, or roughly 12% of the total worldwide software market.

The total $160bn addressable market opportunity includes $95billion in
business and productivity apps, and another $65 billion in online advertising.

IBM and Sun have comprehensive solutions for ‘internal Clouds’. Dell targets large scale data centers, and HP provides ‘everything as a service’, making their solutions attractive for ‘external Clouds’.

So who are some of the private cloud infrastructure/platform startups that are taking advantage of this $160 billion market? (Feel free to leave a comment if I missed anyone.)

Company Product
3Tera AppLogic
Arjuna Agility
Cassatt Active Response?
Elastra Elastra Cloud Server
Enomaly Enomalism
GigaSpaces XAP, EDG, and Community Edition

Private Cloud Links

Architectures of extraordinarily large, self-sustaining systems

Current results from Data Survey #1: Data Scientists. Thanks to everyone for helping the world understand Big Data better!

Please take the following 2-minute survey to help us understand your hadoop environment better.

Visit InfoQ for synchronized slides and video.

Here’s the “revised and reimagined transcript of that talk” that Richard P. Gabriel gave on Design as his Wei Lun Public Lecture at the Chinese
University of Hong Kong.

Picture a system so large it cannot be comprehended. Can such a system be “designed” in any conventional sense? Will machines help design it? Will it help design itself? How will it keep running? Will it be alive? The foundations of computing are about to change. In this talk, Richard P. Gabriel explores why and how.

Web 2.0 Business Models

Current results from Data Survey #1: Data Scientists. Thanks to everyone for helping the world understand Big Data better!

Please take the following 2-minute survey to help us understand your hadoop environment better.

[ via Passion for Innovation ]

Web 2.0 Business Models
View SlideShare presentation or Upload your own. (tags: business models)

Adil Mohammed’s “Startups In The Cloud”

Current results from Data Survey #1: Data Scientists. Thanks to everyone for helping the world understand Big Data better!

Please take the following 2-minute survey to help us understand your hadoop environment better.

[via James Governor’s Monkchips]

Adil Mohammed, co-founder of entrip, with his take on why the cloud is perfect for startups. This was a presentation he gave at CloudCamp in London.

Startups In The Cloud
View SlideShare presentation or Upload your own. (tags: startup aws)

The Rise of Cloud Platforms and Why the OS Doesn’t Matter

Current results from Data Survey #1: Data Scientists. Thanks to everyone for helping the world understand Big Data better!

Please take the following 2-minute survey to help us understand your hadoop environment better.

Platform-as-a-Service (PaaS) is one of the buzzwords that’s mentioned often in the cloud computing space. I’ve written a blog post describing IaaS, PaaS and SaaS. In short, PaaS is a platform for delivering applications, similar to a pre-built system with hardware, OS and application stack all built in. In the PaaS case, this system is hosted. All you have to do is “upload” the application code and it should take care of the executing and scaling of it.

A quick survey of the land (by no means comprehensive, I am also including ONLY application platforms, not service-specific platforms such as DabbleDB) shows that there’s a plethora of PaaS players out there, each with their own target audience. Some provide more of a raw execution platform, some provide a full suite of tools for creating applications online. Unfortunately, most of these vendor approaches will lock you into their proprietary platform. If you ever want to move to another platform, you have to rewrite at least a portion of code using the new vendor’s API. Phil Wainewright has written about this in his blog post “A plethora of PaaS options.”

Company Application Type
Bungee Labs Web applications
Coghead Web applications
Google App Engine Python web applications
LongJump Business applications
NetSuite NS-BOS Business applications
Ning Social networking applications
Joyent Web applications
Mosso Web applications
Rollbase Business applications
Salesforce Force.com Business applications

In one of the CloudCamp SF sessions in July, one of the guys from Microsoft asked whether the OS matters in cloud computing. My answer to that was it depends on the type of application. If it’s a web centric application that has a web front end, uses a database for storage, and doesn’t use any of the low level file IO, then really there’s no need to know what the OS is. In that case, the OS doesn’t matter.

All these vendors have targeted applications that are delivered over the web, and almost all of the vendors listed above try to abstract the OS from the developers so that they don’t have to worry about the underlying infrastructure. As Mosso’s slogan claims, “Code, load and go.”

Even though cloud computing is still in its infancy; however, as it matures, cloud providers will move upmarket to provide additional business value to customers. We will see a rise of cloud application platforms appear on the horizon. Specifically, we will see more domain-specific cloud platforms for different verticals or application types. For example, I can imagine there are developers working on a MMORPG cloud platform (maybe it’s here already if you consider Metaplace to be that) that will provide execution and management (of virtual goods, zones, accounts) for MMO developers; or a data analytics cloud platform that provides all the basic OLAP functions.

Will BGP and DNS Exploits Affect the Future of Cloud Computing?

Current results from Data Survey #1: Data Scientists. Thanks to everyone for helping the world understand Big Data better!

Please take the following 2-minute survey to help us understand your hadoop environment better.

[poll id="5"]

Recently we seem to be hearing more and more security exploits aimed at core Internet protocols. In July, Dan Kaminsky revealed a critical exploit aimed at the DNS protocol.

A couple of days ago “[t]wo security researchers have demonstrated a new technique to stealthily intercept internet traffic on a scale previously presumed to be unavailable to anyone outside of intelligence agencies like the National Security Agency.” See Revealed: The Internet’s Biggest Security Hole | Threat Level from Wired.com for more detailed reporting.

According to Wired.com,

The tactic exploits the internet routing protocol BGP (Border Gateway Protocol) to let an attacker surreptitiously monitor unencrypted internet traffic anywhere in the world, and even modify it before it reaches its destination.”

. . .

Anyone with a BGP router (ISPs, large corporations or anyone with space at a carrier hotel) could intercept data headed to a target IP address or group of addresses. The attack intercepts only traffic headed to target addresses, not from them, and it can’t always vacuum in traffic within a network — say, from one AT&T customer to another.

The clever trip the researchers have done is to

use a method called AS path prepending that causes a select number of BGP routers to reject their deceptive advertisement. They then use these ASes to forward the stolen data to its rightful recipients.

All these core protocol exploits have direct impact to cloud computing as the nature of cloud computing is that computing will happen out there on the Internet somewhere. According to the article,

The method conceivably could be used for corporate espionage, nation-state spying or even by intelligence agencies looking to mine internet data without needing the cooperation of ISPs.