Skip to content

Chapter 1: Cloud Foundations

The word "cloud" has been a part of the technology lexicon for decades, evolving from a simple network diagram symbol to a term representing a paradigm shift in computing.

  • Early Vision (1994): The term first appeared publicly in a 1994 Wired magazine article about the company General Magic. The article envisioned a future where information and services were accessible from any device via "the entire Cloud out there."

  • Mainstream Introduction (2006): It gained widespread recognition when Google's then-CEO, Eric Schmidt, introduced the term "cloud computing" at a conference. He described a model where data and services reside in a "cloud," accessible from anywhere.

  • Evolution: Initially, the cloud was an engineering abstraction representing complex networks (like the internet) in system diagrams. It was later adopted by marketing to describe a vast range of internet-based solutions.

Defining Cloud Computing

To distinguish true cloud services from traditional IT hosting, several organizations developed formal definitions. The most widely accepted and comprehensive definition comes from the U.S. National Institute of Standards and Technology (NIST).

Gartner's Definition

Gartner, a leading research and advisory company, provided an early and influential definition:

INFO

"Cloud computing is a style of computing in which scalable and elastic IT-enabled capabilities are delivered as a service using Internet technologies."

This definition emphasizes two critical concepts:

  • Scalability: The ability to easily increase resources to meet growing demand.
  • Elasticity: The ability to both increase and decrease resources as needed.

The NIST Definition

The NIST definition is considered the de facto industry standard. It provides a precise framework by outlining five essential characteristics, three service models, and four deployment models. This section focuses on the five characteristics that define what makes a service a "cloud" service.

The Five Essential Characteristics (NIST)

These five traits were initially established to create a clear distinction between authentic cloud solutions and rebranded traditional hosting. They highlight the core benefits of the cloud: cost savings, rapid deployment, and user empowerment.

1. On-Demand Self-Service

INFO

A consumer can unilaterally provision computing capabilities, such as server time and network storage, as needed automatically without requiring human interaction with each service’s provider.

This characteristic means users can access and deploy IT resources through a web-based control panel or API without needing to contact a salesperson or support agent.

  • Analogy: Think of it as a vending machine for IT resources. A DevOps team can instantly provision a new server for a project, just like getting a snack from a machine, bypassing the slow, traditional procurement process of ordering and installing physical hardware.
  • Modern Context: While this remains a core principle, some advanced services from major cloud vendors may require a support request for activation and can take hours or days to become available, showing a slight deviation from the ideal of instant self-service.

2. Broad Network Access

INFO

Capabilities are available over the network and accessed through standard mechanisms that promote use by heterogeneous thin or thick client platforms (e.g., mobile phones, tablets, laptops, and workstations).

Cloud services are designed to be accessible from anywhere with an internet connection, using a wide variety of devices. This breaks down geographical barriers and enables remote work and collaboration.

  • Modern Context: While public internet access is the primary method, cloud providers also offer private, dedicated fiber connections for enhanced security and performance. Some providers even offer "disconnected cloud" solutions—hardware that runs cloud services entirely within a customer's data center. These scenarios, while valuable, do not strictly adhere to the "broad network access" principle via the public internet.

3. Resource Pooling

INFO

The provider’s computing resources are pooled to serve multiple consumers using a multi-tenant model, with different physical and virtual resources dynamically assigned and reassigned according to consumer demand.

In a multi-tenant model, multiple customers share the same physical infrastructure (servers, storage, networking) securely and efficiently. This pooling allows providers to achieve massive economies of scale and pass the savings on to customers.

  • Analogy: This is similar to living in an apartment building. Residents share large-scale utilities like water and power, making it more efficient and cost-effective than if each resident had to build their own infrastructure.
  • Modern Context: Cloud providers also offer dedicated or "bare metal" servers. These are single-tenant physical machines that are not shared with other customers, which technically violates the multi-tenancy aspect of resource pooling. However, because they can be provisioned on-demand, they are still considered a cloud service.

4. Rapid Elasticity

INFO

Capabilities can be elastically provisioned and released, in some cases automatically, to scale rapidly outward and inward commensurate with demand. To the consumer, the capabilities available for provisioning often appear to be unlimited.

This gives consumers the ability to dynamically adjust their resource consumption based on real-time needs. For example, an e-commerce site can automatically scale up its server capacity to handle a holiday traffic surge and then scale back down when demand subsides.

  • Analogy: This functions like an elastic band that can stretch to accommodate a large object and shrink back to its original size.
  • Modern Context: This feature is a powerful benefit of the cloud, but it is not always automatic. Customers often need to configure auto-scaling rules and architect their applications to take full advantage of elasticity.

5. Measured Service

INFO

Cloud systems automatically control and optimize resource use by leveraging a metering capability at some level of abstraction appropriate to the type of service (e.g., storage, processing, bandwidth, and active user accounts).

Resource usage is monitored, controlled, and reported, providing transparency for both the provider and the consumer. This typically enables a pay-as-you-go pricing model, where you only pay for the resources you consume, much like a utility bill for electricity or water.

  • Modern Context: Metering is fundamental for billing, monitoring, and capacity planning. However, the units of measurement can be complex (e.g., time, storage, I/O operations, function executions).

WARNING

Transparency Concerns

While the NIST definition envisioned a transparent system, vendor billing can sometimes be opaque, making it difficult for customers to independently verify the accuracy of their charges.

From Strict Rules to Common Guidelines

A decade ago, these five characteristics were strict requirements for a service to be considered "cloud." Today, the lines have blurred. They are now better understood as common guidelines or defining principles rather than a rigid checklist.

The industry has evolved to include hybrid solutions that deliver cloud benefits without perfectly conforming to every rule. For example:

  • Dedicated instances of bare metal servers offering a physical machine violate resource pooling but offer the security of a single-tenant environment.
  • Private fiber connections bypass the public internet but provide essential security and performance for enterprise customers.
  • Transparency in measured service is not always fully achieved by vendors.
  • Rapid elasticity often requires significant configuration effort from the customer.

Ultimately, while the NIST characteristics remain the foundational vocabulary for understanding cloud computing, the modern cloud ecosystem is diverse and flexible, adapting its models to meet a wide array of customer needs.

Made with ❤️ for students, by a fellow learner.