Atom feed of this document
  
 

 Chapter 21. Object Storage

OpenStack Object Storage (Swift) is a service that provides storage and retrieval of data over HTTP. Objects (blobs of data) are stored in an organizational hierarchy that offers anonymous read-only access or ACL defined access based on the authentication mechanism.

A consumer can store objects, modify them, or access them using the HTTP protocol and REST APIs. Backend components of Object Storage use different protocols for keeping the information synchronized in a redundant cluster of services. For more details on the API and the backend components see the OpenStack Storage documentation.

For this document the components will be grouped into the following primary groups:

  1. Proxy services

  2. Auth services

  3. Storage services

    • Account service

    • Container service

    • Object service

 

Figure 21.1. An example diagram from the OpenStack Object Storage Administration Guide (2013)


[Note]Note

An Object Storage environment does not have to necessarily be on the Internet and could also be a private cloud with the "Public Switch" being part of the organization's internal network infrastructure.

 First thing to secure – the network

The first aspect of a secure architecture design for Object Storage is in the networking component. The Storage service nodes use rsync between each other for copying data to provide replication and high availability. In addition, the proxy service communicates with the Storage service when relaying data back and forth between the end-point client and the cloud environment.

[Caution]Caution

None of these use any type of encryption or authentication at this layer/tier.

This is why you see a "Private Switch" or private network ([V]LAN) in architecture diagrams. This data domain should be separate from other OpenStack data networks as well. For further discussion on security domains please see Chapter 4, Security Boundaries and Threats.

[Tip]Tip

Rule: Use a private (V)LAN network segment for your Storage services in the data domain.

This necessitates that the Proxy service nodes have dual interfaces (physical or virtual):

  1. One as a "public" interface for consumers to reach

  2. Another as a "private" interface with access to the storage nodes

The following figure demonstrates one possible network architecture.

 

Figure 21.2. Object storage network architecture with a management node (OSAM)


 Securing services – general

 Service runas user

It is recommended that you configure each service to run under a non-root (UID 0) service account. One recommendation is the username "swift" with primary group "swift."

 File permissions

/etc/swift contains information about the ring topology and environment configuration. The following permissions are recommended:

                #chown -R root:swift /etc/swift/*
                #find /etc/swift/ -type f -exec chmod 640 {} \;
                #find /etc/swift/ -type d -exec chmod 750 {} \;
            

This restricts only root to be able to modify configuration files while allowing the services to read them via their group membership in "swift."

 Securing storage services

The following are the default listening ports for the various storage services:

Service Name Port Type
Account service 6002 TCP
Container service 6001 TCP
Object service 6000 TCP
Rsync 873 TCP

Authentication does not happen at this level in Object Storage. If someone was able to connect to a Storage service node on one of these ports they could access or modify data without authentication. In order to secure against this issue you should follow the recommendations given previously about using a private storage network.

 Object storage "account" terminology

An Object Storage "Account" is not a user account or credential. The following explains the relations:

OpenStack Object Storage Account Collection of containers; not user accounts or authentication. Which users are associated with the account and how they may access it depends on the authentication system used. See authentication systems later. Referred to in this document as OSSAccount.
OpenStack Object Storage Containers Collection of objects. Metadata on the container is available for ACLs. The meaning of ACLs is dependent on the authentication system used.
OpenStack Object Storage Objects The actual data objects. ACLs at the object level are also possible with metadata. It is dependent on the authentication system used.
[Tip]Tip

Another way of thinking about the above would be: A single shelf (Account) holds zero or more -> buckets (Containers) which each hold zero or more -> objects. A garage (Object Storage cloud environment) may have multiple shelves (Accounts) with each shelf belonging to zero or more users.

At each level you may have ACLs that dictate who has what type of access. ACLs are interpreted based on what authentication system is in use. The two most common types of authentication providers used are Keystone and SWAuth. Custom authentication providers are also possible. Please see the Object Storage Authentication section for more information.

 Securing proxy services

A Proxy service node should have at least two interfaces (physical or virtual): one public and one private. The public interface may be protected via firewalls or service binding. The public facing service is an HTTP web server that processes end-point client requests, authenticates them, and performs the appropriate action. The private interface does not require any listening services but is instead used to establish outgoing connections to storage service nodes on the private storage network.

 Use SSL/TLS

The built-in or included web server that comes with Swift supports SSL, but it does not support transmission of the entire SSL certificate chain. This causes issues when you use a third party trusted and signed certificate, such as Verisign, for your cloud. The current work around is to not use the built-in web server but an alternative web server instead that supports sending both the public server certificate as well as the CA signing authorities intermediate certificate(s). This allows for end-point clients that have the CA root certificate in their trust store to be able to successfully validate your cloud environment's SSL certificate and chain. An example of how to do this with mod_wsgi and Apache is given below. Also consult the Apache Deployment Guide

# apt-get install libapache2-mod-wsgi

Modify file /etc/apache2/envvars with

export APACHE_RUN_USER=swift
export APACHE_RUN_GROUP=swift

An alternative is to modify your Apache conf file with

User swift
Group swift

Create a swift directory in your Apache document root:

# mkdir /var/www/swift/

Create the file $YOUR_APACHE_DOC_ROOT/swift/proxy-server.wsgi:

from swift.common.wsgi import init_request_processor application, conf, logger, log_name = \
            init_request_processor('/etc/swift/proxy-server.conf','proxy-server')

 HTTP listening port

You should run your Proxy service web server as a non-root (no UID 0) user such as "swift" mentioned before. The use of a port greater than 1024 is required to make this easy and avoid running any part of the web container as root. Doing so is not a burden as end-point clients are not typically going to type in the URL manually into a web browser to browse around in the object storage. Additionally, for clients using the HTTP REST API and performing authentication they will normally automatically grab the full REST API URL they are to use as provided by the authentication response. OpenStack’s REST API allows for a client to authenticate to one URL and then be told to use a completely different URL for the actual service. Example: Client authenticates to https://identity.cloud.example.org:55443/v1/auth and gets a response with their authentication key and Storage URL (the URL of the proxy nodes or load balancer) of https://swift.cloud.example.org:44443/v1/AUTH_8980.

The method for configuring your web server to start and run as a non-root user varies by web server and OS.

 Load balancer

If the option of using Apache is not feasible or for performance you wish to offload your SSL work you may employ a dedicated network device load balancer. This is also the common way to provide redundancy and load balancing when using multiple proxy nodes.

If you choose to offload your SSL ensure that the network link between the load balancer and your proxy nodes is on a private (V)LAN segment such that other nodes on the network (possibly compromised) cannot wiretap (sniff) the unencrypted traffic. If such a breach were to occur the attacker could gain access to end-point client or cloud administrator credentials and access the cloud data.

The authentication service you use, such as Keystone or SWAuth, will determine how you configure a different URL in the responses to end-clients so they use your load balancer instead of an individual Proxy service node.

 Object storage authentication

Object Storage uses wsgi to provide a middleware for authentication of end-point clients. The authentication provider defines what roles and user types exist. Some use traditional username and password credentials while others may leverage API key tokens or even client-side x.509 SSL certificates. Custom providers can be integrated in using the wsgi model.

 Keystone

Keystone is the commonly used Identity provider in OpenStack. It may also be used for authentication in Object Storage. Coverage of securing Keystone is already provided in Chapter 18, Identity.

 SWAuth

SWAuth is another alternative to Keystone. In contrast to Keystone it stores the user accounts, credentials, and metadata in object storage itself. More information can be found on the SWAuth website at http://gholt.github.io/swauth/.

 Other notable items

In /etc/swift/swift.conf on every service node there is a "swift_hash_path_suffix" setting. This is provided to reduce the chance of hash collisions for objects being stored and avert one user overwriting the data of another user.

This value should be initially set with a cryptographically secure random number generator and consistent across all service nodes. Ensure that it is protected with proper ACLs and that you have a backup copy to avoid data loss.

Questions? Discuss on ask.openstack.org
Found an error? Report a bug against this page

loading table of contents...