TIL

October 3, 2016
10 minutes
2119 words

Transport Layer Security

Maybe you’ve noticed it—that green padlock in your browser’s address bar.

HTTPS

Users of online banking, e-commerce, and private chat take comfort in HTTPS’s promise of a secure connection. Without it, few would likely risk sharing their credit card numbers and sensitive messages.

But what does it guarantee, exactly? What can it guarantee? Are users vulnerable, in spite of using it? What is it?

Before we get to these questions, we must address the purpose of security.

Why do we need security?

In the most simple terms, information security refers to the protection of information from malicious use. Wikipedia elaborates:

Information security, sometimes shortened to InfoSec, is the practice of defending information from unauthorized access, use, disclosure, disruption, modification, inspection, recording or destruction.[1]

Protecting against malicious intent is a broad objective, so we will choose to focus on three essential features:

  • Confidentiality refers to the ability to send information without an eavesdropper intercepting and comprehending it. Without confidentiality, Eve the eavesdropper could intercept to your latest Amazon purchase, extract your credit card number, and use it with neither your knowledge nor consent.

  • Authentication refers to the ability to guarantee the identity of an actor. Without authentication, Eve could impersonate you, placing an order with Amazon on your behalf with neither your knowledge nor consent.

  • Integrity refers to the ability to guarantee the content of a message is not altered between the moment it is sent and the moment it is received by the intended target. Without integrity, Eve could alter your most recent Amazon order, changing the shipping address with neither your knowledge nor consent.

The triple of confidentiality, authentication, and integrity guarantees that a message from an authentic sender will remain intact as it is privately transmitted to the proper destination.

Because we are interested in the open Internet, we must assume that an eavesdropper will be able to intercept messages; therefore, it’s reasonable to substitute encryption—the obfuscation of information from all but the intended parties—for confidentiality, as they are one in the same in this context.

It is worth noting, then, that this concept of security is not necessarily a complete one! For instance, a system that guarantees confidentiality, authentication, and integrity may still allow an eavesdropper to know that some communication occurred between two identifiable parties. That is, if you need to disguise the fact that you contacted Amazon, our security concept will not be able to help.

As High Performance Browser Networking (HPBN) states,

When SSL is used correctly, a third-party observer can only infer the connection endpoints, type of encryption, as well as the frequency and an approximate amount of data sent, but cannot read or modify any of the actual data.[2]

With that security concept in mind, we can address the next most pressing issue: what is protecting us from this cruel and elaborate world?

Transport Layer Security!

Transport Layer Security—also known as TLS or SSL—is the cryptographic protocol that provides confidentiality, authenticity, and integrity for Internet communication. Whereas HTTP (Hypertext Transfer Protocol) establishes the procedure for sharing hypertext documents across the Internet, TSL (or SSL) establishes the procedure for securely communicating over a computer network, e.g. the Internet.

TLS communication occurs by transmission of records using symmetric encryption[6], which relies on a shared secret key, known exclusively by the two communicating parties (henceforth the client and server, for our purposes). How, then, do the client and server agree on a shared secret key? In TLS, it’s called the handshake, and it takes a dash of mathematical magic.

Interlude: Public-key cryptography

If you need to share a message securely, but you first need to share a key securely, how could you possibly communicate the key securely? If you had a way to send the key, why wouldn’t you just send the message? Public-key cryptography provides the seemingly-miraculous transfer mechanism. While we won’t be able to explore all the details of public-key cryptography in this post, here are the (very) short answers to our two questions:

  1. Asymmetric key pairs allow publishing of encrypted information that only a single individual can decrypt.
  2. You could! But swapping symmetric keys will make for a more efficient long-term communication session.

As far as the mechanics of public-key cryptography goes, you should know that it relies on a key pair—one public, one private—to communicate securely. Public keys, which are used to encrypt information, must be published openly; conversely, private keys, which are used to decrypt encryptions, must be kept secret. Think of public keys as pad locks and private keys as the combination to unlock them. Anyone can lock a message with your pad lock, but only you can open it!

For a deeper dive on public key concepts and techniques, I highly recommend Stanford’s Cryptography I course on Coursera, as well as Art of the Problem’s Public Key Cryptography: RSA Encryption Algorithm video lesson.

The Handshake

Prior to the TLS handshake, the client and server establish a TCP connection with a TCP handshake (SYN—SYN-ACK—ACK). Then, the TLS handshake begins with a simple hello over the fresh TCP connection:

TLS Handshake Image credit to HPBN [2]

  1. Client Hello (Client to Server) specifies the TLS version and the list of cipher suites the client supports, ordered by preference (generally, strongest first). There have been TLS attacks targeting the list of cipher suites, attempting to downgrade to breakable ciphers [8]

  2. Server Hello (Server to Client) specifies a cipher suite from the client’s list, defines a session ID, and provides the server’s SSL certificate. We will cover SSL certificates in the next section! For now, just imagine a fancy piece of paper with a gold star and a big, random-looking byte string.

  3. Client-side Authentication (Client) verifies the SSL certificate provided by the server. The certificate assures the client that the server’s identity is authentic, which can be guaranteed by the Certificate Authority (CA). This step protects against establishing a shared key with a malicious server acting as a man-in-the-middle, pretending to be the intended server. The SSL certificate also includes the server’s public key.

  4. Secret exchange (Client to Server) requires the client to use the server’s public key to encrypt a random secret, Kpm, sometimes known as the pre-master key [11]. We’ll refer to the encryption of Kpm under the server’s public key as Es(Kpm). The client sends Es(Kpm) to the server, which decrypts the secret with its private key. Recall that decrypting Es(Kpm) requires the server’s private key, which only the server knows!

  5. Shared key generation (Client and Server) is made possible because the client and server share exclusive knowledge of Kpm. Independently, the client and server each compute a shared symmetric key, Ks, using Kpm.

After step 3, the server might request a client SSL certificate, which operates the same way that the server’s SSL certificate does, allowing for reciprocal identity validation. IBM’s overview of the TLS protocol steps through that process.

Please note that this is an extremely brief and incomplete overview of a public-key cryptography concept that boasts many fascinating nuances beyond the scope of this post.

The Tunnel

At this point, the client and server share a few things:

  • an open TCP connection
  • a secret key, Ks
  • an agreed-upon cipher suite

Using the TLS Record protocol, client and server can communicate securely! The sender uses the cipher suite and Ks to encrypt and sign packets (e.g. using AES block cipher to encrypt and HMAC to sign), then uses the TCP connection to delivers the packets. The receiver decrypts and verifies the packets using the Ks. Receiver may then become sender, and vice-a-versa, completing a round-trip flow of confidential and authentic information with guaranteed integrity.

The aforementioned TLS Record protocol defines the organization of the following information within each packet:

  • Content type (e.g. handshake, alert, data)
  • Payload, which is encrypted
  • MAC, or signature

TLS Record Image credit to HPBN [2]

TLS/SSL in Practice

Now that the general process of TLS is clear, we can discuss how to go about using it in practice from both the client perspective and the server perspective.

Client-side: you, the user

As complicated a topic as TLS can be, this one is simple. In fact, as a user, there really isn’t much for you to do besides making sure that the services you’re using are serving their content over HTTPS. If you’re trying to access a website over HTTPS, but the connection is insecure, a good web browser will stop you right there. Google Chrome, Firefox, and Safari all do just that.

However, that’s not quite the end of the story. There have been tragic instances of spoofing security certificates, which undermines the entire protocol. Furthermore, some folks have been fooled into securely connecting to servers that look like banks, government services, etc., but are not! Rather, they obtain their own SSL certificates and hope that you don’t notice who owns the certificate. Oh, what an unforgiving world.

To check who owns the certificate, click that little green lock and find out! You can also go to DigiCert’s help page, which provides a neat tool. Try typing in “nikovacevic.io” and see what happens.

Check your SSL certificates

Server-side: you, the engineer

Luckily for you, the engineer, the process is fairly simple as well. You need to obtain a certificate and serve it, upon a client’s request of an SSL connection.

We’ll avoid the how-to aspect of actually serving the certificate, which your server (e.g. nginx) should handle, but it’s common to include such information in a server’s config file. Mine, for instance, looks like this:

## SSL Certificates
ssl_certificate     /etc/letsencrypt/live/nikovacevic.io/fullchain.pem;
ssl_certificate_key /etc/letsencrypt/live/nikovacevic.io/privkey.pem;

You’ll notice the format here looks familiar: one public and one private. So, how do you, the engineer, obtain one?

SSL Certificates

Depending on your needs, you may need one of a few types of certificates:

  • Domain validation (DV) guarantees that the server has the right to act on behalf of the given domain. This is the minimum necessary guarantee.
  • Organization validation (OV) guarantees domain rights and that the organization exists.
  • Extended validation (EV) guarantees domain rights, organization existence, and requires the owner to “persuade the certificate provider of its [the owner’s] legal identity, including manual verification checks by a human” [13]

There are many Certificate Authorities from which to obtain an SSL cert; most of them will do a fine job. For my own purposes with this domain (DV), I have chosen Let’s Encrypt because, as they say, it’s free, automated, and open!

Let's Encrypt

Let’s Encrypt makes the process so simple that there is simply no excuse to serve or consume content over regular, old HTTP any more.

You might be wondering

Why is “SSL” an abbreviation for Transport Layer Security?

TLS has a predecessor, Secure Sockets Layer. The original SSL protocols were developed by Netscape in the mid-1990s. Although TLS 1.0 was released in 1999, effectively spelling the end for the original SSL protocols, SSL 3.0 was not deprecated until 2015.

Although SSL 1.0, 2.0, and 3.0 are all deprecated now, the initialization is not. We use “SSL” and “TLS” interchangeably, except in specific cases where the distinction is useful.

Does TLS only protect HTTPS communication?

It does not! It was defined to abstractly protect an application-layer transport protocol. TCP, which is the Internet’s transport protocol, is only one such example. HPBN relates another:

TLS was designed to operate on top of a reliable transport protocol such as TCP. However, it has also been adapted to run over datagram protocols such as UDP. The Datagram Transport Layer Security (DTLS) protocol, defined in RFC 6347, is based on the TLS protocol and is able to provide similar security guarantees while preserving the datagram delivery model.[2]

  1. Wikipedia: Information security

  2. High Performance Browser Networking: Transport Layer Security from O’Reilly provides a good overview of the TLS protocol, including some really instructive graphics.

  3. Wikipedia: Transport Layer Security

  4. The Transport Layer Security (TLS) Protocol: Version 1.2 is the protocol, itself!

  5. Let’s Encrypt is a wonderful resource for obtaining free SSL certificates.

  6. Wikipedia: Symmetric-key Algorithm

  7. Wikipedia: Transmission Control Protocol

  8. IBM: An overview of the SSL or TLS handshake

  9. OpenSSL TLS Export Cipher Suite Downgrade (CVE-2015-0204; CVE-2015-1637)

  10. IBM: How certificate chains work

  11. Wikipedia: Man-in-the-middle attack

  12. SSL.com: The SSL/TLS Handshake: an Overview

  13. IBM: Secure Sockets Layer (SSL) and Transport Layer Security (TLS) concepts and the related articles (including [7]) give a technically thorough overview of TLS protocols.

  14. Wikipedia: Public key certificate

  15. Coursera: Cryptography I covers a wide array of basic cryptography topics, one of which is public-key cryptography. If you’re interested in learning about how and why certain techniques can guarantee security, from concept to proof, this is a good place to start.

  16. Art of the Problem: Public Key Cryptography: RSA Encryption Algorithm does a fantastic job of covering a mathematically intense topic in a short time span, and with relative thoroughness.