SatMagazine

Home >> February 2010 Edition >> Tech Channel — What Is Network Latency — Why Does It Matter?

Tech Channel — What Is Network Latency — Why Does It Matter?

By O3b Networks

Internet data is packaged and transported in small pieces of data. The flow of these small pieces of data directly affects a users internet experience. When data packets arrive in a smooth and timely manner the user sees a continuous flow of data; if data packets arrive with large and variable delays between packets the users experience is degraded.

What Is Latency?
According to Wikipedia, latency is a time delay between the moment something is initiated, and the moment one of its effects begins or becomes detectable. The word derives from the fact that during the period of latency the effects of an action are latent, meaning “potential” or “not yet observed”. Most people understand that it takes time for web pages to load and for emails to get from your outbox to the destination inbox and yes, this is a form of latency.

In order to understand why this happens, we need to think about latency at a lower level: Latency is a time delay imparted by each element involved in the transmission of data.

Networking 101
It’s important to understand the basic elements of networking to properly grasp the latency issue. Early networking engineers anticipated the need to be able to handle thousands to millions of users on one cohesive network, and thus the TCP/IP networking model was developed.

The key design feature of the TCP/IP networking model is the concept of encapsulation, which is the idea of taking data and wrapping it in a common container for shipping. The container that was developed is called the IP Datagram, also known as an IP Packet.

osi model o3b 0210

The IP Packet is a simple thing: a header, followed by data. The Header contains information used for routing the packet to the destination. The data can be any information that needs to be transported such as a snippet of streaming music or a portion of email traffic. The exact construct of the data portion of an IP Packet is defined by the data protocol that is being carried. Data protocols will be discussed later. To understand exactly where latency occurs, it’s valuable to know how this most basic unit of networking data is built and transported. For this we turn to the OSI Model:

The OSI model was created to describe the process of turning your application data into something that can be transported on the Internet. The upper layers of the OSI model describe things that happen within the applications that are running on the computer. These applications are web browsers, email programs, and so on. The lower layers are where information to and from applications are turned into data for transport on a network. This is where data encapsulation occurs and our basic networking data element — the IP Datagram or “packet” is built.

tcpip stack o3b 0210

This diagram shows the encapsulation process in what’s known as the TCP/IP Stack. The precise workings of the TCP/IP stack can be different between various computer operating systems. These differences may seem trivial as long as the protocols are implemented properly but when seeking the absolute highest levels of performance it’s important to know that the network stack implementation can be a significant cause of networking performance variability.

The transport of network data is a three step process:

Data from a source application is passed down through the stack. During this process the application data is wrapped into IP Datagrams which are commonly called “packets”. Packets are then transmitted by the sending computer in the network.

Packets are passed along the network (purple line) until they reach the destination computer.

Packets are received from the network by the destination computer and are passed up through the stack. During this process the application data is extracted and then passed along to the destination application.

The additional encapsulation at Layer 2 is called framing. This is the stage where the IP Datagram is turned into bits which are appropriate for a particular type of network.

Layer 1 is the physical network medium connection. This layer handles the conversion of the layer 2 bits into electrical, optical, or radio signals that can be transported. The network interface, often called the NIC or Network Interface Card, can be fiber-optic, copper wire, or a wireless radio interface.

What Causes Latency?
As described above there are many logical, electrical, and physical elements involved in computer networking. The OSI model identifies each of these elements with regard to specific functionality and delays, another name for latency, occur at every stage of the process.

Application Layer Latency
Layer 7, 6, 5 are the upper �application layers�. Regardless of the speed of the processor or the efficiency of the software, it takes a finite amount of time to manipulate and present data.

Whether the application is a web page showing the latest news, or a live camera shot showing a traffic jam, there are many ways in which an application can be affected by latency. One common source of application latency is the need to read and write data to a disk. There are also hardware limitations that affect application performance such as the amount of memory.

Serialization Latencies
The encapsulation of data, which occurs at the Transport Layers (1 though 4), is called serialization. Serialization takes a finite amount of time and is calculated as follows:

graphic 1 o3b 0210

For example:

Serialization of a 1500 byte packet used on a 56K modem link will take 214 milliseconds Serialization of the same 1500 byte packet on a 100 MBps LAN will take 120 microseconds

Serialization can represent a significant delay on links that operate a lower transmission rates, but for most links this delay is a tiny fraction of the overall latency when compared to the other contributors.

Data Protocols + Latency

Routing + Switching Latencies

Queuing + Buffer Management

queuing latency

WRED

Weighted Random Early Detection

What Is Propagation Delay?

velocity factor

Transmission Rate + Bandwidth

Transmission Rate

Radio Bandwidth

Higher modem data rates cause the modem to occupy more radio bandwidth

Lower modem data rates will let the modem occupy less radio bandwidth

Data Bandwidth

A 10 MBps copper LAN cannot sustain traffic flowing at a higher rate than 10 megabits every second

A satellite link using modems operating at a 600 MBps rate cannot flow any more than 600 megabits every second

Latency + TCP/IP

connectionless

connection based

User Datagram Protocol

UDP

Connection

Transmission Control Protocol

TCP

Establish the connection

Send the data

Close the collection

Phase 1

Establishing the connection requires 3 packets... the client sends a connect request SYN (synchronize) packet to the server

the server replies with a SYN-ACK (synchronize acknowledge) packet

the client confirms the receipt of the SYN-ACK by sending back an ACK (acknowledge)

Phase 2
Once the link is established, the data transfer can start. During the TCP data exchanges, ACK (acknowledge) and NACK (negative acknowledge) packet types are used to tell the sender that packets have been properly received. If a packet is not received or it contains a bit error, the transmission of the exact same packet is repeated.

Phase 3
Upon completion of the data transport session, the connection will be closed by the following 3 packet exchange...

the closing initiator sends a FIN (finish) packet the other side of the link replies with a ACK

the closing initiator send a coMBination FIN/ACK to end the connection

How does TCP know a link is operating poorly and what can it do about it? To protect the integrity of the data, TCP packets have several features...

Sequence NuMBers

TTimestamps

Flow Control

Congestion Control

Checksums

All of these features are used to guarantee the integrity of the data. They are also used by TCP to determine the quality of the link and to tune the flow of data to maximize the use of the available bandwidth.

pullquote1 o3b 0210

An example of this behavior is the way TCP responds to congestion control. The TCP congestion control process uses timers to examine the data flow and subsequent ACK/NACK responses. When TCP detects that ACK/NACKs are taking longer than normal to respond, TCP assumes that the link is being congested somewhere and will slow down the release of packets using flow control. This vital step helps reduce the impact of congestion at routing and switching buffers as well as receiving computer data processing limitations.

Congestion control is generally a valid response to a lethargic network as slow response does often indicate that a portion of the link has a data bottleneck.

A Fictional Data Download
Let�s examine the details of downloading a digital image which is 4 megabytes in size. For calculation purposes we need to use bits, so our 4 megabytes (4 MB) is actually 32 megabits (32 MB).

Our downlink will use TCP/IP but we cannot simply create a 32 MB packet to transport our image; such a large packet would be a very cuMBersome to deal with. Internet traffic is made up of packets of variable sizes, but the Maximum Transport Unit (MTU) is generally only around 1500 bytes. Packets larger than 1500 bytes are considered �JuMBo� packets, but handling these is not yet commonplace for many parts of the Internet. To ensure our image file makes it, we�ll stick with 1500 byte packets.

Our 1500 byte TCP packet has a header, which is required for transportation but not useful image data. The size of the header can vary in length from 20 bytes to 60 bytes depending on TCP packet options. If we assume that our header is a full 60 bytes in length, this leaves only 1440 bytes for our image file data. Based on the amount of data payload available, the maximum amount of image file data that can be transported is 11520 bits per packet. Even this nuMBer can be a little lower depending on upper layer formatting of the data, but for this exercise we�ll assume that the entire TCP payload is useful image data.

Our total file size divided by this maximum packet size tells us that transporting our entire image file will take 2777.7 packets. The last packet would normally become shortened instead of our full MTU but to keep the math easy we�ll round up to 2778. Our assumptions:

We are downloading the file from a local computer using a 10 Mbps LAN

There is no other traffic on that 10 Mbps LAN � we get the whole pipe

The link is operating perfectly, no need to repeat any data during this transmission

There is no appreciable latency. This is a copper LAN and a relatively short cable run which is not adding any appreciable propagation delay

Notice that even for a link that is operating perfectly, the transfer of our 32 Mb image will actually take 34.6 Mb of data. The overhead of TCP added 7.8 percent to the total amount of data which needs to be transported. This overhead is an absolute worst case since we assumed all packets need to be acknowledged.

Now that we have the number of bits transmitted, we need to calculate how long that should take.

This is simple...

34.6 Mb/10 Mbps = 3.46 seconds

Using our nice clean, short networking link we should be able to transport our image file in just over 3 seconds.

The Real World
We�ve made four assumptions in the above analysis which do not mimic the real world Internet at all.

Assumption 1: The file you want is available on a local computer.

Reality 1: The file is more likely to be some physical distance away. This is not necessarily a problem, but it means we need to traverse a much more complicated network path to get to our data.

Assumption 2: We get the whole 10 Mbps to ourselves.

Reality 2: This is feasible only for the local LAN. It is a certainty that once your little 1500 byte TCP/IP packet reaches the Internet backbone, it will be joined with millions of other packets working their way through the Internet. Your image file packets are going to be mixed in with other traffic such as emails, streaming music files, and so on. You simply don�t get the Internet all to yourself.

Assumption 3: The link is operating perfectly.

Reality 3: Internet traffic is routed through an extremely complex collection of hardware which is scattered all over the Earth. The reality is that sometimes a fiber or copper cable is cut or is mistakenly disconnected. A piece of networking equipment such as a router or a switch can break, leaving some other path to pick up and route the extra traffic. When this happens, Internet traffic can start to fill up queues and bottlenecks occur. As mentioned earlier, queuing delays can become significant when the network is operating through a bottleneck.

Assumption 4: There is no appreciable latency present in our network.

Reality 4: The reality is that all of the earlier discussed sources of latency are genuine factors in real-world networks. The impact of latency starts to become noticeable when the latency is significantly longer than the transmission time for the data.

The previous example discussed the data transmission rate in terms of the number of bits per second. To understand how the user is exposed to the effects of latency, we need to convert transmission rate into its measure of time.

Bit transmission time = 1/(bits per second)

It�s easy to see that slower transmission rates take longer to transport packets of data. If the latency on a network is the same as the bit transmission rate, then the impact is very low since the IP packets can still be streamed very close to each other. If the latency on a network is several times longer than the bit rate, the impact will become much more noticeable because the latency spreads the entire data TCP/IP data exchange session over time.

graphic 4 03b 0210

The following plots were made using a TCP/IP packet capture utility. These plots show the packet bit rate on the y-axis and time of day on the x-axis. The data being transmitted was the un-cached web-page reload of the content from the CNN web page. The only condition changed during was the delay between packets � the transmission rate remained the same.

As you can see, the added network latency and its affect on the flow of TCP data spread the web page load over time.

The 50 ms latency link took 3 sec

The 150 ms latency link took 5 sec

The 300 ms latency link took 11 sec

The 600 ms latency link tool 17 sec

The spreading of network data over time reduces what�s called the Effective Bandwidth of a link. Packets are still being transported at the same bit rate but due to latency it is taking much more time for all of the web-page packets to arrive.

It�s this �spreading over time� behavior of high latency networks which becomes noticeable to the user and creates the impression that a link is not operating at a high speed.

table 1 o3b 0210

O3b recently conducted another demonstration of real-world effects of latency using the time to load a web page. This is a very common activity and clearly shows users that latency directly affects the way a user obtains data from the Internet.

The following plots show the effects of latency on the time to load the Wall Street Journal web page...

table 2 o3b 0210

Satellite Link Latencies
Now that we know the effects of latency on real-world traffic, we�ll discuss the latency differences in two satellite technologies. Satellite links can introduce larger latencies than most terrestrial networking segments due to long distances from the ground stations to the satellite. The table on the next page shows the latency caused by propagation delays from two types of satellite configurations...

The O3b Networks MEO orbit constellation at an altitude of 8063 Kilometers

A geosynchronous satellite at 35,786 Kilometers

It is important to understand that for satellites which operate as a bent-pipe, the propagation delays are doubled as the signal has to travel both up to the satellite and back down to the Earth before it reaches the next segment of the network.

table 3 o3b 0210

The table above shows that a ground station in Lagos, Nigeria using an O3b Networks MEO satellite to connect to a teleport in Almeria, Spain, will experience round trip time (RTT) ranging from 122 to 133 milliseconds. If we add in the average internet latency from the Almeria teleport to most Internet destinations in Europe (60 ms), we end up with an overall latency from Lagos to a European internet site of 183 to 193 milliseconds. This range of latency is caused by the change in distance to the ground sites relative to the moving O3b Satellites. The AOS is the Acquisition of Signal, and the LOS is the point at which the O3b system will perform the handover to the next rising satellite. The Maximum Elevation is the point at which the satellite is closest to the customer ground station which explains why this point has the lowest latency.

By comparison, the same Lagos customer site using a geosynchronous satellite to a European internet site using will have to see latencies of 552 milliseconds. The last column in the chart shows the time required to make a data request and to start receiving the requested data. This data request time includes...

The request packet from the user to the web server

The web server acknowledging the request

The web server pushing to requested data to the user

The data arriving at the user�s computer

TCP also includes a returned ACK packet from the user to the web server but this time is not counted in the Data Request Cycle.

TCP also includes a returned ACK packet from the user to the web server but this time is not counted in the Data Request Cycle.

Geosynchronous satellite users must wait almost 1 second before they start getting data, whereas the lower latency O3b satellite link will receive it nearly 3x sooner.

When looking at the basic latency numbers, it�s easy to see that the O3b Satellite constellation will offer users a noticeably better Internet experience with more immediate feedback and quicker access to data.

Summary
We have described the structure of IP-based packet switched networks, the functions of the various protocol layers, and the causes of latency in packet switched data networks, such as the Internet.

Latency and overall throughput is dominated by two factors, the length of the route that the packets have to take between sender and receiver and the interaction between the TCP reliability and congestion control protocols and this path length. The O3b Networks satellite constellation in a much lower MEO orbit has significantly lower path length and therefore significantly lower latency than traditional geosynchronous satellites. Therefore, O3b�s network latency and throughput approximate and, in some cases, exceed that of fiber based terrestrial networks.