Editor's note: To view a PDF version of this article, click here.
Not long ago, quality of service was something phone company engineersBellheadscared most about, while Netheads scoffed at such outdated ideas. Now, as the poles of network infrastructure battle for the minds and dollars of tomorrow's networks, QoS is at the crux of the debate. This month, we will discuss the metrics encompassed by QoS.
In early December 2002, I attended one of the major cable industry trade shows. With attendance down about 40 percent, it had all the energy of a Bob Dole presidential campaign.
The show's first evening, our company, Motorola Inc., had a dinner reservation at one of Anaheim's finest restaurants serving French cuisinethat's French cuisine, not french fries. Unfortunately, due to the dearth of customers to schmooze in the business area's corner of our booth, the dinner ended up being for all Motorola employees.
Being the responsible employees we areand not just because the GM was part of the crowdwe changed the reservation to a Mexican restaurant the local guide told us was "best of Anaheim." Now, I love Mexican food, and this Mexican food was indeed excellent. The atmosphere-well, let's just say it was strip-mall chic and definitely a cut above the laundromat next door. Given the change in venueand menuour bill at the end of the meal was surely a small fraction of what the original plan would have yielded.
Despite our penny-pinching ways, the food indeed turned out to be very good. But what about the "best of Anaheim's" QoS? What can we say about the service experienced at budget burritoville? As every wait staffer quickly figures out with business customers, when serving rounds of drinks, the delivery of the meal can be delayed almost indefinitely. Not only that, but as time and bottles go by, diners become less and less concerned about the delay. This delay time would be quantified and described in networking circles as latency.
When the nachos finally ran low, the service providers realized the main course would have to be delivered soon, so they placed our order with the kitchen staff. Soon thereafter, the plates began to be served. Unfortunately, there were about 15 of us, and only one cook and one server. The food delivery was thus sporadically timed, as the server brought out the dishes as they rolled off the grill in sequence. Over the course of five or 10 minutes, three dishes would arrive, then a short wait, then two more, another wait, then another couple, etc. This variation in consistency of delivery is what network weenies would call jitter. This term is perhaps the most confusing one, because it has meaning as both a Layer 1 and Layer 2 parameter. Physical layer (Layer 1) gurus use "jitter" for a similarly critical, but different metric.
Finally, guess whose meal they forgot altogether? Had they known I was a rich and famous contributor to CSDwell, famous anywayI'm sure this would not have happened (OK, at least a contributor to CSD). After delivering 14 meals to everyone around me, the server asked what I had ordered. I repeated my order, and the server ran to the chef, who started right in on it. This, in networking lingo, is simply data loss.
This month, in addition to offering my dining critiquethree-and-a-half stars out of five, but skip the saladwe will dive into one of the key concepts of converging networksQoSand discuss what it represents.
That was Then
The current situation in network mindsets is the well-documented conflict that boils down to diehards associated with the telephony infrastructure and those associated with the data infrastructure. Carefully designed and implemented telephone networks were developed with rigor and rigidity. Ethernet-based local and wide area networks are built around flexible, low-cost, best-effort data service. The challenge of the former is to pass data efficiently and cost-effectivelyas a LAN can. The challenge for the latter is to obtain some, most or all of the rigorous capabilities offered originally in telephone networks and that we now broadly categorize as "high-QoS" networks.
Precisely what parameters do we mean by "QoS"? People seem to hurl the term around with impunity these days, often without making their meaning clear. QoS is more than just "good" servicewhatever that meansand typically boils down to five interrelated parameters, three of which and a couple of Corona's defined my evening out at the Anaheim Mexican restaurant. While interrelated, the five are also independently quantifiable metrics, and their relationship often amounts to trading one off against another.
There are the three that, along with digestive concerns, may come with your refried beans and nachos: latency, jitter and packet loss.
But the two items that find their way into QoS discussions, whether generally considered QoS parameters or not, are availability (i.e. the "five nines" definition of "carrier class") and throughput-usually a bit-per-second number.
Availability is generally more an architecture and cost issue, and less a technology one. Simply put, availability speaks to how often the network is "up" and thus available for service carriage. Carrier class, if going by the definition of 99.999 percent available, works out to a little more than five minutes a year of down time. You can understand, from just this one QoS parameter, why testing high-QoS solutions could itself become a significant part of a qualification process.
Of course, when considering the two polar end points of the service possibilitiesvoice and best-effort datathings are not always clear-cut. An IP network will route packets around downed network segments in a flexible and reliable manner, but likely without maintaining the same performance as if all network segments were functioning. But, if it is nonguaranteed traffic with no predefined minimum throughput, it is still "good enough" in terms of the service quality promised. It is easy to see how including multiple data service and quality tiers only leads to more fuzziness. Nonetheless, availability corresponds to what percentage of time the network is upwith what constitutes "up" sometimes difficult to pinpoint.
Now, let's move on to throughput. Throughput is often associated simply with increasing the bits per second as technology enables it, which stirs little controversy. As a physical last-mile link issue, it is simply always preferable to be able to offer more bandwidth at the largest bottleneck. This is as simple as the cable modem vs. the dial-up line.
However, bandwidth vs. traffic engineering is an argument that can stoke the flames of debate. Simplified QoS advocates will argue that the best way to ensure QoS is to build a super-big pipe, one in which the allocated traffic is merely a small part of what the pipe can handle. The idea, obviously, is that no one will be contending for network resources; everyone will get virtually all the throughput they want, and packets will traverse the network at nearly wire speed. QoS follows by default. Netheads who speak Internet Protocol (IP) would be especially attracted to this scenario, particularly because LAN equipment has ridden impressive cost curves, providing access to cheap bandwidth.
The opposite angle would be that it is senseless not to attempt to optimize or at least efficiently use a key valuable resource such as bandwidth. It is ill-advised to assume that underutilization will always be possible and that it will be an adequate approach. In this view, super pipes are merely the lazy man's traffic engineering. Since the modem scenario mentioned above is often played out in practicea key bottleneck limits end-to-end throughputit makes sense to design for throughput optimization at some level. Thus, throughput as a QoS term can stir a bit of controversy.
Latency, Jitter, and Loss
The three QoS parameters of latency, jitter and loss are depicted in Figure 1.
Loss: While traversing a network, packets of data may pass through multiple processing stages as they are routed from source to destination. Bottlenecks may ensue, depending on the popularity of a particular destination, the size of a particular pipe link chosen as the route or simply the volume of traffic in the network. In some cases, packets are filling queues faster than they are being forwarded out. Inevitably, when this occurs, the only way for the network to remain stable is to drop packets.
Obviously, a dropped packet can result in packet loss, but not necessarily in every case. When QoS parameters are enabled in the network, a means exists for establishing a pecking order of packets that can be dropped, rather than just discarding randomly or on the basis of location in the queue.
Again, the idea of dropping a packet from a queue itself does not automatically mean the payload information is lost for good, never to reach its destination. It is the job of higher-level protocols, when implemented as such, to recognize that some of the packet sequence is missing and request for it to be resent. Therefore, again we see that the parameters can be difficult to pinpoint and can be interrelated. The result of the situation described would potentially be a significant delay variation-increased jitter.
Don't Forget ATM
This concludes our introduction to QoS concepts. However, I would be remissnot to mention buried under e-mailif I did not at least mention asynchronous transfer mode. ATM was developed to support voice and data on the same network, properly allocating the resources for each, by design. The ATM protocol supports the concepts featured in this article but using different terms. ATM hardware did not take over the world as people once thought it would, for another complete column's worth of reasons and debate points.
Invented from the start with a multiple-service mindset, ATM is based around four different service classes, within which more parameters can be used to distinguish traffic on a per-flow basis. It's safe to say that ATM represented the first attempt at a comprehensive solution for using a single network to support a range of services that accounted for the extremesboth best-effort data and real-time voice.
As network designers try to converge on the same solution capabilities from entirely different angles, the debate will get hot and heavy about what technologies are necessary, which work better and which are destined for the ash heap. Ultimately, many of the questions will be answered in terms of dollar per decibel, throughput per dollar, operation and maintenance cost, or some other thing that ends with "cost."
These days and likely for the next several development cycles, until we all forget about the dot-bomb, cost will remain critical. I suspect the fine dining establishments in Anaheim will agree.
For more on QoS refer to:
- "Transport-Layer QoS Standards Improve Wireless Messaging Services"; www.commsdesign.com/story/OEG20021127S0034
- "Guaranteeing QoS in PON Designs"; www.commsdesign.com/story/OEG-20021003S0013
Rob Howald (rhowald@gi.com) is the director of systems engineering in the transmission network systems group of Motorola's Broadband Communications Sector in Horsham, Pa. He has a BSEE and an MSEE from Villanova University and a PhD from Drexel University.