- Overview
- Registration Information
- Registration Discounts
- Symposium Organizers
- At a Glance
- Calendar
- Technical Sessions
- Live Streaming
- Purchase the Box Set
- Tutorial on GENI
- Posters and Demos
- Sponsorship
- Activities
- Hotel and Travel Information
- Services
- Students
- Questions?
- Help Promote
- For Participants
- Call for Papers
- Past Proceedings
sponsors
usenix conference policies
EyeQ: Practical Network Performance Isolation at the Edge
Vimalkumar Jeyakumar, Stanford University; Mohammad Alizadeh, Stanford University and Insieme Networks; David Mazières and Balaji Prabhakar, Stanford University; Changhoon Kim and Albert Greenberg, Windows Azure
The datacenter network is shared among untrusted tenants in a public cloud, and hundreds of services in a private cloud. Today we lack fine-grained control over network bandwidth partitioning across tenants. In this paper we present EyeQ, a simple and practical system that provides tenants with bandwidth guarantees as if their endpoints were connected to a dedicated switch. To realize this goal, EyeQ leverages the high bisection bandwidth in a datacenter fabric and enforces admission control on traffic, regardless of the tenant transport protocol. We show that this pushes bandwidth contentionto the network’s edge, enabling EyeQ to support end-to-end minimum bandwidth guarantees to tenant endpoints in a simple and scalable manner at the servers. EyeQ requires no changes to applications and is deployable with support from the network available today. We evaluate EyeQ with an efficient software implementation at 10Gb/s speeds using unmodified applications and adversarial traffic patterns. Our evaluation demonstrates EyeQ’s promise of predictable network performance isolation. For instance, even with an adversarial tenant with bursty UDP traffic, EyeQ is able to maintain the 99.9th percentile latency for a collocated memcached application close to that of a dedicated deployment.
Open Access Media
USENIX is committed to Open Access to the research presented at our events. Papers and proceedings are freely available to everyone once the event begins. Any video, audio, and/or slides that are posted after the event are also free and open to everyone. Support USENIX and our commitment to Open Access.
author = {Vimalkumar Jeyakumar and Mohammad Alizadeh and David Mazi{\`e}res and Balaji Prabhakar and Albert Greenberg and Changhoon Kim},
title = {{EyeQ}: Practical Network Performance Isolation at the Edge},
booktitle = {10th USENIX Symposium on Networked Systems Design and Implementation (NSDI 13)},
year = {2013},
isbn = {978-1-931971-00-3},
address = {Lombard, IL},
pages = {297--311},
url = {https://www.usenix.org/conference/nsdi13/technical-sessions/presentation/jeyakumar},
publisher = {USENIX Association},
month = apr
}
Presentation Audio
by Lakshminarayanan Subramanian
The basic question addressed in this paper is: "How do we provide the abstraction of network isolation across different competing tenants in a datacenter network, where each tenant gets the network abstraction of a dedicated switch?" In other words, the datacenter network aims to provide bandwidth guarantees over short-time scales to thecompeting tenants in a datacenter. Achieving network-level isolation abstraction is a challenging problem, since the network resources are shared across multiple tenants and the datacenter network has limited knowledge about, and little control over, the volume of traffic from each tenant. This paper presents EyeQ, a system that aims to provide fine-grained control for bandwidth partitioning across tenants in a datacenter network, and which aims to provide bandwidth guarantees to tenants. I believe that this paper solves a timely problem with growing traffic needs and contention levels at datacenters.
The key idea of their approach is to map the network isolation problem to a distributed and reactive congestion control protocol where they use rate meters in receivers that send congestion feedback to rate limiters at senders. Using a distributed feedback and sender based rate-limiting mechanism, the authors show that they can achieve predictable networkisolation over short time scales. EyeQ aims to guarantee a minimum bandwidth per each flow. To achieve this, EyeQ uses a combination of three ideas: (a) the use of admission control at a flow level, using a hierarchical rate limiter at the sender side; (b) an aggregate tenant-level rate meter at the receiver, which provides control feedback to the sender; (c) the use of a variant of the Rate Control Protocol (RCP) at a per-flow level at the sender side, to provide per-flow max-min fairness within a pool of flows within a single tenant. One interesting result in this paper that EyeQ has a worst-case convergence time which is significantly lower than DCTCP and QCN, two related datacenter congestion control works. One should not view this paper as just a congestion control technique. The problem of providing predictable bandwidth guarantees turns out to be a much harder problem than achieving fairness and network sharing that conventional congestion control techniques aim to achieve.
There are several avenues to build upon this work. The bandwidth guarantees presented in this paper are probabilistic in nature. There is need for a more thorough theoretical analysis to quantify the nature of the guarantees, stability and convergence properties of the techniques presented in this paper. Achieving predictable network isolation in theface of misbehaving flows is inherently a hard problem, and this paper relies on the existence of a sender-level rate limiter to partially achieve this goal. The paper takes a specific design choice of using a rate control based approach to achieve predictable bandwidth guarantees. Given the large body of work on Internet Quality of Service (QoS), are there other techniques which could achieve a similar result or even improve the natureof achievable guarantees? To design a practical solution, one must also balance the tradeoff between deployability and achievable guarantees. Other avenues to consider in future work include exploring predictable guarantees for other network parameters like RTT and loss, and understanding the tradeoff between traffic volatility and achievable guarantees.
connect with us