|
Atomic Weapons Establishment Relies on Extreme
Networks to Meet the Demands of its High-Performance Computing Cluster
Also available in PDF
format (111K)| Download the ZIP
format (94K)
The Atomic Weapons Establishment (AWE) has been central to the defense
of the United Kingdom for more than 50 years, providing and maintaining
the warheads for the country's nuclear deterrent. Uniquely among
the nuclear powers, AWE covers the whole life cycle of nuclear warheads,
in a single establishment. This includes initial concept and design, through
component manufacture and assembly, to in-service support and, finally,
decommissioning and disposal.
Since 1993 AWE has been managed on behalf of the Ministry of Defense
under a government-owned/contractor-operated arrangement. AWE hosts one
of the largest and most advanced research and development centers in the
United Kingdom, facilitating the design and production of Britain's
nuclear warheads. These warheads must have assured performance, but with
the most stringent safe guards against unplanned operation.
Scientific and technical computing has always played a major role in
AWE's work, so when the organization needed to meet the switching
demands of its Galaxy Cluster Supercomputer, comprised of 115 dual-CPU
nodes running at 2.8 GHz and deliver a peak 64 bit performance of 5.6
Gflop/s, AWE looked to Extreme Networks.
Achieving Extreme Cluster Performance
AWE worked closely with Prime Business Solutions to evaluate high-performance
switching solutions for the cluster. Together, they developed a clear
understanding of the cluster's immediate requirements as well as
those well into the future, and developed criteria to assess several vendors.
Throughput, port density and the ability to scale the cluster without
significant or costly upgrades, were the primary focus.
"Some vendors clearly didn't meet our stringent performance
requirements, and others came close but price was an issue," explains
Neil McMahon, High-Performance Computing Network Manager. "After
evaluating the options, Extreme Networks was the clear winner."
AWE selected Extreme Networks BlackDiamond® 10808 core switch because
it delivered the most cost effective, high-performance solution. The BlackDiamond
10808 augments the BlackDiamond family of core switches with a new platform
incorporating major technological breakthroughs in the areas of scalability,
security, resiliency and extensibility—all requirements in a cluster
environment.
Keeping the Cluster Up and Running 24x7
At AWE, the cluster must be available 24x7. Downtime, whether due to network
failures or taking switches down to perform upgrades, dramatically affect
the cluster’s processing productivity.
"Our goal was to minimize network downtime, and that included
taking switches down for upgrades," explains McMahon. "With
the BlackDiamond 10808 every active component, including the software,
can be upgraded without taking the switch out of service. That's
a significant benefit in a clustering environment."
Built to deliver a high level of resiliency for cluster networks, the
BlackDiamond 10808 leverages a new state-of-the-art chassis designed to
help ensure that every possible redundancy is available. The BlackDiamond
10808 delivers high availability by providing carrier-class redundancy;
failover to a backup component occurs without affecting users or application
traffic. By combining innovative selfhealing hardware and software capabilities,
it is designed to be the industry's most resilient switch.
By providing the high network availability AWE's cluster applications
require, Extreme Networks offers greater flexibility to meet organizational
needs without sacrificing performance or increasing costs. Extreme Networks
supports high network availability by offering advanced features such
as industry standard VRRP and Extreme Standby Router Protocol (ESRP) for
fully coordinated Layer 2 and Layer 3 client router redundancy. ESRP also
delivers multi-layer redundancy for servers and clients. In addition,
Policy-Based Quality of Service (QoS) allows network managers to create
policies that provide network availability for mission critical applications
running on cluster servers.
Because AWE moves large amounts of data from disks to the compute area,
line rate throughput was also a paramount concern—degradation in
performance is unacceptable. For networks processing and managing large
amounts of traffic, such as AWE's clustering applications, Extreme
Networks supports jumbo frames. An Extreme Networks infrastructure supports
jumbo frames across every switch—for up to six times the efficiency
of smaller-sized frames, resulting in more efficient transmission of data
between the individual nodes in a cluster.
"The throughput we've been able to achieve with the BlackDiamond
10808 is tremendous," notes McMahon. "There has been a noticeable
increase in throughput and performance gains compared to the switches
we replaced."
Scaling to Meet Future Needs
In AWE's high-performance clustering environment, space is at a
premium. The organization needed a flexible, high port density solution
that would allow the cluster to grow without requiring additional space.
In addition, AWE needed the clustering solution it selected to scale to
meet future needs without requiring significant new investment, including
the ability to upgrade to 10 gigabit without a forklift overhaul of the
network.
"Because we don't know the availability of additional funds
in the future, it was important to select a solution that could meet our
evolving requirements, including moving to 10 gigabit without a forklift
upgrade," says McMahon. "Extreme was able to deliver the scalability
and extensibility our cluster demands."
Offering the greatest number of 10 gigabit and Gigabit Ethernet ports
of any switch in the industry, BlackDiamond 10808 scales to meet both
the immediate and potential future requirements of even the most demanding
cluster environments. In addition, Extreme Networks BlackDiamond 10808
incorporates programmable ASIC technology. Not only does the BlackDiamond
10808 hardware support a wide array of protocols from day-one, the programmable
ASIC technology helps ensure the platform will support emerging protocols
without costly hardware upgrades—offering revolutionary investment
protection for cluster networks.
Even though there are plans to grow the cluster from 115 nodes today
to more than 400 nodes in the future, network management will remain simple.
"Although the performance of the network has increased, the time
and resources required to manage it haven't," says McMahon.
"Extreme's management platform is easy to learn and intuitive."
By sharing a common operating system and management platform between Extreme
Networks switches, the Extreme Networks solution can deliver high-performance
clustering without creating greater management complexity. The ExtremeWare®
XOS™ operating system delivers the uncompromising management, control
and security needed for demanding cluster networks. With standards-based
multi-layer switching and Policy-based QoS, AWE is able to take advantage
of the tools it needs to make the most of network capacity. In addition,
the high port density available in Extreme Networks switches offers support
for a large number of server clusters without increasing the cost of supporting
those clusters.
McMahon concludes, "While other vendors had plans to deliver the
high-performance solution we needed, Extreme was able to deliver a solution
today."
|