Acquire and deploy hardware

From DISI
Revision as of 10:05, 13 March 2014 by Frodo (talk | contribs)
Jump to navigation Jump to search

Here is our recommendation of how to build a computer cluster that will work well for molecular docking and cheminformatics. This document can be used whether you already own hardware, or whether you are planning to buy new. All recommendations are the best we know as of Feb 2014. Things do change, but this advice should be ok through 2015. This is part of a series of articles called So you want to set up a lab.

Buying CPU and disk

We are currently buying CPU from Silicon Mechanics and Dell and disk from Silicon Mechanics and HP. We recommend buying two different kinds of machines in a modular fashion: head nodes to which disk enclosures may be attached and cpu nodes which contain large numbers of cores. We are currently buying enclosures holding 12 SAS disks of 4TB each for 48 TB raw for around $8000 or about 6 raw GB per dollar. Formatted RAID6 this works out to 36TB or 4.5 formatted GB per dollar. We like the HP P822 high performance RAID controller. Compare this to what we were paying just a year ago in spring 2013: 25 TB for $10,000 or 2.5 TB per dollar unformatted. And it is SAS instead of SATA so we only need RAID6 and not RAID10 formatting. An amazing development in the last 12 months.

For CPU, we like the C6145 from Dell. For around $20,000 you get 2 machines in a 2U form each with 64 cores and 256 GB memory and a pair of RAID1 formatted disks each. A single 42U rack could hold 2560 cores and still have room for a switch. Of course, this would cost you $400,000, pull 28 kW and need 5 T of cooling. Amazing density at commodity prices.

If you are on a budget, you can get a single C6145 or equivalent, less memory, and use the disk in the CPU machine, don't get a separate disk server. There are ways to get a serious computational lab started for as little as $10,000 of hardware, you'll just have to cut a few corners.

Setting up a network

Managed 24 port GigE switch with VPN support. About $250. This is not a full layer 3 switch, but it should do for most circumstances. Allows you to run both private and public networks. We do know why people spend $2500 for a full layer 3 switch, but you shouldn't have to, unless you are in a very funky networking environment.

Network topology will depend on your circumstances. We recommend running a private network, and keeping any machines that don't have to be on the public network off it for security. We recommend using a portal computer and ssh keys for remote access.

Minimum requirements for legacy equipment

At the very least, you really need 24 cores at 1GB/core and 10TB of RAID10 formatted disk space to get started. There have been substantial improvements in disk price/performance and CPU-core/dollar performance in the past 24 months that we really recommend getting new equipment if you can. If you do not have a 24 core / 48 GB / 4 TB disk system to run the hypervisor and core services, then you will need to reserve 3 older 2, 4 or 8 core machines to run central services. You can use two cheap switches to run private and public networks separately.

If you have more money

Our recommended CPU machine costs $20K (128 cores). Our recommended disk machine costs $20K (96 TB raw). We recommend just buying these two machines in a proportion that makes sense for your research. A single switch will handle an entire rack. If you have a lot more money, ask us. Don't forget about cooling, UPS and power.

Physical setup

You can fit everything in one rack. Put the switch in the middle.


Return to So you want to set up a lab.