Open main menu

Linux and Unix Users Group at Virginia Teck Wiki β

Changes

Amazon EC2: GPU Cluster

12 bytes removed, 07:43, 3 January 2019
no edit summary
An '''Amazon EC2: GPU cluster''' is a remote GPU cluster seriveservice.
=Creating a new image=
From this point on, we will deal with a SUSE instance, but the general steps will be the same. '''Only select an instance that can work with CUDA!'''
* With the Classic Wizard, select Cluster Instances HVM SUSE Linux Enterprise Server 11. * Change the Instance Type from T1 Micro M3 Extra Large to CG1 Cluster Compute Eight Extra Large Instance GPU (cg1.4xlarge).
* Enable Termination protection.
* Make sure you specify that your Elastic Block Storage volume NOT be destroyed when the instance is terminated (or your data will be lost when the instance is terminated, which may happen without your consent if you are using a spot instance and the variable price exceeds your maximum bid).
<pre>wget http://developer.download.nvidia.com/compute/cuda/$CUDA_MAJOR_VERSION/$CUDA_MINOR_VERSION/installers/cuda_5.0.35_$(Linux|Windows)_$ARCH_$IMG.run
chmod +x cuda_*.run
sudo ./cuda_*.run
</pre>
We only need the developer drivers. Ensure that the drivers install correctly. Note that it needs to run as root. The cuda dev drivers are already installed in some AMIs (for example the CentOS-GPU image).