Editing
CCU:GPU Cluster
(section)
Jump to navigation
Jump to search
Warning:
You are not logged in. Your IP address will be publicly visible if you make any edits. If you
log in
or
create an account
, your edits will be attributed to your username, along with other benefits.
Anti-spam check. Do
not
fill this in!
== How to get started == Most up to date information for the current cluster: * [[CCU:GPU_Cluster_Quick_Start | Quick start tutorial]] * [[CCU:Perstistent storage on the Kubernetes cluster | How to use persistent storage]] * [[Tutorials:Link_to_container_registry_on_our_server | How to use the CCU image repository]] * [[Tutorials:Mount_cifs_storage_in_a_pod | How to mount cifs storage]] * [[Cluster:Compute nodes | How to target different compute nodes]] The following information is partially outdated, and refers to older system architectures (Ubuntu 18.04). In particular, the install scripts probably do not work anymore. Instead, refer to current online documentation on how to install e.g. nvidia drivers and nvidia docker. * Preparing your system ** Step 1: [[Tutorials:Install nVidia CUDA and GPU drivers|Install nVidia CUDA and GPU drivers]] ** Step 2: [[Tutorials:Install the nVidia docker system|Install the nVidia docker system]] ** For the impatient: [[Tutorials:Complete install script for a fresh Ubuntu 18.04|Complete install script for a fresh Ubuntu 18.04]] These tutorials should still work: * Learning the basics of Docker (requires docker or nvidia-docker for the GPU containers) ** [[Tutorials:Link to container registry on our server|Link to container registry on our server]] ** An in-depth look at a [[Example:container which trains MNIST using Tensorflow|container which trains MNIST using Tensorflow]], with the following steps: *** Step 1: create a local python tensorflow application. *** Step 2: wrap the application in a container. *** Step 3: run and test the container locally. *** Step 4: push the container to the registry server of the cluster. *** Step 5: remarks on persistent storage in docker containers * Learning the basics of Kubernetes and how to run jobs on the cluster: ** Step 1: [[Tutorials:Install the Kubernetes infrastructure|Install the Kubernetes infrastructure]] ** Step 2: [[Tutorials:Set up your Kubernetes user account|Set up your Kubernetes user account]] ** Step 3: [[Tutorials:Run the example container on the cluster|Run the example container on the cluster]] and make sure that it works correctly.
Summary:
Please note that all contributions to Collective Computational Unit may be edited, altered, or removed by other contributors. If you do not want your writing to be edited mercilessly, then do not submit it here.
You are also promising us that you wrote this yourself, or copied it from a public domain or similar free resource (see
CCU:Copyrights
for details).
Do not submit copyrighted work without permission!
Cancel
Editing help
(opens in new window)
Navigation menu
Personal tools
Not logged in
Talk
Contributions
Create account
Log in
Namespaces
Project page
Discussion
English
Views
Read
Edit
View history
More
Search
Navigation
Collective Computational Unit
Main page
Projects
Tutorials
GPU Cluster
Core Facilitys
Mediawiki
Recent changes
Random page
Help
Tools
What links here
Related changes
Page information