General Singularity info
Why the name “Singularity”?
The name “Singularity” for me (Greg) stems back from my past experience working at a company called Linuxcare where the Linux Boot-able Business Card (LNX-BBC) was developed. The BBC, was a Linux rescue disk which paved the way for all live CD bootable distributions using a loop back file system called the “singularity”.
This nomenclature represented that all files within the environment were contained within a single file, and for the same reason Singularity emphasizes the same nomenclature. (Thanks LNX-BBC!)
Which namespaces are virtualized? Is that select-able?
The goal of Singularity is to run an application within a contained environment such as it was not contained. Thus there is a balance between what to separate and what not to separate. At present the virtualized namespaces are process, mount points, and certain parts of the contained file system.
When you run your Singularity container, you may find that the process IDs start with 1 (one) and increment from there. You will also find that while the file system is contained starting with ‘/’ (root), you can have access outside the container via your starting path. This means that relative paths will resolve outside the container, and fully qualified paths will resolve inside the container.
To achieve this behavior, you will find that several Linux namespaces are separated (PIDS, file systems and descriptors, mounts, and root file system). These can be enabled or disabled by the build and what namespaces the host system supports as well as through environment variables.
Why can’t you just use RunC or any other container system on a shared system?
This is a copy/paste from a discussion on the email list describing RunC Vs. Singularity on a shared HPC system.. But most of these tenants apply to all of the popular container systems.
There are a number of reasons why RunC will not work on (my) shared multi-tenant environments (your system may vary):
- Requires root to run (there is however a submitted patch to allow non-root, but it has not been accepted at this point)
- Even with the proposed patch, no mitigation of user escalation within the container
- The container files themselves are owned by root, thus a user can not “bring their own environment”
- No facility or optimization’s for MPI or parallel job launch
- Requires a very recent host operating system (RHEL7 and compats, and similar vintage Debian derivatives)
- No automatic resolution of which namespaces to use (e.g. automatic disable PID namespace separation for OMPI shared memory optimization’s)
- It is not a “mobility of compute” solution (it is an example implementation of the OCI)
- Users can escalate to root and potentially get access to shared file systems, run daemons and escape the standard user and scheduler limitations
Singularity on the other hand addresses these issues and more:
- Singularity runs as the user that invoked it, and it prevents escalation pathways to obtain root within a container
- Singularity can be used without any modification within an HPC environment (resource managers, interact with HPC file systems, interconnects, GPUs, etc..)
- Because Singularity uses a single file for the container, that single file can be owned by a user but contain root owned files inside (thus a user can copy from another system)
- Single file also optimizes parallel runs with lots of open()s (large python runs can take 10-30 minutes to start on a big system, but not in a container image)
- Designed for: mobility/portability, speed, HPC, application virtualization (running apps within the container as if they are running on the host)
- Works on all currently maintained vintages of Linux (e.g. RHEL 5 and compats)
- No limitations on vintage of Container OS (e.g. I have a 17 year old install (RHL8) running in a Singularity image)
How does Singularity relate/differ from Docker?
Docker has been used for a variety of purposes, but it is designed as a platform to provide replicatable, network service vitalization. Because of this basic assumption and design model, it makes it difficult to implement on shared HPC platforms (and thus Singularity was born). Additionally, Docker supports the notion of emulating full operating system environments including user context escalation.
Singularity on the other hand does not support user escalation or context changes, nor does it have a root owned daemon process managing the container namespaces. It also exec’s the process work-flow inside the container and seamlessly redirects all IO in and out of the container directly between the environments. This makes doing things like MPI, X11 forwarding, and other kinds of work tasks trivial for Singularity.
If you already have a Docker container you can import it directly into Singularity!
How does Singularity relate/differ from Shifter?
NERSC (like most HPC centers) are feeling the pressure from users asking for support for containers, specifically Docker. Due to the architecture of Docker it is very difficult (if not impossible) to properly and securely implement in a multi-tenant HPC environment. Shifter is NERSC’s implementation to provide a Docker compatible front-end interface to their extreme scale HPC resources. It is system/resource specific in that you must import an existing container (from Docker, Singularity, or other), to the host/Shifter implementation.
Singularity on the other hand does not leverage the Docker work-flow and targets a different premise - Mobility of Compute. This makes the integration of Singularity non-HPC specific (even though it works very well with HPC) and allows the image to become the primary unit of mobility (you can share and operate directly on Singularity images).
Singularity is more of a general purpose mobility of compute solution that is very capable at HPC, Shifter’s primary focus is targeting extreme scale HPC and integration with Cray and the resource manager.
How does Singularity relate/differ from Flatpak
Flatpak is a packaging subsystem that uses some container technologies to create distribution neutral packages and it is more similar to the initial proof of concept of Singularity. But the use-cases of Singularity dictated that we should support full operating system containers that contain the entire user’s environment.
How does Singularity relate/differ from other container systems like OpenVz, LXC/LXD, etc.?
Singularity differs from other container systems in several major ways that impact usability on shared systems. For example, most container systems emulate standard systems in that there is the ability and necessity to escalate to root, run on separate IP/network address, run services, and in some cases even virtually boot the container system.
Singularity on the other hand focuses on the ability to virtualized only what is necessary to achieve run-time application container and portable environments. For instance, you can not obtain root from within a Singularity container.
There are some additional performance and design enhancements which make Singularity also more appropriate in a scheduled HPC environment. The back-end image type is one such feature that negates the need for temporary caching of container images and optimizes meta-data IO (especially on parallel file systems). Another feature is how Singularity interacts with the host operating system to facilitate application work-flows like X11 and MPI.
How does Singularity relate/differ from statically compiled binaries?
Statically compiled binaries are a good comparison to what Singularity can do for a single program because it will package up all of the dynamic libraries and package them into a single executable (interpreted) format.
But because Singularity is actually wrapping operating system files in to a container, you can do much more with it… Such as include other files, scripts, work-flows, pipe lines, data, and multi program processes and package them into a single portable executable format.
What Linux distributions are you trying to get on-board?
All of them! Help us out by letting them know you want Singularity to be included!
Basic Singularity usage
Do you need administrator privileges to use Singularity?
You do not need admin/sudo to use Singularity containers. You do however need admin/root access to install Singularity and to build/manage your containers and images, but to use the containers you do not need any additional privileges to run programs within it.
This then defines the work-flow to some extent… Singularity container images must be built and configured on a host where you have root access (this can be a physical system or on a VM or Docker image). Once the container image has been configured it can be used on a system where you do not have root access as long as Singularity has been installed there.
Can you edit/modify a Singularity container once it has been instantiated?
Yes, if you call it with the -w/–writable flag. (e.g. ‘singularity shell –writable Container.img’).
Can multiple applications be packaged into one Singularity Container?
Yes! You can even create entire pipe lines and work flows using many applications, binaries, scripts, etc.. Look into the RunScript bootstrap definition option to define what happens when a Singularity container is run (note: you can accomplish this by also creating an executable file within your container at /singularity and when the container is executed directly or via the ‘run’ command, this will get executed).
How are external file systems and paths handled in a Singularity Container?
Because Singularity is based on container principals, when an application is run from within a Singularity container its default view of the file system is different from how it is on the host system. This is what allows the environment to be portable. This means that root (‘/’) inside the container is different from the host!
Singularity automatically tries to resolve directory mounts such that things will just work and be portable with whatever environment you are running on. This means that /tmp and /var/tmp are automatically shared into the container as is /home. Additionally, if you are in a current directory that is not a system directory, Singularity will also try to bind that to your container.
There is a caveat in that a directory must already exist within your container to serve as a mount point. If that directory does not exist, Singularity will not create it for you! You must do that.
What is the difference between full and relative paths?
See the above answer to “How are external file-systems and paths handled in a Singularity Container?”.
How does Singularity handle networking?
Singularity does no network isolation because it is designed to run like any other application on the system. It has all of the same networking privileges as any program running as that user.
Can I import an image from Docker?
Yes, there are several ways to do this! First, Docker has the ability to export the data of a particular container and Singularity has the ability to import using the same format that Docker exports. In a nutshell, it is as easy as:
$ docker export [container name] | sudo singularity import /path/to/container.img
We also now support “bootstrapping” Docker images, and to do this you would create a definition file, an image, and then bootstrap. First, here is most simplest definition file, “ubuntu.def”:
Bootstrap: docker From: ubuntu:latest IncludeCmd: yes
Now let’s create an image and bootstrap using the file:
$ sudo singularity create ubuntu-latest.img $ sudo singularity bootstrap ubuntu-latest.img ubuntu.def
Make sure you are running Singularity >2.2 to make full use of this feature.
Can a Singularity container be multi-threaded?
Yes. Singularity imposes no limitations on forks, threads or processes in general.
Can a Singularity container be suspended or check-pointed?
Yes and maybe respectively. Any Singularity application can be suspended using standard Linux/Unix signals. Check-pointing requires some preloaded libraries to be automatically loaded with the application but because Singularity escapes the hosts library stack, the checkpoint libraries would not be loaded. If however you wanted to make a Singularity container that can be check-pointed, you would need to install the checkpoint libraries into the Singularity container via the specfile
Are there any special requirements to use Singularity through a job scheduler?
Singularity containers can be run via any job scheduler without any modifications to the scheduler configuration or architecture. This is because Singularity containers are designed to be run like any application on the system, so within your job script just call Singularity as you would any other application!
Does Singularity work in multi-tenant HPC cluster environments?
Yes! HPC was one of the primary use cases in mind when Singularity was created.
Most people that are currently integrating containers on HPC resources do it by creating virtual clusters within the physical host cluster. This precludes the virtual cluster from having access to the host cluster’s high performance fabric, file systems and other investments which make an HPC system high performance.
Singularity on the other hand allows one to keep the high performance in High Performance Computing by containerizing applications and supporting a runtime which seamlessly interfaces with the host system and existing environments.
Can I run X11 apps through Singularity?
Yes. This works exactly as you would expect it to.
Can I containerize my MPI application with Singularity and run it properly on an HPC system?
Yes! HPC was one of the primary use cases in mind when Singularity was created.
While we know for a fact that Singularity can support multiple MPI implementations, we have spent a considerable effort working with Open MPI as well as adding a Singularity module into Open MPI (v2) such that running at extreme scale will be as efficient as possible.
note: We have seen no major performance impact from running a job in a Singularity container.
Why do we call ‘mpirun’ from outside the container (rather then inside)?
With Singularity, the MPI usage model is to call ‘mpirun’ from outside the container, and reference the container from your ‘mpirun’ command. Usage would look like this:
$ mpirun -np 20 singularity exec container.img /path/to/contained_mpi_prog
By calling ‘mpirun’ outside the container, we solve several very complicated work-flow aspects. For example, if ‘mpirun’ is called from within the container it must have a method for spawning processes on remote nodes. Historically ssh is used for this which means that there must be an sshd running within the container on the remote nodes, and this sshd process must not conflict with the sshd running on that host! It is also possible for the resource manager to launch the job and (in Open MPI’s case) the Orted processes on the remote system, but that then requires resource manager modification and container awareness.
In the end, we do not gain anything by calling ‘mpirun’ from within the container except for increasing the complexity levels and possibly loosing out on some added performance benefits (e.g. if a container wasn’t built with the proper OFED as the host).
See the Singularity on HPC page for more details.
Does Singularity support containers that require GPUs?
Yes, Singularity has been tested to run some test and diagnostic code from within a container without modification. There are however potential issues that can come into play when using GPUs, for instance there are version API compatibilities between kernel and user land which will have to be considered.
Are Singularity containers kernel dependent?
No, never. But sometimes yes.
Singularity is using standard container principals and methods so if you are leveraging any kernel version specific or external patches/module functionality (e.g. OFED), then yes there maybe kernel dependencies you will need to consider.
Luckily most people that would hit this are people that are using Singularity to inter-operate with an HPC (High Performance Computing) system where there are highly tuned interconnects and file systems you wish to make efficient use of. In this case, See the documentation of MPI with Singularity.
There is also some level of glibc forward compatibility that must be taken into consideration for any container system. For example, I can take a Centos-5 container and run it on Centos-7, but I can not take a Centos-7 container and run it on Centos-5.
note: If you require kernel dependent features, a container platform is probably not the right solution for you.
Can a Singularity container resolve GLIBC version mismatches?
Yes. Singularity containers contain their own library stack (including the Glibc version that they require to run).
What is the performance trade off when running an application native or through Singularity?
So far we have not identified any appreciable regressions of performance (even in parallel applications running across nodes with InfiniBand). There is a small start-up cost to create and tear-down the container, which has been measured to be anywhere from 10 - 20 thousandths of a second.
The following are miscellaneous questions.
Are there any special security concerns that Singularity introduces?
No and yes.
While Singularity containers always run as the user launching them, there are some aspects of the container execution which requires escalation of privileges. This escalation is achieved via a SUID portion of code. Once the container environment has been instantiated, all escalated privileges are dropped completely, before running any programs within the container.
Additionally, there are precautions within the container context to mitigate any escalation of privileges. This limits a user’s ability to gain root control once inside the container.
You can read more about the Singularity security overview here.
When I try mounting my container with the
--bind option I receive an unknown option or Invalid argument error.
Make sure that you are using the most recent Singularity release to mount your container to the host system, and that the
--bind argument is placed after the execution command (e.g.
shell). An example might look like this:
$ singularity run -B $PWD:/data my_container.img
Also, make sure you are using an up-to-date Singularity to bootstrap your container. Some features (such as
--bind) will not work in earlier versions.
A little bit of help.
How to use Singularity with GRSecurity enabled kernels
To run Singularity on a GRSecurity enabled kernel, you must disable several security features:
$ sudo sysctl -w kernel.grsecurity.chroot_caps=0 $ sudo sysctl -w kernel.grsecurity.chroot_deny_mount=0 $ sudo sysctl -w kernel.grsecurity.chroot_deny_chmod=0 $ sudo sysctl -w kernel.grsecurity.chroot_deny_fchdir=0
The container isn’t working on a different host!
Singularity by default mounts your home directory. While this is great for seamless communication between your host and the container, it can introduce issues if you have software modules installed at
$HOME. For example, we had a user run into this issue.
Solution 1: Specify the home to mount
A first thing to try is to point to some “sanitized home,” which is the purpose of the
--home option. For example, here we are creating a home directory under
/tmp/homie, and then telling the container to mount it as home:
rm -rf /tmp/homie && mkdir -p /tmp/homie && \ singularity exec -H /tmp/homie analysis.img /bin/bash
Solution 2: Specify the executable to use
It may be the issue that there is an executable in your host environment (eg, python) that is being called in preference to the containers. To avoid this, in your runscript (the
%runscript section of the bootstrap file) you should specify the path to the executable exactly. This means:
%runscript # This specifies the python in the container exec /usr/bin/python "$@" # This may pick up a different one exec python "$@"
This same idea would be useful if you are issuing the command to the container using
exec. Thanks to yarikoptic for the suggestions on this issue.
Error running Singularity with sudo
This fix solves the following error when Singularity is installed into the default compiled prefix of /usr/local:
$ sudo singularity create /tmp/centos.img sudo: singularity: command not found
The cause of the problem is that
sudo sanitizes the PATH environment variable and does not include /usr/local/bin in the default search path. Considering this program path is by default owned by root, it is reasonable to extend the default sudo PATH to include this directory.
To add /usr/local/bin to the default sudo search path, run the program visudo which will edit the sudoers file, and search for the string ‘secure_path’. Once found, append :/usr/local/bin to that line so it looks like this:
Defaults secure_path = /sbin:/bin:/usr/sbin:/usr/bin:/usr/local/bin