It is important to understand that none of our main file systems are backed up on other devices. This means that in case of a hardware crash data will be lost.
Some file systems, have snapshots, which allow a rollback of the file system to a certain point in time. Snapshots reside on the same file system as the data itself, and, while CSE storage systems have raid protection that guards against single disk failure, without backup, data will be lost if a multiple disk/shelf crash occurs.
The main storage systems are:
These directories (/cs/usr/login accessible as "~") reside on a highly capable file server with provided snapshots and have the lowest chance of crashing but are relatively more limited in space.
These are directories that reside under /cs/labs. They provide snapshots and are the volume of choice for large amounts of data. Some labs have directories under /cs/snapless which are desired for large volume volatile data and, hence, contain no snapshots. Lab users that intend to run heavy IO intensive jobs should consult with the CSE System Group first.
Each faculty member is allocated up to 2 terabytes of space, which should cover most users. Anyone requiring more space will be asked to pay a yearly amount that will cover the price of additional storage resources.
Other filesystems can have varying snapshot policies. Contact the CSE System Group if you want to better understand their status.
To safely store code, we suggest using our Github. The github virtual machine and data are backed up on two different file-systems, providing high-level resilience.
Another easy backup option is using the unlimited space Google provides along with your mail HUJI account. Some students and faculty members are using this option and are pleased with the results. Please note that these backups are solely the responsibility of the user and are in no way supported by the CSE System Group.
Snapshots do not behave well with heavy writes. If your lab needs to write a lot of data or do heavy reads please contact us for further assistance.
Users are allocated disk quota as follows:
Account type | Group | Home |
---|---|---|
undergraduate | stud | 6G |
graduate | grad | 7G |
doctoral | phd | 9G |
post-doctoral | pdoc | 35G |
visting lecturer | visitor | 50G |
faculty | staff | 25G |
guest | guest | 6G |
external graduate | cgrad | 7G |
external phd | cphd | 9G |
To see how much quota you have, type nquota. Note that the data for nquota is updated once every few minutes.
Snapshots are short-term online backups. There are three types of snapshots: hourly, daily and weekly. Snapshots cycle out -- i.e. if N hourly snapshots are defined for a filesystem, when the next snapshot is created, the oldest (N+1) snapshot disappears permanently. Note that online means that the snapshots are part of the filesystem. If a filesystem becomes unavailable due to a fileserver failure, the snapshots are unavailable as well.
Snapshots are available in the special .snapshot directory and can be accessed using the snapshot utility or directly (e.g. "cd .snapshot"). Note that the .snapshot doesn't show up when searching a directory (i.e. "ls -a" won't show it), but it exists and can be "cd"-ed into.
The current snapshot schedule is as follows:
File System Type | Path | Hourly | Daily | Weekly |
---|---|---|---|---|
Home | ~ | 6 | 14 | 4 |
Graduate Labs | /cs/labs/supervisor/login | 24 | 7 | 4 |
If there is insufficient disk space, some window managers might not start at all. When this happens you'll receive some message of “Disk quota exceeded”, or an error message saying the Xsession has terminated too quickly. In this case, to be able to login and clean up your quota, you'll have to login using xterm.
There are two ways to login in xterm:
After you've logged in, you'll get a simple xterm window. There you can start cleaning up your quota as described in the following sections. When finished, simply type exit, and re-login again.
Note: Without a window manager, to write commands to the xterm window, you'll have to place the mouse cursor on that window.
Some basic utilities to manage disk space:
du ~ | sort -rn
\ls -A1 | sed "s#'#\\\'#" | sed 's/ /\\ /g' | xargs du -sckx -- | sort -n
tar cvzf <archive file> <list of files>
rm <list of files>
tar --remove-files cvfz <archive file> <list of files>
tar xvzf <archive file>
du -sh <directory>
mv ~/.cache ~/.config ~/.local ~/.ntprofile.V2 /cs/labs/supervisor/login/
ln -s /cs/labs/supervisor/login/.cache ~
ln -s /cs/labs/supervisor/login/.config ~
ln -s /cs/labs/supervisor/login/.local ~
ln -s /cs/labs/supervisor/login/.ntprofile.V2 ~