Table of Contents

Moving project data from Rocket to Comet

At the end of 2025, as Rocket is retired, projects may be transferred to Comet using the HPC Portal, but this will not transfer any data from Rocket to Comet. All data transfer is the responsibility of the project owner and members.

Rsync for transfers between Rocket, Comet and RDW on Linux Command Line 

rsync is faster than cp and scp because it uses a smart delta-transfer algorithm to only send changed file parts, not entire files, making updates extremely quick; it also offers better compression, efficient directory handling, and the ability to resume transfers, while scp does a full file copy every time, making it simpler but less efficient for syncing existing data. Also see our File Transfers and RDW page

Basic syntax: rysnc -a [–options] source destination

Transfer files direct from Rocket to Comet

We think ALL projects should have an RDW share. Over the past months you have been advised to back up your Rocket data to RDW. If your data is on Research Data Warehouse (RDW), there is no rush; it can be downloaded efficiently to Comet as required because RDW is mounted (available) on Comet as /rdw. However,

On a Comet login node

$ pwd
/nobackup/proj/rocket_code
$ rsync -a --stats user@rocket.hpc.ncl.ac.uk:/nobackup/proj/myproject/MyData ./fromRocket

If your transfer takes a long time or you can't easily check the result using ls

(OSS 2410.0) [user@cometlogin02(comet) comet_training]$ rsync -a --itemize-changes --inplace --whole-file --size-only --stats user@rocket.hpc.ncl.ac.uk:/nobackup/proj/myproject/MyData ./fromRocket

Permissions related errors

If you attempt to copy files to which you do not have read access, 'rysnc' will show an error like: rsync: send_files failed to open “/nobackup/proj/jshpcu/bonnie_rocket_64g.txt”: Permission denied (13) This usually happens when someone other than you created the file. Ask the file owner to fix permissions and if this isn't possible, contact the RSE-HPC team for help.

Transfer from your campus workstation to Comet 

Command line Linux or Mac:

scp for is just for small data transfers but rsync is fine for all transfers to Comet

on campus

For example, on your campus workstation, you have a directory named 'forComet' in your home directory

$ pwd
/home/user
rsync -az --stats ./forComet user@comet.ncl.ac.uk

off campus

Transferring data from off-campus is to be avoided where possible, because speed will be poor. Rsync is the best option when this must be done, because it allows proxy connections and resuming after failures. First check that you can ssh to the proxy server. Setting up ssh keys on the proxy (unix.ncl.ac.uk) can be helpful.

pwd
/home/user/myHPCdata
rsync -az -e "ssh user@unix.ncl.ac.uk" ./forComet user@comet.ncl.ac.uk

Graphical options on Linux, Mac and Windows:

NB the free versions of these applications will only work on campus as they do not support using a gateway (proxy).  Connect to wired LAN rather than campus WiFi to improve transfer speeds.

Transfer files from RDW to Comet

RDW will always be mounted on Comet. If your data is on RDW, there is no need to copy it to Comet until you need to work with it. Use kinit to refresh your login before running a long transfer. Your permissions to access RDW expire after an hour.

Try out a dry run: 

[user@cometlogin01(comet) ~] rsync --dry-run –rltv --inplace --itemize-changes --progress --stats --whole-file --size-only /rdw/path/to/my/share/source/ /nobackup/myproj/destination
Run ‘for real’: 
[user@cometlogin01(comet) ~] kinit
[user@cometlogin01(comet) ~] rsync -rltv --inplace --itemize-changes --progress --stats --whole-file --size-only /rdw/path/to/my/share/source/ /nobackup/myproj/destination


Back to Data Transfer