At the end of 2025, as Rocket is retired, projects may be transferred to Comet using the HPC Portal, but this will not transfer any data from Rocket to Comet. All data transfer is the responsibility of the project owner and members.
rsync is faster than cp and scp because it uses a smart delta-transfer algorithm to only send changed file parts, not entire files, making updates extremely quick; it also offers better compression, efficient directory handling, and the ability to resume transfers, while scp does a full file copy every time, making it simpler but less efficient for syncing existing data. Also see our File Transfers and RDW page
Basic syntax: rysnc -a [–options] source destination
We think ALL projects should have an RDW share. Over the past months you have been advised to back up your Rocket data to RDW. If your data is on Research Data Warehouse (RDW), there is no rush; it can be downloaded efficiently to Comet as required because RDW is mounted (available) on Comet as /rdw. However,
aprj On Comet, project directory names are prefaced with their origins like comet_aprj or rocket_aprj)pwd to get the path). e.g. /nobackup/projectcode/MyData/cd to your new project directoryrsync -a userid@rocket.hpc.ncl.ac.uk:/nobackup/proj/projectcode/MyData ./fromRocket (italic text must be replaced with your own user ID and directory names)
$ pwd
/nobackup/proj/rocket_code
$ rsync -a --stats user@rocket.hpc.ncl.ac.uk:/nobackup/proj/myproject/MyData ./fromRocket
If your transfer takes a long time or you can't easily check the result using ls
–itemize-changes –inplace –whole-file –size-only–dry-run shows what would happen if you ran the command, but doesn't copy any files. Add this to the end of your command to check, then remove it and run the command 'for real'
(OSS 2410.0) [user@cometlogin02(comet) comet_training]$ rsync -a --itemize-changes --inplace --whole-file --size-only --stats user@rocket.hpc.ncl.ac.uk:/nobackup/proj/myproject/MyData ./fromRocket
Permissions related errors
If you attempt to copy files to which you do not have read access, 'rysnc' will show an error like:
rsync: send_files failed to open “/nobackup/proj/jshpcu/bonnie_rocket_64g.txt”: Permission denied (13)
This usually happens when someone other than you created the file. Ask the file owner to fix permissions and if this isn't possible, contact the RSE-HPC team for help.
scp for is just for small data transfers but rsync is fine for all transfers to Comet
For example, on your campus workstation, you have a directory named 'forComet' in your home directory
$ pwd
/home/user
rsync -az --stats ./forComet user@comet.ncl.ac.uk
Transferring data from off-campus is to be avoided where possible, because speed will be poor. Rsync is the best option when this must be done, because it allows proxy connections and resuming after failures. First check that you can ssh to the proxy server. Setting up ssh keys on the proxy (unix.ncl.ac.uk) can be helpful.
pwd
/home/user/myHPCdata
rsync -az -e "ssh user@unix.ncl.ac.uk" ./forComet user@comet.ncl.ac.uk
NB the free versions of these applications will only work on campus as they do not support using a gateway (proxy). Connect to wired LAN rather than campus WiFi to improve transfer speeds.
RDW will always be mounted on Comet. If your data is on RDW, there is no need to copy it to Comet until you need to work with it. Use kinit to refresh your login before running a long transfer. Your permissions to access RDW expire after an hour.
Try out a dry run:
[user@cometlogin01(comet) ~] rsync --dry-run –rltv --inplace --itemize-changes --progress --stats --whole-file --size-only /rdw/path/to/my/share/source/ /nobackup/myproj/destination
Run ‘for real’:
[user@cometlogin01(comet) ~] kinit
[user@cometlogin01(comet) ~] rsync -rltv --inplace --itemize-changes --progress --stats --whole-file --size-only /rdw/path/to/my/share/source/ /nobackup/myproj/destination