Transferring Data To and From MSI
This page outlines common ways to move data between your local computer, external services, and MSI storage.
Choose the transfer method that matches your source and destination:
Google Drive and Tier 1: use
rcloneLocal computer and Tier 1: use SFTP
Local computer and Tier 1 or Tier 2: use Globus
External Globus user and MSI: use the
globus-incomingworkflow on the MSI Globus endpoint
Before starting any large transfer:
confirm that the destination path is correct
make sure you have enough quota available at the destination
verify the copied data before deleting the original copy
Transfer Between Google Drive and Tier 1 with rclone
For Google Drive transfers, MSI recommends rclone.
Regular transfers can be run from the command line on MSI, typically from an interactive job for larger copies. The one-time Google authentication step may require a browser session, so many users complete the initial setup from an Open OnDemand desktop session and then use the command line for later transfers.
Load the MSI module:
module load rclone/1.64.1
One-Time Configuration
You only need to configure the remote once.
Start the configuration tool:
rclone config
Use answers like these during the setup:
nto create a new remoteprovide a name such as
your_umn_gdrivechoose
google drivefor the storage typeleave
client_idblankleave
client_secretblankchoose full access when prompted for scope
leave
root_folder_idblankleave
service_account_fileblankchoose
Nofor advanced configchoose
Nofor auto config on the headless MSI systemopen the provided URL in a browser, sign in to your UMN Google account, authorize access, and paste the verification code back into the terminal
choose
Nofor team drive unless you specifically need oneconfirm the configuration and quit
The resulting rclone configuration is stored in your home directory for later use.
If you cannot complete the browser-based authorization from a plain SSH session, start an Open OnDemand desktop session temporarily, run rclone config there, and then return to the CLI for routine use.
Routine Transfers
Copy a Google Drive folder into a new directory in your current Tier 1 location:
rclone copy -v your_umn_gdrive:Work_MSI/Projects/data ./data
Copy the contents of a Google Drive folder into your current Tier 1 directory:
rclone copy -v your_umn_gdrive:Work_MSI/Projects/data .
Copy a single file from Google Drive into your current Tier 1 directory:
rclone copy -v your_umn_gdrive:Work_MSI/Projects/file.txt .
rclone also works in the opposite direction, so you can reverse the source and destination to move data from Tier 1 back to Google Drive when needed.
Transfer Between Your Local Computer and Tier 1 with SFTP
SFTP is a straightforward option when you want to move files between your local computer and MSI Tier 1 storage over SSH.
For a graphical client, see Transfer Between Your Local Computer and Tier 1 with FileZilla. For large or long-running transfers, Globus is usually the better choice.
Connect to MSI with SFTP:
sftp INTERNETID@agate.msi.umn.edu
Common SFTP commands:
lpwdshows your current local directorypwdshows your current remote MSI directorylcd LOCAL_PATHchanges your local directorycd REMOTE_PATHchanges your remote MSI directoryput myfile.txtuploads a file from your computer to MSIput -r mydiruploads a directory from your computer to MSIget myfile.txtdownloads a file from MSI to your computerget -r mydirdownloads a directory from MSI to your computer
Example session:
sftp INTERNETID@agate.msi.umn.edu
sftp> cd /projects/standard/PROJECT_NAME/shared/INTERNETID
sftp> lcd ~/Desktop
sftp> put -r analysis-output
Use SFTP for Tier 1 locations such as your home directory or project space. SFTP is not an access method for Tier 2 object storage.
Transfer Between Your Local Computer and Tier 1 or Tier 2 with Globus
Globus is often the easiest choice for large or long-running transfers because it manages retries and continues transfers even if your browser session ends.
At MSI, Globus can be used with:
UMN MSI Homefor Tier 1 storageUMN MSI Tier2for Tier 2 storageyour local system through Globus Connect Personal or another Globus endpoint you can access
Typical Globus workflow:
Sign in to Globus.
Open the Files view.
Select your local endpoint in one pane.
Select
UMN MSI HomeorUMN MSI Tier2in the other pane.Browse to the source and destination folders.
Start the transfer.
Review transfer status in Globus and confirm the files arrived where expected.
For more on MSI’s Globus setup, see MSI’s Globus transfer page.
Receive Data from an External Globus User
If someone outside MSI needs to send you data through Globus, use the MSI Globus endpoint’s globus-incoming location described on the MSI Globus page.
Typical staging locations look like this:
Tier 1:
/projects/standard/PROJECT_NAME/shared/globus-incomingTier 2:
s3://BUCKET_NAME/globus-incoming
Create the Tier 1 directory from the CLI if it does not already exist:
mkdir -vp "$SHARED/globus-incoming"
For Tier 2, create the globus-incoming prefix from the Globus interface when you set up the transfer destination.
Typical workflow:
Coordinate with the external collaborator on the directory name and expected contents.
Have the collaborator transfer the data into your designated location under
globus-incomingon the MSI endpoint.After the transfer completes, log in to MSI and verify the files.
Move the data from
globus-incominginto its long-term destination in Tier 1 or transfer it onward to Tier 2 if that is the better storage location.
Treat globus-incoming as a staging area rather than a permanent storage location.
Choosing the Right Transfer Method
Use this quick guide when deciding how to move data:
Use
rclonefor Google Drive and Tier 1 transfersUse SFTP for simple local-to-Tier-1 copies over SSH
Use Globus for large local transfers, managed transfers, and Tier 1 to Tier 2 workflows
Use
globus-incomingwhen receiving data from external Globus users