(but not always easy!)
You've already been working on the cloud instance and moving data from external servers onto your instance as well as moving data around your instance.
Remember, when you downloaded the E. coli reference genome
$ wget ftp://bacteria.ensemble....
In this case, you are using a command line tool, wget, to download content from a webserver. This command supports downloading files from FTP and HTTP(s). The tool wget also supports recursive download (with the parameter -r), allowing you to download content from a directory or folder. For your information, there are other command line tools that can also be used to download data (e.g., curl), but wget should serve you well for this lesson and bioinformatic analysis.
These directions are platform specific so please follow the instructions for your system
Using PC, we recommend you use the PSCP program. This program is from the same suite of tools as the putty program we have been using to connect.
If you haven't done so, download pscp from http://the.earth.li/~sgtatham/putty/latest/x86/pscp.exe
Make sure the PSCP program is somewhere you know on your computer. In this case, your Downloads folder is appropriate.
Open the windows PowerShell; go to your start menu/search enter the term 'cmd'; you will be able to start the shell (the shell should start from C:\Users\your-pc-username>).
Change to the download directory
$ cd Downloads
C:\User\your-pc-username\Downloads> pscp.exe local_file.txt dcuser@ip.address:/home/dcuser/
$ scp local_file.txt dcuser@ip.address:/home/dcuser/
Let's download a zipped file from our remote machine. You should have a fastqc report in ~/dc_workshop/results/fastqc_untrimmed_reads/SRR097977_fastqc.zip
Tip: If you are looking for another (or any really) zip file in your home directory to use instead try
$ find ~ -name *.zip
C:\User\your-pc-username\Downloads> pscp.exe dcuser@ip.address:/home/dcuser/dc_workshop/results/fastqc_untrimmed_reads/SRR097977_fastqc.zip C:\User\your-pc-username\Downloads
$ scp dcuser@ip.address:/home/dcuser/dc_workshop/results/fastqc_untrimmed_reads/SRR097977_fastqc.zip ~/Downloads
If you have an iPlant account, or storage on an iRods server, we can you iCommands to move the data to your personal Data Store.
If you are unfamiliar with iCommands see their documentation
$ iinit
You will then be asked to setup your account and will need to enter the following information
Prompt | Entry |
---|---|
irodsHost | data.iplantcollaborative.org |
port | 1247 |
zone | iplant |
irodsUserName | your iplant username |
Current iRODS password | your iplant password |
Verify that you have connected to your iPlant Data Store; view the contents of your home directory using the following the ils command:
$ ils
To move data from a local source to iPlant use the iput command. In this case we pass the -P option to see the progress of the transfer.
$ iput -P remote_machine_file.txt .
Tip: in this case, the '.' stands for the current iPlant working directory which is by default '/iplant/home/your-iplant-username'
Use the iget command to move files from the Data Store into your remote machine:
$ iget -P iplant_datastore_file.txt .
There are several other iCommands options (including how to use the -T option for more reliable big data transfers, and -r command for recursive transfers of directories - See iCommands documentation.