Now that you have access to the shared S3 bucket, we will download data from the S3 bucket to your machine.
aws s3 ls --profile training --region ap-southeast-1 s3://slchen-lab-transfer/GIS-training/
You will find the sub-folders of the slchen-lab-transfer in the list under the “GIS-training” folder.
NOTE: Only a specific few folders may have been shared. Users are able to select which folders are available to the public within a bucket.
We will now copy files from a specific location within this folder to your machine.
Before that, let’s create a directory on your machine to store this data.
mkdir -p /tmp/fastq/SRR6327950
cd /tmp/fastq/SRR6327950
aws s3 cp --profile training s3://slchen-lab-transfer/GIS-training/SRR6327950/SRR6327950_1.fastq.gz . --region ap-southeast-1
aws s3 cp --profile training s3://slchen-lab-transfer/GIS-training/SRR6327950/SRR6327950_2.fastq.gz . --region ap-southeast-1
md5sum *.fastq.gz
The original MD5 hashes for these files are:
mkdir -p /tmp/fastq/SRR6327875
cd /tmp/fastq/SRR6327875
# note here the --no-sign-request makes an anonymous request to this public S3 bucket
aws s3 sync --no-sign-request s3://sra-pub-run-odp/sra/SRR6327875/ /tmp/fastq/SRR6327875/
# convert the sra formatted file to fastq, then gzip them and clean up
fasterq-dump ./SRR6327875
gzip SRR6327875_1.fastq
gzip SRR6327875_2.fastq
rm -f SRR6327875
We are now ready to run some data analyses.