This lesson is still being designed and assembled (Pre-Alpha version)

CyberInfrastructure Training to Advance Climate Scienc (CI-TRACS) - Data Movement Workshop

Presenters

Past Presenter and Contributor

Synopsis

This workshop will be an introduction to understanding the challenges and options in moving scientific data over the network. In particular, attendees will learn about some of the different network infrastructure and tools available and the use cases to apply them towards and highlight any disadvantages or drawbacks to a particular technology. Lessons will also address what to do if a transfer is experiencing less than expected performance, potential common contributors to a transfer bottleneck and when and who to ask for assistance will also be covered. Different data transfer tools like SFTP, Globus and Rclone for moving data will be covered in hands-on exercises.

Prerequisites

  • Basic SSH and Command line experience. We recommend the participants to go through shell-novice if you are new to the command-line
  • Have an account on Mana
  • Have UH Duo/MFA enabled
  • Have a modern web browser

Learning Outcomes:

By the end of this workshop attendees will know how to:

Schedule

Setup Download files required for the lesson
00:00 1. Introduction to Scientific Data Networks How do networks connect everything?
How is UH connected?
00:10 2. Networks What do networks look like?
What does the equipment that connects everything look like?
00:20 3. Data Transfer Evaluation Of The Network What are some tools we can use to test network throughput?
00:30 4. Processes and Queues What are Queues/Buffers?
How does data actually move from machine to machine?
00:40 5. Transmission Control Protocol (TCP) What is TCP?
00:50 6. Transfer Programs What are some of the most common/best transfer applications?
01:00 7. Scientific Data Transfer Examples What are some real world examples of data transfer issues that can be fixed?
01:10 8. Transferring files with remote computers How do I transfer files using wget, scp or rsync?
01:40 9. Introduction to Globus What does Globus do and how can I use it with my data?
01:45 10. Creating Globus Account/Install Globus Connect Personal Installation How do I create a Globus account using your UH credential?
How do I install Globus Connect Personal on my PC, Mac, Linux, or Unix?
02:05 11. Transferring Data How do I move data from my machine using Globus?
How do I move data tom my maching with Globus?
02:20 12. Configuring and Using Rclone How do I Configure Rclone?
02:40 13. Transferring Files with Rclone How do I move data from google drive to MANA?
How do I move data from MANA to google drive?
03:00 Finish

The actual schedule may vary slightly depending on the topics and exercises chosen by the instructor.