Skip to content

Commit

Permalink
Refactor data upload section (#16)
Browse files Browse the repository at this point in the history
* Link to filezilla client instead of homepage

* Move upload images into seperate folder and fix links

* Rename files within metadata.txt and folder to show that not only *.gz files can be registered

* WIP

* Update images and specify folder and file structure

* Apply code review
  • Loading branch information
Steffengreiner authored Jul 24, 2024
1 parent dff3819 commit 53fd9ea
Show file tree
Hide file tree
Showing 14 changed files with 53 additions and 30 deletions.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
83 changes: 53 additions & 30 deletions docs/rawdata/raw_data_upload.md
Original file line number Diff line number Diff line change
Expand Up @@ -8,7 +8,7 @@ This section gives an overview of how to upload data to measurements from QBiC's
The following is required in order to successfully execute the measurement data upload.

- Access to the project of interest
- A SFTP client software (e.g. [FileZilla](https://filezilla-project.org)
- A SFTP **client** software (e.g. [FileZilla](https://filezilla-project.org/download.php?type=client)
or [WinSCP](https://winscp.net))
- A LDAP account of the University Of Tübingen
- A connection to the University Of Tübingen network (
Expand All @@ -33,23 +33,23 @@ Uploading your files to us was never this easy! SFTP is a broadly used file tran
wide-spread use ensures that there exists many client software products that
support uploading files to us.
In this section we will go through the process of connecting to our server
using [FileZilla](https://filezilla-project.org) as an example.
using the [FileZilla](https://filezilla-project.org/download.php?type=client) client as an example.

![An image showing the button leading to the site manager](./images/raw_data_upload_open_site_manager.png)
**Open the Site Manager:** You need to add QBiC's upload server as a site to _FileZilla_.
**Open the Site Manager:** You need to add the QBiC's upload server as a site to _FileZilla_ within its site manager.
To open the site manager select it from the menu or press on the highlighted icon.
![An image showing the button leading to the site manager](./images/upload/raw_data_upload_open_site_manager.png)

![An image showing the input fields for a new site. upload.qbic.uni-tuebingen.de is entered in the Host field and SFTP is selected as protocol.](./images/raw_data_upload_add_qbic_site.png)
**Add the upload server:** In the _Site Manager_ you can add sites to which you want to connect. For
measurement data registration, select `SFTP - SSH File Transfer Protocol` and
enter `upload.qbic.uni-tuebingen.de` into the `Host` field.
![An image showing the users site manager highlighting the host and connection field](images/upload/raw_data_upload_host_field.png)

You can log in with your _University of Tübingen_ credentials. Enter your university user account
into the `User` field.

![An image showing the users home folder. You can see three directories named registration, error and upload.](./images/raw_data_upload_initial_user_directory.png)

![An image showing the users site manager highlighting the user and password field](images/upload/raw_data_upload_user_field.png)
**Connect to the server:** Make sure you are in the network of the _University of Tübingen_. You can connect to the server by pressing `Connect` in
the _Site Manager_. After connecting to the server, _FileZilla_ shows you the contents of your home directory.

the _Site Manager_. After connecting to the server, _FileZilla_ shows you the contents of your home directory on the server side.
![An image showing the users home folder. You can see three directories named registration, error and upload.](images/upload/raw_data_upload_remote_filesystem.png)
!!! warning
When you first log in, the server will create some folders. Do not delete these folders!

Expand All @@ -63,53 +63,76 @@ For every registration task, the data needs to reside in a folder with the follo

```text
|- my-registration-batch // folder name is irrelevant
|- file1_1.fastq.gz
|- file1_2.fastq.gz
|- file2_1.fastq.gz
|- file2_2.fastq.gz
|- metadata.txt // mandatory!
|- file1_1.fastq.gz // all files except for metadata.txt serve as examples
|- file1_2.fastq
|- report.pdf
|- summary.html
```



!!! warning
Ensure that the uploaded folder name and files do not have a whitespace within their name

You can upload folders in the same way. Everything at the top level of your created folder is
considered. For uploading folders, specify the name of the folder instead of a file name.
Uploading only specific files from a subdirectory is not supported at the moment.

To register a folder the data needs to reside in a folder with the following structure:

```text
|- upload-example // folder name is irrelevant
|- metadata.txt // mandatory!
|- my-registration-batch/
```

!!! info
You can upload folders in the same way. Everything at the top level of your created folder is
considered. For uploading folders, specify the name of the folder instead of a file name.
Uploading only specific files from a sub directory is not supported at the moment.
The `metadata.txt` file for the folder example would look like this:

```text
NGSQTN23015AS-1225978199074484 my-registration-batch
```

The folder `my-registration-batch` represents an atomic registration unit and must contain the
`metadata.txt` with information about the measurements identifier and the files belonging to this
measurement
dataset.
measurement dataset.
One registration task can register data for multiple measurements. The `metadata.txt` file for the
previous example would look like this:

!!! note
Please ensure that measurement identifier and filename are separated by a TAB `\t` character and not
by spaces.

```text
NGSQTEST001AE-1234512312 file1_1.fastq.gz
NGSQTEST001AE-1234512312 file1_2.fastq.gz
NGSQTEST002BC-3321314441 file2_1.fastq.gz
NGSQTEST002BC-3321314441 file2_2.fastq.gz
NGSQTN23015AS-1225978199074484 file1_1.fastq.gz
NGSQTN23015AS-1225978199074484 file1_2.fastq
NGSQTN23015AS-1225978199074484 report.pdf
NGSQTN23015AS-1225978199074484 summary.html
```

!!! note
Ensure that measurement identifier and filename are separated by a TAB `\t` character and not
by spaces.

## Upload your data

Once you have prepared your folder, upload it to your user directory on our server. Please do not
upload directly to the registration folder but stage it instead in your user directory.
Once your folder is prepared and uploaded to `upload.qbic.uni-tuebingen.de`, move it to
the `registration` folder.

!!! tip
You can easily drag and drop the folder via your mouse
from your local filesystem to our server within filezilla

Our system will then transfer the folder and proceed with data registration.

!!! success
Congratulations you have uploaded your data!

Finally, you can view summarized information for your uploaded data within the raw data view of the data manager.
![raw_data_upload_shown_in_view.png](images/upload/raw_data_upload_shown_in_view.png)

## Handle failed uploads

Uploading data to a measurement can fail in certain cases. When an upload fails, a folder is created
in `/home/<your-user>/error`.
![An image showing the error folder. It contains a directory with a long name.](./images/raw_data_upload_error_directory.png)
![An image showing the error folder. It contains a directory with a long name.](./images/upload/raw_data_upload_error_folder.png)
In this new folder, you can find an `error.txt` file describing the error, and the data you tried to upload.
You can then try to fix the error and upload again.
You can then try to fix the error. To retry the upload of the fixed folder, move it to the registration folder again.

0 comments on commit 53fd9ea

Please sign in to comment.