Overview

Data storage and preservation are integral to CRC1261’s commitment to data integrity and reproducibility.

Data Handling Guidelines

  • Use Central Storage
    Research data may be stored locally only temporarily during acquisition or initial processing.
    All data must be synchronized to designated central storage systems to ensure:

    • Security and protection of data

    • Traceability and reproducibility

    • Access and collaboration within the research team

  • Organize and Document Data
    Use clear, type-based folder structures. Each dataset must include an INFO file or README.md documenting:

    • Methods and instruments used to generate the data

    • Directory structure and file naming conventions

    • Any context necessary for reuse or interpretation

    See: Guidelines for a Good README.

  • Assign Responsibilities
    Principal Investigators (PIs) are responsible for:

    • Ensuring proper data storage practices within their research group

    • Confirming timely synchronization of locally stored data to central storage

  • Manage Experimental Data
    Non-sensitive experimental data and lab records must be maintained in eLabFTW for traceability and collaborative use.

  • Handle Sensitive or Personal Data Carefully
    Sensitive or personal data must be stored pseudonymously in the REDCap system
    (hosted by the Department of Neurology, fully EU GDPR compliant).

  • Store and Share Code
    Software, scripts, and documentation must be managed in the CRC1261 GitLab Group
    See platform usage: GitLab RZ CAU.

Storage Services

Service

Purpose / Notes

Quota

Backup

Home Network Drive

Personal files or temporary local storage (must sync centrally).

50 GB[1]

Daily incremental (60 days)

Project Network Drive

Shared project data ≤ 1 TB. For small/medium project datasets.

On request

Daily incremental (60 days)

Research Data Storage

Shared project data > 1 TB or long-term collaborative work.

On request

Daily incremental (60 days)

CAU-Cloud Sync & Share

Synchronization across devices / external collaboration.

20 GB per user[2]

Managed via recycle bin; 7–30 day retention[3]

GitLab RZ CAU

Version control and collaborative development (code & documentation).

20 GB per repo

Daily incremental (60 days)

SAMBA Storage Service

Legacy / transition storage. Only for legacy use.

-

-

Which storage to use?

The flowchart below guides the selection of appropriate storage based on data type, size, and sharing needs.

        flowchart TD

A["Start: You need to store research data"] --> B{"Is the data code or documentation?"}
B -->|"Yes"| G["GitLab RZ CAU"]
B -->|"No"| C{"Is the data experimental / lab data (non-sensitive)?"}
C -->|"Yes"| H["eLabFTW"]
C -->|"No"| D{"Is the data sensitive?"}
D -->|"Yes"| M["REDCap (pseudonymous, GDPR-compliant)"]
D -->|"No"| E{"Is it personal work?"}
E -->|"Yes"| I["Home Network Drive (temporary)"]
E -->|"No"| F{"Is total project data ≤ 1 TB?"}
F -->|"Yes"| J["Project Network Drive"]
F -->|"No"| K["Research Data Storage"]

G --> L["Ensure daily backup / sync"]
H --> L
I --> L
J --> L
K --> L
M --> L

L["Data stored according to CRC1261 policy<br><small>Local storage only temporary, must be synced centrally</small>"]
    

Backup

Data stored on the central CRC1261 storage systems (Home Network Drive, Project Network Drive, Research Data Storage) is protected by the Computing Centre with daily incremental backups (60-day retention).

However, data located on local devices (laptops, lab PCs, acquisition computers) is not backed up and is at risk of loss.

Therefore:

Local research data must be synchronized daily to central storage using secure transfer methods.

  • When files are synchronized to the central storage systems, they are automatically included in the Computing Centre’s backup schedule.

  • Software and code stored in GitLab RZ CAU is version-controlled, but raw data and results still need to be synchronized to central storage.

  • PIs are responsible for ensuring that daily synchronization to central storage occurs and that research group members understand this workflow.

Example

Synchronizing local data to the central storage

# Mount the network drive to the local directory
sudo mount -t cifs //my_network_path /mnt/my_home_drive -o uid=1000,gid=1000,rw,user,username=suabc123,domain=uni-kiel.de

# Use rsync to backup data from a source directory to the mounted network drive
rsync -av --delete /path/to/source/directory /mnt/my_home_drive/backup/

# Unmount the network drive after backup is completed
sudo umount /mnt/my_home_drive

Explanation of rsync options used:

  • -a: Archive mode (preserves permissions, ownership, timestamps, etc.)

  • -v: Verbose mode (provides detailed output)

  • --delete: Deletes any files in the backup directory that are not present in the source directory