CSL-Daily Dataset

Introduction

CSL-Daily is a large-scale continuous SLT dataset. It provides both spoken language translations and gloss-level annotations. The topic revolves around people's daily lives (e.g., travel, shopping, medical care), the most likely SLT application scenario.

Download

The CSL-Daily database is released to universities and research institutes for research purpose only. To request the access right to the data resources, please follow the instructions below:

  1. Download the CSL-Daily Dataset Release Agreement;
  2. Read all items and conditions carefully;
  3. Complete it appropriately. Note that the agreement should be signed by a full-time staff member (that is, the student is not acceptable).
  4. Please scan the signed agreement, send it to (ustc_vslrg At 126.com) and CC to Prof. Zhou (zhwg At ustc.edu.cn). If you are a student, please also CC to the full-time staff member who sign the agreement.

Reference

Please cite the following papers if you use CSL-Daily for your research

  • Hao Zhou, Wengang Zhou, Weizhen Qi, Junfu Pu, and Houqiang Li, "Improving Sign Language Translation with Monolingual Data by Sign Back-Translation," IEEE/CVF International Conference on Computer Vision and Pattern Recognition (CVPR), 2021.

Besides, you can refer to the following papers for continuous SLR published by our group:

  • Hezhen Hu, Weichao Zhao, Wengang Zhou, and Houqiang Li, "SignBERT+: Hand-model-aware Self-supervised Pre-training for Sign Language Understanding," IEEE Trans. Pattern Anal. Mach. Intell. (TPAMI), 2023.

Contact

If you have any questions about the dataset and our papers, please feel free to contact us: