Skip to content

SCUT-CTW1500 Datasets

Data Downloading

SCUT-CTW1500 Datasets download link

Please download the data from the website above and unzip the file. After unzipping the file, the data structure should be like:

ctw1500
 ├── ctw1500_train_labels
 │   ├── 0001.xml
 │   ├── 0002.xml
 │   ├── ...
 ├── gt_ctw_1500
 │   ├── 0001001.txt
 │   ├── 0001002.txt
 │   ├── ...
 ├── test_images
 │   ├── 1001.jpg
 │   ├── 1002.jpg
 │   ├── ...
 ├── train_images
 │   ├── 0001.jpg
 │   ├── 0002.jpg
 │   ├── ...

Data Preparation

For Detection task

To prepare the data for text detection, you can run the following commands:

python tools/dataset_converters/convert.py \
    --dataset_name ctw1500 --task det \
    --image_dir path/to/ctw1500/train_images/ \
    --label_dir path/to/ctw1500/ctw_1500_train_labels \
    --output_path path/to/ctw1500/train_det_gt.txt
python tools/dataset_converters/convert.py \
    --dataset_name ctw1500 --task det \
    --image_dir path/to/ctw1500/test_images/ \
    --label_dir path/to/ctw1500/gt_ctw_1500 \
    --output_path path/to/ctw1500/test_det_gt.txt

Then you can have two annotation files train_det_gt.txt and test_det_gt.txt under the folder ctw1500/.

Back to dataset converters