Niger-Mali Audio Collection

French from Niger, Fulfulde, Hausa, Tamasheq and Zarma

About

This unannotated audio collection corresponds to 671 hours of radio broadcasts in five languages: French from Niger, Fulfulde, Hausa, Tamasheq, and Zarma. This data was collected by Avignon University in the context of the SELMA EU Project, and shared under the license CC BY-NC-ND-3.0.

Downloading the data

The datasets are available for download here.

Citing us

When using our dataset, please cite the following paper:

@inproceedings{zanon-boito-etal-2022-speech,
    title = "Speech Resources in the {T}amasheq Language",
    author = {Boito, Marcely Zanon  and
      Bougares, Fethi  and
      Barbier, Florentin  and
      Gahbiche, Souhir  and
      Barrault, Lo{\"i}c  and
      Rouvier, Mickael  and
      Est{\`e}ve, Yannick},
    editor = "Calzolari, Nicoletta  and
      B{\'e}chet, Fr{\'e}d{\'e}ric  and
      Blache, Philippe  and
      Choukri, Khalid  and
      Cieri, Christopher  and
      Declerck, Thierry  and
      Goggi, Sara  and
      Isahara, Hitoshi  and
      Maegaard, Bente  and
      Mariani, Joseph  and
      Mazo, H{\'e}l{\`e}ne  and
      Odijk, Jan  and
      Piperidis, Stelios",
    booktitle = "Proceedings of the Thirteenth Language Resources and Evaluation Conference",
    month = jun,
    year = "2022",
    address = "Marseille, France",
    publisher = "European Language Resources Association",
    url = "https://aclanthology.org/2022.lrec-1.222/",
    pages = "2066--2071",
}
Creative Commons License