From cac929954acbcc5edc298f2d3328778b31a2e76c Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?Micha=C5=82=20Junczyk?= Date: Sun, 10 Mar 2024 14:47:47 +0100 Subject: [PATCH] Update README.md --- README.md | 52 +++++++++++++++++++++++++++++++++------------------- 1 file changed, 33 insertions(+), 19 deletions(-) diff --git a/README.md b/README.md index e261a26..3e84efa 100644 --- a/README.md +++ b/README.md @@ -1,17 +1,44 @@ Polish ASR speech datasets survey and catalog. - -Important - before getting familiar with the Polish ASR speech data survey and catalog, please consider completing the short feedback [form](https://forms.gle/FL1obRfYMvgHVoEu5).
-Your feedback will help to assess the state of Polish ASR datasets from the community perspective and improve the design of the [Polish ASR speech datasets catalog.](https://docs.google.com/spreadsheets/d/181EDfwZNtHgHFOMaKNtgKssrYDX4tXTJ9POMzBsCRlI/edit?usp=sharing)
-Each response is awarded by donation of 50 PLN to the chosen charity organization. Thank you! -[Feedback form link](https://forms.gle/FL1obRfYMvgHVoEu5) - # Polish ASR speech data survey goals - Organization of publicly available information about Polish ASR speech datasets into the catalog. - Providing the ASR community with a collectively maintained, up-to-date source of information about available datasets. - Identification of ASR speech datasets for Polish, which are freely available for research and commercial purposes.
+# Request for feedback +Before getting familiar with the Polish ASR speech data survey and catalog, please consider completing the short (5 min) feedback [form](https://forms.gle/FL1obRfYMvgHVoEu5).
+Your feedback will help to assess the state of Polish ASR datasets from the community perspective. +Each response is awarded by donation of 50 PLN to the chosen charity organization. Thank you! +[Feedback form link](https://forms.gle/FL1obRfYMvgHVoEu5) + +# How to cite? +If you use the raw data from the catalog, please cite. +```bibtex +@Misc{pl-asr-speech-data-survey, + author = {Micha{\l} Junczyk}, + title = {Polish ASR speech data catalog}, + howpublished = {Github}, + year = {2023}, + url = {https://github.com/goodmike31/pl-asr-speech-data-survey} +} +``` +If you use the survey results, please cite the corresponding article: +```bibtex +@article{Junczyk+2024+27+52,
+url = {https://doi.org/10.1515/psicl-2023-0019},
+title = {A survey of Polish ASR speech datasets},
+author = {Micha{\l} Junczyk},
+pages = {27--52},
+volume = {60},
+number = {1},
+journal = {Poznan Studies in Contemporary Linguistics},
+doi = {doi:10.1515/psicl-2023-0019},
+year = {2024},
+lastchecked = {2024-03-10}
+} +``` + # Survey results: - **Fifty-three datasets (53)** intended for Polish ASR development were identified based on the information available in the public domain. - **Fourty-four (44)** out of 53 identified datasets are accessible. @@ -42,19 +69,6 @@ Each response is awarded by donation of 50 PLN to the chosen charity organizatio 1. Open the [catalog](https://docs.google.com/spreadsheets/d/181EDfwZNtHgHFOMaKNtgKssrYDX4tXTJ9POMzBsCRlI/edit?usp=sharing) 2. Set filter on **Usage cost** column to **free** -# How to cite this resource? -@article{Junczyk+2024+27+52,
-url = {https://doi.org/10.1515/psicl-2023-0019},
-title = {A survey of Polish ASR speech datasets},
-author = {MichaƂ Junczyk},
-pages = {27--52},
-volume = {60},
-number = {1},
-journal = {Poznan Studies in Contemporary Linguistics},
-doi = {doi:10.1515/psicl-2023-0019},
-year = {2024},
-lastchecked = {2024-03-10}
-}
# Addendum - Survey design