Skip to main navigation menu Skip to main content Skip to site footer

No. 13 (2019)

Articles

Speech analysis tools - overview of available programs and libraries

DOI: https://doi.org/10.25312/2391-5137.13/2019_14kp  [Google Scholar]
Published: 2020-03-25

Abstract

The article presents a list of popular speech analysis tools in the form of programs available online to download and in the form of libraries in various programming languages. The first part presents programs used to visualise, to edit, to analyse the speech signal (for example, measurements of the fundamental frequency, intensity or formants) and annotation (segmentation, transcription and labelling of recordings). The second part presents selected libraries available on the GitHub website, which are used for acoustic, phonetic-phonological and prosodic analysis of speech. All tools have been described taking into account their functions and capabilities, sources, authors, licenses on which they are made available. The last chapter of the article presents the evaluation of the described programs taking into account the number and usability of their functions.

References

  1. Bachan J., Wagner A., Klessa K., Demenko G. (2015), Consistency of Prosodic Annotation of Spontaneous Speech for Technology Needs, Proceedings of the 7th Language & Technology Conference. [Google Scholar]
  2. Giannakopoulos T. (2015), pyAudioAnalysis: An Open-Source PythonLibrary for Audio Signal Analysis, https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0144610&-type=printable [dostęp: 6.12.2019]. [Google Scholar]
  3. Leech G. (2005), Adding Lingusitic Annotation, [in:] M. Wynne, Developing Linguistic Corpora: a Guide to Good Practice, Oxford. [Google Scholar]
  4. Łukasik M. (2009), Anotacja korpusów tekstów specjalistycznych, „Języki Specjalistyczne 9. Wyraz – tekst – interpretacja”. [Google Scholar]
  5. Orozco-Arroyave J.R., Vásquez-Correa J.C., Vargas-Bonilla J.F., Arora R., Dehak N., Nidadavolu P., Nöth E. (2018), NeuroSpeech: An open-source software for Parkinson’s speech analysis, “Digital Signal Processing”, July. [Google Scholar]
  6. Rykowski J. (2014), Metody i narzędzia rozpoznawania mowy w zastosowaniach niekomercyjnych, „Napędy i Sterowanie”, R. 16, nr 6. [Google Scholar]
  7. Wagner A., Bachan J., Klessa K., Demenko G. (2015), Przegląd wybranych aspektów analizy prozodii mowy spontanicznej na potrzeby technologii mowy, „Prace Filologiczne”, t. LXVI. [Google Scholar]

Downloads

Download data is not yet available.